llm

#llm

@eyad_khrais: https://x.com/eyad_khrais/status/2069552027382980882

X AI KOLs Timeline ↗ · 2h ago Cached

A comprehensive guide to building AI agent harnesses, covering tool execution, context management, state/memory, and guardrails, based on lessons from building Claude Code and other harnesses for enterprise.

0 favorites 0 likes

#llm

@karpathy: This is a new paradigm for interacting with Claude that is significantly more "inline" with all the other human activit…

X AI KOLs ↗ · 2h ago Cached

Karpathy describes a new paradigm for interacting with Claude where it becomes a persistent, async team member with org-wide tools and context, similar to Claude Tag in Slack, marking a third major redesign of LLM user interfaces.

0 favorites 0 likes

#llm

I Figured Out What Causes 'Super Weights'

Reddit r/ArtificialInteligence ↗ · 4h ago

Explains that super weights in large language models arise from the SoftMax-Attention interaction creating a 'Nothing Dump' token that serves as a stable reference point; removing these weights cripples performance.

0 favorites 0 likes

#llm

Modal Auto Endpoints: Optimized inference you own

Hacker News Top ↗ · 6h ago Cached

Modal introduces Auto Endpoints, a self-serve service for optimized, production-grade LLM inference with full code ownership, transparent metrics, and autoscaling, built on their serverless GPU infrastructure.

0 favorites 0 likes

#llm

A Potential Alignment Vulnerability in LLMs: Behavioral and Hidden-State Evidence from Gemma-3-12B . Pre-token hidden state shift as an alignment policy traversal vector in instruction-tuned LLMs

Reddit r/AI_Agents ↗ · 7h ago

This paper investigates an alignment vulnerability in instruction-tuned LLMs, specifically Gemma-3-12B, by showing that pre-token hidden state shifts can act as an alignment policy traversal vector, potentially enabling bypass of safety measures.

0 favorites 0 likes

#llm

How are you evaluating AI features in production?

Reddit r/AI_Agents ↗ · 7h ago

A discussion on the methodologies and challenges involved in evaluating AI features once they are deployed in production environments.

0 favorites 0 likes

#llm

I benchmarked 8 LLMs for medical scribing. Hallucinations were rare; omissions need attention.

Reddit r/LocalLLaMA ↗ · 8h ago

A benchmark of 8 LLMs for medical scribing found hallucinations rare but omissions a concern.

0 favorites 0 likes

#llm

Real-world GLM 5.2 experiences only — skip generic benchmark scores, how does it hold up on complex production business workloads?

Reddit r/AI_Agents ↗ · 10h ago

Discusses real-world experiences with GLM 5.2 in complex production business workloads, focusing on practical performance beyond benchmark scores.

0 favorites 0 likes

#llm

Vulnerability Reports Are Not Special Anymore

Lobsters Hottest ↗ · 11h ago Cached

Filippo Valsorda argues that LLMs have made vulnerability reports no longer special, as AI can now generate insights that were once exclusive to human researchers, shifting the bottleneck from discovery to triage.

0 favorites 0 likes

#llm

What you read before a question changes how a language model answers it — even when the question has nothing to do with what you read. Potential Alignment Vulnerability in LLMs: Behavioral and Hidden-State Evidence from Gemma-3-12B

Reddit r/ArtificialInteligence ↗ · 19h ago

The article reports a potential alignment vulnerability in LLMs where processing a structured passage before an unrelated question can alter the model's response, with mechanistic evidence from Gemma-3-12B showing hidden-state separation.

0 favorites 0 likes

#llm

@ankrgyl: 2025: engineer LLM APIs into your harness 2026: engineer harnesses to work in your agent

X AI KOLs Following ↗ · yesterday Cached

A brief prediction that in 2025 engineers will integrate LLM APIs into their test harnesses, and in 2026 they will design harnesses to work within their agents.

0 favorites 0 likes

#llm

@julien_c: https://x.com/julien_c/status/2069144929100571134

X AI KOLs Following ↗ · yesterday Cached

An LLM was given access to a thermal camera pointing at the Raspberry Pi it runs on, and it began conducting experiments by toggling the fan to observe temperature changes.

0 favorites 0 likes

#llm

@MiaAI_lab: FYI the best Qwen 3.6 35b nvfp4 to run is the @NVIDIAAI nvfp4. Do not use unsloth nvfp4, it performs worse. https://hug…

X AI KOLs Timeline ↗ · yesterday Cached

NVIDIA's nvfp4 quantized version of Qwen 3.6 35B is recommended over the Unsloth variant, offering better performance. The model is available on HuggingFace for use in AI applications.

0 favorites 0 likes

#llm

Prompt Injection as Role Confusion

Hacker News Top ↗ · yesterday Cached

This paper presents a theory that prompt injection attacks on LLMs stem from a fundamental flaw in how models perceive roles, treating roles as a type system for language. It explains existing attacks, predicts new ones, and proposes a research agenda for a science of roles.

0 favorites 0 likes

#llm

Attention Is All You Need

Reddit r/ArtificialInteligence ↗ · yesterday

A reflection on the landmark 'Attention Is All You Need' paper, highlighting how removing recurrence and relying solely on attention mechanisms revolutionized AI and led to modern LLMs like GPT and Claude.

0 favorites 0 likes

#llm

@Anitahityou: The prompt engineering still being touted in '24 is already dead. Today's LLM is an intent reconstructor. Clarity is important, but richness is more important. Because human real thinking is not linear; it is jumpy, chaotic, emotional. An over-compressed prompt can...

X AI KOLs Following ↗ · yesterday Cached

This article argues that traditional prompt engineering is obsolete; modern LLMs are intent reconstructors, and interactions should be through natural, rich conversation rather than condensed instructions.

0 favorites 0 likes

#llm

@AlphaSignalAI: https://x.com/AlphaSignalAI/status/2069064122218717387

X AI KOLs Timeline ↗ · yesterday Cached

This article explores how AI agents can automatically write and optimize their skill files using techniques like SkillOpt from Microsoft Research, which treats skill documents as trainable state and delivers significant performance improvements. It addresses the challenge of manual skill tuning and presents frameworks like GEPA and EvoSkill as evolutionary approaches.

0 favorites 0 likes

#llm

Use AI for reviewing code especially when the diff is huge

Hacker News Top ↗ · yesterday Cached

The article argues that human code reviewers should use AI to handle large diffs, and instead contribute their out-of-distribution knowledge and high-level context.

0 favorites 0 likes

#llm

@huskydogewoof: Just spent 20 mins reading Alisa’s job-search blog. Lots of useful stuff in it! She’s joining OpenAI next week, and sha…

X AI KOLs Timeline ↗ · yesterday Cached

Alisa Liu is joining OpenAI next week and shared a blog with job-search notes, including LLM and math resources.

0 favorites 0 likes

#llm

@VukRosic99: Test Time Reinforcement Learning 1. Take an unlabeled question 2. Sample many answers from the LLM 3. Majority vote → t…

X AI KOLs Timeline ↗ · yesterday Cached

Introduces Test-Time Reinforcement Learning (TTRL), a method that uses majority voting on unlabeled data to create pseudo-labels for RL training, enabling self-improvement of LLMs without ground-truth answers. Achieves significant gains (e.g., +159-211% on AIME 2024 for Qwen-2.5-Math-7B).

1 favorites 1 likes

llm

Submit Feedback