Reddit

Articles from Reddit

Cards List

A Potential Alignment Vulnerability in LLMs: Behavioral and Hidden-State Evidence from Gemma-3-12B . Pre-token hidden state shift as an alignment policy traversal vector in instruction-tuned LLMs

Reddit r/AI_Agents · 38m ago

This paper investigates an alignment vulnerability in instruction-tuned LLMs, specifically Gemma-3-12B, by showing that pre-token hidden state shifts can act as an alignment policy traversal vector, potentially enabling bypass of safety measures.

0 favorites 0 likes

AI coding agents need a local safety boundary before they touch files or run commands

Reddit r/AI_Agents · 41m ago

Discussion on the need for local safety boundaries in AI coding agents to prevent unauthorized file access or command execution.

0 favorites 0 likes

How do you use AI in education, and is it better than a human editor?

Reddit r/artificial · 50m ago

The article explores how AI is used in education and whether AI editing tools can replace human editors, seeking insights on their real benefits.

0 favorites 0 likes

Diffusion Model that can turn any Image into a Playable Hallucination! BUT LOCALLY, NOT ON DATACENTER

Reddit r/ArtificialInteligence · 51m ago

A diffusion model that can transform any image into an interactive, playable hallucination, running locally on user hardware.

0 favorites 0 likes

How are you evaluating AI features in production?

Reddit r/AI_Agents · 1h ago

A discussion on the methodologies and challenges involved in evaluating AI features once they are deployed in production environments.

0 favorites 0 likes

China's AI chip independence is mostly theater, according to former White House AI advisor Dean Ball

Reddit r/ArtificialInteligence · 1h ago

Former White House AI advisor Dean Ball argues that China's efforts to achieve AI chip independence are largely performative and not substantive.

0 favorites 0 likes

Openrouter model prices implying heavier quantization?

Reddit r/LocalLLaMA · 1h ago

An analysis questioning whether OpenRouter's API pricing for open models like GLM-5.2 implies more aggressive quantization than assumed, given the economics of running large models on expensive hardware like 8xH200.

0 favorites 0 likes

GLM 5.2 on Mac Studio Speedup PR

Reddit r/LocalLLaMA · 1h ago

GLM 5.2 delivers major performance gains on Mac Studio with 512GB RAM, achieving prefill speeds above 100 t/s at high context lengths and enabling 4-bit quantization for contexts over 100k tokens, as detailed in a pull request by the oMLX creator.

0 favorites 0 likes

I benchmarked 8 LLMs for medical scribing. Hallucinations were rare; omissions need attention.

Reddit r/LocalLLaMA · 2h ago

A benchmark of 8 LLMs for medical scribing found hallucinations rare but omissions a concern.

0 favorites 0 likes

Gemini and AI Hallucination

Reddit r/artificial · 2h ago

Discussion of AI hallucination issues in Google's Gemini model, highlighting challenges in reliability and accuracy of large language models.

0 favorites 0 likes

7 Chinese companies are already shipping H100/H200-class AI chips, most IPO'd in the last 6 months. I mapped all of them.

Reddit r/LocalLLaMA · 2h ago

At least seven Chinese companies are shipping H100/H200-class AI accelerators, most having recently IPO'd, with several founded by former NVIDIA/AMD architects. Huawei's Ascend 950 targets H200-class performance, and China's domestic market share is rising as NVIDIA's declines.

0 favorites 0 likes

Your agent and your team should have the same source of truth, but most setups don't

Reddit r/AI_Agents · 3h ago

Highlights the common disconnect between AI agents and human teams sharing the same source of truth, and how most current setups fail to achieve this.

0 favorites 0 likes

AGIBOT is now streaming live their G2 humanoid robots working at real tablet factory

Reddit r/singularity · 3h ago Cached

AGIBOT is live streaming their G2 humanoid robots working on a real tablet production line, showcasing real-world deployment in manufacturing.

0 favorites 0 likes

Krea 2 released on Hugging Face

Reddit r/LocalLLaMA · 3h ago Cached

Krea 2 is a 12-billion parameter text-to-image diffusion model released open-weight on Hugging Face, with Raw (base) and Turbo (post-trained) checkpoints available.

0 favorites 0 likes

I mapped the KLD of KV cache quantization for Qwen3.6-35B-A3B and Gemma4-E2B QAT

Reddit r/LocalLLaMA · 3h ago

The author maps the Kullback-Leibler divergence of KV cache quantization for the Qwen3.6-35B-A3B and Gemma4-E2B QAT models.

0 favorites 0 likes

June delays 5.6

Reddit r/singularity · 3h ago

Multiple AI model releases are delayed: GPT-5.6 now expected mid-July, DeepMind's 3.5 Pro postponed, while OpenAI's Bidi voice model and Claude Sonnet 5 for enterprises see progress.

0 favorites 0 likes

Agent Profiles Make AI Runs Safer, More Focused and Reusable

Reddit r/artificial · 3h ago

Agent Profiles is a new method that enhances AI safety, focus, and reusability by defining structured profiles for AI agents.

0 favorites 0 likes

Real-world GLM 5.2 experiences only — skip generic benchmark scores, how does it hold up on complex production business workloads?

Reddit r/AI_Agents · 4h ago

Discusses real-world experiences with GLM 5.2 in complex production business workloads, focusing on practical performance beyond benchmark scores.

0 favorites 0 likes

Best cheap model for content writing, realistic image generation & vibe coding?

Reddit r/AI_Agents · 4h ago

Asks for recommendations on affordable AI models for content writing, image generation, and vibe coding.

0 favorites 0 likes

Anthropic cofounder predicts singularity in 2028

Reddit r/singularity · 4h ago

Anthropic cofounder Dario Amodei predicts the technological singularity will arrive by 2028.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback