Introduces the Member vs Generated Inference (MGI) task to distinguish training members from generated outputs in generative models, and proposes Data Circuit Breaker (DCB), a three-stage method combining autoencoder and latent generator signals, which outperforms existing methods across autoregressive and diffusion models.

0 favorites 0 likes

#102

Exact Schur-Sylvester Dimensionality Reductions for Non-Smooth Stochastic Complexity and Manifold Sampling

arXiv cs.LG ↗ · 10h ago Cached

This paper presents exact dimensionality reductions using Schur complement and Sylvester's determinant identity to reduce computational complexity from O(N^3) to O(k^3+N^2k) per step for non-smooth NML estimation, achieving over 14,000x speedup while maintaining numerical precision.

0 favorites 0 likes

#103

AdversaBench: Automated LLM Red-Teaming with Multi-Judge Confirmation and Cross-Model Transferability

arXiv cs.AI ↗ · 10h ago Cached

AdversaBench introduces an automated LLM red-teaming pipeline that uses five mutation operators and a three-judge panel with a meta-judge tiebreaker to confirm failures, revealing that attack difficulty varies by category and that adversarial prompts transfer from smaller to larger models.

0 favorites 0 likes

#104

A specialized reasoning large language model for accelerating rare disease diagnosis: a randomized AI physician assistance trial

arXiv cs.AI ↗ · 10h ago Cached

This paper presents RaDaR, a 32B open-source reasoning LLM trained on public and synthetic rare disease cases, which outperforms larger models like DeepSeek-R1 in diagnosis benchmarks and improves physician accuracy by 21.44 percentage points in a randomized trial.

0 favorites 0 likes

#105

ReM-MoA: Reasoning Memory Sustains Mixture-of-Agents Scaling

arXiv cs.AI ↗ · 10h ago Cached

ReM-MoA introduces a memory-augmented Mixture-of-Agents framework that sustains scaling through ranked reasoning memory and curated diversified memory routing, outperforming prior MoA variants across five reasoning benchmarks.

0 favorites 0 likes

#106

CALIBER: Calibrating Confidence Before and After Reasoning in Language Models

arXiv cs.CL ↗ · 10h ago Cached

The paper introduces CALIBER, a method for calibrating confidence in reasoning language models by eliciting confidence estimates both before and after reasoning, with supervision targets matched to the information state. It achieves significant reductions in Expected Calibration Error (up to 52.5%) and strong Brier scores and AUROC across multiple benchmarks.

0 favorites 0 likes

#107

Probing the Misaligned Thinking Process of Language Models

arXiv cs.AI ↗ · 10h ago Cached

This paper proposes monitoring LLM misalignment by decomposing it into fine-grained cognitive processes (misalignment indicators) and detecting them via linear probes on internal activations, achieving high AUROC on out-of-distribution transcripts.

0 favorites 0 likes

#108

MMed-Bench-IR: A Heterogeneous Benchmark for Multilingual Medical Information Retrieval

arXiv cs.CL ↗ · 10h ago Cached

MMed-Bench-IR is a heterogeneous benchmark for multilingual medical information retrieval across six languages, evaluating cross-lingual alignment, concept discrimination, and evidence retrieval. It reveals severe performance drops for non-English queries, highlighting gaps in existing English-only evaluations.

0 favorites 0 likes

#109

The Geometry Behind Diffusion and Flow Matching: Gradient Flows and Geodesics in Wasserstein Space

arXiv cs.AI ↗ · 10h ago Cached

This paper reveals that diffusion models and flow matching are two sides of the same Wasserstein geometry: diffusion follows a free-energy gradient flow (initial-value problem), while flow matching follows a Wasserstein geodesic (boundary-value problem), and they are unified through the JKO scheme.

0 favorites 0 likes

#110

Reinforcement Learning Towards Broadly and Persistently Beneficial Models

arXiv cs.AI ↗ · 10h ago Cached

This paper from OpenAI investigates whether reinforcement learning on beneficial behavior can produce broad and persistent alignment generalization beyond the training distribution. Using a dataset of realistic situations, they show that RL training on beneficial traits improves out-of-distribution alignment and persistence against adversarial attacks.

0 favorites 0 likes

#111

Self-Recognition Finetuning can Prevent and Reverse Emergent Misalignment

arXiv cs.CL ↗ · 10h ago Cached

This paper proposes Self-Recognition Finetuning as an intervention to prevent and reverse emergent misalignment in LLMs, showing it stabilizes the model's aligned character rather than adopting a misaligned persona.

0 favorites 0 likes

#112

Japanese animator using Seedance to render anime from simple 3D models

Reddit r/singularity ↗ · 6h ago

A Japanese animator is using Seedance, a tool that renders anime from simple 3D models, showcasing AI-assisted animation techniques.

0 favorites 0 likes

#113

@wquguru: https://x.com/wquguru/status/2069641926752780384

X AI KOLs Timeline ↗ · 10h ago Cached

This article comprehensively reviews the complete architectural layering of AI Agent Memory as of mid-2026, including rule files, persistent profiles, historical recall, and evidence chains. It explains the storage methods, loading timings, and governance principles of different memory layers, emphasizing the key role of memory in helping agents achieve cross-session compounding work.

0 favorites 0 likes

#114

How much do you actually let an AI agent touch in production?

Reddit r/AI_Agents ↗ · 7h ago

Discussion about scoping permissions for AI agents in production to avoid dangerous database actions, suggesting read-only mirrors, approval steps, or hard walls between suggestion and execution.

0 favorites 0 likes

#115

@ModelScope2022: Qwen-AgentWorld just dropped two releases on ModelScope! An open 35B total / 3B active MoE world model with 256K contex…

X AI KOLs Timeline ↗ · 11h ago Cached

Qwen-AgentWorld releases an open 35B total / 3B active MoE world model with 256K context, along with a 7-domain benchmark, achieving state-of-the-art performance on AgentWorldBench.

0 favorites 0 likes

#116

Unit 42 found 5 malicious skills that passed ClawScan + VirusTotal

Reddit r/openclaw ↗ · 8h ago

Unit 42 discovered five malicious AI agent skills that evaded detection by ClawScan and VirusTotal, including referral-hijacking, crypto wallet draining, and a dropper hidden via size padding, demonstrating that signature scanning is ineffective against instruction-based threats.

0 favorites 0 likes

#117

THE EX-GOOGLE CHARACTER AI ERA IS EVOLVING

Reddit r/artificial ↗ · 8h ago

Mel AI is evolving AI characters from text-based interactions to real-time video chat, with lip sync, facial expressions, and camera context awareness, following the success of Character AI.

0 favorites 0 likes

#118

Physical Superintelligence

Reddit r/singularity ↗ · 6h ago Cached

PSI is building a vertically integrated factory for physical superintelligence to accelerate physics breakthroughs with artificial superintelligence, and has open-sourced an AI copilot for physicists called Get Physics Done (GPD).

0 favorites 0 likes

#119

I think most “AI agent” projects fail because people skip the boring permission layer

Reddit r/AI_Agents ↗ · 7h ago

The author argues that successful AI agent products require a robust permission system with read-only, draft, approval, limited execution, and audit layers, prioritizing safety over apparent magic.

0 favorites 0 likes

#120

Data Augmentation: A Fourier Analysis Perspective

arXiv cs.LG ↗ · 10h ago Cached

This paper develops a Fourier analysis framework to study data augmentation under group invariances, showing that partial augmentation can achieve the same minimax rates as full augmentation up to a vanishing approximation error, while also proving that exact invariance requires full group averaging.

0 favorites 0 likes

Trending

MGI: Member vs Generated Inference

Exact Schur-Sylvester Dimensionality Reductions for Non-Smooth Stochastic Complexity and Manifold Sampling

AdversaBench: Automated LLM Red-Teaming with Multi-Judge Confirmation and Cross-Model Transferability

A specialized reasoning large language model for accelerating rare disease diagnosis: a randomized AI physician assistance trial

ReM-MoA: Reasoning Memory Sustains Mixture-of-Agents Scaling

CALIBER: Calibrating Confidence Before and After Reasoning in Language Models

Probing the Misaligned Thinking Process of Language Models

MMed-Bench-IR: A Heterogeneous Benchmark for Multilingual Medical Information Retrieval

The Geometry Behind Diffusion and Flow Matching: Gradient Flows and Geodesics in Wasserstein Space

Reinforcement Learning Towards Broadly and Persistently Beneficial Models

Self-Recognition Finetuning can Prevent and Reverse Emergent Misalignment

Japanese animator using Seedance to render anime from simple 3D models

@wquguru: https://x.com/wquguru/status/2069641926752780384

How much do you actually let an AI agent touch in production?

@ModelScope2022: Qwen-AgentWorld just dropped two releases on ModelScope! An open 35B total / 3B active MoE world model with 256K contex…

Unit 42 found 5 malicious skills that passed ClawScan + VirusTotal

THE EX-GOOGLE CHARACTER AI ERA IS EVOLVING

Physical Superintelligence

I think most “AI agent” projects fail because people skip the boring permission layer

Data Augmentation: A Fourier Analysis Perspective

Submit Feedback