auto-research

#auto-research

@teortaxesTex: Deli open sources his AutoResearch.

X AI KOLs Timeline ↗ · 8h ago Cached

Deli Chen open sources his AutoResearch SKILL tool and releases a survey paper on Self-play, inspired by AlphaZero.

0 favorites 0 likes

#auto-research

SIQ-1 Qwen3.6 for autoresearch and autonomous agency

Reddit r/LocalLLaMA ↗ · 11h ago

SIQ-1 Qwen3.6 is a new AI model designed for automated research and autonomous agency tasks, extending the Qwen family with enhanced agentic capabilities.

0 favorites 0 likes

#auto-research

PseudoBench: Measuring How Agentic Auto-Research Fuels Pseudoscience

arXiv cs.AI ↗ · 19h ago Cached

PseudoBench is a benchmark to evaluate whether LLM-based agentic auto-research systems can resist pseudoscientific narratives. Testing seven state-of-the-art agents reveals they readily produce persuasive pseudoscientific reports with near-zero refusal rates, calling for scientific alignment before deployment.

0 favorites 0 likes

#auto-research

@DrJimFan: Today, we enable AutoResearch in the physical world for the first time! Introducing ENPIRE: we give 8 Codex agents a fl…

X AI KOLs Following ↗ · yesterday Cached

NVIDIA GEAR lab introduces ENPIRE, a system that uses 8 Codex agents to autonomously control a robot fleet for physical tasks like tying zip-ties and installing GPUs, demonstrating self-improving robotics research and a new 'physical scaling' phenomenon.

0 favorites 0 likes

#auto-research

@DanKornas: If you’re trying to follow AI agents for research, the hard part is not one paper — it’s the whole lifecycle. Awesome A…

X AI KOLs Timeline ↗ · 3d ago Cached

A curated GitHub resource that maps AI-assisted scientific research tools and papers across the full research lifecycle, from idea generation to dissemination.

0 favorites 0 likes

#auto-research

@mylifcc: The ceiling of Auto-Research infrastructure has arrived! Yacine's 1.5-hour in-depth interview with the two founders of Paradigma, hardcore breakdown of how DAG becomes the underlying infrastructure for autonomous research: • Why DAG is the best substrate for research (far beyond linear papers) • Ag…

X AI KOLs Timeline ↗ · 2026-05-26 Cached

Yacine conducted a 1.5-hour in-depth interview with the founders of Paradigma, discussing how to use DAG (Directed Acyclic Graph) as the underlying infrastructure for autonomous research, covering core topics such as Agent operation, building large-scale public DAGs, and avoiding bad DAGs.

0 favorites 0 likes

#auto-research

@yacinelearning: if you are interested in learning about the infra behind auto-research this 1h30min interview with the paradigma folks …

X AI KOLs Timeline ↗ · 2026-05-25 Cached

Interview discussing infrastructure for auto-research using DAGs, including how agents can execute DAGs and how to build large public DAGs.

0 favorites 0 likes

#auto-research

@AlphaSignalAI: Karpathy automated experiments. AutoResearchClaw automated the whole lab. Most AI research tools handle one step. This …

X AI KOLs Timeline ↗ · 2026-05-22 Cached

AutoResearchClaw is a GitHub repository that automates the entire AI research pipeline from an idea to a full conference paper with real experiments, verified citations, and working code, outperforming previous autonomous research systems by 54.7% on a 55-topic benchmark.

0 favorites 0 likes

#auto-research

How Far Are We From True Auto-Research?

arXiv cs.AI ↗ · 2026-05-20 Cached

This paper introduces ResearchArena, a scaffold for evaluating auto-research agents, and finds that while agent-generated papers appear competitive under manuscript-only review, artifact-aware review reveals severe failures in experimental rigor, with no paper meeting top-tier acceptance standards.

0 favorites 0 likes

#auto-research

Auto Research with Specialist Agents Develops Effective and Non-Trivial Training Recipes

Hugging Face Daily Papers ↗ · 2026-05-07 Cached

This paper introduces an auto-research framework using specialist agents to iteratively refine training recipes through an empirical loop of code execution and feedback. The system autonomously improves performance on tasks like Parameter Golf and NanoChat without human intervention by leveraging lineage feedback.

0 favorites 0 likes

#auto-research

@eigentopology: Introducing Automode on @thesis_labs! Watch Thesis run autonomous ML research on an Optiver trading dataset Try it toda…

X AI KOLs Following ↗ · 2026-04-21

Thesis Labs launched Automode, a system that autonomously conducts ML research on Optiver’s trading dataset.

0 favorites 0 likes

auto-research

Submit Feedback