@dair_ai: NEW paper from Meta: Agentic Discovery of Neural Architectures. This is a hot new area of research! Keep an eye on it.
Summary
Meta's new paper presents an agentic system that autonomously discovers neural architectures outperforming Llama 3.2 at 350M, 1B, and 3B scales within a 24-hour compute budget.
View Cached Full Text
Cached at: 05/19/26, 12:40 AM
NEW paper from Meta: Agentic Discovery of Neural Architectures.
This is a hot new area of research! Keep an eye on it.
elvis (@omarsar0): NEW paper from Meta.
(bookmark it)
It’s an agent system that autonomously discovers neural architectures that beat Llama 3.2 at 350M, 1B, and 3B scales, all under a 24-hour compute budget.
They get this work by splitting the search into two agents:
> AIRA-Compose searches the
Similar Articles
Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design
This paper introduces AIRA-Compose and AIRA-Design, dual frameworks using AI agents to autonomously discover neural architectures that outperform standard Transformers and scale efficiently.
@dair_ai: NEW paper worth reading. A full agentic workflow can be distilled into model weights and run at roughly 100x lower infe…
This paper demonstrates that agentic workflows can be distilled into small fine-tuned models, achieving near-frontier quality while reducing inference cost by two orders of magnitude compared to orchestration approaches.
@rohanpaul_ai: Very important Meta paper brings Autodata, an agentic data scientist to create high quality synthetic data. The main re…
Meta's new paper 'Autodata' introduces an agentic data scientist that generates and meta-optimizes synthetic training data, significantly outperforming standard methods and enabling a small 4B model to beat a 397B baseline in legal tasks.
@dair_ai: https://x.com/dair_ai/status/2061104052818108476
A roundup of three notable AI papers: SkillOpt treats skill documents as trainable parameters to optimize frozen agents; a new method compiles agentic workflows into model weights for 100x cost reduction; and AutoScientists introduces a decentralized agent team for long-running science without a central planner.
@dair_ai: https://x.com/dair_ai/status/2053495521243799717
DAIR AI's weekly roundup highlights top research papers including HeavySkill, which improves model performance via internalized parallel reasoning, and Sakana AI's Conductor, which uses RL to optimize agent orchestration. It also covers Meta FAIR's work on self-improving pretraining.