Tag
Sakana AI has released AI Scientist-v2, an end-to-end automated research system that can autonomously go from generating research hypotheses to writing papers, and has been accepted by the ICLR2025 Workshop after peer review.
RocketSmith is an agentic system that uses large language models to automate the design and additive manufacturing of high-powered rockets, achieving successful flight tests with simulation results matching 80% of predicted apogee.
OpenMontage is the world's first open-source Agentic video production system. With just one sentence, it automates the entire workflow from research and scripting to dubbing and compositing. It supports 12 major Pipelines and 52 tools, with costs as low as a few cents. It has over 16k stars on GitHub.
A 22-chapter skeleton course on building production AI agents, using an innovative approach where the AI partner fills in details. The course covers tool calling, agent loops, memory, multi-agent collaboration, and more.
Introduces ModSleuth, an agentic system that recursively reconstructs large-scale dependency graphs for LLM development by analyzing public artifacts, revealing multi-hop license obligations and documentation inconsistencies.
A guide on building a secure agentic system with sandboxing, parallel sub-agents, tool calling with control policies, inference routing, and protection against injection and role escalation attacks, to be published by Evangelos Pappas.
Introduces Benchmark Agent, a fully autonomous system for creating diverse benchmarks with minimal human intervention, enabling continuous model assessment across domains.
Adaptive Auto-Harness is a framework for sustained self-improvement of agentic systems deployed on open-ended task streams, outperforming baselines via a stateful multi-agent evolver, harness tree, and human-steering hooks.
A developer shares their experience building a local autonomous agent with long-term and short-term memory using Qwen 3.6 27B, noting that memory dramatically improves the agent's usefulness and realism. They invite others building similar agents to discuss memory techniques and potential agentic meetups.
The team of Assistant Professor Elliott Wu from the University of Cambridge released Articraft — an agentic coding system that can automatically generate interactive 3D assets with parts, joints, and realistic motion, and open-sourced the Articraft-10K dataset, significantly lowering the asset threshold for robot training and physical AI.
Microsoft's new multi-model agentic security system (MDASH) tops the CyberGym leaderboard for vulnerability discovery, achieving 35 zero-day findings and demonstrating advanced AI-driven defense capabilities.
The author shares a locally runnable AI companion built with Python, Gemini, and Ollama, featuring a custom cognitive architecture based on Global Workspace Theory and an Integrated Information Theory proxy for personality modeling.