Memory-Augmented Reinforcement Learning Agent for CAD Generation

arXiv cs.AI Papers

Summary

This paper proposes a memory-augmented reinforcement learning framework for CAD generation agents that integrates geometric kernel toolchains, dual-track memory, and dynamic utility retrieval to handle complex CAD models with long operation sequences and geometric constraints, achieving improved success rate and geometric consistency.

arXiv:2605.19748v1 Announce Type: new Abstract: Automatic generation of computer-aided design (CAD) models is a core technology for enabling intelligence in advanced manufacturing. Existing generation methods based on large language models (LLMs) often fall short when handling complex CAD models characterized by long operation sequences, diverse operation types, and strong geometric constraints, primarily because reasoning chains break and effective error-correction mechanisms are lacking. To address this problem, this paper proposes a memory-augmented reinforcement learning framework for CAD generation agents. The framework encapsulates the underlying geometric kernel into a structured toolchain callable by the agent and builds a closed-loop mechanism of design intent understanding, global planning, execution, and multi-dimensional verification. It also designs a dual-track memory module consisting of a case library and a skill library, and proposes a dynamic utility retrieval algorithm. By introducing reinforcement learning into retrieval and policy optimization, the agent can effectively avoid retrieval traps in which examples are semantically similar but geometrically infeasible, enabling online self-correction and continual evolution without additional large-scale annotated data. Experiments show that the proposed method significantly improves both the success rate and geometric consistency on complex CAD model generation tasks.
Original Article

Similar Articles

Self-Improving CAD Generation Agents with Finite Element Analysis as Feedback

Hugging Face Daily Papers

This paper introduces a new task formulation for CAD generation that incorporates finite element analysis as feedback, along with improved supervision signals like a text-only blueprint schema and multi-view image renderer, leading to better geometric reconstruction on benchmarks.

Learning to Learn from Multimodal Experience

arXiv cs.AI

This paper introduces AutoMMemo, a framework that enables multimodal agents to automatically design memory mechanisms (expressible as executable memo programs) for learning from multimodal interaction trajectories, outperforming no-memory and fixed-memory baselines on GUI/Web navigation and visual reasoning benchmarks.

GenCAD

Hacker News Top

GenCAD introduces an image-conditional model that generates full parametric CAD command histories using transformers and diffusion priors, enabling precise and modifiable 3D modeling from images.