neural-architecture-search

Tag

Cards List
#neural-architecture-search

Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design

Hugging Face Daily Papers · 3d ago Cached

This paper introduces AIRA-Compose and AIRA-Design, dual frameworks using AI agents to autonomously discover neural architectures that outperform standard Transformers and scale efficiently.

0 favorites 0 likes
#neural-architecture-search

Learngene Search Across Multiple Datasets for Building Variable-Sized Models

arXiv cs.LG · 6d ago Cached

This paper introduces LSAMD, a method for extracting 'learngenes' across multiple datasets to initialize variable-sized Vision Transformer models, significantly reducing training costs and storage while maintaining performance comparable to pretrain-finetune methods.

0 favorites 0 likes
← Back to home

Submit Feedback