llm-architectures

#llm-architectures

Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention [P]

Reddit r/MachineLearning ↗ · 7h ago Cached

Sebastian Raschka reviews recent innovations in LLM architectures focused on long-context efficiency, including KV sharing, compressed convolutional attention, and layer-wise attention budgeting from models like Gemma 4, ZAYA1, Laguna XS.2, and DeepSeek V4.

0 favorites 0 likes

#llm-architectures

A Two-Dimensional Framework for AI Agent Design Patterns: Cognitive Function and Execution Topology

arXiv cs.AI ↗ · 2d ago Cached

This paper proposes a two-dimensional classification framework for AI agent design patterns that combines cognitive function and execution topology axes, identifying 27 named patterns and deriving empirical laws from cross-domain analysis.

0 favorites 0 likes

llm-architectures

Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention [P]

A Two-Dimensional Framework for AI Agent Design Patterns: Cognitive Function and Execution Topology

Submit Feedback