causal-transformers

Tag

Cards List
#causal-transformers

Simplifying the Modeling of Arbitrary Conditionals in Natural Language

arXiv cs.CL · 6d ago Cached

Proposes ac-gpt, a simple modification to causal Transformers that enables evaluating and sampling from arbitrary conditionals (past, future, mixed) in a single forward pass while preserving left-to-right ordering and next-token prediction, allowing existing LLMs to be fine-tuned for arbitrary conditioning.

0 favorites 0 likes
#causal-transformers

Long Context Pre-Training with Lighthouse Attention

Hugging Face Daily Papers · 2026-05-07 Cached

Lighthouse Attention is a training-only hierarchical selection-based attention algorithm that reduces computational complexity for long sequence training of causal transformers, enabling faster pre-training with competitive final loss after a recovery phase.

0 favorites 0 likes
← Back to home

Submit Feedback