efficient-training

#efficient-training

Long Context Pre-Training with Lighthouse Attention

Hugging Face Daily Papers ↗ · 2026-05-07 Cached

Lighthouse Attention is a training-only hierarchical selection-based attention algorithm that reduces computational complexity for long sequence training of causal transformers, enabling faster pre-training with competitive final loss after a recovery phase.

0 favorites 0 likes

#efficient-training

Motif-Video 2B: Technical Report

Hugging Face Daily Papers ↗ · 2026-04-14 Cached

Motif-Video 2B is a 2B parameter text-to-video generation model that achieves 83.76% on VBench, surpassing Wan2.1 14B while using 7x fewer parameters and trained on fewer than 10M clips with less than 100,000 H200 GPU hours. The model uses a specialized architecture with shared cross-attention and a three-part backbone to separate prompt alignment, temporal consistency, and detail refinement.

0 favorites 0 likes

#efficient-training

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Papers with Code Trending ↗ · 2024-03-20 Cached

LlamaFactory is a unified framework that enables efficient fine-tuning of over 100 large language models via a web-based interface, eliminating the need for coding.

0 favorites 0 likes

efficient-training

Long Context Pre-Training with Lighthouse Attention

Motif-Video 2B: Technical Report

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Submit Feedback