convergence-speed

Tag

Cards List
#convergence-speed

Three-Phase Transformer

Hugging Face Daily Papers · 2026-04-15 Cached

A research paper introducing Three-Phase Transformer (3PT), which applies Tesla's polyphase geometry to transformer architectures by organizing the residual stream into three 120° offset phases. The approach achieves 7.2% perplexity improvement on WikiText-103 with minimal parameters (0.00124% overhead) and 1.93× convergence speedup.

0 favorites 0 likes
← Back to home

Submit Feedback