tiny-stories

Tag

Cards List
#tiny-stories

High Dimensional, Dynamic Rotary Positional Embedding [P]

Reddit r/MachineLearning · 3d ago

Introduces HDD-RoPE, an extension of rotary positional embeddings that uses high-dimensional chunks and data-dependent rotation rates, showing faster convergence on TinyStories compared to xPos.

0 favorites 0 likes
← Back to home

Submit Feedback