model-family

Tag

Cards List
#model-family

Apertus LLM Family Expansion via Distillation and Quantization

arXiv cs.LG · 2026-05-29 Cached

This paper validates distillation and quantization as cost-effective methods to expand the Apertus LLM family to new sizes and hardware formats, producing Apertus-v1.1 models with up to 4B parameters trained on 1.7T tokens.

0 favorites 0 likes
← Back to home

Submit Feedback