vram-optimized

Tag

Cards List
#vram-optimized

Ornith-1.0-35B Q3_K_M: ~17 GB VRAM, KLD-checked against BF16

Reddit r/LocalLLaMA · 2d ago

Ornith-1.0-35B Q3_K_M is a 3-bit quantized version of a 35B parameter model, requiring about 17 GB VRAM, with KLD checking against BF16 to ensure fidelity.

0 favorites 0 likes
← Back to home

Submit Feedback