Ornith-1.0-35B Q3_K_M: ~17 GB VRAM, KLD-checked against BF16

Reddit r/LocalLLaMA Models

Summary

Ornith-1.0-35B Q3_K_M is a 3-bit quantized version of a 35B parameter model, requiring about 17 GB VRAM, with KLD checking against BF16 to ensure fidelity.

No content available
Original Article

Similar Articles

Ornith-1.0 released on Hugging Face

Reddit r/LocalLLaMA

Ornith-1.0 has been released on Hugging Face, featuring a collection of models ranging from 9B to 397B parameters, including dense and MoE architectures, claiming state-of-the-art performance on various benchmarks.