ornith-35b

#ornith-35b

Ornith 35B works reasonably well with Qwen3.6 35B DFlash speculative model

Reddit r/LocalLLaMA ↗ · 13h ago

Ornith 35B shows 30-40% token generation speedup when paired with Qwen3.6 35B DFlash speculative model in llama-server, achieving 80% acceptance rate on mixed code and text, though prompt processing suffers.

0 favorites 0 likes

ornith-35b

Ornith 35B works reasonably well with Qwen3.6 35B DFlash speculative model

Submit Feedback