apex-quantization

#apex-quantization

Qwen3.6-35B-A3B APEX on a Single RTX 3090 - Getting the Most Out of It

Reddit r/LocalLLaMA ↗ · yesterday

A detailed guide on running the Qwen3.6-35B-A3B APEX model on an RTX 3090, comparing two llama.cpp forks and quantization methods for optimal speed and quality.

0 favorites 0 likes

apex-quantization

Qwen3.6-35B-A3B APEX on a Single RTX 3090 - Getting the Most Out of It

Submit Feedback