deep-context

#deep-context

Nemotron - King of the Deep? Comparison of 4 models <=120B

Reddit r/LocalLLaMA ↗ · yesterday

Comparison of four large language models (≤120B parameters) on deep context performance using Strix Halo hardware. Nemotron Super excels in prompt processing speed at deep context depths compared to GPT-OSS and Qwen models.

0 favorites 0 likes

deep-context

Nemotron - King of the Deep? Comparison of 4 models <=120B

Submit Feedback