tokens-per-second

#tokens-per-second

RTX 5080 and RTX 3090 Setup: 80 Tok/s on Qwen 3.6 27B Q8

Hacker News Top ↗ · 2026-06-13

A setup using RTX 5080 and RTX 3090 GPUs achieves 80 tokens per second on the Qwen 3.6 27B Q8 model.

0 favorites 0 likes

#tokens-per-second

How fast is 10 tokens per second really?

Simon Willison's Blog ↗ · 2026-05-20 Cached

Simon Willison explores the practical meaning of 10 tokens per second speed for large language models, offering context on how fast that feels and its implications for usability.

0 favorites 0 likes

tokens-per-second

RTX 5080 and RTX 3090 Setup: 80 Tok/s on Qwen 3.6 27B Q8

How fast is 10 tokens per second really?

Submit Feedback