@andrewchen: finding the main downside with experimenting with local AI models is that you end up buying one GPU, then another, then…

X AI KOLs Following News

Summary

Andrew Chen shares his experience of buying multiple GPUs for local AI experimentation, running Qwen3.6 27B dense at 100 tok/s on a 5090 eGPU, and compares it to Sonnet 4.6.

finding the main downside with experimenting with local AI models is that you end up buying one GPU, then another, then another, then another… But I’m running qwen3.6 27b dense at 100 tok/s now on a 5090 eGPU! It feels like sonnet 4.6? Fast and highly usable I figure the GPUs I have will now increase in value over the next few years so it’ll all be worth it
Original Article
View Cached Full Text

Cached at: 05/19/26, 02:42 AM

finding the main downside with experimenting with local AI models is that you end up buying one GPU, then another, then another, then another…

But I’m running qwen3.6 27b dense at 100 tok/s now on a 5090 eGPU! It feels like sonnet 4.6? Fast and highly usable

I figure the GPUs I have will now increase in value over the next few years so it’ll all be worth it

Similar Articles

we really all are going to make it, aren't we? 2x3090 setup.

Reddit r/LocalLLaMA

A user shares their experience setting up a dual 3090 GPU system to run the Qwen 3.6 27b model locally, achieving over 100 tokens/second after switching to Ubuntu and using the club-3090 tool with custom patches. They express excitement about the future of local AI.