@DeRonin_: My current local AI setup: - 2x DGX Spark linked (256gb) > GLM 5.2 @ 2bit, reasoning + agent loops - Mac Studio M3 Ultr…

X AI KOLs Following 06/30/26, 12:51 PM News

local-ai self-hosting chinese-models qwen kimi dgx-spark hardware-setup

Summary

A user describes their fully local AI stack using multiple hardware devices running Chinese models like GLM, Qwen, and Kimi, claiming 87% cost savings compared to frontier models like GPT-5.5 and Opus 4.8, while noting plans to self-host video generation.

My current local AI setup: - 2x DGX Spark linked (256gb) > GLM 5.2 @ 2bit, reasoning + agent loops - Mac Studio M3 Ultra 96gb > Wan 2.2, image generation - Mac mini M5 Pro 64gb > Qwen3.6-35B, code + content drafts - MB Air M5 24gb > Qwen3 30B-A3B, bulk processing - iPhone > Qwen3 4B, on-device every model above runs on hardware i own, weights downloaded, no api key in the loop the one thing i don't self-host yet is video.. the open video models want a dedicated gpu box, so that's my next build (when i figure out how to make $100k MRR on it lol) the other one i'm scaling toward is Kimi K2.7 fully local.. it's a 1T model so it needs a real gpu server, adding it as the revenue grows with MiMo V2.5 the same as with Kimi and Kling frontier ai used to need someone else's datacenter.. now it fits on my desk

Original Article

View Cached Full Text

Cached at: 06/30/26, 01:46 PM

My current local AI setup:

2x DGX Spark linked (256gb) > GLM 5.2 @ 2bit, reasoning + agent loops
Mac Studio M3 Ultra 96gb > Wan 2.2, image generation
Mac mini M5 Pro 64gb > Qwen3.6-35B, code + content drafts
MB Air M5 24gb > Qwen3 30B-A3B, bulk processing
iPhone > Qwen3 4B, on-device

every model above runs on hardware i own, weights downloaded, no api key in the loop

the one thing i don’t self-host yet is video.. the open video models want a dedicated gpu box, so that’s my next build (when i figure out how to make $100k MRR on it lol)

the other one i’m scaling toward is Kimi K2.7 fully local.. it’s a 1T model so it needs a real gpu server, adding it as the revenue grows

with MiMo V2.5 the same as with Kimi and Kling

frontier ai used to need someone else’s datacenter.. now it fits on my desk

currently i guess it’s valued around $20k

@DeRonin_: My current local AI setup: - 2x DGX Spark linked (256gb) > GLM 5.2 @ 2bit, reasoning + agent loops - Mac Studio M3 Ultr…

Similar Articles

@TheAhmadOsman: Gentle reminder that all you need to start with Local AI is: - 2x RTX 3090s (pick up for $700-$900 on r/hardwareswap) -…

@DeRonin_: My entire AI stack is now Chinese 87% cheaper. same revenue swaps by task: 1. reasoning / backend brain Opus 4.8 → Kimi…

@RayFernando1337: https://x.com/RayFernando1337/status/2070621713952579990

@andrewchen: finding the main downside with experimenting with local AI models is that you end up buying one GPU, then another, then…

@rohanpaul_ai: atomic[.]chat (a desktop app that runs LLMs locally) ran a very revealing comparison for local AI agents, on a MacBook …

Submit Feedback

Similar Articles

@TheAhmadOsman: Gentle reminder that all you need to start with Local AI is: - 2x RTX 3090s (pick up for $700-$900 on r/hardwareswap) -…

@DeRonin_: My entire AI stack is now Chinese 87% cheaper. same revenue swaps by task: 1. reasoning / backend brain Opus 4.8 → Kimi…

@RayFernando1337: https://x.com/RayFernando1337/status/2070621713952579990

@andrewchen: finding the main downside with experimenting with local AI models is that you end up buying one GPU, then another, then…

@rohanpaul_ai: atomic[.]chat (a desktop app that runs LLMs locally) ran a very revealing comparison for local AI agents, on a MacBook …