m3-ultra

Tag

Cards List
#m3-ultra

@cryptoresetlife: 本地无审核版 GLM5.2 754B 参数模型 231GB 在我的MAC studio M3 ultra 512gb 上部署成功了 @support_huihui 948 tokens / 4分25秒 = 948 / 265 ≈ 3.6 …

X AI KOLs Timeline · yesterday Cached

An uncensored version of the GLM5.2 754B parameter model (231GB GGUF) was successfully deployed on a Mac Studio M3 Ultra with 512GB RAM, achieving approximately 3.6 tokens/s.

0 favorites 0 likes
#m3-ultra

@antirez: Cool use case

X AI KOLs Following · 4d ago Cached

A user reports that running Hermes Agent as a game master using DeepSeek V4 Flash locally on M3 Ultra yields nearly identical quality to the online version.

0 favorites 0 likes
#m3-ultra

"AWS secures rare Mac Studios while ordinary Apple customers remain completely locked out"

Reddit r/LocalLLaMA · 2026-05-20

AWS has secured a large number of Apple's M3 Ultra Mac Studio units for cloud services, while regular consumers face continued shortages and limited availability.

0 favorites 0 likes
#m3-ultra

@antirez: I didn't expect DeepSeek v4 PRO (not Flash) to run well on the Mac Studio M3 Ultra with 512GB of RAM. This is 2 bit qua…

X AI KOLs Timeline · 2026-05-17 Cached

Antirez reports that DeepSeek v4 PRO runs well on a Mac Studio M3 Ultra with 512GB RAM using 2-bit quantization, achieving 130 t/s prefill and 13 t/s generation.

0 favorites 0 likes
#m3-ultra

@Prince_Canuma: My home compute for MLX and research: • M3 Ultra — 512GB (sponsored by community + @wai_protocol) • RTX PRO 6000 — 96GB…

X AI KOLs Timeline · 2026-04-19

A researcher shares their home compute setup for MLX and AI research, featuring M3 Ultra with 512GB, RTX PRO 6000 with 96GB, and M3 Max with 96GB for model porting and stress testing.

0 favorites 0 likes
← Back to home

Submit Feedback