@garrytan: Thinking Machines is impressive. In a couple hours I just fine tuned my own Qwen3.5-397B model this afternoon. Fast usa…

X AI KOLs Following News

Summary

Garry Tan tweets that he fine-tuned a Qwen3.5-397B model in a couple hours using Thinking Machines, praising its speed and usability for multimodal personal AI.

Thinking Machines is impressive. In a couple hours I just fine tuned my own Qwen3.5-397B model this afternoon. Fast usable multimodal is also going to enable very mind-blowing personal AI.
Original Article
View Cached Full Text

Cached at: 05/24/26, 04:15 AM

Thinking Machines is impressive. In a couple hours I just fine tuned my own Qwen3.5-397B model this afternoon.

Fast usable multimodal is also going to enable very mind-blowing personal AI.

Thinking Machines (@thinkymachines): People talk, listen, watch, think, and collaborate at the same time, in real time. We’ve designed an AI that works with people the same way.

We share our approach, early results, and a quick look at our model in action.

Similar Articles

@zhixianio: After receiving the new machine, I began an 'ascetic' practice of forcing myself to use local models for common tasks. I thought it would be painful, but both speed and quality greatly exceeded my expectations: Model: Qwen3.6-35B-A3B-oQ6-fp16-mtp, Running: oMLX, with N…

X AI KOLs Timeline

The author uses the Qwen3.6-35B-A3B model and oMLX tool on the new local machine for daily tasks, finding that both speed and quality far exceed expectations, even outperforming remote LLMs in PA and coding scenarios, demonstrating a significant improvement in on-device AI capabilities.

Qwen 3.7 Max

Reddit r/LocalLLaMA

Qwen 3.7 is an impressive new AI model from Chinese labs, with discussion on whether weights will be available for download.

Qwen3.7 Preview lands on Arena (1 minute read)

TLDR AI

Alibaba Qwen announces two major model releases: Qwen3-Omni, the first natively end-to-end omni-modal AI unifying text, image, audio and video, and Qwen3-Next-80B-A3B, an ultra-efficient MoE model with 3B activated parameters per token, achieving SOTA performance and 10x faster inference than Qwen3-32B.