Tag
The author uses personal experience to introduce a tutorial on architect thinking in the AI era, emphasizing that the ability to understand the underlying essence when abstraction leaks is more critical than tool usage, and shares two modes: assembly thinking and object-oriented thinking.
Sam Altman shares his belief that clarity of thinking, speed, and quality of execution are linked, using writing as a tool to clarify thoughts.
The author shares a quantization recipe for Qwen3.6 27B that makes the model use significantly fewer thinking tokens while still producing correct answers, leading to faster inference on math benchmarks.