Tag
A daily AI newsletter covering multiple stories including warnings from central bankers about AI debt bubbles, Chinese developers buying cheap Claude access via gray-market APIs, Sakana's Fugu report, cost comparisons of Chinese vs American AI models, Deepseek's new inference optimization, and Meta's open-source brain-to-text system.
GLM 5.2, a new version of the GLM language model, has been released, demonstrating improved performance.
A rumor suggests Zhipu AI's new model, possibly GLM 5.2, matches Fable5 in cybersecurity capabilities, as mentioned in a Wall Street Journal article.
Zhipu AI has released a new model that reportedly matches the performance of Claude Mythos in identifying security vulnerabilities.
Chinese AI and semiconductor companies are driving a rebound in onshore initial public offerings, reflecting renewed investor interest in the sector.
The author highly praises Tencent's ima AI app, saying it is far superior to its American counterparts in boosting productivity.
Cheap Chinese AI models are rapidly gaining customers in the US market, signaling a significant shift in the competitive landscape.
After firing Junyang Lin, Qwen has locked down its large models and is no longer releasing open source models, while other Chinese AI labs continue to open source their latest models. Rumors suggest the small model team is gone and Qwen 3.6/3.7 may be the last open source models.
Z.ai (formerly Zhipu AI) has released GLM-5.2, a 744-billion parameter Mixture-of-Experts AI model designed for agentic tasks like autonomous software engineering, with a 1-million token context window, low moderation, and trained on domestic Huawei Ascend chips.
This article analyzes the reasons behind the performance leap of Zhipu GLM-5.2, suggesting that its 40B activation parameters provide greater effective capacity after accounting for fixed overhead, making RL post-training more effective. It also reviews the history of Chinese AI model development and notes that the large model approach ultimately prevailed.
Zhipu AI released GLM 5.2, a new version of their large language model, as demonstrated in a video created using the model itself.
The article reviews the early development history of large models in China, pointing out that BAAI supported the earliest Qingyuan CPM (2020) and Wudao 1.0 (2021), and corrects the claim that Huawei Pangu was the first domestic large model.
This article explains the technical principles of knowledge distillation in machine learning, pointing out that merely collecting output dialogues from ChatGPT/Claude cannot achieve effective distillation due to the lack of probability distribution information, and discusses the limitations of using generated data in SFT and pre-training.
Zhipu released the GLM 5.2 model, focusing on coding capabilities, open-source and supporting 1M context. Tests show it approaches Claude Opus 4.8 level in large engineering and coding tasks, but lacks multimodal capabilities and is limited by computational power, resulting in slower speed. The article also mentions Anthropic shutting down Fable 5 and Mythos 5 at the request of the U.S. Department of Commerce, highlighting the contrast between open-source and closed AI.
A user observes that the Kimi K2.6 model's chain-of-thought has become shorter and more concise, improving coding performance in Kimi Code, and expresses hope for continued open-source competition with upcoming GLM 5.2 and Fable 5.
A report on an AI-assisted research tool that connects the entire research workflow through three Skills (scientific-toolkit, research-writing, office-academic), from data computation to paper writing to PPT creation. It supports one-click installation in Claude Code and Codex, with Chinese-first priority.
Chinese AI models like DeepSeek and Qwen deliver competitive performance at 5x–20x lower cost than Western counterparts, reshaping the economics of AI and driving multi-model deployment strategies.
Tencent Workbuddy is gradually becoming a phenomenal product and may have significant implications.
The article observes that Tencent's Hy3 Preview open model performs surprisingly well in evaluations, narrowing the gap with top closed models, yet remains underdiscussed compared to Western AI labs.
OpenBMB releases MiniCPM5-1B, a leading 1B open weights LLM that achieves the highest Artificial Analysis Intelligence Index score (17.9) in its size class, surpassing larger models like Qwen3.5 2B while using fewer parameters.