Tag
Sakana in Japan released a competitor to the Mythos model, and initial impressions suggest it looks great.
A user expresses confusion about the status or behavior of the Claude Opus 4.8 AI model, prompting discussion.
GLM-5.2 has been released on the DeepSWE platform.
An example of the upcoming GPT bidirectional voice model has been shown.
Claude Sonnet 5 has been spotted, with its release expected next week.
The Vercel CEO expresses surprise at the impressive coding capabilities of the GLM-5.2 AI model.
Vik Paruchuri showcases lift, an open-source extraction model capable of pulling structured data from messy contracts.
According to speculation, Anthropic's new model Mythos, after completing training in February this year, quietly changed the R&D rhythm, leading to a significant leap in AI capabilities over the past 5 months. Leading models are helping to train the next generation of models.
GLM 5.2 ranks second on the Vending Bench business simulation benchmark while costing less than half of Opus, demonstrating strong performance at lower cost.
GLM-5.2 achieves state-of-the-art results on PostTrainBench, outperforming GPT-5.5 and Opus 4.8.
GLM-5.2, an open-weight model with Opus-level design capabilities, incorporates an anti-hacking module trained via RL to mitigate reward hacking and improve performance on long-running tasks.
ArgusRed is a CLI tool that uses a post-trained AI model to perform security scanning and penetration testing on codebases, outputting detailed markdown reports. It offers two modes: security scan (read-only) and pen test (active exploits) with optional exploit verification.
A comprehensive guide to setting up, prompting, and using GLM-5.2, including tips and tricks.
GLM-5.2 is now free to use with Hugging Face Inference Providers for the next 6 hours, supporting open-source AI.
OpenAI is preparing to release the GPT-5.6 family, including standard, Mini, and Pro variants, with a rumored 1.5 million token context window and improved agentic coding capabilities, targeting a Tuesday launch amid a competitive landscape with Anthropic.
OpenAI announces GPT-5.5 Instant, now on par with frontier thinking models for health-related questions, available to all free users, with improvements in recognizing urgent care and explaining uncertainty.
The founder of Z.ai expresses confidence in releasing a fable-class GLM model before the end of the year.
GLM-5.2 inference is available for free on Hugging Face for the next 6 hours.
GLM-5.2 is an open weight AI model optimized for creative writing tasks, claimed to be the best in its category.
MiniMax released M3, an open-source AI model that leads coding benchmarks and offers a 1M token context for handling entire codebases.