Tag
Cohere officially launches North Mini Code, a coding model, with weights available on Hugging Face and deployment support for vLLM and MLX.
Cohere released its first open-source coding model, North Mini Code Small, designed for efficient agentic performance and community input.
Cohere has released an early access coding model, BLS-Mini-Code-1.0, a 30B parameter model available on Hugging Face for testing.
A monokernel approach for LLM decoding on AMD MI300X GPUs achieves up to 3,300 output tokens/s per request without speculative decoding or quantization, using memory access patterns mapped to the die topology.
阿里巴巴发布了通义千问 3.7 Max,一款专为智能体时代设计的旗舰编码模型。该模型在长周期自主执行、前端生成和3D场景构建上表现突出,多项基准测试中与顶尖闭源模型持平甚至超越,是接近前沿的中国模型。
The user discusses their experience with Qwen 3.6 27B for local coding tasks and asks for recommendations for larger models (100B+) suitable for systems with 224GB of VRAM.
OpenAI releases GPT-5.2-Codex, an advanced agentic coding model optimized for complex software engineering tasks with improved long-horizon capabilities, Windows support, and cybersecurity features. The release includes comprehensive safety documentation through a system card outlining model and product-level mitigations.
OpenAI releases GPT-5.1-Codex-Max, a frontier agentic coding model trained on software engineering tasks with native multi-context window support through compaction, designed to handle millions of tokens in a single task. The system card details comprehensive safety measures and preparedness framework evaluations across cybersecurity, biology, and AI self-improvement domains.
OpenAI introduces GPT-5.1-Codex-Max, a new agentic coding model with improved reasoning, token efficiency, and the ability to maintain coherent work across millions of tokens through a 'compaction' mechanism. The model is faster, more intelligent, and can sustain long-running tasks for hours or days, representing a significant advancement in AI-assisted software engineering.