coding-model

#coding-model

Releasing Cohere North Mini Code

Reddit r/LocalLLaMA ↗ · yesterday

Cohere officially launches North Mini Code, a coding model, with weights available on Hugging Face and deployment support for vLLM and MLX.

0 favorites 0 likes

#coding-model

@cohere: Introducing Cohere's first open-source coding model: North Mini Code Small & efficient, designed for agentic performanc…

X AI KOLs Following ↗ · yesterday Cached

Cohere released its first open-source coding model, North Mini Code Small, designed for efficient agentic performance and community input.

0 favorites 0 likes

#coding-model

Cohere's unreleased coding model (early access for localllama)

Reddit r/LocalLLaMA ↗ · 4d ago Cached

Cohere has released an early access coding model, BLS-Mini-Code-1.0, a 30B parameter model available on Hugging Face for testing.

0 favorites 0 likes

#coding-model

Building a monokernel for LLM inference on AMD MI300X - up to 3,300 output tokens/s per request [P]

Reddit r/MachineLearning ↗ · 2026-05-29

A monokernel approach for LLM decoding on AMD MI300X GPUs achieves up to 3,300 output tokens/s per request without speculative decoding or quantization, using memory access patterns mapped to the die topology.

0 favorites 0 likes

#coding-model

@intheworldofai: Qwen 3.7-Max is genuinely one of the most impressive agentic coding models I’ve tested in a while. I had it generate a …

X AI KOLs Timeline ↗ · 2026-05-22 Cached

阿里巴巴发布了通义千问 3.7 Max，一款专为智能体时代设计的旗舰编码模型。该模型在长周期自主执行、前端生成和3D场景构建上表现突出，多项基准测试中与顶尖闭源模型持平甚至超越，是接近前沿的中国模型。

0 favorites 0 likes

#coding-model

High VRAM local coding model — still Qwen 3.6 27B?

Reddit r/LocalLLaMA ↗ · 2026-05-12

The user discusses their experience with Qwen 3.6 27B for local coding tasks and asks for recommendations for larger models (100B+) suitable for systems with 224GB of VRAM.

0 favorites 0 likes

#coding-model

Addendum to GPT-5.2 System Card: GPT-5.2-Codex

OpenAI Blog ↗ · 2025-12-18 Cached

OpenAI releases GPT-5.2-Codex, an advanced agentic coding model optimized for complex software engineering tasks with improved long-horizon capabilities, Windows support, and cybersecurity features. The release includes comprehensive safety documentation through a system card outlining model and product-level mitigations.

0 favorites 0 likes

#coding-model

GPT-5.1-Codex-Max System Card

OpenAI Blog ↗ · 2025-11-19 Cached

OpenAI releases GPT-5.1-Codex-Max, a frontier agentic coding model trained on software engineering tasks with native multi-context window support through compaction, designed to handle millions of tokens in a single task. The system card details comprehensive safety measures and preparedness framework evaluations across cybersecurity, biology, and AI self-improvement domains.

0 favorites 0 likes

#coding-model

Building more with GPT-5.1-Codex-Max

OpenAI Blog ↗ · 2025-11-19 Cached

OpenAI introduces GPT-5.1-Codex-Max, a new agentic coding model with improved reasoning, token efficiency, and the ability to maintain coherent work across millions of tokens through a 'compaction' mechanism. The model is faster, more intelligent, and can sustain long-running tasks for hours or days, representing a significant advancement in AI-assisted software engineering.

0 favorites 0 likes

coding-model

Submit Feedback