Tag
MiniMax released M3, an open-weights model combining frontier coding, 1M context, and native multimodality, offering comparable performance to Opus at a fraction of the cost.
The first skill of the AI teaching tool, "3D Geometry Problem Solving," has been released, supporting text-based problem solving, image-based problem solving, and random question generation. Tests show that deepseek-v4-pro offers high cost-performance, with accuracy on some questions even surpassing GPT5.5.
Greg pr07 announces a new browser infrastructure using custom Firecracker VMs, a Chromium fork, and bare metal, offering up to 6x cheaper costs with subsecond cold starts and support for 10,000 concurrent browsers.
This paper introduces a lightweight multimodal LLM-based framework for cost-effective defect grading of power transmission equipment, using in-context learning and chain-of-thought to generate training data and fine-tuning Qwen3-VL-8B for state-of-the-art performance.
DeepSeek has made its V4-Pro API price cut of 75% permanent, with per-million cached input tokens at just $0.003625 and output tokens at $0.87, about 34 times cheaper than OpenAI's GPT-5.5. The model has 1.6 trillion parameters but requires only 49 billion active parameters, supports a 1-million-token context, and leads in coding and reasoning benchmark tests.
Composer 2.5 achieves 63.2% on CursorBench at $0.55 per task, nearly matching top models at 20x lower cost.
Google's Gemini 3.5 Flash model ranks first on Zapier's Automation Bench, outperforming other frontier models at a significantly lower cost.
Built a cheaper alternative to CodeRabbit using open source models, achieving similar or better PR review accuracy at 6x lower cost, with features like auto-fix and prompt-based bug fixing.
The author demonstrates that small vertical language models (6B-15B) can outperform top LLMs on niche benchmarks through cost-effective fine-tuning using open-source models and Codex orchestration, achieving results with a $300 dataset.
Google releases Veo 3.1 Lite, a cost-effective video generation model available on the Gemini API with 50% lower cost than Veo 3.1 Fast while maintaining the same speed. The model supports text-to-video and image-to-video generation with flexible resolutions and aspect ratios.