Tag
Kimi released the K2.7 Code model and its high-speed version, and announced API pricing. Compared to rival Mimo, it is more expensive and slower.
Samsung will introduce paid tiers for its SmartThings API starting October 2026, including a $4.99 monthly plan for individual developers, potentially impacting advanced users and Home Assistant integration.
A free RAG API using medical Wikipedia articles is now available to provide local LLMs with accurate medical facts, as demonstrated by correcting hallucinations about Lhermitte sign.
Cloudflare launched self-managed OAuth for all customers, allowing developers to create and manage OAuth clients for delegated API access, improving security and scalability of the Cloudflare app ecosystem.
The article discusses how LLM code style choices affect token consumption and costs, offering optimizations such as using Web API standards and simpler indentation to reduce output tokens.
A cloud lab for biology allows running experiments via API in minutes, eliminating the need for expensive automation engineers and long setup times, enabling solo founders to conduct thousands of drug screens quickly.
Databricks introduces Agent Mode API for Genie Agent, providing a new interface for building and managing AI agents on the platform.
mcpgen is a CLI tool that turns any OpenAPI 3.x spec or Postman collection into a fully functional, self-contained Python MCP server with auto-detected authentication, no runtime dependency, and generates deployable source code.
This paper introduces MemClaw, a governed shared memory architecture for multi-agent LLM systems, formalizing failure modes like unauthorized leakage and stale propagation, and evaluating the system via the ArgusFleet harness.
Anthropic reported and resolved elevated error rates affecting multiple Claude models and services on June 23, 2026, lasting from 14:08 UTC to 15:33 UTC.
Postproxy is an API that allows users to publish, reply to, and analyze social media content programmatically.
The article explains the new HTTP QUERY method defined in RFC 10008, which addresses limitations of GET and POST for complex queries by providing a standard, safe, and idempotent method with a request body.
A brief prediction that in 2025 engineers will integrate LLM APIs into their test harnesses, and in 2026 they will design harnesses to work within their agents.
This article explains vLLM's weight syncing API for reinforcement learning, covering how it facilitates weight updates and KV cache recompute in RL training, with a focus on reducing complexity for training frameworks.
Stripe launched Directory, a searchable catalog of businesses in its network, designed for AI agents and developers to discover and integrate services programmatically.
The article analyzes the revenue sources of major AI chatbots including subscriptions, enterprise contracts, API usage, and cloud partnerships, and questions which company has the strongest long-term business model.
Sakana Fugu dynamically orchestrates a diverse pool of top models to tackle complex, multi-step tasks via a single API, leveraging their ICLR 2026 papers on learned orchestration to achieve frontier-level performance without single-vendor dependency.
A developer shares an alternative to repeatedly building the same social media API integration layer, likely a reusable tool or library.
A guide on avoiding rate limits and reducing costs when using the GLM 5.2 model, covering prompt batching, caching, free model alternatives, effort levels, context window management, and self-hosting.
StartupWiki is a free, open startup database for discovering and researching companies without accounts or subscriptions.