So... has anyone actually figured out whose model Elephant Alpha is yet?

Reddit r/singularity 04/18/26, 02:42 PM News

Summary

Community discusses the identity of 'Elephant Alpha', a 100B parameter model ranked #1 on OpenRouter with 256K context window, fast inference speed, and strong coding capabilities but poor Chinese support, speculating on which company might be behind it.

It's been sitting at #1 on OpenRouter, doing ~250 tps. It's a 100B parameter model, the context window is 256K, and the Chinese language support is notoriously bad. It's clearly heavily optimized for coding and agentic tasks (instruction following is insanely strict). Given the specs and the sheer compute required to serve it this fast for free, the list of companies that could be behind this is pretty short. It doesn't feel like a Google model (they usually share sizes), and the poor Chinese support rules out Qwen/DeepSeek. Are we looking at a new Cohere Command variant? Or maybe a highly optimized MoE from a new startup? What's the current consensus?

Original Article

Similar Articles

I guess Ling-2.6-Flash is actually the stealth model Elephant Alpha that was making waves a few days ago.

Reddit r/LocalLLaMA

Ling-2.6-Flash appears to be the previously rumored stealth model 'Elephant Alpha' that had recently gained attention.

The mysterious Hy3 LLM is topping OpenRouter Model Rankings by a large margin

Hacker News Top

A mysterious model called Hy3 from Tencent has unexpectedly topped OpenRouter's LLM rankings by token usage, despite mediocre benchmark performance and low public awareness. The article investigates the anomaly using OpenRouter's public data.

@cuisitekp: A 9B model outperforms models several times larger. The team behind OLMo/Tülu from Ai2 and the University of Washington released a new paper called Tmax, claiming it's the strongest open-source RL training recipe for 'terminal agents'. Result: A 9B model on Terminal-Be…

X AI KOLs Timeline

Ai2 and the University of Washington released a paper titled Tmax, proposing the strongest open-source terminal agent RL training recipe to date. A 9B parameter model outperforms larger models on Terminal-Bench 2.0, with the key being low-cost generation of vast amounts of verifiable training data, not model size or algorithm.

Tools: Is This a Technical Victory, or a Price War Victory?

Reddit r/artificial

Analysis of OpenRouter data shows that Chinese AI models have become the most used in Kilo Code's coding agent, accounting for 58% of token usage, challenging the dominance of Claude and GPT due to lower cost and longer context windows.

Claude Mythos, Deepseek v4, HappyHorse, Meta’s new AI, realtime video games: AI NEWS

YouTube AI Channels

Anthropic unveils a withheld Claude Mythos model that autonomously finds thousands of 0-days, ZAI open-sources the 1.5 TB GLM-5.1 that tops open-weight benchmarks, Alibaba’s unreleased HappyHorse video model hits #1 on public leaderboards, and Deepseek teases an “Expert Mode” v4 preview.