So... has anyone actually figured out whose model Elephant Alpha is yet?

Reddit r/singularity News

Summary

Community discusses the identity of 'Elephant Alpha', a 100B parameter model ranked #1 on OpenRouter with 256K context window, fast inference speed, and strong coding capabilities but poor Chinese support, speculating on which company might be behind it.

It's been sitting at #1 on OpenRouter, doing ~250 tps. It's a 100B parameter model, the context window is 256K, and the Chinese language support is notoriously bad. It's clearly heavily optimized for coding and agentic tasks (instruction following is insanely strict). Given the specs and the sheer compute required to serve it this fast for free, the list of companies that could be behind this is pretty short. It doesn't feel like a Google model (they usually share sizes), and the poor Chinese support rules out Qwen/DeepSeek. Are we looking at a new Cohere Command variant? Or maybe a highly optimized MoE from a new startup? What's the current consensus?
Original Article

Similar Articles

@cuisitekp: A 9B model outperforms models several times larger. The team behind OLMo/Tülu from Ai2 and the University of Washington released a new paper called Tmax, claiming it's the strongest open-source RL training recipe for 'terminal agents'. Result: A 9B model on Terminal-Be…

X AI KOLs Timeline

Ai2 and the University of Washington released a paper titled Tmax, proposing the strongest open-source terminal agent RL training recipe to date. A 9B parameter model outperforms larger models on Terminal-Bench 2.0, with the key being low-cost generation of vast amounts of verifiable training data, not model size or algorithm.

Tools: Is This a Technical Victory, or a Price War Victory?

Reddit r/artificial

Analysis of OpenRouter data shows that Chinese AI models have become the most used in Kilo Code's coding agent, accounting for 58% of token usage, challenging the dominance of Claude and GPT due to lower cost and longer context windows.