artificial-analysis

Tag

Cards List
#artificial-analysis

Claude Sonnet 5 Artificial Analysis Results & Comparison

Reddit r/singularity · yesterday

Provides analysis and comparison of Claude Sonnet 5's performance across benchmarks.

0 favorites 0 likes
#artificial-analysis

The gap between open weights LLMs and closed source LLMs

Hacker News Top · 5d ago Cached

Analyzes the gap between open weights and closed source LLMs using the Artificial Analysis Intelligence Index and other benchmarks, finding that the gap is shrinking on some metrics but stable on others.

0 favorites 0 likes
#artificial-analysis

GLM-5.2 is the new leading open weights model on Artificial Analysis

Hacker News Top · 2026-06-17 Cached

Z ai's GLM-5.2 has become the new leading open weights model on the Artificial Analysis Intelligence Index, scoring 51 and outperforming competitors like MiniMax-M3 and DeepSeek V4 Pro. The model features 744B total parameters, 40B active, MIT license, and 1M context window.

0 favorites 0 likes
#artificial-analysis

GLM-5.2 (max) is currently the third best model available, across both open and proprietary.

Reddit r/LocalLLaMA · 2026-06-17 Cached

GLM-5.2 (max) is currently ranked as the third best AI model overall according to Artificial Analysis' Intelligence Index, with detailed analysis of intelligence, openness, cost, and token usage.

0 favorites 0 likes
#artificial-analysis

Claude Fable 5 gets 65 on Artificial Analysis

Reddit r/singularity · 2026-06-09

Claude Fable 5 achieved a score of 65 on the Artificial Analysis intelligence index.

0 favorites 0 likes
#artificial-analysis

Qwen3.7 Max scored by Artificial Analysis, 27B/35B waiting room

Reddit r/LocalLLaMA · 2026-05-20

Qwen3.7 Max ranks 5th on Artificial Analysis benchmarks, matching GPT-5.4 and outperforming Gemini 3.5 Flash, while Qwen3.6 27B trails significantly.

0 favorites 1 likes
#artificial-analysis

@draecomino: Cerebras sets a new record: a one trillion parameter model @ 1,000 tokens/s

X AI KOLs Timeline · 2026-05-19 Cached

Cerebras announces it is running Kimi K2.6, a trillion parameter model, at approximately 1,000 tokens per second in enterprise trials, claiming the fastest frontier model performance ever measured by Artificial Analysis.

0 favorites 0 likes
#artificial-analysis

AA introduces Coding Agent Index - Performance Comparisons between Model & Harness Combinations

Reddit r/singularity · 2026-05-11

Artificial Analysis introduces the Coding Agent Index, a new benchmark suite combining SWE-Bench-Pro-Hard-AA, Terminal-Bench v2, and SWE-Atlas-QnA to evaluate the performance of AI coding agents across diverse tasks.

0 favorites 0 likes
#artificial-analysis

Kimi K2.6 lands at #4 on the Artificial Analysis Intelligence Index

Reddit r/singularity · 2026-04-21

Moonshot AI's Kimi K2.6 has debuted at fourth place on the Artificial Analysis Intelligence Index, marking a strong benchmark showing for the latest version of the model.

0 favorites 0 likes
← Back to home

Submit Feedback