@MaxForAI: You'd be hard-pressed to find a better eval resource library. If you're interested in eval, these are what you should read. Thanks to @xdotli for sharing.

X AI KOLs Timeline News

Summary

Share a curated AI evaluation (evals) resource library, including high-quality blogs, podcasts, papers, and projects, compiled by Xiangyi Li.

You'd be hard-pressed to find a better eval resource library If you're interested in eval, these are what you should read. Thanks to @xdotli for sharing.
Original Article
View Cached Full Text

Cached at: 06/24/26, 08:29 PM

It’s hard to find a better eval resource library than this one.

If you’re interested in eval, these are what you should read.

Thanks to @xdotli for sharing

Xiangyi Li (@xdotli): sharing my personal library on evals 1/n

i put together the highest quality blogs, podcasts, papers, and projects on evals. additions are welcome!

The Unsloth team is terrifying.

They took China’s top open-source model, GLM 5.2, and optimized it using an extreme technique called 1-bit compression, then converted it into a lightweight GGUF format.

This means you can run GLM 5.2 entirely locally (on a 256GB Mac Studio) with no internet connection or external server needed, and at an impressive speed of 21 tokens per second (human speech is about 10-20 tokens per second).

And they didn’t stop there — they also livestreamed a showdown, pitting this compressed local model against the world’s most powerful and expensive paid cloud models: Claude 4.8 Opus and GPT-5.5.

What shocked developers in the comments the most was that this local model actually traded blows with multi-billion-dollar server clusters, delivering intelligent and precise answers that rivaled these closed-source giants.

The real winner today isn’t a specific model — it’s the concept of local inference:

From now on, your data stays 100% safe on your device
Your API bill is zero dollars, with intelligence on par with the best US companies

What a crazy but revolutionary approach.

Similar Articles

@PierceZhang34: Sharing an open collaborative repository focused on AI-assisted research: Awesome Vibe Research. The core goal is to collect and curate reusable, verifiable, and evolvable AI-assisted components across the full research workflow (from idea generation to paper publication and dissemination), including: Agents, Skills...

X AI KOLs Timeline

Shared an open collaborative repository Awesome Vibe Research maintained by ModelScope. This repository collects and curates reusable, verifiable, and evolvable AI-assisted components across the full research workflow, including agents, skills, workflows, tools, and best practices. It aims to help researchers and developers leverage AI to improve research efficiency.