Models

Cards List

@gabepereyra: Harvey partnered with @appliedcompute to train a legal agent. We optimized each part of the agent stack, including the …

X AI KOLs Following · 18h ago Cached

Harvey partnered with Applied Compute to train a legal agent, optimizing the agent stack and post-training the GLM-5.1 model using reward signals from their Legal Agent Benchmark.

0 favorites 0 likes

an updated GPT-5.5 Cyber outperforms Mythos 5 in CyberGym

Reddit r/singularity · 19h ago

An updated GPT-5.5 Cyber model surpasses Mythos 5 in the CyberGym benchmark.

0 favorites 0 likes

@cursor_ai: Three announcements from our keynote at Compile, including how we're training a new model with SpaceX.

X AI KOLs Following · 19h ago Cached

Cursor AI announced three updates from its Compile keynote, including training a new model in collaboration with SpaceX.

0 favorites 0 likes

@MiaAI_lab: FYI the best Qwen 3.6 35b nvfp4 to run is the @NVIDIAAI nvfp4. Do not use unsloth nvfp4, it performs worse. https://hug…

X AI KOLs Timeline · 21h ago Cached

NVIDIA's nvfp4 quantized version of Qwen 3.6 35B is recommended over the Unsloth variant, offering better performance. The model is available on HuggingFace for use in AI applications.

0 favorites 0 likes

@SpaceTimeViking: I don’t know what to say, even I’m at a loss for words.

X AI KOLs Timeline · 21h ago Cached

A tweet reacts to a claim that AEON-7 is the best model distill, leaving even the creator speechless.

0 favorites 0 likes

@sama: We want to help all companies be secure, working with the USG and the security ecosystem. *The full version of GPT-5.5-…

X AI KOLs · 21h ago Cached

OpenAI releases the full version of GPT-5.5-Cyber, a cybersecurity-focused AI model with state-of-the-art performance on CyberGym, and announces efforts to improve security through Patch The Planet and Codex Security.

0 favorites 0 likes

@KyleHessling1: Morning y'all! We've released Qwopus 3.6 27B-Coder-Compat with some compatibility fixes for various harnesses! This ver…

X AI KOLs Timeline · 21h ago Cached

Qwopus 3.6 27B-Coder-Compat is a new GGUF release with compatibility fixes for various harnesses, reducing looping and improving thinking stability. It can generate full HTML games and is suitable for local deployment.

0 favorites 0 likes

Everything we currently know about GPT 5.6

Reddit r/ArtificialInteligence · 21h ago Cached

According to leaked information, OpenAI's GPT‑5.6 is nearly complete, expected to have a 1.5 million context window, priced at only one-third of Fable 5, with significantly improved front-end generation quality. Release may be delayed until after the 28th.

0 favorites 0 likes

@AdinaYakup: SingGuard from Ant Group @AntLingAGI A multimodal guardrail where the safety policy is an input, not a fixed weight. - …

X AI KOLs Timeline · 23h ago Cached

SingGuard is a multimodal guardrail system from Ant Group that treats safety policy as an input, allowing dynamic adaptation via natural language. It is released under Apache 2.0 and covers text and image modalities.

0 favorites 0 likes

@AdinaYakup: Unlimited-OCR New OCR from @PaddlePaddle It can parse hundreds of pages in a single pass while maintaining stable speed…

X AI KOLs Following · yesterday Cached

PaddlePaddle releases Unlimited-OCR, a new OCR model using Reference Sliding Window Attention (R-SWA) to maintain constant KV cache during decoding, achieving 93% on OmniDocBench and a 6% improvement over previous methods.

0 favorites 0 likes

NEX-N2-mini: "There is no Pareto frontier. I am Pareto". This Qwen3.5-MoE fine tune fixed 3.5 and 3.6 overthinking apparently on my tests.

Reddit r/LocalLLaMA · yesterday

A fine-tuned version of Qwen3.5-MoE called NEX-N2-mini reportedly fixes overthinking issues seen in Qwen 3.5 and 3.6 models.

0 favorites 0 likes

Gemma4-12B-QAT Uncensored Balanced is out with MTP (~60% speed boost)!

Reddit r/LocalLLaMA · yesterday

Release of Gemma4-12B-QAT Uncensored Balanced, a fine-tuned uncensored model with a multi-token-prediction draft head for ~60% faster speculative decoding, optimized for llama.cpp and offering vision support.

0 favorites 0 likes

GLM5.2 @7tg on 4x3090 + 192GB on budget motherboard + cpu

Reddit r/LocalLLaMA · yesterday

Running GLM5.2 with 7 trillion tokens on a budget setup using 4x RTX 3090 GPUs and 192GB RAM.

0 favorites 0 likes

@HuggingPapers: Ai2 just released TMax 27B on Hugging Face A 27B terminal agent that hits 42.7% on Terminal Bench 2.0, rivaling models …

X AI KOLs Following · yesterday Cached

Ai2 released TMax 27B, a 27B terminal agent that achieves 42.7% on Terminal Bench 2.0, rivaling models 40 times its size.

0 favorites 0 likes

PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters

Hugging Face Blog · yesterday Cached

PP-OCRv6 is the latest generation of PaddleOCR's universal OCR model family, offering three tiers from 1.5M to 34.5M parameters, supporting 50 languages, and achieving significant accuracy improvements over previous versions.

0 favorites 0 likes

@manateelazycat: Did a big shot come from Baidu's AI Whampoa Military Academy? The open-source Unlimited OCR, based on DeepSeek OCR, immediately drops a killer move. According to its published data, it scored 93.23 on OmniDocBench v1.5, surpassing DeepSeek OCR and...

X AI KOLs Timeline · yesterday Cached

The open-source OCR model Unlimited OCR, based on DeepSeek OCR, achieves 93.23 on OmniDocBench v1.5 with only 3B parameters, outperforming DeepSeek OCR, Gemini 2.5, and others.

0 favorites 0 likes

Gemma 4 QAT 31B responds better to KV cache quantization too

Reddit r/LocalLLaMA · yesterday

The Gemma 4 QAT 31B model demonstrates improved behavior with KV cache quantization, suggesting enhanced inference efficiency.

0 favorites 0 likes

@VikParuchuri: Surya 2, which has 650M params and scores 83.3% on olmocr, is the most accurate small OCR model. One reason why is char…

X AI KOLs Following · yesterday Cached

Surya 2 is a 650M parameter OCR model achieving 83.3% on olmocr, claiming to be the most accurate small OCR model thanks to character tokenization which improves accuracy and model size.

0 favorites 0 likes

@GoSailGlobal: Current OCR processes multi-page documents page by page. Every time you turn a page, memory is reset. Today, Baidu quietly open-sourced a model on GitHub and HuggingFace called Unlimited OCR, inspired by how humans copy books: - When copying a book, you don't reread hundreds of pages every time you write a word...

X AI KOLs Timeline · yesterday Cached

Baidu has open-sourced the Unlimited OCR model, which uses a Reference Sliding Window Attention (R-SWA) mechanism to parse documents up to 32K context in a single pass, eliminating the need for page-by-page inference.

0 favorites 0 likes

GLM 5.2 vs. Opus

Hacker News Top · yesterday Cached

GLM 5.2 is a new open-weights model from Z.ai, compared against Claude Opus in a 3D game coding task. Opus performed faster and cleaner, but GLM 5.2 offers compelling cost and accessibility advantages.

0 favorites 0 likes
← Previous
Next →
← Back to home

Submit Feedback