Tag
Krea 2 is a 12-billion parameter text-to-image diffusion model released open-weight on Hugging Face, with Raw (base) and Turbo (post-trained) checkpoints available.
This post shows how to serve Baidu's Unlimited-OCR model as a temporary, OpenAI-compatible endpoint on Hugging Face Jobs, enabling multi-page document parsing with features like table-to-HTML and equation-to-LaTeX extraction.
A GGUF conversion of MiniMax M3's EAGLE draft model for llama.cpp is now available, enabling speculative decoding speedups on compatible hardware.
Hugging Face describes how they built a weekly release pipeline for their huggingface_hub library using AI, open-source tools, and human oversight, enabling faster and more reliable releases.
The blog post describes using local open-weight models like Gemma and Qwen in an agent harness to automatically triage issues and pull requests in the OpenClaw repository, enabling real-time notifications without relying on costly closed API models.
A tweet recommending 'The Smol Training Playbook' on Hugging Face, a resource that demystifies model training for beginners.
An uncensored version of the gemma-4-12B-coder model created using abliteration to remove refusals, intended for research and experimental use.
Liquid AI demonstrates using LFM2.5-ColBERT-350M as a filter to select only the five most relevant tools from 151 options, reducing latency and improving tool selection accuracy.
GLM-5.2 is an open-source long-context model with a solid 1M-token context, strong coding capabilities, and an MIT license, now available on Hugging Face.
Learn how to use Claude Code with GLM-5.2 via Hugging Face Inference Providers. GLM-5.2 is free for 6 hours on several providers like Together AI, Fireworks, and DeepInfra.
GLM-5.2 is now free to use with Hugging Face Inference Providers for the next 6 hours, supporting open-source AI.
Baidu releases Unlimited-OCR, a new model for one-shot long-horizon document parsing, building on Deepseek-OCR. It supports single image and multi-page/PDF parsing via Hugging Face Transformers and SGLang.
Poolside releases Laguna M.1, a 225B parameter Mixture-of-Experts model with 23B activated parameters per token, designed for agentic coding and long-horizon tasks. It achieves competitive results on SWE-bench benchmarks and is released under an Apache 2.0 license.
A tweet highlighting Joël Niklaus's HuggingFace article on the Synthetic Data Playbook, which inspired the text-albumentations library.
The Hugging Face CLI's `hf upload` command has been fully rewritten with single-pass hashing, multi-commits, and resumable uploads, available in version 1.20.0 for improved speed and cleanliness.
Explores whether LoRA is the best parameter-efficient fine-tuning technique and introduces the PEFT library's tools to compare methods.
AutoMegaKernel is an open-source agent harness that compiles any HuggingFace model into a single persistent megakernel, fusing the entire forward pass into one GPU launch to reduce overhead. It achieves up to 1.33x speedup over CUDA-graphed cuBLAS on inference-class GPUs like L4 and L40S, while proving schedules deadlock- and race-free.
GLM 5.2 GGUF quantized model is released, with 8-bit version half the size of the full model; smaller versions are coming soon.
Moonshot AI 发布了专注于编程的开放式权重模型 Kimi K2.7 Code,拥有1万亿参数和384个专家,性能在MCP工具调用上超越Opus 4.8,成本仅为十分之一。
GLM 5.2 has been released with open weights under MIT license on HuggingFace, available via API and Ollama, featuring competitive benchmarks that trail Opus 4.8 by a point and edge GPT-5.5 by one.