Tag
GLM 5.2 has been released with open weights under MIT license on HuggingFace, available via API and Ollama, featuring competitive benchmarks that trail Opus 4.8 by a point and edge GPT-5.5 by one.
Inflect-Nano-v1 is a tiny English text-to-speech model with 4.63M total inference parameters, including its vocoder, designed for local, efficient speech synthesis experiments.
Introduces Glimmer, a 10,000 parameter language model trained on 500K tokens of FineWeb-Edu with a standard Llama architecture, available on HuggingFace.
A health tracking service that reads signals like longevity, heart rate, sleep, and recovery against user baseline is now open to all, with auditable reasoning steps on Hugging Face and privacy protection via OpenMed_AI PII models.
Boogu-Image-0.1 is an Apache-2.0 open-source unified image generation and editing model family, including variants for text-to-image, fast generation, editing, and Chinese-English text rendering, released as a research project on Hugging Face.
The author shares excitement about kernel fusion and demonstrates using HuggingFace's kernels project to profile a GeGLU FFN fused Liger kernel, noting the profile's beauty.
An open-source LLM called OpenMythos was trained for cybersecurity tasks using SFT and RLVR, with datasets available on HuggingFace. The model aims to reduce hallucinations and improve precision in security-related queries.
Joel Niklaus from Hugging Face will give a live stream on synthetic data's role in advancing pretraining; the team has also published a playbook on the topic.
This post demonstrates how to fine-tune a model for free using a single prompt, leveraging the new Google Colab CLI along with Hugging Face's TRL and trackio tools, all orchestrated by an AI agent.
A user warns to download as many open models as possible from Hugging Face, suggesting that open models may be targeted next.
Discusses the fact that for MiniMax M3, sparse attention is not yet supported in GGUF format, so inference falls back to dense attention, potentially using all 428B weights each step, causing significant slowdown.
Google's Gemma 4 12B model, released last week, has already surpassed 4 million downloads on HuggingFace, making it the most popular encoder-free VLM and the first general-purpose LLM with encoder-free audio input. The model balances size and performance, enabling local laptop use with multi-step reasoning and agentic workflows.
Unsloth is uploading a GGUF quantized version of the MiniMax M3 model to Hugging Face.
Supra Labs released Supra Title, a 350M parameter model specialized for generating chat conversation titles. Built on LFM2.5, it runs on any hardware in GGUF format and requires no system prompt.
oMLX, a MLX server for local AI, now supports the standard Hugging Face cache model directory, simplifying model loading.
A new CLI tool for navigating and visualizing Hugging Face repositories, allowing users to explore storage, find outliers, and manage repos directly from the terminal.
Release of fine-tuned versions of Qwen3.5: the Nex-N2 Pro 397B and Nex-N2 Mini 35B, with strong benchmark results.
Hugging Face's Open R1 project provides a fully open reproduction pipeline for DeepSeek-R1, including distilled datasets, training scripts, and evaluation tools, with the goal of enabling anyone to replicate and build on top of R1's reasoning capabilities.
Cohere Transcribe, an open-source speech recognition model, achieved first place on Hugging Face's new Far-Field ASR benchmark.
MooreThreads releases MusaCoder-27B, a 27-billion-parameter code generation model, accompanied by a paper on arXiv.