huggingface

#huggingface

GLM 5.2 API is live, weights are on HF, and ollama has it already

Reddit r/LocalLLaMA ↗ · 2026-06-16

GLM 5.2 has been released with open weights under MIT license on HuggingFace, available via API and Ollama, featuring competitive benchmarks that trail Opus 4.8 by a point and edge GPT-5.5 by one.

0 favorites 0 likes

#huggingface

owensong/Inflect-Nano-v1

Hugging Face Models Trending ↗ · 2026-06-16 Cached

Inflect-Nano-v1 is a tiny English text-to-speech model with 4.63M total inference parameters, including its vocoder, designed for local, efficient speech synthesis experiments.

0 favorites 0 likes

#huggingface

Glimmer 1 - Glint Research. A foundational 10,000 parameter language model

Reddit r/LocalLLaMA ↗ · 2026-06-16

Introduces Glimmer, a 10,000 parameter language model trained on 500K tokens of FineWeb-Edu with a standard Llama architecture, available on HuggingFace.

0 favorites 0 likes

#huggingface

@MaziyarPanahi: 110+ people asked for early access before today. Now it's open to everyone. It reads the signals longevity tracks, hear…

X AI KOLs Following ↗ · 2026-06-16 Cached

A health tracking service that reads signals like longevity, heart rate, sleep, and recovery against user baseline is now open to all, with auditable reasoning steps on Hugging Face and privacy protection via OpenMed_AI PII models.

0 favorites 0 likes

#huggingface

Boogu/Boogu-Image-0.1-Edit

Hugging Face Models Trending ↗ · 2026-06-16 Cached

Boogu-Image-0.1 is an Apache-2.0 open-source unified image generation and editing model family, including variants for text-to-image, fast generation, editing, and Chinese-English text rendering, released as a research project on Hugging Face.

0 favorites 0 likes

#huggingface

@ariG23498: I was fascinated when I first heard about kernel fusion from @cHHillee's blog post "Making Deep Learning Go Brrrr From …

X AI KOLs Timeline ↗ · 2026-06-16 Cached

The author shares excitement about kernel fusion and demonstrates using HuggingFace's kernels project to profile a GeGLU FFN fused Liger kernel, noting the profile's beauty.

0 favorites 0 likes

#huggingface

We trained a cybersecurity-focused Mythos like LLM open weights on HuggingFace

Reddit r/LocalLLaMA ↗ · 2026-06-15

An open-source LLM called OpenMythos was trained for cybersecurity tasks using SFT and RLVR, with datasets available on HuggingFace. The model aims to reduce hallucinations and improve precision in security-related queries.

0 favorites 0 likes

#huggingface

@yacinelearning: okay folks buckle up because this thursday we have @joelniklaus from @huggingface that will join us on stream to teach …

X AI KOLs Timeline ↗ · 2026-06-15 Cached

Joel Niklaus from Hugging Face will give a live stream on synthetic data's role in advancing pretraining; the team has also published a playbook on the topic.

0 favorites 0 likes

#huggingface

@SergioPaniego: https://x.com/SergioPaniego/status/2066498136273531363

X AI KOLs Timeline ↗ · 2026-06-15 Cached

This post demonstrates how to fine-tune a model for free using a single prompt, leveraging the new Google Colab CLI along with Hugging Face's TRL and trackio tools, all orchestrated by an AI agent.

0 favorites 0 likes

#huggingface

@0xSero: If you have the NVMe Go download as many models as you think you might ever want. Now, go on Huggingface. They’re comin…

X AI KOLs Following ↗ · 2026-06-13 Cached

A user warns to download as many open models as possible from Hugging Face, suggesting that open models may be targeted next.

0 favorites 0 likes

#huggingface

"inference falls back to dense attention" for MiniMax M3 - does it mean 428B weights used at each step?

Reddit r/LocalLLaMA ↗ · 2026-06-12

Discusses the fact that for MiniMax M3, sparse attention is not yet supported in GGUF format, so inference falls back to dense attention, potentially using all 428B weights each step, causing significant slowdown.

0 favorites 0 likes

#huggingface

@AndreasPSteiner: Released last week, and already more than 4M downloads on HuggingFace alone This makes Gemma 4 12B the most popular enc…

X AI KOLs Timeline ↗ · 2026-06-12 Cached

Google's Gemma 4 12B model, released last week, has already surpassed 4 million downloads on HuggingFace, making it the most popular encoder-free VLM and the first general-purpose LLM with encoder-free audio input. The model balances size and performance, enabling local laptop use with multi-step reasoning and agentic workflows.

0 favorites 0 likes

#huggingface

Unsloth Minimax M3 GGUF

Reddit r/LocalLLaMA ↗ · 2026-06-12

Unsloth is uploading a GGUF quantized version of the MiniMax M3 model to Hugging Face.

0 favorites 0 likes

#huggingface

[NEW MODEL] Supra-Title-0.3B Just released!

Reddit r/LocalLLaMA ↗ · 2026-06-12

Supra Labs released Supra Title, a 350M parameter model specialized for generating chat conversation titles. Built on LFM2.5, it runs on any hardware in GGUF format and requires no system prompt.

0 favorites 0 likes

#huggingface

@julien_c: This is awesome news: oMLX, by @jundotkim, now supports the standard HF cache model directory Great MLX server for Loca…

X AI KOLs Following ↗ · 2026-06-12 Cached

oMLX, a MLX server for local AI, now supports the standard Hugging Face cache model directory, simplifying model loading.

0 favorites 0 likes

#huggingface

@julien_c: Explore your @huggingface repos in a whole new way Visualize storage, discover outliers, and navigate your repos direct…

X AI KOLs Following ↗ · 2026-06-11 Cached

A new CLI tool for navigating and visualizing Hugging Face repositories, allowing users to explore storage, find outliers, and manage repos directly from the terminal.

0 favorites 0 likes

#huggingface

New models released: Nex-N2 Pro 397B and Nex-N2 Mini 35B

Reddit r/LocalLLaMA ↗ · 2026-06-11

Release of fine-tuned versions of Qwen3.5: the Nex-N2 Pro 397B and Nex-N2 Mini 35B, with strong benchmark results.

0 favorites 0 likes

#huggingface

Open Reproduction of DeepSeek-R1

Hacker News Top ↗ · 2026-06-11 Cached

Hugging Face's Open R1 project provides a fully open reproduction pipeline for DeepSeek-R1, including distilled datasets, training scripts, and evaluation tools, with the goal of enabling anyone to replicate and build on top of R1's reasoning capabilities.

0 favorites 0 likes

#huggingface

@cohere: Cohere Transcribe, our open-source speech recognition model, is #1 on the new @huggingface Far-Field ASR benchmark.

X AI KOLs Following ↗ · 2026-06-10 Cached

Cohere Transcribe, an open-source speech recognition model, achieved first place on Hugging Face's new Far-Field ASR benchmark.

0 favorites 0 likes

#huggingface

MooreThreads/MusaCoder-27B • Huggingface

Reddit r/LocalLLaMA ↗ · 2026-06-10

MooreThreads releases MusaCoder-27B, a 27-billion-parameter code generation model, accompanied by a paper on arXiv.

0 favorites 0 likes

huggingface

Submit Feedback