huggingface

Tag

Cards List
#huggingface

Krea 2 released on Hugging Face

Reddit r/LocalLLaMA · 5h ago Cached

Krea 2 is a 12-billion parameter text-to-image diffusion model released open-weight on Hugging Face, with Raw (base) and Turbo (post-trained) checkpoints available.

0 favorites 0 likes
#huggingface

@vanstriendaniel: It's raining OCR models again! @Baidu_Inc's Unlimited-OCR is one of the more interesting. You can try it without much e…

X AI KOLs Following · 8h ago Cached

This post shows how to serve Baidu's Unlimited-OCR model as a temporary, OpenAI-compatible endpoint on Hugging Face Jobs, enabling multi-page document parsing with features like table-to-HTML and equation-to-LaTeX extraction.

0 favorites 0 likes
#huggingface

MiniMax-M3-EAGLE3-GGUF - Llama.cpp compatible MiniMax M3 EAGLE draft model!

Reddit r/LocalLLaMA · 17h ago

A GGUF conversion of MiniMax M3's EAGLE draft model for llama.cpp is now available, enabling speculative decoding speedups on compatible hardware.

0 favorites 0 likes
#huggingface

Shipping huggingface_hub every week with AI, open tools, and a human in the loop

Hugging Face Blog · 21h ago Cached

Hugging Face describes how they built a weekly release pipeline for their huggingface_hub library using AI, open-source tools, and human oversight, enabling faster and more reliable releases.

0 favorites 0 likes
#huggingface

We got local models to triage the OpenClaw repo for FREE!*

Hugging Face Blog · yesterday Cached

The blog post describes using local open-weight models like Gemma and Qwen in an agent harness to automatically triage issues and pull requests in the OpenClaw repository, enabling real-time notifications without relying on costly closed API models.

0 favorites 0 likes
#huggingface

@LottoLabs: If you’ve been thinking about training models or like the idea but don’t know where to start This is one of the best re…

X AI KOLs Following · 2d ago Cached

A tweet recommending 'The Smol Training Playbook' on Hugging Face, a resource that demystifies model training for beginners.

0 favorites 0 likes
#huggingface

huihui-ai/Huihui-gemma-4-12B-coder-fable5-composer2.5-v1-abliterated

Hugging Face Models Trending · 2d ago Cached

An uncensored version of the gemma-4-12B-coder model created using abliteration to remove refusals, intended for research and experimental use.

0 favorites 0 likes
#huggingface

@liquidai: Storing too many tools in your context window increases latency and can lead to wrong tool selection. In this demo, we …

X AI KOLs Following · 4d ago Cached

Liquid AI demonstrates using LFM2.5-ColBERT-350M as a filter to select only the five most relevant tools from 151 options, reducing latency and improving tool selection accuracy.

0 favorites 0 likes
#huggingface

@Thom_Wolf: To all the newcomers excited to try Opus 4.8-level models at home: welcome to OpenWeightLand! Things work a little diff…

X AI KOLs Following · 4d ago Cached

GLM-5.2 is an open-source long-context model with a solid 1M-token context, strong coding capabilities, and an MIT license, now available on Hugging Face.

0 favorites 0 likes
#huggingface

@NielsRogge: Here's how to use Claude Code with GLM-5.2 via @huggingface Inference Providers: 1. Create a token at https://huggingfa…

X AI KOLs Following · 4d ago Cached

Learn how to use Claude Code with GLM-5.2 via Hugging Face Inference Providers. GLM-5.2 is free for 6 hours on several providers like Together AI, Fireworks, and DeepInfra.

0 favorites 0 likes
#huggingface

@DJLougen: This is a huge commitment, thank you as always to @huggingface and co for their commitment to local opportunities

X AI KOLs Following · 4d ago Cached

GLM-5.2 is now free to use with Hugging Face Inference Providers for the next 6 hours, supporting open-source AI.

0 favorites 0 likes
#huggingface

baidu/Unlimited-OCR

Hugging Face Models Trending · 4d ago Cached

Baidu releases Unlimited-OCR, a new model for one-shot long-horizon document parsing, building on Deepseek-OCR. It supports single image and multi-page/PDF parsing via Hugging Face Transformers and SGLang.

0 favorites 0 likes
#huggingface

poolside/Laguna-M.1 · Hugging Face - 225B-A23B

Reddit r/LocalLLaMA · 5d ago Cached

Poolside releases Laguna M.1, a 225B parameter Mixture-of-Experts model with 23B activated parameters per token, designed for agentic coding and long-horizon tasks. It achieves competitive results on SWE-bench benchmarks and is released under an Apache 2.0 license.

0 favorites 0 likes
#huggingface

@neural_avb: Btw Joel is the author of the great huggingface article on the Synthetic Data Playbook. It's a marathon survey everyone…

X AI KOLs Timeline · 5d ago Cached

A tweet highlighting Joël Niklaus's HuggingFace article on the Synthetic Data Playbook, which inspired the text-albumentations library.

0 favorites 0 likes
#huggingface

@Wauplin: `hf upload` got a full rewrite! Single-pass hashing, multi commits, resumable uploads. Same CLI, way faster, way cleane…

X AI KOLs Following · 5d ago Cached

The Hugging Face CLI's `hf upload` command has been fully rewritten with single-pass hashing, multi-commits, and resumable uploads, available in version 1.20.0 for improved speed and cleanliness.

0 favorites 0 likes
#huggingface

Beyond LoRA: Can you beat the most popular fine-tuning technique?

Hugging Face Blog · 5d ago Cached

Explores whether LoRA is the best parameter-efficient fine-tuning technique and introduces the PEFT library's tools to compare methods.

0 favorites 0 likes
#huggingface

@Akashi203: i open-sourced automegakernel -- compiles any huggingface model into a single persistent megakernel batch-1 decode is b…

X AI KOLs Timeline · 5d ago Cached

AutoMegaKernel is an open-source agent harness that compiles any HuggingFace model into a single persistent megakernel, fusing the entire forward pass into one GPU launch to reduce overhead. It achieves up to 1.33x speedup over CUDA-graphed cuBLAS on inference-class GPUs like L4 and L40S, while proving schedules deadlock- and race-free.

0 favorites 0 likes
#huggingface

@aisearchio: GLM 5.2 GGUF is already here! 8-bit is ~half the size of the full model. Smaller versions coming soon https://huggingfa…

X AI KOLs Timeline · 6d ago Cached

GLM 5.2 GGUF quantized model is released, with 8-bit version half the size of the full model; smaller versions are coming soon.

0 favorites 0 likes
#huggingface

Kimi K2.7 Code: 1T MoE, $0.95/M tokens, MIT license, beats Opus 4.8 on MCP tool-calling

Reddit r/AI_Agents · 6d ago

Moonshot AI 发布了专注于编程的开放式权重模型 Kimi K2.7 Code,拥有1万亿参数和384个专家,性能在MCP工具调用上超越Opus 4.8,成本仅为十分之一。

0 favorites 0 likes
#huggingface

GLM 5.2 API is live, weights are on HF, and ollama has it already

Reddit r/LocalLLaMA · 2026-06-16

GLM 5.2 has been released with open weights under MIT license on HuggingFace, available via API and Ollama, featuring competitive benchmarks that trail Opus 4.8 by a point and edge GPT-5.5 by one.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback