HuggingFace

Articles from HuggingFace

Cards List

Build real agentic apps using CUGA: two dozen working examples on a lightweight harness

Hugging Face Blog · 8h ago Cached

IBM introduces CUGA, an open-source agent harness that handles plumbing for state, tool calls, and orchestration, allowing developers to focus on defining tools and prompts. The article showcases two dozen single-file example apps built with CUGA, demonstrating how it eliminates repetitive framework setup.

0 favorites 0 likes

Experimenting with the proposed Cross-Origin Storage API in Transformers.js

Hugging Face Blog · 21h ago Cached

This guest post explores the proposed Cross-Origin Storage API to improve caching of AI model resources in Transformers.js, enabling efficient reuse across origins while maintaining privacy and integrity for in-browser inference.

0 favorites 0 likes

Shipping huggingface_hub every week with AI, open tools, and a human in the loop

Hugging Face Blog · 21h ago Cached

Hugging Face describes how they built a weekly release pipeline for their huggingface_hub library using AI, open-source tools, and human oversight, enabling faster and more reliable releases.

0 favorites 0 likes

PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters

Hugging Face Blog · yesterday Cached

PP-OCRv6 is the latest generation of PaddleOCR's universal OCR model family, offering three tiers from 1.5M to 34.5M parameters, supporting 50 languages, and achieving significant accuracy improvements over previous versions.

0 favorites 0 likes

ShotcreteDepth: A Bi-modal Dataset for Robust Robotic Depth Perception in Shotcrete Construction Environments

Hugging Face Daily Papers · yesterday Cached

ShotcreteDepth is a bi-modal dataset of stereo RGB and LiDAR data from construction environments, designed to support research in depth perception under challenging conditions. The dataset includes 11,252 samples with 220 annotated, and is accompanied by a lightweight annotation tool.

0 favorites 0 likes

TROPT: An Open Framework for Unifying and Advancing Discrete Text Optimization

Hugging Face Daily Papers · yesterday Cached

TROPT is an open-source framework that unifies discrete text-trigger optimization, standardizing development and execution across domains like LLM jailbreaking and model interpretability. It includes over 15 optimizers and 30 recipes, lowering barriers for adoption and advancement.

0 favorites 0 likes

Capable but Careless: Do Computer-Use Agents Follow Contextual Integrity?

Hugging Face Daily Papers · yesterday Cached

This paper introduces AgentCIBench, a benchmark to evaluate privacy risks in computer-use agents, finding that 11 of 15 frontier agents leak information in over 50% of scenarios.

0 favorites 0 likes

Vera: A Layered Diffusion Model for Content-Preserving Video Editing

Hugging Face Daily Papers · yesterday Cached

Vera is a layered diffusion model for video editing that preserves source content by generating edit layers and alpha mattes, using a Mixture-of-Transformers architecture.

0 favorites 0 likes

When Agents Commit Too Soon: Diagnosing Premature Commitment in LLM Agents

Hugging Face Daily Papers · yesterday Cached

This paper introduces representational commitment, a cross-run hidden-state convergence that diagnoses when an LLM agent has locked onto a trajectory prematurely. It shows that commitment predicts trajectory consistency but not correctness, and proposes monitoring to detect when an agent is confidently settled rather than assuming consistency equals trust.

0 favorites 0 likes

Arbor: Explicit Geometric Conditioning for Controllable 3D Asset Generation

Hugging Face Daily Papers · yesterday Cached

Arbor introduces explicit geometric control for 3D asset generation by using constraint meshes (hull, avoidance, touch regions) to condition latent generation, improving spatial constraint adherence without sacrificing object quality.

0 favorites 0 likes

HAKARI-Bench: A Lightweight Benchmark for Comparing Retrieval Architectures and Efficiency Settings under Unified Conditions

Hugging Face Daily Papers · yesterday Cached

HAKARI-Bench is a lightweight benchmark for comparing retrieval methods across multiple configurations and languages, enabling efficient model selection and performance analysis. It reproduces full benchmarks like MTEB at high correlation while being faster to run.

0 favorites 0 likes

MeshFlow: Mesh Generation with Equivariant Flow Matching

Hugging Face Daily Papers · yesterday Cached

MeshFlow introduces an equivariant optimal-transport flow matching model for direct triangle mesh generation, achieving state-of-the-art quality while providing approximately 18x inference speedup over autoregressive methods.

0 favorites 0 likes

Foresight: Failure Detection for Long-Horizon Robotic Manipulation with Action-Conditioned World Model Latents

Hugging Face Daily Papers · yesterday Cached

Foresight is a failure detection framework for long-horizon robotic manipulation that uses action-conditioned world model latents and functional conformal prediction to monitor trajectories, trained only with final task labels. It demonstrates state-of-the-art performance across simulation and real robot tasks.

0 favorites 0 likes

We got local models to triage the OpenClaw repo for FREE!*

Hugging Face Blog · yesterday Cached

The blog post describes using local open-weight models like Gemma and Qwen in an agent harness to automatically triage issues and pull requests in the OpenClaw repository, enabling real-time notifications without relying on costly closed API models.

0 favorites 0 likes

KaLM-Reranker-V1: Fast but Not Late Interaction for Compressed Document Reranking

Hugging Face Daily Papers · yesterday Cached

KaLM-Reranker-V1 is a fast reranker that decouples query and passage computation using an encoder-decoder architecture with Matryoshka embedding pooling and cross-attention, achieving state-of-the-art reranking performance on BEIR and competitive results on multilingual benchmarks.

0 favorites 0 likes

Safe Few-Step Generation via Velocity Editing

Hugging Face Daily Papers · yesterday Cached

VESFlow is a training-free safety method for flow matching-based text-to-image generation that edits velocity fields to ensure safe output while maintaining prompt integrity.

0 favorites 0 likes

EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions

Hugging Face Daily Papers · yesterday Cached

EnterpriseClawBench presents a benchmark for enterprise agents based on real-world workplace sessions, offering 852 reproducible tasks and comprehensive evaluation metrics beyond single performance scores.

0 favorites 0 likes

Causal Discovery in the Era of Agents

Hugging Face Daily Papers · yesterday Cached

This paper argues that language model agents should assist causal discovery workflows by providing contextual support and explanations rather than generating causal conclusions, and introduces causal-learn+ platform to demonstrate this principle.

0 favorites 0 likes

Dense Reward for Multi-View 3D Reasoning with Global Maps and Local Views

Hugging Face Daily Papers · yesterday Cached

DR-MV3D presents a map-grounded learning framework with dense rewards to improve multi-view 3D visual question answering through global map construction, view-trajectory planning, and egocentric grounding.

0 favorites 0 likes

Self-Compacting Language Model Agents

Hugging Face Daily Papers · yesterday Cached

SelfCompact is a scaffolding approach that lets language models autonomously decide when and how to compact long agent traces, achieving better performance with reduced token costs compared to fixed-interval methods.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback