supervision

Tag

Cards List
#supervision

Running agents all day, I keep noticing the bottleneck is me defining "good", not the model

Reddit r/AI_Agents · 2026-06-10

The author reflects that the primary bottleneck in running AI agents is not the model's capability but the human's ability to precisely define what 'good' or 'done' means, drawing parallels to managing people.

0 favorites 0 likes
#supervision

From Sampled Outcomes to Capability Distributions: Rethinking Supervision for LLM Routing

arXiv cs.LG · 2026-06-08 Cached

This paper proposes DARS, a framework that constructs routing supervision from a distributional view of model behavior to address the unreliability of single-shot labels in LLM routing.

0 favorites 0 likes
#supervision

Trust Functions: Near-Lossless Weak-to-Strong Generalization by Learning When to Trust the Weak Teacher

Hugging Face Daily Papers · 2026-05-31 Cached

Trust functions enable near-lossless weak-to-strong generalization by identifying reliable weak labels for training, achieving performance comparable to ground-truth supervision across multiple domains.

0 favorites 0 likes
#supervision

@_akhaliq: GEM Generative Supervision Helps Embodied Intelligence

X AI KOLs Following · 2026-05-28 Cached

GEM introduces a generative supervision method to improve embodied intelligence by leveraging generative models for training.

0 favorites 0 likes
#supervision

Anyone else feel like AI agents are amazing right up until things get complicated?

Reddit r/AI_Agents · 2026-05-20

A reflection on the gap between impressive AI agent demos and dependable real-world execution, arguing that current agents excel at structured tasks but fail under unpredictable conditions, suggesting near-term AI roles will focus on narrow automation with human oversight.

0 favorites 0 likes
← Back to home

Submit Feedback