critic-model

Tag

Cards List
#critic-model

Critic-R: Improving Agentic Search using Instruction-tuned Retrievers with Natural Language Introspective Feedback

Hugging Face Daily Papers · 2026-05-30 Cached

Critic-R introduces a framework using a critic model to provide introspective feedback between the reasoning agent and retriever, improving agentic search performance at both inference and training time without requiring retraining the agent.

0 favorites 0 likes
#critic-model

Finding GPT-4’s mistakes with GPT-4

OpenAI Blog · 2024-06-27 Cached

OpenAI introduced CriticGPT, a GPT-4-based model designed to catch errors in ChatGPT's code output. When human trainers use CriticGPT for code review, they outperform those without assistance 60% of the time, addressing a fundamental limitation of RLHF as models become increasingly capable.

0 favorites 0 likes
← Back to home

Submit Feedback