specificity

Tag

Cards List
#specificity

Memora: A Harmonic Memory Representation Balancing Abstraction and Specificity

Hacker News Top · 3d ago Cached

Memora is a scalable memory system for AI agents that decouples storage from retrieval, achieving state-of-the-art performance on long-horizon tasks while using up to 98% fewer tokens. The research is published at ICML 2026.

0 favorites 0 likes
#specificity

Discretizing Reward Models

Hugging Face Daily Papers · 2026-06-19 Cached

This paper identifies oversensitivity in continuous reward models for reinforcement learning, where equally good responses receive different scores, and proposes a discretization technique using Monte Carlo dropout to reduce this oversensitivity while maintaining discriminative ability, leading to better policies and less reward hacking.

0 favorites 0 likes
← Back to home

Submit Feedback