iclr

Tag

Cards List
#iclr

Fair Reinforcement Learning

Reddit r/AI_Agents · 2026-06-02

Fair Reinforcement Learning introduces Democratic Alignment to incorporate multiple competing value sets from different agents, overcoming traditional RLHF limitations, and achieves orders of magnitude faster optimization via a black-box policy wrapper.

0 favorites 0 likes
#iclr

ICLR 2026 – Institutional Affiliations Dataset and Analysis

Hacker News Top · 2026-05-14 Cached

This article presents a dataset and analysis pipeline for ICLR 2026 accepted papers, extracting institutional affiliations from PDF title blocks to create a clean dataset and publication-ready treemap visualizations.

0 favorites 0 likes
#iclr

Teaching AI models to say “I’m not sure”

MIT News — Artificial Intelligence · 2026-04-22 Cached

MIT CSAIL researchers introduce RLCR, a method using Brier scores in reinforcement learning to train AI models to output calibrated confidence estimates, significantly reducing overconfidence without sacrificing accuracy.

0 favorites 0 likes
#iclr

Struggling to form a group at ICLR. How do you actually break into conversations?[D]

Reddit r/MachineLearning · 2026-04-22

A PhD student at ICLR seeks practical tactics to overcome social anxiety and break into existing conversation groups without generic confidence advice.

0 favorites 0 likes
#iclr

@sherryyangML: Machine learning engineering (MLE) is the new agentic frontier. I'll be sharing our work on scaling RL for MLE agents a…

X AI KOLs Following · 2026-04-21

Two ICLR 2026 papers show how small RL-trained agents outperform frontier models on machine-learning engineering tasks and how MLE-Smith automatically scales MLE workloads.

0 favorites 0 likes
#iclr

AutoFigure: Generating and Refining Publication-Ready Scientific Illustrations

Papers with Code Trending · 2026-02-03 Cached

AutoFigure is an open-source system for generating and refining editable, publication-ready scientific diagrams, accepted to ICLR 2026.

0 favorites 0 likes
← Back to home

Submit Feedback