icml-2026

#icml-2026

Off-Policy Evaluation with Strategic Agents via Local Disclosure

arXiv cs.AI ↗ · 3d ago Cached

This paper studies off-policy evaluation (OPE) when decision subjects (agents) strategically modify their covariates in response to a policy. It proposes a method that uses local disclosure via post-hoc explanations to reveal agents' pre-strategic covariates and construct a doubly robust estimator for policy value.

0 favorites 0 likes

#icml-2026

@steverab: Very excited to share that our paper "Towards a Science of AI Agent Reliability" was accepted at ICML 2026! See you in …

X AI KOLs Timeline ↗ · 6d ago Cached

A paper analyzing AI agent reliability, accepted at ICML 2026, finds that even the latest frontier models (GPT 5.5, Gemini 3.1 Pro, Claude Opus 4.7) show only marginal reliability improvements over earlier versions, with low outcome consistency and persistent issues in agent scaffolding.

0 favorites 0 likes

#icml-2026

Derivative Informed Learning of Exchange-Correlation Functionals

arXiv cs.LG ↗ · 2026-06-04 Cached

This ICML 2026 paper introduces Derivative Informed XC-Loss (DI-Loss), a training approach for machine-learned exchange-correlation functionals that incorporates first and second derivative supervision on the Grassmannian of density matrices. Across four architectures, DI-Loss reduces total-energy MAE by 66% compared to energy and density supervision alone, and improves excited-state predictions in TDDFT calculations.

0 favorites 0 likes

#icml-2026

RT-Lynx: Putting the GEMM Sparsity In a Right Way for Diffusion Models

Hugging Face Daily Papers ↗ · 2026-05-26 Cached

RT-Lynx proposes using activation sparsity instead of weight sparsity to accelerate diffusion models, achieving up to 1.55× linear-layer speedup while maintaining generation quality, and is accepted at ICML 2026.

0 favorites 0 likes

#icml-2026

@elliotchen100: Translate the work on MiroMind under Shanda. The next step of post-training might be scientific discovery itself. Simply put, it trains a model to propose research hypotheses across different disciplines. Physics, chemistry, and biology all use one method. The paper was accepted at ICML 2026, code open source...

X AI KOLs Timeline ↗ · 2026-05-19 Cached

This paper proposes a scalable supervised fine-tuning method for training language models to propose research hypotheses across disciplines. It has been accepted by ICML 2026 and the code is open source.

0 favorites 0 likes

#icml-2026

MOOSE-Star (ICML 2026): 7B model + 108K-paper dataset for scientific hypothesis discovery

Reddit r/LocalLLaMA ↗ · 2026-05-14

MOOSE-Star presents a 7B model fine-tuned from DeepSeek-R1-Distill-Qwen-7B for scientific hypothesis discovery, along with a dataset of 108K NCBI papers. The model achieves state-of-the-art inspiration retrieval accuracy, outperforming larger models like GPT-5.4 and Gemini-3 Pro.

0 favorites 0 likes

#icml-2026

@JulieKallini: Fast Byte Latent Transformer is accepted to ICML 2026! Byte-level LMs promise to free us from subword tokenizers, but d…

X AI KOLs Following ↗ · 2026-05-11 Cached

The Fast Byte Latent Transformer (BLT-D) has been accepted to ICML 2026, introducing a text diffusion method for parallel byte-level decoding to overcome the speed limitations of traditional byte-level language models.

0 favorites 0 likes

icml-2026

Off-Policy Evaluation with Strategic Agents via Local Disclosure

@steverab: Very excited to share that our paper "Towards a Science of AI Agent Reliability" was accepted at ICML 2026! See you in …

Derivative Informed Learning of Exchange-Correlation Functionals

RT-Lynx: Putting the GEMM Sparsity In a Right Way for Diffusion Models

MOOSE-Star (ICML 2026): 7B model + 108K-paper dataset for scientific hypothesis discovery

@JulieKallini: Fast Byte Latent Transformer is accepted to ICML 2026! Byte-level LMs promise to free us from subword tokenizers, but d…

Submit Feedback