Tag
Proposes ReTAMamba, a method using reliability-aware temporal aggregation with Mamba for irregular clinical time series prediction, achieving significant AUPRC gains on MIMIC-IV, eICU, and PhysioNet 2012.
This paper proposes a two-stage sampling design where LLM evaluations are used to augment, rather than replace, human ratings, and provides guidance on determining sample sizes for human and LLM reviews using a doubly robust estimator from missing data literature.
This paper identifies the problem of missing observations in inverse reinforcement learning (IRL) that can make expert actions appear suboptimal, and develops a practical algorithm to quantify the minimal perturbations needed for expert actions to appear optimal, validated on synthetic tasks, cancer treatment simulation, and ICU data.