Tag
A startup called Springboards has built Flint, an LLM trained to produce more diverse responses to overcome the groupthink problem in mainstream chatbots.
This paper introduces approach-level diversity for LLM math reasoning, showing that surface-level diversity metrics are unreliable proxies and that directly optimizing for approach diversity remains an open problem.
Proposes a multi-objective reinforcement learning framework combining semantic embeddings with Pareto-DQN to balance engagement, diversity, and fairness in recommendations, mitigating filter bubbles.
This paper introduces Spokes, a probabilistic diversification framework using the G-Vendi score to optimize diversity in pretraining data selection, achieving significant improvements in downstream task performance on FineWeb and DCLM by jointly optimizing quality and diversity.
This paper introduces a domain-agnostic multi-layered framework for unsupervised extraction of perspectives to evaluate pluralism in LLM-generated text, finding that rare perspectives are disproportionately underrepresented.
This paper introduces DiRL, a direction-aware reinforcement learning framework that distinguishes reasoning-driven diversity from memorization-driven diversity in LLM exploration. It extracts an internal reasoning-memorization direction from model representations and shapes rewards to prioritize reasoning-aligned exploration, showing improvements on math and general reasoning benchmarks.
This paper proposes an evolutionary framework inspired by parallel tempering that uses multi-temperature sampling and information exchange to improve the diversity and quality of scientific hypotheses generated by large language models, demonstrated across molecular, equation, and algorithm discovery.
This paper proposes MADS, a method that leverages neural activation states from LLMs to select diverse core sets for instruction tuning, showing that a 15% subset can outperform full-dataset fine-tuning on multiple benchmarks.
Palantir announces a Neurodivergent Fellowship targeting software developers, with compensation ranging from $110,000 to $200,000 per year.
This paper introduces Vector Policy Optimization (VPO), a reinforcement learning algorithm that trains LLMs to produce diverse solutions by optimizing across multiple reward dimensions, significantly improving test-time search performance compared to scalar RL baselines.
Introduces StructuredSemanticSearch, a model search framework that combines semantic similarity with structured table discovery to improve diversity and coverage of recommended models, evaluated on a benchmark of 597 queries.
This paper proposes ensemble monitoring for AI control, combining diverse monitors to improve detection of misaligned actions. Experiments show that diverse ensembles outperform homogeneous ones and that fine-tuned monitors add unique detection capabilities.
The author trained Qwen3.5 to jailbreak itself with reinforcement learning, using diversity rewards to surface multiple attack strategies, then improved the defender's robustness from 64% to 92% defense rate with a slight drop in benign accuracy.
This paper introduces a quantitative notion of diversity of extensions in abstract argumentation based on symmetric difference, and provides a systematic complexity classification for related reasoning tasks.
This paper introduces a validity-diversity framework attributing diversity collapse in LLMs to order and shape miscalibration during decoding, validated across 14 language models.
This paper proposes mid-training language models on self-generated diverse reasoning traces before reinforcement learning, showing improved RL performance on math benchmarks by exposing models to multiple valid solution approaches.
OpenAI Scholars 2020 program concluded with final projects investigating GPT-2 grammar representation, model interpretability, and medical applications like epileptic seizure prediction. The program provides stipends and mentorship to underrepresented groups in machine learning.
OpenAI announces the 2020 Scholars program, a remote fellowship for underrepresented groups in AI and engineering with 2+ years of software engineering experience. Applications are open to eligible candidates in US timezones with work authorization.
OpenAI Scholars 2019 is now accepting applications for a remote program targeting underrepresented groups in science and engineering. The 3-month program is open to people with US work authorization in US timezones who have basic programming skills and foundational knowledge of calculus and linear algebra.
OpenAI announces the completion of its first Scholars cohort program, with eight participants completing final projects and a Demo Day scheduled for September 20th to showcase their work and discuss their ML futures.