curriculum-learning

Tag

Cards List
#curriculum-learning

Manifold Bandits: Bayesian Curriculum Learning over the Latent Geometry of Large Language Models

Hugging Face Daily Papers · 6d ago Cached

Introduces Bayesian Manifold Curriculum (BMC), an adaptive curriculum learning method for LLMs that leverages the model's latent geometry to allocate training effort across diverse problem types, improving efficiency beyond traditional difficulty-based curricula.

0 favorites 0 likes
#curriculum-learning

Pythagoras-Prover: Advancing Efficient Formal Proving via Augmented Lean Formalisation

arXiv cs.AI · 2026-06-12 Cached

Pythagoras-Prover is a compute-efficient family of Lean theorem provers that achieves strong performance using curriculum supervised fine-tuning and a novel Augmented Lean Formalisation technique. The 4B model surpasses DeepSeek-Prover-V2-671B at pass@32 on MiniF2F-Test, and the 32B model sets a new state-of-the-art among open-source provers.

0 favorites 0 likes
#curriculum-learning

Representation Curriculum: Stagewise Training for Robust Ranking and Allocation

arXiv cs.LG · 2026-06-10 Cached

This paper proposes Representation Curriculum (RC), a training-time intervention that stages feature utilization to reduce over-reliance on exposure-confounded historical signals and improve cold-start generalization in ranking systems. The method is theoretically analyzed and validated on public benchmarks and large-scale eBay search experiments.

0 favorites 0 likes
#curriculum-learning

The Order Matters: Sequential Fine-Tuning of LLaMA for Coherent Automated Essay Scoring

arXiv cs.CL · 2026-06-10 Cached

This paper investigates sequential fine-tuning of LLaMA-3.1-8B for automated essay scoring using a curriculum aligned with discourse structure, showing improved coherence and performance compared to independent or randomized training.

0 favorites 0 likes
#curriculum-learning

Agentic Transformers Provably Learn to Search via Reinforcement Learning

arXiv cs.LG · 2026-06-02 Cached

This paper theoretically studies how transformer-based policies acquire search capabilities from reinforcement learning training dynamics in a stochastic tree environment. It shows that a two-head transformer can implement depth-first search and that this mechanism emerges naturally from sparse reward signals under a depth-wise curriculum.

0 favorites 0 likes
#curriculum-learning

Tailoring the Curriculum: Student-Centered Reasoning Distillation via Dynamic Data-Model Compatibility

arXiv cs.AI · 2026-05-29 Cached

Introduces the Data-Model Compatibility (DMC) metric to evaluate how well a reasoning dataset aligns with a student model during distillation. Experiments show DMC strongly correlates with distillation performance and that dynamically selecting datasets based on DMC further improves reasoning capabilities.

0 favorites 0 likes
#curriculum-learning

Micro-Macro Retrieval: Reducing Long-Form Hallucination in Large Language Models

arXiv cs.CL · 2026-05-29 Cached

This paper introduces Micro-Macro Retrieval (M2R), a retrieve-while-generate framework that reduces hallucination in long-form LLM outputs by ensuring key information stays close to generated text. It uses curriculum learning-based reinforcement learning to train retrieval and grounding skills, showing effectiveness especially in lengthy contexts.

0 favorites 0 likes
#curriculum-learning

Curriculum Learning for Safety Alignment

arXiv cs.LG · 2026-05-27 Cached

This paper proposes Staged-Competence, a curriculum learning framework for DPO-based safety alignment that organizes preference data by difficulty, improving robustness and data efficiency while preserving general capabilities.

0 favorites 0 likes
#curriculum-learning

Show, Don't TELL: Explainable AI-Generated Text Detection

Hugging Face Daily Papers · 2026-05-27 Cached

Introduces TELL, an AI-generated text detection system that provides explainable annotations alongside numerical scores, achieving competitive AUROC of 0.927 while enabling users to judge authorship based on highlighted textual indicators.

0 favorites 0 likes
#curriculum-learning

What Training Data Teaches RL Memory Agents: An Empirical Study of Curriculum Effects in Memory-Augmented QA

arXiv cs.CL · 2026-05-25 Cached

This paper empirically studies how the composition of training data (curriculum) affects the skills learned by RL-based memory agents in multi-session question answering. It finds that curriculum composition acts as a fine-grained lever on specialization, with mixed benchmarks yielding the best overall performance and narrow out-of-domain sets transferring targeted temporal reasoning skills.

0 favorites 0 likes
#curriculum-learning

Leveraging Self-Paced Curriculum Learning for Enhanced Modality Balance in Multimodal Conversational Emotion Recognition

arXiv cs.LG · 2026-05-22 Cached

This paper proposes a plug-and-play module using self-paced curriculum learning to enhance modality balance in multimodal conversational emotion recognition, achieving consistent F1-score improvements on IEMOCAP and MELD datasets.

0 favorites 0 likes
#curriculum-learning

I created an LLM post-training method called RPS. Preliminary results show that it improved Qwen3-8b's program synthesis reliability. [R]

Reddit r/MachineLearning · 2026-05-21

RPS is a two-stage LLM post-training method inspired by neuroscience, combining curriculum learning with learning rate decay. Preliminary results show improved program synthesis reliability on Qwen3-8b compared to equal learning rate training.

0 favorites 0 likes
#curriculum-learning

From Reasoning Chains to Verifiable Subproblems: Curriculum Reinforcement Learning Enables Credit Assignment for LLM Reasoning

Hugging Face Daily Papers · 2026-05-21 Cached

SCRL is a curriculum reinforcement learning framework that uses subproblem-level normalization and curriculum learning to improve credit assignment in LLM reasoning, outperforming baselines on mathematical reasoning benchmarks.

0 favorites 0 likes
#curriculum-learning

PROWL: Prioritized Regret-Driven Optimization for World Model Learning

arXiv cs.LG · 2026-05-20 Cached

Introduces PROWL, a prioritized regret-driven optimization framework that uses an adversarial curriculum to improve diffusion-based world model robustness by focusing on high-error trajectories, achieving better performance on out-of-distribution scenarios in MineRL.

0 favorites 0 likes
#curriculum-learning

From Seeing to Thinking: Decoupling Perception and Reasoning Improves Post-Training of Vision-Language Models

Hugging Face Daily Papers · 2026-05-19 Cached

This paper proposes a staged training approach for vision-language models that separates visual perception, visual reasoning, and textual reasoning into distinct stages. The method improves visual reasoning accuracy while reducing reasoning trace length, demonstrating that stronger perception reduces the need for excessive reasoning.

0 favorites 0 likes
#curriculum-learning

Curriculum Learning of Physics-Informed Neural Networks based on Spatial Correlation

arXiv cs.LG · 2026-05-18 Cached

This paper proposes a spatially correlated curriculum learning framework for Physics-Informed Neural Networks (PINNs) that improves training stability and solution accuracy by leveraging spatial correlations among subregions, addressing issues like high-dimensional non-convex loss landscapes and imbalanced multi-objective constraints.

0 favorites 0 likes
#curriculum-learning

VectraYX-Nano: A 42M-Parameter Spanish Cybersecurity Language Model with Curriculum Learning and Native Tool Use

arXiv cs.CL · 2026-05-15 Cached

Presents VectraYX-Nano, a 42M-parameter decoder-only language model trained from scratch in Spanish for cybersecurity, featuring curriculum learning, native tool invocation via MCP, and a 170M-token corpus. Empirical findings reveal a loss-versus-register inversion and corpus-density artifacts for tool-use capability.

0 favorites 0 likes
#curriculum-learning

@stingning: We’re releasing a 30B-A3B reasoning model that reaches gold-medal level across both physics and math Olympiad evaluatio…

X AI KOLs Timeline · 2026-05-15 Cached

Researchers release SU-01, a 30B-A3B reasoning model achieving gold-medal-level performance on physics and math Olympiad problems using a unified scaling recipe for proof search.

0 favorites 0 likes
#curriculum-learning

Avoiding Overthinking and Underthinking: Curriculum-Aware Budget Scheduling for LLMs

arXiv cs.CL · 2026-04-23 Cached

BACR introduces adaptive token budgeting and curriculum-aware scheduling to prevent LLMs from overthinking easy problems and underthinking hard ones, cutting token use 34% while boosting accuracy up to 8.3%.

0 favorites 0 likes
#curriculum-learning

LiFT: Does Instruction Fine-Tuning Improve In-Context Learning for Longitudinal Modelling by Large Language Models?

arXiv cs.CL · 2026-04-21 Cached

LiFT is a longitudinal instruction fine-tuning framework that unifies diverse temporal NLP tasks under a shared instruction schema with curriculum-based training. Evaluated across OLMo, LLaMA, and Qwen models, LiFT consistently outperforms base-model in-context learning, especially on out-of-distribution data and rare change events.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback