meta-learning

Tag

Cards List
#meta-learning

@jaseweston: Claim: Autoresearch that moves the frontier will be about better data: we call that *Autodata*. 1/6 -- Paper is out! ht…

X AI KOLs Timeline · yesterday Cached

Introduces Autodata, a method where AI agents act as data scientists to create high-quality synthetic training data, showing gains on computer science, legal, and math reasoning tasks over classical methods.

0 favorites 0 likes
#meta-learning

Learning Dynamical Systems from Multiple Sparse Datasets: A Hierarchical Bayesian Modeling Approach

arXiv cs.LG · yesterday Cached

Proposes a hierarchical Bayesian framework for meta-learning in dynamical systems from multiple sparse, noisy datasets, using gradient-based MCMC with an embedded ODE solver for efficient posterior inference of shared and dataset-specific parameters.

0 favorites 0 likes
#meta-learning

Exploring Dualistic Meta-Learning to Enhance Domain Generalization in Open Set Scenarios

arXiv cs.LG · 2d ago Cached

Proposes a novel meta-learning strategy called MEDIC for open set domain generalization, which uses implicit gradient matching across domain and class splits to achieve better boundaries. Experiments show state-of-the-art performance.

0 favorites 0 likes
#meta-learning

Connect the Dots: Training LLMs for Long-Lifecycle Agents with Cross-Domain Generalization Via Reinforcement Learning

Hugging Face Daily Papers · 2026-06-18 Cached

This paper presents Connect the Dots (CoD), a framework for training LLMs via reinforcement learning to develop meta-capabilities for long-lifecycle agents, enabling continuous learning and cross-domain generalization.

0 favorites 0 likes
#meta-learning

Retrievable Gradients: Continual Post-Training Without Cumulative Weight Drift

arXiv cs.CL · 2026-06-16 Cached

Proposes ReGrad, a paradigm that treats gradients as retrievable units of knowledge for continual post-training, avoiding cumulative weight drift by storing document-specific gradients in a Gradient Bank and retrieving query-relevant gradients for temporary weight adaptation.

0 favorites 0 likes
#meta-learning

Fodor and Pylyshyn's Systematicity Challenge Still Stands

arXiv cs.CL · 2026-06-15 Cached

This paper argues that recent claims that neural networks have solved Fodor and Pylyshyn's systematicity challenge are premature. The authors show that the meta-learning for compositionality model fails to generalize out-of-distribution and behaves unsystematically even on in-distribution problems, concluding the challenge remains unmet.

0 favorites 0 likes
#meta-learning

Robotic Policy Adaptation via Weight-Space Meta-Learning

Hugging Face Daily Papers · 2026-06-05 Cached

Introduces WIZARD, a weight-space meta-learning framework that generates task-specific LoRA parameters for frozen VLA policies from language instructions and demonstration videos, enabling efficient task adaptation without fine-tuning.

0 favorites 0 likes
#meta-learning

When Offline Selectors Cannot Beat the Best Single Model: A Diagnostic Study on edX Dropout Prediction

arXiv cs.LG · 2026-06-04 Cached

This paper proposes a three-stage diagnostic framework to identify why offline model selectors fail to beat the best single model, applying it to dropout prediction on edX clickstream data. The study finds that the bottleneck is local representational ambiguity rather than learner choice or distribution shift, recommending state redesign or new data collection over further algorithm tuning.

0 favorites 0 likes
#meta-learning

SePO: Self-Evolving Prompt Agent for System Prompt Optimization

arXiv cs.CL · 2026-06-04 Cached

SePO (Self-Evolving Prompt Optimization) proposes a self-referential prompt agent that optimizes both task agents' system prompts and its own system prompt through an evolutionary search, outperforming Manual-CoT, TextGrad, and MetaSPO across five benchmarks including AIME'25, ARC-AGI-1, and GPQA.

0 favorites 0 likes
#meta-learning

R-APS: Compositional Reasoning and In-Context Meta-Learning for Constrained Design via Reflective Adversarial Pareto Search

arXiv cs.AI · 2026-06-04 Cached

R-APS (Reflective Adversarial Pareto Search) is a novel method for constrained design tasks that addresses three structural failures in LLM-based agentic systems—error propagation, robustness evaluation, and knowledge invalidation—through reasoning-mode decomposition across three timescales, requiring no fine-tuning. Evaluated on planar mechanism synthesis, it achieves 3.5x tighter robustness certificates, 46% faster iterations-to-first-admission, and 2.1x Chamfer-distance reduction over baselines.

0 favorites 0 likes
#meta-learning

CHAM-net: A Contrastive Hierarchical Adaptive Meta-network for Robust Global Methane Flux Prediction

arXiv cs.LG · 2026-06-02 Cached

CHAM-net introduces a contrastive hierarchical adaptive meta-network that captures site-specific and cross-year dynamics for robust global methane flux prediction, outperforming baseline methods on simulation and observational datasets.

0 favorites 0 likes
#meta-learning

Model Unlearning Objectives Vary for Distinct Language Functions

arXiv cs.CL · 2026-05-27 Cached

The paper argues that unlearning in LLMs should be goal-dependent, proposing a cosine-based meta-learned variant of RMU for dangerous knowledge and a multi-layer objective with probe directions for toxicity, achieving strong results across four 7-8B models.

0 favorites 0 likes
#meta-learning

Characterizing the Representational Capacity of Neural Processes

arXiv cs.LG · 2026-05-26 Cached

This paper theoretically characterizes the representational capacity of Neural Process (NP) architectures, proving a strict hierarchy among Conditional, Attentive, Convolutional, and Transformer NPs, and showing that finite-dimensional latent variables do not expand representational capacity beyond the encoder.

0 favorites 0 likes
#meta-learning

Three Costs of Amortizing Gaussian Process Inference with Neural Processes

arXiv cs.LG · 2026-05-22 Cached

This paper decomposes the predictive KL divergence between Gaussian process and latent neural process posteriors into three terms, providing upper bounds that characterize approximation errors and connecting representation dimension to kernel smoothness.

0 favorites 0 likes
#meta-learning

SOLAR: A Self-Optimizing Open-Ended Autonomous Agent for Lifelong Learning and Continual Adaptation

arXiv cs.AI · 2026-05-22 Cached

SOLAR proposes a self-optimizing autonomous agent that leverages parameter-level meta-learning and multi-level reinforcement learning to enable lifelong adaptation of LLMs to non-stationary data streams, outperforming baselines on reasoning tasks.

0 favorites 0 likes
#meta-learning

@arcprize: Jerry Tworek on path to AGI @MillionInt sits down with @FrancoisChauba1 to talk about what's next on the path to AGI 0:…

X AI KOLs Following · 2026-05-13 Cached

Jerry Tworek and François Chollet discuss the path to AGI, covering the definition of intelligence, the role of games, and why meta-learning is the closest approach.

0 favorites 0 likes
#meta-learning

NoiseRater: Meta-Learned Noise Valuation for Diffusion Model Training

arXiv cs.LG · 2026-05-12 Cached

This paper introduces NoiseRater, a meta-learning framework that assigns importance scores to individual noise samples during diffusion model training to improve efficiency and generation quality.

0 favorites 0 likes
#meta-learning

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

Hugging Face Daily Papers · 2026-05-11 Cached

This paper introduces RubricEM, a reinforcement learning framework that uses rubric-guided policy decomposition and reflection-based meta-policy evolution to train deep research agents for long-form tasks. The resulting RubricEM-8B model demonstrates strong performance on long-form research benchmarks by leveraging stage-aware planning and denser semantic feedback.

0 favorites 0 likes
#meta-learning

Model-Agnostic Meta Learning for Class Imbalance Adaptation

arXiv cs.CL · 2026-04-22 Cached

University of Memphis researchers propose HAMR, a model-agnostic meta-learning framework that uses bi-level optimization and neighborhood-aware resampling to adaptively reweight hard examples and minority classes across six imbalanced NLP datasets.

0 favorites 0 likes
#meta-learning

FSPO: Few-Shot Optimization of Synthetic Preferences Personalizes to Real Users

arXiv cs.CL · 2026-04-20 Cached

FSPO proposes a few-shot preference optimization algorithm for LLM personalization that reframes reward modeling as meta-learning, enabling models to quickly infer personalized reward functions from limited user preferences. The method achieves 87% personalization performance on synthetic users and 70% on real users through careful synthetic preference dataset construction.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback