in-context-learning

#in-context-learning

Induction in Both Directions: A Mechanistic Analysis of In-Context Learning in Masked Diffusion Language Models

arXiv cs.CL ↗ · 12h ago Cached

This paper presents a mechanistic analysis of induction in masked diffusion language models, identifying a bidirectional induction circuit and showing that these models use the global fraction of masked tokens as an implicit timestep.

0 favorites 0 likes

#in-context-learning

In-context learning of closed form solution to simple linear regression task using transformer with linear self-attention

arXiv cs.LG ↗ · 12h ago Cached

This paper constructs a transformer with linear self-attention that performs in-context learning of the closed-form least squares solution for simple linear regression, using layer normalization to approximate the analytical solution rather than gradient descent.

0 favorites 0 likes

#in-context-learning

Tabular Foundation Models for Discrete Choice Estimation

arXiv cs.LG ↗ · 4d ago Cached

This paper proposes a reformulation to apply tabular foundation models (TFMs) to discrete choice estimation, addressing the structural gap of row-independent assumptions. The best reformulation outperforms hierarchical Bayesian estimation by 8% in holdout log-likelihood and 3.6% in hit rate while running 16 times faster.

0 favorites 0 likes

#in-context-learning

Partition, Prompt, Aggregate: Statistical Self-Consistency in Language Models

Hugging Face Daily Papers ↗ · 4d ago Cached

This paper proposes a framework to test whether LLM estimates obey statistical self-consistency (law of total probability) across subpopulations, finding widespread violations and the 'macro fallacy' where fine-grained estimates align better with human data.

0 favorites 0 likes

#in-context-learning

@oneill_c: https://x.com/oneill_c/status/2077453217609453784

X AI KOLs Timeline ↗ · 4d ago Cached

A researcher discusses the challenge of continual learning in LLMs, comparing them to amnesiac interns, and explores approaches like extending context windows, building stateful memory, and compressing context into latent representations, citing their work on Still.

0 favorites 0 likes

#in-context-learning

Self-Evolving In-Context Learning for Direct Pilot-to-Beamformer Design in MU-MISO Systems

arXiv cs.LG ↗ · 5d ago Cached

This paper proposes a self-evolving in-context learning framework for direct pilot-to-beamformer design in multi-user MISO systems, integrating a Transformer backbone with a pilot encoder-decoder network and curriculum learning to handle multiple channel models without retraining.

0 favorites 0 likes

#in-context-learning

The Importance of Encoder Choice:A Tabular-Image Study

arXiv cs.LG ↗ · 2026-07-10 Cached

This paper evaluates state-of-the-art tabular models as encoders in image-tabular multimodal learning, addressing the challenge of using In-Context Learning models that require labels for embedding training and test instances.

0 favorites 0 likes

#in-context-learning

AbICL: In-Context Learning for Antigen-Specific Antibody Affinity Ranking

arXiv cs.LG ↗ · 2026-07-08 Cached

AbICL proposes an in-context learning framework for antigen-specific antibody affinity ranking, combining a pretrained structural encoder with a context ranking head to leverage labeled demonstrations for test-time adaptation without gradient updates.

0 favorites 0 likes

#in-context-learning

Demonstrating TOFFEE: A Learned System for Synthesizing Data Agent Trajectories at Scale

arXiv cs.AI ↗ · 2026-07-08 Cached

TOFFEE is a system that uses Monte Carlo Tree Search with adaptive model selection and cross-task prefix reuse to synthesize high-quality data agent trajectories at scale. These trajectories can be used for fine-tuning or in-context learning to improve data agent performance in heterogeneous enterprise environments.

0 favorites 0 likes

#in-context-learning

@AnjneyMidha: true in-context learning, both in AI and in life, remains the single most valuable capability of a system/person when I…

X AI KOLs Timeline ↗ · 2026-07-08 Cached

Anjney Midha reflects on the value of in-context learning in AI and life, recalling his investment in Anthropic's 2021 seed round as an intuitive but often misunderstood concept.

0 favorites 0 likes

#in-context-learning

Induction Heads Interpolate N-Grams

arXiv cs.LG ↗ · 2026-07-07 Cached

This paper studies transformers trained on Markov chains and identifies that induction heads implement soft context-matching and Dirichlet-style smoothing, showing that transformers regularize in-context estimation rather than simply counting n-grams.

0 favorites 0 likes

#in-context-learning

From Signals to Structure: How Memory Architecture Drives Language Emergence in LLM Agents

arXiv cs.AI ↗ · 2026-07-02 Cached

This paper studies how memory architecture affects language emergence in LLM agents playing a Lewis signaling game, finding that persistent private notebook memory outperforms stateless agents and prevents high-capacity collapse.

0 favorites 0 likes

#in-context-learning

Ghost in the Kernel: In-Context Learning with Efficient Transformers via Domain Generalization

arXiv cs.LG ↗ · 2026-07-02 Cached

This paper theoretically analyzes linear transformers for in-context learning under domain generalization, establishing dimension-independent convergence rates and proposing novel activation and loss designs for linearizing pretrained softmax LLMs.

0 favorites 0 likes

#in-context-learning

Probing Memorization of Tabular In-Context Learning

arXiv cs.LG ↗ · 2026-07-01 Cached

This paper investigates parametric memorization in tabular foundation models that use in-context learning, introducing a probing framework (IclMem) to separate context-based predictions from memorization. It finds moderate memorization signals under specific conditions but notes they largely vanish under realistic training scenarios.

0 favorites 0 likes

#in-context-learning

Transformers as Bayesian In-Context Experimenters: Smoothness-Adaptive Efficient ATE Estimation

arXiv cs.LG ↗ · 2026-07-01 Cached

该论文提出贝叶斯上下文实验者（Bayesian in-context experimenters），通过训练Transformer模仿贝叶斯后验Neyman教师策略，实现自适应平均处理效应（ATE）估计，并采用混合专家Transformer处理未知平滑性，理论证明可通过监督预训练学习该策略。

0 favorites 0 likes

#in-context-learning

Can Tabular In-Context Learners Generalize to Biomolecular Property Prediction?

arXiv cs.LG ↗ · 2026-07-01 Cached

This paper investigates whether tabular in-context learning models, pretrained on synthetic causal tables, can generalize to predict biomolecular properties from limited labeled data. The authors find that these models are competitive for protein fitness regression but that representation choice is crucial for small-molecule classification.

0 favorites 0 likes

#in-context-learning

When Reranking Hurts: Uncertainty-Based Gating for Few-Shot Reranking

arXiv cs.CL ↗ · 2026-07-01 Cached

This paper challenges the assumption that reranking always improves few-shot selection for LLMs, proposing a training-free gated reranking approach that uses model uncertainty to decide when to rerank, reducing computational costs by 15-80% while slightly improving performance.

0 favorites 0 likes

#in-context-learning

TabFM: A zero-shot foundation model for tabular data

Hacker News Top ↗ · 2026-06-30 Cached

Google Research introduces TabFM, a zero-shot foundation model for tabular data that uses in-context learning to perform classification and regression without requiring manual model training or hyperparameter tuning.

0 favorites 0 likes

#in-context-learning

LC-ICL: Label-Guided Contrastive In-Context Learning for Robust Information Extraction

arXiv cs.CL ↗ · 2026-06-30 Cached

This paper proposes LC-ICL, a novel few-shot technique that uses both correct and incorrect examples with error-cause labels to improve large language models' performance on information extraction tasks like named entity recognition and relation extraction.

0 favorites 0 likes

#in-context-learning

Can LLMs Judge Better Than They Generate? Evaluating Task Asymmetry, Mechanistic Interpretability and Transferability for In-Context QA

arXiv cs.CL ↗ · 2026-06-29 Cached

This paper tests the assumption that LLMs judge better than they generate in in-context QA, finding generation accuracy exceeds self-evaluation on most benchmarks, with evaluation attending less to context. The findings challenge core assumptions in self-evaluation pipelines.

0 favorites 0 likes

in-context-learning

Submit Feedback