knowledge-transfer

Tag

Cards List
#knowledge-transfer

Why Solve It Twice? Hierarchical Accumulation of Skills for Transfer-Efficient ML Engineering

arXiv cs.AI · 4h ago Cached

HASTE introduces a hierarchical multi-agent system for ML engineering that organizes cross-competition knowledge into three tiers, achieving 77.3% medal rate on MLE-Bench Lite while reducing compute by 52% and demonstrating that structured knowledge transfer outperforms flat memory approaches.

0 favorites 0 likes
#knowledge-transfer

Bridging Scientific Heritage: An Arabic--Russian Parallel Corpus and LLM Benchmark for Sustainable Knowledge Transfer

arXiv cs.CL · 4h ago Cached

This paper presents a benchmark for Arabic-Russian scientific translation, including a hybrid parallel corpus of 27,000 sentence pairs and fine-tuned multilingual models (mT5, NLLB, Qwen) using LoRA. The best model achieves BLEU 23.15, and the work aims to lower language barriers for scientific knowledge exchange between Arabic and Russian researchers.

0 favorites 0 likes
#knowledge-transfer

Thinking While Speaking: Inference-Time Knowledge Transfer for Responsive and Intelligent Conversational Voice Agents

Hugging Face Daily Papers · 2026-06-23 Cached

This paper introduces a conversational voice agent system that uses a lightweight on-device 'Talker' model to start responding immediately, then incorporates knowledge from a frontier LLM 'Reasoner' as it becomes available, achieving 7-19x faster time-to-first-response while approaching frontier-level performance on a laptop.

0 favorites 0 likes
#knowledge-transfer

CacheRL:Multi-Turn Tool-Calling Agents via Cached Rollouts and Hybrid Reward

arXiv cs.CL · 2026-06-15 Cached

CacheRL trains small agent foundation models for multi-step tool-calling tasks, achieving 92% process accuracy (approaching GPT-5's 94%) with 100x less compute using cached rollouts and hybrid reward shaping, with innovations in knowledge transfer, cache-aware rewards, and iterative SFT/GRPO training.

0 favorites 0 likes
#knowledge-transfer

How Endava builds an agentic organization with Codex

OpenAI Blog · 2026-05-28 Cached

Endava, a global software contracting firm, uses OpenAI's Codex to codify senior expertise into agents, enabling small teams to deliver massive value quickly and transforming how junior and senior engineers collaborate.

0 favorites 0 likes
#knowledge-transfer

EDGE-OPD: Internalizing Privileged Context with Evidence Guided On-Policy Distillation

arXiv cs.AI · 2026-05-25 Cached

This paper introduces EDGE-OPD, a modification of on-policy self-distillation for LLMs that uses guided rollouts and evidence masks to internalize privileged context without degrading general capabilities, showing success in rare-token identity settings.

0 favorites 0 likes
#knowledge-transfer

Do Factual Recall Mechanisms Carry over from Text to Speech in Multimodal Language Models?

arXiv cs.CL · 2026-05-22 Cached

This paper investigates whether factual recall mechanisms learned in text-based language models transfer to speech modalities in multimodal speech-language models. Using causal mediation analysis on SpiritLM, it finds that the mechanisms are only partially carried over, highlighting differences between text and speech processing.

0 favorites 0 likes
#knowledge-transfer

XPERT: Expert Knowledge Transfer for Effective Training of Language Models

arXiv cs.CL · 2026-05-12 Cached

The paper introduces XPERT, a framework that extracts and reuses expert knowledge from pre-trained Mixture-of-Experts (MoE) language models to improve training efficiency and performance in downstream models.

0 favorites 0 likes
#knowledge-transfer

EVOCHAMBER: Test-Time Co-evolution of Multi-Agent System at Individual, Team, and Population Scales

Hugging Face Daily Papers · 2026-05-11 Cached

EVOCHAMBER is a training-free, multi-agent test-time evolution framework that enables emergent specialization through collaborative reflection and asymmetric knowledge transfer across individual, team, and population scales, achieving significant improvements on math, code, and reasoning tasks.

0 favorites 0 likes
#knowledge-transfer

@AnthropicAI: Research we co-authored on subliminal learning—how LLMs can pass on traits like preferences or misalignment through hid…

X AI KOLs · 2026-04-15 Cached

Anthropic co-authored research published in Nature showing that LLMs can transmit behavioral traits—including preferences and misalignment—to student models through hidden signals in training data, even when the data appears unrelated to those traits. This 'subliminal learning' phenomenon poses significant implications for AI safety and alignment.

0 favorites 0 likes
#knowledge-transfer

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Papers with Code Trending · 2026-04-09 Cached

SkillClaw introduces a framework for collective skill evolution in multi-user LLM agent systems, enabling autonomous updates and cross-user knowledge transfer by aggregating interactions and feedback to improve performance across the ecosystem.

0 favorites 0 likes
#knowledge-transfer

Semi-supervised knowledge transfer for deep learning from private training data

OpenAI Blog · 2016-10-18 Cached

OpenAI presents PATE (Private Aggregation of Teacher Ensembles), a privacy-preserving approach that trains a student model on noisy outputs from multiple teacher models trained on disjoint datasets, providing strong differential privacy guarantees without exposing sensitive training data.

0 favorites 0 likes
← Back to home

Submit Feedback