skill-injection

Tag

Cards List
#skill-injection

@noisyb0y1: SOMEONE REVERSE-ENGINEERED KIMI K2.6 AND IT KILLS THE "BIGGER MODEL = BETTER AI" NARRATIVE FOR GOOD 1 trillion paramete…

X AI KOLs Timeline · 2026-05-26 Cached

A reverse engineering analysis of Kimi K2.6 reveals that its architecture prioritizes orchestration and skill injection over raw parameter count, achieving high SWE-Bench scores through multi-agent collaboration without retraining.

0 favorites 0 likes
#skill-injection

Investigating Cross-Modal Skill Injection: Scenarios, Methods, and Hyperparameters

arXiv cs.CL · 2026-05-20 Cached

This paper systematically investigates cross-modal skill injection, where a domain-expert LLM is merged into a VLM to induce emergent multimodal capabilities. It evaluates different scenarios (instruction-following, cross-lingual, mathematical reasoning), merging methods (TA, DARE, etc.), and hyperparameters, finding that TA and DARE perform well except in mathematical reasoning.

0 favorites 0 likes
← Back to home

Submit Feedback