Tag
CodeAlchemy is a synthetic data generation framework that transforms publicly available code into semantically rich training data using five strategies, producing over 500 billion tokens and enabling small models to outperform much larger ones on code benchmarks.
Sumner successfully ported 960,000 lines of Bun runtime code from Zig to Rust using Anthropic's Claude AI, demonstrating the significant efficiency potential of AI in code rewriting.