Tag
Google Gemma团队正在赞助Kaggle上的1天黑客松活动,提供奖金支持,鼓励社区使用Gemma 4构建轻量级工具或推动AI创新。
Unsloth enables free fine-tuning of a 31B parameter multimodal model on Kaggle using 4-bit quantization, requiring only 22-24GB VRAM for local runs.
Free 5-day AI Agents course on Kaggle using Gemini, covering topics from introduction to deployment.
LongDS is a benchmark for evaluating AI agents on long-horizon, multi-turn data analysis tasks derived from Kaggle notebooks; experiments show best models only achieve 48% accuracy with significant drop over long turns.
A Reddit user debunks claims from Seed IQ (AGX) about solving the ARC-AGI-3 benchmark with a perfect score, arguing that refusal to submit to the Kaggle leaderboard (which allows closed-source submission) suggests a scam.
Kaggle and Google are hosting a free 5-day intensive course (June 15-19) on building AI agents, culminating in a simulated challenge called Kaggriculture.
Promotion of mlcourse.ai, an open-source machine learning course by OpenDataScience featuring theory, practice, and Kaggle competitions.
Google and Kaggle are hosting a free five-day AI Agents Vibe Coding course in June 2026, focusing on building production-ready agents using natural language workflows.
Google DeepMind and Kaggle have launched the FACTS Benchmark Suite, a comprehensive set of evaluations including parametric, search, multimodal, and grounding benchmarks to systematically measure the factuality of large language models.
OpenAI introduces MLE-bench, a benchmark of 75 Kaggle ML competitions to evaluate AI agents on real-world ML engineering tasks. The best setup, o1-preview with AIDE scaffolding, achieves at least a Kaggle bronze medal in 16.9% of competitions.