Tag
This article recommends a website called Sophon, which aggregates AI papers, models, benchmarks, leaderboards, and reinforcement learning environments. It provides real-time rankings, comparisons, and subscription features, and is hailed as the Bloomberg terminal for AI research.
This paper presents a qualitative study based on interviews with CS researchers, revealing a paradox of pragmatic skepticism where researchers distrust LLM leaderboard rankings yet continue to use them as rough guides. It finds that peer networks are primary for model selection, arena-based leaderboards are preferred, and cost transparency is the most demanded feature.
Niels from Hugging Face announces new features for the revived PapersWithCode platform, including multi-metric leaderboards, support for external papers, paper lineage, and more.
Niels from Hugging Face announces the revival of PapersWithCode as paperswithcode.co, a platform that parses high-impact AI papers at scale and automatically generates leaderboards and benchmarks, incorporating features like trending papers, domain categorization, and external paper support.
NielsRogge announces a revival of PapersWithCode, featuring SOTA per domain, leaderboards, and methods parsed at scale using AI agents.