Built a tool that maps research gaps from PDFs — beta, would love ML researchers to break it
Summary
The author introduces Papira, a beta tool that analyzes uploaded research papers to map coverage and identify gaps in machine learning and NLP subfields.
Similar Articles
Show HN: TikTok but for Scientific Papers
Papel is a new research-focused social platform that leverages AI-powered vector search and on-device RAG to help researchers discover, discuss, and quiz themselves on academic papers. It offers personalized feeds, local AI chat via Apple Intelligence or MLX, and gamified learning features.
@tom_doerr: Converts research papers into editable diagrams and slides https://github.com/OpenDCAI/Paper2Any…
Paper2Any is an open-source AI tool that converts research papers into editable diagrams, technical roadmaps, and slide decks with support for universal file formats and custom styling.
PaperBench: Evaluating AI’s Ability to Replicate AI Research
OpenAI introduces PaperBench, a benchmark evaluating AI agents' ability to replicate state-of-the-art AI research by replicating 20 ICML 2024 papers with 8,316 gradable tasks. The best-performing model (Claude 3.5 Sonnet) achieves only 21% replication score, below human PhD-level performance, highlighting current limitations in autonomous research capabilities.
@socialwithaayan: HUGGING FACE JUST OPEN-SOURCED THE ML INTERN EVERY RESEARCHER HAS DREAMED OF No more spending days reading papers and w…
Hugging Face open-sourced ml-intern, an autonomous agent that reads ML papers, discovers datasets, trains models, debugs failures, and ships production-ready models to the Hub, automating the entire post-training workflow.
@KanikaBK: CHINA JUST DROPPED A TOOL THAT WORKS 24 HOURS, NEVER SLEEPS AND NEVER COMPLAINS. It took one paper from borderline reje…
A new open-source tool automates the entire research paper refinement process, using Claude Code for execution and a separate model for evaluation to iteratively improve papers overnight. The system successfully upgraded a borderline rejected paper to submission-ready status through autonomous GPU experiments and narrative adjustments.