PaperMentor: A Human-Centered Multi-Agent Writing Tutor for AI Research Papers on Overleaf
Summary
PaperMentor is a human-centered multi-agent writing assistant that integrates an expert skill library with specialized agents to provide actionable inline comments on Overleaf, outperforming GPT-5.2 in usability and relevance for AI research papers.
View Cached Full Text
Cached at: 06/10/26, 09:46 PM
Paper page - PaperMentor: A Human-Centered Multi-Agent Writing Tutor for AI Research Papers on Overleaf
Source: https://huggingface.co/papers/2606.08857 Authors:
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Abstract
A human-centered writing assistant system called PaperMentor integrates expert research advice with specialized agents to provide actionable feedback during manuscript drafting, outperforming AI baselines in usability and relevance.
Expert writing feedback from experienced researchers is critical for early-career scholars to improve their manuscripts, yet high-quality feedback often remains scarce because reviewing research papers is labor-intensive. EmergingAI-powered writing assistantslargely focus ongrammar fixesor simulating peer review with final scores, yet they fall short of providing concrete,actionable suggestionsthat help students improve their papers during drafting. We present PaperMentor, a human-centered writing assistant system that deliversactionable suggestionsasOverleaf-native inline commentswhile leaving the actual writing entirely to human authors. PaperMentor integrates anexpert skill librarycarefully curated from established researchers’ writing advice with 12specialized agentscovering different aspects of paper writing, such as formatting compliance, phrasing accuracy, and terminology consistency. In auser study(n=14), 90.6% of the generated comments were rated actionable and 67.5% were rated valid, significantly outperforming a GPT-5.2 baseline uswithout the skill library. We release PaperMentor asopen sourcefor public use. Our code is publicly available under the AGPL-3.0 license at https://github.com/jiarui-liu/overleaf
View arXiv pageView PDFProject pageGitHub6Add to collection
Get this paper in your agent:
hf papers read 2606\.08857
Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash
Models citing this paper0
No model linking this paper
Cite arxiv.org/abs/2606.08857 in a model README.md to link it from this page.
Datasets citing this paper0
No dataset linking this paper
Cite arxiv.org/abs/2606.08857 in a dataset README.md to link it from this page.
Spaces citing this paper0
No Space linking this paper
Cite arxiv.org/abs/2606.08857 in a Space README.md to link it from this page.
Collections including this paper0
No Collection including this paper
Add this paper to acollectionto link it from this page.
Similar Articles
@HowToAI_: This might be the most unreal academic-writing upgrade I’ve ever seen. A team from NUS open-sourced PaperDebugger, a in…
A team from NUS open-sourced PaperDebugger, a multi-agent system that lives inside Overleaf, providing real-time rewriting, critique, and citation assistance with an open enhancer model (XtraGPT-7B), making Overleaf a full research environment.
@JeremyNguyenPhD: Paper Debugger: Multi-Agent System for Academic Writing
PaperDebugger is an open-source multi-agent system from NUS that integrates with Overleaf to help rewrite academic papers in real time.
I built a "Research Paper Writing Army" – 20+ autonomous agents, 6 novelty engines, and 10 adversarial reviewers
The author open-sourced Sisyphus Academica, a self-coordinating swarm of over 20 specialized agents for producing publication-ready research papers with novelty engines and adversarial review to avoid hallucinated citations and AI-typical prose.
PaperBench: Evaluating AI’s Ability to Replicate AI Research
OpenAI introduces PaperBench, a benchmark evaluating AI agents' ability to replicate state-of-the-art AI research by replicating 20 ICML 2024 papers with 8,316 gradable tasks. The best-performing model (Claude 3.5 Sonnet) achieves only 21% replication score, below human PhD-level performance, highlighting current limitations in autonomous research capabilities.
AI-written critiques help humans notice flaws
OpenAI trained language models to write critiques of text summaries, helping human evaluators spot flaws more effectively — a step toward scalable oversight of AI systems on difficult tasks. The work explores how AI-assisted feedback can improve human evaluation quality as a proof of concept for alignment research.