Tag
ChinaTextbook is a GitHub open-source project that organizes textbook PDFs from primary school to university for free download, making it convenient for parents and students to access digital textbooks.
A 277-page PDF guide revealing insights into Large Language Models, shared via a Twitter thread by Matt Dancho.
Cursor can now read documents attached in Slack threads, supporting formats like .txt, .log, .json, .zip, .pdf, and .docx.
The open-source tool book-to-skill on GitHub converts PDF/EPUB tech books into Claude Code skills, generating a table of contents, core concepts, and patterns with one click, turning dusty books into a personal on-demand consultant.
Microsoft's Markitdown tool converts PDFs to markdown, saving tokens and cost when feeding documents to AI models like Claude, but requires caution with scanned PDFs, charts, and complex tables.
Extend released Parse 2.0, a state-of-the-art document parsing API that achieves top accuracy on real-world documents, outperforming competitors on the open-source RealDoc-Bench benchmark.
A user describes how someone used an AI-powered tool to read a 134-page book in 15 minutes, generating atomic notes and flashcards without highlighting, highlighting the productivity gap from using proper infrastructure.
Abogen is an open-source tool that can convert documents like ePub and PDF into high-quality audio with one click, automatically generating synchronized subtitles. It supports a voice mixer and multiple deployment methods.
Advice on parsing tables from PDFs by converting to PNGs and using Gemini 3.1 Pro with low thinking, claiming 95% accuracy. Other tools like Extend, Reducto, Landing are poor for this task.
pdf-inspector is an open-source Rust library for intelligently classifying PDF types (text or scanned), extracting text, and converting to Markdown, avoiding unnecessary OCR to improve speed and save costs.
Jason Liu shared how to use Codex as a central hub to handle knowledge work across tools, such as slides, PDFs, spreadsheets, to maximize its value.
Recommending MinerU 3.1 document parsing tool, which perfectly converts PDF, Word, PPT etc. to Markdown, supports auto-identification of tables, formulas, images, and offers three modes (Pipeline/VLM), open-source and commercially usable.