@heynavtoor: A philosophy professor at UC Berkeley built one of the most useful tools on the internet. Not a developer. Not a startu…
Summary
This article highlights Pandoc, a free and open-source document conversion tool created by UC Berkeley philosophy professor John MacFarlane. It converts between over 50 formats locally without uploading files, saving significant time and maintaining privacy.
View Cached Full Text
Cached at: 06/25/26, 03:23 PM
A philosophy professor at UC Berkeley built one of the most useful tools on the internet.
Not a developer. Not a startup founder. Not a tech company. A philosophy professor named John MacFarlane.
He wrote it in Haskell. In 2006. He still maintains it today. Twenty years later.
It is called Pandoc. The Swiss Army knife of documents.
You give it any document in any format. It converts it to any other format. One command. Seconds.
Word to PDF. Done. Markdown to PowerPoint. Done. LaTeX to Word. Done. HTML to ePub. Done. Jupyter notebook to PDF. Done. Word to Markdown. Done. PowerPoint to Markdown. Done. Excel to Markdown. Done.
50 plus formats. Any direction. One tool.
pandoc thesis.docx -o thesis.pdf
That is it. One line. Your 300-page thesis converted. Formatting preserved.
Here is what this replaces:
A PhD student at Berkeley converting a thesis from LaTeX to Word. By hand: 6 hours. Pandoc: 3 seconds.
A novelist in Brooklyn converting a manuscript to ePub for Kindle. By hand: two days. Pandoc: one command.
A startup in Chicago migrating 200 HTML blog posts to Markdown. By hand: a full week. Pandoc: 4 minutes.
A professor at MIT turning 200 Markdown lecture notes into a PowerPoint deck. By hand: 2 hours. Pandoc: one command.
Here is what online converters charge for the same work:
Zamzar: $25 a month. CloudConvert: $8 a month. Smallpdf Pro: $15 a month. Adobe Acrobat Pro: $239.88 a year.
Every one of them uploads your documents to their servers. Your thesis. Your manuscript. Your private notes. Sitting on someone else’s machine.
Pandoc runs on your laptop. Nothing uploaded. Nothing sent anywhere.
44,997 stars on GitHub. GPL-2.0 license. Version 3.10 shipped June 4 2026.
Here is the wild part.
Pandoc powers R Markdown. Pandoc powers Jupyter Book. Pandoc powers Quarto. Every time you export a notebook to a PDF or render a research report, Pandoc is probably running underneath.
The standard of document conversion on the internet was built by one philosophy professor in his spare time.
In 2006, John wanted to write his lecture notes in a lightweight format and export them to Word, HTML, and PDF. He had never written a line of Haskell in his life. Pandoc was his first Haskell program.
He has shipped 14,505 commits since then. He co-wrote the CommonMark standard. He still teaches philosophy at Berkeley. He still ships Pandoc himself.
Microsoft did not build it. Adobe did not build it. Google did not build it.
One philosophy professor. Twenty years. Forty-five thousand stars.
Your documents. Your formats. Your machine. One command.
The week you used to lose to file conversion is back in your hands.
(Link in the comments)
Similar Articles
@0xQiYan: Brothers, have you ever encountered situations where various format conversions require a membership, and still worry about not having one? Discovered an open-source project for format conversion that Microsoft and Google couldn't achieve, but a philosophy professor managed to do in his spare time. Pandoc—the document conversion artifact, one command, a few seconds, over 50 formats freely converted. Word to PDF, ...
Introducing the open-source document conversion tool Pandoc, developed in spare time by philosophy professor John MacFarlane, supporting conversion between over 50 formats, free, open source, and fully local.
@Ryrenz: Papers, contracts, PDFs — these open-source tools cover all document work: 1. opendatalab/MinerU (68.9k) — from Shanghai AI Lab, one-click PDF/document to markdown, excellent academic paper layout restoration. https://github.c…
This tweet summarizes 6 open-source tools covering PDF to markdown, document understanding, OCR, paper translation, and automatic literature review, aiming to streamline document workflows.
Markdown (Aaron Swartz: The Weblog)
Aaron Swartz announces the release of Markdown, a lightweight text-to-HTML conversion tool co-developed with John Gruber, along with his complementary html2text converter.
@TechFlow99: BREAKING: Someone just built the exact tool Andrej Karpathy said someone should build. 48 hours after Karpathy posted h…
A new open-source tool called Graphify was built within 48 hours of Andrej Karpathy describing an LLM knowledge base workflow, enabling users to generate navigable knowledge graphs, Obsidian vaults, and wikis from any folder with 71.5x fewer tokens per query compared to reading raw files. It integrates with Claude Code and supports 13 programming languages, PDFs, images, and Markdown.
@tom_doerr: Converts images and PDFs to Markdown without OCR https://github.com/NanoNets/docext
docext is an on-premises toolkit that converts images and PDFs to markdown without OCR, leveraging vision-language models. It also introduces Nanonets-OCR-s, a compact 3B parameter model for efficient image-to-markdown conversion.