HTML table extractor

Simon Willison's Blog Tools

Summary

A tool that extracts tables from pasted HTML, rich text, or plain text and exports them to HTML, Markdown, CSV, TSV, or JSON, with added Wikipedia search and automatic table import.

No content available
Original Article
View Cached Full Text

Cached at: 06/30/26, 03:33 AM

# Tool: HTML table extractor Source: [https://simonwillison.net/2026/Jun/29/html-table-extractor/](https://simonwillison.net/2026/Jun/29/html-table-extractor/) [Tool](https://simonwillison.net/elsewhere/tool/)[HTML table extractor](https://tools.simonwillison.net/html-table-extractor)— Extract tables from pasted content and convert them to multiple formats\. Paste HTML, rich text, or plain text containing tables, and the tool automatically detects and displays each table with a preview, then allows you to export it as HTML, Markdown, CSV, TSV, or JSON\. Yet another in my growing collection of paste\-conversion tools\. This one accepts pasted rich text from browsers \(with embedded HTML tables\) and converts every detected table into HTML, Markdown, CSV, TSV, or JSON\. Try it out by selecting everything on the Wikipedia[List of cities and towns in the San Francisco Bay Area](https://en.wikipedia.org/wiki/List_of_cities_and_towns_in_the_San_Francisco_Bay_Area)page and pasting it directly into the tool: ![Screenshot of a web interface for converting table data between formats. A row of tabs labeled HTML, Markdown, CSV, TSV, and JSON sits below the bottom edge of a styled data table, with the TSV tab currently selected. The TSV tab displays the table's contents as tab-separated plain text in a monospaced font inside a bordered panel, with a "Copy" button in the upper right of that panel.](https://static.simonwillison.net/static/2026/html-table.jpg) On a similar note, I recently[rebuilt](https://github.com/simonw/tools/commit/f278e977751dbc1948baedfc2f26b6de870f60e6)my[Rich text to markdown](https://tools.simonwillison.net/rich-text-to-markdown)tool to add support for tables and generally improve the UI\. **Update**: It turns out Wikipedia has an open CORS API for retrieving the full rendered HTML content of any page \-[demo here](https://tools.simonwillison.net/cors-fetch#url=https%3A%2F%2Fen.wikipedia.org%2Fw%2Fapi.php%3Faction%3Dparse%26page%3DList_of_cities_and_towns_in_the_San_Francisco_Bay_Area%26prop%3Dtext%26format%3Djson%26origin%3D%2A)\- so I[had Codex](https://gist.github.com/simonw/f226fe96f464ec7d81d6996cb466436d)add the ability to search Wikipedia for a page and then automatically import and display any tables from that page\.

Similar Articles

@geekbb: AI-generated technical docs are often thousands of lines long, scrolling in the terminal — nobody wants to read them. md2html lets AI automatically convert those Markdown docs into HTML pages with sidebar table of contents, diagrams, timelines, cards, and callouts, all in a single file to share with the team. https://github.c…

X AI KOLs Timeline

md2html is a tool that converts AI-generated Markdown documents into polished, self-contained HTML pages with sidebar table of contents, diagrams, timelines, and callouts, making them easier to read and share.

Markdown (Aaron Swartz: The Weblog)

Lobsters Hottest

Aaron Swartz announces the release of Markdown, a lightweight text-to-HTML conversion tool co-developed with John Gruber, along with his complementary html2text converter.

MD+HTML Reader

Product Hunt

MD+HTML Reader allows users to review AI-generated Markdown and HTML in a focused workspace.