Tag
Microsoft released MarkItDown, an open-source Python library that converts any document to Markdown for use with LLMs.
Microsoft has launched the MarkItDown tool, which converts various document formats (such as PDF, Word, Excel) into Markdown, making it easier for large models and RAG systems to process enterprise data.
Microsoft open-sourced MarkItDown, a lightweight Python tool that converts PDF, Word, PPT, Excel, HTML, and images into clean, structured Markdown text in one go, ideal for AI summarization, data analysis, knowledge base construction, and more.
This tool allows users to convert documents (docx, PPT, pdf, md) into HTML format, and edit them using an AI agent CLI. It supports multiple output formats, is local-first, and open source.