@KyrieCheungYep: I built a Skill: ky-markdown-rebuilder that reconstructs "visual documents" like PDFs, PPTs, whitepapers, proposals, course materials, and long screenshots into structurally reliable Markdown, because AI struggles to read those source files well. Markdown not only...

X AI KOLs Timeline Tools

Summary

KyrieCheungYep released a Skill called ky-markdown-rebuilder that specifically reconstructs visual documents such as PDFs and PPTs into structurally reliable Markdown, improving AI's ability to read and understand complex documents.

I built a Skill: ky-markdown-rebuilder It specializes in reconstructing "visual documents" like PDFs, PPTs, whitepapers, proposals, course materials, and long screenshots into structurally reliable Markdown, because AI reads these source files poorly. Markdown not only allows AI to read files better but also turns these source files into reusable and rebuildable assets. Microsoft already has MarkItDown, but anyone who has used it knows it's not great. It's better suited for "format conversion" and "text extraction." For ordinary Word documents, web pages, and linear PDFs, MarkItDown is sufficient. But when it comes to more complex reports like PPTs, sales decks, and charts, there are many problems: Left and right columns get mixed together, timelines become disordered, architectural diagrams turn into scattered text, and tables, screenshots, and annotations easily clump into a mess. When AI later reads it, it treats the jumbled text as truth. Ky-markdown-rebuilder first extracts text, then renders the original file into page-by-page screenshots, reorganizing the Markdown by comparing with the actual layout. It can do: 1. Page-by-page restoration Each page becomes its own section without merging adjacent pages. 2. Organize by visual relationships Left-right columns, cards, flowcharts, timelines, comparison matrices, and screenshot captions are reconstructed in their original structure. 3. Suitable for complex documents Handles PPTs, PDF reports, whitepapers, course materials, product decks, sales proposals, and long screenshots. 4. Supports deep visual mode For pages with high information density, it specifically checks layout, charts, and module relationships, not just extracting a few lines of text. 5. Can generate an outline If an outline is needed, it generates one based on the calibrated Markdown, rather than directly summarizing from the disordered text. I used a middle school chemistry textbook and a PPT from today's partner, created some effect examples. See the images below to visually see the results. A well-organized Markdown document can greatly improve efficiency in use. I uploaded this Skill to GitHub. Give it a try if you need it! https://github.com/KyrieCheungYep/ky-markdown-rebuilder…
Original Article
View Cached Full Text

Cached at: 07/03/26, 02:38 PM

I built a Skill: ky-markdown-rebuilder

It is specifically designed to rebuild “visual documents” like PDFs, PPTs, white papers, proposal documents, course materials, and long screenshots into structurally reliable Markdown. Since AI struggles to read these source files, Markdown not only makes them more readable for AI, but also turns these source files into reusable and reconstructable assets.

Microsoft’s official MarkItDown exists, but anyone who has used it knows it’s not very good; it’s more suited for “format conversion” and “text extraction.”

For regular Word documents, web pages, and linear PDFs, MarkItDown is sufficient. But when it comes to complex reports like PPTs, sales decks, charts, etc., problems arise:

  • Left and right columns get mixed up, timelines become disordered, architecture diagrams turn into scattered text, tables, screenshots, and annotations get jumbled together. When AI later reads it, it takes the messy text as fact.

ky-markdown-rebuilder first extracts text, then renders the original file into page-by-page screenshots, and reorganizes the Markdown against the real layout.

It can do the following:

  1. Page-by-page restoration
    Each page forms its own section, without merging adjacent pages.

  2. Organize by visual relationships
    Left/right columns, cards, flowcharts, timelines, comparison matrices, and screenshot annotations are reconstructed according to the original structure.

  3. Suitable for complex documents
    Handles PPTs, PDF reports, white papers, course materials, product decks, sales proposals, and long screenshots.

  4. Supports deep visual mode
    For pages with high information density, it specifically checks layout, charts, and module relationships, not just extracting a few lines of text.

  5. Can also generate an outline
    If an outline is needed, it generates one based on the calibrated Markdown, rather than summarizing from disordered text directly.

I used a middle school chemistry textbook and a PPT sent by a partner today, and produced some example result images. You can see the effects intuitively in the images below.

A well-structured Markdown document can greatly improve efficiency during use.

I’ve put this Skill on Github. If you need it, feel free to try it out!
https://github.com/KyrieCheungYep/ky-markdown-rebuilder


KyrieCheungYep/ky-markdown-rebuilder

Source: https://github.com/KyrieCheungYep/ky-markdown-rebuilder

KY Markdown Rebuilder

A document reconstruction Skill for Codex / Claude Code.

Purpose: To rebuild visually complex PDFs, PPTs, long screenshots, reports, proposal decks, etc., into clear, reliable, page-aligned Markdown.

Preserves page structure, tables, cards, diagram relationships, and visual reading order as much as possible.


Suitable for

  • PPT / PPTX / slide deck
  • PDF reports, white papers, course materials
  • Documents with mixed text and images
  • Materials containing tables, flowcharts, architecture diagrams, timelines, matrices
  • Long screenshots or visually dense pages

Not suitable for simple plain text files. For plain text files, a direct conversion is sufficient; this Skill is not needed.


Install to Codex

mkdir -p ~/.codex/skills
git clone https://github.com/KyrieCheungYep/ky-markdown-rebuilder.git ~/.codex/skills/ky-markdown-rebuilder

Install to Claude Code

mkdir -p ~/.claude/skills
git clone https://github.com/KyrieCheungYep/ky-markdown-rebuilder.git ~/.claude/skills/ky-markdown-rebuilder

Usage

In Codex or Claude Code, simply reference this Skill:

[$ky-markdown-rebuilder] Convert this PPTX into Markdown

You can also trigger it with natural language:

Rebuild this visually complex PDF into page-aligned Markdown

Default Output

Typically generates:

source.calibrated.md

If you request an outline or both files, it will also generate:

source.outline.md

Output Features

For complex documents, the Markdown is rebuilt as closely as possible to the original pages:

  • Each page or slide forms its own section
  • Preserves page titles and main content
  • Tables are converted to Markdown tables where possible
  • Cards, columns, flows, and matrices are organized by structure
  • Necessary descriptions are added for images, screenshots, charts, and architecture relationships
  • Avoids merging page content into a single block of messy text

English

KY Markdown Rebuilder is a document reconstruction skill for Codex / Claude Code.

It converts visually complex PDFs, PPT decks, long screenshots, reports, and proposal documents into clean, reliable, page-aligned Markdown.

It is designed for documents where plain text extraction is not enough, especially when layout, tables, diagrams, cards, timelines, screenshots, or visual hierarchy matter.

Install for Codex

mkdir -p ~/.codex/skills
git clone https://github.com/KyrieCheungYep/ky-markdown-rebuilder.git ~/.codex/skills/ky-markdown-rebuilder

Install for Claude Code

mkdir -p ~/.claude/skills
git clone https://github.com/KyrieCheungYep/ky-markdown-rebuilder.git ~/.claude/skills/ky-markdown-rebuilder

Usage

[$ky-markdown-rebuilder] Convert this PPTX into Markdown

Typical output:

source.calibrated.md

If an outline is requested:

source.outline.md

Similar Articles

@Chenzeze777: Microsoft open-sourced a document tool with 140k stars — I compiled its 5 most practical use cases. MarkItDown, a Python tool, converts PDF/Word/PPT/Excel/HTML/images into clean Markdown text with one click. What you can do with it: · P…

X AI KOLs Timeline

Microsoft open-sourced MarkItDown, a lightweight Python tool that converts PDF, Word, PPT, Excel, HTML, and images into clean, structured Markdown text in one go, ideal for AI summarization, data analysis, knowledge base construction, and more.

@KyrieCheungYep: Let me introduce this Skill. It's a Skill I developed while presenting projects to some clients, and it has been validated through my own multiple practice attempts. This is a Codex & Claude Skill specifically designed to convert visible UI reference images into high-quality HTML/CSS. It will translate the design…

X AI KOLs Timeline

KyrieCheungYep released a Codex/Claude Skill called ky-design-to-html, used to convert UI screenshots or design drafts into high-quality HTML/CSS. It reduces visual discrepancies through a workflow of deconstructing design structure, separating code from assets, canvas adaptation, and browser screenshot verification.

@KyrieCheungYep: Recommend a PPT creation Skill: ultimate-ppt-master-skill, which is closer to real-world PPT making scenarios. Its features and highlights are as follows: 1. First, clarify requirements. Many AI PPT tools start making slides immediately, resulting in nothing but empty words. PPT M…

X AI KOLs Timeline

Recommend an open-source PPT creation tool, ultimate-ppt-master-skill. It follows a workflow of clarifying requirements, organizing material structure, generating editable PPTX or Web Deck, image generation, and final review, which is close to real office scenarios. It solves common issues in AI PPT generation like empty words and lack of editability.

@Jolyne_AI: When writing technical documentation, we often use Markdown, but it falls short when dealing with complex layouts. I stumbled upon an open-source tool on GitHub: Quarkdown. Think of it as "Markdown on steroids" — it keeps the lightweight syntax but cranks up formatting, generation, and automation to the max. GitH…

X AI KOLs Timeline

Quarkdown is an enhanced typesetting system based on Markdown, supporting variables and functions, multi-format output, real-time preview and other powerful features. It is suitable for books, papers, presentations, etc. Installation requires Java 17+.