@vintcessun: Get It 把 PDF 变成了可交互的知识图谱和可视化引擎,不是又一个摘要工具。它逐个概念检测,为每个关键词生成 3D/动画/公式可视化,然后用闪卡、测验、费曼教学等工具闭环评估理解深度。每个概念从记忆、理解、结构、应用四个维度打分,只…
摘要
Get It turns PDFs into interactive knowledge graphs and visualizations, using concept detection and multi-format rendering to help students deeply understand material. It runs locally using the user's own ChatGPT account.
查看缓存全文
缓存时间: 2026/06/08 05:19
Get It 把 PDF 变成了可交互的知识图谱和可视化引擎,不是又一个摘要工具。它逐个概念检测,为每个关键词生成 3D/动画/公式可视化,然后用闪卡、测验、费曼教学等工具闭环评估理解深度。每个概念从记忆、理解、结构、应用四个维度打分,只能向上。所有 AI 能力通过你自己的 ChatGPT 账户提供(Codex CLI),无额外订阅,数据全在本地。
beltromatti/get-it
Source: https://github.com/beltromatti/get-it
Get It.
Read it. See it. Get it.
The study companion that turns a PDF into a measurable mastery map. Built around the document, not in place of it.

The problem
The student already has the PDF. They don’t need another summary. They need to see the parts a textbook refuses to draw, and they need a way to prove to themselves that they have understood. Concept by concept, not page by page.
Today’s tools measure surface area, not depth. Flashcard ratings measure recall in the moment. Mind maps measure how much you drew. Summaries measure how patient the AI was. None of them answer the only question that matters on exam day:
Would I survive a question I have not seen before?
Get It. is the layer that answers it.
How it works
Drop a PDF — a digital, text-based one. Get It. checks the file up front and turns away scans or image-only documents with a clear message, because it reads text, not pictures. Once a file clears that gate, three things start at once.
- The page tags itself. A concept-detection agent walks every page and plants inline tag pills on the words that benefit from a picture. Each tag carries a renderer choice: 3D scene, 2D animation, formula walkthrough, plotted graph, or cited source.
- The right pane fills in. Click a tag and its visualization renders — Three.js for anatomy and molecules, Canvas for physics and chemistry animations, KaTeX-clean formulas, a plot engine for functions and distributions, authoritative quotes for legal articles and named papers. Ready tags are marked so you can tell what already exists, and a setting renders every tag automatically as you read if you prefer. When a sandbox crashes, the agent reads its own error and re-emits a fix. The student sees “repairing” instead of red text.
- A knowledge graph builds itself. Six to twenty-five concept nodes, typed edges, sized by mastery, coloured by progress, clickable for the four-axis breakdown plus the evaluator’s note.
Then the loop closes. Four study tools feed one journal.
| Tool | What it measures |
|---|---|
| 💬 Chat | Recall references and paraphrases. Multi-turn, multi-thread, scoped to one document. |
| 🎴 Flashcards | Open-recall under self-grade. Again / Hard / Good / Easy on every card. |
| ✅ Quizzes | Forced-choice discrimination. One correct answer, three plausible distractors. |
| 💡 Feynman | The agent plays a curious eight-year-old. You teach. The strongest comprehension signal. |
After every completed session the evaluator agent reads the journal end-to-end and updates four scores per concept node on the knowledge graph: memory, comprehension, structure, application. Each scored 0 to 100. Each monotone non-decreasing by a runtime clamp. The student can only progress, never regress.
The four numbers are the difference between a study app and a measurement instrument.
Bring your own ChatGPT
The AI side of Get It. has no business model layered on top.
You sign in once with the ChatGPT account you already pay for (or an OpenAI API key) through the official Codex CLI. Every agent inside the app runs against your own tier. There is no Get It. server, no shared key pool, no per-message metering, no “AI credits” wallet, no second subscription, and no plan to ever ship one.
- You pay for AI once. ChatGPT Plus, Pro, Team, Enterprise, or Edu covers everything Get It. does.
- Plus is the practical floor. The free tier signs in but its Codex allowance is intentionally small. Plus and above give comfortable session headroom in the same flow.
- Your data stays yours. No backend, no upload step, no analytics. The work-context journal is a single JSON file on your disk, downloadable in one click from the right-pane menu.
- Rate limits are OpenAI’s. When you hit one, the app shows a countdown banner and resumes the background work itself once the window clears.
Other AI study apps wrap a marked-up subscription around a model API the vendor holds. Get It. wraps a study workflow around the access you already have.
Install
Get It. is a desktop app. Download the installer for your machine, double-click, sign in with the ChatGPT account you already use. Nothing else to buy.
| Platform | Installer |
|---|---|
| macOS (Apple Silicon, M1 / M2 / M3 / M4) | Get It-<version>-arm64.dmg |
| macOS (Intel) | Get It-<version>.dmg |
| Windows 10 / 11 (x64) | Get It Setup <version>.exe |
| Linux (x64) | Get It-<version>.AppImage |
Every release ships on the Releases page. The app checks for a newer build on every launch and offers a one-click update inside its own window.
First launch
The setup wizard verifies the bundled Codex CLI, walks the OAuth sign-in, and refuses to open the main window until both gates are green. Then drop a PDF, or open one of the five bundled samples (anatomy, classical mechanics, Italian constitution, calculus, organic chemistry). Tags, chats, flashcard decks, quizzes, Feynman sessions, and the knowledge graph all stay on your computer.
Gatekeeper and SmartScreen
macOS builds (both Apple Silicon and Intel) are signed with a paid Apple Developer ID Application certificate AND notarized by Apple. On a fresh download from this repo’s Releases page the OS opens the app with no Gatekeeper prompt — the stapled notarization ticket tells Gatekeeper the binary is trusted before the network is even consulted. spctl --assess reports source=Notarized Developer ID and xcrun stapler validate passes on both architectures.
Windows builds are not signed. The first launch shows a SmartScreen warning (“Windows protected your PC”); click More info → Run anyway. The warning persists because SmartScreen reputation is per-certificate and we currently don’t ship a Windows code-signing cert (Microsoft’s Trusted Signing service requires a paid Azure subscription that the project doesn’t carry).
If you ever pull a build that wasn’t notarized — a local ad-hoc build before secrets are wired up, an old release from before v1.2.1 — macOS shows the “unidentified developer” prompt instead. The bypass is System Settings → Privacy & Security → Open Anyway (macOS Sequoia 15 and macOS 26 removed the older right-click → Open shortcut). Or strip the quarantine flag in one shot from the CLI: xattr -dr com.apple.quarantine "/Applications/Get It.app".
Storage
Everything lives under one OS-native directory.
| OS | Path |
|---|---|
| macOS | ~/Library/Application Support/get-it/ |
| Windows | %APPDATA%\get-it\ |
| Linux | ~/.local/share/get-it/ |
Layout: one folder per document at docs/<docId>/ (source PDF, extracted text cache, tags, work context, knowledge graph), a docs.json index at the root, a codex-scratch/ working dir, and logs/. Deleting a doc from the Library wipes the whole folder.
Hack on it
git clone https://github.com/beltromatti/get-it.git
cd get-it
npm install
npm run dev # builds the Next.js standalone bundle and opens it in Electron
npm run dev exercises the full path: setup wizard, embedded server, IPC bridge. Re-run after edits.
For browser-side hot reload:
npm run browser:dev # http://localhost:3000
(The Electron-internal HMR loop has a known Next 16.2.6 + Turbopack + Chromium 130 hydration glitch, so browser dev or rebuild-and-test is the cleaner inner loop.)
Local desktop builds, one or all targets:
npm run build && npm run electron:prepare
node scripts/build-electron.mjs --target=mac-arm64 # or mac-x64 / win-x64 / --all
Artefacts land in dist-electron/. Cross-arch builds pull the matching Codex platform package from npm on the fly, so you do not need an Intel Mac or a Windows VM to build for them.
Releases are tag-driven. Push a vX.Y.Z tag to main and .github/workflows/release.yml builds every target on a native runner, attaches the .dmg / .exe / .AppImage to a GitHub Release, and pins the version into Info.plist and NSIS metadata from the tag itself.
Architecture in one breath
upload ─► quality gate (model-free) ─► pdfjs-dist extracts text + glyph bboxes per page
│
├──► visualizer pipeline
│ ├─ batched concept-detection agent → DetectedConcept[] with anchor strings
│ │ (≤5 pages per call, concurrency 3) (each concept carries its page)
│ └─ per-tag visualization-spec agent → 3d / 2d-anim / formula / graph / 2d-text spec
│ (lazy: on click by default) (server-side syntax preflight + client-side
│ runtime repair loop on sandbox crashes)
│
└──► knowledge-graph pipeline
├─ kg-build agent (one-shot) → 6–25 concept nodes + typed edges + global note
│ ◄── full document text (bounded by the 150-page upload cap)
└─ kg-evaluate agent (incremental) → per-node {memory, comprehension, structure,
◄── current graph (baseline scores) application} 0–100, monotone non-decreasing
◄── interactions since the last pass
Nine prompts behind one auth path, nine schemas behind one shared SDK wrapper. The full design rationale, the four-axis rubric, the per-doc evaluator queue, the LLM-code sandbox, and the desktop-packaging layer are in technical-writeup.md, also rendered as PDF.
The team
Built in 24 hours at GDG AI Hack 2026, Milan, for the Braynr challenge. The hackathon submission lived at commit 277ec43. Everything past that commit is post-hackathon polish: desktop packaging, the persistent Library, the first-launch setup wizard, the quizzes tool, the in-app auto-update flow, the server-side jobs runner, and long-document support that keeps a 100-page PDF affordable on a single ChatGPT plan. The product is the same. Only the way it gets onto a student’s laptop has changed.
- Mattia Beltrami, Politecnico di Milano
- Matteo Impieri, Politecnico di Milano
- Filippo Difronzo, Politecnico di Milano
- Luca Feggi, Università di Padova
Notice
Get It. is an independent project. It is not affiliated with, endorsed by, or sponsored by OpenAI. The app uses the official open-source Codex CLI as the transport between the local app and OpenAI’s models, signed in with the end user’s own ChatGPT or OpenAI API account. “OpenAI”, “ChatGPT”, and “Codex” are trademarks of their respective owner; we use the names only to describe what Get It. interoperates with.
Your use of OpenAI’s models through Get It. is subject to OpenAI’s own Terms of Use, Usage Policies, and Privacy Policy, and to the Codex CLI’s own license and release notes. Those documents are authoritative for what the model service permits, how data is handled on OpenAI’s side, and what each subscription tier covers.
License
Apache License 2.0. See LICENSE. Source is open. Contributions are welcome.
相似文章
@GitHub_Daily: 在 GitHub 上发现一个开源的学习工具:Get It,可帮助我们通过多种方式深度学习 PDF 文件内容。 自动在 PDF 文件上标注关键概念,还可转化为 3D 模型、动画演示、公式推导等可视化内容,同时生成一张知识图谱。 GitHub…
Get It 是一个开源学习工具,能够自动标注 PDF 中的关键概念并将其转化为 3D 模型、动画等可视化内容,同时生成知识图谱,支持对话问答、闪卡记忆等学习方法。
@veyhon: 把代码、文档、论文、图片和视频统一抽成知识图谱,让 AI 编程助手先查关系图,再决定该读哪段上下文 https://github.com/safishamsi/graphify… Graphify 的管线是 detect → extrac…
Graphify 是一个开源工具,将代码、文档、论文、图片和视频统一抽取为知识图谱,供 AI 编程助手优先查询关系图以确定上下文,从而提升代码理解和生成的准确性。
@GitHub_Daily: 最近看到一个开源项目 Flipbook Canvas,挺有意思,能把每张 AI 生成的图变成一棵可以无限点击探索的知识树。 长按图片任意位置,系统会自动识别你点的内容,联网搜索相关资料,然后生成一张全新的详细图解,层层递进。 GitHub…
Flipbook Canvas 是一个开源项目,能将 AI 生成的图像转换为可无限点击探索的知识树,支持联网搜索、实时生成和离线导出。
@AIExplorerTim: 有人刚刚开发了一个工具,可以将 PDF 转换为 干净、结构化的 Markdown 速度达到 100 页/秒 不需要 GPU。 不需要 API 成本。 没有混乱的解析。 只有原始的、可用的数据。 它可以轻松处理的内容: • 表格 → 完美提…
OpenDataLoader 是一个开源工具,可将 PDF 转换为结构化的 Markdown 和 JSON,支持 100 页/秒的本地处理速度,无需 GPU 或 API 成本,专为 RAG 管道和 PDF 无障碍自动化设计。
@Xudong07452910: Paper2Any:将论文、文本或主题一键转化为可编辑的科研图表、技术路线图和演示文稿 这是一个专注于学术可视化与演示制作的开源项目,主要功能包括: 1.Paper2Figure:生成可编辑的模型架构图、技术路线图、实验流程图(支持PPT…
Paper2Any 是一个开源项目,可将论文、文本或主题一键转化为可编辑的科研图表、技术路线图和演示文稿,支持多种输出格式,适合科研工作者快速制作论文配图和汇报材料。