document-extraction

#document-extraction

Can Qwen3.6-35B-A3B on an RTX 3060 Replace Google Vision for Receipt-to-JSON Extraction?

Reddit r/LocalLLaMA ↗ · 2d ago

A developer shares their experience using a local Qwen VL model on an RTX 3060 to parse Japanese receipts into JSON, replacing Google Vision, with results showing accurate extraction of key fields at ~31 seconds per receipt.

0 favorites 0 likes

#document-extraction

@hasantoxr: Now turn messy documents into structured knowledge with one command. It's called Hyper-Extract. Most RAG tools just chu…

X AI KOLs Timeline ↗ · 4d ago Cached

Hyper-Extract is a CLI tool that transforms messy, unstructured documents into structured knowledge such as knowledge graphs, hypergraphs, temporal/spatial graphs, and Obsidian vaults, supporting local LLM inference and MCP integration.

0 favorites 0 likes

#document-extraction

Beyond Logprobs: A Multi-Signal Confidence Engine for LLM-Based Document Field Extraction

arXiv cs.CL ↗ · 5d ago Cached

ExtractConf is a confidence estimation method for LLM-based document field extraction that uses two structurally different calls (field-guided and document-guided) to derive disagreement signals, achieving 0.928 ROC AUC on DocILE invoices and enabling reliable selective prediction for high-stakes automation.

0 favorites 0 likes

#document-extraction

Agentic Document Extraction

Product Hunt ↗ · 2026-06-17

Agentic Document Extraction is a tool that uses AI agents to make documents computable by extracting structured data from unstructured documents.

0 favorites 0 likes

#document-extraction

@jerryjliu0: Every enterprise organization receives and generates a massive volume of contracts. Each contract oftentimes follows a …

X AI KOLs Following ↗ · 2026-06-15 Cached

LlamaIndex introduces an Extract feature in LlamaParse for turning unstructured contract data into structured, machine-readable metadata using layout-aware parsing and LLMs, addressing challenges like non-standard templates and cross-references.

0 favorites 0 likes

#document-extraction

@tom_doerr: Converts images and PDFs to Markdown without OCR https://github.com/NanoNets/docext

X AI KOLs Timeline ↗ · 2026-05-08 Cached

docext is an on-premises toolkit that converts images and PDFs to markdown without OCR, leveraging vision-language models. It also introduces Nanonets-OCR-s, a compact 3B parameter model for efficient image-to-markdown conversion.

0 favorites 0 likes

#document-extraction

olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models

Papers with Code Trending ↗ · 2025-02-25 Cached

olmOCR is an open-source toolkit using a fine-tuned vision language model to extract clean text from PDFs while preserving structure, optimized for large-scale batch processing.

0 favorites 0 likes

document-extraction

Can Qwen3.6-35B-A3B on an RTX 3060 Replace Google Vision for Receipt-to-JSON Extraction?

@hasantoxr: Now turn messy documents into structured knowledge with one command. It's called Hyper-Extract. Most RAG tools just chu…

Beyond Logprobs: A Multi-Signal Confidence Engine for LLM-Based Document Field Extraction

Agentic Document Extraction

@jerryjliu0: Every enterprise organization receives and generates a massive volume of contracts. Each contract oftentimes follows a …

@tom_doerr: Converts images and PDFs to Markdown without OCR https://github.com/NanoNets/docext

olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models

Submit Feedback