structured-extraction

#structured-extraction

I made a small local model (llama3.2 3B) reliably extract structured JSON from documents - the hard part wasn't the model, it was everything around it

Reddit r/AI_Agents ↗ · 6d ago

A developer shares lessons from building a local document-to-JSON extractor using llama3.2 3B on Ollama, highlighting that deterministic post-processing and schema-constrained outputs matter more than model size, while seeking feedback on hallucination and context truncation issues with long documents.

0 favorites 0 likes

#structured-extraction

@Michaelzsguo: Today I upgraded my Hermes agents with TencentDB Agent Memory. I did not connect it to a cloud LLM. Instead, I wired it…

X AI KOLs Timeline ↗ · 2026-05-24 Cached

The author upgraded their Hermes agents with TencentDB Agent Memory, using a local Qwen 3.5-4B model via llama-server for structured JSON extraction and multi-step tool use, implementing a resilient layered memory pipeline with cursor-based checkpointing.

0 favorites 0 likes

#structured-extraction

NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable) [P]

Reddit r/MachineLearning ↗ · 2026-05-22

Numind released NuExtract3, a 4B open-weight vision-language model based on Qwen3.5-4B, designed for converting document images to Markdown, OCR, and structured data extraction. It is Apache-2.0 licensed and self-hostable with quantized versions for low VRAM.

0 favorites 0 likes

#structured-extraction

numind/NuExtract3

Hugging Face Models Trending ↗ · 2026-04-29 Cached

NuExtract3 is a 4B vision-language reasoning model for document understanding, enabling structured extraction and image-to-Markdown conversion.

0 favorites 0 likes

structured-extraction

I made a small local model (llama3.2 3B) reliably extract structured JSON from documents - the hard part wasn't the model, it was everything around it

@Michaelzsguo: Today I upgraded my Hermes agents with TencentDB Agent Memory. I did not connect it to a cloud LLM. Instead, I wired it…

NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable) [P]

numind/NuExtract3

Submit Feedback