rag

#rag

Mistral OCR 4

Hacker News Top ↗ · 11h ago Cached

Mistral AI releases Mistral OCR 4, a compact document intelligence model that provides bounding boxes, block classification, and inline confidence scores for structured text extraction. It supports 170 languages, runs in a single container for self-hosted deployment, and integrates with the Mistral Search Toolkit for enterprise search and RAG pipelines.

0 favorites 0 likes

#rag

@aikangarooking: https://x.com/aikangarooking/status/2069325659105861926

X AI KOLs Timeline ↗ · 17h ago Cached

Introduces SAG (SQL-Retrieval Augmented Generation), a novel retrieval-augmented generation architecture based on SQL dynamic hyperedges. It is more efficient and lower cost for multi-hop reasoning compared to traditional RAG and GraphRAG. It is open-sourced on GitHub and has achieved good evaluation results.

0 favorites 0 likes

#rag

The AI agent demo always passes. Then it hits production and you realize "it works" was never the hard part.

Reddit r/AI_Agents ↗ · yesterday

This article discusses how AI agent demos often succeed while production deployment reveals critical security and authorization issues, emphasizing that model quality does not solve problems like access control, data leaks, and auditability.

0 favorites 0 likes

#rag

@antirez: I was thinking about Vector Sets and the Redis approach to this stuff in general. Now that the hype with RAG is gone, I…

X AI KOLs Timeline ↗ · yesterday

Salvatore Sanfilippo reflects on his earlier prediction that RAG would fade while raw vector search remains valuable, now that the RAG hype has subsided.

0 favorites 0 likes

#rag

@yyyole: Agent memory entrepreneurship is booming! Many teams are working on it, with several mainstream approaches: The first, most brute-force: context expansion. The second, most common: RAG/vector database route, embedding historical content into a vector store for retrieval. The third is product-oriented: memory API route, packaging memory as an API for direct calls. Each approach has its clear pros and cons!

X AI KOLs Timeline ↗ · yesterday Cached

Introduces several mainstream approaches to Agent memory entrepreneurship, recommends the EverMind team's open-source project EverOS, which provides a Markdown-sourced local memory OS supporting dual-track memory, multimodal ingestion, and self-evolution capabilities.

0 favorites 0 likes

#rag

@LTChives: Web scraping is dead. This PixelRAG in the video completely bypasses HTML parsing. It takes a screenshot of the webpage and then lets the vision model read answers from the pixels. Previously, AI reading a webpage meant first parsing the code, extracting text, and splitting paragraphs. Now it just looks at the page. 100% open source, plus it comes with Claude Code…

X AI KOLs Timeline ↗ · yesterday Cached

PixelRAG is a novel open-source tool that bypasses traditional HTML parsing by directly taking screenshots of webpages and using vision models to extract answers from the pixels. It also supports the Claude Code plugin, giving Claude visual capabilities.

0 favorites 0 likes

#rag

@BlockInsight214: Before feeding papers, contracts, or scanned documents to AI, the hardest step is often "cleaning up the PDF." These open-source projects specialize in that: converting to Markdown/JSON, ready for RAG or agents. ① MarkItDown · Microsoft, Office/PDF/images to Markdown in one click...

X AI KOLs Timeline ↗ · yesterday Cached

Introduces five open-source tools (MarkItDown, MinerU, Docling, marker, surya) that convert PDFs, Office documents, etc., into Markdown or JSON for direct use with RAG or AI agents.

0 favorites 0 likes

#rag

@Pluvio9yte: Tencent quietly open-sourced an enterprise-level knowledge platform, WeKnora. Its core concept is: turning a pile of raw documents into truly usable, inferable, and self-growing knowledge assets. It mainly does three things: RAG intelligent Q&A — fast and accurate ordinary semantic retrieval with support for hybrid search; ReAct autonomous A…

X AI KOLs Timeline ↗ · yesterday Cached

Tencent open-sourced the enterprise knowledge platform WeKnora, featuring three key capabilities: RAG intelligent Q&A, ReAct autonomous Agent, and self-maintaining Wiki + knowledge graph, transforming raw documents into inferable and growing knowledge assets.

0 favorites 0 likes

#rag

@XiaohuiAI666: Your RAG implementation is wrong! Traditional chunks lack knowledge boundaries, version information, and metadata, leading to missing retrieval context, version mixing, and difficult permission control. The author proposes a new method that replaces chunks with IdeaBlocks (Question-Answer + governance fields), achieving structured knowledge units. Without changing the retrieval algorithm,…

X AI KOLs Timeline ↗ · 2d ago Cached

The author proposes replacing traditional chunks with IdeaBlocks (Question-Answer + governance fields) to improve RAG knowledge units. The Blockify tool has been open-sourced, which can reduce corpus size by 40x, tokens by 3x, and increase relevance by 2.3x.

0 favorites 0 likes

#rag

@DeRonin_: THIS IS HOW YOU WIN AS AN AI ENGINEER IN 2026: > ship one real app per month, ugly counts > master 4 things cold: promp…

X AI KOLs Following ↗ · 2d ago Cached

A tweet from @DeRonin_ provides advice for AI engineers in 2026, emphasizing shipping real apps, mastering core skills, using cheap models, deploying widely, open-sourcing projects, and focusing on a single career lane.

0 favorites 0 likes

#rag

Effective use-cases for LLMs

Lobsters Hottest ↗ · 2d ago Cached

This article shares practical, real-world use cases for LLMs in software engineering, including searching through customer conversations via RAG, triaging API failures from logs, and shortening content. It emphasizes efficiency gains and reducing manual sifting.

0 favorites 0 likes

#rag

@akshay_pachaar: Web scraping will never be the same. (100% open-source visual search at scale) PixelRAG is a retrieval system that skip…

X AI KOLs Following ↗ · 3d ago Cached

PixelRAG is an open-source retrieval system that bypasses HTML parsing by screenshotting web pages and using vision-language models to read answers directly from pixels, claiming significant accuracy improvements over text-based RAG.

0 favorites 0 likes

#rag

@_avichawla: 8 RAG architectures for AI Engineers: (explained with usage) 1) Naive RAG - Retrieves documents purely based on vector …

X AI KOLs Timeline ↗ · 3d ago Cached

A tweet thread explaining 8 different RAG architectures (Naive, Multimodal, HyDE, Corrective, Graph, Hybrid, Adaptive, Agentic) with their use cases, and hinting at an improved indexing technique.

0 favorites 0 likes

#rag

@AYi_AInotes: Wow, Alibaba has directly open-sourced the vector database it has been using internally for years. The capability that Pinecone charges $70/month for, you can get for free with a single pip command. Billion-level vector recall in milliseconds without needing a separate service. From now on, those doing RAG and AI search no longer need to pay Pinecone $70 each month! The vector database that Alibaba has been running internally for years is open-sourced...

X AI KOLs Timeline ↗ · 4d ago Cached

Alibaba has open-sourced Zvec, a vector database used internally for years. It supports billion-scale vector retrieval in milliseconds, can be embedded into application processes without a separate service, and is completely free. It serves as a replacement for paid services like Pinecone.

0 favorites 0 likes

#rag

We built a persistent agent memory layer on Elasticsearch with 0.89 recall

Hacker News Top ↗ · 5d ago Cached

Elasticsearch blog post describes building a persistent agent memory layer with three memory types (episodic, semantic, procedural), achieving 0.89 recall on a QA eval with zero tenant leaks using hybrid recall and DLS isolation.

0 favorites 0 likes

#rag

@sairahul1: https://x.com/sairahul1/status/2067540315620405543

X AI KOLs Timeline ↗ · 5d ago Cached

A thread explaining six essential AI concepts (tokens, embeddings, vector search, etc.) for building production-ready AI systems, emphasizing that understanding them prevents costly failures like runaway API costs.

0 favorites 0 likes

#rag

@DanKornas: RAG Playlist What you will learn: - Retrieval Augmented Generation - Document Loaders in LangChain - Text Splitters in …

X AI KOLs Timeline ↗ · 5d ago Cached

Tweet announces a RAG Playlist covering topics from basic RAG to advanced techniques like CRAG and Self-RAG using LangChain and LangGraph, with a link in the comments.

0 favorites 0 likes

#rag

@QingQ77: Describe requirements in natural language, and the AI Agent automatically breaks down steps, calls tools to complete development, file operations, browser control, and other tasks, while also providing a full-fledged editor and terminal. https://github.com/Liuchun-oss/codelf-agent… Codelf is…

X AI KOLs Timeline ↗ · 5d ago Cached

Codelf is an open-source desktop AI assistant that lets you describe requirements in natural language. It automatically breaks down steps and calls tools to handle development, file operations, browser control, and more, all while providing a complete editor and terminal. It supports models like DeepSeek, Claude, and ChatGPT, works well on domestic networks, and includes local RAG knowledge base capabilities.

0 favorites 0 likes

#rag

@amitiitbhu: Agentic RAG Explained Learn here: https://youtube.com/watch?v=6nSegpuWJVw…

X AI KOLs Timeline ↗ · 5d ago Cached

Agentic RAG uses AI agents to drive the retrieval process in a loop, enabling multi-step reasoning, automatic data source selection, and query optimization, overcoming the limitations of standard RAG in handling multi-hop questions, ambiguous queries, and multiple data sources.

0 favorites 0 likes

#rag

@jerryjliu0: Agentic search has moved from fixed RAG pipelines into flexible agent harnesses with access to a set of search tools: k…

X AI KOLs Following ↗ · 6d ago Cached

LlamaIndex introduces agentic retrieval with LlamaParse Index, combining semantic search and grep for flexible agent harnesses. A webinar on June 30th will demonstrate these tools.

0 favorites 0 likes

rag

Submit Feedback