Tag
Weaviate launches Engram, a fully managed memory service for AI agents that actively maintains memory through reconciliation, deduplication, and scoped isolation, treating memory as infrastructure rather than data hoarding.
A transparent comparison of three open-source AI memory backends (Atomic Memory, Mem0, Zep) covering license, setup, provider support, and unique features like AUDN classification.
A developer shares an architecture using Neo4j knowledge graphs with typed entities and deduplication to solve the problem of AI agents forgetting entity identity across sessions, moving beyond flat files and vector stores.
Explains Cache-Augmented Generation (CAG) as a method to cache static knowledge directly in the model's KV memory, reducing latency and cost compared to traditional RAG, and shows how to combine both for optimal performance.
A detailed architectural guide for building long-running AI agents that handle changing user preferences over time by combining a vector store, graph DB, and temporal edges instead of overwriting data.
The author criticizes existing AI memory platforms for lacking multi-agent memory, poor long-term recall after many interactions, and no forgetting mechanism, and is building a new solution; asks the community for additional pain points.
turbovec is an open-source Rust vector index using Google Research's TurboQuant algorithm, achieving 16x compression and faster search than FAISS, with integrations for RAG frameworks like LangChain, LlamaIndex, and Haystack.
Introduces an open-source project with 30k stars on GitHub that achieves RAG through reasoning instead of vector databases, claiming higher accuracy and solving the problem of similarity not equating to relevance.
The author introduces an open-source MCP server running on Cloudflare Workers that provides persistent, searchable memory for AI clients like Claude, ChatGPT, and Cursor using vector embeddings and duplicate detection.
MemoryOS is an open-source, self-hosted AI agent memory tool using a temporal knowledge graph, achieving 86.2% accuracy on LongMemEval-s with fast 78ms retrieval speeds.
Introduces LlamaIndex, a mature Python open-source framework with 49k+ stars, designed to provide AI assistants with persistent memory and efficient RAG capabilities through vectorized storage and semantic search.
Caliby is an open-sourced embedded vector database co-developed by Sea-Land AI and MIT's Michael Stonebraker team, offering high-performance vector retrieval (4x faster than pgvector) with HNSW, DiskANN, and IVF+PQ indexes, designed specifically for AI Agent and RAG use cases with a simple pip install.
Oracle has released an AI Developer Hub repository on GitHub containing technical resources, reference applications, and Jupyter notebooks to help developers build AI applications, agents, and systems using Oracle AI Database and OCI services. The hub includes complete working examples such as agentic RAG systems, finance AI agents, and full-stack AI applications.
Anthropic introduces Contextual Retrieval, a technique combining contextual embeddings and BM25 to significantly improve RAG accuracy by reducing failed retrievals.
Zilliz releases Claude Context, an open-source MCP plugin that adds semantic code search to Claude Code and other AI coding agents, enabling cost-effective deep context from entire codebases via vector search.