Tag
This paper identifies and analyzes 'location leakage' in LLMs, where geographic conditioning causes models to over-index on location metadata even in location-agnostic prompts, revealing structural conditioning effects beyond content.
A user shares how they used Codex and DeepSeek Flash to automate organizing a large ROM collection, including pulling cover art, assets, and metadata, and storing it in RomM.
This research explores methods to determine the source table and column for each result column in arbitrary SQLite queries, using SQLite's internal column metadata API accessed via Python's apsw library or a ctypes bridge, with applications for tools like Datasette.
This paper identifies 'vector search dilution' in RAG systems when scaling to large heterogeneous document collections, where accuracy dropped from 75% to 40% in a real-world deployment. The proposed MASDR-RAG method uses domain scoping via organizational metadata before retrieval, improving P@10 from 0.77 to 0.86 with low cost and easy deployment.
mixedbread.ai introduces a Metadata explorer feature that allows users to browse metadata fields and values across their store, enabling enhanced filtering during search.
A detailed technical guide explaining Exif metadata structure, how it is stored in JPEG and other image formats, and practical considerations for developers working with image pixel data.
A developer rants about the poor design of custom attributes in .NET at the binary metadata level, explaining their storage and why they cause problems.
This extended paper revisits Semantic Web Services insights for Knowledge Graphs, proposing a four-dimensional formal framework and an Agentic Affordance Profile (AAP) to enable principled KG selection, composition, and failure diagnosis at agent planning time.
Supercut is a permission-aware AI tool that provides AI agents with access to recordings and metadata, aiming to streamline agentic workflows.