Tag
chDB is a complete ClickHouse OLAP engine compiled to WebAssembly, enabling SQL queries directly in the browser via a SQL shell at wasm.chdb.io.
GreptimeDB improved distributed query performance by enabling remote dynamic filters to push down to datanode scans at runtime and optimizing the optimizer to run before MergeScan wraps remote plans, ensuring filters reach the data. JSON v2 columns now support type hints.
A blog post discussing how performance improvements that fail to cross the 10-second attention threshold may not yield user experience benefits, using examples from database query optimization and process automation.
Someone on GitHub open-sourced a database of 1,324 fitness exercises, each with images, GIF animations, bilingual instructions, target muscles, and equipment type, along with a directly runnable local webpage and developer tools, suitable for fitness app prototypes, AI motion recognition, etc.
A project to build a faithful implementation of the 1976 MUMPS standard, including a parser, interpreter, and integrated hierarchical database, to study the original architecture and run historical software from the era.
The TiDB Cloud team built an AI Agent cloud disk called drive9.ai based on TiDB, aiming to provide a queryable, programmable, and governable file system layer for Agents, solving engineering pain points such as file and metadata consistency, object storage latency, and file understanding in AI scenarios.
pg_plan_advice is a PostgreSQL module that allows users to influence and stabilize query plan choices by specifying plan advice using a mini-language. It helps in controlling join order, scan methods, and other planner decisions, but overriding planner defaults can backfire if data distribution changes.
This paper introduces logical physical clocks and methods for consistent snapshots in globally distributed databases, addressing causality and consistency challenges.
This article from SQLite's official documentation explains the various ways an SQLite database can become corrupted, such as file overwrites by rogue processes, misuse of file descriptors, and unsafe backup procedures, along with mitigation strategies.
This article demonstrates that pre-sorting random data before inserting into SQLite can improve insert performance by 2-3x, by leveraging the B+ Tree's ordered nature and reducing page splits.
A detailed technical guide explaining how PgBouncer works as a PostgreSQL connection pooler, covering its pooling modes, production deployment, and common pitfalls.
A detailed guide on using PostgreSQL as a single database to handle all aspects of a financial application, including schema design, state machines, triggers, and performance optimization.
QuestDB introduces a dedicated WINDOW JOIN operator that is parallelized and vectorized, achieving up to 25x speedup over alternative databases for time-series aggregations around event timestamps.
Cross Canon offers a Bible as a RAG database, allowing users to query scripture using retrieval-augmented generation techniques.
Simon Willison created a tool to convert Mozilla's browser compatibility data into a downloadable SQLite database, hosted with open CORS headers for easy exploration via Datasette Lite.
WebBase-III is a full recreation of dBASE III that runs in the browser, featuring its own W3Script interpreter, BROWSE grid, form engine, indexing, and persistent storage via SQLite.
The paper presents simdjson, the first validating JSON parser capable of processing gigabytes per second on a single core using SIMD instructions, achieving substantial speedups over existing parsers like RapidJSON.
Fluree DB is an open-source, temporal graph database with git-like branching, integrated vector/text/geo search, fine-grained access control, and support for SPARQL, JSON-LD, and Open Cypher. It is optimized for AI agent memory and achieves high performance on billion-scale graphs.
Datasette 1.0a35 alpha release introduces a new create table interface and alter table JSON API, along with stable template context documentation.
Discusses how British Columbia's permanent switch to Pacific Daylight Time in 2026 affects PostgreSQL timestamp storage, and provides best practices to avoid time zone shift errors using a dual-column pattern.