Tag
A Hugging Face Space allows running SQL queries over 2.19 billion web pages from Common Crawl without downloading, using DuckDB to read directly from Hugging Face storage buckets.
Datasette Agent is a new extensible AI assistant for Datasette that lets users query their data conversationally and generate charts via plugins. It supports local models and cloud APIs like Gemini 3.1 Flash-Lite.
Pao announced the launch of Handinger, a managed cloud agent for automating business tasks such as email workflows, reporting, and data analysis.
The article analyzes the concept of 'model half-life' by compiling release dates of major AI models from frontier labs, finding that while release cadence has increased, the notion of a continuously halving release time is misleading. The author provides a TSV dataset and a prediction method.
A practical guide to six SQL patterns for detecting transaction fraud in financial data, including velocity checks, impossible travel detection, and other methods. The author shares real-world examples and tuning advice.
A compilation of 1,058 GTME job listings, 1,167 practitioners, and 867 hiring companies, sorted into eight archetypes showing how the role is evolving.
OpenAgents is an open platform for using and hosting language agents in everyday life, featuring agents for data analysis, plugins, and web browsing, with open code and a demo.
The author details the process of designing a custom query language tailored for non-technical analysts to filter vehicle maintenance data, outlining user needs, data schema, and specific use cases.
The article discusses analyzing OpenAI's open roles using ChatGPT to infer corporate strategy, concluding that the findings were not particularly new but the method worked well.
A survey analyzing over 300,000 web feeds on the top 500k sites reveals that while feeds remain prevalent, most are abandoned or low quality due to automatic CMS generation. The author used AI agents to process Common Crawl data and calls for better feed management practices.
This paper introduces AIDA, an autonomous agent framework designed to transform fragmented enterprise data into actionable business insights by leveraging reinforcement learning and a proprietary Domain-Specific Language for SQL execution.
Bruin is an AI data agent designed to collaborate with teams, likely assisting with data analysis and automation tasks.
The author demonstrates how to collaborate using Codex, HyperFrames, and Remotion tools to produce a Chinese educational video about declassified UFO files. Additionally, it introduces a Claude Code skills repository on GitHub that automates the organization and analysis of publicly declassified UAP/UFO government documents.
Skopx is a conversational AI analytics platform that lets users ask business questions in plain English, automatically generating insights from connected data sources without SQL. It provides transparent reasoning, role-based access, and integrates with existing tools.
The author uses an AI agent to analyze 8 years of his mother's hypertension records, identifying morning surges and drug interactions that were missed during brief hospital visits, highlighting AI's role in bridging gaps in chronic care continuity.
An analysis of prediction markets like Polymarket and Kalshi, examining whether their massive trading volume actually produces valuable forecasting information or merely serves as gambling, referencing historical academic support and current data.
ggsql is an alpha-release tool that brings grammar of graphics visualization capabilities to SQL, allowing users to create structured, modular visualizations using SQL syntax across Quarto, Jupyter, Positron, and VS Code.
Machine Learning at Berkeley collaborated with LatchBio to benchmark their AI agent's performance on spatial transcriptomics workflows, evaluating its ability to automate complex bioinformatics tasks.
OpenAI Academy publishes a guide on using ChatGPT for data analysis, enabling users to upload files and ask natural language questions to explore, clean, and visualize data without requiring formula or dashboard expertise.
OpenAI has developed an internal research assistant that combines dashboards with a conversational GPT-5 interface to help teams analyze millions of support tickets and generate insights in minutes instead of weeks. The tool democratizes data analysis across teams, allowing non-technical users to ask questions in plain language and get actionable reports on product feedback, customer sentiment, and trends.