@DSPyOSS: “In a way, Machine Learning asks how a system can improve from data when we have a precise objective to optimize. Machi…
Summary
A twitter thread by @DSPyOSS and Jacob X. Li contrasts Machine Learning (optimizing from data with a precise objective) with 'Machine Studying' (learning from a declarative corpus without a downstream task), highlighting the urgent need for AI systems to develop expertise from unstructured documents.
View Cached Full Text
Cached at: 06/18/26, 04:09 PM
“In a way, Machine Learning asks how a system can improve from data when we have a precise objective to optimize. Machine Studying asks what an agent should do when it’s given a declarative corpus and no downstream task.”
Jacob X. Li (@jacobli99): Continual learning is widely discussed right now, but mostly as improving on the job or avoiding catastrophic forgetting. But it has a different, difficult, and already urgent form:
Given nothing but a corpus of documents, how should AI systems develop expertise in a new,
Similar Articles
@DSPyOSS: a crisper operationalization of continual learning that matches problems that are inaccurately treated as "RAG" or "RL"…
Introduces 'Machine Studying' as a new formulation of continual learning where AI systems autonomously develop expertise from a corpus, and presents StudyBench for evaluation.
@jacobli99: Continual learning is widely discussed right now, but mostly as improving on the job or avoiding catastrophic forgettin…
Jacob Li introduces the concept of 'Machine Studying' as a distinct and urgent form of continual learning, where AI systems must develop expertise in a new domain from a corpus of documents alone.
@jacobli99: To compare procedures for machine studying, we start by defining expertise. The corpus is always available at test time…
Jacob Li introduces 'Machine Studying' as a new problem in continual learning: how AI systems develop expertise in an unfamiliar domain given only a corpus of documents, distinct from avoiding catastrophic forgetting.
@lateinteraction: Been extremely excited about this work by @jacobli99! We're disappointed in the current ways our agents develop experti…
A discussion on the challenge of enabling AI systems to develop deep expertise from documents, akin to humans learning from textbooks, highlighting a form of continual learning.
@lateinteraction: putting the link here for those that want to jump right into the long form: https://jacobxli.com/blog/2026/machine-stud…
Introduces 'Machine Studying' as a problem where AI agents must autonomously develop expertise from a corpus, beyond RAG or long-context, and presents the StudyBench benchmark for evaluation.