Experiments in Agentic AI for Science

arXiv cs.AI 05/27/26, 04:00 AM Papers

Summary

This paper presents two agentic AI frameworks, DeepTS/DeepCollector and DeepScribe, that automate scientific workflows including time-series data curation and conversion of physics lectures into structured reports, using a hybrid local-cloud architecture with LLMs.

arXiv:2605.26305v1 Announce Type: new Abstract: This paper details two novel frameworks for developing autonomous, agentic AI in scientific workflows. Both systems leverage a hybrid Local Body, Remote Brain architecture via Google Colab, utilizing Python-based local orchestrators to invoke large language model (LLM) cloud backends. The first agent, DeepTS/DeepCollector, automates the large-scale curation, extraction, and deduplication of time-series datasets. The second, DeepScribe, is an autonomous presentation analyzer that converts visually dense, mathematically complex physics lectures into structured scientific reports. Through practical systems engineering-such as granular attribute extraction (Cellular RAG), remote data inspection, and distributed concurrency controls-we demonstrate how agentic AI can overcome the context and reasoning limitations of current state-of-the-art systems to rigorously support scientific workflows. Finally, we outline a generalization of DeepTS to support deep knowledge graphs and discuss the application of this conceptual approach to high-energy physics (DeepQCD).

Original Article

View Cached Full Text

Cached at: 05/27/26, 09:03 AM

# Experiments in Agentic AI for Science
Source: [https://arxiv.org/abs/2605.26305](https://arxiv.org/abs/2605.26305)
[View PDF](https://arxiv.org/pdf/2605.26305)

> Abstract:This paper details two novel frameworks for developing autonomous, agentic AI in scientific workflows\. Both systems leverage a hybrid Local Body, Remote Brain architecture via Google Colab, utilizing Python\-based local orchestrators to invoke large language model \(LLM\) cloud backends\. The first agent, DeepTS/DeepCollector, automates the large\-scale curation, extraction, and deduplication of time\-series datasets\. The second, DeepScribe, is an autonomous presentation analyzer that converts visually dense, mathematically complex physics lectures into structured scientific reports\. Through practical systems engineering\-such as granular attribute extraction \(Cellular RAG\), remote data inspection, and distributed concurrency controls\-we demonstrate how agentic AI can overcome the context and reasoning limitations of current state\-of\-the\-art systems to rigorously support scientific workflows\. Finally, we outline a generalization of DeepTS to support deep knowledge graphs and discuss the application of this conceptual approach to high\-energy physics \(DeepQCD\)\.

## Submission history

From: Geoffrey Fox \[[view email](https://arxiv.org/show-email/85017892/2605.26305)\] **\[v1\]**Mon, 25 May 2026 19:57:57 UTC \(1,028 KB\)

Experiments in Agentic AI for Science

Similar Articles

Most “agentic AI” conversations feel too abstract. Here is how my agentic research system looks like

Neurodata Without Boredom: Benchmarking Agentic AI for Data Reuse

@dair_ai: https://x.com/dair_ai/status/2061104052818108476

AutoSci: A Memory-Centric Agentic System for the Full Scientific Research Lifecycle

Position: Agentic AI System Is a Foreseeable Pathway to AGI

Submit Feedback

Similar Articles

Most “agentic AI” conversations feel too abstract. Here is how my agentic research system looks like

Neurodata Without Boredom: Benchmarking Agentic AI for Data Reuse

@dair_ai: https://x.com/dair_ai/status/2061104052818108476

AutoSci: A Memory-Centric Agentic System for the Full Scientific Research Lifecycle

Position: Agentic AI System Is a Foreseeable Pathway to AGI