Trends in AI and Human-AI Interaction in Clinical Trials -- A Hybrid Human-AI Exploration
Summary
This paper analyzes temporal and geographical trends in AI terminology in clinical trials from clinicaltrials.gov, using a hybrid human-AI approach with GPT-5.5. It finds increasing AI-related trials, with China and the US leading, and discusses the viability of hybrid screening for human-AI interaction classification.
View Cached Full Text
Cached at: 05/29/26, 09:13 AM
# Trends in AI and Human-AI Interaction in Clinical Trials -- A Hybrid Human-AI Exploration Source: [https://arxiv.org/abs/2605.29096](https://arxiv.org/abs/2605.29096) [View PDF](https://arxiv.org/pdf/2605.29096) > Abstract:This paper examines records retrieved from the[this http URL](http://clinicaltrials.gov/)registry to characterize temporal trends in AI terminology and the geographical distribution of AI trials\. The work also reports on an exploratory hybrid human\-AI approach to analyzing human\-AI interaction trends in registered clinical trials\. The hybrid workflow comprised a frontier generative AI model \(GPT\-5\.5\) and human review to screen and categorize records returned by an AI\-focused search\. The findings indicate a marked increase in AI\-related trials over time, with recent growth in references to machine learning, deep learning, chatbots, GPTs, and large language models\. Geographically, China and the United States accounted for the largest numbers of AI\-related trials, with notable recent increases in several other countries including Italy, France, Spain, the UK and Turkey \(Türkiye\)\. In a random sample of 100 records, human and AI classifiers showed good agreement in identifying studies not substantively using AI, but lower agreement in classifying human\-AI interaction, particularly where health professional interaction was ambiguous or insufficiently described\. Overall, the results suggest that hybrid human\-AI screening of clinical trial records is potentially viable, but clearer trial reporting and more precise interaction definitions will benefit the process\. ## Submission history From: Tim Collins \[[view email](https://arxiv.org/show-email/a622cc52/2605.29096)\] **\[v1\]**Wed, 27 May 2026 20:56:36 UTC \(472 KB\)
Similar Articles
Using AI to improve patient access to clinical trials
Paradigm leverages GPT-4's natural language understanding to dramatically improve patient screening for clinical trials, enabling evaluation of hundreds of patients per minute compared to manual review of ~50 per day, reducing clinician burden and improving patient access to treatments.
Imperfectly Cooperative Human-AI Interactions: Comparing the Impacts of Human and AI Attributes in Simulated and User Studies
This research paper investigates how human personality traits and AI design characteristics jointly impact human-AI interactions in imperfectly cooperative scenarios using both simulated datasets (2,000 simulations) and human subjects experiments (290 participants). The study finds significant divergences between simulation and real-world interactions, with AI transparency emerging as a critical factor in actual human-AI encounters.
AI-assisted Protocol Information Extraction For Improved Accuracy and Efficiency in Clinical Trial Workflows
Researchers from Banting Health AI present an AI system using generative LLMs with Retrieval-Augmented Generation (RAG) for automated clinical trial protocol information extraction, achieving 89% accuracy compared to 62.6% for standalone LLMs, with AI-assisted workflows completing tasks 40% faster and reducing cognitive demand.
Rehumanizing global health care with agentic AI
Healthcare providers are turning to agentic AI to automate complex tasks, reduce clinician burnout, and improve patient outcomes, as demonstrated by HSS's use of AI agents for insurance claims and patient triage.
Measuring What Matters: Benchmarking Generative, Multimodal, and Agentic AI in Healthcare
This paper presents a structured framework for benchmarking generative, multimodal, and agentic AI in healthcare, addressing the gap between high benchmark scores and real-world clinical reliability, safety, and relevance.