Tag
Anthropic research reveals that AI agents struggle with biology databases, producing highly variable answers for the same query (e.g., Ebola sequence counts ranging from 5 to 106 vs. expected 266), but adding a repeatable retrieval tool significantly improves consistency and accuracy.