scientific-literature

#scientific-literature

PlantMarkerBench: A Multi-Species Benchmark for Evidence-Grounded Plant Marker Reasoning

Hugging Face Daily Papers ↗ · 2026-05-11 Cached

This paper introduces PlantMarkerBench, a multi-species benchmark for evaluating language models' ability to interpret evidence for plant marker genes from scientific literature across four species. It highlights that while frontier models perform well on direct evidence, they struggle with functional and indirect evidence types.

0 favorites 0 likes

#scientific-literature

Consensus accelerates research with GPT-5 and Responses API

OpenAI Blog ↗ · 2025-10-23 Cached

Consensus, a research assistant with 8 million users, has launched Scholar Agent—a multi-agent system built on GPT-5 and OpenAI's Responses API—that can synthesize peer-reviewed literature across 220 million papers in minutes. The system uses coordinated Planning, Search, Reading, and Analysis agents to mirror how human researchers work, reducing hallucinations and improving reliability over previous approaches.

0 favorites 0 likes

scientific-literature

PlantMarkerBench: A Multi-Species Benchmark for Evidence-Grounded Plant Marker Reasoning

Consensus accelerates research with GPT-5 and Responses API

Submit Feedback