Tag
The paper introduces GeoNatureAgent Benchmark, the first benchmark for evaluating LLM agents on environmental geospatial analysis tasks via structured tool calls. It evaluates seven models on 93 tasks across 18 categories and finds Claude Sonnet 4 achieves highest accuracy at 60.8%, while open-weight models like DeepSeek V3.2 offer strong cost-performance tradeoffs.
A study evaluating the Prithvi-EO-2.0 foundation model for satellite-based flood mapping across 19 diverse global flood events, finding that detection accuracy is jointly governed by land cover and flood type.
Niantic Spatial's Visual Positioning System (VPS) has been deemed 'Awardable' on the U.S. Department of War's Tradewinds Solutions Marketplace, enabling government agencies to procure the GPS-independent, deep learning-based localization technology for mission-critical environments.
This paper evaluates whether geospatial foundation model embeddings like Prithvi-EO improve cross-country crop yield prediction in Sub-Saharan Africa compared to traditional Sentinel-2 features. The study finds that frozen embeddings do not significantly outperform spectral medians under rigorous Leave-One-Country-Out validation, suggesting country-level distribution shift is the primary bottleneck rather than feature representation quality.
Analyzes the 64-D embedding manifold of Google AlphaEarth across 12.1M U.S. samples, shows non-Euclidean structure and poor vector arithmetic, then builds an agentic system with geometry-aware tools that outperforms parametric baselines on environmental queries.
Google DeepMind introduces AlphaEarth Foundations, an AI model that integrates petabytes of Earth observation data into unified embeddings to map and monitor the planet at 10x10 meter resolution. The model's compact representations enable efficient planetary-scale analysis for applications in food security, deforestation tracking, and environmental monitoring.