Decision-Driven Geosteering Under Uncertainty: A Unified Framework for Sequential Decision Optimization
Summary
Presents an uncertainty-aware geosteering framework integrating particle filtering for probabilistic subsurface interpretation with reinforcement learning for sequential decision-making, evaluated on an industrial simulator.
View Cached Full Text
Cached at: 06/17/26, 05:37 AM
# Decision-Driven Geosteering Under Uncertainty: A Unified Framework for Sequential Decision Optimization Source: [https://arxiv.org/abs/2606.17331](https://arxiv.org/abs/2606.17331) [View PDF](https://arxiv.org/pdf/2606.17331) > Abstract:Geosteering requires navigating a well trajectory through an unknown geological configuration, while sequentially updating decisions based on indirect measurements acquired during drilling\. This work presents an uncertainty\-aware geosteering framework that tightly integrates particle filtering for probabilistic subsurface interpretation with value\-based reinforcement learning for sequential decision\-making\. Geological uncertainty ahead of the drill bit is represented explicitly through a particle filter \(PF\), enabling belief\-informed control rather than deterministic trajectory correction\. The framework couples PF belief updates with belief\-informed decision policies and evaluates three decision\-making options that operate under identical uncertainty representations: an interpretable Approximate Dynamic Programming \(ADP\) scheme, a Deep Q\-learning baseline, and a Dual Deep Reinforcement Learning \(Dual DRL\) architecture trained with a target Q\-network scheme for stability, using a dueling \(value/advantage\) decomposition for Q\-value parameterization\. Beyond final placement performance, we assess policy behavior using stability\-oriented metrics that quantify steering smoothness over time, providing additional operational insight into how decision policies respond as uncertainty evolves\. The framework is integrated with an API for validation within an industrial geosteering simulator under realistic measurement noise and drilling constraints\. Using identical geological realizations, operational limits, and reward definitions across methods, the experiments provide a controlled and high\-fidelity evaluation of how alternative decision policies behave throughout the drilling process, rather than evaluating performance solely from the final well trajectory\. ## Submission history From: Hibat Errahmen Djecta \[[view email](https://arxiv.org/show-email/3b6c1478/2606.17331)\] **\[v1\]**Mon, 15 Jun 2026 22:22:13 UTC \(12,968 KB\)
Similar Articles
Optimizing Lithium Production Decisions under Geological, Demand, and Pricing Uncertainties: A POMDP Framework for Multi-Objective Decision Making
This paper proposes a POMDP framework for multi-objective decision making in lithium production, addressing geological, demand, and pricing uncertainties to optimize mine opening and extraction method selection. The approach outperforms human-inspired heuristics by dynamically adapting to shifting price regimes through belief state planning.
Uncertainty-Aware and Temporally Regulated Expert Advice in Reinforcement Learning for Autonomous Driving
This paper proposes an uncertainty-aware reinforcement learning framework for autonomous driving that uses expert advice guided by adaptive uncertainty thresholds and a commitment-cooldown strategy to improve safety and efficiency. Experiments in the CARLA simulator show a 5-7% success improvement over the IQN baseline.
Generative Auto-Bidding with Unified Modeling and Exploration
This paper introduces Guide, a framework that combines a Decision Transformer with Q-value guidance and an inverse dynamics module to balance exploration and safety in automated bidding for digital advertising, demonstrating effectiveness on public datasets and simulated auctions.
Exploring Agentic Tool-Calling Decisions via Uncertainty-Aligned Reinforcement Learning
This paper proposes TRUST, a method that integrates uncertainty quantification into reinforcement learning reward design to improve LLM agents' tool-calling decisions, enhancing decision quality and maintaining reliable uncertainty estimates.
Uncertainty-Aware LLM-Guided Policy Shaping for Sparse-Reward Reinforcement Learning
Proposes ULPS, a framework integrating a calibrated LLM into RL training with uncertainty-modulated guidance and A*-based symbolic trajectories, achieving improved success rate and sample efficiency on MiniGrid-UnlockPickup.