Pixel-Level Pavement Distress Assessment Using Instance Segmentation
Summary
This paper presents a vision-based pavement distress analysis system using Mask R-CNN instance segmentation, achieving high precision and recall for crack detection and quantification on a custom dataset.
View Cached Full Text
Cached at: 05/26/26, 06:42 AM
Paper page - Pixel-Level Pavement Distress Assessment Using Instance Segmentation
Source: https://huggingface.co/papers/2605.26095
Abstract
A vision-based pavement distress analysis system using Mask R-CNN instance segmentation demonstrates superior performance for crack detection and quantification compared to object detection approaches, achieving high precision and recall metrics on a custom field-collected dataset.
Automated pavement distress assessment requires more than image-level classification or coarse bounding box detection, demanding precise localization of thin, branching, and irregular cracks to achieve the geometric precision necessary for maintenance-relevant quantification. This paper presents a vision-based pavement distress analysis system based onMask R-CNNinstance segmentationand evaluates it on UWGB-StreetCrack, a custom field-collected roadway image dataset acquired with a vehicle-mounted smartphone and manually annotated withpolygon labelsfor longitudinal cracks, transverse cracks, alligator cracks, and potholes. FiveDetectron2-basedMask R-CNNbackbone variants were considered under a consistent fine-tuning protocol. The best-performing model,Mask R-CNNwith aResNet-101 FPNbackbone, achieved 84.23% precision, 90.04% recall, and anF1 scoreof 87.04% under the project-specificbounding-box matching protocol. The same model produced an aggregate predictedcrack-area fractionof 2.164%, closely matching the 2.170% ground-truthcrack-area fraction. To contextualize the segmentation system against a detector-oriented alternative, aCSPDarknet53-basedYOLOdetector was also adapted and retrained on the dataset, reaching 27.5% precision and 20.7% recall on the validation protocol. The results show thatinstance segmentationis a practical direction for field pavement imagery and aggregate crack-area estimation, while also exposing open challenges in annotation consistency, class imbalance, confounder rejection, and mask-level benchmarking.
View arXiv pageView PDFAdd to collection
Get this paper in your agent:
hf papers read 2605\.26095
Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash
Models citing this paper0
No model linking this paper
Cite arxiv.org/abs/2605.26095 in a model README.md to link it from this page.
Datasets citing this paper0
No dataset linking this paper
Cite arxiv.org/abs/2605.26095 in a dataset README.md to link it from this page.
Spaces citing this paper0
No Space linking this paper
Cite arxiv.org/abs/2605.26095 in a Space README.md to link it from this page.
Collections including this paper0
No Collection including this paper
Add this paper to acollectionto link it from this page.
Similar Articles
Unification of Closed-Open Industrial Detection Scenarios: New Large-Scale Benchmarks,Challenges and Baselines
Introduces MMIOC-1M, a large-scale multi-modal benchmark for industrial defect detection, and proposes RTVPNet, a refined text-visual prompt network achieving state-of-the-art performance.
Toward Parking Spot Occupancy Recognition: A Self-Supervised Approach
This paper presents a self-supervised transfer learning approach for parking spot occupancy recognition that achieves high accuracy (up to 97.8%) with minimal labeled data using a two-stage training strategy with SimCLR and ResNet-50.
Semantic Feature Segmentation for Interpretable Predictive Maintenance in Complex Systems
This paper proposes a semantic feature segmentation framework for predictive maintenance that decomposes monitoring signals into canonical and residual components to improve interpretability while maintaining predictive performance.
Binary Road Surface Classification Using Machine Learning on Production Vehicle Signals During Cruising
This paper presents machine learning frameworks for binary classification of road surface conditions (grip vs. slip) using production vehicle signals during cruising, addressing the limitations of traditional friction estimation methods that fail under low-slip conditions.
Intelligent CCTV for Urban Design: AI-Based Analysis of Soft Infrastructure at Intersections
This academic paper introduces an AI-enabled analytics framework using existing CCTV infrastructure to evaluate the impact of soft traffic interventions on vehicle speed and safety at urban intersections.