Machine learning prediction of obstructive coronary artery disease using opportunistic coronary calcium and epicardial fat assessments from CT calcium scoring scans

arXiv cs.LG 05/22/26, 04:00 AM Papers

machine-learning medical-imaging coronary-artery-disease calcium-scoring catboost shap healthcare-ai

Summary

This paper presents a machine learning framework using CatBoost and SHAP to predict obstructive coronary artery disease from CT calcium scoring scans, achieving high accuracy by combining calcium-omics and epicardial fat features.

arXiv:2605.21762v1 Announce Type: new Abstract: Non-contrast computed tomography calcium scoring (CTCS) is a cost-effective imaging modality widely used to detect coronary artery calcifications. This study aimed to develop an advanced machine learning framework that utilizes quantitative analyses of coronary calcium and epicardial fat from CTCS images to predict obstructive coronary artery disease (CAD). The study population consisted of 1,324 patients from the SCOT-HEART clinical trial who underwent both CTCS and coronary CT angiography. We extracted and analyzed a broad range of features, including 24 clinical variables, 189 calcium-omics, and 211 epicardial fat-omics features from the CTCS images. Feature selection was conducted using the CatBoost algorithm combined with SHapley Additive exPlanation (SHAP) values. Predictive modeling utilized the CatBoost gradient boosting method, focusing on the most informative features. From an initial set of 424 candidate features, 14 were identified as most predictive through the CatBoost-SHAP method. The top two predictive features originated from fat-omics, with the remaining 12 features derived from calcium-omics. The optimized model achieved robust predictive capabilities, demonstrating a sensitivity of 83.1+/-4.6%, specificity of 93.8+/-1.7%, accuracy of 85.3+/-2.0%, and an F1 score of 73.9+/-3.3%. Inclusion of calcium-omics and fat-omics data significantly improved predictive performance. Notably, the model also showed reliable predictive accuracy in patients with diverse coronary calcium scores, including cases with obstructive CAD despite a zero-calcium score. This innovative approach holds promise for improving clinical decision-making and potentially reducing dependence on contrast-enhanced or invasive diagnostic procedures, particularly within low-to intermediate-risk patient groups.

Original Article

View Cached Full Text

Cached at: 05/22/26, 08:51 AM

# Machine learning prediction of obstructive coronary artery disease using opportunistic coronary calcium and epicardial fat assessments from CT calcium scoring scans
Source: [https://arxiv.org/abs/2605.21762](https://arxiv.org/abs/2605.21762)
[View PDF](https://arxiv.org/pdf/2605.21762)

> Abstract:Non\-contrast computed tomography calcium scoring \(CTCS\) is a cost\-effective imaging modality widely used to detect coronary artery calcifications\. This study aimed to develop an advanced machine learning framework that utilizes quantitative analyses of coronary calcium and epicardial fat from CTCS images to predict obstructive coronary artery disease \(CAD\)\. The study population consisted of 1,324 patients from the SCOT\-HEART clinical trial who underwent both CTCS and coronary CT angiography\. We extracted and analyzed a broad range of features, including 24 clinical variables, 189 calcium\-omics, and 211 epicardial fat\-omics features from the CTCS images\. Feature selection was conducted using the CatBoost algorithm combined with SHapley Additive exPlanation \(SHAP\) values\. Predictive modeling utilized the CatBoost gradient boosting method, focusing on the most informative features\. From an initial set of 424 candidate features, 14 were identified as most predictive through the CatBoost\-SHAP method\. The top two predictive features originated from fat\-omics, with the remaining 12 features derived from calcium\-omics\. The optimized model achieved robust predictive capabilities, demonstrating a sensitivity of 83\.1\+/\-4\.6%, specificity of 93\.8\+/\-1\.7%, accuracy of 85\.3\+/\-2\.0%, and an F1 score of 73\.9\+/\-3\.3%\. Inclusion of calcium\-omics and fat\-omics data significantly improved predictive performance\. Notably, the model also showed reliable predictive accuracy in patients with diverse coronary calcium scores, including cases with obstructive CAD despite a zero\-calcium score\. This innovative approach holds promise for improving clinical decision\-making and potentially reducing dependence on contrast\-enhanced or invasive diagnostic procedures, particularly within low\-to intermediate\-risk patient groups\.

## Submission history

From: Juhwan Lee \[[view email](https://arxiv.org/show-email/cfe5b101/2605.21762)\] **\[v1\]**Wed, 20 May 2026 21:47:36 UTC \(2,484 KB\)

Machine learning prediction of obstructive coronary artery disease using opportunistic coronary calcium and epicardial fat assessments from CT calcium scoring scans

Similar Articles

Quantitative coronary calcification analysis for prediction of myocardial ischemia using non-contrast CT calcium scoring

LLMs for Cardiovascular Risk Prediction from Structured Clinical Data

Cross-Modal Contrastive Learning of ECG and Angiography Representations for Severe Stenosis Classification

Can AI help predict which heart-failure patients will worsen within a year?

Calibration, Uncertainty Communication, and Deployment Readiness in CKD Risk Prediction: A Framework Evaluation Study

Submit Feedback

Similar Articles

Quantitative coronary calcification analysis for prediction of myocardial ischemia using non-contrast CT calcium scoring

LLMs for Cardiovascular Risk Prediction from Structured Clinical Data

Cross-Modal Contrastive Learning of ECG and Angiography Representations for Severe Stenosis Classification

Can AI help predict which heart-failure patients will worsen within a year?

Calibration, Uncertainty Communication, and Deployment Readiness in CKD Risk Prediction: A Framework Evaluation Study