Machine learning prediction of obstructive coronary artery disease using opportunistic coronary calcium and epicardial fat assessments from CT calcium scoring scans

arXiv cs.LG Papers

Summary

This paper presents a machine learning framework using CatBoost and SHAP to predict obstructive coronary artery disease from CT calcium scoring scans, achieving high accuracy by combining calcium-omics and epicardial fat features.

arXiv:2605.21762v1 Announce Type: new Abstract: Non-contrast computed tomography calcium scoring (CTCS) is a cost-effective imaging modality widely used to detect coronary artery calcifications. This study aimed to develop an advanced machine learning framework that utilizes quantitative analyses of coronary calcium and epicardial fat from CTCS images to predict obstructive coronary artery disease (CAD). The study population consisted of 1,324 patients from the SCOT-HEART clinical trial who underwent both CTCS and coronary CT angiography. We extracted and analyzed a broad range of features, including 24 clinical variables, 189 calcium-omics, and 211 epicardial fat-omics features from the CTCS images. Feature selection was conducted using the CatBoost algorithm combined with SHapley Additive exPlanation (SHAP) values. Predictive modeling utilized the CatBoost gradient boosting method, focusing on the most informative features. From an initial set of 424 candidate features, 14 were identified as most predictive through the CatBoost-SHAP method. The top two predictive features originated from fat-omics, with the remaining 12 features derived from calcium-omics. The optimized model achieved robust predictive capabilities, demonstrating a sensitivity of 83.1+/-4.6%, specificity of 93.8+/-1.7%, accuracy of 85.3+/-2.0%, and an F1 score of 73.9+/-3.3%. Inclusion of calcium-omics and fat-omics data significantly improved predictive performance. Notably, the model also showed reliable predictive accuracy in patients with diverse coronary calcium scores, including cases with obstructive CAD despite a zero-calcium score. This innovative approach holds promise for improving clinical decision-making and potentially reducing dependence on contrast-enhanced or invasive diagnostic procedures, particularly within low-to intermediate-risk patient groups.
Original Article
View Cached Full Text

Cached at: 05/22/26, 08:51 AM

# Machine learning prediction of obstructive coronary artery disease using opportunistic coronary calcium and epicardial fat assessments from CT calcium scoring scans
Source: [https://arxiv.org/abs/2605.21762](https://arxiv.org/abs/2605.21762)
[View PDF](https://arxiv.org/pdf/2605.21762)

> Abstract:Non\-contrast computed tomography calcium scoring \(CTCS\) is a cost\-effective imaging modality widely used to detect coronary artery calcifications\. This study aimed to develop an advanced machine learning framework that utilizes quantitative analyses of coronary calcium and epicardial fat from CTCS images to predict obstructive coronary artery disease \(CAD\)\. The study population consisted of 1,324 patients from the SCOT\-HEART clinical trial who underwent both CTCS and coronary CT angiography\. We extracted and analyzed a broad range of features, including 24 clinical variables, 189 calcium\-omics, and 211 epicardial fat\-omics features from the CTCS images\. Feature selection was conducted using the CatBoost algorithm combined with SHapley Additive exPlanation \(SHAP\) values\. Predictive modeling utilized the CatBoost gradient boosting method, focusing on the most informative features\. From an initial set of 424 candidate features, 14 were identified as most predictive through the CatBoost\-SHAP method\. The top two predictive features originated from fat\-omics, with the remaining 12 features derived from calcium\-omics\. The optimized model achieved robust predictive capabilities, demonstrating a sensitivity of 83\.1\+/\-4\.6%, specificity of 93\.8\+/\-1\.7%, accuracy of 85\.3\+/\-2\.0%, and an F1 score of 73\.9\+/\-3\.3%\. Inclusion of calcium\-omics and fat\-omics data significantly improved predictive performance\. Notably, the model also showed reliable predictive accuracy in patients with diverse coronary calcium scores, including cases with obstructive CAD despite a zero\-calcium score\. This innovative approach holds promise for improving clinical decision\-making and potentially reducing dependence on contrast\-enhanced or invasive diagnostic procedures, particularly within low\-to intermediate\-risk patient groups\.

## Submission history

From: Juhwan Lee \[[view email](https://arxiv.org/show-email/cfe5b101/2605.21762)\] **\[v1\]**Wed, 20 May 2026 21:47:36 UTC \(2,484 KB\)

Similar Articles

LLMs for Cardiovascular Risk Prediction from Structured Clinical Data

arXiv cs.CL

This paper presents a hybrid framework that combines structured clinical data with LLM-generated narratives for coronary artery disease prediction, achieving high fidelity in variable extraction and comparing ML models with LLM-based zero-shot and few-shot classification.

Can AI help predict which heart-failure patients will worsen within a year?

MIT News — Artificial Intelligence

MIT researchers have developed PULSE-HF, a deep learning model that predicts whether heart failure patients will experience worsening left ventricular ejection fraction within a year using electrocardiograms. The model, published in Lancet eClinical Medicine, could help clinicians prioritize high-risk patients and reduce unnecessary hospital visits in both well-resourced and low-resource clinical settings.