Brain Vascular Age Prediction Using Cerebral Blood Flow Velocity and Machine Learning Algorithms

arXiv cs.AI Papers

Summary

This paper uses machine learning models trained on transcranial Doppler features to predict brain vascular age, finding accelerated aging in subjects with brain diseases such as stroke and Alzheimer's.

arXiv:2605.16969v1 Announce Type: new Abstract: Defining vascular age in terms of physiological function has become one focal point of the extensive studies to categorize and track chronological age. Transcranial Doppler (TCD) is a method by which cerebral blood flow velocity is measured along the major arteries feeding the human brain. This study aims to use features extracted from TCD to estimate chronological age and assess accelerated aging in subjects with various brain diseases. We predict subjects with various brain diseases to present with accelerated cerebrovascular aging when tested on various regression models trained by healthy subjects. 168 healthy subjects and 277 diseased subjects with bilateral TCD recordings of the middle cerebral artery were analyzed using the Morphological Analysis and Clustering of Intracranial Pressure (MOCAIP) algorithm. MOCAIP-generated features and heart rate variability features were used as input features for regression models to predict the brain vascular age. 66 subjects with acute stroke, 27 subjects with post stroke, 26 subjects with Alzheimer's disease, 23 subjects with mild cognitive impairment, and 135 established subjects were tested against the machine learning model to assess for accelerated cerebrovascular age. The trained model, on average, predicted healthy subjects' cerebrovascular age to be 3.69 years above their chronological age. Subjects with different disease conditions exhibited varying levels of age acceleration. The differences in healthy and diseased subjects' performances suggest that features generated using TCD may be relevant when evaluating accelerated cerebrovascular aging. Moreover, imbalanced datasets have been observed to affect the performance of machine-learning-based brain age prediction models.
Original Article
View Cached Full Text

Cached at: 05/19/26, 06:38 AM

# Brain vascular age prediction using cerebral blood flow velocity and machine learning algorithms
Source: [https://arxiv.org/html/2605.16969](https://arxiv.org/html/2605.16969)
Anni Zhao Center for Data Science Nell Hodgson Woodruff School of Nursing Emory University Atlanta, GA 30322, USA anni\.zhao@emory\.edu& Alex Bateh Division of Nephrology Department of Medicine University of Alabama at Birmingham Birmingham, AL 35294 USA abateh@uab\.edu&Tyler Baldridge Department of Neurology School of Medicine University of Kansas Medical Center Kansas City, KS 66103, USA tbaldridge@kumc\.edu&Sandra Billinger Department of Neurology School of Medicine University of Kansas Medical Center Kansas City, KS 66103, USA sbillinger@kumc\.edu&Xiao Hu Center for Data Science Nell Hodgson Woodruff School of Nursing Emory University Atlanta, GA 30322, USA xiao\.hu@emory\.edu

###### Abstract

Defining vascular age in terms of physiological function has become one focal point of the extensive studies to categorize and track chronological age\. Transcranial Doppler \(TCD\) is a method by which cerebral blood flow velocity is measured along the major arteries feeding the human brain\. This study aims to use features extracted from TCD to estimate chronological age and assess accelerated aging in subjects with various brain diseases\. We predict subjects with various brain diseases to present with accelerated cerebrovascular aging when tested on various regression models trained by healthy subjects\. 168 healthy subjects and 277 diseased subjects with bilateral TCD recordings of the middle cerebral artery were analyzed using the Morphological Analysis and Clustering of Intracranial Pressure \(MOCAIP\) algorithm\. MOCAIP\-generated features and heart rate variability features were used as input features for regression models to predict the brain vascular age\. 66 subjects with acute stroke, 27 subjects with post stroke, 26 subjects with Alzheimer’s disease, 23 subjects with mild cognitive impairment, and 135 established subjects were tested against the machine learning model to assess for accelerated cerebrovascular age\. The trained model, on average, predicted healthy subjects’ cerebrovascular age to be 3\.69 years above their chronological age\. Subjects with different disease conditions exhibited varying levels of age acceleration\. The differences in healthy and diseased subjects’ performances suggest that features generated using TCD may be relevant when evaluating accelerated cerebrovascular aging\. Moreover, imbalanced datasets have been observed to affect the performance of machine\-learning\-based brain age prediction models\.

*Keywords*Brain vascular age prediction⋅\\cdotCerebral blood flow velocity⋅\\cdotMorphological analysis and clustering of intracranial pressure⋅\\cdotCerebrovascular aging

## 1Introduction

Brain vascular aging reflects the progressive structural and functional changes in the cerebral vasculature over time, and it is closely associated with cognitive decline, stroke risk, and other neurological disorders\. Accurate prediction of brain vascular age can therefore provide a meaningful biomarker for assessing cerebrovascular health beyond chronological age\. Furthermore, a reliable machine learning model plays an important role in the assessment of therapeutic outcomes\. In recent years, machine learning algorithms have emerged as a powerful tool for brain vascular age prediction because they can capture complex, nonlinear relationships from physiological\[[21](https://arxiv.org/html/2605.16969#bib.bib18),[34](https://arxiv.org/html/2605.16969#bib.bib19)\], imaging\[[11](https://arxiv.org/html/2605.16969#bib.bib17)\], and multimodal datasets\[[4](https://arxiv.org/html/2605.16969#bib.bib20)\]\. By learning patterns associated with vascular aging, these methods offer the potential to identify individuals with accelerated vascular aging at an early stage, support personalized risk assessment, and improve preventive interventions\. As a result, brain vascular age prediction using machine learning has become a promising direction in precision medicine and computational neuroscience\. Various algorithms and morphologies have been adopted for brain age prediction, including convolutional neural network\[[31](https://arxiv.org/html/2605.16969#bib.bib22)\], cortical structure\[[25](https://arxiv.org/html/2605.16969#bib.bib23)\], and Hidden Markov Model\[[37](https://arxiv.org/html/2605.16969#bib.bib24)\]\. Furthermore, there are some algorithm specialities adopted for brain age prediction, such as the decentralized algorithm\[[3](https://arxiv.org/html/2605.16969#bib.bib25)\]and extreme learning machine framework\[[16](https://arxiv.org/html/2605.16969#bib.bib26)\]\. However, there is a large amount of heterogeneity in model performance reported between studies using machine learning models\. Mainly, the machine learning model will be trained first on the labeled data from healthy subjects, and implemented for brain age prediction for patients with brain diseases\. A comprehensive review of the brain vascular age prediction using machine learning models from 2013 to 2024 can be found in\[[18](https://arxiv.org/html/2605.16969#bib.bib16)\]\. Another review focusing on the UK Biobank for brain age prediction can be found in\[[19](https://arxiv.org/html/2605.16969#bib.bib21)\]\.

Most of the existing research focused on using MRI\-based features for brain vascular age prediction\. MRI\-based features have shown a significant relationship with the brain vascular age\. A well\-developed brain vascular age prediction model, DeepBrainNet, has been trained using a large set of MRI scans\[[2](https://arxiv.org/html/2605.16969#bib.bib27)\]\. Features extracted from various physiological signals have also been adopted for brain vascular age prediction using machine learning algorithms\. Recent studies suggest that age\-related vascular and neurophysiological changes can be quantified from noninvasive physiological signals, including transcranial Doppler cerebral blood flow velocity\[[27](https://arxiv.org/html/2605.16969#bib.bib30)\], near\-infrared spectroscopy–derived cerebral pulsation, photoplethysmography\[[28](https://arxiv.org/html/2605.16969#bib.bib28)\], and EEG\[[10](https://arxiv.org/html/2605.16969#bib.bib31),[14](https://arxiv.org/html/2605.16969#bib.bib32)\]\. These findings support the feasibility of developing machine\-learning models to estimate brain vascular age from cerebral hemodynamic signal features\. Photoplethysmogram \(PPG\) was adopted for brain vascular age prediction using deep convolutional neural networks in\[[34](https://arxiv.org/html/2605.16969#bib.bib19)\]\. It has been studied that the artificial intelligence\-derived photoplethysmography \(AI\-PPG\) age can serve as a biomarker for cardiovascular health\[[28](https://arxiv.org/html/2605.16969#bib.bib28)\]\. A distribution\-aware loos function was first designed to reduce bias from the imbalanced age distribution\. However, limited attention has been given to the imbalanced distribution in brain vascular age prediction\. In this paper, the cerebral blood flow velocity measured from TCD and the extracted features are adopted for brain vascular age prediction\. It has been investigated that cerebral blood flow velocity is an important aging\-related physiological signal\. The morphological features extracted from the cerebral blood flow velocity are closely related to age and neuropsychological performance\[[29](https://arxiv.org/html/2605.16969#bib.bib29),[27](https://arxiv.org/html/2605.16969#bib.bib30)\]\. Furthermore, the influence of dataset imbalance has been investigated to highlight the importance of carefully selecting training and testing datasets for brain vascular age prediction\.

The paper is organized as follows\. Section[2](https://arxiv.org/html/2605.16969#S2)explains the datasets that were adopted for the training of the machine learning model\. Section[3](https://arxiv.org/html/2605.16969#S3)shows the feature extraction and machine learning algorithms\. Section[4](https://arxiv.org/html/2605.16969#S4)shows the results of the brain vascular age prediction using various machine learning algorithms\. Section[5](https://arxiv.org/html/2605.16969#S5)concludes the paper\.

## 2Datasets

Currently, we are mainly using the datasets from the University of Kansas, including 168 healthy subjects and 277 subjects with various diseases\. There are 26 patients with Alzheimer’s disease \(AD\), 23 patients with mild cognitive impairment \(MCI\), 66 patients with acute stroke, 27 patients with post stroke, and 135 patients are established subjects\. Established subjects refer to individuals with a family history of brain disease who may have a higher risk of developing brain disease in the future\. Currently, the established subjects do not show any apparent pattern associated with brain diseases\. The distribution of the healthy subjects and disease subjects are shown from Fig\.[1](https://arxiv.org/html/2605.16969#S2.F1)to[3](https://arxiv.org/html/2605.16969#S2.F3)\.

![Refer to caption](https://arxiv.org/html/2605.16969v1/Figures/FigDataDistributionHealthyDiseasedGroup1.png)Figure 1:Data distribution of healthy subjects and diseased subjects with Alzheimer’s and MCI diseases\.![Refer to caption](https://arxiv.org/html/2605.16969v1/Figures/FigDataDistributionHealthyDiseasedGroup2.png)Figure 2:Data distribution of healthy subjects and diseased subjects with acute stroke and post stroke diseases\.![Refer to caption](https://arxiv.org/html/2605.16969v1/Figures/FigDataDistributionHealthyDiseasedGroup3.png)Figure 3:Data distribution of healthy subjects and established subjects\.As observed from Fig\.1, we can see that for healthy subjects, most of the datasets are distributed in the age range \[20, 30\] and \[60, 80\], and for various diseased subjects, most of the datasets are distributed above age 50\. It has been observed that the age distribution for healthy and diseased subjects is highly imbalanced\. In this case, the property of imbalancing may influence the performance of the machine learning model\. To further improve model performance and ensure age\-distribution consistency, healthy subjects over 50 years old were selected as the training dataset because most diseased subjects were also older than 50\. The distribution of the split training and testing healthy subjects is shown in Fig\.[4](https://arxiv.org/html/2605.16969#S2.F4)\. 75% of the healthy subjects are adopted as the training datasets, 25% of the healthy subjects are adopted as the testing datasets\.

![Refer to caption](https://arxiv.org/html/2605.16969v1/Figures/FigDataDistributionHealthySubjects.png)Figure 4:Data distribution of the training and testing healthy subjects for machine learning algorithms\.
## 3Algorithm

In this section, the feature extraction algorithm and machine learning algorithms adopted in this paper are illustrated in detail\. The Morphological Clustering and Analysis of the Continuous Intracranial Pressure \(MOCAIP\) algorithm is adopted to extract the morphological features from the cerebral blood velocity\[[15](https://arxiv.org/html/2605.16969#bib.bib1)\]\. After feature extraction, the extracted features are served as inputs for the machine learning model, and the age is adopted as the output for the model\.

### 3\.1Feature extraction algorithm

#### 3\.1\.1Pre\-analysis

MOCAIP has the ability to extract the dominant pulse from the pulsatile signals\. A dominant pulse in the MOCAIP toolbox is the representative pulse waveform selected from a group of pulses in a time window because it best reflects the stable, typical pulse morphology while reducing noise and artifacts\. Here, we analyze and compare the averaged dominant pulse across various age groups for healthy subjects as shown in Fig\.[5\(a\)](https://arxiv.org/html/2605.16969#S3.F5.sf1)\. It has been observed that as age increases, the amplitude of the dominant pulse decreases with increased waveform complexity\. This could be due to the vascular stiffening, wave reflection, and reduced cerebral compliance\[[5](https://arxiv.org/html/2605.16969#bib.bib33)\]\.

Furthermore, the averaged dominant pulse comparisons across the same age group \[20, 40\] for healthy subjects and diseased subjects are shown from Fig\.[5\(b\)](https://arxiv.org/html/2605.16969#S3.F5.sf2)to[5\(d\)](https://arxiv.org/html/2605.16969#S3.F5.sf4)\. Fig\.[5\(b\)](https://arxiv.org/html/2605.16969#S3.F5.sf2)shows the averaged dominant pulse comparisons in the age group \[20, 40\] for healthy and acute stroke subjects\. Fig\.[5\(c\)](https://arxiv.org/html/2605.16969#S3.F5.sf3)shows the comparisons between the healthy and post stroke subjects\. Fig\.[5\(d\)](https://arxiv.org/html/2605.16969#S3.F5.sf4)shows the comparisons between the healthy subjects and established subjects\. Higher CBv amplitude and a greater number of waveform peaks were observed in the acute stroke and established groups\. These waveform alterations may reflect abnormal cerebrovascular hemodynamics, as TCD\-derived CBv waveforms have been shown to provide information related to cerebrovascular occlusion, stenosis, and pathological waveform morphology\[[35](https://arxiv.org/html/2605.16969#bib.bib34)\]\. Potential physiological mechanisms include impaired or heterogeneous cerebral autoregulation after stroke, compensatory changes in cerebral perfusion, and increased flow through collateral pathways\[[24](https://arxiv.org/html/2605.16969#bib.bib35)\]\. Moreover, vascular narrowing, disturbed or turbulent flow, and abnormal wave propagation near stenotic vessels may contribute to additional peaks in pulsatile CBv signals\[[17](https://arxiv.org/html/2605.16969#bib.bib36)\]\.

![Refer to caption](https://arxiv.org/html/2605.16969v1/x1.png)\(a\)Averaged dominant pulse comparisons across various age groups for healthy subjects\.
![Refer to caption](https://arxiv.org/html/2605.16969v1/x2.png)\(b\)Averaged dominant pulse comparisons in age group \[20,40\] for healthy and acute stroke subjects\.
![Refer to caption](https://arxiv.org/html/2605.16969v1/x3.png)\(c\)Averaged dominant pulse comparisons in age group \[20,40\] for healthy and post stroke subjects\.
![Refer to caption](https://arxiv.org/html/2605.16969v1/x4.png)\(d\)Averaged dominant pulse comparisons in age group \[20,40\] for healthy and established subjects\.

Figure 5:Averaged dominant pulse comparisons in age group \[20,40\] for healthy subjects and different disease groups\.
#### 3\.1\.2Feature extraction

Fig\.[6](https://arxiv.org/html/2605.16969#S3.F6)shows the data processing in general, including the data selection, signal segmentation, and feature extraction\. The 445 entries were selected using the following procedures: \(1\) data completeness and quality check, and \(2\) feature validity check\. After the data completeness and quality check, signals from each recording were segmented into entries of 360 beats using the MOCAIP toolbox\. Each recording is at least 360 beats long\. Each entry was then manually reviewed by verifying the validity of the extracted MOCAIP features\. There are 128 features in total, and some representative features are shown on the right\-hand side of Fig\.[6](https://arxiv.org/html/2605.16969#S3.F6)\. All datasets, either non\-invasive or invasive, are collected synchronously and resampled at 400 Hz\. For entries with ECG, the R\-R interval is computed using a general biomedical signal processing toolbox for QRS detection\. The process of signal segmentation is shown in the middle diagram\. The figure and table on the right\-hand side show the feature extraction procedure for the MOCAIP toolbox\. Other than the 128 features extracted from the MOCAIP toolbox, the body mass index \(BMI\) and heart rate variability features extracted from the ECG signal are adopted together as a feature vector for the machine learning algorithm\. In total, there are 137 numerical features extracted from the TCD and ECG signals\. There are various mathematical metrics that have been calculated in MOCAIP\. The right\-hand side of Fig\.[6](https://arxiv.org/html/2605.16969#S3.F6)shows some representative features, including the amplitude of landmarks from the minimum point, the slope of each rising edge, the mean absolute curvature of the pulse, and the absolute curvature of landmarks\. After applying MOCAIP to the healthy and diseased subjects, the top 10 features with the largest group difference are obtained using the variations from the mean value of the features\. The features with the largest group difference are computed following the procedures below:

Assume the original feature matrix is

X=\[x11x12⋯x1​px21x22⋯x2​p⋮⋮⋱⋮xN​1xN​2⋯xN​p\],X=\\begin\{bmatrix\}x\_\{11\}&x\_\{12\}&\\cdots&x\_\{1p\}\\\\ x\_\{21\}&x\_\{22\}&\\cdots&x\_\{2p\}\\\\ \\vdots&\\vdots&\\ddots&\\vdots\\\\ x\_\{N1\}&x\_\{N2\}&\\cdots&x\_\{Np\}\\end\{bmatrix\},whereNNis the total number of samples andppis the number of features\. The elementxi​jx\_\{ij\}represents the value of thejj\-th feature for theii\-th sample\.

If standard scaling is applied, each feature is normalized as

zi​j=xi​j−μjσj,z\_\{ij\}=\\frac\{x\_\{ij\}\-\\mu\_\{j\}\}\{\\sigma\_\{j\}\},whereμj\\mu\_\{j\}andσj\\sigma\_\{j\}are the mean and standard deviation of thejj\-th feature, respectively\. They are defined as

μj=1N​∑i=1Nxi​j,\\mu\_\{j\}=\\frac\{1\}\{N\}\\sum\_\{i=1\}^\{N\}x\_\{ij\},and

σj=1N−1​∑i=1N\(xi​j−μj\)2\.\\sigma\_\{j\}=\\sqrt\{\\frac\{1\}\{N\-1\}\\sum\_\{i=1\}^\{N\}\\left\(x\_\{ij\}\-\\mu\_\{j\}\\right\)^\{2\}\}\.
Therefore, the scaled feature matrix can be written as

Xscaled=\[z11z12⋯z1​pz21z22⋯z2​p⋮⋮⋱⋮zN​1zN​2⋯zN​p\]\.X\_\{\\text\{scaled\}\}=\\begin\{bmatrix\}z\_\{11\}&z\_\{12\}&\\cdots&z\_\{1p\}\\\\ z\_\{21\}&z\_\{22\}&\\cdots&z\_\{2p\}\\\\ \\vdots&\\vdots&\\ddots&\\vdots\\\\ z\_\{N1\}&z\_\{N2\}&\\cdots&z\_\{Np\}\\end\{bmatrix\}\.
Let each sample belong to one ofGGgroups\. The group label of theii\-th sample is denoted as

gi∈\{1,2,…,G\}\.g\_\{i\}\\in\\\{1,2,\\ldots,G\\\}\.
For thegg\-th group and thejj\-th feature, the group\-wise mean of the scaled feature values is computed as

z¯g​j=1ng​∑i:gi=gzi​j,\\bar\{z\}\_\{gj\}=\\frac\{1\}\{n\_\{g\}\}\\sum\_\{i:g\_\{i\}=g\}z\_\{ij\},wherengn\_\{g\}is the number of samples in groupgg\.

The group\-wise feature mean matrix is then given by

M=\[z¯11z¯12⋯z¯1​pz¯21z¯22⋯z¯2​p⋮⋮⋱⋮z¯G​1z¯G​2⋯z¯G​p\]\.M=\\begin\{bmatrix\}\\bar\{z\}\_\{11\}&\\bar\{z\}\_\{12\}&\\cdots&\\bar\{z\}\_\{1p\}\\\\ \\bar\{z\}\_\{21\}&\\bar\{z\}\_\{22\}&\\cdots&\\bar\{z\}\_\{2p\}\\\\ \\vdots&\\vdots&\\ddots&\\vdots\\\\ \\bar\{z\}\_\{G1\}&\\bar\{z\}\_\{G2\}&\\cdots&\\bar\{z\}\_\{Gp\}\\end\{bmatrix\}\.
To quantify how much each feature differs across groups, the variance of the group means is calculated for each feature\. For thejj\-th feature, this variance is defined as

Vj=1G−1​∑g=1G\(z¯g​j−z¯⋅j\)2,V\_\{j\}=\\frac\{1\}\{G\-1\}\\sum\_\{g=1\}^\{G\}\\left\(\\bar\{z\}\_\{gj\}\-\\bar\{z\}\_\{\\cdot j\}\\right\)^\{2\},where

z¯⋅j=1G​∑g=1Gz¯g​j\\bar\{z\}\_\{\\cdot j\}=\\frac\{1\}\{G\}\\sum\_\{g=1\}^\{G\}\\bar\{z\}\_\{gj\}is the average of the group means for thejj\-th feature\.

Finally, the features are ranked according to their variance values:

Vj1≥Vj2≥⋯≥Vjp\.V\_\{j\_\{1\}\}\\geq V\_\{j\_\{2\}\}\\geq\\cdots\\geq V\_\{j\_\{p\}\}\.
The topKKfeatures are selected as

ℱtop=\{j1,j2,…,jK\},\\mathcal\{F\}\_\{\\text\{top\}\}=\\\{j\_\{1\},j\_\{2\},\\ldots,j\_\{K\}\\\},whereK=10K=10in this code\. Thus, the selected features are those with the largest variation in group\-wise mean values across different groups\. The top 10 features obtained are RLp1v2Lp1p2, RLTLp1p3, RLTLv1p2, RLTLp1p2, RLTLv1p3, Lv1p3, LT, RLTLp2p3, RLTLv2p2, and RLv1p1Lv1p3 as defined in MOCAIP\. The mathematical definitions\[[1](https://arxiv.org/html/2605.16969#bib.bib37)\]of the top 10 features can be found in Table[1](https://arxiv.org/html/2605.16969#S3.T1)\. From Table[1](https://arxiv.org/html/2605.16969#S3.T1)we can see that there is a large group difference in terms of the ratio among time delays, time delay among landmarks, and the time delay between the peaks\. A group\-level visualization of the standardized features differences can be found in Fig\.[7](https://arxiv.org/html/2605.16969#S3.F7)\. From Fig\.[7](https://arxiv.org/html/2605.16969#S3.F7), it is observed that the acute stroke patients exhibit significant feature differences compared with the other subject groups\. It is interesting to notice that for most of the features there are no large differences between the healthy subjects and the subjects with other diseases\. There is minor difference in terms of the time delay among landmarks Lv1p3 for various subjects\.

Table 1:Top 10 MOCAIP features with the largest group difference\.![Refer to caption](https://arxiv.org/html/2605.16969v1/Figures/Fig4_FeatureExtraction.png)Figure 6:Data processing and feature extraction procedures for brain vascular age prediction using CBv\. Left: Data selection procedure from the database\. Middle: Signal segmentation for signals with and without ECG\. Right: Feature extraction for CBv pulse using MOCAIP\.![Refer to caption](https://arxiv.org/html/2605.16969v1/Figures/FigGroupLevelFeatureDifferences.png)Figure 7:Group\-level standardized feature differences for MOCAIP features extracted from the healthy and the diseased subjects\.

### 3\.2Machine learning algorithms

Various machine learning algorithms are adopted for the brain vascular age prediction, including the eXtreme Gradient Boosting \(XGBoost\), CatBoost, Random Forest, Gaussian process regressor, and the Tabular foundation model \(TabPFN\)\. XGBoost, CatBoost, and Random Forest are typical tree\-based machine learning algorithms\. Tree\-based ensemble algorithms have been extensively applied in healthcare outcomes prediction, especially for structured clinical data such as EHRs, mortality prediction, readmission, and risk stratification\[[9](https://arxiv.org/html/2605.16969#bib.bib6)\]\. XGBoost has been applied for predicting 30\-days mortality for MIMIC\-III patients with sepsis\-3\[[13](https://arxiv.org/html/2605.16969#bib.bib7)\]and hospital readmission\[[38](https://arxiv.org/html/2605.16969#bib.bib8)\]\. Multiple ensemble tree\-based algorithms have been adopted for readmission evaluation for stroke patients in\[[7](https://arxiv.org/html/2605.16969#bib.bib9)\]\. CatBoost has also been extensively applied for dementia risk prediction\[[8](https://arxiv.org/html/2605.16969#bib.bib10)\], ICU mortality prediction\[[33](https://arxiv.org/html/2605.16969#bib.bib11)\], and carotid sonographic features analysis for the prediction of recurrent stroke\[[22](https://arxiv.org/html/2605.16969#bib.bib12)\]\. Random Forest is not as popular as XGBoost and CatBoost\. However, it has been applied for dementia\-related neuropsychiatric symptoms prediction\[[26](https://arxiv.org/html/2605.16969#bib.bib14)\], incident delirium\[[6](https://arxiv.org/html/2605.16969#bib.bib13)\], and intensive care unit readmission\[[32](https://arxiv.org/html/2605.16969#bib.bib15)\]\.

Tabular foundation model \(TabPFN\)\[[12](https://arxiv.org/html/2605.16969#bib.bib2)\]is a specific foundation model trained using millions of synthetic tabular datasets\. It outperforms the traditional algorithms, such as gradient\-boosted decision trees, especially when dealing with datasets with a heterogeneity property\. TabPFN has been extensively applied in healthcare\[[20](https://arxiv.org/html/2605.16969#bib.bib3),[23](https://arxiv.org/html/2605.16969#bib.bib4),[36](https://arxiv.org/html/2605.16969#bib.bib5)\]\.\[[23](https://arxiv.org/html/2605.16969#bib.bib4)\]proposed an integrated multimodal engine based on TabPFN for image data processing\. Recently, TabPFN was also adopted in\[[20](https://arxiv.org/html/2605.16969#bib.bib3)\]for lymphvascular invasion prediction in invasive breast cancer\. A hybrid LighGBM\-TabPFN model was proposed in\[[36](https://arxiv.org/html/2605.16969#bib.bib5)\]for dementia prediction in Parkinson’s disease\. TabPFN shows significant performance improvement when dealing with smaller tabular datasets and missing values\. In this study, the XGBoost, CatBoost, Random Forest, Gaussian process regressor, and TabPFN are adopted for brain vascular age prediction\. The MOCAIP and heart rate variability features extracted from the healthy subjects are adopted as inputs, and the age is adopted as output for the machine learning model\. After the machine learning model is trained, it is applied to the diseased subjects for early brain disease detection\. This approach is based on the assumption that the physiological features extracted from healthy and diseased subjects exhibit distinct characteristics\. The healthy subjects are split into training and testing datasets\. Furthermore, several resampling strategies were evaluated to address group imbalance; however, they did not consistently improve prediction performance\. It has been observed that using healthy datasets above difference age threshold as training datasets yield different results\. To guarantee the estimation accuracy, the final model was trained using the healthy subject distributed above age 50 with subgroup\-level performance evaluation to assess potential bias across cohorts\.

## 4Results

The mean absolute errors of the predicted brain vascular age from healthy and diseased subjects for various algorithms are summarized in Table[2](https://arxiv.org/html/2605.16969#S4.T2)\. From the results, we can see that the TabPFN yields the smallest mean absolute error of 3\.36 for the healthy testing subjects\. Moreover, the results from XGBoost, CatBoost, and Random Forest \(RF\) with balanced sample weighting\[[30](https://arxiv.org/html/2605.16969#bib.bib38)\]are included in Table[2](https://arxiv.org/html/2605.16969#S4.T2)\. It has been observed that there are minor improvements through incorporating the sample weighting algorithm\. The XGBoost with sample reweighting yields the smallest mean absolute error for AD and MCI subjects\. The RF with sample reweighting yields the smallest mean absolute error for post stroke and established subjects\. Table[3](https://arxiv.org/html/2605.16969#S4.T3)shows the MAE difference between the healthy subjects and the diseased subjects\. It is observed that the XGBoost shows the largest MAE difference between the healthy subjects and the subjects with stroke and established diseases\. TabPFN shows the largest MAE difference between the healthy subjects and the subjects with MCI diseases\. CatBoost with sample weighting shows the largest MAE difference between the healthy subjects and the subjects with AD and post stroke\. On average, AD subjects showed 3\.57 years of age acceleration\. Patients with MCI showed 1\.72 years of acceleration\. Stroke patients exhibited the largest age acceleration, with 6\.12 years\. Post\-stroke patients showed 5\.51 years of acceleration, while established subjects showed 2\.75 years of acceleration\. The scatter plot for the predicted brain vascular age from healthy and diseased subjects is shown from Fig\.[8](https://arxiv.org/html/2605.16969#S4.F8)to Fig\.[10](https://arxiv.org/html/2605.16969#S4.F10)\. Based on the results from Fig\.[7](https://arxiv.org/html/2605.16969#S3.F7), we can see that there are no large feature differences between the subjects with various diseases, which is aligned with the mean absolute error shown in Table[2](https://arxiv.org/html/2605.16969#S4.T2)\. The mean absolute errors from the trained machine learning models show no large difference between the subjects with AD, MCI, stroke, post stroke, and established diseases\. Table[4](https://arxiv.org/html/2605.16969#S4.T4)shows the summary of the proportional, standard deviation, and mean of positive bran vascular gaps across various machine learning algorithms\. The positive brain vascular age gap is defined as the predicted age minus the chronological age\. From Table[4](https://arxiv.org/html/2605.16969#S4.T4)we can see that the stroke patients show the largest proportional, mean values, and standard deviation for positive brain vascular age gaps\. Furthermore, the distribution of the positive predicted age gap and the negative predicted age gap is shown in Fig\.[11](https://arxiv.org/html/2605.16969#S4.F11)\. From the figure we can see that there are 168 subjects show positive predicted gap and 151 subjects show negative predicted gap\. Moreover, most of the positive predicted age gap is distributed in chronological age above 65\. Most of the negative predicted age gap is distributed in chronological age below 65\. Figs\.[12](https://arxiv.org/html/2605.16969#S4.F12)to[14](https://arxiv.org/html/2605.16969#S4.F14)shows the chronological age distribution with positive predicted gap and negative predicted gap across different groups\. It is observed that for different group, they are following the same distribution \- most of the positive predicted age gap is distributed below age 65∼\\sim70 and most of the negative predicted age gap is distributed above age 70\. This finding may suggest a tendency toward accelerated brain vascular aging among patients older than 65 years\.

Table 2:Mean absolute error summary of brain vascular age prediction for healthy and diseased subjects from various machine learning algorithms\. The minimum mean absolute error for each machine learning algorithm is highlighted in blue\.Table 3:Mean absolute error difference summary of brain vascular age prediction between the healthy and diseased subjects from various machine learning algorithms\. The maximum mean error difference for each machine learning algorithm is highlighted in blue\.Table 4:Summary of the proportion, standard deviation, and mean of positive brain vascular age gaps from Tabular foundation model\. The groups with the largest proportion, standard deviation, and mean of the brain vascular age gap are highlighted in blue\.![Refer to caption](https://arxiv.org/html/2605.16969v1/Figures/FigPredictionResults_0.png)Figure 8:Brain vascular prediction results from healthy and AD subjects\.![Refer to caption](https://arxiv.org/html/2605.16969v1/Figures/FigPredictionResults_1.png)Figure 9:Brain vascular prediction results from subjects with MCI and acute stroke\.![Refer to caption](https://arxiv.org/html/2605.16969v1/Figures/FigPredictionResults_2.png)Figure 10:Brain vascular prediction results from post stroke and established subjects\.![Refer to caption](https://arxiv.org/html/2605.16969v1/Figures/Fig_PredictedAgeGap_AllSubjects.png)Figure 11:The positive and negative predicted brain vascular prediction gap for all subjects, including the healthy and diseased subjects\.![Refer to caption](https://arxiv.org/html/2605.16969v1/Figures/Fig_PredictedAgeGroup_GapType1.png)Figure 12:The positive and negative predicted brain vascular age gap for AD and Established subjects\.![Refer to caption](https://arxiv.org/html/2605.16969v1/Figures/Fig_PredictedAgeGroup_GapType2.png)Figure 13:The positive and negative predicted brain vascular age gap for healthy and MCI subjects\.![Refer to caption](https://arxiv.org/html/2605.16969v1/Figures/Fig_PredictedAgeGroup_GapType3.png)Figure 14:The positive and negative predicted brain vascular age gap for post stroke and stroke subjects\.
## 5Conclusions

In this paper, the TCD signal is adopted for brain vascular age prediction\. Mathematical features extracted using MOCAIP and heart rate variability toolbox are adopted as inputs for the machine learning algorithm\. Various machine learning algorithms, including XGBoost, CatBoost, Random Forest, and Tabular foundation model, are adopted for brain vascular age prediction\. It has been observed that the Tabular foundation model yields the smallest MAE for testing healthy subjects\. XGBoost shows the largest MAE difference for stroke and established subjects, CatBoost shows the largest MAE difference for subjects with AD and post stroke\. However, it has been observed that brain vascular age prediction depends heavily on the distribution of the age\. In general, the age distribution for healthy subjects is highly imbalanced, and there are limited approaches existing for dealing with imbalanced datasets for brain age prediction\. Although balanced sample weighting was adopted in this study to address the imbalanced age distribution, only limited performance improvement was observed\. In the future, it is worth investigating a more effective resampling approach for brain vascular age prediction with imbalanced datasets\. Furthermore, the unequal sample sizes among diseased groups could introduce bias into the machine learning model predictions\. Demographic information, including gender and BMI, along with measurement\-related factors such as estimation position, should be considered during the modeling and evaluation of machine learning models\.

## References

- \[1\]S\. Asgari, N\. Gonzalez, A\. W\. Subudhi, R\. Hamilton, P\. Vespa, M\. Bergsneider, R\. C\. Roach, and X\. Hu\(2012\)Continuous detection of cerebral vasodilatation and vasoconstriction using intracranial pulse morphological template matching\.Plos one7\(11\),pp\. e50795\.External Links:ISSN 1932\-6203Cited by:[§3\.1\.2](https://arxiv.org/html/2605.16969#S3.SS1.SSS2.p10.2)\.
- \[2\]V\. M\. Bashyam, G\. Erus, J\. Doshi, M\. Habes, I\. M\. Nasrallah, M\. Truelove\-Hill, D\. Srinivasan, L\. Mamourian, R\. Pomponio, and Y\. Fan\(2020\)MRI signatures of brain age and disease over the lifespan based on a deep brain network and 14 468 individuals worldwide\.Brain143\(7\),pp\. 2312–2324\.External Links:ISSN 0006\-8950Cited by:[§1](https://arxiv.org/html/2605.16969#S1.p2.1)\.
- \[3\]S\. Basodi, R\. Raja, B\. Ray, H\. Gazula, J\. Liu, E\. Verner, and V\. D\. Calhoun\(2021\)Federation of brain age estimation in structural neuroimaging data\.Conference ProceedingsIn2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society \(EMBC\),pp\. 3854–3857\.External Links:ISBN 172811179XCited by:[§1](https://arxiv.org/html/2605.16969#S1.p1.1)\.
- \[4\]T\. Chattopadhyay, P\. Senthilkumar, R\. H\. Ankarath, C\. Patterson, E\. J\. Gleave, S\. I\. Thomopoulos, H\. Huang, L\. Shen, L\. You, and D\. Zhi\(2026\)Deep learning to predict future cognitive decline: a multimodal approach using brain mri and clinical data\.Frontiers in Neuroimaging5,pp\. 1726037\.Cited by:[§1](https://arxiv.org/html/2605.16969#S1.p1.1)\.
- \[5\]A\. Chien, F\. Vinuela, V\. Szeder, G\. Colby, R\. Jahan, A\. Wang, S\. Tateshima, G\. Duckwiler, and N\. Salamon\(2025\)Cerebrovascular longitudinal atlas: changes in cerebral arteries in unruptured intracranial aneurysm patients followed with mra\.NeuroImage: Clinical46,pp\. 103766\.External Links:ISSN 2213\-1582Cited by:[§3\.1\.1](https://arxiv.org/html/2605.16969#S3.SS1.SSS1.p1.1)\.
- \[6\]J\. P\. Corradi, S\. Thompson, J\. F\. Mather, C\. M\. Waszynski, and R\. S\. Dicks\(2018\)Prediction of incident delirium using a random forest classifier\.Journal of medical systems42\(12\),pp\. 261\.External Links:ISSN 0148\-5598Cited by:[§3\.2](https://arxiv.org/html/2605.16969#S3.SS2.p1.1)\.
- \[7\]N\. Darabi, N\. Hosseinichimeh, A\. Noto, R\. Zand, and V\. Abedi\(2021\)Machine learning\-enabled 30\-day readmission model for stroke patients\.Frontiers in neurology12,pp\. 638267\.External Links:ISSN 1664\-2295Cited by:[§3\.2](https://arxiv.org/html/2605.16969#S3.SS2.p1.1)\.
- \[8\]H\. Ding, A\. Mandapati, A\. P\. Hamel, C\. Karjadi, T\. F\. Ang, W\. Xia, R\. Au, and H\. Lin\(2023\)Multimodal machine learning for 10\-year dementia risk prediction: the framingham heart study\.Journal of Alzheimer’s Disease96\(1\),pp\. 277–286\.External Links:ISSN 1387\-2877Cited by:[§3\.2](https://arxiv.org/html/2605.16969#S3.SS2.p1.1)\.
- \[9\]P\. Doupe, J\. Faghmous, and S\. Basu\(2019\)Machine learning for health services researchers\.Value in Health22\(7\),pp\. 808–815\.External Links:ISSN 1098\-3015Cited by:[§3\.2](https://arxiv.org/html/2605.16969#S3.SS2.p1.1)\.
- \[10\]D\. A\. Engemann, A\. Mellot, R\. Höchenberger, H\. Banville, D\. Sabbagh, L\. Gemein, T\. Ball, and A\. Gramfort\(2022\)A reusable benchmark of brain\-age prediction from m/eeg resting\-state signals\.Neuroimage262,pp\. 119521\.External Links:ISSN 1053\-8119Cited by:[§1](https://arxiv.org/html/2605.16969#S1.p2.1)\.
- \[11\]H\. Han, S\. Ge, and H\. Wang\(2023\)Prediction of brain age based on the community structure of functional networks\.Biomedical Signal Processing and Control79,pp\. 104151\.External Links:ISSN 1746\-8094Cited by:[§1](https://arxiv.org/html/2605.16969#S1.p1.1)\.
- \[12\]N\. Hollmann, S\. Müller, L\. Purucker, A\. Krishnakumar, M\. Körfer, S\. B\. Hoo, R\. T\. Schirrmeister, and F\. Hutter\(2025\)Accurate predictions on small data with a tabular foundation model\.Nature637\(8045\),pp\. 319–326\.External Links:ISSN 0028\-0836Cited by:[§3\.2](https://arxiv.org/html/2605.16969#S3.SS2.p2.1)\.
- \[13\]N\. Hou, M\. Li, L\. He, B\. Xie, L\. Wang, R\. Zhang, Y\. Yu, X\. Sun, Z\. Pan, and K\. Wang\(2020\)Predicting 30\-days mortality for mimic\-iii patients with sepsis\-3: a machine learning approach using xgboost\.Journal of translational medicine18\(1\),pp\. 462\.External Links:ISSN 1479\-5876Cited by:[§3\.2](https://arxiv.org/html/2605.16969#S3.SS2.p1.1)\.
- \[14\]S\. Hu, X\. Xiang, X\. Huang, Y\. Lu, X\. Zhang, D\. Yao, and P\. A\. Valdes\-Sosa\(2025\)Lifespan brain age prediction based on multiple eeg oscillatory features and sparse group lasso\.Frontiers in Aging Neuroscience17,pp\. 1559067\.External Links:ISSN 1663\-4365Cited by:[§1](https://arxiv.org/html/2605.16969#S1.p2.1)\.
- \[15\]X\. Hu, P\. Xu, F\. Scalzo, P\. Vespa, and M\. Bergsneider\(2008\)Morphological clustering and analysis of continuous intracranial pressure\.IEEE Transactions on Biomedical Engineering56\(3\),pp\. 696–705\.Cited by:[§3](https://arxiv.org/html/2605.16969#S3.p1.1)\.
- \[16\]P\. H\. Kassani, A\. Gossmann, and Y\. Wang\(2019\)Multimodal sparse classifier for adolescent brain age prediction\.IEEE journal of biomedical and health informatics24\(2\),pp\. 336–344\.External Links:ISSN 2168\-2194Cited by:[§1](https://arxiv.org/html/2605.16969#S1.p1.1)\.
- \[17\]J\. Kim\(2019\)Pictorial essay: transcranial doppler findings of the intracranial and extracranial diseases\.Journal of Neurosonology and Neuroimaging11\(1\),pp\. 2–21\.External Links:ISSN 2635\-425XCited by:[§3\.1\.1](https://arxiv.org/html/2605.16969#S3.SS1.SSS1.p2.1)\.
- \[18\]L\. S\. Kumari and R\. Sundarrajan\(2024\)A review on brain age prediction models\.Brain Research1823,pp\. 148668\.External Links:ISSN 0006\-8993Cited by:[§1](https://arxiv.org/html/2605.16969#S1.p1.1)\.
- \[19\]Y\. Li, H\. Gao, L\. Lin, Y\. Wu, and X\. Zhu\(2026\)UK biobank\-centric advances in brain age prediction: a comprehensive review\.Reviews in the Neurosciences37\(1\),pp\. 19–42\.External Links:ISSN 0334\-1763Cited by:[§1](https://arxiv.org/html/2605.16969#S1.p1.1)\.
- \[20\]Y\. Li, J\. Yang, P\. Xiao, H\. Liu, Y\. Zhou, X\. Yang, G\. Chen, and Z\. Zuo\(2025\)MRI delta\-radiomics and morphological feature\-driven tabpfn model for preoperative prediction of lymphovascular invasion in invasive breast cancer\.Technology in Cancer Research & Treatment24,pp\. 15330338251362050\.External Links:ISSN 1533\-0346Cited by:[§3\.2](https://arxiv.org/html/2605.16969#S3.SS2.p2.1)\.
- \[21\]J\. Libiseller\-Egger, J\. E\. Phelan, Z\. I\. Attia, E\. D\. Benavente, S\. Campino, P\. A\. Friedman, F\. Lopez\-Jimenez, D\. A\. Leon, and T\. G\. Clark\(2022\)Deep learning\-derived cardiovascular age shares a genetic basis with other cardiac phenotypes\.Scientific reports12\(1\),pp\. 22625\.External Links:ISSN 2045\-2322Cited by:[§1](https://arxiv.org/html/2605.16969#S1.p1.1)\.
- \[22\]S\. Lin, K\. Law, Y\. Yeh, K\. Wu, J\. Lai, C\. Lin, W\. Hsu, C\. Lin, and C\. Kao\(2022\)Applying machine learning to carotid sonographic features for recurrent stroke in patients with acute stroke\.Frontiers in Cardiovascular Medicine9,pp\. 804410\.External Links:ISSN 2297\-055XCited by:[§3\.2](https://arxiv.org/html/2605.16969#S3.SS2.p1.1)\.
- \[23\]J\. Luo, Y\. Yuan, and S\. Xu\(2025\)TIME: tabpfn\-integrated multimodal engine for robust tabular\-image learning\.arXiv preprint arXiv:2506\.00813\.Cited by:[§3\.2](https://arxiv.org/html/2605.16969#S3.SS2.p2.1)\.
- \[24\]H\. Ma, Z\. Guo, H\. Jin, X\. Yan, J\. Liu, S\. Lv, P\. Zhang, X\. Sun, and Y\. Yang\(2018\)Preliminary study of dynamic cerebral autoregulation in acute ischemic stroke: association with clinical factors\.Frontiers in Neurology9,pp\. 1006\.External Links:ISSN 1664\-2295Cited by:[§3\.1\.1](https://arxiv.org/html/2605.16969#S3.SS1.SSS1.p2.1)\.
- \[25\]C\. R\. Madan and E\. A\. Kensinger\(2018\)Predicting age from cortical structure across the lifespan\.European Journal of Neuroscience47\(5\),pp\. 399–416\.External Links:ISSN 0953\-816XCited by:[§1](https://arxiv.org/html/2605.16969#S1.p1.1)\.
- \[26\]J\. Mar, A\. Gorostiza, O\. Ibarrondo, C\. Cernuda, A\. Arrospide, Á\. Iruin, I\. Larrañaga, M\. Tainta, E\. Ezpeleta, and A\. Alberdi\(2020\)Validation of random forest machine learning models to predict dementia\-related neuropsychiatric symptoms in real\-world data\.Journal of Alzheimer’s Disease77\(2\),pp\. 855–864\.External Links:ISSN 1387\-2877Cited by:[§3\.2](https://arxiv.org/html/2605.16969#S3.SS2.p1.1)\.
- \[27\]M\. L\. Miller, P\. Ghisletta, B\. S\. Jacobs, C\. L\. Dahle, and N\. Raz\(2021\)Changes in cerebral arterial pulsatility and hippocampal volume: a transcranial doppler ultrasonography study\.Neurobiology of Aging108,pp\. 110–121\.External Links:ISSN 0197\-4580Cited by:[§1](https://arxiv.org/html/2605.16969#S1.p2.1)\.
- \[28\]G\. Nie, Q\. Zhao, G\. Tang, Y\. Li, and S\. Hong\(2025\)Artificial intelligence\-derived photoplethysmography age as a digital biomarker for cardiovascular health\.Communications Medicine5\(1\),pp\. 481\.External Links:ISSN 2730\-664XCited by:[§1](https://arxiv.org/html/2605.16969#S1.p2.1)\.
- \[29\]M\. P\. Pase, N\. A\. Grima, C\. Stough, A\. Scholey, and A\. Pipingas\(2014\)Association of pulsatile and mean cerebral blood flow velocity with age and neuropsychological performance\.Physiology & behavior130,pp\. 23–27\.External Links:ISSN 0031\-9384Cited by:[§1](https://arxiv.org/html/2605.16969#S1.p2.1)\.
- \[30\]F\. Pedregosa, G\. Varoquaux, A\. Gramfort, V\. Michel, B\. Thirion, O\. Grisel, M\. Blondel, P\. Prettenhofer, R\. Weiss, and V\. Dubourg\(2011\)Scikit\-learn: machine learning in python\.the Journal of machine Learning research12,pp\. 2825–2830\.External Links:ISSN 1532\-4435Cited by:[§4](https://arxiv.org/html/2605.16969#S4.p1.1)\.
- \[31\]K\. M\. Poloni, R\. J\. Ferrari, and A\. D\. N\. Initiative\(2022\)A deep ensemble hippocampal cnn model for brain age estimation applied to alzheimer’s diagnosis\.Expert Systems with Applications195,pp\. 116622\.External Links:ISSN 0957\-4174Cited by:[§1](https://arxiv.org/html/2605.16969#S1.p1.1)\.
- \[32\]J\. C\. Rojas, K\. A\. Carey, D\. P\. Edelson, L\. R\. Venable, M\. D\. Howell, and M\. M\. Churpek\(2018\)Predicting intensive care unit readmission with machine learning using electronic health record data\.Annals of the American Thoracic Society15\(7\),pp\. 846–853\.External Links:ISSN 2329\-6933Cited by:[§3\.2](https://arxiv.org/html/2605.16969#S3.SS2.p1.1)\.
- \[33\]N\. Safaei, B\. Safaei, S\. Seyedekrami, M\. Talafidaryani, A\. Masoud, S\. Wang, Q\. Li, and M\. Moqri\(2022\)E\-catboost: an efficient machine learning framework for predicting icu mortality using the eicu collaborative research database\.Plos one17\(5\),pp\. e0262895\.External Links:ISSN 1932\-6203Cited by:[§3\.2](https://arxiv.org/html/2605.16969#S3.SS2.p1.1)\.
- \[34\]H\. Shin, G\. Noh, and B\. Choi\(2022\)Photoplethysmogram based vascular aging assessment using the deep convolutional neural network\.Scientific Reports12\(1\),pp\. 11377\.External Links:ISSN 2045\-2322Cited by:[§1](https://arxiv.org/html/2605.16969#S1.p1.1),[§1](https://arxiv.org/html/2605.16969#S1.p2.1)\.
- \[35\]S\. G\. Thorpe, C\. M\. Thibeault, N\. Canac, K\. Jalaleddini, A\. Dorn, S\. J\. Wilk, T\. Devlin, F\. Scalzo, and R\. B\. Hamilton\(2020\)Toward automated classification of pathological transcranial doppler waveform morphology via spectral clustering\.PloS one15\(2\),pp\. e0228642\.External Links:ISSN 1932\-6203Cited by:[§3\.1\.1](https://arxiv.org/html/2605.16969#S3.SS1.SSS1.p2.1)\.
- \[36\]V\. Q\. Tran and H\. Byeon\(2024\)Predicting dementia in parkinson’s disease on a small tabular dataset using hybrid lightgbm–tabpfn and shap\.Digital Health10,pp\. 20552076241272585\.External Links:ISSN 2055\-2076Cited by:[§3\.2](https://arxiv.org/html/2605.16969#S3.SS2.p2.1)\.
- \[37\]B\. Wang and T\. D\. Pham\(2011\)MRI\-based age prediction using hidden markov models\.Journal of neuroscience methods199\(1\),pp\. 140–145\.External Links:ISSN 0165\-0270Cited by:[§1](https://arxiv.org/html/2605.16969#S1.p1.1)\.
- \[38\]P\. Zhao, I\. Yoo, and S\. H\. Naqvi\(2021\)Early prediction of unplanned 30\-day hospital readmission: model development and retrospective data analysis\.JMIR medical informatics9\(3\),pp\. e16306\.Cited by:[§3\.2](https://arxiv.org/html/2605.16969#S3.SS2.p1.1)\.

Similar Articles

MIT FINGERS-7B: First Multi-Omics AI Model for Alzheimer’s Prevention

Reddit r/singularity

MIT released FINGERS-7B, a 7-billion-parameter multi-omics foundation model trained on data from 30,000 individuals to predict Alzheimer's risk years in advance. The model is accessible via the AD Workbench and is accompanied by a research paper on OpenReview.

Teaching the Brain to Discover Itself

ML at Berkeley

This blog post explores the intersection of machine learning and neuroscience, specifically focusing on using multivariate classification techniques on neuroimaging data to understand brain function and behavior.