A novel hybrid architecture and feature-selection synergy was developed to advance tabular stroke modeling.
The machine-learning framework predicted strokes using demographic, lifestyle, and clinical variables with a high accuracy rate of 97.2% and an F1-score of 97.15%.
The framework involved detailed exploratory data analysis, rigorous data preprocessing, feature selection, and optimization of algorithms like Random Forest, XGBoost, LightGBM, and support-vector classifier.
The study highlights the potential of converting low-cost tabular data into a clinical-grade stroke-risk assessment tool through advanced modeling techniques.