Identifying mortality risk factors amongst acute coronary syndrome patients admitted to Arabian Gulf hospitals using machine-learning methods

Syed Asif Raza*, Lukman Thalib, Jassim Al Suwaidi, Kadhim Sulaiman, Wael Almahmeed, Haitham Amin, Khalid F. AlHabib

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)

Abstract

Acute coronary syndrome (ACS) is a leading cause of mortality and morbidity in the Arabian Gulf. In this study, the in-hospital mortality amongst patients admitted with ACS to Arabian Gulf hospitals is predicted using a comprehensive modelling framework that combines powerful machine-learning methods such as support-vector machine (SVM), Naïve Bayes (NB), artificial neural networks (NN), and decision trees (DT). The performance of the machine-learning methods is compared with that of the performance of a commonly used statistical method, namely, logistic regression (LR). The study follows the current practise of computing mortality risk using risk scores such as the Global Registry of Acute Coronary Events (GRACE) score, which has not been validated for Arabian Gulf patients. Cardiac registry data of 7,000 patients from 65 hospitals located in Arabian Gulf countries are used for the study. This study is unique as it uses a contemporary data analytics framework. A k-fold (k = 10) cross-validation is utilized to generate training and validation samples from the GRACE dataset. The machine-learning-based predictive models often incur prejudgments for imbalanced training data patterns. To mitigate the data imbalance due to scarce observations for in-hospital mortalities, we have utilized specialized methods such as random undersampling (RUS) and synthetic minority over sampling technique (SMOTE). A detailed simulation experimentation is carried out to build models with each of the five predictive methods (LR, NN, NB, SVM, and DT) for the each of the three datasets k-fold subsamples generated. The predictive models are developed under three schemes of the k-fold samples that include no data imbalance, RUS, and SMOTE. We have implemented an information fusion method rooted in computing weighted impact scores obtain for an individual medical history attributes from each of the predictive models simulated for a collective recommendation based on an impact score specific to a predictor. Finally, we grouped the predictors using fuzzy c-mean clustering method into three categories, high-, medium-, and low-risk factors for in-hospital mortality due to ACS. Our study revealed that patients with medical history related to the presences of peripheral artery disease, congestive heart failure, cardiovascular transient ischemic attack valvular disease, and coronary artery bypass grafting amongst others have the most risk for in-hospital mortality.

Original languageEnglish
Article numbere12413
JournalExpert Systems
Volume36
Issue number4
DOIs
Publication statusPublished - 2019

Keywords

  • Global Registry of Acute Coronary Events (GRACE) risk score
  • Naïve Bayes
  • acute coronary syndrome (ACS)
  • decision tree
  • fuzzy c-mean clustering
  • imbalance data
  • information fusion
  • logistic regression
  • machine learning
  • mortality
  • neural networks
  • predictive analytics
  • support-vector machine

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Theoretical Computer Science
  • Computational Theory and Mathematics
  • Artificial Intelligence

Cite this