Supervised Learning Based Classification of Cardiovascular Diseases

  • Arif Hussain Department of Computer Science, National College of Business Administration and Economics, Multan, Pakistan
  • Hassaan Malik Department of Computer Science, National College of Business Administration and Economics, Multan, Pakistan; Department of Computer Science, University of Management and Technology, Lahore, Pakistan
  • Muhammad Umar Chaudhry Department of Computer Science, National College of Business Administration and Economics, Multan, Pakistan; Department of Electrical and Computer Engineering, Sungkyunkwan University, Suwon, Korea; AiHawks, Multan, Pakistan
Keywords: cardiovascular disease, machine learning, artificial intelligence


Detecting cardiovascular disease (CVD) in the early stage is a difficult and crucial process. The objective of this study is to test the capability of machine learning (ML) methods for accurately diagnosing the CVD outcomes. For this study, the efficiency and effectiveness of four well renowned ML classifiers, i.e., support vector machine (SVM), logistics regression (LR), naive Bayes (NB), and decision tree (J48), are measured in terms of precision, sensitivity, specificity, accuracy, Matthews correlation coefficient (MCC), correctly and incorrectly classified instances, and model building time. These ML classifiers are applied on publically available CVD dataset. In accordance with the measured result, J48 performs better than its competitor classifiers, providing significant assistance to the cardiologists.


J. Mackay, G. A. Mensah, S. Mendis, and K. Greenlund, The Atlas of Heart Disease and Stroke, Geneva: World Health Organization, 2004.

Z. S. Wong, J. Zhou, and Q. Zhang, “Artificial Intelligence for Infectious Disease Big Data Analytics,” Infection, Disease, and Health, vol. 24, no. 1, pp. 44-48, February 2019.

S. Schneeweiss, “Learning from Big Health Care Data,” The New England Journal of Medicine, vol. 370, no. 23, pp. 2161-2163, June 2014.

A. Blumenthal, “Artificial Intelligence to Fight the Spread of Infectious Diseases,”, February 20, 2018.

K. E. Goodman, J. Lessler, S. E. Cosgrove, A. D. Harris, E. Lautenbach, J. H. Han, et al., “A Clinical Decision Tree to Predict Whether a Bacteremic Patient is Infected with an Extended-Spectrum β-Lactamase-Producing Organism,” Clinical Infectious Diseases, vol. 63, no. 7, pp. 896-903, October 2016.

A. U. Haq, J. P. Li, M. H. Memon, S. Nazir, and R. Sun, “A Hybrid Intelligent System Framework for the Prediction of Heart Disease Using Machine Learning Algorithms,” Mobile Information Systems, vol. 2018, 3860146, December 2018.

M. Elsisi, K. Mahmoud, M. Lehtonen, and M. M. Darwish, “Reliable Industry 4.0 Based on Machine Learning and IoT for Analyzing, Monitoring, and Securing Smart meters,” Sensors, vol. 21, no. 2, 487, January 2021.

M. Q. Tran, M. K. Liu, and M. Elsisi, “Effective Multi-Sensor Data Fusion for Chatter Detection in Milling Process,” The International Society of Automation Transactions, in press.

M. Elsisi, M. Q. Tran, K. Mahmoud, D. E. A. Mansour, M. Lehtonen, and M. M. Darwish, “Towards Secured Online Monitoring for Digitalized GIS Against Cyber-Attacks Based on IoT and Machine Learning,” Institute of Electrical and Electronics Engineers Access, vol. 9, pp. 78415-78427, June 2021.

J. P. Li, A. U. Haq, S. U. Din, J. Khan, A. Khan, and A. Saboor, “Heart Disease Identification Method Using Machine Learning Classification in E-Healthcare,” Institute of Electrical and Electronics Engineers Access, vol. 8, pp. 107562-107582, June 2020.

C. B. C. Latha and S. C. Jeeva, “Improving the Accuracy of Prediction of Heart Disease Risk Based on Ensemble Classification Techniques,” Informatics in Medicine Unlocked, vol. 16, 100203, July 2019.

K. W. Johnson, J. Torres Soto, B. S. Glicksberg, K. Shameer, R. Miotto, M. Ali, et al., “Artificial Intelligence in Cardiology,” Journal of the American College of Cardiology, vol. 71, no. 23, pp. 2668-2679, June 2018.

A. Raza, A. Mehmood, S. Ullah, M. Ahmad, G. S. Choi, and B. W. On, “Heartbeat Sound Signal Classification Using Deep Learning,” Sensors, vol. 19, no. 21, 4819, January 2019.

R. Spencer, F. Thabtah, N. Abdelhamid, and M. Thompson, “Exploring Feature Selection and Classification Methods for Predicting Heart Disease,” Digital Health, vol. 6, pp. 1-10, March 2020.

K. Shameer, K. W. Johnson, B. S. Glicksberg, J. T. Dudley, and P. P. Sengupta, “Machine Learning in Cardiovascular Medicine: Are We There Yet?” Heart, vol. 104, no. 14, pp. 1156-1164, July 2018.

A. M. Alaa, T. Bolton, E. Di Angelantonio, J. H. Rudd, and M. van der Schaar, “Cardiovascular Disease Risk Prediction Using Automated Machine Learning: A Prospective Study of 423,604 UK Biobank Participants,” Public Library of Science One, vol. 14, no. 5, e0213653, May 2019.

V. D. Soni, “Detection of Heart Disease Using Machine Learning Techniques,” International Journal of Scientific and Technology Research, vol. 9, no. 8, pp. 285-288, August 2020.

N. L. Fitriyani, M. Syafrudin, G. Alfian, and J. Rhee, “HDPM: An Effective Heart Disease Prediction Model for a Clinical Decision Support System,” Institute of Electrical and Electronics Engineers Access, vol. 8, pp. 133034-133050, July 2020.

H. Malik, M. S. Farooq, A. Khelifi, A. Abid, J. N. Qureshi, and M. Hussain, “A Comparison of Transfer Learning Performance Versus Health Experts in Disease Diagnosis from Medical Imaging,” Institute of Electrical and Electronics Engineers Access, vol. 8, pp. 139367-139386, June 2020.

B. Alić, L. Gurbeta, and A. Badnjević, “Machine Learning Techniques for Classification of Diabetes and Cardiovascular Diseases,” 6th Mediterranean Conference on Embedded Computing, pp. 1-4, June 2017.

X. Zhai and C. Tin, “Automated ECG Classification Using Dual Heartbeat Coupling Based on Convolutional Neural Network,” Institute of Electrical and Electronics Engineers Access, vol. 6, pp. 27465-27472, May 2018.

S. R. Saufi, Z. A. B. Ahmad, M. S. Leong, and M. H. Lim, “Challenges and Opportunities of Deep Learning Models for Machinery Fault Detection and Diagnosis: A Review,” Institute of Electrical and Electronics Engineers Access, vol. 7, pp. 122644-122662, August 2019.

M. Mohammed, M. B. Khan, and E. B. M. Bashier, Machine Learning: Algorithms and Applications, Boca Raton: Chemical Rubber Company Press, 2017.

R. Y. Goh and L. S. Lee, “Credit Scoring: A Review on Support Vector Machines and Metaheuristic Approaches,” Advances in Operations Research, vol. 2019, 1974794, March 2019.

N. Friedman, D. Geiger, and M. Goldszmidt, “Bayesian Network Classifiers,” Machine Learning, vol. 29, pp. 131-163, November 1997.

M. Mohri, A. Rostamizadeh, and A. Talwalkar, Foundations of Machine Learning, Cambridge: Massachusetts Institute of Technology Press, 2018.

R. R. Isnanto, A. F. Rochim, D. Eridani, and G. D. Cahyono, “Multi-Object Face Recognition Using Local Binary Pattern Histogram and Haar Cascade Classifier on Low-Resolution Images,” International Journal of Engineering and Technology Innovation, vol. 11, no. 1, pp. 45-58, January 2021.

How to Cite
A. Hussain, H. Malik, and M. U. Chaudhry, “Supervised Learning Based Classification of Cardiovascular Diseases”, Proc. eng. technol. innov., vol. 20, pp. 24-34, Sep. 2021.