Performance Evaluation of Machine Learning Algorithm in Various Datasets

https://doi.org/10.55529/jaimlnn.32.14.32

Authors

  • Md. Siraj-Ud-Doulah Associate Professor, Department of Statistics, Begum Rokeya University, Rangpur, Bangladesh
  • Md. Nazmul Islam M.Sc, Department of Statistics, Begum Rokeya University, Rangpur, Bangladesh

Keywords:

Machine Learning, Classification, Confusion Matrix, Performance Measures.

Abstract

Machine learning is one of the fast-growing areas of computer science, with far-reaching applications. There are several applications for machine learning. The most significant of which is supervised learning. Supervised learning is common in classification problems. In this study, frequently used twelve machine learning algorithms are considered: NB, LDA, LR, ANN, SVM, K-NN, HT, DT, C4.5, CART, RF and BB. We apply these algorithms on seven datasets. The main goal of this study was to evaluate the performance of the machine learning algorithms on both binary and multiple classification problems using a variety of performance metrics: accuracy, kappa statistic, precision, recall, specificity, F-measure, MAE, RMSE and MCC. Here, we found that RF algorithm proved to have the best performance in three out of seven datasets. But the other four algorithms: NN, NB, BB and LR also performed well.

Published

2023-02-21

How to Cite

Md. Siraj-Ud-Doulah, & Md. Nazmul Islam. (2023). Performance Evaluation of Machine Learning Algorithm in Various Datasets. Journal of Artificial Intelligence,Machine Learning and Neural Network (JAIMLNN) ISSN: 2799-1172, 3(02), 14–32. https://doi.org/10.55529/jaimlnn.32.14.32