Download PDFOpen PDF in browserOn relationships between imbalance and overlapping of datasets10 pages•Published: March 9, 2020AbstractThe paper deals with problems that imbalanced and overlapping datasets often en- counter. Performance indicators as accuracy, precision and recall of imbalanced data sets, both with and without overlapping, are discussed and compared with the same performance indicators of balanced datasets with overlapping. Three popular classification algorithms, namely, Decision Tree, KNN (k-Nearest Neighbors) and SVM (Support Vector Machines) classifiers are analyzed and compared.Keyphrases: classification algorithms, imbalance data, machine learning, overlapping classes, oversampling algorithms, undersampling algorithms In: Gordon Lee and Ying Jin (editors). Proceedings of 35th International Conference on Computers and Their Applications, vol 69, pages 141-150.
|