Examining Techniques to Solving Imbalanced Datasets in Educational Data Mining Systems

نتاج البحث: المساهمة في مجلةArticleمراجعة النظراء

4 اقتباسات (Scopus)

ملخص

The educational data mining research attempts have contributed in developing policies to improve student learning in different levels of educational institutions. One of the common challenges to building accurate classification and prediction systems is the imbalanced distribution of classes in the data collected. This study investigates data-level techniques and algorithm-level techniques. Six classifiers from each technique are used to explore their effectiveness to handle the imbalanced data problem while predicting students’ graduation grade based on their performance at the first stage. The classifiers are tested using the k-fold cross-validation approach before and after applying the data-level and algorithm-level techniques. For the purpose of evaluation, various evaluation metrics have been used such as accuracy, precision, recall, and f1-score. The results showed that the classifiers do not perform well with imbalanced dataset, and the performance could be improved by using these techniques. As for the level of improvement, it varies from one technique to another.

اللغة الأصليةEnglish
الصفحات (من إلى)205-213
عدد الصفحات9
دوريةInternational Journal of Computing
مستوى الصوت21
رقم الإصدار2
المعرِّفات الرقمية للأشياء
حالة النشرPublished - يونيو 30 2022

ASJC Scopus subject areas

  • ???subjectarea.asjc.1700.1701???
  • ???subjectarea.asjc.1700.1712???
  • ???subjectarea.asjc.1700.1710???
  • ???subjectarea.asjc.1700.1708???
  • ???subjectarea.asjc.1700.1705???

قم بذكر هذا