PENERAPAN METODE ENSEMBLE UNTUK MENINGKATKAN KINERJA ALGORITME KLASIFIKASI PADA IMBALANCED DATASET

Yoga Pristyanto

Abstract


Pada bidang data mining sering kali para peneliti tidak memperhatikan keseimbangan distribusi kelas pada dataset. Hal ini dapat menimbulkan kesulitan yang cukup serius pada algoritme klasifikasi. karena secara teori mayoritas classifier mengasumsikan distribusi yang relatif seimbang, sehingga menyebabkan kinerja suatu algoritme klasifikasi menjadi kurang maksimal. Oleh karena itu, pada penelitian ini diterapkan metode ensemble dengan penambahan adaptive boosting untuk menyelesaikan permasalahan tersebut. Dari hasil pengujian yang dilakukan pada penelitian ini, metode ensemble dengan penambahan adaptive boosting dapat meningkatkan nilai kinerja algoritme klasifikasi. Nilai kinerja algoritme Naive Bayes dengan Adaptive Boosting akurasi yang dihasilkan sebesar 91.98%, sensitifitas sebesar 91.98%, spesifisitas sebesar 96.49%, dan g-mean sebesar 94.21%. Nilai kinerja algoritme Support Vector Machine dengan Adaptive Boosting akurasi yang dihasilkan sebesar 91.52%, sensitifitas sebesar 91.52%, spesifisitas sebesar 96.29%, dan g-mean sebesar 93.88%. Sedangkan Nilai kinerja algoritme Decision Tree dengan Adaptive Boosting akurasi yang dihasilkan sebesar 94.37%, sensitifitas sebesar 94.37%, spesifisitas sebesar 97.73%, dan g-mean sebesar 96.03%. Hal ini menunjukkan bahwa metode ensemble dengan Adaptive Boosting dapat menjadi solusi untuk meningkatkan kinerja algoritme pada imbalanced dataset.

Kata Kunci: adaptive boosting, data mining, ensemble, ketidakseimbangan kelas, klasifikasi.


Full Text:

PDF

References


Ian H. Wilten & Eibe Frank, Data Mining Practical Machine Learning Tools and Techniques, Second Edi. San Francisco: Morgan Kaufmann Publishers, 2005.

Jiawei Han and Micheline Kamber, Jiawei Han & Micheline Kamber, Second Edi. San Francisco: Morgan Kaufmann Publishers, 2006.

Y. Pristyanto, N. A. Setiawan, and I. Ardiyanto, “Hybrid Resampling to Handle Imbalanced Class on Classification of Student Performance in Classroom,” in The First International Conference on Informatics and Computational Sciences (ICICoS 2017), 2017, pp. 215–220.

T. M. Christian and M. Ayub, “Exploration of classification using NBTree for predicting students’ performance,” in Proceedings of 2014 International Conference on Data and Software Engineering, 2014, pp. 1–5.

G. Gray, C. McGuinness, and P. Owende, “An application of classification models to predict learner progression in tertiary education,” 2014 4th IEEE Int. Adv. Comput. Conf. IACC 2014, pp. 549–554, 2014.

S. A. Kumar, M. N. Vijayalakshmi, and D. V. M. N. S.Anupama Kumar, “Appraising the Significance of Self Regulated Learning in Higher Education Using Neural Networks,” Int. J. Eng. Res. Dev., vol. Volume 1, no. Issue 1, pp. 9–15, 2012.

M. Mayilvaganan and D. Kalpanadevi, “Comparison of classification techniques for predicting the performance of students academic environment,” Commun. Netw. Technol. (ICCNT), 2014 Int. Conf. Comput. Intell. Comput. Res., pp. 113–118, 2014.

R. S. Wahono, N. S. Herman, and S. Ahmad, “Neural network parameter optimization based on genetic algorithm for software defect prediction,” Adv. Sci. Lett., vol. 20, no. 10–12, pp. 1951–1955, 2014.

S. Aries and R. S. Wahono, “Pendekatan Level Data untuk Menangani Ketidakseimbangan Kelas pada Prediksi Cacat Software,” J. Softw. Eng., vol. 1, no. 2, pp. 76–85, 2015.

Y. Sun, M. S. Kamel, A. K. C. Wong, and Y. Wang, “Cost-sensitive boosting for classification of imbalanced data,” Pattern Recognit., vol. 40, no. 12, pp. 3358–3378, 2007.

R. Longadge and S. Dongre, “Class Imbalance Problem in Data Mining Review,” Int. J. Comput. Sci. Netw., vol. 2, no. 1, pp. 83–87, 2013.

G. Hu, T. Xi, F. Mohammed, and H. Miao, “Classification of wine quality with imbalanced data,” Proc. IEEE Int. Conf. Ind. Technol., pp. 1712–1717, 2016.

S. T. Jishan, R. I. Rashu, N. Haque, and R. M. Rahman, “Improving accuracy of students’ final grade prediction model using optimal equal width binning and synthetic minority over-sampling technique,” Decis. Anal., vol. 2, no. 1, pp. 1–25, 2015.

R. I. Rashu, N. Haq, and R. M. Rahman, “Data mining approaches to predict final grade by overcoming class imbalance problem,” 2014 17th Int. Conf. Comput. Inf. Technol. ICCIT 2014, pp. 14–19, 2014.

A. R. Naufal, R. Satria Wahono, and A. Syukur, “Penerapan Bootstrapping dan Weighted Information Gain untuk Optimasi Parameter pada Algoritma Support Vector Machine untuk Prediksi Loyalitas Pelanggan oleh :,” J. Intell. Syst., vol. 1, no. 2, pp. 98–108, 2015.

O. N. Pratiwi, “Predicting student placement class using data mining,” in Proceedings of 2013 IEEE International Conference on Teaching, Assessment and Learning for Engineering, TALE 2013, 2013, no. August, pp. 618–621.

H. Liu, H. Tian, Y. Li, and L. Zhang, “Comparison of four Adaboost algorithm based artificial neural networks in wind speed predictions,” Energy Convers. Manag., vol. 92, pp. 67–81, 2015.

A. S. Nugroho, A. B. Witarto, and D. Handoko, “Support Vector Machine,” Proceeding Indones. Sci. Meeiting Cent. Japan, 2003.

Kusrini and E. Taufiq Luthfi, Algoritma Data Mining, Edisi Pert. Yogyakarta: Penerbit Andi, 2009.

B. Max, Principles of Data Mining. London: Springer, 2007.

M. Han, J., & Kamber, Data Mining: Concepts and Techniques Second, Second Edi., vol. 12. San Fransisco: Morgan Kauffman, 2006.

D. M. W. Powers, “Evaluation: From Precision, Recall And F-Measure To ROC, Informedness, Markedness & Correlation,” vol. 2, no. 1, pp. 37–63, 2011.

Y. Pristyanto, I. Pratama, and A. F. Nugraha, “Data level approach for imbalanced class handling on educational data mining multiclass classification,” in 2018 International Conference on Information and Communications Technology, ICOIACT 2018, 2018, pp. 310–314.




DOI: https://doi.org/10.33365/jti.v13i1.184

Refbacks

  • There are currently no refbacks.


Copyright (c) 2021 Yoga Pristyanto

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.


JURNAL TEKNOINFO
Published by Universitas Teknokrat Indonesia
Organized by Prodi S1 Informatika FTIK Universitas Teknokrat Indonesia

W: http://ejurnal.teknokrat.ac.id/index.php/teknoinfo/index
E : teknoinfo@teknokrat.ac.id.
Jl. Zainal Abidin Pagaralam, No.9-11, Labuhan Ratu, Bandarlampung

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Jumlah Pengunjung : View Teknoinfo StatsCounter

Flag Counter