Pengembangan Model Uplift Menggunakan F Filter dan K-Means SMOTE Pada Algoritma Machine Learning untuk Prediksi Turnover Karyawan

Risyda Miftahur Rahmah

Risyda Miftahur Rahmah, Dr. Sigit Priyanta, S.Si., M.Kom.

2025 | Tesis | MAGISTER KECERDASAN ARTIFISIAL

Abstrak
File Pdf

Tingginya tingkat turnover karyawan menjadi tantangan bagi perusahaan. Model uplift menawarkan pendekatan analitik preskriptif yang mampu merekomendasikan program retensi yang tepat guna mengurangi tingkat turnover karyawan. Namun, performa model uplift sebelumnya masih belum optimal akibat keberadaan fitur yang kurang relevan serta distribusi kelas yang tidak seimbang.

Penelitian ini bertujuan untuk mengembangkan model prediksi turnover karyawan dengan pendekatan Lai’s Generalized Weighted Uplift Method (LGWUM). Pengembangan model dilakukan melalui seleksi fitur menggunakan metode F filter serta penanganan ketidakseimbangan kelas dengan metode K-Means SMOTE. Selanjutnya, dilakukan klasifikasi menggunakan algoritma Logistic Regression (LR), Extreme Gradient Boosting (XGBoost), Random Forest (RForest), Support Vector Machine (SVM), dan Balanced Random Forest (BRF). Penelitian ini menggunakan tiga dataset HR yang dilakukan feature engineering untuk menambahkan empat kelas uplift yaitu CN, CR, TN, and TR.

Hasil penelitian menunjukkan bahwa pengembangan model uplift dengan seleksi fitur F-Filter dan oversampling K-Means SMOTE terbukti efektif meningkatkan performa model, ditunjukkan oleh peningkatan koefisien qini pada dataset 2 dan 3 menjadi 0,0680 dan 0,0870. Berdasarkan analisis probabilitas pada top desile, algoritma XGBoost menunjukkan konsistensi terbaik dalam membedakan individu yang merespon positif dan individu yang merespon negatif terhadap treatment pada top desile, sehingga direkomendasikan sebagai model yang paling andal dalam penelitian ini.

High employee turnover rates pose a significant challenge for companies. Uplift model offers a prescriptive analytics approach capable of recommending appropriate retention programs to reduce employee turnover. However, the performance of previous uplift model remains suboptimal due to the presence of irrelevant features and imbalanced class distributions.

This study aims to develop an employee turnover prediction model using Lai’s Generalized Weighted Uplift Method (LGWUM). The model is enhanced through feature selection using the F-Filter method and class imbalance handling with the K-Means SMOTE technique. Classification is conducted using Logistic Regression (LR), Extreme Gradient Boosting (XGBoost), Random Forest (RForest), Support Vector Machine (SVM), and Balanced Random Forest (BRF). Three HR datasets are used, with feature engineering applied to introduce four uplift classes: CN, CR, TN, and TR.

The results indicate that the proposed model development approach, combining F-Filter feature selection and K-Means SMOTE oversampling, is effective in improving model performance, as evidenced by increased Qini coefficients on datasets 2 and 3, reaching 0.0680 and 0.0870, respectively. Based on the probability distribution analysis in the top decile, XGBoost consistently demonstrated the best capability in distinguishing between individuals who respond positively and negatively to treatment, making it the most reliable uplift model in this study.

Kata Kunci : Model Uplift, Turnover Karyawan, Metode F filter, K-Means SMOTE, Machine Learning

S2-2025-530697-abstract.pdf
S2-2025-530697-bibliography.pdf
S2-2025-530697-tableofcontent.pdf
S2-2025-530697-title.pdf

LAYANAN

E-Resources

Quick Access