Featuring Machine Learning Models to Evaluate Employee Attrition: A Comparative Analysis of Workforce StabilityRelating Factors

Abstract
Employee attrition is a problem for most organizations as it affects morale, productivity, and business continuity. In addressing this, the study made use of machine learning techniques such as Clear AI, Random Forest, and logistic regression in designing a prediction model to predict who is the next to leave within an organization. The HR data relating to demographics, performance metrics, job roles, and conditions of work was sourced from publicly available website Kaggle.com for the study. Data preprocessing included scaling, outlier detection, and balancing the dataset using SMOTE. Multiple machine learning models were trained and evaluated by checking on accuracy, F1-score, and the ROC-AUC curve. The best model that was tested was Random Forest, which gave an accuracy of 85.71%. Additional insights from feature importance highlighted the significant effect of overtime, marital status, and stock options on attrition. Among the remaining key drivers are workload, work-life balance, and financial incentives. These findings suggest the need for focused HR strategies, such as reduction of overtime, mentorship programs, and career development opportunities, to reduce attrition rates and improve employee satisfaction. This study provides a robust methodology in predicting attrition and delivers actionable insights into designing interventions that improve workforce stability and organizational efficiency.
Keywords: Employee Attrition, Features Importance, Human Resources, Machine Learning Models, Organizations

Author(s): Mustafizul Haque, Tejasvini Alok Paralkar, Sudhir Rajguru, Adheer A Goyal, Tanaya Patil, Kamal Upreti*
Volume: 6 Issue: 2 Pages: 862-873
DOI: https://doi.org/10.47857/irjms.2025.v06i02.03512