Data Eng Projects
Oct 1, 2022
ยท
1 min read
R was used to predict employee attrition based on IBM employee data. Report details data cleansing, classification (Chi-Square,Pearson Correlation), feature selection(Random Forest), Prediction (Support Vector Machines, Logistic Regression, Random Forest, Neural Networks, Decision Trees). Model performance was measured using AUC (area under the ROC curve) to account for sensitivity and specificity.