Data Eng Projects

Oct 1, 2022 ยท 1 min read

R was used to predict employee attrition based on IBM employee data. Report details data cleansing, classification (Chi-Square,Pearson Correlation), feature selection(Random Forest), Prediction (Support Vector Machines, Logistic Regression, Random Forest, Neural Networks, Decision Trees). Model performance was measured using AUC (area under the ROC curve) to account for sensitivity and specificity.