Free — no account required

IBM HR Employee Attrition Drivers In Minutes

Upload your data and get a complete ibm hr employee attrition drivers report. Free.

24,000+ analyses run
Encrypted & deleted in 7 days
PDF & citation included

Drop your CSV here

or click to browse · max 3 MB

📊
-
Rows
-
Columns
-
Numeric

Running ibm hr employee attrition drivers analysis...

Running ibm hr employee attrition drivers analysis...

Your report is ready

Sent to — interactive charts, statistical results, R code, and AI insights.

Analyze another file
Sample Output

Every report includes interactive charts, tables, and AI insights

Upload your data to get your own report

View all case studies See all free tools

How it works

Dual-model approach combining logistic regression (interpretable odds ratios with 95% CI) and XGBoost with gain-based feature importance as a SHAP proxy, applied to 20 employee attributes spanning compensation, workload, satisfaction, tenure, and demographics

Use this when you have HR employee records with a binary attrition outcome and want to identify and rank the factors driving voluntary turnover using both interpretable statistics and non-linear machine learning

Do not use if your dataset lacks a clear binary attrition column, has fewer than 100 employee records, or if you need causal inference rather than predictive association

Built for: HR Business Partners, People Analytics Managers, Compensation & Benefits Specialists, Talent Acquisition Directors, Chief Human Resources Officers

Typical data source: Employee HR dataset with attrition outcome (Yes/No), demographics (age, department, job role), compensation fields (monthly income, job level, stock options), satisfaction ratings (job, environment, work-life balance), and tenure fields (years at company, years in current role, years since last promotion)

technologyfinancial serviceshealthcaremanufacturing

What data do you need?

Dataset with 20 columns

attrition (categorical) age (numeric) department (categorical) job_role (categorical) job_level (numeric) monthly_income (numeric) years_at_company (numeric) years_in_current_role (numeric) overtime (categorical) business_travel (categorical) job_satisfaction (numeric) environment_satisfaction (numeric) work_life_balance (numeric) distance_from_home (numeric) total_working_years (numeric) num_companies_worked (numeric) years_since_last_promotion (numeric) stock_option_level (numeric) training_times_last_year (numeric) years_with_curr_manager (numeric)

Minimum 100 rows

What's in the report?

Binary classification: Attrition (Yes/No, converted to 1/0). The IBM HR dataset has 1,470 employees × 35 attributes. Baseline attrition rate ~16.1%; top driver in published Kaggle notebooks is OverTime, followed by MonthlyIncome, JobRole, YearsAtCompany, and BusinessTravel. Analysis compares a logistic regression (directional, interpretable) against xgboost with SHAP values (non-linear, ranked importance).

📋

Attrition Rate by Segment

Attrition rates broken down by department, overtime status, business travel frequency, and job level to identify highest-risk segments

📊

Attrition Rate by Job Role

Job role attrition rates ranked to identify which titles face disproportionate voluntary turnover

📊

Logistic Regression: Odds Ratios

Logistic regression odds ratios (forest plot) showing which predictors significantly increase or decrease attrition odds after controlling for all other factors

📋

Logistic Regression: Full Coefficient Table

Complete logistic regression coefficient table with p-values and 95% confidence intervals for every predictor

📊

XGBoost SHAP Feature Importance

XGBoost gain-based feature importance ranked by mean absolute contribution to attrition prediction

🟧

Overtime × Department Attrition Heatmap

Overtime-by-department cross-tab of attrition rates showing whether overtime-driven attrition is uniform or concentrated in specific departments

📈

Employee Retention Survival Curve

Kaplan-Meier retention probability over tenure years highlighting at which milestones attrition risk peaks

🔵

Model Discrimination: ROC Curve

ROC curve showing logistic regression discrimination quality between attritors and stayers

🤖

AI Insights

Plain-English interpretation — what the numbers mean, what's significant, and what to do next.

Related tools

Need something simpler? Tf038 Live Ttest — When you only need to test whether a single metric like monthly income or job satisfaction differs significantly between employees who left vs. stayed

Need more power? Anova Factorial — When you need to analyze interaction effects between multiple categorical HR factors such as department and overtime and job level on attrition rates

Similar: Churn Drivers

The Question This Answers

Identify employees at highest risk of quitting

Identify employees at highest risk of quitting

Questions?

See our FAQ for details on pricing, data privacy, and how the analysis works. Every report includes a Methodology section showing the statistical test, assumptions checked, and diagnostics run.

Your data has more stories to tell

Run any analysis on your own data — validated R analyses, interactive reports, AI insights, and PDF export.

Try Free — No Credit Card
Powered by MCP Analytics