User 136 · Social · Countries · Happiness Regression
Executive Summary

Executive Summary

Key regression findings including model fit and strongest predictor

Countries Analysed
156
R-Squared
0.779
Adjusted R-Squared
0.77
Strongest Predictor
social_support
Model F-Statistic
87.6
The six-factor regression explains 77.9% of the variation in national happiness scores (R² = 0.779, adjusted R² = 0.77) across 156 countries. After controlling for all other predictors, social_support emerges as the strongest independent driver of happiness. Residual diagnostics confirm that the model assumptions are broadly satisfied.
Interpretation

The six-factor regression explains 77.9% of the variation in national happiness scores (R² = 0.779, adjusted R² = 0.77) across 156 countries. After controlling for all other predictors, social_support emerges as the strongest independent driver of happiness. Residual diagnostics confirm that the model assumptions are broadly satisfied.

Data Table

Predictor Descriptive Statistics

Mean, SD, min, and max for each happiness predictor across all countries

VariableMeanSDMinMax
gdp_per_capita0.90510.398401.684
social_support1.2090.299201.624
healthy_life_expectancy0.72520.242101.141
freedom0.39260.143300.631
generosity0.18480.095300.566
corruption0.11060.094500.453
Interpretation

Across the 156 countries analysed, GDP per capita shows the widest relative spread, while generosity and corruption have the smallest absolute ranges. Social support and healthy life expectancy have the highest mean values among the six predictors, suggesting these factors are prevalent in most countries. The outcome variable (happiness score) has a mean of 5.407.

Visualization

Predictor Correlation Matrix

Pairwise Pearson correlations among happiness score and all six predictors

Interpretation

The correlation matrix covers happiness score and all six predictors. Among the predictors, gdp_per_capita shows the strongest bivariate correlation with happiness score (r = 0.794). High inter-predictor correlations (r > 0.7) could signal multicollinearity — the VIF table below provides the definitive diagnostic.

Visualization

Standardized Regression Coefficients

Beta weights ranking each predictor's independent effect on happiness

Interpretation

Standardized beta coefficients rank the six predictors by their independent contribution to happiness after controlling for all others. social_support has the largest beta (0.302), meaning a one-SD increase in this factor is associated with a 0.302-SD change in happiness, holding all else constant. Predictors with betas close to zero have little unique explanatory power once the other five factors are accounted for.

Visualization

Top 10 Happiest Countries

Countries with the highest happiness scores

Interpretation

Finland tops the rankings with a happiness score of 7.769. The gap between the happiest country and the 10th-ranked country is 0.523 points, showing that the top tier is relatively tightly clustered. All top-10 countries score well above the global mean of 5.407.

Visualization

Bottom 10 Least Happy Countries

Countries with the lowest happiness scores

Interpretation

South Sudan has the lowest happiness score in the dataset at 2.853. The gap between the least happy country and the happiest is 4.916 points — a range of 4.916 points spanning the full dataset. Countries at the bottom tend to share low scores across multiple predictors, reinforcing the multidimensional nature of wellbeing.

Data Table

VIF Multicollinearity Diagnostics

VIF scores indicating collinearity between happiness predictors

PredictorVif Value
gdp_per_capita4.116
healthy_life_expectancy3.573
social_support2.736
freedom1.575
corruption1.432
generosity1.224
Interpretation

Variance Inflation Factors assess how much each predictor's coefficient variance is inflated by correlations with other predictors. All VIF values are below 5 (maximum: 4.12 for gdp_per_capita), indicating acceptable multicollinearity. Predictors with VIF < 5 can be interpreted with confidence; those above 5 should be treated with caution when drawing causal conclusions.

Visualization

Residuals vs Fitted Values

Scatter plot of regression residuals against fitted values to check homoskedasticity

Interpretation

The residuals vs fitted plot assesses whether OLS assumption of homoskedasticity holds: residuals should be randomly scattered around zero with no systematic pattern. Here residuals have a standard deviation of 0.5231 and a maximum absolute value of 1.753. Any funnel shape or curve in this plot would indicate heteroskedasticity or non-linearity requiring model refinement.

Visualization

Residual Distribution

Histogram of regression residuals to validate the normality assumption

Interpretation

The histogram of residuals tests the normality assumption underpinning OLS inference. A bell-shaped, symmetric distribution centred near zero supports valid p-values and confidence intervals. The approximate skewness of the residuals is -0.493; values near zero indicate a symmetric distribution consistent with the normality assumption.

Your data has more stories to tell. Run any analysis on your own data — 60+ validated R modules, interactive reports, AI insights, and PDF export. 2,000 free credits on signup.
Try Free — No Signup Sign Up Free

Report an Issue

Tell us what's wrong. You'll get a free re-run of this analysis so you can try again with different parameters. If the re-run still doesn't meet your expectations, we'll refund your credits.

Want to run this analysis on your own data? Upload CSV — Free Analysis See Pricing