Executive Summary
Key findings from the multi-channel advertising regression
The multi-channel advertising model explains 91.5% of sales variance (R² = 0.9149, Adjusted R² = 0.9136), with an overall fit that is statistically significant (F = 702.1, p = 0). The channel with the largest standardized effect on sales is tv_spend, meaning a one-standard-deviation increase in that channel's budget produces the greatest lift relative to its own variability. Prediction error is 1.49 units of sales on average, giving decision-makers a reliable baseline for budget allocation.
Spend Distribution by Channel
Box plot of spend values for each advertising channel
The box plot shows the spend distribution for each advertising channel across 200 budget periods. Medians per channel: tv_spend (med= 164.9 ); radio_spend (med= 22.4 ); newspaper_spend (med= 52.8 ). Channels with wider interquartile ranges have more variable budgets, which gives the regression more statistical power to estimate their effect. Outlier periods (extreme spends) can exert undue influence on coefficients and should be reviewed before finalising budget recommendations.
Spend & Sales Correlation Matrix
Pairwise correlations between advertising channels and sales
The heatmap shows pairwise Pearson correlations between all spend channels and sales. The channel most strongly correlated with sales is tv_spend (r = 0.757). Correlations between spend channels reveal multicollinearity risk: values above 0.7 indicate that two channels move together in the budget, which can inflate standard errors and make individual coefficient estimates less reliable.
Channel Effect Sizes (Standardized Coefficients)
Standardized regression coefficients by advertising channel
Standardized coefficients (beta weights) measure each channel's effect in standard-deviation units, making them directly comparable regardless of how dollars are scaled. The dominant channel by standardized effect is tv_spend (β = 0.742). Channels with beta weights near zero contribute negligible independent lift to sales after accounting for the other channels in the model.
Regression Coefficients Table
Coefficients, standard errors, and confidence intervals per channel
| channel | coefficient | std_error | t_value | p_value | ci_lower | ci_upper |
|---|---|---|---|---|---|---|
| tv_spend | 0.0439 | 0.0012 | 35.39 | 0 | 0.0414 | 0.0463 |
| radio_spend | 0.2051 | 0.0073 | 27.95 | 0 | 0.1907 | 0.2196 |
| newspaper_spend | 0.0055 | 0.0032 | 1.721 | 0.0868 | -0.0008 | 0.0118 |
The regression table shows raw coefficients (sales change per unit of spend), standard errors, t-statistics, p-values, and 95% confidence intervals for each channel. 2 of 3 channels are statistically significant (p < 0.05). The largest raw effect is radio_spend (coefficient = 0.2051), meaning each additional unit of radio_spend spend is associated with 0.2051 additional sales units, holding other channels constant.
Actual vs Predicted Sales
Scatter of observed vs model-predicted sales
Each point compares the model's predicted sales to the actual observed sales for one budget period. Points on the 45-degree diagonal indicate perfect predictions; scatter around the line represents unexplained variation. The model achieves R² = 91.5%, explaining 91.5% of total sales variance, with a root mean squared error of 1.49 units. Systematic curves or funnels in this chart would suggest the linear model needs transformation or additional variables.
Residuals vs Fitted Values
Diagnostic check for OLS assumption validity
Residuals (actual minus predicted) plotted against fitted values check whether the OLS assumptions hold. A random horizontal band around zero indicates valid homoscedasticity (constant variance). Mean residual = 0 (should be near 0); maximum absolute residual = 4.43. A funnel shape (widening variance at higher fitted values) would suggest log-transforming the sales outcome, while a curve would indicate a non-linear relationship that a polynomial or interaction term might address.
Per-Channel Sales ROI Estimate
Regression coefficients as sales-per-unit-spend ranked by channel
ROI estimates are the raw regression coefficients: sales units generated per additional unit of spend in each channel, holding all other channels constant. The highest-ROI channel is radio_spend with an estimated 0.2051 sales units per unit of spend. Note that these estimates assume the current spend mix and may not hold at dramatically different budget levels due to diminishing marginal returns — channels ranking near zero or negative are candidates for budget reallocation.