User 136 · Research · Multiple Comparisons · Holm Bonferroni

Executive Summary

Holm-Bonferroni correction results across all pairwise group comparisons

Total Students

1000

Education Groups

6

Total Pairwise Tests

45

Significant After Correction

19

Pairs Significant All Outcomes

5

19 of 45 pairwise comparisons (42%) remain statistically significant after Holm-Bonferroni correction at alpha = 0.05. The analysis compared 6 parental education levels across three student outcomes (math, reading, and writing scores), testing 15 unique group pairs per outcome. 5 group pair(s) show significance consistently across all outcomes, indicating robust education-level gaps that survive rigorous family-wise error control.

Interpretation

19 of 45 pairwise comparisons (42%) remain statistically significant after Holm-Bonferroni correction at alpha = 0.05. The analysis compared 6 parental education levels across three student outcomes (math, reading, and writing scores), testing 15 unique group pairs per outcome. 5 group pair(s) show significance consistently across all outcomes, indicating robust education-level gaps that survive rigorous family-wise error control.

Data Table

Group Descriptive Statistics

Mean scores and sample sizes per parental education level

N	Group	Outcome 1 Mean	Outcome 2 Mean	Outcome 3 Mean
222	associate's degree	67.9	70.9	69.9
118	bachelor's degree	69.4	73	73.4
196	high school	62.1	64.7	62.4
59	master's degree	69.7	75.4	75.7
226	some college	67.1	69.5	68.8
179	some high school	63.5	66.9	64.9

Interpretation

Across 6 parental education groups, the highest-scoring group on the first outcome is 'master's degree' and the lowest is 'high school', a raw gap of 7.6 points before significance testing. Sample sizes are roughly balanced across groups, supporting reliable t-test inference. Groups higher in educational attainment (e.g., master's or bachelor's degree) tend to have the largest mean scores, while 'some high school' or 'high school' groups typically score lowest.

Visualization

Mean Scores by Parental Education Level

Grouped mean scores for each education level across all outcomes

Interpretation

'master's degree' students score highest and 'high school' students score lowest on average across outcomes. The education-level ranking is highly consistent across all three outcomes. These raw differences motivate the pairwise testing: we need Holm-Bonferroni correction to determine which gaps are statistically reliable rather than sampling noise.

Visualization

Score Distributions by Education Level

Box plots of score distributions showing within-group spread

Interpretation

Each education group shows substantial within-group variability: the median interquartile range on the first outcome is approximately 19.6 points. This overlap in distributions means many pairwise differences are not detectable at the individual level, which is why formal significance testing — and then Holm-Bonferroni correction — is needed to avoid over-claiming. Groups with wider distributions require larger true mean differences to achieve significance.

Data Table

Pairwise Comparison Results (All 45 Tests)

All pairwise Welch t-test results with Holm-adjusted p-values (method = 'holm')

Group 1	Group 2	Adj Pvalue	Raw Pvalue	Significant	Outcome Name
bachelor's degree	high school	0	0	Yes	Outcome 2
high school	master's degree	0	0	Yes	Outcome 2
associate's degree	high school	0	0	Yes	Outcome 3
bachelor's degree	high school	0	0	Yes	Outcome 3
high school	master's degree	0	0	Yes	Outcome 3
bachelor's degree	some high school	0.0001	0	Yes	Outcome 3
master's degree	some high school	0.0001	0	Yes	Outcome 3
associate's degree	high school	0.0003	0	Yes	Outcome 2
high school	some college	0.0003	0	Yes	Outcome 3
bachelor's degree	high school	0.0013	0	Yes	Outcome 1
associate's degree	high school	0.0031	0.0001	Yes	Outcome 1
master's degree	some high school	0.0047	0.0001	Yes	Outcome 2
high school	some college	0.0146	0.0004	Yes	Outcome 1
high school	some college	0.0193	0.0006	Yes	Outcome 2
bachelor's degree	some high school	0.0195	0.0006	Yes	Outcome 2
high school	master's degree	0.0287	0.001	Yes	Outcome 1
associate's degree	some high school	0.031	0.0011	Yes	Outcome 3
master's degree	some college	0.0334	0.0012	Yes	Outcome 3
bachelor's degree	some high school	0.0366	0.0014	Yes	Outcome 1
master's degree	some college	0.1133	0.0044	No	Outcome 2
associate's degree	some high school	0.1327	0.0053	No	Outcome 1
associate's degree	master's degree	0.1327	0.0054	No	Outcome 3
master's degree	some high school	0.1725	0.0078	No	Outcome 1
associate's degree	some high school	0.1725	0.0076	No	Outcome 2
bachelor's degree	some college	0.1725	0.0075	No	Outcome 3
some college	some high school	0.2162	0.0108	No	Outcome 3
some college	some high school	0.337	0.0177	No	Outcome 1
associate's degree	master's degree	0.5263	0.0303	No	Outcome 2
bachelor's degree	some college	0.5263	0.0292	No	Outcome 2
associate's degree	bachelor's degree	0.5926	0.037	No	Outcome 3
associate's degree	bachelor's degree	1	0.3788	No	Outcome 1
associate's degree	master's degree	1	0.4032	No	Outcome 1
associate's degree	some college	1	0.5878	No	Outcome 1
bachelor's degree	master's degree	1	0.8826	No	Outcome 1
bachelor's degree	some college	1	0.1778	No	Outcome 1
high school	some high school	1	0.3901	No	Outcome 1
master's degree	some college	1	0.2354	No	Outcome 1
associate's degree	bachelor's degree	1	0.1997	No	Outcome 2
associate's degree	some college	1	0.2665	No	Outcome 2
bachelor's degree	master's degree	1	0.2881	No	Outcome 2
high school	some high school	1	0.1465	No	Outcome 2
some college	some high school	1	0.0909	No	Outcome 2
associate's degree	some college	1	0.4465	No	Outcome 3
bachelor's degree	master's degree	1	0.308	No	Outcome 3
high school	some high school	1	0.1159	No	Outcome 3

Interpretation

19 of 45 pairwise comparisons survive Holm-Bonferroni correction at alpha = 0.05. The smallest adjusted p-value is 0, indicating the most robustly significant group difference. The largest raw p-value is 0.883. Comparing raw vs adjusted p-values reveals how much the correction inflates thresholds for lower-ranked tests — pairs near the boundary are most affected.

Visualization

Significant Pairwise Differences After Correction

Mean score gaps for group pairs that survive Holm-Bonferroni correction

Interpretation

19 pairwise comparisons remain significant after Holm-Bonferroni correction. The largest gap is between 'high school vs master's degree' at 13.23 points — a practically meaningful education-level effect. Bars are ordered by absolute mean difference so the most impactful gaps appear first, distinguishing statistical significance from effect magnitude.

Visualization

Adjusted P-Value Significance Map

Average Holm-adjusted p-value per group pair across all outcomes (cells below 0.05 are significant)

Interpretation

5 of 15 unique group pairs show an average adjusted p-value below the 0.05 threshold across all outcomes. Darker cells (lower p-values) indicate pairs where education-level differences are most consistently detectable. Pairs involving the highest and lowest education levels tend to produce the strongest signal, while adjacent levels often do not differ significantly after correction.

What's wrong with this card?

Executive Summary

Group Descriptive Statistics

Mean Scores by Parental Education Level

Score Distributions by Education Level

Pairwise Comparison Results (All 45 Tests)

Significant Pairwise Differences After Correction

Adjusted P-Value Significance Map

Report an Issue