User 136 · Entertainment · Movies · Revenue Drivers
Executive Summary

Executive Summary

Key findings from the log-log revenue regression across TMDB films

Budget Elasticity
0.461
Model R-Squared
0.652
Films Analyzed
2999
Top Genre by Revenue
Animation
Top Genre by ROI
Horror
Across 2999 TMDB films with valid financials, log-log OLS regression explains 65.2% of variance in box office revenue (R² = 0.652). The estimated budget elasticity is 0.461, meaning a 1% increase in production budget is associated with a 0.461% increase in revenue. Animation generates the highest median absolute revenue, while Horror delivers the strongest median return on investment.
Interpretation

Across 2999 TMDB films with valid financials, log-log OLS regression explains 65.2% of variance in box office revenue (R² = 0.652). The estimated budget elasticity is 0.461, meaning a 1% increase in production budget is associated with a 0.461% increase in revenue. Animation generates the highest median absolute revenue, while Horror delivers the strongest median return on investment.

Visualization

Budget vs Revenue (Log-Log Scale)

Each point is one film; axes are log-transformed to linearise the relationship

Interpretation

The log-log scatter across 2999 films shows a clear positive relationship between production budget and box office revenue. The OLS slope (budget elasticity) is 0.461 — slightly below 1, indicating diminishing returns: doubling the budget does not quite double revenue. Genre coloring reveals that Action and Adventure films cluster at the high end, while Drama and Horror span a wide revenue range.

Visualization

Median Revenue by Genre

Median box office revenue for genres with at least 5 films in the filtered dataset

Interpretation

Median box office revenue varies substantially across 18 genres with at least 5 qualifying films. Animation leads with a median revenue of $246M. Genres with lower budgets (e.g., Horror, Documentary) tend to sit at the lower end of median revenue but may still offer attractive ROI — see the ROI by Genre card.

Visualization

Regression Coefficients

OLS coefficient estimates; positive values increase log-revenue, negative values decrease it

Interpretation

The OLS model (R² = 0.652) identifies log-budget as the dominant predictor with a coefficient of 0.461 — a 1% increase in budget predicts a 0.461% increase in revenue, holding other factors constant. Log vote_count and log popularity also contribute positively, capturing the combined effect of audience visibility and social buzz on commercial performance.

Visualization

Return on Investment by Genre

Median revenue-to-budget ratio per genre; values above 1 indicate profitable genres on average

Interpretation

Median ROI (revenue / budget) differs considerably from median absolute revenue across 18 genres. Horror leads with a median ROI of 3.7x, meaning films in this genre typically return 3.7 times their production cost. High-budget genres like Action may have lower ROI despite higher absolute revenue, because their massive budgets eat into proportional returns.

Data Table

Top Highest-Grossing Films

Top 20 films by total box office revenue with budget and ROI

ROITitleBudgetRevenuePrimary Genre
11.76Avatar2370000002787965087Action
9.23Titanic2000000001845034188Drama
6.91The Avengers2200000001519557910Science Fiction
10.09Jurassic World1500000001513528810Action
7.93Furious 71900000001506249360Action
5.02Avengers: Age of Ultron2800000001405403694Action
8.49Frozen1500000001274219009Animation
6.08Iron Man 32000000001215439994Action
15.63Minions740000001156730962Family
4.61Captain America: Civil War2500000001153304495Adventure
5.76Transformers: Dark of the Moon1950000001123746996Action
11.9The Lord of the Rings: The Return of the King940000001118888979Adventure
5.54Skyfall2000000001108561013Action
5.2Transformers: Age of Extinction2100000001091405097Science Fiction
4.34The Dark Knight Rises2500000001084939099Action
5.33Toy Story 32000000001066969703Animation
5.33Pirates of the Caribbean: Dead Man's Chest2000000001065659812Adventure
2.75Pirates of the Caribbean: On Stranger Tides3800000001045713802Adventure
5.13Alice in Wonderland2000000001025491110Family
4.08The Hobbit: An Unexpected Journey2500000001021103568Adventure
Interpretation

The top 20 films by box office revenue are shown, led by Avatar with $2.79B in revenue. Note that ROI varies even among these blockbusters — high absolute revenue does not guarantee high ROI, as the largest-grossing films also tend to have the largest budgets. Primary genre is shown to identify whether certain genres dominate the top tier.

Visualization

Vote Average vs Revenue

Each point is one film; x-axis is audience vote average (0-10), y-axis is log-revenue

Interpretation

Across 2999 films, the Pearson correlation between audience vote average and log-revenue is 0.176. This weak-to-moderate relationship suggests that while better-rated films do tend to earn more, ratings alone are a poor predictor of box office success. Many commercially dominant blockbusters score 6–7 on vote average, while critically acclaimed films can underperform at the box office. Genre coloring shows that genre moderates this relationship.

Visualization

Average Revenue by Release Year

Mean box office revenue per release year, for years with at least 5 qualifying films

Interpretation

Average box office revenue is shown across 51 release years (1961–2016) with at least 5 qualifying films each. 2016 produced the highest average revenue at $201M per film. The trend reflects a combination of industry growth, blockbuster cycles, and the increasing nominal scale of major studio productions over the decades.

Your data has more stories to tell. Run any analysis on your own data — 60+ validated R modules, interactive reports, AI insights, and PDF export. 2,000 free credits on signup.
Try Free — No Signup Sign Up Free

Report an Issue

Tell us what's wrong. You'll get a free re-run of this analysis so you can try again with different parameters. If the re-run still doesn't meet your expectations, we'll refund your credits.

Want to run this analysis on your own data? Upload CSV — Free Analysis See Pricing