Free — no account required

Medical Insurance Cost Prediction In Minutes

Upload your data and get a complete medical insurance cost prediction report. Free.

24,000+ analyses run
Encrypted & deleted in 7 days
PDF & citation included

Drop your CSV here

or click to browse · max 3 MB

📊
-
Rows
-
Columns
-
Numeric

Running medical insurance cost prediction analysis...

Running medical insurance cost prediction analysis...

Your report is ready

Sent to — interactive charts, statistical results, R code, and AI insights.

Analyze another file
Sample Output

Every report includes interactive charts, tables, and AI insights

Upload your data to get your own report

View all case studies See all free tools

How it works

Generalized Linear Model (GLM) with Gamma distribution and log link, including a smoker-by-BMI interaction term, fitted to predict individual annual medical insurance charges from demographic and lifestyle factors

Use when predicting medical costs or insurance premiums with right-skewed positive outcome variables and suspected multiplicative interaction effects between lifestyle factors

Do not use if outcome variable can be negative, if sample size is very small (under 100), or if you need exact prediction intervals rather than mean cost estimates

Built for: Actuaries, insurance pricing analysts, underwriters, health benefits managers, data scientists in insurance

Typical data source: Policyholder records with age, BMI, smoking status, region, number of dependents, and annual medical charges billed

health insurancelife and health insuranceemployee benefitshealthcare consulting

What data do you need?

Dataset with 7 columns

age (numeric) sex (categorical) bmi (numeric) children (numeric) smoker (categorical) region (categorical) charges (numeric)

Minimum 30 rows

What's in the report?

Cornerstone #15 — GLM with interaction effects on insurance medical cost (3,178 votes)

📉

Distribution of Medical Charges

Distribution of medical charges showing right-skew

📊

Average Charges by Smoker Status

Average charges comparison between smokers and non-smokers

🔵

BMI vs Charges by Smoking Status

BMI vs charges scatter showing smoker interaction effect

📊

GLM Predictor Effects

GLM coefficient magnitudes showing predictor importance

📊

Average Charges by Region

Average medical costs by US geographic region

🔵

Actual vs Predicted Charges

Actual vs predicted charges showing model fit quality

🔵

Residuals vs Fitted Values

Residuals vs fitted values for GLM diagnostic assessment

📋

Descriptive Statistics

Descriptive statistics for all numeric variables

🤖

AI Insights

Plain-English interpretation — what the numbers mean, what's significant, and what to do next.

Related tools

Need something simpler? Diabetes Risk Drivers — When you need to identify which health and demographic factors are associated with disease risk rather than predict a continuous cost amount

Need more power? Cancer Classification — When you need to classify policyholders into discrete high/medium/low risk tiers using a classification model rather than predict their continuous cost

Similar: Price Drivers Geo, Happiness Regression

The Question This Answers

Actuarial Premium Pricing

Insurers upload policyholder demographics to build a transparent, auditable GLM that justifies premium tiers based on quantified risk factors — smoker status, BMI, age, and region — without black-box complexity.

Questions?

See our FAQ for details on pricing, data privacy, and how the analysis works. Every report includes a Methodology section showing the statistical test, assumptions checked, and diagnostics run.

Your data has more stories to tell

Run any analysis on your own data — validated R analyses, interactive reports, AI insights, and PDF export.

Try Free — No Credit Card
Powered by MCP Analytics