Upload your data and get a complete red wine quality drivers report. Free.
or click to browse · max 3 MB
Running red wine quality drivers analysis...
Sent to — interactive charts, statistical results, R code, and AI insights.
Analyze another fileRandom forest feature importance combined with multiple linear regression to identify and rank physicochemical drivers of red wine quality scores
Use when you have a numeric quality or rating outcome and want to rank predictors by importance and understand their directional effect
Do not use for time series data, categorical outcomes with more than a few levels, or datasets with fewer than 100 rows
Built for: Enologists and winemakers, food scientists, quality control managers, beverage R&D analysts, wine researchers and viticulture academics
Typical data source: CSV with 12 physicochemical measurements per wine sample (fixed acidity, volatile acidity, citric acid, residual sugar, chlorides, free/total SO2, density, pH, sulphates, alcohol) plus expert quality scores (3-8)
Dataset with 12 columns
Minimum 30 rows
Cornerstone #14 — regression + classification on red wine quality (3,237 votes)
Distribution of wine quality scores from 3 to 8, showing whether mid-range scores dominate
Pairwise correlation matrix showing which features correlate most strongly with quality and with each other
Random forest feature importance ranking all 11 physicochemical features by predictive power
Box plots of alcohol content by quality score, showing whether higher alcohol links to higher quality
Box plots of volatile acidity by quality score, confirming whether it is a negative quality driver
Standardized linear regression coefficients showing direction and magnitude of each feature effect on quality
Scatter plot of alcohol vs volatile acidity colored by quality tier, showing visual separation between high and low quality wines
Per-quality-score summary table with mean alcohol, volatile acidity, sulphates, and citric acid
Plain-English interpretation — what the numbers mean, what's significant, and what to do next.
Need something simpler? Happiness Regression — When you only need linear regression coefficients without the random forest layer - simpler for smaller datasets or when the relationship between variables and outcome is expected to be linear
Need more power? Cancer Classification — When you need a full binary classification model with probability outputs, ROC curves, and precision/recall metrics rather than a regression on a continuous quality score
Similar: Diabetes Risk Drivers, Churn Drivers, Attrition Drivers
Upload your wine chemistry dataset with quality scores to get a ranked feature importance chart from random forest and standardized regression coefficients, showing which of the 11 properties matter most and in which direction.
See our FAQ for details on pricing, data privacy, and how the analysis works. Every report includes a Methodology section showing the statistical test, assumptions checked, and diagnostics run.
Run any analysis on your own data — validated R analyses, interactive reports, AI insights, and PDF export.
Try Free — No Credit Card