Ecommerce · Customers · Rfm Segmentation P1778698833
Executive Summary

Executive Summary

Key metrics from customer RFM segmentation

Number of Customers
941
Number of Clusters
4
Average Silhouette Score
0.2885
Clusters Found
4
Largest Segment Size
406
Smallest Segment Size
65
Overview

Analysis Overview

Analysis configuration and dataset characteristics

Interpretation

Analysis: Customer RFM Segmentation using K-Means clustering. Dataset contains 2000 transactions analyzed from 941 unique customers. Method: K-Means clustering on Recency, Frequency, and Monetary (RFM) metrics with 4 segments. Log transformation: enabled. Minimum transaction threshold: 1. RFM aggregation groups transactions by customer_id to compute purchase recency (days since last purchase), frequency (transaction count), and monetary value (total spending).

Data Preparation

Data Quality

Data filtering and quality metrics

Interpretation

Started with 2000 transaction records. Quality filtering removed 1059 rows (53.0%) with missing customer_id, missing dates, negative quantities, or zero unit prices, leaving 1444 clean transactions. Customer aggregation resulted in 941 unique customers with at least 1 transaction(s). Final dataset for clustering: 941 customers across 4 unique segments with silhouette-based quality validation.

Data Table

RFM Statistics

Central tendency and spread of recency, frequency, and monetary metrics across all customers

MetricMinMaxMeanMedianStd Dev
Recency (Days)1374142.9108111.7
Frequency (Transactions)1271.39511.573
Monetary Value0.19179031.2116.3587.86
Visualization

Cluster Size Distribution

Number and proportion of customers in each segment

Interpretation

Customers are distributed across 4 segments with 235 individuals per segment on average. Segment sizes range from 65 to 406 customers (6.9% to 43.1%). The distribution is skewed, requiring sized-specific strategies for targeted marketing campaigns.

Data Table

Segment Profiles

Average RFM characteristics and cluster quality (silhouette coefficient) for each segment

ClusterN CustomersMean RecencyMean MonetaryMean FrequencySilhouette Coeff
Segment 1406195.339.421.160.3084
Segment 26547.17124.74.740.232
Segment 3266175.95.851.060.3141
Segment 420426.0818.171.230.2335
Interpretation

Segment 1 has the highest recency (195.3 days), while Segment 4 shows the lowest recency (26.1 days), indicating most recent purchases. Monetary values range from 5.85 to 124.71 across segments. Cluster cohesion (silhouette: 0.232 to 0.314) suggests Segment 3 has the most cohesive customers, while Segment 2 contains more heterogeneous points.

Visualization

Frequency vs Monetary (by Segment)

Customer distribution in frequency-monetary space, colored by cluster membership

Interpretation

The scatter shows clear separation of segments in frequency-monetary space. Segment 2 contains the highest-spending customers (average monetary: 124.71), while other segments show lower spending profiles. Most customers cluster in the low-to-moderate frequency and monetary ranges, with occasional high-value outliers representing premium customers.

Visualization

Cluster Quality (Silhouette Scores)

Cohesion measure for each cluster (1=perfect separation, 0=borderline, <0=misclassified)

Interpretation

Silhouette scores range from 0.232 to 0.314, with an overall average of 0.272. 0 of 4 clusters show strong cohesion (>0.5), while 2 clusters contain borderline or overlapping points. Scores near 1 indicate tight, well-separated clusters suitable for targeting, while lower scores suggest segment boundaries may be ambiguous.

Data Table

Top Customers by Monetary Value

Highest-spending customers and their RFM characteristics by segment

ClusterCustomer IDRecency DaysMonetary ValueFrequency Count
Segment 1158386517901
Segment 2174507111554
Segment 2146463867.98
Segment 2149113686.127
Segment 214156335865
Segment 1163331245401
Segment 1130822553901
Segment 2127485294.826
Segment 21340818270.44
Segment 115061212244.81
Segment 214298532444
Segment 2140965218.210
Segment 112415144217.42
Segment 114608302041
Segment 112590212203.41
Segment 115189205194.12
Segment 116684310183.61
Segment 2124331180.94
Segment 113798941791
Segment 116003360175.21
Interpretation

Top 20 customers by spending are dominated by Segment 1 (11 of top 20), which likely represents your most valuable and retention-critical segment. These customers show spending from 175.20 to 1790.00 with varying recency (1 to 360 days) and transaction frequency (1 to 27). Prioritizing retention and upsell strategies for this segment can maximize customer lifetime value.

Visualization

Recency Distribution by Segment

Distribution of days since last purchase for each customer segment

Interpretation

Segment 4 shows the most recent purchase behavior (median: 24 days), while Segment 1 contains customers at higher risk of churn with longer recency (median: 193 days). Recency variation within segments (visible in box widths) reveals customer engagement patterns—tight boxes indicate consistent behavior, wide boxes suggest mixed engagement levels requiring segment-specific retention strategies.

Your data has more stories to tell. Run any analysis on your own data — 60+ validated R modules, interactive reports, AI insights, and PDF export. 2,000 free credits on signup.
Try Free — No Signup Sign Up Free

Report an Issue

Tell us what's wrong. You'll get a free re-run of this analysis so you can try again with different parameters. If the re-run still doesn't meet your expectations, we'll refund your credits.

Want to run this analysis on your own data? Upload CSV — Free Analysis See Pricing