Overview

Analysis Overview

RFM Segmentation Configuration

Analysis overview and configuration

Configuration

Analysis TypeRfm
CompanyDemo E-Commerce Store
ObjectiveSegment customers by purchase behavior to prioritize marketing spend
Analysis Date2026-03-14
Processing Idrfm_test_20260314_215957
Total Observations4045

Module Parameters

ParameterValue_row
analysis_date2024-01-01analysis_date
n_bins5n_bins
top_n_customers20top_n_customers
Rfm analysis for Demo E-Commerce Store

Interpretation

Purpose

This RFM (Recency, Frequency, Monetary) segmentation analysis evaluates 300 customers from a demo e-commerce store to identify purchase behavior patterns and segment them for targeted marketing investment. The analysis processes 4,045 transaction records with perfect data retention, enabling the business to prioritize marketing spend toward high-value customer groups.

Key Findings

  • Champions Segment: 25.7% of customers generate 69.1% of total revenue ($443,747), demonstrating extreme concentration of value in a small, highly engaged cohort
  • At-Risk Customers: 15.3% of the base (46 customers) show declining engagement with average recency of 538 days, representing significant churn risk
  • Average Customer Frequency: 6.75 purchases per customer indicates moderate repeat purchase behavior across the base
  • Score Distribution: Perfectly balanced quintile distribution (20% in each score tier) confirms proper segmentation methodology

Interpretation

The analysis reveals a highly skewed customer value distribution typical of e-commerce: one-quarter of customers drive nearly 70% of revenue. The 10-segment taxonomy (Champions, Loyal Customers, Lost, Potential Loyalists, Need Attention, At Risk, New Customers, Promising, About to Sleep, Hibernating) provides granular targeting capability. The balanced R

Data Preparation

Data Preprocessing

Transaction Data Quality

Data preprocessing and column mapping

Data Quality

Initial Rows4045
Final Rows4045
Rows Removed0
Retention Rate100

Data Quality

MetricValue
Initial Rows4,045
Final Rows4,045
Rows Removed0
Retention Rate100%
Processed 4,045 observations, retained 4,045 (100.0%) after cleaning

Interpretation

Purpose

This section documents the data cleaning and preparation phase for the RFM segmentation analysis. Perfect retention (100%) indicates that all 4,045 transaction records passed validation checks and were successfully processed into the final 300-customer dataset used for segmentation.

Key Findings

  • Retention Rate: 100% (4,045 rows preserved) - No records were excluded due to missing customer IDs, dates, or invalid revenue values, suggesting high data quality in the source dataset
  • Rows Removed: 0 - The absence of data loss indicates either exceptionally clean input data or lenient validation thresholds
  • No Train/Test Split: Analysis uses the complete dataset for segmentation rather than holdout validation, appropriate for descriptive RFM clustering but limits predictive reliability assessment

Interpretation

The perfect retention rate supports confidence in the segmentation results, as no systematic data quality issues forced exclusion of customer records. However, the lack of train/test validation means model performance cannot be independently verified. The 4,045 raw transactions were aggregated into 300 unique customers with RFM metrics, enabling the quintile-based scoring that produced the 10 customer segments.

Context

RFM analysis is inherently descriptive rather than predictive, so absence of train/test splits is standard practice. The assumption that all retained data accurately represents customer behavior depends

Executive Summary

Executive Summary

Key Findings & Recommendations

Key Metrics

n_customers
300
champions_pct
25.7
at_risk_pct
15.3
total_revenue
642448.48
n_segments_found
10

Key Findings

findingvalue
Total Customers Analyzed300
Champion Customers77 (25.7%)
At-Risk Customers46 (15.3%)
Lost / Hibernating56 (18.7%)
Champion Revenue Share69.1% of $642,448
Average Customer Spend$2,141
Average Purchase Frequency6.8
Segments Identified10

Summary

Bottom Line: 300 customers segmented into 10 groups. Champions (25.7% of customers) generate 69.1% of revenue — protecting this segment is the highest-priority action.

Priority Actions:
Champions (77 customers): Launch referral program, offer VIP perks, early access to new products
At Risk (46 customers): Win-back campaign with 15-20% discount, personalized email within 7 days
New Customers (14 customers): Onboarding sequence to encourage 2nd purchase within 30 days
Lost / Hibernating (56 customers): Low-cost re-engagement or sunset campaign

Recommendation: Focus marketing budget on protecting Champions and converting At Risk customers. A 10% improvement in Champion retention protects approximately $44,375 in annual revenue.

Interpretation

Executive Summary: RFM Customer Segmentation Analysis

Purpose

This section synthesizes the RFM segmentation results to assess whether the analysis achieved its objective of segmenting customers by purchase behavior to prioritize marketing spend. The findings reveal a highly concentrated revenue distribution that directly informs budget allocation strategy.

Key Findings

  • Champion Concentration: 25.7% of customers (77 individuals) generate 69.1% of total revenue ($443,747)—a 2.7x revenue-to-customer ratio advantage
  • At-Risk Segment: 15.3% of customers (46 individuals) represent $51,898 in revenue but show declining engagement (avg. recency: 538 days)
  • Revenue Disparity: The top 20 customers alone account for approximately $239,000+ in lifetime value, while 12.3% of the base (37 hibernating customers) contributes only 2.3% of revenue
  • Segment Diversity: 10 distinct segments identified with RFM scores ranging from 3.95 (hibernating) to 14.06 (champions), enabling precise targeting

Interpretation

The analysis successfully achieved its segmentation objective, revealing extreme customer value concentration. Champions demonstrate recent, frequent, and high-value purchasing patterns (avg. 14.79 purchases, $5

Figure 4

RFM Score Heatmap

Recency x Frequency Colored by Average Spend

Recency x Frequency score heatmap colored by average monetary value — the canonical RFM visualization showing where high-value customers cluster

Interpretation

Purpose

This heatmap reveals where high-value customers concentrate across recency and frequency dimensions. It identifies your Champions (top-right: recent + frequent buyers) and dormant high-spenders (bottom-left: infrequent + distant), enabling targeted retention and reactivation strategies aligned with your segmentation objective.

Key Findings

  • Champion Concentration (R5×F5): $8,095 average spend with 43 customers—the brightest cell, representing your most valuable segment
  • Recency-Frequency Correlation: Spending increases sharply along the diagonal from bottom-left to top-right, confirming that recent and frequent buyers spend significantly more
  • Dormant High-Spenders Gap: R1×F5 cell shows zero customers, indicating no truly lapsed frequent buyers—a positive sign of retention
  • Volume Distribution: Highest customer density (20) in R1×F1 ($282 avg), showing many inactive, infrequent buyers requiring reactivation

Interpretation

The heatmap confirms that recency and frequency act synergistically: customers scoring high on both dimensions generate 28× more revenue ($8,095 vs. $282) than low scorers. The absence of high-frequency, low-recency customers suggests your business successfully retains engaged buyers. However, the large R

Figure 5

Customer Segment Distribution

Customer Count per Named Segment

Horizontal bar chart showing customer count per named segment, ordered by average RFM score

Interpretation

Purpose

This section reveals how the 300-customer base is distributed across 10 distinct behavioral segments, with emphasis on identifying high-value and at-risk populations. Understanding segment distribution is critical for prioritizing marketing spend and resource allocation—the core objective of this RFM analysis.

Key Findings

  • Champions Dominance: 77 customers (25.7%) represent the elite segment with the highest RFM scores (14.06 avg) and generate 69.1% of total revenue ($443.7K), demonstrating extreme concentration of value
  • At-Risk Population: 46 customers (15.3%) show concerning recency decay (537.9 avg days) despite moderate frequency and spending, representing significant revenue leakage risk
  • Lost Segment: 56 customers (18.7%) combined across "Lost" and "Hibernating" categories indicate substantial customer attrition, with minimal recent engagement and low monetary contribution

Interpretation

The distribution reveals a classic Pareto pattern: one-quarter of customers generate nearly 70% of revenue. The 15.3% at-risk segment is particularly significant because these customers historically demonstrated strong purchase behavior but have become dormant—they represent recoverable revenue. Conversely, the 18.7% lost/hibernating segment reflects customers who have already churned and require different engagement strategies than those still showing

Figure 6

Customer Value Map

Recency vs Frequency, Bubble Size = Spend

Scatter plot of individual customers by Recency Score vs Frequency Score, with bubble size proportional to total spend

Interpretation

Purpose

This scatter plot visualizes the distribution of 300 individual customers across the RFM space, revealing where each customer sits relative to recency and purchase frequency. It enables identification of customer concentration patterns and highlights which segments occupy high-value positions (top-right quadrant) versus at-risk zones (bottom-left), directly supporting the objective to prioritize marketing spend by customer behavior.

Key Findings

  • Average Recency: 321.1 days since last purchase—indicating a customer base with moderate engagement gaps, with significant variation across the portfolio
  • Average Frequency: 6.75 orders—reflecting moderate repeat purchase behavior, though distribution is wide (range 1–30 orders)
  • Average Monetary Value: $2,141.49—heavily right-skewed (median $998.50), showing a small number of high-value customers drive disproportionate revenue
  • Segment Concentration: Champions (25.7%) cluster in the top-right with large bubbles; At Risk (15.3%) and Hibernating (12.3%) occupy lower quadrants with smaller bubbles

Interpretation

The bubble chart reveals a classic Pareto distribution: a concentrated elite of high-frequency, recent purchasers (Champions) generates 69.1% of revenue despite representing only 25.7% of customers. The median monetary value ($998

Figure 7

Revenue by Segment

Proportional Revenue Contribution per Segment

Treemap showing revenue contribution of each customer segment - larger rectangles represent segments generating more revenue

Interpretation

Purpose

This treemap visualizes revenue concentration across customer segments, answering which segments drive the most business value. It reveals the critical insight that customer value is highly skewed—a small proportion of customers generates the majority of revenue. Understanding this distribution is essential for prioritizing marketing spend and retention efforts toward maximum financial impact.

Key Findings

  • Champion Revenue Concentration: $443,747 (69.1% of total) generated by only 77 customers (25.7% of base) — a 2.7x revenue-to-customer ratio advantage
  • At Risk Segment: 46 customers (15.3%) contribute only $51,898 (8.1%), indicating significant revenue leakage risk
  • Long Tail Effect: The remaining 7 segments collectively represent 22.9% of revenue across 177 customers, showing diminishing returns below Champions tier

Interpretation

The data demonstrates classic Pareto distribution in customer value. Champions are 2.7× more valuable per capita than the average customer, while segments like Hibernating and New Customers contribute minimally despite non-trivial customer counts. This concentration explains why retention strategies focused on Champions yield higher ROI than acquisition or reactivation efforts targeting lower-value segments. The At Risk segment represents the second-priority opportunity—protecting existing mid-tier revenue before it deteriorates.

Context

This

Table 8

Top Customers

Highest RFM Score Customers

Top customers ranked by combined RFM score, showing individual recency, frequency, and monetary metrics

customer_idrecency_daysfrequencymonetary_valuerecency_scorefrequency_scoremonetary_scorerfm_scoresegment
CUST_018741301.547e+0455515Champions
CUST_013412261.457e+0455515Champions
CUST_001671281.399e+0455515Champions
CUST_022461271.37e+0455515Champions
CUST_018113291.305e+0455515Champions
CUST_002422271.239e+0455515Champions
CUST_012827271.227e+0455515Champions
CUST_002348301.204e+0455515Champions
CUST_004914261.189e+0455515Champions
CUST_000129271.18e+0455515Champions
CUST_000210231.166e+0455515Champions
CUST_001354241.166e+0455515Champions
CUST_020757281.153e+0455515Champions
CUST_014847281.116e+0455515Champions
CUST_001727231.105e+0455515Champions
CUST_004463231.104e+0455515Champions
CUST_010526221.099e+0455515Champions
CUST_026960261.041e+0455515Champions
CUST_00393226963555515Champions
CUST_01064722961355515Champions

Interpretation

Purpose

This section identifies the 20 individual customers with the highest combined RFM scores (perfect score of 15), representing the most valuable segment within your customer base. These customers drive disproportionate revenue and engagement, making them critical for understanding where your business value concentrates and informing targeted retention strategies.

Key Findings

  • Top 20 Customers: All 20 highest-scoring individuals are classified as Champions (RFM score = 15), representing 26% of the 77-customer Champion segment
  • Revenue Concentration: These 20 customers contribute significantly to the 69.1% of total revenue generated by all Champions combined
  • Perfect RFM Profile: Each top customer scores 5 across all three dimensions—recency (recent purchases), frequency (22–30 transactions), and monetary value ($9,613–$15,467)
  • Engagement Pattern: Average recency of ~38 days indicates consistent, recent purchasing behavior across the entire top-20 cohort

Interpretation

The concentration of perfect RFM scores among these 20 customers demonstrates that your highest-value segment exhibits uniform excellence across all behavioral dimensions. This uniformity—rather than variation—suggests these customers represent a stable, predictable revenue base with minimal churn risk. Their recent activity and high transaction frequency indicate strong ongoing engagement, validating the RF

Table 9

Score Distribution

R, F, M Score Distribution Across Customers

Distribution of Recency, Frequency, and Monetary scores across all customers

scorecount_rcount_fcount_mpct_rpct_fpct_m
1606060202020
2606060202020
3606060202020
4606060202020
5606060202020

Interpretation

Purpose

This section reveals how Recency, Frequency, and Monetary scores distribute across your 300-customer base. A perfectly balanced quintile distribution would show 20% of customers in each score tier (1-5). This distribution is foundational to understanding whether your customer base skews toward recent, frequent, high-value purchasers or contains significant proportions of dormant or low-engagement segments.

Key Findings

  • Perfect Quintile Balance: The score_distribution table shows exactly 60 customers (20%) in each score tier for Recency, Frequency, and Monetary dimensions, indicating a mathematically balanced segmentation
  • No Skew Detected: Equal distribution across all five tiers means the RFM algorithm successfully partitioned customers into equally-sized engagement groups
  • Alignment with Segment Data: This uniform distribution supports the validity of the 10 segments identified, where Champions (25.7%) and At Risk (15.3%) emerge from the underlying score combinations rather than from imbalanced raw distributions

Interpretation

The perfectly balanced distribution confirms the quintile-based RFM methodology is functioning as designed. Rather than revealing natural clustering in customer behavior, this equal split reflects the algorithmic approach of ranking customers into fixed percentile groups. The segment diversity (Champions through Hibernating) therefore emerges from combinations of R, F, and M scores

Want to run this analysis on your own data? Upload CSV — Free Analysis See Pricing