SaaS Customer Lifetime Value

The CAC:LTV ratio is the most cited unit economics metric in SaaS. Yet most founders calculate LTV as "average revenue per customer times average lifespan" -- a formula that treats every customer as identical. Some customers pay for years. Others already placed their last payment and you do not know it yet. This analysis uses probabilistic modeling to predict what each individual customer is worth, which ones are still active, and where to focus your retention spend.

Why the Simple LTV Formula Fails

The back-of-napkin formula -- ARPU divided by churn rate -- gives you a single number for your entire business. If your ARPU is $80/month and monthly churn is 4%, you get LTV = $80 / 0.04 = $2,000. That number is fine for a pitch deck. It is useless for deciding which customers to invest in.

The problem is variation. Your $2,000 average hides a customer who pays $199/month and has been active for 2 years (actual value: $4,776 and counting) and a customer who paid $29 once and vanished (actual value: $29). Treating them the same leads to blanket retention campaigns that overspend on low-value accounts and underspend on high-value ones.

The 2026 LTV:CAC benchmark for B2B SaaS is 3.2:1 median, with elite companies targeting 4:1 or higher (Optifai, 2025). But that ratio is meaningless if your LTV number is an average that hides massive customer-level variation. You need per-customer predictions to know whether your best accounts justify more acquisition spend and whether your worst accounts are even worth retaining.

How Per-Customer CLV Prediction Works

This analysis uses the BG/NBD (Beta-Geometric / Negative Binomial Distribution) model -- the academic standard for customer lifetime value prediction. The idea is simple even if the math is not: the model looks at each customer's purchase history and estimates two things.

First, is the customer still "alive"? Not in the literal sense, but in the buying sense -- are they still in their purchasing cycle, or have they silently churned? A customer who bought 5 times last month is almost certainly alive. A customer who bought 5 times over 3 years and has not returned in 6 months is probably gone. The model quantifies this as a P(alive) score between 0 and 1.

Second, how many future transactions should you expect? The model weighs recency (when did they last pay?) against frequency (how often do they pay?) to predict future behavior. A companion model called Gamma-Gamma then estimates the average monetary value of each customer's transactions. Multiply expected future transactions by expected order value, and you get a predicted CLV for every person in your database.

The result is not one LTV number for the business. It is a ranked list of every customer with their individual predicted value, their probability of still being active, and their expected future transactions over the next 12 months.

What the Report Shows You

P(alive) probability matrix -- arguably the most actionable output. A heatmap showing which frequency-recency combinations correspond to active customers and which correspond to likely churners. Customers with P(alive) between 0.3 and 0.7 are uncertain -- they are the ones most worth a re-engagement email or a check-in call. Below 0.3, they are probably gone. Above 0.7, they are probably fine.

Top customers by predicted CLV -- a ranked list of your highest-value accounts by predicted future revenue. Each entry shows the customer identifier, predicted CLV, P(alive) score, and historical purchase stats. This is the list your retention team should work from. Losing any of these customers has an outsized impact on future revenue.

CLV distribution -- shows how predicted value is distributed across your customer base. In most businesses, the distribution is heavily right-skewed. Your top 20% of customers often account for 60-80% of predicted future revenue. The chart quantifies exactly how concentrated your value is and helps you decide how many customers are worth individual attention versus segment-level campaigns.

Customer segment summary -- groups customers into actionable categories based on model scores: Champions (high frequency, high recency, high CLV), At Risk (historically valuable but declining P(alive)), Hibernating (low recent activity), and New (recent first purchase, not enough data yet). Each segment shows size, average CLV, average P(alive), and total predicted revenue. Design a different retention action for each segment.

Expected transactions matrix -- a heatmap showing expected future transactions by frequency and recency. Customers in the upper-right (high frequency, recent purchase) are expected to keep buying. Customers in the lower-left are expected to buy rarely or never.

Model validation -- the data is split into a calibration period and a holdout period. The validation table shows predicted vs. actual transactions for each frequency group. When they track closely, you can trust the forward-looking estimates.

When to Use CLV Modeling

What Data Do You Need?

A CSV of your transaction or billing history with three required columns:

Required columns

Optional columns

For SaaS, each monthly subscription payment is a transaction row. If your billing system only records subscription start/end dates (not individual payments), generate a row per payment month before uploading.

Minimum: 50 unique customers with some repeat transactions and at least 90 days of history. Best results with 200+ customers and 6+ months. The model needs repeat purchase behavior -- datasets with mostly one-time buyers produce estimates with high uncertainty.

Common Mistakes

When to Use Something Else

References