
With some free time at hand in the midst of COVID-19 pandemic, I decided to do pro bono consulting work. I was helping a few e-commerce companies with analyzing their customer data. A common theme I encountered during this work was that the companies were more interested in getting a list of their best customers so they could run some marketing campaign to boost their revenue. My recommendation has always been that every customer segment has some valuable insight to offer and that best customers depend on the immediate company goal/objective at hand. In other words, best customers are not always what we call the high-value customers. In this article, I will highlight various insights RFM segmentation can provide.

在COVID-19大流行期间有一些空闲时间,我决定进行无偿咨询工作。 我曾帮助一些电子商务公司分析其客户数据。 我在这项工作中遇到的一个共同主题是,这些公司对获取最佳客户名单更感兴趣,因此他们可以开展一些营销活动来增加收入。 我的建议一直是,每个客户群都应提供一些有价值的见解,并且最好的客户取决于当前的公司目标。 换句话说,最佳客户并不总是我们所谓的高价值客户。 在本文中,我将重点介绍RFM细分可以提供的各种见解。

使用RFM分析进行细分 (Segmentation using RFM Analysis)

Analysis based on RFM — stands for Recency, Frequency and Monetary — is probably one of the easiest ways to segment the customers given companies usually have the customer purchase information readily available to them.


Here is a sample customer purchase history data that is needed for the RFM analysis. With this, each customer is scored on the RFM attributes on a scale of 1–5 (or 1–4 or 1–3, depending on how granular you want to look at the purchase behavior) with 1 being the least and 5 being the best score.

这是RFM分析所需的样本客户购买历史记录数据。 这样一来,每位客户在RFM属性上的评分为1-5(或1-4或1-3,具体取决于您希望如何看待购买行为),其中1最小,5最小。最好成绩。

Sample Customer Purchase History Data for RFM Analysis

For example, a customer with a most recent purchase will have a score of 5 on Recency, whereas a customer who hasn’t purchased in a while will have a score of 1.


Here the important thing to note is that the criteria for scoring varies business by business and by understanding customer purchase cycles, we can come up with the scoring criteria. For this sample data, I used the following recency scoring based on 2-month purchase periods. Once you score on all three attributes, we can create RFM score (sum of these three scores) and RFM Category (when R=4, F=3, M=4, RFM Category = 434). You might have seen cases where RFM score was used to segment customers. This method has some serious pitfalls and its better to use the RFM category instead.

这里要注意的重要一点是,评分标准因企业而异,并且通过了解客户购买周期,我们可以提出评分标准。 对于此样本数据,我基于2个月的购买期使用了以下新近度评分。 在所有三个属性上得分后,我们可以创建RFM得分(这三个得分的总和)和RFM类别(当R = 4,F = 3,M = 4,RFM类别= 434时)。 您可能已经看到过使用RFM分数细分客户的情况。 此方法有一些严重的陷阱,最好使用RFM类别。

Using a 5 point scale could lead up to 125 RFM categories. Next step is to group these RFM categories into various segments. While the standard RFM analysis doesn’t take into account other data points such as length of the relationship of a customer, but this is the time to look at the purchase data holistically and assign segments to the RFM categories. For example, a customer with long relationship with the business, most recent purchases but low monetary might have used promotional offers more frequently than a customer with similar recency, monetary but less frequent and shorter relationship with us. One could be segmented into Deal Seeker and the other one could be segmented into Newcomers.

使用5分制可能会导致多达125个RFM类别。 下一步是将这些RFM类别分为不同的细分。 尽管标准的RFM分析未考虑其他数据点,例如客户关系的长度,但这是时候全面查看购买数据并将细分分配给RFM类别的时候了。 例如,与企业有长期关系,最近购买但货币较少的客户可能比具有类似新近度,货币但频率较低,与我们关系较短的客户更频繁地使用促销优惠。 一个可以细分为Deal Seeker ,另一个可以细分为Newcomers

Customer Data after RFM Segmentation

In the above example, I have identified 6 segments with uniquely distinguishable purchase behaviors.


Dropouts: In other words these are the lost customers. They became our customers through initial promotional offer and either didn’t come back or made 1–2 subsequent purchases.

辍学:换句话说,这些是失去的客户。 他们通过最初的促销活动成为我们的客户,要么没有回来,要么随后进行了1-2次购买。

Early Enthusiasts: These customers were with the company for a relatively longer period of time than Dropouts and made frequent purchases and spent lot of money. However, they were lost along the way. These will be the best customers if the objective is to reactivate the lost customers.

早期发烧友:与Dropouts相比,这些客户在公司呆的时间相对较长,并且经常购买商品并花了很多钱。 但是,他们一路迷路了。 如果目标是重新激活失去的客户,这些将是最好的客户。

Newcomers: As the name suggests, this segment consists of customers who are relatively new and active. Any marketing efforts for this segment could be to promote them to loyalty programs or offer promotions to try other products.

新移民:顾名思义,该细分市场由相对较新且活跃的客户组成。 此细分市场的任何营销工作可能是将其推广到会员计划或提供促销以尝试其他产品。

Deal Seekers: These customers have relatively long relationship with the company and are currently active. They are interested in making frequent purchases but usually on look out for promotional offers. These will be the best customers if running a campaign to boost sales.

寻求交易者:这些客户与公司的关系相对较长,并且目前处于活跃状态。 他们有兴趣频繁购买商品,但通常会寻找促销优惠。 如果开展宣传活动以提高销量,这些将是最好的客户。

Potential High Value Customers: These are relatively newer customers compared to Deal Seekers but are less sensitive to the price or are less keen on promotional offers. In shorter period of time, they made frequent purchases and spent lot of money with the company. Nudge them towards High Value Customers and sign them up for loyalty program.

潜在的高价值客户:与“寻求交易者”相比,这些客户相对较新,但对价格不太敏感或对促销活动不感兴趣。 在较短的时间内,他们经常购买商品,并在公司花了很多钱。 将他们推向高价值客户,并签署他们的会员计划。

High-Value Customers: These are the most loyal customers. They are active with frequent purchases and high monetary value. They could be the brand evangelists and should focus on serving them well. They could be the best customers to get feedback on any new product launches or be the early adopters or promoters.

高价值客户:这些是最忠诚的客户。 他们很活跃,经常购买并且具有很高的货币价值。 他们可能是品牌传播者,应该专注于为他们提供良好的服务。 他们可能是获得任何新产品发布反馈的最佳客户,或者是早期采用者或推广者。

从可视化中获取见解 (Getting Insights from Visualization)

Once the customer segments have been identified using RFM analysis, it is always a good idea to visualize these segments not only for a sanity check but also to derive actionable insights. I used Tableau to visualize the segments.

一旦使用RFM分析确定了客户群,将这些细分形象化以进行健全性检查并获得可行的见解始终是一个好主意。 我使用Tableau可视化了细分。

Average Revenue Per User (ARPU) Matrix for R&F Scores

Recency and Frequency score matrix chart is usually a good one for sanity check. As R score and F score increase, the monetary value increases. Higher R score may not necessarily lead to higher monetary value given that the business may have acquired new customers recently.

新近度和频率得分矩阵图通常是进行健全性检查的好方法。 随着R分数和F分数增加,货币价值增加。 鉴于企业最近可能获得了新客户,因此较高的R分数不一定会导致较高的货币价值。

Treemap of Total Revenue per Customer Segment

The treemap shows how each customer segment is contributing to the total revenue and how much each customer within a given segment is contributing to the total revenue. In a way, this helps predict how consistent revenues will be given how many customers have churned and how active other segments are.

树状图显示了每个客户群如何为总收入做出贡献,以及给定细分市场中的每个客户为总收入做出了多少贡献。 从某种意义上说,这有助于预测将有多少客户流失以及其他细分市场的活跃程度如何带来稳定的收入。

When there is geographic information for the customers, plotting them on a map will give some unique insights. One of the interesting cases I have seen is for an e-commerce company with some retail presence. Mapping the customer geographic data helped explain clusters of online shoppers around geographies where the company has retail store presence. Interestingly enough, these clusters of online customers were acquired organically. Accordingly, you can evaluate strategies to expand retail presence that could potentially boost e-commerce business.

当有客户的地理信息时,将其绘制在地图上将提供一些独特的见解。 我所看到的一个有趣的案例是一家拥有零售业务的电子商务公司。 映射客户地理数据有助于解释公司在零售商店中存在的地理位置周围的在线购物者群体。 有趣的是,这些在线客户群是有机收购的。 因此,您可以评估策略来扩展零售业务,这可能会促进电子商务业务。

Geographical Dispersion of the Customers by Segment (bubble size represents total sales by the customer)
Channel Performance in terms of Total Revenue and No. of Customers

Another thing is the channel performance. Customer segmentation helps with understanding channel performance better. By looking at customer lifetime value (LTV) per segment and what portion of each segment is attributable to a channel, acquisition cost associated with that channel, we can evaluate channel performance more accurately than without segmentation.

另一件事是频道性能。 客户细分有助于更好地了解渠道绩效。 通过查看每个细分的客户生命周期价值(LTV)以及每个细分的哪个部分可归因于某个渠道以及与该渠道相关的购置成本,我们可以比不进行细分来更准确地评估渠道效果。

结论 (In Conclusion)

Usually, startups tend to be growth focused rather than retention focused. Even though they are resource constrained, startups are better off investing some of their limited resources on understanding their existing customers so they can device effective retention strategies that could, in turn, help them with acquiring new customers. After all, it may end up being much cheaper to retain your existing customer than to acquire a new one. Word of mouth is still a major customer acquisition channel and you want your customers to be the evangelists for your product.

通常,初创公司往往注重增长而不是保留。 尽管他们受到资源的限制,但初创公司最好将一些有限的资源用于了解他们的现有客户,以便他们可以制定有效的保留策略,从而帮助他们获得新客户。 毕竟,保留您的现有客户可能比购买新客户便宜得多。 口耳相传仍然是主要的客户获取渠道,您希望您的客户成为产品的传播者。

Bullseye chart of Customer Segments

Reiterating the quest for the best customers, it may seem obvious that high-value customers are the best customers. When there is a goal at hand — say to run a marketing campaign to boost revenue — we should dig a little deeper to find out who the best customer would be for that specific goal or which customer segment is at the bullseye.

重申对最佳客户的追求,高价值客户显然是最佳客户。 当有目标时(例如,开展营销活动以增加收入),我们应该进行更深入的研究,以找出谁是实现该特定目标的最佳客户,或者哪个客户群处于靶心位置。

翻译自: https://towardsdatascience.com/how-to-identify-the-best-customers-using-rfm-based-segmentation-a0a16c34a859




  • 数据仓库项目分析_数据分析项目:仓库库存
  • 有没有改期末考试成绩的软件_如果考试成绩没有正常分配怎么办?
  • 探索性数据分析(EDA):Python
  • 写作工具_4种加快数据科学写作速度的工具
  • 大数据(big data)_如何使用Big Query&Data Studio处理和可视化Google Cloud上的财务数据...
  • 多元时间序列回归模型_多元时间序列分析和预测:将向量自回归(VAR)模型应用于实际的多元数据集...
  • 数据分析和大数据哪个更吃香_处理数据,大数据甚至更大数据的17种策略
  • 批梯度下降 随机梯度下降_梯度下降及其变体快速指南
  • 生存分析简介:Kaplan-Meier估计器
  • 使用r语言做garch模型_使用GARCH估计货币波动率
  • 方差偏差权衡_偏差偏差权衡:快速介绍
  • 分节符缩写p_p值的缩写是什么?
  • 机器学习 预测模型_使用机器学习模型预测心力衰竭的生存时间-第一部分
  • Diffie Hellman密钥交换
  • linkedin爬虫_您应该在LinkedIn上关注的8个人
  • 前置交换机数据交换_我们的数据科学交换所
  • 量子相干与量子纠缠_量子分类
  • 知识力量_网络分析的力量
  • marlin 三角洲_带火花的三角洲湖:什么和为什么?
  • eda分析_EDA理论指南
  • 简·雅各布斯指数第二部分:测试
  • 抑郁症损伤神经细胞吗_使用神经网络探索COVID-19与抑郁症之间的联系
  • 如何开始使用任何类型的数据? - 第1部分
  • 机器学习图像源代码_使用带有代码的机器学习进行快速房地产图像分类
  • COVID-19和世界幸福报告数据告诉我们什么?
  • lisp语言是最好的语言_Lisp可能不是数据科学的最佳语言,但是我们仍然可以从中学到什么呢?...
  • python pca主成分_超越“经典” PCA:功能主成分分析(FPCA)应用于使用Python的时间序列...
  • 大数据平台构建_如何像产品一样构建数据平台
  • 时间序列预测 时间因果建模_时间序列建模以预测投资基金的回报
  • 贝塞尔修正_贝塞尔修正背后的推理:n-1


  1. RFM模型分析与客户细分

    今天偶然看到沈浩老师的文章<数据挖掘应用案例:RFM模型分析与客户细分>(http://shenhaolaoshi.blog.sohu.com/201923838.html),感慨不少.这 ...

  2. 数据挖掘应用案例:RFM模型分析与客户细分(转)

    正好刚帮某电信行业完成一个数据挖掘工作,其中的RFM模型还是有一定代表性,就再把数据挖掘RFM模型的建模思路细节与大家分享一下吧!手机充值业务是一项主要电信业务形式,客户的充值行为记录正好满足RFM模 ...

  3. rfm模型python_数据分析实战——用RFM模型分析客户价值

    数据分析实战--用RFM模型分析客户价值 阿雷边学边教python数据分析第4期--数据可视化 一.介绍什么是RFM模型和作用 1.什么是RFM模型 RFM模型是衡量客户价值的一种工具,该模型通过客户 ...

  4. 某电信公司客户RFM模型分析

    手机充值业务是一项主要电信业务形式,客户的充值行为记录正好满足RFM模型的交易数据要求. 根据美国数据库营销研究所Arthur Hughes的研究,客户数据库中有三个神奇的要素,这三个要素构成了数据分 ...

  5. 快速找出高价值用户,3分钟学会RFM模型分析

    在众多的客户关系管理分析模式中,应用最广泛的就是RFM模型,它可以通过一个客户的近期购买行为.购买的频率以及花了多少钱三项指标来描述客户的价值情况. RFM模型用户分群是介于千人千面的个性化运营和未加 ...

  6. python数据分析实战之用户分析及RFM模型分析

    理论基础知识可以看我之前的博客: 1.python之Numpy知识点详细总结 2.python最最最重要的数据分析工具之pandas 3.pandas之表连接与高级查询 也可以进入我的专栏:欢迎订阅哦 ...

  7. 如何用EXCEL实现RFM模型分析

    FM模型:R(Recency)表示客户最近一次购买的时间有多远,F(Frequency)表示客户在最近一段时间内购买的次数,M (Monetary)表示客户在最近一段时间内购买的金额.一般原始数据为3 ...

  8. 如何量化你的用户价值?——RFM模型分析实战

    1.分析模板 2.什么是RFM分析模型 3.实现思路 4.分析结果辅助业务决策 5.RFM模型深化 6.BI分析效果 pdf地址 https://download.csdn.net/download/ ...

  9. 使用机器学习进行客户终身价值和RFM模型分析

    对公司而言,在产品的『生产-变现』这一过程当中,用户决定了价值兑现的关键,也就是在产品设计过程中为产品赋形,更好的满足用户需求,以用户为中心,其中用户价值细分是了解用户价值度的重要途经. 公司在市场研 ...


  1. 正念奇迹(一则正能量)
  2. 【小摘抄】关于C++11下 string各类用法(持续更新)
  3. 无人值守时代,运维如何保障发布质量?
  4. Abp连接多个数据源
  5. 计算机网络-路由器广域网配置
  6. Mysql 5.7 for windows 免安装版(解压版)安装和配置
  7. vector java 复制_孙悟空的身外身法术使用了Java设计模式:原型模式
  8. 基于ARINC664标准的AFDX通信协议概述
  9. SpringBoot 集成 TkMybatis
  10. 阿里矢量图iconfont的两种使用方法
  11. 设置notepad++背景护眼色
  12. 称呼大全,亲戚称呼,祖宗十八代的称呼!
  13. 事业单位工资计算机公积金计算,事业单位住房公积金基数怎么算?
  14. sbrkr.c:(.text._sbrk_r+0xc): undefined reference to `_sbrk'
  15. window下python安装包问题小结
  16. 计算机浏览器无法上网怎么办,电脑ie浏览器无法上网,浅析win10电脑ie浏览器无法上网的解决方法...
  17. 毕设——接口测试——Postman
  18. VisionMobile:2011年智能手机数据
  19. PyQt4的学习历程(3)
  20. 阿里云服务器远程连接


  1. c语言 大雨 班上多个同学准备,2015年计算机二级考试《C语言》提高练习题(7)
  2. CertUtil.exe被利用来下载恶意软件
  3. fastjson反序列化漏洞原理及利用
  4. Vue项目中遇到了大文件分片上传的问题
  5. java基础-BigDecimal类常用方法介绍
  6. Python基础-----列表、元组、集合(2)
  7. Swift之 vm10虚拟机安装Mac OS X10.10教程
  8. linux增加调整虚拟内存
  9. 第四章 大网高级   NSSA
  10. .NET使用免费开源类库操作Excel