Uplift Modeling: Making Predictive Models Actionable

Mike Thurber

April 14, 2017


BLOG_Uplift-Modeling-Making-Predictive-Models-Actionable.jpgPredictive models typically estimate the likelihood of future events, such as whether it will rain tomorrow or which customers are most likely to “churn” by cancelling their phone contract.  In the case of the weather, we do not expect to change it; we just want to know how to adapt.  However, the goal for most use cases is to be more proactive; we want to understand what action to take to change the outcome in a favorable way. In these cases prescriptive, not just predictive, analytics is required.  The return on investment comes directly from knowing the impact of alternative treatments. By knowing the impact of each treatment, resources can be targeted where they will be most effective and withheld where they will have negligible effect or worse, have a negative effect.  This great objective of data science, to intelligently drive day-to-day business decisions based on data, is the purview of uplift modeling.  This blog will explain what uplift modeling is and why it can be much better than directly modeling the outcome.

There are many applications of predictive modeling where the outcome is predicted as advice only to a human decision maker, and no action is directly taken automatically from the model result.  An example is workload prioritization for customer service or fraud investigative teams. But where we can, we aim to influence the outcome one way or another. Will a live agent offering the phone customer a contract upgrade decrease their likelihood to churn?  Will soliciting a fund raising prospect with a flyer in the mail improve their chances of making a donation?  Will offering a moving bonus increase the likelihood that a desirable candidate will accept our employment offer?

Uplift Modeling Examples.png

Uplift Modeling is Not Directly Measurable

Uplift modeling is also known as incremental modeling, treatment effects modeling, true lift modeling, or net modeling.  Uplift is the increase in likelihood of the outcome with the treatment as compared to the outcome without the treatment.  We can’t observe this difference, or causal effect, directly, but must infer it from an experiment. It is very helpful to visualize a 2x2 matrix, as shown below, with four categories of people (say) to be classified, as: (a) Persuadable, (b) Sure Thing, (c) Do-Not-Disturb, and (d) Lost Cause as shown in the figure below.

Uplift Modeling 2x2 Classification Matrix.png

Uplift modeling’s objective is to find Persuadables. Then, you can target resources on the cases that are likely to be positively impacted by the treatment.

Uplift Modeling Benefits

Uplift modeling can apply to any modeled outcome, human or not, such as fertilizer on crop yield, a drug on a patient’s health, retail loyalty programs on profit, or messages in political campaigns.  Whenever treatment resources are limited or there is a possibility of a negative treatment effect, uplift analysis is an effective tool. Uplift analysis models the effect of treatment, rather than the outcome directly. If we know how likely something is already, and how likely we are going to be able to change it with a treatment, we can classify prospects as either “sure things”, “persuadables”, “lost causes”, or “do not disturbs”.  This is extremely valuable as a way to get the most out of ones analytics investment.

Request a consultation to speak to a data analytics consultant about how Elder Research can help drive better insight from your analytics projects.

Previously published on Predictive Analytics Times.


If you would like to learn more about the benefits of uplift modeling download the white paper.

About the Author

Mike Thurber Mike Thurber is an analysis professional who listens carefully to partners to master an organization’s objectives and challenges, and he has a passion for extracting relevant and valuable insights from available data in a collaborative setting. As a trusted data science consultant, he clearly communicates deep analytical insights to managers and leaders regarding decision alternatives to help them improve key outcomes. He has 20 years of experience modeling causal relationships between potential actions and desired outcomes. He has 30 years of experience procuring and transforming historical data for descriptive analysis, statistical testing, predictive modeling, and deep learning. As a Principal Scientist at Elder Research, a highly regarded data science consultancy, he has delivered a broad range of advanced analytic solutions across many industries, as well as training, mentoring, and leading other data scientists. Mike’s work has ranged from estimating the profitability, risk, and responsiveness of credit card prospects, to identifying which infants will be negatively impacted by a Cesarean delivery. He has gleaned insights on how complex consumer choices impact sales, modeled individual healthcare providers rank in achieving desired patient outcomes, and calculated fraud risk and identified emerging fraud types. His projects have shown how call center interactions affect customer retention, measured the effect of targeted messages in political campaigns, forecasted debt recoveries at the account-holder level, modeled maintenance events on natural gas wells, and predicted propensity of past benefactors to make voluntary donations. Finally, especially in the last five years, Mike has been teaching principles and best practices of data science to a broad professional audience of emerging and experienced data scientists, with an emphasis on predictive and prescriptive modeling in an AI setting.