Blog Header 1.jpg


Sort by Topic

Monte Carlo Simulation - a Venerable History

Peter Bruce

November 16, 2018

BLOG_Monte Carlo Simulation

One of the most consequential and valuable analytical tools in business is simulation, which helps us make decisions in the face of uncertainty.

Read More

Simple, Attractive, and Wrong: An Introduction to Linearity Bias

Will Goodrum, Ph.D.

November 2, 2018

BLOG_Introduction to Linearity Bias

This blog is 3rd in a series of short posts where we explore common biases that can impair analytics projects.

Read More

Top 3 Objectives Before Starting an Analytics Project

Gerhard Pilcher

October 26, 2018

BLOG_Top 3 Objectives Before Starting an Analytics ProjectUnderstanding the organization’s business objectives and requirements, converting this knowledge into a definition of a problem, and developing a preliminary plan to solve that problem is crucial to the successful application of analytics and machine learning. In order to construct a successful model, the data scientist must understand how the business functions and how it will use the data. Even the most technologically advanced analytics model will produce trivial and possibly misleading results if it is disconnected from the purposes and goals of the business.

Read More

Machine Learning: It is a Mistake to Lack Relevant Data

John Elder, Ph.D.

October 19, 2018

BLOG_It's a Mistake to Lack Relevant DataIn his Top 10 Data Mining Mistakes Dr. John Elder shares lessons learned from more than 20 years of data science consulting experience. Avoiding these mistakes are cornerstones to any successful analytics project. In this blog about Mistake #0 you will learn that the less probable the interesting event is, the more data it takes to obtain enough to generalize a model to unseen cases, and why some projects probably should not proceed until enough critical data is gathered to make them worthwhile.

Read More

Finding Fraud When No Cases are Known

Carlos Blancarte

October 12, 2018

BLOG_Finding Fraud When No Cases are Known

Fraud detection is about finding needles in haystacks and requires reliably labeled instances of fraudulent (needle) and non-fraudulent (straw) behavior. A predictive model can be trained using these labels to learn the underlying patterns in the input variables that best separate fraud from non-fraud cases, and thereby estimate the fraud-likeness of any future case. Typically, the interesting cases are very scarce, in which case we might have to carefully up-sample the rare class and/or down-sample the abundant class to help the model pay enough attention to the rare class to be useful. But what do we do when labels are not just rare, but are completely absent?

Read More

Hiring a Data Analytics Consultant

Jeff Deal

October 5, 2018

BLOG_Hiring Data Analytics Consultants-1In the earliest days of data analytics, our new clients would typically say, “Solve this problem for us.” As they saw the enduring power of analytics, their request then became, “We want to launch our own data analytics capability. Will you help us set it up?” However, growing an analytics capability from scratch is a huge challenge, and today more companies appreciate its difficulty. 

Read More

Who Invented the Null Hypothesis?

Peter Bruce

September 28, 2018

BLOG_Who Invented the Null Hypothesis

For many students, statistics is a troublesome subject, and the root of that trouble can be traced to the concept of the null hypothesis. In these days of big data, machine learning, and predictive analytics, formal hypothesis testing has receded in relative importance. Nonetheless, it retains considerable inertia and ability to cause difficulty - even in data science circles.

Read More

Workshops, Workshops, Workshops!

Paul Derstine

September 14, 2018

BLOG_PAWgov Workshops

Predictive Analytics World for Government, the premier analytics and AI conference for government, starts next week. Keynote speakers include David Williams, USPS Board of Governors, Tom Davenport, a world-renowned analytics thought leader, author, and industry expert, and Dr. John Elder, Chairman and Founder of Elder Research, author and analytics practitioner. Elder Research will teach three workshops at the conference, offering insight on Machine Learning Methods, Deadly Analytics Mistakes, and Data Science for Managers.

Read More

Fluency in The Language of Data Models

Michael Lieberman

September 7, 2018

BLOG_Fluency in The Language of Data Models

My job as a data scientist and research strategist is getting easier. Over the past 50 years, statisticians have developed a number of practical models that are highly effective to explain consumer patterns and predict consumer behavior. As new forms of computing power and information technology provide every increasing descriptions of individual-level purchasing tendencies, these models offer great value for business managers.

Read More

Data Science, Statistics, and the "Method of Moments"

Peter Bruce

August 31, 2018


BLOG_Data Science, Statistics, and the Method of Moments

I got my introduction to statistics via resampling, working with Julian Simon, an early resampling pioneer. Demonstrating this "brute force" computer method to my father, I saw that he was vaguely offended by its inelegance.

Read More

Are We Using Machine Learning?

Gerhard Pilcher

August 24, 2018

BLOG_Are We Using Machine Learning

In the midst of a recent engagement an executive suddenly asked, “Are we using Machine Learning?”. This caught us off-guard; working in the field for many years, we use the “learning sciences” virtually every day to solve hard problems. Machine Learning (ML), Data Science (DS) and Artificial Intelligence (AI) are exciting and very powerful; still, we’re happy to use conventional techniques whenever they’re the best choice to solve the client’s challenge. 

Read More

Using Machine Learning to Predict Parkinson’s Disease

Jennifer Schaff, PhD

August 17, 2018

BLOG_Parkinson’s Test Recommendation Engine

Recent research supported by the Michael J. Fox Foundation (MJFF) (and other benefactors) collected multifaceted data sets from patients with Parkinson’s Disease. They wanted to determine which medical test, or combination of tests, best predicts Parkinson’s disease.

Read More

It is a Mistake to Ask the Wrong Questions

John Elder, Ph.D.

August 10, 2018

BLOG_It is a Mistake to Ask the Wrong QuestionsIn his Top 10 Data Mining Mistakes Dr. John Elder shares lessons learned from more than 20 years of data science consulting experience. Avoiding these mistakes are cornerstones to any successful analytics project. In this blog about Mistake #3 you will learn why it is very important to have the right project goal; that is, to aim at the right target; and even with the right project goal it is essential to also have an appropriate model goal.

Read More

Automating Demand Forecasting with Machine Learning

Will Goodrum, Ph.D.

August 3, 2018

BLOG_Automating Demand Forecasting

Elder Research implemented an automated framework for time-series forecasting at a major logistics company. Our system, combining R and Apache Spark™, produces 35 million forecasts in under one hour, and selects the optimal time-series forecast algorithm in each of three forecasting windows. Forecast results from our framework were 88% accurate at a four-week horizon.

Read More

3 Myths About the Normal Distribution

Peter Bruce

July 27, 2018

BLOG_3 Myths About the Normal Distribution

Is the Normal Curve Normal?

I saw an article recently that referred to the normal curve as the data scientist's best friend, and it is certainly true that the normal distribution is ubiquitous in classical statistical theory. Still, it's overrated.

Read More