If you need to use Machine Learning, keep it simple!

It often amuses me that in the era of ChatGPT (and despite the countless books I have in my library on AI/ML), I've never used any Deep Learning model for any of my client engagements or prior advanced analytics leadership roles.

For 99.95% of data problems in traditional (non-tech) companies, Deep Learning (and any of its derivations like LSTMs, GANs, CNNs, etc.) are overkill at best and a complete waste of time (or not applicable) at worst.

In Revenue Growth Analytics, we help answer the following types of questions to drive more Sales, Profit, or both:

  1. What are my optimal prices, and what is the price-value trade-off for my products? 

  2. What new markets/customer segments should we expand to?

  3. What product whitespace opportunities exist based on detailed market analysis?

  4. How can we improve Net Revenues and Gross Profits through Customer and Product Mix Management?

  5. Where are my most significant Customer retention, upsell and cross-sell opportunities to drive volume growth?

  6. What is the ROI of my Marketing Spend, and how can I optimize my Marketing Mix to drive greater Sales and Profit?

  7. Which prospective customers have the highest propensity to drive high Customer Lifetime Values?

  8. How should we adjust our sales and marketing efforts based on a segmentation of customer purchase patterns (to maximize profits)?

  9. How else can I leverage my data assets to drive incremental value for my Sales Teams?

  10. What internal analytics tools can I modify (or newly build) for our Customers to help them optimize their assortment, pricing, or purchasing, to drive customer stickiness and incremental sales?

To solve these problems, I mostly stick to the basics (think data munging / diagnostic analytics in Excel / R / Tableau / Power BI, mainly using middle-school math and plenty of domain expertise).

For more advanced work, I typically rely on the following:

  1. Linear Regression: for quick & dirty predictions (either as a baseline to compare things to or if the data is simple and I have enough domain expertise to know the relationship is primarily linear).

  2. Random Forest: for most prediction problems, including customer churn and price elasticity estimations. RF is always a good choice for me when my data is pretty complete, I have lots of features to work with, and when multi-collinearity is a problem (which is almost always the case for commercial analytics problems). Despite its computational intensity (although ideal for parallelization even on standard laptops), Random Forest is also straightforward to explain to executive stakeholders in a way they can conceptualize and understand.

  3. GBM (typically XGBoost): for churn, likelihood to purchase, and price elasticity problems. Use it less frequently than RF, typically when my data is sparse and prediction errors are substantially more costly to the business (also, when I have considerably more time to build the models). LightGBM is a faster but less accurate option. Also, much more work to explain to your stakeholders.

  4. Market Mix Modeling: to help companies understand the return on their marketing spend and help optimize budgets for greater Sales or Gross Profit return.

  5. Affinity Analysis (or Market Basket Analysis): to help with optimal B2B cross-selling and to understand the best pricing & promotional bundling opportunities.

  6. Cohort Analysis: to track customer behavior over time on dimensions such as retention, repeat purchase, or customer lifetime value. 

  7. Time Series Modeling (mostly the "prophet" package): using simple forms of TS to visualize seasonality or obtain optimal Fourier terms (continuous predictors in linear regressions instead of using countless dummy variables). TS modeling is also helpful for basic revenue or volume projections as inputs to Revenue or Profitability scenario analyses.


To recap, for most business pain points in traditional companies, more advanced ML is not needed, and foundational ML techniques like Linear Regression, Random Forest, Market Mix Modeling, Affinity Analysis, and some Time Series Analysis are more than enough. These methods are easy to explain to stakeholders and provide accurate enough output to make a decision, measure impact and move on.

Most of these techniques have been around for decades (and centuries in the case of linear Regression), and I suspect we'll still predominantly use these simple tools to solve critical business pain points for another few decades.

Subscribe to
Revology Analytics Insider

Want to stay abreast of the latest
Revenue Growth Analytics thought leadership by Revology?


Use the form below to subscribe to our newsletter.

Previous
Previous

Unlocking the Power of Revenue Growth Analytics for Sustainable Growth

Next
Next

The Business Analyst of Tomorrow