I appreciate the effort in this article on customer #churn prediction in Nature Scientific Reports - and I must give some serious criticism: - ThreadSky

carl24k.bsky.social • 44 days ago

I appreciate the effort in this article on customer #churn prediction in Nature Scientific Reports - and I must give some serious criticism:

Comments

carl24k.bsky.social•44 days ago

1. Gradient boosting (#XGBoost or LGBM) is state of the art for in the real world. I don't believe they put much effort into tuning their benchmark models, so don't believe the claims of higher accuracy.

carl24k.bsky.social•44 days ago

#ML researchers always do this - they slave away tuning their preferred model, and use their benchmarks "as is" without tuning.

carl24k.bsky.social•44 days ago

2. #Churn models should *not* be evaluated with precision/recall but rather with AUC: True/false churn predictions are NEVER used, but rather risk rankings. (Always use predict_proba for churn, never predict.)

carl24k.bsky.social•44 days ago

Importantly, the precision/recall metrics they show in their results will be sensitive to the thresholds which are not detailed, and thats a tricky issue for imbalanced data. This is another reason not to believe the supposed accuracy improvement.

carl24k.bsky.social•44 days ago

3. #Gradientboosting is VERY interpretable with the #SHAPley method. They are totally misleading by saying their Deep Neural Network is more interpretable and boosting is not interpretable. They are apparently ignorant of these important advanced in interpretability more than 5 years old now.

carl24k.bsky.social•44 days ago

4. Despite a lot of talk about class imbalance, the churn datasets are not very imbalanced - 10%-20% churn rates. Really imbalanced data is low single digit churn rates.

hallmark-nick.bsky.social•44 days ago

Incentives are to present something new and flashy even if real world use case is small or nonexistent. Time series forecasting with deep learning has been a classic example of this problem. Lots of “SOTA” models benchmarked against clearly untuned and improper models.

hallmark-nick.bsky.social•44 days ago

I’d love to hear even one industry story about someone successfully using a transformer architecture for time series forecasting.

Comments

Posting Rules

Reply