🎓📑✨ Preprint ( #AIstats ): survival analysis and competing risks with scalable stochastic solvers that recovers well outcome probabilities. TL;DR: using stochastic optimization, our approach gives survival models both faster and predicting better event probabilities 🧵 1/7 - ThreadSky

gaelvaroquaux.bsky.social • 93 days ago

🎓📑✨ Preprint ( #AIstats ): survival analysis and competing risks with scalable stochastic solvers that recovers well outcome probabilities.

TL;DR: using stochastic optimization, our approach gives survival models both faster and predicting better event probabilities

🧵
1/7

Comments

ausset.me•93 days ago

Interesting!
We proposed something very similar a while ago ( https://jmlr.org/papers/volume23/19-450/19-450.pdf ).
I toyed at the time with the idea of jointly estimating the weight and the survival estimator in an iterative fashion but it wasn't stable.
Did you have any divergence problems ?

ausset.me•93 days ago

At the time it seemed (to me) that the problem was when the estimator of G gave probabilities too close to 0 to some samples ( and therefore basically infinite weights ) and it would get lost in a loop of increasingly weird estimated losses.

ausset.me•93 days ago

In my experiments it was actually positive to have a "weak" and potentially biased estimator for G ( like something very very smooth, or clipped away from 0) to prevent numerical explosion.

gaelvaroquaux.bsky.social•93 days ago

Thanks for the pointer.
No, we did not have any divergence problems. The solver is actually very stable.
Try it out! The code is linked in the manuscript.

numberstorm.bsky.social•63 days ago

Very cool, I am working on something where this approach might be useful. For predicting whether someone buys a seat on an airplane given a finite window between the customer buying their flight ticket and them flying.

gaelvaroquaux.bsky.social•93 days ago

Survival settings: individuals observed in a finite time window, where an outcome event can happen. There are multiple outcome types (multi-class, called "competing risks")

For some "censored" individuals, the window was too short to observe the outcome (common in marketing, insurance, medicine)
2/

gaelvaroquaux.bsky.social•93 days ago

Goal: predict which event happens when (& probabilities).

Censoring must be compensated: dropping individuals with unobserved outcome leads to bias (neglecting cases with long times).

Solutions are typically non-scalable, and practitioners often use linear models.

3/7

gaelvaroquaux.bsky.social•93 days ago

We introduce a "proper scoring rule": a function that when minimized leads to estimating the corresponding probabilities.
It is a sum over individuals, and can be optimized by stochastic methods.
It does not need time-wise gradients, and can thus use tree-based models.

4/7

gaelvaroquaux.bsky.social•93 days ago

Intuitively, this is a standard log loss, but we reweigh samples by probability of being censored, an IPCW (Inverse Probability of Censoring Weighting).

This loss can be plugged into any stochastic solver, eg neural networks or Gradient-Boosted Trees, with an alternate optimization of censoring
5/7

gaelvaroquaux.bsky.social•93 days ago

Combined with boosted trees, it gives "SurvivalBoost".

On simulated data, 4 real-life datasets, and a variety of metrics, SurvivalBoost is markedly better predictor and faster than alternatives, sometimes much much faster.

gaelvaroquaux.bsky.social•93 days ago

I've seldom seen such a large jump in both computational and prediction performance.
It really enables easy survival analysis and competing risk on large data, such as electronic health records.

Read it here:
https://arxiv.org/abs/2410.16765
6/7

ehudk.bsky.social•93 days ago

Surprising to hear you learn the weights and outcomes simultaneously without getting stuck in the parameter space.
Are these 2 separate boosting trees, or can they share parameters?

Comments

Posting Rules

Reply