Can deep learning finally compete with boosted trees on tabular data? 🌲 In our NeurIPS 2024 paper, we introduce RealMLP, a NN with improvements in all areas and meta-learned default parameters. Some insights about RealMLP and other models on large benchmarks (>200 datasets): 🧵 - ThreadSky

dholzmueller.bsky.social • 101 days ago

Can deep learning finally compete with boosted trees on tabular data? 🌲
In our NeurIPS 2024 paper, we introduce RealMLP, a NN with improvements in all areas and meta-learned default parameters.
Some insights about RealMLP and other models on large benchmarks (>200 datasets): 🧵

Comments

dholzmueller.bsky.social•101 days ago

Coauthors: Léo Grinsztajn (@leogrin.bsky.social) and Ingo Steinwart
Paper: https://arxiv.org/abs/2407.04491
Code: https://github.com/dholzmueller/pytabkit

Our library is pip-installable and contains easy-to-use and configurable scikit-learn interfaces (including baselines). 2/

dholzmueller.bsky.social•101 days ago

Our new methods and default parameters are tuned on a meta-train benchmark and then also evaluated on
- a disjoint meta-test benchmark including large and high-dimensional datasets
- the smaller Grinsztajn et al. benchmark (with more baselines). 3/

dholzmueller.bsky.social•101 days ago

RealMLP can be used with tuned defaults (TD) or hyperparameter optimization (HPO).
We tuned defaults and our “bag of tricks” only on meta-train. Still, RealMLP outperforms the MLP-PLR baseline with numerical embeddings on all benchmarks. 4/

dholzmueller.bsky.social•101 days ago

To test if our “bag of tricks” transfers to other architectures, we try some of the tricks on the retrieval-based TabR-S-D, with much less tuning than for RealMLP-TD.
The resulting RealTabR-D performs much better than the default parameters from the original paper. 5/

dholzmueller.bsky.social•101 days ago

For boosted trees, our tuned defaults (TD) outperform the library defaults (D) in our standard metrics, though they do not match hyperparameter optimization (HPO) on meta-test, and the results are more mixed on other metrics/benchmarks. 6/

dholzmueller.bsky.social•101 days ago

Depending on the benchmark and metrics/aggregation, RealMLP is sometimes a bit better than boosted trees and sometimes a bit worse.
Generally, taking the best TD model (Best-TD) on each dataset typically has a better time-accuracy trade-off than 50 steps of random search HPO . 7/

Comments

Posting Rules

Reply