This might be the first time after 10 years that boosted trees are not the best default choice when working with data in tables.
Instead a pre-trained neural network is, the new TabPFN, as we just published in Nature π
Instead a pre-trained neural network is, the new TabPFN, as we just published in Nature π
Comments
Not only that, it requires only a few seconds to train on new data, while the baselines are tuned for 4 hours.
We train a neural network that natively handles tables, using attention across rows and columns, on millions of artificial tabular datasets from a meticulously designed data generator. It then performs in-context learning to make predictions on unseen data.
Nature article: https://nature.com/articles/s41586-024-08328-6
Try on free cloud: https://github.com/PriorLabs/tabpfn-client/
Try locally (gpu recommended): https://github.com/PriorLabs/TabPFN