This is an automated archive made by the Lemmit Bot.

The original was posted on /r/machinelearning by /u/_puhsu on 2024-11-12 12:29:00+00:00.


What DL architecture to try on tabular data?

Hi Reddit! Today, my colleagues announced TabM - a new answer to the above question. TabM is leading on the benchmarks, while being simple, practical, and scalable to large datasets. Technically, TabM efficiently imitates an ensemble of MLPs, as illustrated below. Also, TabM is one of the first projects using our new TabReD benchmark - a collection of eight real-world industrial datasets with time-based splits and feature engineering.

For a quick overview of TabM, you can check the following parts of the paper:

  • The abstract

  • The model illustration in Figure 1 (and in the post below)

  • The main results on Page 7

TabM links:

TabReD links:

The model illustration