Tabular data

Tabular data involves two-dimensional tables with objects (rows) and features (columns), which are used in numerous applied tasks such as classification, regression, ranking and many others.

Area 19. Tabular data.svg

Posts

Publications

  • TabPack: Efficient Hyperparameter Ensembles for Tabular Deep Learning

    Tabular data
    Yury Gorishniy
    Akim Kotelnikov
    Ivan Rubachev
    Artem Babenko
    ICML, 2026

    Deep learning models for supervised learning on tabular data are rapidly improving. Notably, ensembles (mixtures of multiple models) often play an important role in achieving top performance, which motivates designing ensemble-first systems rather than treating ensembling as an ad hoc trick. In this work, we present TabPack — a new ensembling approach that packs many base model–optimizer pairs with different hyperparameters into a single neural network and a single optimizer. The base model and optimizer hyperparameters are sampled randomly, after which all base models are trained in parallel, and the final ensemble is built on the fly during training. As a result, TabPack produces powerful ensembles in a single run, with substantial efficiency gains over traditional approaches. With its remarkable efficiency, strong performance on medium-to-large datasets, and reduced reliance on traditional hyperparameter tuning, TabPack is an appealing solution for practitioners and researchers that makes tabular DL more accessible on consumer-grade hardware and suggests a new avenue for designing better tabular deep learning systems.

  • Unveiling the Role of Data Uncertainty in Tabular Deep Learning

    Tabular dataUncertainty estimation
    Nikolay Kartashev
    Ivan Rubachev
    Artem Babenko
    ICML, 2026

    Recent advancements in tabular deep learning have demonstrated exceptional practical performance, yet the field often lacks a clear understanding of why these techniques actually succeed. To address this gap, our paper highlights the importance of the concept of data (aleatoric) uncertainty for explaining the effectiveness of recent tabular DL methods. While data uncertainty leads to irreducible prediction errors on test samples, it also introduces stochasticity into the training signal that can impede effective learning. We demonstrate that tabular methods differ significantly in their ability to cope with this optimization challenge. Specifically, we reveal that the success of many beneficial design choices in tabular DL, such as numerical feature embeddings, advanced ensembling strategies, retrieval-augmented models, and tabular Prior-Fitted Networks, can be partially attributed to their respective implicit mechanisms for performing well under high data uncertainty. By dissecting these varied mechanisms, we provide a unifying understanding of recent performance improvements. Furthermore, leveraging insights from this perspective, we design a novel, more effective numerical feature embedding method as an immediate practical outcome of our analysis. Overall, our work paves the way toward a principled understanding of the benefits introduced by modern tabular methods that results in the concrete advancements of existing techniques and outlines future research directions for tabular DL.

  • TabM: Advancing Tabular Deep Learning with Parameter-Efficient Ensembling

    Tabular data
    Yury Gorishniy
    Akim Kotelnikov
    Artem Babenko
    ICLR, 2025

    Deep learning architectures for supervised learning on tabular data range from simple multilayer perceptrons (MLP) to sophisticated Transformers and retrieval-augmented methods. This study highlights a major, yet so far overlooked opportunity for substantially improving tabular MLPs; namely, parameter-efficient ensembling -- a paradigm for imitating an ensemble of models with just one model. We start by describing TabM -- a simple model based on MLP and BatchEnsemble (an existing technique), improved with our custom modifications. Then, we perform a large scale evaluation of tabular DL architectures on public benchmarks in terms of both task performance and efficiency, which renders the landscape of tabular DL in a new light. In particular, we find that TabM outperforms prior tabular DL models, while the complexity of attention- and retrieval-based methods does not pay off. Lastly, we conduct a detailed empirical analysis, that sheds some light on the high performance of TabM. For example, we show that parameter-efficient ensembling is not an arbitrary trick, but rather a highly effective way to reduce overfitting and improve optimization dynamics of tabular MLPs. Overall, our work brings an impactful technique to tabular DL, analyses its behaviour, and advances the performance-efficiency tradeoff with TabM -- a simple and powerful baseline for researchers and practitioners.

Datasets

  • TabReD

    Tabular data
    Ivan Rubachev
    Nikolay Kartashev
    Yury Gorishniy
    Artem Babenko

    TabReD is a benchmark for evaluating tabular machine learning models under conditions representative of real-world deployments. It comprises eight datasets from production ML systems at Yandex and Kaggle competitions. TabReD addresses two gaps in existing benchmarks: (1) all datasets use time-based train/validation/test splits to evaluate models under temporal distribution drift, and (2) datasets are feature-rich (median 261 features vs. 13-23 in prior benchmarks) with extensive feature engineering, reflecting real ML pipelines. Experiments on TabReD demonstrate that methods successful on standard benchmarks may underperform on TabReD, making it a critical testbed for assessing whether tabular ML approaches generalize to industrial settings.

  • Shifts Dataset

    Distributional shiftUncertainty estimation Tabular dataMachine translationNatural language processing
    Andrey Malinin
    Neil Band
    Yarin Gal
    Mark J. F. Gales
    Alexander Ganshin
    German Chesnokov
    Alexey Noskov
    Andrey Ploskonosov
    Liudmila Prokhorenkova
    Ivan Provilkov
    Vatsal Raina
    Vyas Raina
    Denis Roginskiy
    Mariya Shmatova
    Panos Tigas
    Boris Yangel

    The Shifts Dataset contains curated and labeled examples of real, 'in-the-wild' distributional shifts across three large-scale tasks. Specifically, it contains tabular weather prediction, machine translation, and vehicle motion prediction tasks' data used in Shifts Challenge 2021. Dataset shift is ubiquitous in all of these tasks and modalities.