Transformers for tabular data representation: A survey of models and applications

Badaro, Gilbert; Saeed, Mohammed; Papotti, Paolo
Transactions of the Association for Computational Linguistics, January 2023, Vol.11

In the last few years, the natural language processing community has witnessed advances in neural representations of free texts with transformer-based language models (LMs). Given the importance of knowledge available in tabular data, recent research efforts extend LMs by developing neural representations for structured data. In this article, we present a survey that analyzes these efforts. We first abstract the different systems according to a traditional machine learning pipeline in terms of training data, input representation, model training, and supported downstream tasks. For each aspect, we characterize and compare the proposed solutions. Finally, we discuss future work directions.


DOI
HAL
Type:
Journal
Date:
2023-01-15
Department:
Data Science
Eurecom Ref:
7123
Copyright:
Copyright ACL. Personal use of this material is permitted. The definitive version of this paper was published in Transactions of the Association for Computational Linguistics, January 2023, Vol.11 and is available at : https://doi.org/10.1162/tacl_a_00544

PERMALINK : https://www.eurecom.fr/publication/7123