Fine-grained attention in hierarchical transformers for tabular time-series

Azorin, Raphael; Ben Houidi, Zied; Gallo, Massimo; Finamore, Alessandro; Michiardi, Pietro

SIGKDD 2024, 10th ACM Mining and Learning from Time Series Workshop (MiLeTS), August 26, 2024, Barcelona, Spain

Tabular data is ubiquitous in many real-life systems. In particular, time-dependent tabular data, where rows are chronologically related, is typically used for recording historical events, e.g., financial transactions, healthcare records, or stock history. Recently, hierarchical variants of the attention mechanism of transformer architectures have been used to model tabular time-series data. At first, rows (or columns) are encoded separately by computing attention between their fields. Subsequently, encoded rows (or columns) are attended to one another to model the entire tabular time-series. While efficient, this approach constrains the attention granularity and limits its ability to learn patterns at the field-level across separate rows, or columns. We take a first step to address this gap by proposing Fieldy, a fine-grained hierarchical model that contextualizes fields at both the row and column levels. We compare our proposal against state of the art models on regression and classification tasks using public tabular time-series datasets. Our results show that combining row-wise and column-wise attention improves performance without increasing model size. Code and data are available at https://github.com/raphaaal/fieldy

Detail

Document

ARXIV

BIBTEX

Type:

Conference

City:

Barcelona

Date:

2024-08-26

Department:

Data Science

Eurecom Ref:

7798

© ACM, 2024. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in SIGKDD 2024, 10th ACM Mining and Learning from Time Series Workshop (MiLeTS), August 26, 2024, Barcelona, Spain