Time Series Analysis (TSA) and Natural Language Processing (NLP) are two domains of research that have seen a surge of interest in recent years. NLP focuses mainly on enabling computers to manipulate and generate human language, whereas TSA identifies patterns or components in time-dependent data. Given their different purposes, there has been limited exploration of combining them. In this study, we present an approach to convert text into time series to exploit TSA for exploring text properties and to make NLP approaches interpretable for humans. We formalize our Text to Time Series framework as a feature extraction and aggregation process, proposing a set of different conversion alternatives for each step. We experiment with our approach on several textual datasets, showing the conversion approach’s performance and applying it to the field of interpretable time series classification.

Text to Time Series Representations: Towards Interpretable Predictive Models

Spinnato, Francesco
;
2023

Abstract

Time Series Analysis (TSA) and Natural Language Processing (NLP) are two domains of research that have seen a surge of interest in recent years. NLP focuses mainly on enabling computers to manipulate and generate human language, whereas TSA identifies patterns or components in time-dependent data. Given their different purposes, there has been limited exploration of combining them. In this study, we present an approach to convert text into time series to exploit TSA for exploring text properties and to make NLP approaches interpretable for humans. We formalize our Text to Time Series framework as a feature extraction and aggregation process, proposing a set of different conversion alternatives for each step. We experiment with our approach on several textual datasets, showing the conversion approach’s performance and applying it to the field of interpretable time series classification.
2023
Settore INF/01 - Informatica
26th International Conference on Discovery Science, DS 2023
prt
2023
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Springer Science and Business Media Deutschland GmbH
978-3-031-45274-1
978-3-031-45275-8
Explainable AI; Interpretable Machine Learning; Natural Language Processing; Time Series Classification
File in questo prodotto:
File Dimensione Formato  
DS2023___Text2TimeSeries (5).pdf

Accesso chiuso

Tipologia: Submitted version (pre-print)
Licenza: Non pubblico
Dimensione 556.21 kB
Formato Adobe PDF
556.21 kB Adobe PDF   Richiedi una copia
P6 - 978-3-031-45275-8_16 (1).pdf

Accesso chiuso

Tipologia: Published version
Licenza: Non pubblico
Dimensione 916.64 kB
Formato Adobe PDF
916.64 kB Adobe PDF   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11384/137186
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact