Artículo de revista
Compact representations of event sequences
Fecha
2018Registro en:
Data Compression Conference Proceedings, 2018
10680314
10.1109/DCC.2018.00032
Autor
Brisaboa, Nieves
De Bernardo, Guillermo
Navarro, Gonzalo
Rodeiro, Tirso
Seco, Diego
Institución
Resumen
We introduce a new technique for the efficient management of large sequences of multi-dimensional data, which takes advantage of regularities that arise in real-world datasets and supports different types of aggregation queries. More importantly, our representation is flexible in the sense that the relevant dimensions and queries may be used to guide the construction process, easily providing a space-time tradeoff depending on the relevant queries in the domain. We provide two alternative representations for sequences of multidimensional data and describe the techniques to efficiently store the datasets and to perform aggregation queries over the compressed representation. We perform experimental evaluation on realistic datasets, showing the space efficiency and query capabilities of our proposal.