Artículos de revistas
Similarity joins and beyond: an extended set of binary operators with order
Fecha
2015Registro en:
Lecture Notes in Computer Science, Cham, v.9371, p.29-41, 2015
0302-9743
10.1007/978-3-319-25087-8_3
Autor
Carvalho, Luiz Olmes
Santos, Lúcio Fernandes Dutra
Oliveira, Willian Dener de
Traina, Agma Juci Machado
Traina Junior, Caetano
Institución
Resumen
Similarity joins are troublesome database operators that often produce results much larger than the user really needs or expects. In order to return the similar elements, similarity joins also require sorting during the retrieval process, although order is a concept not supported in the relational model. This paper proposes a solution to solve those two issues extending the similarity join concept to a broader set of binary operators, which aims at retrieving the most similar pairs and embedding the sorting operation only as an internal processing step, so as to comply with the relational theory. Additionally, our extension allows to explore another useful condition not previously considered in the similarity retrieval: the negation of predicates. Experiments performed on real and synthetic data show that our operators are fast enough to be used in real applications and scale well both for multidimensional and non-dimensional metric data.