Optimizing response time in large scale similarity searches

Rafael Martins de Souza

dc.contributor	Renato Antônio Celso Ferreira
dc.contributor	http://lattes.cnpq.br/3446817929796674
dc.contributor	George Luiz Medeiros Teodoro
dc.contributor	Wagner Meira Júnior
dc.contributor	William Robson Schwartz
dc.contributor	Eduardo Alves do Valle Junior
dc.creator	Rafael Martins de Souza
dc.date.accessioned	2021-10-18T02:03:48Z
dc.date.accessioned	2022-10-03T22:40:23Z
dc.date.available	2021-10-18T02:03:48Z
dc.date.available	2022-10-03T22:40:23Z
dc.date.created	2021-10-18T02:03:48Z
dc.date.issued	2020-03-13
dc.identifier	http://hdl.handle.net/1843/38399
dc.identifier.uri	http://repositorioslatinoamericanos.uchile.cl/handle/2250/3807991
dc.description.abstract	Similarity search is a core operation found in several online multimedia services. These services have to handle very large databases, while, at the same time, they must min imize the query response times observed by users. This is especially complex because those services deal with fluctuating query workloads (rates). Consequently, they must adapt at run-time to minimize the response times as the load varies. In this dissertation, we address the aforementioned challenges with a distributed memory parallelization of the product quantization nearest neighbor search, also known as IVFADC, for hybrid CPU-GPU machines. Our parallel IVFADC also implements an out-of-core scheme to use the GPU for databases in which the index does not fit in its memory, which is crucial for searching in very large databases. The careful use of CPU and GPU with work-stealing led to an average reduction of the response time of 1.6× as com pared to using the GPU only. Also, our approach to adapt the system to fluctuating loads, called Dynamic Query Processing Policy (DQPP), attained an average response time reduction of 7× vs. the greedy policy. Finally, in all settings, the system has been shown to attain high query processing rates and near-linear scalability. We have executed our system in an environment with up to 256 NVIDIA V100 GPUs and a database of 256 billion SIFT features vectors
dc.publisher	Universidade Federal de Minas Gerais
dc.publisher	Brasil
dc.publisher	ICX - DEPARTAMENTO DE CIÊNCIA DA COMPUTAÇÃO
dc.publisher	Programa de Pós-Graduação em Ciência da Computação
dc.publisher	UFMG
dc.rights	Acesso Aberto
dc.subject	Computation
dc.subject	Distributed Systems
dc.subject	Similarity Search
dc.title	Optimizing response time in large scale similarity searches
dc.type	Dissertação

Este ítem pertenece a la siguiente institución

Universidade Federal de Minas Gerais (Brasil)