dc.creatorScabora, Lucas C.
dc.creatorBrito, Jaqueline J.
dc.creatorCiferri, Ricardo Rodrigues
dc.creatorCiferri, Cristina Dutra de Aguiar
dc.date.accessioned2016-10-20T17:36:29Z
dc.date.accessioned2018-07-04T17:12:22Z
dc.date.available2016-10-20T17:36:29Z
dc.date.available2018-07-04T17:12:22Z
dc.date.created2016-10-20T17:36:29Z
dc.date.issued2016-04
dc.identifierInternational Conference on Enterprise Information Systems, XVIII, 2016, Rome.
dc.identifier9789897581878
dc.identifierhttp://www.producao.usp.br/handle/BDPI/51025
dc.identifierhttp://dx.doi.org/10.5220/0005815901110118
dc.identifier.urihttp://repositorioslatinoamericanos.uchile.cl/handle/2250/1646098
dc.description.abstractNowadays, data warehousing and online analytical processing (OLAP) are core technologies in business intelligence and therefore have drawn much interest by researchers in the last decade. However, these technologies have been mainly developed for relational database systems in centralized environments. In other words, these technologies have not been designed to be applied in scalable systems such as NoSQL databases. Adapting a data warehousing environment to NoSQL databases introduces several advantages, such as scalability and flexibility. This paper investigates three physical data warehouse designs to adapt the Star Schema Benchmark for its use in NoSQL databases. In particular, our main investigation refers to the OLAP query processing over column-oriented databases using the MapReduce framework. We analyze the impact of distributing attributes among column-families in HBase on the OLAP query performance. Our experiments showed how processing time of OLAP queries was impacted by a physical data warehouse design regarding the number of dimensions accessed and the data volume. We conclude that using distinct distributions of attributes among column-families can improve OLAP query performance in HBase and consequently make the benchmark more suitable for OLAP over NoSQL databases.
dc.languageeng
dc.publisherInstitute for Systems and Technologies of Information, Control and Communication - INSTICC
dc.publisherScience and Technology Press – SciTePress
dc.publisherRome
dc.relationInternational Conference on Enterprise Information Systems, XVIII
dc.rightsCopyright SCITEPRESS
dc.rightsclosedAccess
dc.subjectData Warehousing
dc.subjectPhysical Design
dc.subjectNoSQL
dc.subjectOLAP Query Processing
dc.subjectHBase
dc.subjectStar Schema Benchmark
dc.titlePhysical data warehouse design on NoSQL databases OLAP query processing over HBase
dc.typeActas de congresos


Este ítem pertenece a la siguiente institución