info:eu-repo/semantics/article
A family of experiments to validate measures for UML activity diagrams of ETL processes in data warehouses
Fecha
11/01/201011/01/2010
Autor
Muñoz, Lilia
Mazón, Jose Norberto
Trujillo, Juan
Institución
Resumen
In data warehousing, Extract, Transform, and Load (ETL) processes are in charge of extracting the data from the data sources that will be contained in the data warehouse. Their design and maintenance is thus a cornerstone in any data warehouse development project. Due to their relevance, the quality of these processes should be formally assessed early in the development in order to avoid populating the data warehouse with incorrect data. To this end, this paper presents a set of measures with which to evaluate the structural complexity of ETL process models at the conceptual level. This study is, moreover, accompanied by the application of formal frameworks and a family of experiments whose aim is to theoretical and empirically validate the proposed measures, respectively. Our experiments show that the use of these measures can aid designers to predict the effort associated with the maintenance tasks of ETL processes and to make ETL process models more usable. Our work is based on Unified Modeling Language (UML) activity diagrams for modeling ETL processes, and on the Framework for the Modeling and Evaluation of Software Processes (FMESP) framework for the definition and validation of the measures.
Materias
Ítems relacionados
Mostrando ítems relacionados por Título, autor o materia.
-
Designing relational data warehouses through schema-transformation primitives
Marotta, Adriana (UR. FI – INCO., 2001)A Data Warehouse (DW) is a database that stores information oriented to satisfy decision-making request. It is a database with some particular features concerning the data it contains and its utilisation. The features od ... -
Designing relational data warehouses through schema-transformation primitives : prototype
Gutiérrez, Alejandro; Marotta, Adriana (UR. FI – INCO., 2001)The logical design of a Data Warehouse (DW) is a task that requires the application of techniques and strategies that are specific of DW context. In [Mar00] we present a mechanism for designing DWs. Based in this mechanism ... -
Análise de desempenho de consultas OLAP espaçotemporais em função da ordem de processamento dos predicados convencional, espacial e temporal
Joaquim Neto, Cesar (Universidade Federal de São CarlosUFSCarPrograma de Pós-Graduação em Ciência da Computação - PPGCCCâmpus São Carlos, 2016-03-08)By providing ever-growing processing capabilities, many database technologies have been becoming important support tools to enterprises and institutions. The need to include (and control) new data types to the existing ...