Actas de congresos
A Provenance-based Approach To Manage Long Term Preservation Of Scientific Data
Registro en:
9781479934805
Proceedings - International Conference On Data Engineering. Ieee Computer Society, v. , n. , p. 126 - 133, 2014.
10844627
10.1109/ICDEW.2014.6818316
2-s2.0-84901751595
Autor
Sousa R.B.
Cugler D.C.
Malaverri J.E.G.
Medeiros C.B.
Institución
Resumen
Long term preservation of scientific data goes beyond the data, and extends to metadata preservation and curation. While several researchers emphasize curation processes, our work is geared towards assessing the quality of scientific (meta)data. The rationale behind this strategy is that scientific data are often accessible via metadata - and thus ensuring metadata quality is a means to provide long term accessibility. This paper discusses our quality assessment architecture, presenting a case study on animal sound recording metadata. Our case study is an example of the importance of periodically assessing (meta)data quality, since knowledge about the world may evolve, and quality decrease with time, hampering long term preservation. © 2014 IEEE.
126 133 (2012) Status Report of the DPHEP Study Group: Towards a Global Effort for Sustainable Data Preservation in High Energy Physics, , www.dphep.Org, DPHEP-Tech. Rep. DPHEP-2012-001Z A. et al http://www.csa.com/discoveryguides/cyber/overview.php, The Domesday Project 1986, accessed in November 2013Malaverri, J.E.G., (2013) Supporting Data Quality Assessment in Escience: A Provenance Based Aproach, , Ph. D. dissertation, Universidade Estadual de Campinas, Campinas, SÃco Paulo Conway, E., Giaretta, D., Lambert, S., Matthews, B., Curating scientific research data for the long term: A preservation analysis method in context (2011) The International Journal of Digital Curation, 6 (2), pp. 38-52 Trimble, L., Marks, S., Supporting data access and reuse in ontario: Scholars portals initiatives (2013) Proc. World Social Sciences Forum, , Montreal, Canada extended abstract http://http://ocul.on.ca/, Ontario council of university libraries 2010 accessed in November 2013http://http://rds-sdr.cisti-icist.nrccnrc.gc.ca/eng/, Research data canada 2011, accessed in november 2013(2000) Digital Preservation, , http://www.digitalpreservation.gov/, accessed in november 2013Infrastructure N.D.I. Program P Bechhofer, S., Buchan, I., De Roure, D., Missier, P., Ainsworth, J., Bhagat, J., Couch, P., Goble, C., Why linked data is not enough for scientists (2013) Future Generation Computer Systems, 29 (2), pp. 599-611. , Feb Bizer, C., Heath, T., Berners-Lee, T., Linked data\-The story so far (2009) International Journal on Semantic Web and Information Systems (IJSWIS), 5 (3), pp. 1-22 Wang, R.Y., Strong, D.M., Beyond accuracy: What data quality means to data consumers (1996) Journal of Management Information Systems, pp. 5-33 Wand, Y., Wang, R.Y., Anchoring data quality dimensions in ontological foundations (1996) Communications of the ACM, 39 (11), pp. 86-95 Parssian, A., Managerial decision support with knowledge of accuracy and completeness of the relational aggregate functions (2006) Decision Support Systems, 42 (3), pp. 1494-1502 Scholten, H., Udinkten Cate, A.J., Quality assessment of the simulation modeling process (1999) Computers and Electronics in Agriculture, 22 (2), pp. 199-208 Gamble, M., Goble, C., Quality, trust, and utility of scientific data on the web: Towards a joint model (2011) Web Science Trust Chiang, F., Miller, R.J., Active repair of data quality rules (2011) Proceedings of the 16th International Conference on Information Quality (ICIQ) Lemos, F., (2013) Infrastructure and Algorithms for Information Quality Analysis and Process Discovery, , Ph. D. dissertation, University of Versailles, France Etcheverry, L., Peralta, V., Bouzeghoub, M., Qbox-foundation: A metadata platform for quality measurement (2008) Proceeding of the 4th Workshop on Data and Knowledge Quality (QDC2008) Naim, A., Crawl, D., Indrawan, M., Altintas, I., Sun, S., Monitoring data quality in kepler (2010) Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing. ACM, pp. 560-564 Bowers, S., Kudo, J., Cao, H., Schildhauer, M.P., Obsdb: A system for uniformly storing and querying heterogeneous observational data (2010) IEEE Sixth International Conference on E-Science, pp. 261-268 (2012) The Cornell Lab of Ornithology, , http://www.allaboutbirds.Org, Cornell accessed in December Frommolt, K.-H., Bardeli, R., Kurth, F., Clausen, M., The animal sound archive at the humboldt-university of berlin: Current activities in conservation and improving access for bioacoustic research (2006) Advances in Bioacoustics II, pp. 139-144 Cobos, M., Lopez, J., Listen up\-The present and future of audio signal processing (2010) IEEE Potentials, 29 (4), pp. 40-44 Bardeli, R., Similarity search in animal sound databases (2009) IEEE Transactions on Multimedia, 11 (1), pp. 68-76 Malaverri, J.E., Santanche, A., Medeiros, C.B., A provenancebased approach to evaluate data quality in escience (2013) Int. J. Metadata, Semantics and Ontologies, , accepted for publication Moreau, L., Clifford, B., Freire, J., Futrelle, J., Gil, Y., Groth, P., Kwasnikowska, N., Myers, J., The open provenance model core specification (v1. 1) (2011) Future Generation Computer Systems, 27 (6), pp. 743-756 Cugler, D.C., Medeiros, C.B., Toledo, L.F., Managing animal sounds-some challenges and research directions (2011) Proceedings v Brazilian EScience Workshop, , July Ranft, R., Natural sound archives: Past, present and future (2004) Anais da Academia Brasileira de Cincias, 76 (2), pp. 456-460 http://www.britannica.com/EBchecked/topic/409141/Neot-ropicalregion, Encyclopedia britannica-academic edition, accessed in November 2011http://proj.lis.ic.unicamp.br/fnjv, FNJV Online animal sound collection-Fonoteca Neotropical Jacques Vielliard, accessed in June 2013Toledo, L., Haddad, C., Reproductive biology of scinax fuscomarginatus (anura, hylidae) in south-eastern brazil (2005) Journal of Natural History, 39 (32), pp. 3029-3037 Caramaschi, U., Notes on the taxonomic status of Elachistocleis ovalis (schneider, 1799) and description of five new species of Elachistocleis Parker, 1927 (amphibia, anura, microhylidae) (2010) Boletim Do Museu Nacional. Nova Serie, Zoologia, 527, pp. 1-30 Cugler, D.C., Medeiros, C.B., Toledo, L.F., An architecture for retrieval of animal sound recordings based on context variables (2012) Concurrency and Computation: Practice and Experience, , June Cugler, D., Medeiros, C.B., Shekhar, S., Toledo, F., A geographical approach for metadata quality improvement in biological observation databases (2013) Proc. 9th IEEE International E-Science Conference http://www.catalogueoflife.org, Catalogue of life, accessed in October 2013Hull, D., Wolstencroft, K., Stevens, R., Goble, C., Pocock, M.R., Li, P., Oinn, T., Taverna: A tool for building and running workflows of services (2006) Nucleic Acids Research, 34 (SUPPL. 2), pp. W729-W732 Mota, M.S., Medeiros, C.B., Introducing shadows: Flexible document representation and annotation on the web (2013) Proc International Workshop on Data Engineering Meets the Semantic Web (DESWEB), , Brisbane, co-located with 29th ICDE conference Zhao, J., Gomez-Perez, J.M., Belhajjame, K., Klyne, G., Garcia-Cuesta, E., Garrido, A., Hettne, K., Goble, C., Why workflows breakâA T Understanding and combating decay in Taverna workflows (2012) E-Science (E-Science), 2012 IEEE 8th International Conference On. IEEE, pp. 1-9