dc.creatorLópez, Paula
dc.creatorHasperué, Waldo
dc.creatorQuiroga, Facundo Manuel
dc.creatorRonchetti, Franco
dc.date2021-10
dc.date2021
dc.date2022-02-02T17:26:01Z
dc.date.accessioned2023-07-15T05:22:15Z
dc.date.available2023-07-15T05:22:15Z
dc.identifierhttp://sedici.unlp.edu.ar/handle/10915/130340
dc.identifierisbn:978-987-633-574-4
dc.identifier.urihttps://repositorioslatinoamericanos.uchile.cl/handle/2250/7473106
dc.descriptionProgeny analyses are useful in biological sciences for various purposes, such as improving individuals in new generations or carrying out molecular analysis of the transmission of genetic characteristics. Analyzing these data by making comparisons between individuals of a generation with their offspring is not a trivial task, and increases in complexity as more and more generations are incorporated. In this article, we present TreeSpark, an open source tool to carry out progeny analysis and provides functionality that allows simple access to the information of the individuals and their relations both as progenitors and descendants. This tool is developed as a Python module, which in turn inherits the distributed processing features of Spark, allowing it to process large volumes of progeny information. TreeSpark is compared with other similar tools, finding TreeSpark much simpler to use.
dc.descriptionWorkshop: WBDMD - Base de Datos y Minería de Datos
dc.descriptionRed de Universidades con Carreras en Informática
dc.formatapplication/pdf
dc.format251-260
dc.languageen
dc.rightshttp://creativecommons.org/licenses/by-nc-sa/4.0/
dc.rightsCreative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.subjectCiencias Informáticas
dc.subjectSpark
dc.subjectBig data
dc.subjectProgeny analysis
dc.subjectGenealogy
dc.subjectAnalytics
dc.titleTreeSpark: A Distributed Tool for Progeny Analysis based on Spark
dc.typeObjeto de conferencia
dc.typeObjeto de conferencia


Este ítem pertenece a la siguiente institución