dc.creatorXamena, Eduardo
dc.creatorMarmanillo, Walter Gabriel
dc.creatorMechaca, Ana Lidia
dc.date2019-09
dc.date2019-12-20T14:31:19Z
dc.date.accessioned2023-07-14T17:53:39Z
dc.date.available2023-07-14T17:53:39Z
dc.identifierhttp://sedici.unlp.edu.ar/handle/10915/87809
dc.identifierissn:2683-8966
dc.identifier.urihttps://repositorioslatinoamericanos.uchile.cl/handle/2250/7429258
dc.descriptionLarge amounts of ancient documents have become available in the last years, regarding Argentinian history. This fact turns possible to find interesting and useful aggregated information. This work proposes the application of Natural Language Processing, Text Mining and Visualization tools over Argentinian ancient document repositories. Conceptual maps and entity networks make up the first target of this preliminary paper. The first step is the normalization of OCR acquired books of General G¨uemes. Exploratory analyses reveal the presence of manifold spelling errors, due to the OCR acquisition process of the volumes. We propose smart automatic ways for overcoming this issue in the process of normalization. Besides, a first topic landscape of a subset of volumes is obtained and analysed, via Topic Modelling tools.
dc.descriptionSociedad Argentina de Informática e Investigación Operativa
dc.formatapplication/pdf
dc.format28-37
dc.languageen
dc.rightshttp://creativecommons.org/licenses/by-nc-sa/3.0/
dc.rightsCreative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
dc.subjectCiencias Informáticas
dc.subjectArgentinian history
dc.subjectNatural language processing
dc.subjectTextMining
dc.subjectVisualization
dc.subjectBig document repositories
dc.titleRebuilding the Story of a Hero: Information Extraction in Ancient Argentinian Texts
dc.typeObjeto de conferencia
dc.typeObjeto de conferencia


Este ítem pertenece a la siguiente institución