dc.contributor | Saquicela Galarza, Victor Hugo | |
dc.creator | Ochoa Arevalo, Kevin Ismael | |
dc.creator | Quituisaca Suconota, Lucia Carolina | |
dc.date.accessioned | 2023-07-27T16:01:20Z | |
dc.date.accessioned | 2023-08-10T15:18:43Z | |
dc.date.available | 2023-07-27T16:01:20Z | |
dc.date.available | 2023-08-10T15:18:43Z | |
dc.date.created | 2023-07-27T16:01:20Z | |
dc.date.issued | 2023-07-26 | |
dc.identifier | http://dspace.ucuenca.edu.ec/handle/123456789/42509 | |
dc.identifier.uri | https://repositorioslatinoamericanos.uchile.cl/handle/2250/8152255 | |
dc.description.abstract | Around the world, projects are being carried out to digitize historical documents
with the aim of preserving the information contained in them. Many of these projects
use Optical Character Recognition (OCR). However, there are currently no such projects in Ecuador. During the digitization process, challenges arise that affect the quality of the information obtained through OCR, due to problems directly related to the
image, such as stains, folds, lighting, among others. Therefore, it is necessary to find
solutions to counteract these problems and obtain a better quality of information.
In this research work we propose to analyze image processing techniques to improve OCR processes with images of old newspapers from Ecuador. A process of
comparison and analysis of the data obtained from OCR is carried out, focusing on
the number of words correctly recognized in the images that were treated and untreated, with the objective of identifying improvements in the results. The processing
techniques, for ease of analysis, are divided into three groups: traditional techniques,
segmentation techniques and super-resolution techniques.
The results demonstrate that super-resolution processes, in particular the LAPSRN
technique, show a significant improvement in OCR results. These findings have important implications for the field of preservation and access to historical information
in Ecuador. | |
dc.language | spa | |
dc.publisher | Universidad de Cuenca | |
dc.relation | TS;309 | |
dc.rights | http://creativecommons.org/licenses/by-nc-nd/4.0/ | |
dc.rights | openAccess | |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 Internacional | |
dc.subject | Ingeniería de Sistemas | |
dc.subject | Reconocimiento óptico | |
dc.subject | Preservación documental | |
dc.subject | Digitalización de documentos | |
dc.title | Analizar y aplicar técnicas de tratamiento de imágenes de periódicos antiguos del Ecuador para mejoras en el proceso de reconocimiento de textos (OCR). | |
dc.type | bachelorThesis | |