dc.contributorUniversidade Estadual Paulista (UNESP)
dc.date.accessioned2022-05-01T11:54:06Z
dc.date.accessioned2022-12-20T03:46:34Z
dc.date.available2022-05-01T11:54:06Z
dc.date.available2022-12-20T03:46:34Z
dc.date.created2022-05-01T11:54:06Z
dc.date.issued2021-01-01
dc.identifierLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 13074 LNAI, p. 616-627.
dc.identifier1611-3349
dc.identifier0302-9743
dc.identifierhttp://hdl.handle.net/11449/233938
dc.identifier10.1007/978-3-030-91699-2_42
dc.identifier2-s2.0-85121798857
dc.identifier.urihttps://repositorioslatinoamericanos.uchile.cl/handle/2250/5414038
dc.description.abstractIn the different areas of knowledge, textual data are important sources of information. This way, Information Extraction methods have been developed to identify and structure information present in textual documents. In particular there is the Named Entity Recognition (NER) task, which consists of using methods to identify Named Entities, such as Person, Place, among others, in texts, using techniques from Natural Language Processing and Machine Learning. Recent works explored the use of external sources of knowledge to boost the Machine Learning models with sets of domain specific relevant information for the NER task. This work aims to evaluate the aggregation of external knowledge, in the form of Gazetter and Knowledge Graphs, for NER task. Our approach is composed of two steps: i) generation of embeddings, ii) definition and training of the Machine Learning methods. The experiments were conducted on four English datasets, and their results show that the applied strategies for external knowledge integration did not bring great gains to the models, as expressed by F1-Score metric. In the performed experiments, there was an F1-score increase in 17 of the 32 cases where external knowledge was used, but in most cases the gains were lesser than 0.5% in F1-score. In some scenarios the aggregated external knowledge does not capture relevant content, thus not being necessarily beneficial to the methodology.
dc.languageeng
dc.relationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
dc.sourceScopus
dc.subjectInformation extraction
dc.subjectKnowledge embeddings
dc.subjectNamed entity recognition
dc.titleWhen External Knowledge Does Not Aggregate in Named Entity Recognition
dc.typeActas de congresos


Este ítem pertenece a la siguiente institución