When External Knowledge Does Not Aggregate in Named Entity Recognition

dc.contributor	Universidade Estadual Paulista (UNESP)
dc.date.accessioned	2022-05-01T11:54:06Z
dc.date.accessioned	2022-12-20T03:46:34Z
dc.date.available	2022-05-01T11:54:06Z
dc.date.available	2022-12-20T03:46:34Z
dc.date.created	2022-05-01T11:54:06Z
dc.date.issued	2021-01-01
dc.identifier	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 13074 LNAI, p. 616-627.
dc.identifier	1611-3349
dc.identifier	0302-9743
dc.identifier	http://hdl.handle.net/11449/233938
dc.identifier	10.1007/978-3-030-91699-2_42
dc.identifier	2-s2.0-85121798857
dc.identifier.uri	https://repositorioslatinoamericanos.uchile.cl/handle/2250/5414038
dc.description.abstract	In the different areas of knowledge, textual data are important sources of information. This way, Information Extraction methods have been developed to identify and structure information present in textual documents. In particular there is the Named Entity Recognition (NER) task, which consists of using methods to identify Named Entities, such as Person, Place, among others, in texts, using techniques from Natural Language Processing and Machine Learning. Recent works explored the use of external sources of knowledge to boost the Machine Learning models with sets of domain specific relevant information for the NER task. This work aims to evaluate the aggregation of external knowledge, in the form of Gazetter and Knowledge Graphs, for NER task. Our approach is composed of two steps: i) generation of embeddings, ii) definition and training of the Machine Learning methods. The experiments were conducted on four English datasets, and their results show that the applied strategies for external knowledge integration did not bring great gains to the models, as expressed by F1-Score metric. In the performed experiments, there was an F1-score increase in 17 of the 32 cases where external knowledge was used, but in most cases the gains were lesser than 0.5% in F1-score. In some scenarios the aggregated external knowledge does not capture relevant content, thus not being necessarily beneficial to the methodology.
dc.language	eng
dc.relation	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
dc.source	Scopus
dc.subject	Information extraction
dc.subject	Knowledge embeddings
dc.subject	Named entity recognition
dc.title	When External Knowledge Does Not Aggregate in Named Entity Recognition
dc.type	Actas de congresos

Este ítem pertenece a la siguiente institución

Universidade Estadual Paulista (Brasil)