dc.contributorCaseli, Helena de Medeiros
dc.contributorhttp://lattes.cnpq.br/6608582057810385
dc.contributorhttp://lattes.cnpq.br/8077169220013752
dc.creatorSilva, Jéssica Rodrigues da
dc.date.accessioned2019-09-06T19:06:09Z
dc.date.accessioned2022-10-10T21:29:00Z
dc.date.available2019-09-06T19:06:09Z
dc.date.available2022-10-10T21:29:00Z
dc.date.created2019-09-06T19:06:09Z
dc.date.issued2019-07-03
dc.identifierSILVA, Jéssica Rodrigues da. Geração de vetores de sentido para o português. 2019. Dissertação (Mestrado em Ciência da Computação) – Universidade Federal de São Carlos, São Carlos, 2019. Disponível em: https://repositorio.ufscar.br/handle/ufscar/11792.
dc.identifierhttps://repositorio.ufscar.br/handle/ufscar/11792
dc.identifier.urihttp://repositorioslatinoamericanos.uchile.cl/handle/2250/4042307
dc.description.abstractNumerical vector representations are able to represent from words to meanings, in a low-dimensional continuous space. These representations are based on distributional modeling, where the context in which the word occurs is taken into account for vector generation. The word representations, known as word embeddings or word vectors (Word2vec, FastText, Wang2vec and Glove), which have been widely used until now, have an important limitation: they produce a single vector representation for each word, ignoring the fact that ambiguous words can represent different meanings (different contexts). This mixture of meanings can be a problem for many applications. For example, in a language comprehension task, using the vector of an ambiguous word as "bank", all possible meanings --such as financial institution, blood bank, or furniture item --will be mixed into a single numerical vector, causing an erroneous semantic interpretation of the sentence in which it occurs. Over the last few years, representations of meanings, known as sense embeddings or sense vectors, have proven to be able to model syntactic and semantic knowledge and have been used in NLP applications. By being able to transform the various meanings of an ambiguous word into numerical vectors, sense vectors can be applied to Word Sense Disambiguation (WSD). Thus, this work generated and evaluated sense vectors for Portuguese (PT-BR and PT-EU), and showed that they overcome traditional vectors in intrinsic and extrinsic NLP tasks, since they are capable of dealing with lexical ambiguity. To the best of our knowledge, this is the first work to address the geneation and evaluation of sense vectors for Portuguese.
dc.languagepor
dc.publisherUniversidade Federal de São Carlos
dc.publisherUFSCar
dc.publisherPrograma de Pós-Graduação em Ciência da Computação - PPGCC
dc.publisherCâmpus São Carlos
dc.rightsAcesso aberto
dc.subjectVetores de sentido
dc.subjectDesambiguação lexical de sentidos
dc.subjectVetores de palavra
dc.subjectModelagem distribucional
dc.subjectSense embeddings
dc.subjectSense vectors
dc.subjectWord sense disambiguation
dc.subjectWord embeddings
dc.subjectWord vectors
dc.subjectDistributional modeling
dc.titleGeração de vetores de sentido para o português
dc.typeTesis


Este ítem pertenece a la siguiente institución