dc.creatorArgerich, Luis
dc.creatorCano, Matías J.
dc.creatorTorre Zaffaroni, Joaquín
dc.date2016-09
dc.date2016-11-22
dc.date2016-11-22T16:15:23Z
dc.identifierhttp://sedici.unlp.edu.ar/handle/10915/56977
dc.identifierhttp://45jaiio.sadio.org.ar/sites/default/files/ASAI-10_0.pdf
dc.identifierissn:2451-7585
dc.descriptionIn this paper we propose the application of feature hashing to create word embeddings for natural language processing. Feature hashing has been used successfully to create document vectors in related tasks like document classification. In this work we show that feature hashing can be applied to obtain word embeddings in linear time with the size of the data. The results show that this algorithm, that does not need training, is able to capture the semantic meaning of words.We compare the results against GloVe showing that they are similar. As far as we know this is the first application of feature hashing to the word embeddings problem and the results indicate this is a scalable technique with practical results for NLP applications.
dc.descriptionSociedad Argentina de Informática e Investigación Operativa (SADIO)
dc.formatapplication/pdf
dc.format33-40
dc.languageen
dc.rightshttp://creativecommons.org/licenses/by-sa/3.0/
dc.rightsCreative Commons Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0)
dc.subjectCiencias Informáticas
dc.subjectfeature hashing
dc.subjectword embedding
dc.subjectNatural Language Processing
dc.titleHash2Vec: Feature Hashing for Word Embeddings
dc.typeObjeto de conferencia
dc.typeObjeto de conferencia


Este ítem pertenece a la siguiente institución