Actas de congresos
Towards a phonetic Brazilian Portuguese spell checker
Fecha
2014-10Registro en:
International Conference on Computational Processing of the Portuguese Language, 11th; Workshop on Tools and Resources for Automatically Processing Portuguese and Spanish, 1st, 2014, São Carlos.
Autor
Avanço, Lucas Vinicius
Duran, Magali Sanches
Nunes, Maria das Graças Volpe
Institución
Resumen
Spell checking is no longer considered a big challenge for natural language pro-cessing, at least regarding the task of correcting documents during edition. Nevertheless, without human interaction, it is necessary to automatically choose the word that will more likely correct the misspelled word. Also, there is a further difficulty for spell checking: new types of errors on the web material have emerged due to the increasing participation of gen-eral public, especially when expressing opinions, feelings and requests, which take many characteristics from the spoken language. This paper presents the first efforts towards a new Brazilian Portuguese (BP) spell checker to deal with the challenges that emerged in the au-tomatic processing of a web corpus, including a new phonetic algorithm to specifically ad-dress spelling correction in BP. The speller proposed here is able to correct 16% more words than Aspell, in a web corpus composed of reviews of products.