dc.creator | Novais, Eder Miranda de | |
dc.creator | Paraboni, Ivandre | |
dc.date.accessioned | 2014-09-04T14:01:03Z | |
dc.date.accessioned | 2018-07-04T16:51:19Z | |
dc.date.available | 2014-09-04T14:01:03Z | |
dc.date.available | 2018-07-04T16:51:19Z | |
dc.date.created | 2014-09-04T14:01:03Z | |
dc.date.issued | 2011 | |
dc.identifier | Journal of the Brazilian Computer Society, Guildford, v. 19, n. 2, p. 135–146, jun. 2013 | |
dc.identifier | 0104-6500 | |
dc.identifier | http://www.producao.usp.br/handle/BDPI/46085 | |
dc.identifier | 10.1007/s13173-012-0095-1 | |
dc.identifier | http://download.springer.com/static/pdf/70/art%253A10.1007%252Fs13173-012-0095-1.pdf?auth66=1406990203_10bc76a17386ac45f79d93e8d9293943&ext=.pdf | |
dc.identifier.uri | http://repositorioslatinoamericanos.uchile.cl/handle/2250/1641285 | |
dc.description.abstract | As in many other natural language processing (NLP) fields, the use of statistical methods is now part of mainstream natural language generation (NLG). In the development of systems of this kind, however, there is the issue of data sparseness, a problem that is particularly evident in the case of morphologically-rich languages such as Portuguese. This work presents a shallow surface realisation system that makes use of factored language models (FLMs) of Portuguese to overcome some of these difficulties. The system combines FLMs trained on a large corpus with a number of NLP resources that have been made publicly available by the Brazilian NLP research community in recent years, such as corpora, dictionaries, thesauri and others. Our FLM-based approach to surface realisation has been successfully applied to the generation of Brazilian newspapers headlines, and the results are shown to outperform a number of statistical and non-statistical baseline systems alike | |
dc.language | eng | |
dc.publisher | Springer | |
dc.publisher | Guildford | |
dc.relation | Journal of the Brazilian Computer Society | |
dc.rights | Copyright The Brazilian Computer Society | |
dc.rights | restrictedAccess | |
dc.subject | Natural language generation | |
dc.subject | Text generation | |
dc.subject | Surface realisation | |
dc.title | Portuguese text generation using factored language models | |
dc.type | Artículos de revistas | |