FakeRecogna: A New Brazilian Corpus for Fake News Detection

Actas de congresos

Fecha

2022-01-01

Registro en:

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 13208 LNAI, p. 57-67.

1611-3349

0302-9743

http://hdl.handle.net/11449/234317

10.1007/978-3-030-98305-5_6

2-s2.0-85127101959

https://repositorioslatinoamericanos.uchile.cl/handle/2250/5414418

Autor

Universidade Estadual Paulista (UNESP)

Institución

Universidade Estadual Paulista (Brasil)

Resumen

Fake news has become a research topic of great importance in Natural Language Processing due to its negative impact on our society. Although its pertinence, there are few datasets available in Brazilian Portuguese and mostly comprise few samples. Therefore, this paper proposes creating a new fake news dataset named FakeRecogna that contains a greater number of samples, more up-to-date news, and covering a few of the most important categories. We perform a toy evaluation over the created dataset using traditional classifiers such as Naive Bayes, Optimum-Path Forest, and Support Vector Machines. A Convolutional Neural Network is also evaluated in the context of fake news detection in the proposed dataset.

Materias

Corpus

Fake news

Portuguese

Mostrar el registro completo del ítem