Actas de congresos
FakeRecogna: A New Brazilian Corpus for Fake News Detection
Fecha
2022-01-01Registro en:
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 13208 LNAI, p. 57-67.
1611-3349
0302-9743
10.1007/978-3-030-98305-5_6
2-s2.0-85127101959
Autor
Universidade Estadual Paulista (UNESP)
Institución
Resumen
Fake news has become a research topic of great importance in Natural Language Processing due to its negative impact on our society. Although its pertinence, there are few datasets available in Brazilian Portuguese and mostly comprise few samples. Therefore, this paper proposes creating a new fake news dataset named FakeRecogna that contains a greater number of samples, more up-to-date news, and covering a few of the most important categories. We perform a toy evaluation over the created dataset using traditional classifiers such as Naive Bayes, Optimum-Path Forest, and Support Vector Machines. A Convolutional Neural Network is also evaluated in the context of fake news detection in the proposed dataset.