Article
PlaM-DeP: una plataforma modular para el desarrollo y evaluación de algoritmos de detección de plagio académico
Fecha
2014-06Autor
Fajardo Heras, Hernán
Barrera Maura, Manuel
Robles Bykbaev, Vladimir
Timbi Sisalima, Cristian
Calle Ortiz, Eduardo
Institución
Resumen
In this paper we present a software platform model to
develop and evaluate plagiarism detection algorithms.
The platform is based in a scalable modular design,
and implements several services to perform automatically
the following tasks: syntactic and semantic
analysis through WordNet and FreeLing, automatic
text extraction of multiple file formats (PDF, Word
and text), web page content extraction (using some
search engines like Google, Yandex, Yahoo, Bing),
and storage, load and use of plagiarism detection
algorithms. These services allow a programmer to
develop a code focusing the effort on the design of
the algorithm and the mathematical/statistical basis.
The platform was tested using several text queries
(n-grams), and currently the performance results are
promising.