Autores: | Alberto Barrón-Cedeño, Paolo Rosso, Sobha Lalitha Devi, Paul Clough, Mark Stevenson |
URL: | http://users.dsic.upv.es/grupos/nle/fire-workshop-clitr.html |
Contacto: | - |
Descripción
The corpus contains a set of potential source documents D, written in English, and set of suspicious documents S, written in Hindi. In the corpus you will find plain text files encoded in UTF-8. The source documents are taken from English Wikipedia. The source documents include Wiki-mark up.
Funcionalidad
-
Tecnología
-
Requisitos técnicos
-
Módulos
-
Innovación
-
Desarrollo
-
Publicaciones
-