Autores: | Yassine Benajiba and Lahsen Abouenour (Ph.D. students) and Paolo Rosso. |
URL: | http://www.dsic.upv.es/grupos/nle/ |
Contacto: | Yassine Benajiba <benajibayassine |
Descripción
This corpus includes Spanish journalistic texts, more precisely, it is a collection of news extracted from El Periódico de Catalunya. It has been manually annotated at a syntactic ( phrases and syntactic function) and semantic level (semantic roles, semantic constructions and sense disambiguation). The corpus has approximately 700.000 words. It contains sentences with the 250 more frequent verbs in Spanish.
Funcionalidad
The interface (http://grial.uab.es/search) allows simple and advanced searches on the corpus by different fields, including the negative search. The XML corpus can be downloaded.
Tecnología
Documents: from the web, Questions and answers: manually built.
Requisitos técnicos
In order to use these data, it is required to have an Arabic Question system. It is also possible to use these data to test only some modules of the QA system (I.e. the QA system is not required to be fully built).
Módulos
None.
Innovación
To our knowledge, these data is the only freely available Arabic test-platform for Arabic Question Answering.
Desarrollo
Developed as part of Yassine Benajiba’s AECI Ph.D. and the MiDES CICYT TIN2006-15265-C06-04 research project, co-funded by the AECI-PCI A01031707 and A706706 projects.
Publicaciones
- Benajiba Y., Rosso P., Gómez J.M. Adapting JIRS Passage Retrieval System to the Arabic. In: Proc. 8th Int. Conf. on Comput. Linguistics and Intelligent Text Processing, CICLing-2007, Springer-Verlag, LNCS(4394), pp. 530-541, 2007.
- Abouenour L., Bouzoubaa K., Rosso P. Improving Q/A using Arabic WordNet. In: Proc. Int. Arab Conf. on Information Technology, ACIT-2008, Hammamet, Tunisia, December, 2008.
- Abouenour L., Bouzoubaa K., Rosso P. Construction de l’ontologie Amine Arabic WordNet dans le cadre des systèmes Q/A. (in French) In: Proc. 2nd JOurnées Scientifiques en Technologies de l’Information et de la Communication JOSTIC-2008, Rabat, Marroco, October, 2008.
- Abouenour L., Bouzoubaa K., Rosso P. Towards an Arabic Q/A system using a conceptual/lexical ontology. (in Arabic) In: Proc. Proc. 5th Conf. on Scientific Research Outlook & Technology Development in the Arab world, SROV, Fez, Marroco, October, 11-16, 2008