Autores: | Omar Trigui and Lamia Hadrich Belguith / University of Sfax (Tunisia) |
URL: | https://sites.google.com/site/anlprg/outils-et-corpus-realises |
Contacto: | Omar Trigui <omar.trigui |
Descripción
ADQA Corpus – Arabic Definition Question Answering corpus. This corpus is constituted of a list of 50 definition questions (ArabicListDefQuest), a set of 50 files containing snippets collected from Wikipedia search engine (ArabicCorpusWikipedia), a set of 50 files containing snippets from Google search engine (ArabicCorpusGoogle) and a set of 50 files which each file contains a question with their answers (ArabicListDefAnsw from -Google+Wikipedia-).
Funcionalidad
Tecnología
Requisitos técnicos
No special hardware/software is required. Disk space required: 235 Kbytes.
Módulos
Innovación
A first corpus corpus collected from the web and a set of definition questions with their answers. Development: MICINN research project TEXT-ENTERPRISE 2.0 TIN2009-13391-C04-03 (Plan I+D+i). This corpus was generated as part of the Ph.D. work of Omar Trigui under the supervision of Lamia Hadrich Belguith and Paolo Rosso.
Desarrollo
- MICINN research project TEXT-ENTERPRISE 2.0 TIN2009-13391-C04-03 (Plan I+D+i).
- This corpus was generated as part of the Ph.D. work of Diego Ingaramo under the supervision of Marcelo Errecalde (external researcher of TEXT-ENTERPRISE 2.0) and Paolo Rosso.
Publicaciones
- Trigui O., Hadrich-Belguith L., Rosso P. An Automatic Definition Extraction in Arabic Language. In: Proc. 15th Int. Conf. on Applications of Natural Language to Information Systems, NLDB-2010, Springer-Verlag, LNCS(6177), pp. 240-247, 2010.
- Trigui O., Hadrich-Belguith L., Rosso P. DefArabicQA: Arabic Definition Question Answering System. In: Proc. Workshop on LR & HLT for Semitic Languages, 7th Int. Conf. on Language Resources and Evaluation, LREC-2010, Malta, May 17-23, pp. 40-44, 2010.