Autores: | Musa Alkhalifa, Manuel Bertran, William J. Black, Sabri Elkateb, Javier Farreres, David Farwell, Christiane Fellbaum, James Kirk, M. Antònia Martí, Adam Pease, Horacio Rodríguez, Piek Vossen |
URL: | http://www.globalwordnet.org/AWN |
Contacto: | Horacio Rodrígez <horacio |
Descripción
The Arabic WordNet (AWN) is a lexical database of the Arabic language following the development process of Princeton English WordNet and Euro WordNet. It utilizes the Suggested Upper Merged Ontology as an interlingua to link Arabic WordNet to previously developed wordnets. Christiane Fellbaum at Princeton was the project lead. The project was sponsored by DOI/REFLEX. From http://www.globalwordnet.org/AWN/DataSpec.html you can get the XML data exchange specifications of the database. Roughly AWN contains 11,000 synsets (including 1,000 NE). Figures of the current coverage of AWN can be obtained from: http://www.lsi.upc.edu/~mbertran/arabic/awn/query/sug_statistics.php There are several different ways for accessing the database: 1) The browser package (available at http://sourceforge.net/projects/awnbrowser) includes the AWN data and PWN2.0 mappings in a relational database. You can use the export facilities to export the data as XML or CSV and do what you will with it in textual form. 2) The browser source code contains methods written in Java which access the DB in order to retrieve the data structures the browser displays. The AWN code has been placed on Sourceforge at http://awnbrowser.cvs.sourceforge.net/awnbrowser/source/awnbrowser. There’s 2-page document about the source code there too. 3) The database can also be downloaded in xml format (linked to pwn 2.0) from http://www.lsi.upc.edu/~mbertran/arabic/bd/get_bd.php. If you are interested on data connected with other English WN versions you can get it from http://www.lsi.upc.edu/~mbertran/arabic/bd/get_bd.php?ver={20,21,30,awn}. 4) A set of basic python functions for accessing the database can be obtained from: http://www.lsi.upc.edu/~horacio/varios/AWNDatabaseManagement.py.gz In addition to the AWN database and the AWN browser some other complementary software has been produced including: 1) The AWN Lexicographer’s Web Interface (Barcelona) http://www.lsi.upc.edu/~mbertran/arabic/awn/update/synset_browse.php . 2) The AWN User’s Web Interface (Barcelona) http://www.lsi.upc.edu/~mbertran/arabic/awn/index.html . 3) The AWN word spotter can be accessed at: http://www.lsi.upc.edu/~mbertran/arabic/wwwWn7/.
Funcionalidad
AWN Browser: Browsing the database . AWN can be downloaded in XML format and access its content be directly accessed.
Tecnología
Java, Perl, MySQL
Requisitos técnicos
Módulos
Innovación
One of the most important lexical resources for Arabic language.
Desarrollo
Publicaciones
- Christiane Fellbaum, Musa Alkhalifa, William J. Black, Sabri Elkateb, Adam Pease, Horacio Rodríguez, Piek Vossen (2006). Introducing the Arabic WordNet project. Proceedings of the 3rd Global Wordnet Conference, Jeju Island, Korea, January, 2006.
- Christiane Fellbaum, Musa Alkhalifa, William J. Black, Sabri Elkateb, Adam Pease, Horacio Rodríguez, Piek Vossen (2006). Building a WordNet for Arabic. Proceedings of the the 5th Conference on Language Resources and Evaluation LREC2006, May, 2006.
- Horacio Rodríguez, David Farwell, Javi Farreres, Manuel Bertran, Musa Alkhalifa, M. Antonia Martí, William Black, Sabri Elkateb, James Kirk, Adam Pease, Piek Vossen, and Christiane Fellbaum. Arabic WordNet: Current State and Future Extensions in: Proceedings of the Fourth International GlobalWordNet Conference – GWC 2008, Szeged, Hungary, January 22-25, 2008.
- Horacio Rodríguez, David Farwell, Javi Farreres, Manuel Bertran, Musa Alkhalifa, M. Antonia Martí (2008). Arabic WordNet: Semi-automatic Extensions using Bayesian Inference. Proceedings of the the 6th Conference on Language Resources and Evaluation LREC2008. Marrakech (Morocco), May 2008.