Autores: | IXA group and Asier Gabiola |
URL: | http://ixa3.si.ehu.es/wsd-demo |
Contacto: | Iñaki Alegria |
Descripción
The WSD system is based on the well known Support Vectors Machine (SVM) Algorithm. This system has been trained on EuSemCor corpus (the unique basque corpus semantically tagged). Due to corpus’s reduced size, the WSD system has been trained for 402 polysemous nouns.
Funcionalidad
Perl CGI script runs the input raw text over Eustagger basque lemmatizer in order to extract features. Then, the feature-vector is classified by the WSD (SVM) system. Finally, the CGI manage classifier and lemmatizer output in order to show in a proper format.
Tecnología
C, C++, Perl.
Requisitos técnicos
-
Módulos
Perl CGI script, EusSemcor data base (MySql), Eustagger, SVM-light.
Innovación
First online WSD system for Basque.
Desarrollo
-
Publicaciones
- Agirre E., and Martinez D.2004. The Basque Country University system: English and Basque tasks..Proceedings of the 3rd ACL workshop on the Evaluation of Systems for the Semantic Martinez D. 2005.
- Supervised Word Sense Disambiguation: Facing Current Challenges. SEPLN Journal. Vol. 34. pgs 125-126.