Autores: | Mihai Surdeanu |
URL: | http://www.surdeanu.name/mihai/bios/ |
Contacto: | Mihai Surdeanu <mihaisurdeanu.name> |
Descripción
Suite of Syntactico-Semantic Analyzers. Includes a named-entity recognizer, a syntactic chunker, a POS tagger, and a “smart” tokenizer. All processors are learned using the MiLL machine learning library (see below).
Funcionalidad
-
Tecnología
Java
Requisitos técnicos
MiLL machine learning library, TnT tagger, YamChA.
Módulos
Smart tokenizer that recognizes abbreviations, SGML tags etc.; Part-of-speech (POS) tagger. The POS tagger is implemented as a a wrapper around the TNT tagger by Thorsten Brants; Syntactic chunking using the labels promoted by the CoNLL chunking evaluations; Named-Entity Recognition and Classification (NERC) for the CoNLL entity types plus an additional 11 numerical entity types.
Innovación
-
Desarrollo
-
Publicaciones
-