Autores: | Fermín L. Cruz Mata, José Troyano Jiménez, Pablo Montoya |
URL: | http://www.lsi.us.es/~fermin/TBOD.tar.gz |
Contacto: | Fermín L. Cruz , University of Seville, <fcruzus.es> |
Descripción
This dataset contains annotated reviews for three different domains: cars, headphones and hotels. Opinions are annotated at the feature level, with the following fields:
Required:
- polarity: positive (+) or negative (-).
- feature: a feature from the feature taxonomy.
- opWords: opinion words. The minimum set of words from the sentence from which you can decide the polarity of this opinion.
Optional:
- featWords: feature words. A set of words from the sentence naming the feature.
- potency: potency words. A set of words from the sentence which affect the strength of the opinion.
The feature taxonomy for each domain is defined in an xml file (featureTaxonomy.xml). For each feature, a set of feature words is included.
Funcionalidad
-
Tecnología
-
Requisitos técnicos
-
Módulos
-
Innovación
-
Desarrollo
Partially funded by Ministerio de Educación y Ciencia (HUM2007-6607-C04-04).
Publicaciones
- F. L. Cruz, J. A. Troyano, F. Enríquez, J. Ortega, and C. G.Vallejo. Knowledge-rich approach to feature-based opinion extraction from product reviews In Proceedings of the 2nd International Workshop on Search and Mining User-Generated Contents, pages 130. ACM, 2010.
- Fermín L. Cruz, José A. Troyano, Fernando Enríquez, F. Javier Ortega, Carlos G. Vallejo: ‘Long autonomy or long delay?’ The importance of domain in opinion mining. Expert Syst. Appl. 40(8): 3174-3184 (2013)