Autores: | Arkaitz Zubiaga |
URL: | http://nlp.uned.es/social-tagging/socialodp2k9/ |
Contacto: | Arkaitz Zubiaga <azubiagalsi.uned.es> |
Descripción
Social-ODP-2k9 is a dataset created during December 2008 and January 2009 with data retrieved from the social bookmarking sites Delicious and StumbleUpon, the Open Directory Project and the Web. It is made up by 12,616 unique web documents, along with their corresponding social annotations, and classification data according to the ODP.
Funcionalidad
Web page classification, analysis of social annotations, etc.
Tecnología
Data stored in XML format.
Requisitos técnicos
Módulos
Innovación
To the best of our knowledge, this is the only dataset including social tagging data along with crawled documents and category data.
Desarrollo
The generation of this dataset was partially funded by the Regional Government of Madrid under the Research Network MAVIR (S-0505/TIC-0267), the Regional Ministry of Education of the Community of Madrid, by the Spanish Ministry of Science and the Innovation project QEAVis-Catiex (TIN2007-67581-C02-01).
Publicaciones
Arkaitz Zubiaga, Raquel Martínez, and Víctor Fresno. Getting the Most Out of Social Annotations for Web Page Classification. Proceedings of DocEng 2009, the 9th ACM Symposium on Document Engineering, pp. 74-83, Munich, Germany. 2009.