A List of Social Tagging Datasets Made Available for Research
This list is not exhaustive - help expand it!
Social Tagging Systems | Research Group | Source | Year Obtained | Availability | Contact | References |
CiteULike | Oversity Ltd. | Primary | Daily Snapshots | Via Download after Email (link) | Richard Cameron | |
Bibsonomy | KDE | Primary | Periodical Snapshots every half year | Available after signed license agreement | Andreas Hotho | [Hotho 2006] |
MovieLens | GroupLens | Primary | 2009 | Via Download (link) | GroupLens Info | [Sen 2006] |
GiveALink | NaN Group | Primary | Current information via API | Via API | Filippo Menczer | [Markines 2009] |
ESP Game | Luis von Ahn | Primary | 2006 | Via Download (link) | Luis von Ahn | [VonAhn 2004] |
Delicious | DAI Labor | Secondary | 2007/2008 | Via Email Request | Alan Said | [Wetzker 2006] |
Delicious, Stumble Upon & Wikipedia | NLP and Information Retrieval Group | Secondary | 2008/2009 | Via Download (link) | Arkaitz Zubiaga | [Zubiaga 2009a] [Zubiaga 2009b] [Zubiaga 2009c] |
Delicious, Flickr, Last.fm, zexe.net | TAGora | Secondary | 2006, 2007, 2008 | Via Download (link) | Vittorio Loreto | |
Delicious, Flickr, Diigo, Bibsonomy and others | Agents and Social Computation | Secondary | 2009 | Via Email Request | Markus Strohmaier | [Grahsl 2010] |
In case you are aware of other available datasets, please let me know by leaving a comment on a corresponding blog post.
Page updated and maintained by Markus Strohmaier.
References
[Grahsl 2010] H.P. Grahsl, C. Körner, M. Strohmaier. A Collection of Tagging Datasets Containing Complete Personomies From Heterogeneous Sources. Technical Report, Knowledge Management Institute, Graz University of Technology. To be published in 2010
[Hotho 2006] A. Hotho, R. Jäschke, C. Schmitz, and G. Stumme. BibSonomy: A Social Bookmark and Publication Sharing System. In Aldo de Moor, Simon Polovina, and Harry Delugach, editors, Proceedings of the Conceptual Structures Tool Interoperability Workshop at the 14th International Conference on Conceptual Structures, Aalborg, Denmark
[Markines 2009] B. Markines and F. Menczer. A Scalable, Collaborative Similarity Measure for Social Annotation Systems. Proc. 20th ACM Conf. on Hypertext and Hypermedia (HT).
[Sen 2006] S. Sen, S. K. Lam, A. M. Rashid, D. Cosley, D. Frankowski, J. Osterhouse, F. M. Harper, and J. Riedl. tagging, communities, vocabulary, evolution. In CSCW '06: Proceedings of the 2006 20th Anniversary Conference on Computer Supported Cooperative Work, pages 181-190, New York, NY, USA, 2006. ACM.
[VonAhn 2004] L. von Ahn and L. Dabbish. Labeling Images with a Computer Game. ACM Conference on Human Factors in Computing Systems, CHI 2004. pp 319-326.
[Wetzker 2008] R. Wetzker, C. Zimmermann, and C. Bauckhage. Analyzing Social Bookmarking Systems: A Delicious cookbook. In Mining Social Data (MSoDa) Workshop Proceedings, pp. 26-30. ECAI 2008, (July 2008).
[Zubiaga 2009a] A. Zubiaga, R. Mart穩nez, and V. Fresno. Getting the Most Out of Social Annotations for Web Page Classification. Proceedings of DocEng 2009, the 9th ACM Symposium on Document Engineering, pp. 74-83, Munich, Germany. 2009.
[Zubiaga 2009b] A. Zubiaga, A. P. Garc穩a-Plaza, V. Fresno, and R. Mart穩nez. Content-based Clustering for Tag Cloud Visualization. Proceedings of ASONAM 2009, International Conference on Advances in Social Networks Analysis and Mining. 2009.
[Zubiaga 2009c] A. Zubiaga. Enhancing Navigation on Wikipedia with Social Tags. Wikimania 2009. Buenos Aires, Argentina. 2009.
Last edited on December 7, 2009 (Christian Körner, Markus Strohmaier)
http://www.markusstrohmaier.info/datasets/
另外:http://www.tagora-project.eu/data/
A List of Social Tagging Datasets Made Available for Research的更多相关文章
- 近年Recsys论文
2015年~2017年SIGIR,SIGKDD,ICML三大会议的Recsys论文: [转载请注明出处:https://www.cnblogs.com/shenxiaolin/p/8321722.ht ...
- Install SharePoint 2013 on Windows Server 2012 without a domain
Any setup of Team Foundation Server is not complete until you have at least tried t work with ShareP ...
- Use of Deep Learning in Modern Recommendation System: A Summary of Recent Works(笔记)
注意:论文中,很多的地方出现baseline,可以理解为参照物的意思,但是在论文中,我们还是直接将它称之为基线,也 就是对照物,参照物. 这片论文中,作者没有去做实际的实验,但是却做了一件很有意义的事 ...
- 关于LDA的文章
转:http://www.zhizhihu.com/html/y2011/3228.html l Theory n Introduction u Unsupervised learning by ...
- Link-based Classification相关数据集
Link-based Classification相关数据集 Datasets Document Classification Datasets: CiteSeer: The CiteSeer dat ...
- Open Data for Deep Learning
Open Data for Deep Learning Here you’ll find an organized list of interesting, high-quality datasets ...
- SharePoint 2010 搜索结果没有显示部分文件
Why SharePoint 2010 search does not show some results? SharePoint 2010 search is better than ever ...
- 论文翻译——Character-level Convolutional Networks for Text Classification
论文地址 Abstract Open-text semantic parsers are designed to interpret any statement in natural language ...
- paper 118:计算机视觉、模式识别、机器学习常用牛人主页链接
牛人主页(主页有很多论文代码) Serge Belongie at UC San Diego Antonio Torralba at MIT Alexei Ffros at CMU Ce Liu at ...
随机推荐
- 快速解析超大XML不占用太大内存
import xml.etree.ElementTree as ET def parse_res(xml_file): res_dic = {} tmp_lst_lev1 = [] tmp_lst_l ...
- Spring JDBC
转载:博客主页:http://blog.csdn.NET/chszs 一.概述 在Spring JDBC模块中,所有的类可以被分到四个单独的包:1)core即核心包,它包含了JDBC的核心功能.此包内 ...
- iScroll-5 API 中文版
http://wiki.jikexueyuan.com/project/iscroll-5/ http://www.mamicode.com/info-detail-331827.html IScro ...
- springmvc 学习笔记_1
一.回顾JavaWeb中的MVC设计模式 1)MVC这种设计模式,不光运用于Web领域,而且也能用于非Web领域 2)今天说的MVC特指一种表现层设计模式,不限于Java语言 二.回顾struts2+ ...
- SVN和CVS的区别
对版本控制就有了一定的理解,同时也应该知道SVN与CVS是比较流行的两款SCM工具.那么到底这两款工具有什么区别呢? 1.版本编号方面 例如,我们的版本库为A,其中有文件a,b,c. 在SVN中,新版 ...
- 转:jsp页面显示中文乱码解决方案
jsp页面显示中文乱码: jsp页面的编码方式有两个地方需要设置: <%@ page language="java" import="java.util.*&quo ...
- 图片javascript缩小
function SetSize(obj, width, height) { myImage = new Image(); myImage.src = obj.src; if (myImage.wid ...
- 2-6 Working with Lambdas
在C++中使用匿名函数,格式如下:[] () {}; Using a Lambda to Print array Values #include <algorithm> #include ...
- Ext.Net TreePanel 修改Icon图标
分类: Ext.Net2012-09-24 13:44 1779人阅读 评论(0) 收藏 举报 webformserverextassemblyxhtmlobject 1.默认icon 2.自定义ic ...
- My安卓知识3--多个activity之前共享数据的方法
在网上搜这个问题的时候看到了有一篇文章说有五种方法: 1.基于消息的通信机制 Intent ---boudle ,extra 数据类型有限,比如遇到不可序列化的数据Bitmap,InputStrea ...