学习机器学习 数据处理时 找到的这些链接 可以在上面下载到开源的研究数据数据
2013-12-07 14:40
501 查看
美国政府数据 http://www.data.gov/
Movies Recommendation:
MovieLens - Movie Recommendation Data Sets http://www.grouplens.org/node/73
Yahoo! - Movie, Music, and Images Ratings Data Sets http://webscope.sandbox.yahoo.com/catalog.php?datatype=r
Jester - Movie Ratings Data Sets (Collaborative Filtering Dataset) http://www.ieor.berkeley.edu/~goldberg/jester-data/
Cornell University - Movie-review data for use in sentiment-analysis experiments http://www.cs.cornell.edu/people/pabo/movie-review-data/
Music Recommendation:
Last.fm - Music Recommendation Data Sets http://www.dtic.upf.edu/~ocelma/MusicRecommendationDataset/index.html
Yahoo! - Movie, Music, and Images Ratings Data Sets http://webscope.sandbox.yahoo.com/catalog.php?datatype=r
Audioscrobbler - Music Recommendation Data Sets http://www-etud.iro.umontreal.ca/~bergstrj/audioscrobbler_data.html
Amazon - Audio CD recommendations http://131.193.40.52/data/
Books Recommendation:
Institut für Informatik, Universitt Freiburg - Book Ratings Data Sets http://www.informatik.uni-freiburg.de/~cziegler/BX/
Food Recommendation:
Chicago Entree - Food Ratings Data Sets http://archive.ics.uci.edu/ml/datasets/Entree+Chicago+Recommendation+Data
Merchandise Recommendation:
Amazon - Product Recommendation Data Sets http://131.193.40.52/data/
Healthcare Recommendation:
Nursing Home - Provider Ratings Data Set http://data.medicare.gov/dataset/Nursing-Home-Compare-Provider-Ratings/mufm-vy8d
Hospital Ratings - Survey of Patients Hospital Experiences http://data.medicare.gov/dataset/Survey-of-Patients-Hospital-Experiences-HCAHPS-/rj76-22dk
Dating Recommendation:
www.libimseti.cz - Dating website recommendation (collaborative filtering) http://www.occamslab.com/petricek/data/
Scholarly Paper Recommendation:
National University of Singapore - Scholarly Paper Recommendation http://www.comp.nus.edu.sg/~sugiyama/SchPaperRecData.html
proximity DBLP http://kdl.cs.umass.edu/data/dblp/dblp-info.html
DBLP-Citation-Network http://arnetminer.org/citation
KDD-2011 http://www.cs.uiuc.edu/~hbdeng/data/kdd2011.htm
CiteSeer (hardly) http://csxstatic.ist.psu.edu/about/data
CiteSeer dumped http://martinharrigan.blogspot.com/2008/07/citeseers-dataset.html
Cora (hardly) http://people.cs.umass.edu/~mccallum/data.html
IMDB http://www.imdb.com/interfaces/
Stanford class resources http://snap.stanford.edu/na09/resources.html
ICWSM twitter dataset: http://twitter.mpi-sws.org/data-icwsm2010.html
EBSN - Event-based social network dataset: http://www.largenetwork.org/ebsn
Other social network dataset: Slashdot, Enron email, Mit mobile, Epinions reviews.
Bing Liu's homepage
Movie Review http://www.cs.cornell.edu/people/pabo/movie-review-data/
Lee's homepage
twitter sentiment: http://www.sananalytics.com/lab/twitter-sentiment/
index2: http://mobblog.cs.ucl.ac.uk/datasets/
Million song dataset http://labrosa.ee.columbia.edu/millionsong/
Several graph dataset http://law.di.unimi.it/datasets.php
Delicious/Flikr/Last.FM etc http://www.tagora-project.eu/data/
A small dataset about links http://www.cs.umd.edu/projects/linqs/projects/lbc/index.html
A small dataset including citeseerx/imdb http://komarix.org/ac/ds/
Amazon
Both user-user and user-object
single-type user netwrok
Flickr, Youtube, twitter
signed user network
Epinion, Slashdot, Ciao
Multi-type user network
Facebook, Google plus
Movies Recommendation:
MovieLens - Movie Recommendation Data Sets http://www.grouplens.org/node/73
Yahoo! - Movie, Music, and Images Ratings Data Sets http://webscope.sandbox.yahoo.com/catalog.php?datatype=r
Jester - Movie Ratings Data Sets (Collaborative Filtering Dataset) http://www.ieor.berkeley.edu/~goldberg/jester-data/
Cornell University - Movie-review data for use in sentiment-analysis experiments http://www.cs.cornell.edu/people/pabo/movie-review-data/
Music Recommendation:
Last.fm - Music Recommendation Data Sets http://www.dtic.upf.edu/~ocelma/MusicRecommendationDataset/index.html
Yahoo! - Movie, Music, and Images Ratings Data Sets http://webscope.sandbox.yahoo.com/catalog.php?datatype=r
Audioscrobbler - Music Recommendation Data Sets http://www-etud.iro.umontreal.ca/~bergstrj/audioscrobbler_data.html
Amazon - Audio CD recommendations http://131.193.40.52/data/
Books Recommendation:
Institut für Informatik, Universitt Freiburg - Book Ratings Data Sets http://www.informatik.uni-freiburg.de/~cziegler/BX/
Food Recommendation:
Chicago Entree - Food Ratings Data Sets http://archive.ics.uci.edu/ml/datasets/Entree+Chicago+Recommendation+Data
Merchandise Recommendation:
Amazon - Product Recommendation Data Sets http://131.193.40.52/data/
Healthcare Recommendation:
Nursing Home - Provider Ratings Data Set http://data.medicare.gov/dataset/Nursing-Home-Compare-Provider-Ratings/mufm-vy8d
Hospital Ratings - Survey of Patients Hospital Experiences http://data.medicare.gov/dataset/Survey-of-Patients-Hospital-Experiences-HCAHPS-/rj76-22dk
Dating Recommendation:
www.libimseti.cz - Dating website recommendation (collaborative filtering) http://www.occamslab.com/petricek/data/
Scholarly Paper Recommendation:
National University of Singapore - Scholarly Paper Recommendation http://www.comp.nus.edu.sg/~sugiyama/SchPaperRecData.html
Information Network
DBLP http://www.informatik.uni-trier.de/~ley/db/proximity DBLP http://kdl.cs.umass.edu/data/dblp/dblp-info.html
DBLP-Citation-Network http://arnetminer.org/citation
KDD-2011 http://www.cs.uiuc.edu/~hbdeng/data/kdd2011.htm
CiteSeer (hardly) http://csxstatic.ist.psu.edu/about/data
CiteSeer dumped http://martinharrigan.blogspot.com/2008/07/citeseers-dataset.html
Cora (hardly) http://people.cs.umass.edu/~mccallum/data.html
IMDB http://www.imdb.com/interfaces/
Social Network
Stanford large network dataset (contains lots of network dataset): http://snap.stanford.edu/data/Stanford class resources http://snap.stanford.edu/na09/resources.html
ICWSM twitter dataset: http://twitter.mpi-sws.org/data-icwsm2010.html
EBSN - Event-based social network dataset: http://www.largenetwork.org/ebsn
Other social network dataset: Slashdot, Enron email, Mit mobile, Epinions reviews.
Sentiment and Option Mining
MPQA http://www.cs.pitt.edu/mpqa/index.htmlBing Liu's homepage
Movie Review http://www.cs.cornell.edu/people/pabo/movie-review-data/
Lee's homepage
twitter sentiment: http://www.sananalytics.com/lab/twitter-sentiment/
Recommendation
index1: https://gist.github.com/1653794index2: http://mobblog.cs.ucl.ac.uk/datasets/
Machine Learning
UCI dataset http://archive.ics.uci.edu/ml/datasets.htmlAudio Retrieval
CAL-500: http://twitterdata.org/Million song dataset http://labrosa.ee.columbia.edu/millionsong/
Miscellaneous1
A lot graph dataset including several cups, twitter etc http://graphlab.org/downloads/datasets/Several graph dataset http://law.di.unimi.it/datasets.php
Delicious/Flikr/Last.FM etc http://www.tagora-project.eu/data/
A small dataset about links http://www.cs.umd.edu/projects/linqs/projects/lbc/index.html
A small dataset including citeseerx/imdb http://komarix.org/ac/ds/
Miscellaneous2
Only user-objectAmazon
Both user-user and user-object
single-type user netwrok
Flickr, Youtube, twitter
signed user network
Epinion, Slashdot, Ciao
Multi-type user network
Facebook, Google plus
相关文章推荐
- 学习机器学习 数据处理时 找到的这些链接 可以在上面下载到开源的研究数据数据
- ASP.NET MVC WebApi 返回数据类型序列化控制(json,xml) 用javascript在客户端删除某一个cookie键值对 input点击链接另一个页面,各种操作。 C# 往线程里传参数的方法总结 TCP/IP 协议 用C#+Selenium+ChromeDriver 生成我的咕咚跑步路线地图 (转)值得学习百度开源70+项目
- 研究了两套经典源码,翻译了其中的注释,有学习需要的朋友可以下载
- 大数据学习笔记文档下载链接
- Google研究 | 联合学习:无需集中存储训练数据的协同机器学习
- Mono v1.2.51 - 开源版本的.NET框架,Mono,Mono下载,Mono框架开发,Mono学习,Mono是什么,Mono浅谈,Mono研究
- 跟着教程学习MP3播放器编写,遇到奇怪事,扩展名为MP3、jpg、gif的都可以下载,唯独lrc的下载总失败,终于找到原因!
- 大数据学习笔记文档下载链接
- 一组可以学习,可以考虑移植到PlayBook上面的开源Adobe AIR应用(hosted by google code)
- 研究发现:开源可以帮你找到工作
- 机器学习模式识别数据挖掘数据集下载链接
- 机器学习模式识别数据挖掘数据集下载链接
- USGS官方的各种卫星数据产品的详细说明(很多英文缩写都可在这里找到详细信息,如ETM+和TM的意思,也可以下载)
- JPA学习笔记---JPA实体Bean的建立---链接上一个博文:对实体Bean中属性进行操作:保存日期类型,设置字段的长度,名字,是否为空,可以声明枚举字段;可以存放二进制数据,可以存放
- 找到一个可以下载开放式基金历史数据的网站
- C#学习笔记之从FTP服务器上传和下载数据(一)
- (一〇一)第七章编程练习(附①至⑦章学习笔记下载链接)
- 分享最近在学习的 TensorFlow 教程 | 提供资源下载链接
- 机器学习与深度学习的数据集
- 有学生提到,在大学选课的时候,可以写一个“刷课机”的程序,利用学校选课系统的弱点或漏洞,帮助某些人选到某些课程。或者帮助用户刷购票网站,先买到火车票。这些软件合法么?符合道德规范么?是在“软件工程”的研究范围么?