您的位置:首页 > 其它

学习机器学习 数据处理时 找到的这些链接 可以在上面下载到开源的研究数据数据

2013-12-07 14:40 501 查看
美国政府数据 http://www.data.gov/

Movies Recommendation:
MovieLens - Movie Recommendation Data Sets http://www.grouplens.org/node/73

Yahoo! - Movie, Music, and Images Ratings Data Sets http://webscope.sandbox.yahoo.com/catalog.php?datatype=r

Jester - Movie Ratings Data Sets (Collaborative Filtering Dataset) http://www.ieor.berkeley.edu/~goldberg/jester-data/

Cornell University - Movie-review data for use in sentiment-analysis experiments http://www.cs.cornell.edu/people/pabo/movie-review-data/

Music Recommendation:
Last.fm - Music Recommendation Data Sets http://www.dtic.upf.edu/~ocelma/MusicRecommendationDataset/index.html

Yahoo! - Movie, Music, and Images Ratings Data Sets http://webscope.sandbox.yahoo.com/catalog.php?datatype=r

Audioscrobbler - Music Recommendation Data Sets http://www-etud.iro.umontreal.ca/~bergstrj/audioscrobbler_data.html

Amazon - Audio CD recommendations http://131.193.40.52/data/

Books Recommendation:
Institut für Informatik, Universitt Freiburg - Book Ratings Data Sets http://www.informatik.uni-freiburg.de/~cziegler/BX/

Food Recommendation:
Chicago Entree - Food Ratings Data Sets http://archive.ics.uci.edu/ml/datasets/Entree+Chicago+Recommendation+Data

Merchandise Recommendation:
Amazon - Product Recommendation Data Sets http://131.193.40.52/data/

Healthcare Recommendation:
Nursing Home - Provider Ratings Data Set http://data.medicare.gov/dataset/Nursing-Home-Compare-Provider-Ratings/mufm-vy8d

Hospital Ratings - Survey of Patients Hospital Experiences http://data.medicare.gov/dataset/Survey-of-Patients-Hospital-Experiences-HCAHPS-/rj76-22dk

Dating Recommendation:
www.libimseti.cz - Dating website recommendation (collaborative filtering) http://www.occamslab.com/petricek/data/

Scholarly Paper Recommendation:
National University of Singapore - Scholarly Paper Recommendation http://www.comp.nus.edu.sg/~sugiyama/SchPaperRecData.html

Information Network

DBLP http://www.informatik.uni-trier.de/~ley/db/

proximity DBLP http://kdl.cs.umass.edu/data/dblp/dblp-info.html

DBLP-Citation-Network http://arnetminer.org/citation

KDD-2011 http://www.cs.uiuc.edu/~hbdeng/data/kdd2011.htm

CiteSeer (hardly) http://csxstatic.ist.psu.edu/about/data

CiteSeer dumped http://martinharrigan.blogspot.com/2008/07/citeseers-dataset.html

Cora (hardly) http://people.cs.umass.edu/~mccallum/data.html

IMDB http://www.imdb.com/interfaces/

Social Network

Stanford large network dataset (contains lots of network dataset): http://snap.stanford.edu/data/

Stanford class resources http://snap.stanford.edu/na09/resources.html

ICWSM twitter dataset: http://twitter.mpi-sws.org/data-icwsm2010.html

EBSN - Event-based social network dataset: http://www.largenetwork.org/ebsn

Other social network dataset: Slashdot, Enron email, Mit mobile, Epinions reviews.

Sentiment and Option Mining

MPQA http://www.cs.pitt.edu/mpqa/index.html

Bing Liu's homepage

Movie Review http://www.cs.cornell.edu/people/pabo/movie-review-data/

Lee's homepage

twitter sentiment: http://www.sananalytics.com/lab/twitter-sentiment/

Recommendation

index1: https://gist.github.com/1653794

index2: http://mobblog.cs.ucl.ac.uk/datasets/

Machine Learning

UCI dataset http://archive.ics.uci.edu/ml/datasets.html

Audio Retrieval

CAL-500: http://twitterdata.org/

Million song dataset http://labrosa.ee.columbia.edu/millionsong/

Miscellaneous1

A lot graph dataset including several cups, twitter etc http://graphlab.org/downloads/datasets/

Several graph dataset http://law.di.unimi.it/datasets.php

Delicious/Flikr/Last.FM etc http://www.tagora-project.eu/data/

A small dataset about links http://www.cs.umd.edu/projects/linqs/projects/lbc/index.html

A small dataset including citeseerx/imdb http://komarix.org/ac/ds/

Miscellaneous2

Only user-object
Amazon

Both user-user and user-object
single-type user netwrok
Flickr, Youtube, twitter

signed user network
Epinion, Slashdot, Ciao

Multi-type user network
Facebook, Google plus
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
相关文章推荐