List of Datasets 2015 Fall¶
Note that these are some of students projects from one of Big Data courses. These are reference only.
| Dataset | Type | Source |
|---|---|---|
| Amazon Movie Data | Recommendation | link |
| ATP Tennis Data | link; link2 | |
| bts.gov | link | |
| CDC | link | |
| census.gov | ||
| China Foundations | link | |
| Chinese Statistical Yearbook | Public State/City/County/Government | link |
| City of Chicago | Public State/City/County/Government | link; link2 |
| Crime Statistics - City of Houston Texas | ||
| Crossfit | Sports | link |
| CRSP US Stock | Research | link |
| edmunds.com | Commercial | link |
| GDP | Public State/City/County/Government | link |
| Hubway Bike data | link; link2 | |
| Indiana Lidar from SDSC | link | |
| Kaggle.com | www.kaggle.com/c/titanic/data>`_ | |
| Lahman’s data | Sports | link |
| LIBOR Rates from St. Louis | link | |
| Movie reviews | link | |
| Movie Reviews - Rotten Tomatos | link | |
| noaa.gov | link | |
| PITCHfx | Sports | link |
| PubMed | link | |
| Retrosheet | Sports | link |
| SF OpenData - SFPD Incidents | Public State/City/County/Government | link |
| SNAP - Stanford Network Analysis Project | link | |
| State of Washington | Public State/City/County/Government | link |
| Statistical Computing | Public | link |
| Titanic survival data | link | |
| UCI Machine Learning Repository | link | |
| United Nation Population Division | Public State/City/County/Government | link |
| WRDS CRSP data | ||
| Yelp Dataset | Recommendation | link |