List of Project 2015 Fall (In Progress)¶
Note that these are some of students projects from one of Big Data courses. These are reference only.
Title | Data set | Software | Category |
---|---|---|---|
NIST Fingerprint (a subset of): NFIQ PCASYS MINDTCT BOZORTH3 NFSEG SIVV | NIST Special Database 27A [4GB] | NIST Biometric Image Software (NBIS) v5.0 [userguide] | Batch Data Analytics |
Hadoop Benchmark (each) - TeraSort Suite | Teragen | hadoop-examples.jar | Batch Data Analytics |
Hadoop Benchmark (each) - DFSIO (HDFS Performance) | hadoop-mapreduce-client-jobclient | Batch Data Analytics | |
Hadoop Benchmark (each) - NNBench (NameNode Perf.) | hadoop-mapreduce-client-jobclient | Batch Data Analytics | |
Hadoop Benchmark (each) - MRBench (MapReduce Perf.) | src/test/org/apache/hadoop/mapred/MRBench.java | Batch Data Analytics | |
Stock Data Analysis with MPI | CRSP | Stock Analysis | Streaming Data Analytics |
Id | Title | Technology* | User Interface Language | Backend Environment** | Dataset |
---|---|---|---|---|---|
1 | Time series visualization of stock data | MPI | Java | MPI, SLURM | CRSP US Stock from WRDS |
2 | San Francisco’s Most Dangerous Crime Areas | scikit-learn | Python | SQL (pandasql) | SF OpenData - SFPD Incidents |
3 | Houston, TX Crime Data Analysis 2014 | Postgresql | Python | SQL (Postgres) | Crime Statistics - City of Houston, Texas |
4 | Twitter Live Feed Analysis and Storage in MongoDB | IPython Notebook; seaborn | Python | NoSQL (MongoDB) | |
5 | Twitter Sentiment Analysis of US Presidential Election | Hadoop; Datumbox; Tableau | Python | Hadoop; HBase | |
6 | Amazon movie reviews | IPython Notebook; Spark | Python | Spark | Amazon Movie Data |
7 | Stock Market Analysis | scikit-learn;hadoop | Python | hadoop | WRDS CRSP data |
8 | Twitter User Data Analysis | MongoDB;D3;jQuery | javascript | NoSQL (MongoDB) | |
9 | Twitter Dataset Analysis and Modeling | MongoDB;pymongo | Python | NoSQL (MongoDB) |