List of Project 2015 Fall (In Progress)

Note that these are some of students projects from one of Big Data courses. These are reference only.

2015 Fall Suggested Projects
Title Data set Software Category
NIST Fingerprint (a subset of): NFIQ PCASYS MINDTCT BOZORTH3 NFSEG SIVV NIST Special Database 27A [4GB] NIST Biometric Image Software (NBIS) v5.0 [userguide] Batch Data Analytics
Hadoop Benchmark (each) - TeraSort Suite Teragen hadoop-examples.jar Batch Data Analytics
Hadoop Benchmark (each) - DFSIO (HDFS Performance)   hadoop-mapreduce-client-jobclient Batch Data Analytics
Hadoop Benchmark (each) - NNBench (NameNode Perf.)   hadoop-mapreduce-client-jobclient Batch Data Analytics
Hadoop Benchmark (each) - MRBench (MapReduce Perf.)   src/test/org/apache/hadoop/mapred/MRBench.java Batch Data Analytics
Stock Data Analysis with MPI CRSP Stock Analysis Streaming Data Analytics
2015 Fall
Id Title Technology* User Interface Language Backend Environment** Dataset
1 Time series visualization of stock data MPI Java MPI, SLURM CRSP US Stock from WRDS
2 San Francisco’s Most Dangerous Crime Areas scikit-learn Python SQL (pandasql) SF OpenData - SFPD Incidents
3 Houston, TX Crime Data Analysis 2014 Postgresql Python SQL (Postgres) Crime Statistics - City of Houston, Texas
4 Twitter Live Feed Analysis and Storage in MongoDB IPython Notebook; seaborn Python NoSQL (MongoDB) Twitter
5 Twitter Sentiment Analysis of US Presidential Election Hadoop; Datumbox; Tableau Python Hadoop; HBase Twitter
6 Amazon movie reviews IPython Notebook; Spark Python Spark Amazon Movie Data
7 Stock Market Analysis scikit-learn;hadoop Python hadoop WRDS CRSP data
8 Twitter User Data Analysis MongoDB;D3;jQuery javascript NoSQL (MongoDB) Twitter
9 Twitter Dataset Analysis and Modeling MongoDB;pymongo Python NoSQL (MongoDB) Twitter