Paper Submitted to BIG DATA 2016 for Review: Source Code and Dataset For Public Release