This paper provides an overview of how to use “big data” for social science research (with emphasis on economics and finance). We investigate the performance ease different Spark applications running a distributed file system enable handling analysis data sets which were previously not usable due their size. More specifically, we explain (i) explore big exceed retail grade computers memory size...