Apache Spark is an open source cluster computing system that aims to make data analytics fast - both fast to run and fast to write. BDAS, the Berkeley Data Analytics Stack, is an open source software stack that integrates software components being bu…