SparkContext: Putting Data in Business Context​

Apache Spark is an in-memory streaming and data-analytics framework which has taken the big data world by storm. In this talk we’ll cover what it is, its history, where its headed, and how to use it. Specifically we’ll hit on the following topics:

  • Spark basics: Data processing, Analytics, and Streaming
  • Using it to ingest, integrate, and query data
  • Test the Spark job against a larger data size
  • Set up a Spark job to run in scheduled, production fashion
  • Demos of real production use cases in the transportation industry​