Skip to content

Latest commit

 

History

History
40 lines (31 loc) · 1.43 KB

README.md

File metadata and controls

40 lines (31 loc) · 1.43 KB

Code repository for O'reilly course : 'Integrating Hadoop and Spark'

Getting Started

You can clone this repository as follows

    $   git   clone   [email protected]:elephantscale/hadoop-spark.git

Lab Order

  1. Dev environment setup
  2. Hadoop setup
  3. Spark Shell
  4. RDDs
  5. Dataframes
  6. Hive and Spark
  7. Spark and YARN
  8. Spark Applications

Resources

Books

Sites

Vendors