Skip to content

Latest commit

 

History

History
27 lines (20 loc) · 1.29 KB

README.md

File metadata and controls

27 lines (20 loc) · 1.29 KB

Docker-Spark-HDFS-Kafka

Repository containing docker-compose files for quickly setting up a docker cluster consisting of Spark, HDFS, Kafka.

Requirement

Docker Desktop must be preinstalled.

How to deploy

Navigate to the folder containing docker-compose.yml file, type docker-compose up -d.

Services available at:

Connect Scrapy Item Pipeline to Kafka Cluster

Please refer to os-scrapy-kafka-pipeline.

Default kafka brokers' addresses are ["192.168.1.5:9092","192.168.1.5:9093","192.168.1.5:9094"] (configured in docker-compose.yml).

Credits

  1. big-data-europe/docker-hadoop
  2. wurstmeister/kafka-docker
  3. cluster-apps-on-docker/spark-standalone-cluster-on-docker
  4. elasticsearch

Other Dockers

docker-compose.yml files for deploying elasticsearch & kibana cluster, as well as other individual clusters are also provided.