Skip to content

Latest commit

 

History

History
43 lines (29 loc) · 1.11 KB

README.adoc

File metadata and controls

43 lines (29 loc) · 1.11 KB

k8s-spark-py

Build spark-py image for Kubernetes

Pre-requisites

# Retrieve the code
git clone https://github.com/astrolabsoftware/k8s-spark-py
cd k8s-spark-py

Eventually edit conf.sh to fine-tune the configuration

# Download and unzip Spark binaries
./prereq-install.sh

Build spark-py image for k8s

./build.sh

Push image to IN2P3 registry

# Log in IN2P3 registry
docker login gitlab-registry.in2p3.fr
./push-image.sh

Customize image

The goal is to remain closest from the standard build procedure documented here: https://spark.apache.org/docs/latest/running-on-kubernetes.html#docker-images

However, it is possible to customize the build by adding files inside the custom/ directory. This file will be copied to SPARK_HOME just before building the container images. Currenly avro, hbase and kafka java libraries are added in order to support fink-broker.

Automated build

The CI will automatically build and push spark-py container image inside IN2P3 registry for each commit to the git repository.