Purpose

This docker image provides a local spark installation with zeppelin and a running spark-history-server. It is uploaded in dockerhub in a public repository.

I use it to evaluate independently spark code in a more convenient way then a spark-shell.

Components

Spark version="2.4.3"
Zeppelin version="0.8.1"
Hadoop version="2.7"

Start the container

  docker run -it -p 18080:18080 -p 8088:8080 -d mirkoprescha/spark-zeppelin-docker

Open Zeppelin and Spark History Server

In your local browser

Zeppelin: http://localhost:8088/#/
Spark History Server: http://localhost:18080/?showIncomplete=true

Probably, you have to wait roughly 10 second until zeppelin daemon has been started, right after starting the container.

Spark-App

Copy your spark jar to docker container

Start another shell session and copy the jar-file into the docker container. Following command copies it into your latest started container.

docker cp <your-jar-file.jar> $(docker ps  -l -q):/work/

Run spark job

Go back to container session. You should be connected as root in the docker container:

cd /work
spark-submit   --class <your-class-name-with-package> \
      <your-jar-file.jar> \
      [<your-program-parameters>]

Changes in dockerfile

After changes in Dockerfile goto project home dir and run

docker build  -t mirkoprescha/spark-zeppelin-docker .

This repo is connected to an automated build in docker hub, so the following no push to docker hub is not required.

docker push  mirkoprescha/spark-zeppelin-docker

Misc

if container session is lost

docker start CONTAINER_ID
docker attach CONTAINER_ID

run as background session

add - d

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Dockerfile		Dockerfile
README.md		README.md
spark-defaults.conf		spark-defaults.conf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Purpose

Components

Start the container

Open Zeppelin and Spark History Server

Spark-App

Copy your spark jar to docker container

Run spark job

Changes in dockerfile

Misc

if container session is lost

run as background session

About

Releases

Packages

Languages

mirkoprescha/spark-zeppelin-docker

Folders and files

Latest commit

History

Repository files navigation

Purpose

Components

Start the container

Open Zeppelin and Spark History Server

Spark-App

Copy your spark jar to docker container

Run spark job

Changes in dockerfile

Misc

if container session is lost

run as background session

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages