Back to Charts

⚠️ Repo Archive Notice

stable/zeppelin/README.md

latest3.9 KB
Original Source

⚠️ Repo Archive Notice

As of Nov 13, 2020, charts in this repo will no longer be updated. For more information, see the Helm Charts Deprecation and Archive Notice, and Update.

Zeppelin Chart

Zeppelin is a web based notebook for interactive data analytics with Spark, SQL and Scala.

DEPRECATION NOTICE

This chart is deprecated and no longer supported.

Chart Details

Installing the Chart

To install the chart:

$ helm install stable/zeppelin

Configuration

The following table lists the configurable parameters of the Zeppelin chart and their default values.

ParameterDescriptionDefault
zeppelin.imageZeppelin imagedylanmei/zeppelin:{VERSION}
zeppelin.resourcesResource limits and requestslimits.memory=4096Mi, limits.cpu=2000m
spark.driverMemoryMemory used by Spark driver (Java notation)1g
spark.executorMemoryMemory used by Spark executors (Java notation)1g
spark.numExecutorsNumber of Spark executors2
hadoop.useConfigMapUse external Hadoop configuration for Spark executorsfalse
hadoop.configMapNameName of the hadoop config map to use (must be in same namespace)hadoop-config
hadoop.configPathPath in the Zeppelin image where the Hadoop config is mounted/usr/hadoop-2.7.3/etc/hadoop
ingress.enabledEnable ingressfalse
ingress.annotationsIngress annotations{}
ingress.hostsIngress Hostnames["zeppelin.local"]
ingress.pathPath within the URL structure/
ingress.tlsIngress TLS configuration[]
nodeSelecorNode selector for the Zeppelin deployment{}

The Hadoop chart can be used to create a YARN cluster where Spark jobs are executed:

helm install -n hadoop stable/hadoop
helm install --set hadoop.useConfigMap=true,hadoop.configMapName=hadoop-hadoop stable/zeppelin

Note that you may also want to set the spark.numExecutors value to match the number of yarn NodeManager replicas and the executorMemory value to half of the NodeManager memory limit.