integration/metrics/README.md
This directory contains some docker files to help you set up metrics tracking on an alluxio cluster. The scripts use OpenTelemetry(OTEL)'s auto instrumentation java agent to generate tracing information for GRPC and S3 calls, which are visualized using Jaeger and Prometheus.
For reference, please refer to documentations listed at opentelemetry GitHub page
docker-compose -f docker-compose-master.yaml up -d
This will run four services on this node, a Jaeger service to visualize traces, a Prometheus service to visualize metrics such as counters, an OTEL agent to collect traces and counters, and an OTEL collector to aggregate traces and counters from this node and other nodes.
MASTER_IP=xxx.xx.xx.xx docker-compose -f docker-compose-worker.yaml up -d
alluxio/conf directory. The jar can be found athttps://github.com/open-telemetry/opentelemetry-java-instrumentation/releases/latest/download/opentelemetry-javaagent-all.jar
conf/alluxio-env.sh and addALLUXIO_MASTER_JAVA_OPTS+=" -javaagent:./conf/opentelemetry-javaagent-all.jar \
-Dotel.resource.attributes=service.name=AlluxioMaster \
"
ALLUXIO_WORKER_JAVA_OPTS+=" -javaagent:./conf/opentelemetry-javaagent-all.jar \
-Dotel.resource.attributes=service.name=AlluxioWorker \
"
alluxio/conf dir to all nodes using alluxio utility.bin/alluxio copyDir conf
MASTER_IP:16686 for tracing and MASTER_IP:9090 for metricsdocker-compose -f docker-compose-master.yaml down
docker-compose -f docker-compose-worker.yaml down