doc/source/cluster/kubernetes/k8s-ecosystem/scheduler-plugins.md
(kuberay-scheduler-plugins)=
The kubernetes-sigs/scheduler-plugins repository provides out-of-tree scheduler plugins based on the scheduler framework.
Starting with KubeRay v1.4.0, KubeRay integrates with the PodGroup API provided by scheduler plugins to support gang scheduling for RayCluster custom resources.
kind create cluster --image=kindest/node:v1.26.0
Follow the installation guide in the scheduler-plugins repository to install the scheduler plugins.
:::{note}
There are two modes for installing the scheduler plugins: single scheduler mode and second scheduler mode.
KubeRay v1.4.0 only supports the single scheduler mode. You need to have the access to configure Kubernetes control plane to replace the default scheduler with the scheduler plugins.
:::
KubeRay v1.4.0 and later versions support scheduler plugins.
helm install kuberay-operator kuberay/kuberay-operator --version 1.5.1 --set batchScheduler.name=scheduler-plugins
# Configure the RayCluster with label `ray.io/gang-scheduling-enabled: "true"`
# to enable gang scheduling.
kubectl apply -f https://raw.githubusercontent.com/ray-project/kuberay/release-1.4/ray-operator/config/samples/ray-cluster.scheduler-plugins.yaml
Note that if you use "second scheduler mode," which KubeRay currently doesn't support, the following commands still show similar results. However, the Ray Pods don't get scheduled in a gang scheduling manner. Make sure to use "single scheduler mode" to enable gang scheduling.
kubectl get podgroups.scheduling.x-k8s.io
# NAME PHASE MINMEMBER RUNNING SUCCEEDED FAILED AGE
# test-podgroup-0 Running 3 3 2m25s
# All Ray Pods (1 head and 2 workers) belong to the same PodGroup.
kubectl get pods -L scheduling.x-k8s.io/pod-group
# NAME READY STATUS RESTARTS AGE POD-GROUP
# test-podgroup-0-head 1/1 Running 0 3m30s test-podgroup-0
# test-podgroup-0-worker-worker-4vc6j 1/1 Running 0 3m30s test-podgroup-0
# test-podgroup-0-worker-worker-ntm9f 1/1 Running 0 3m30s test-podgroup-0