doc/source/cluster/kubernetes/examples/mobilenet-rayservice.md
(kuberay-mobilenet-rayservice-example)=
Note: The Python files for the Ray Serve application and its client are in the repository ray-project/serve_config_examples.
kind create cluster --image=kindest/node:v1.26.0
Follow this document to install the latest stable KubeRay operator from the Helm repository.
Note that the YAML file in this example uses serveConfigV2. You need KubeRay version v0.6.0 or later to use this feature.
# Create a RayService
kubectl apply -f https://raw.githubusercontent.com/ray-project/kuberay/v1.5.1/ray-operator/config/samples/ray-service.mobilenet.yaml
tensorflow as a dependency. Hence, the YAML file uses rayproject/ray-ml image instead of rayproject/ray image.starlette.requests.form() needs python-multipart, so the YAML file includes python-multipart in the runtime environment.# Wait for the RayService to be ready to serve requests
kubectl describe rayservice/rayservice-mobilenet
# Conditions:
# Last Transition Time: 2025-02-13T02:29:26Z
# Message: Number of serve endpoints is greater than 0
# Observed Generation: 1
# Reason: NonZeroServeEndpoints
# Status: True
# Type: Ready
# Forward the port for Ray Serve service
kubectl port-forward svc/rayservice-mobilenet-serve-svc 8000
image_path in mobilenet_req.pyImageClassifier.
python mobilenet_req.py
# sample output: {"prediction":["n02099601","golden_retriever",0.17944198846817017]}