doc/source/serve/llm/examples.md
Production examples for deploying LLMs with Ray Serve.
Complete end-to-end tutorials for deploying different types of LLMs:
Deploy a small-sized LLM <../../_collections/serve/tutorials/deployment-serve-llm/small-size-llm/README>Deploy a medium-sized LLM <../../_collections/serve/tutorials/deployment-serve-llm/medium-size-llm/README>Deploy a large-sized LLM <../../_collections/serve/tutorials/deployment-serve-llm/large-size-llm/README>Deploy a vision LLM <../../_collections/serve/tutorials/deployment-serve-llm/vision-llm/README>Deploy a reasoning LLM <../../_collections/serve/tutorials/deployment-serve-llm/reasoning-llm/README>Deploy a hybrid reasoning LLM <../../_collections/serve/tutorials/deployment-serve-llm/hybrid-reasoning-llm/README>Deploy gpt-oss <../../_collections/serve/tutorials/deployment-serve-llm/gpt-oss/README>:hidden:
../../_collections/serve/tutorials/deployment-serve-llm/small-size-llm/README
../../_collections/serve/tutorials/deployment-serve-llm/medium-size-llm/README
../../_collections/serve/tutorials/deployment-serve-llm/large-size-llm/README
../../_collections/serve/tutorials/deployment-serve-llm/vision-llm/README
../../_collections/serve/tutorials/deployment-serve-llm/reasoning-llm/README
../../_collections/serve/tutorials/deployment-serve-llm/hybrid-reasoning-llm/README
../../_collections/serve/tutorials/deployment-serve-llm/gpt-oss/README