docs/source/en/main_classes/continuous_batching.md
This page documents the classes behind continuous batching inference: submitting prompts, configuring scheduling and memory limits, and retrieving results.
For usage examples, see the Continuous batching guide and for how scheduling and memory interact, see the Continuous batching architecture doc.
[[autodoc]] ContinuousMixin.generate_batch
[[autodoc]] ContinuousBatchingManager
[[autodoc]] ContinuousBatchingConfig - call