.. Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. See the NOTICE file distributed with this work for additional information regarding copyright ownership. The ASF licenses this file to you under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

.. http://www.apache.org/licenses/LICENSE-2.0

.. Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Autoscaling with KEDA

KEDA stands for Kubernetes Event Driven Autoscaling. KEDA <https://github.com/kedacore/keda>__ is a custom controller that allows users to create custom bindings to the Kubernetes Horizontal Pod Autoscaler <https://kubernetes.io/docs/tasks/run-application/horizontal-pod-autoscale/>__. The autoscaler will adjust the number of active Celery workers based on the number of tasks in queued or running state.

One advantage of KEDA is that it allows you to scale your application to/from 0 workers, meaning no workers are idle when there are no tasks.

KEDA Installation and usage

To install KEDA in your Kubernetes cluster, run the following commands:

.. code-block:: bash

helm repo add kedacore https://kedacore.github.io/charts helm repo update kubectl create namespace keda helm install keda kedacore/keda
--namespace keda
--version "v2.0.0"

To enable KEDA for the Airflow instance, it has to be enabled by setting workers.celery.keda.enabled=true in your Helm command or in the values.yaml like:

.. code-block:: bash

kubectl create namespace airflow helm repo add apache-airflow https://airflow.apache.org helm install airflow apache-airflow/airflow
--namespace airflow
--set executor=CeleryExecutor
--set workers.celery.keda.enabled=true

.. note::

Make sure values.yaml shows that either KEDA or HPA is enabled, but not both. It is recommended not to use both KEDA and HPA to scale the same workload. They will compete with each other resulting in odd scaling behavior.

After installation, the KEDA ScaledObject and an HPA will be created in the Airflow namespace.

In the default configuration, KEDA will derive the desired number of Celery workers by querying Airflow metadata database with following SQL statement:

.. code-block:: none

SELECT ceil(COUNT(*)::decimal / {{ .Values.config.celery.worker_concurrency }}) FROM task_instance WHERE (state='running' OR state='queued') AND queue IN <queue names>

where <queue names> is a list of queue names used by Celery worker queues <https://airflow.apache.org/docs/apache-airflow-providers-celery/stable/celery_executor.html#queues>_ mechanism (with default configuration it has one element default).

.. note::

Set Celery worker concurrency through the Helm Chart value config.celery.worker_concurrency (e.g. instead of airflow.cfg or environment variables), so that the KEDA trigger will be consistent with the worker concurrency setting.

Triggers (aka Scalers)

Triggers refer to the metrics (or formulae) that KEDA should refer to when scaling workers.

It is recommended to use multiple triggers within a ScaledObject, rather than creating different objects for different triggers. This keeps all your rules and formulae in one place, and it avoids multiple ScaledObjects being created by the same workload.

ScaledObject

To configure KEDA's triggers and scaling behaviors, you need to create a ScaledObject. Below ScaledObject parameters:

cooldownPeriod specifies the number of seconds to wait before downscaling to 0 workers, does not apply to downscaling to n workers while n >= 1.
idleReplicaCount can be set to any number less than minReplicaCount, but it must be set to 0, otherwise KEDA will not work. Change minReplicaCount to n > 0 if you need idle workers.

Triggerers value targetQueryValue is used as TargetValue of workers, which must be between ScaledObject minReplicaCount and maxReplicaCount values.

.. note::

To avoid strange behavior, best practice is to set cooldownPeriod to an integer slightly larger than terminationGracePeriodSeconds so that your cluster does not downscale to 0 workers before cleanup is finished.

Metrics

The HPA controller, refreshes metrics defined in triggers every --horizontal-pod-autoscaler-sync-period and the values are routed to KEDA Metrics Server directly. To reduce the load on the KEDA Scaler, you can set useCachedMetrics to true, to enabling reading metrics from cache first. Cache is updated periodically every pollingInterval.

.. note::

When number of workers = 0, KEDA will still poll for metrics using pollingInterval. When number of workers >= 1, both KEDA and the HPA will poll your defined triggers.

KEDA offers two metricTypes that provide more granular scaling control than the standard HPA Target metric:

AverageValue (default) controls a per-worker average.
Value controls total system load.