Back to Airflow

INSTALLING

INSTALLING.md

3.3.0b24.9 KB
Original Source
<!-- Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. See the NOTICE file distributed with this work for additional information regarding copyright ownership. The ASF licenses this file to you under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0 Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License. -->

Local Development Setup

This section outlines a recommended approach for setting up a local development environment for Apache Airflow on macOS and Linux, primarily using PyEnv for Python version management.

⚠️ Avoid using either system-installed Python or Python from Homebrew, as these versions are often labeled --externally-managed resulting in restricted dependency installation.

You can use other ways to install Python and Airflow. The Airflow development setup requires uv. If you want to set up a development environment, uv is the only supported local development environment setup because Airflow uses uv workspace extensively. See local virtualenv setup in contributing docs for details.

If you are just installing Airflow to run it locally, you can use other ways to set up your Python and virtual environment: uv is one option (refer to the uv documentation), but you can also use more traditional tools, such as pyenv. Note that installing Airflow with constraints is recommended, at least initially, because it makes the installation reproducible. See Installation from PyPI for more details.

✅ Setup using pyenv:

  1. Install pyenv (macOS and Linux):
bash
brew install pyenv

(Note: Homebrew is the recommended method on macOS. For Linux, you can typically install pyenv using the pyenv-installer script as detailed in the official documentation: https://github.com/pyenv/pyenv#installation.)

  1. Install Python:
bash
pyenv install 3.11.9
pyenv global 3.11.9
  1. Check Python version:
bash
python --version
  1. Create and Activate a Virtual Environment: Since Apache Airflow requires multiple dependencies, it's a good practice to isolate these dependencies in a virtual environment.
  • Create a virtual environment:
bash
python -m venv airflow_venv
  • Activate the virtual environment:
bash
source airflow_venv/bin/activate
  1. Install Apache Airflow: Apache Airflow is available on PyPI. To install it, you can use the following command in your terminal:
bash
pip install apache-airflow==3.1.8 --constraint "https://raw.githubusercontent.com/apache/airflow/constraints-3.1.8/constraints-3.11.txt"

Note that installing with constraints - at least initially - is recommended for reproducible installation. It might sometimes happen that 3rd-party distributions are released and their latest versions break airflow. Using constraints makes the installation reproducible with versions of dependencies that were "frozen" at the time of releasing airflow. Note you have to specify both - Airflow version and Python version you are using.

You can also specify additional extras - when you want to install airflow with additional providers:

bash
pip install apache-airflow[amazon,google]==3.1.8 --constraint "https://raw.githubusercontent.com/apache/airflow/constraints-3.1.8/constraints-3.11.txt"
  1. Set the AIRFLOW_HOME Environment Variable: Apache Airflow requires a directory to store configuration files, logs, and other data. Set the AIRFLOW_HOME variable to specify this directory.
  • Set the Airflow home directory:
bash
export AIRFLOW_HOME=~/airflow

Note: This command sets AIRFLOW_HOME for the current shell session only. To make it persistent, add the line to your shell profile, such as ~/.bashrc or ~/.zshrc.

  1. Run Airflow in standalone mode: Apache Airflow runs several components, such as the scheduler, web server, triggerer, and API server, to manage workflows and display the UI.
  • To run Airflow in standalone mode (which will automatically start the required components):
bash
airflow standalone
  1. Access the Airflow Web UI: Once the components are up and running, you can access the Airflow UI through your browser:
  • Open your browser and go to:
text
http://localhost:8080

Note: The airflow standalone command prints the generated username and password in the terminal on first run. Use these credentials to log in to the Airflow UI.