Back to Datahub

DataHub Init CLI - Agent Context

metadata-ingestion/src/datahub/cli/resources/INIT_AGENT_CONTEXT.md

1.6.03.2 KB
Original Source

DataHub Init CLI - Agent Context

Best practices for AI agents using datahub init to set up authentication.

What it does

Writes ~/.datahubenv with the GMS URL and an access token. Run once before using any other CLI commands that require authentication.

Quickstart (local instance)

bash
# Default credentials on localhost — no --host, no --force needed
datahub init --username datahub --password datahub

Non-interactive signals (credentials via flags or env vars) trigger all agent-friendly defaults automatically — no extra flags required.

Key defaults

SituationDefault behaviour
--host omitted + credentials providedsilently uses http://localhost:8080
--token-duration omitted, localhostONE_MONTH
--token-duration omitted, remote hostONE_HOUR
Config file already exists, non-TTYsilently overwrites (no prompt)
Config file already exists, TTYprompts for confirmation

Common scenarios

bash
# Local instance — minimal form
datahub init --username datahub --password datahub

# Local instance — explicit duration override
datahub init --username datahub --password datahub --token-duration NO_EXPIRY

# Remote instance — always pass --host explicitly
datahub init --host https://your-instance.acryl.io/gms --username alice --password secret

# Already have a token — skip credential exchange
datahub init --host https://your-instance.acryl.io/gms --token <your-token>

# CI/CD — fully non-interactive via env vars
export DATAHUB_GMS_URL=https://prod.example.com/gms
export DATAHUB_GMS_TOKEN=<your-token>
datahub init

SSO browser login

For DataHub instances using SSO (OIDC/SAML), use --sso to authenticate via browser:

bash
# Opens browser — complete SSO, CLI captures session and generates token
datahub init --sso --host https://your-instance.example.com/gms

# Custom token duration
datahub init --sso --host https://your-instance.example.com/gms --token-duration ONE_MONTH

Prerequisites (one-time setup):

bash
pip install 'acryl-datahub[sso]'   # or: uv pip install 'acryl-datahub[sso]'
playwright install chromium

--sso is mutually exclusive with --token, --username, and --password. If Playwright is not installed, the command prints step-by-step install instructions and exits.

Support login (DataHub Cloud)

For the support team debugging customer instances, add --support to use the /support/authenticate login path:

bash
datahub init --sso --support --host https://customer.acryl.io/gms

--support requires --sso.

Environment variables

VariableCLI equivalent
DATAHUB_GMS_URL--host
DATAHUB_GMS_TOKEN--token
DATAHUB_USERNAME--username
DATAHUB_PASSWORD--password

CLI flags take precedence over environment variables.

Available token durations

ONE_HOUR, ONE_DAY, ONE_WEEK, ONE_MONTH, THREE_MONTHS, SIX_MONTHS, ONE_YEAR, NO_EXPIRY