metadata-ingestion/docs/sources/fivetran/fivetran_pre.md
The fivetran module ingests metadata from Fivetran into DataHub. It is intended for production ingestion workflows and module-specific capabilities are documented below.
This source extracts the following:
Prerequisites:
Before running ingestion, ensure network connectivity to the source, valid authentication credentials, and read permissions for metadata APIs required by this module.
To use the Fivetran REST API integration, you need:
Required API Permissions:
GET /v1/connections/{connection_id})The Fivetran REST API configuration is required for Google Sheets connectors and optional for other use cases. It provides access to connection details that aren't available in the Platform Connector logs.
To obtain API credentials:
api_config:
api_key: "your_api_key"
api_secret: "your_api_secret"
base_url: "https://api.fivetran.com" # Optional, defaults to this
request_timeout_sec: 30 # Optional, defaults to 30 seconds
Google Sheets connectors require special handling because Google Sheets is not yet natively supported as a DataHub source. As a workaround, the Fivetran source creates Dataset entities for Google Sheets and includes them in the lineage.
api_config) is required for Google Sheets connectorsFor each Google Sheets connector, two Dataset entities are created:
Google Sheet Dataset: Represents the entire Google Sheet
google_sheetsGOOGLE_SHEETSNamed Range Dataset: Represents the specific named range being synced
google_sheetsGOOGLE_SHEETS_NAMED_RANGEsource:
type: fivetran
config:
# Required for Google Sheets connectors
api_config:
api_key: "your_api_key"
api_secret: "your_api_secret"
# ... other configuration ...