metadata-ingestion/docs/dev_guides/sql_profiles.md
SQL Profiling collects table level and column level statistics. The SQL-based profiler does not run alone, but rather can be enabled for other SQL-based sources. Enabling profiling will slow down ingestion runs.
:::caution
Running profiling against many tables or over many rows can run up significant costs. While we've done our best to limit the expensiveness of the queries the profiler runs, you should be prudent about the set of tables profiling is enabled on or the frequency of the profiling runs.
:::
Extracts:
SQL profiling is supported for all SQL sources. Check the individual source page to verify if it supports profiling.
DataHub is transitioning from Great Expectations (GE) based profiling to a custom SQLAlchemy profiler.
Two profiler implementations are available:
The SQLAlchemy profiler can be enabled via profile.method = "sqlalchemy":
source:
config:
profiling:
enabled: true
method: sqlalchemy