metadata-ingestion/docs/sources/cassandra/cassandra_pre.md
The cassandra module ingests metadata from Cassandra into DataHub. It is intended for production ingestion workflows and module-specific capabilities are documented below.
Before running ingestion, ensure network connectivity to the source, valid authentication credentials, and read permissions for metadata APIs required by this module.
Supports DataStax Astra DB and Cassandra Enterprise Edition (EE).
Set Up User Credentials:
Permissions:
SELECT permissions that allow it to:
system_schema) to retrieve information about keyspaces, tables, columns, and views.SELECT operations on the data tables if data profiling is enabled.Verify Database Access:
:::caution
When enabling profiling, make sure to set a limit on the number of rows to sample. Profiling large tables without a limit may lead to excessive resource consumption and slow performance.
:::
:::note
For cloud configuration with Astra DB, it is necessary to specify the Secure Connect Bundle path in the configuration. For that reason, use the CLI to ingest metadata into DataHub.
:::