Back to Datahub

README

metadata-ingestion/docs/sources/starrocks/README.md

1.6.01.4 KB
Original Source

Overview

StarRocks is a high-performance analytical database that supports real-time, multi-dimensional analytics. It features a multi-catalog architecture that enables federated queries across internal tables and external data sources such as Hive, Iceberg, Hudi, and Delta Lake.

This integration extracts metadata for databases, tables, and views across all catalogs (internal and external). It also supports optional data profiling and stateful ingestion for automatic stale entity removal.

Concept Mapping

This ingestion source maps the following Source System Concepts to DataHub Concepts:

Source ConceptDataHub ConceptNotes
StarRocksData Platform
CatalogContainerSubtype Catalog
DatabaseContainerSubtype Database, child of Catalog
TableDatasetSubtype Table
ViewDatasetSubtype View