datahub-actions/src/datahub_actions/plugin/action/term/README.md
The Glossary Term Propagation Action allows you to propagate glossary terms from your assets into downstream entities.
Enable classification of datasets or field of datasets to propagate metadata to downstream datasets with minimum manual work.
Propagation can be controlled via a specified list of terms or a specified list of term groups.
Classification.Confidential (the default), if you apply Classification.Confidential term to a dataset (at the dataset level or a field-level), this action will find all the downstream datasets and apply the Classification.Confidential tag to them at the dataset level. Note that downstream application is only at the dataset level, regardless of whether the primary application was at the field level or the dataset level.PersonalInformation.Email inherits Classification.Confidential (the target term), and if you apply the PersonalInformation.Email term to a dataset (or a field in the dataset), it will be picked up by the action, and the PersonalInformation.Email term will be applied at the dataset level to all the downstream entities.The action supports propagation of term additions and removals.
You can control what the target term should be. Linkage to the target term is controlled through your business glossary which is completely under your control.
name: "term_propagation"
source:
type: "kafka"
config:
connection:
bootstrap: ${KAFKA_BOOTSTRAP_SERVER:-localhost:9092}
schema_registry_url: ${SCHEMA_REGISTRY_URL:-http://localhost:8081}
filter:
event_type: "EntityChangeEvent_v1"
action:
type: "term_propagation"
config:
target_terms:
- Classification
term_groups:
- "Personal Information"
datahub:
server: "http://localhost:8080"