docs/automations/ai-term-suggestion.md
import FeatureAvailability from '@site/src/components/FeatureAvailability';
:::info
This feature is currently in closed beta. Reach out to your DataHub Cloud representative to get access.
:::
The AI Glossary Term Suggestion automation uses LLMs to suggest Glossary Terms for tables and columns in your data.
This is useful for improving coverage of glossary terms across your organization, which is important for compliance and governance efforts.
This automation can:
As of v0.3.15, you can customize how classification recommendations are generated by providing custom instructions that are passed to the underlying AI model when generating classification decisions for any Table or Column. This is useful if you want AI-generated classification to follow specific guidelines or standards set by your organization.
To provide custom instructions for AI classification generation, simply provide them in the Custom Instruction input when creating the AI classification automation.
<p align="center"> </p>Note that after updating instructions, it may take up to 5 minutes for the new instructions to take effect.
The automation will scan through all the datasets matched by the configured filters. For each one, it will generate suggestions. If new entities are added that match the configured filters, those will also be classified within 24 hours.
We take into account the following metadata when generating suggestions:
Data privacy: Your metadata is not sent to any third-party LLMs. We use AWS Bedrock internally, which means all metadata remains within the DataHub Cloud AWS account. We do not fine-tune on customer data.