Back to Genai Toolbox

dataplex-get-data-profile

docs/en/integrations/knowledge-catalog/tools/knowledge-catalog-get-data-profile.md

1.5.02.5 KB
Original Source

About

A dataplex-get-data-profile tool retrieves the results of a completed Data Profile scan.

WARNING: You must verify the execution run has succeeded (via dataplex-get-run-status) before calling this tool, otherwise the results will be empty. CRITICAL: Access the results via the nested public fields dataProfileResult.profile.fields inside the returned DataScan.

Compatible Sources

{{< compatible-sources >}}

Requirements

IAM Permissions

Knowledge Catalog uses Identity and Access Management (IAM) to control user and group access to Knowledge Catalog resources. Toolbox will use your Application Default Credentials (ADC) to authorize and authenticate when interacting with [Knowledge Catalog][dataplex-docs].

In addition to setting the ADC for your server, you need to ensure the IAM identity has been given the correct IAM permissions for the tasks you intend to perform. See Knowledge Catalog IAM permissions and Knowledge Catalog IAM roles for more information on applying IAM permissions and roles to an identity.

Parameters

The dataplex-get-data-profile tool accepts the following parameters:

fieldtyperequireddescription
scanIdstringtrueThe unique ID of the Dataplex profile scan (e.g. nq-prof-12345).
locationstringtrueThe Google Cloud region where the scan was created (e.g. us-central1).

Example

yaml
kind: tool
name: get_data_profile
type: dataplex-get-data-profile
source: my-dataplex-source
description: Fetch results of a completed data profile scan.

Reference

fieldtyperequireddescription
typestringtrueMust be "dataplex-get-data-profile".
sourcestringtrueName of the source the tool should execute on.
descriptionstringtrueDescription of the tool that is passed to the LLM.