Back to Genai Toolbox

Dataproc Clusters Source

docs/en/integrations/dataproc/source.md

1.1.01.5 KB
Original Source

About

The Dataproc Clusters source allows Toolbox to interact with Dataproc Clusters hosted on Google Cloud.

Available Tools

{{< list-tools >}}

Requirements

IAM Permissions

Dataproc uses Identity and Access Management (IAM) to control user and group access to Dataproc resources.

Toolbox will use your Application Default Credentials (ADC) to authorize and authenticate when interacting with Dataproc. When using this method, you need to ensure the IAM identity associated with your ADC has the correct permissions for the actions you intend to perform. Common roles include roles/dataproc.editor or roles/dataproc.viewer. Follow this guide to set up your ADC.

Example

yaml
kind: source
name: my-dataproc-source
type: dataproc
project: my-project
region: us-central1

Reference

fieldtyperequireddescription
typestringtrueMust be "dataproc".
projectstringtrueID of the GCP project with Dataproc resources.
regionstringtrueRegion containing Dataproc resources.