Back to Genai Toolbox

Serverless for Apache Spark Source

docs/en/integrations/serverless-spark/source.md

1.1.01.8 KB
Original Source

About

The Serverless for Apache Spark source allows Toolbox to interact with Spark batches hosted on Google Cloud Serverless for Apache Spark.

Available Tools

{{< list-tools >}}

Requirements

IAM Permissions

Serverless for Apache Spark uses Identity and Access Management (IAM) to control user and group access to serverless Spark resources like batches and sessions.

Toolbox will use your Application Default Credentials (ADC) to authorize and authenticate when interacting with Google Cloud Serverless for Apache Spark. When using this method, you need to ensure the IAM identity associated with your ADC has the correct permissions for the actions you intend to perform. Common roles include roles/dataproc.serverlessEditor (which includes permissions to run batches) or roles/dataproc.serverlessViewer. Follow this guide to set up your ADC.

Example

yaml
kind: source
name: my-serverless-spark-source
type: serverless-spark
project: my-project-id
location: us-central1

Reference

fieldtyperequireddescription
typestringtrueMust be "serverless-spark".
projectstringtrueID of the GCP project with Serverless for Apache Spark resources.
locationstringtrueLocation containing Serverless for Apache Spark resources.