docs/provider-config/gcp-vertex-ai.mdx
GCP Vertex AI:
A fully managed service that provides access to leading generative AI models-such as Anthropic's Claude 3.5 Sonnet v2-through Google Cloud.
Learn more about GCP Vertex AI.
This guide is tailored for organizations with established GCP environments (leveraging IAM roles, service accounts, and best practices in resource management) to ensure secure and compliant usage.
Assign Required Roles:
roles/aiplatform.user)roles/aiplatform.serviceAgent) to enable certain operationsCross-Project Resource Access:
Vertex AI supports multiple regions. Select a region that meets your latency, compliance, and capacity needs. Examples include:
The Global endpoint may offer higher availability and reduce resource exhausted errors. Only Gemini models are supported.
us-east5)Install the Google Cloud CLI:
Follow the installation guide
Initialize and Authenticate:
gcloud init
gcloud auth application-default login
Restart VS Code:
Ensure VS Code is restarted so that the Cline extension picks up the new credentials
Create a Service Account:
Assign Roles:
roles/aiplatform.user)roles/aiplatform.serviceAgent)Generate a JSON Key:
Set the Environment Variable:
export GOOGLE_APPLICATION_CREDENTIALS="/path/to/your/service-account-key.json"
Restart VS Code:
Launch VS Code from a terminal where the GOOGLE_APPLICATION_CREDENTIALS variable is set
Model Observability Dashboard:
Quota Management:
Service Agents:
Be aware of the different service agents:
Cross-Project Access:
For resources in other projects (e.g., BigQuery, Cloud Storage), ensure that the appropriate roles (BigQuery Data Viewer, Storage Object Viewer) are assigned
By following these steps, your enterprise team can securely integrate GCP Vertex AI with the Cline VS Code extension to harness the power of Claude 3.5 Sonnet v2:
gcloud auth application-default login) or a service account with a JSON keyFor further details, please consult the GCP Vertex AI Documentation and your internal security policies.
Happy coding!
This guide will be updated as GCP Vertex AI and Cline evolve. Always refer to the latest documentation for current practices.