documentation/docs/advanced/gcp-vertex.md
:::info This is only helpful for self-hosted users. If you're using Khoj Cloud, you can directly use any of the pre-configured AI models. :::
Khoj can use Google's Gemini and Anthropic's Claude family of AI models from Vertex AI on Google Cloud. Explore Anthropic and Gemini AI models available on Vertex AI's Model Garden.
base64 -i <service_account_credentials_keyfile.json>Google Vertex (or whatever friendly name you prefer).base64 encoded json keyfile from step 2.https://{MODEL_GCP_REGION}-aiplatform.googleapis.com/v1/projects/{YOUR_GCP_PROJECT_ID}
us-east5 works for Claude.claude-3-7-sonnet@20250219. Any Claude or Gemini model on Vertex's Model Garden should work.Anthropic or Google60000 (replace with the max prompt size of your model)Permission Denied?
Ensure your service account has the Vertex AI User role and that the API is enabled in your GCP project.
Region Errors?
Double-check that the model you're trying to use is supported in your selected region. Some Claude or Gemini models are restricted to specific zones like us-east5 or us-central1.
Prompt Size Limitations The "Max prompt size" should align with the limits defined in the model documentation. Exceeding it can silently fail or truncate inputs.
Testing the API Key Before adding it to Khoj, you can verify that your key works by making a simple curl request to Vertex AI. This helps debug auth issues early.
Use Environment Variables For better security, consider using environment variables to manage sensitive keys and inject them at runtime during base64 encoding.
If you encounter any issues, the Khoj Discord is a great place to ask for help!