Using Google Gemini with AutoGPT

This guide covers integrating Google Gemini models with AutoGPT using OpenRouter.

Prerequisites

Make sure you have completed the AutoGPT Setup Guide and have AutoGPT running locally at http://localhost:3000.
You have an OpenRouter API key from OpenRouter.

Getting Your API Key

AutoGPT routes all Gemini models through OpenRouter. You need an OpenRouter API key:

Visit OpenRouter Keys
Sign in or create an account
Click "Create Key"
Copy the generated key for use in AutoGPT

Setup Steps

1. Start AutoGPT Locally

Follow the official guide: AutoGPT Getting Started Guide

Ensure AutoGPT is running and accessible at: http://localhost:3000

2. Open the Visual Builder

Open your browser and navigate to: http://localhost:3000/build

Or click "Build" in the navigation bar.

3. Add an AI Text Generator Block

Click the "Blocks" button on the left sidebar.
In the search bar, type AI Text Generator.
Drag the block into the canvas.

4. Select a Gemini Model

Click the AI Text Generator block to configure it.

In the LLM Model dropdown, select one of the available Gemini models:

Model	Description	Best For
`google/gemini-3-pro-preview`	Latest Gemini model	Complex reasoning, coding, multimodal tasks
`google/gemini-2.5-pro-preview-03-25`	High capability model	Complex reasoning, coding, multimodal tasks
`google/gemini-2.5-flash`	Fast, efficient performance	Quick responses, high-volume tasks
`google/gemini-2.5-flash-lite-preview-06-17`	Lightweight preview	Simple tasks, low latency requirements
`google/gemini-2.0-flash-001`	Balanced speed and quality	General-purpose applications
`google/gemini-2.0-flash-lite-001`	Lightweight, cost-effective	Simple tasks, low latency requirements

Select the models prefixed with google/ in the dropdown.

5. Configure Your Credentials

Inside the AI Text Generator block:

API Key: Enter your OpenRouter API key
Prompt: Enter your desired prompt text

Get your API key from: https://openrouter.ai/keys

Save your API key as a credential in AutoGPT for easy reuse across multiple blocks.

6. Save Your Agent

Click the "Save" button at the top-right of the builder interface:

Give your agent a descriptive name (e.g., gemini_research_agent)
Click "Save Agent" to confirm

7. Run Your Agent

From the workspace:

Click "Run" next to your saved agent
The request will be sent to the selected Gemini model

8. View the Output

Scroll to the AI Text Generator block
Check the Output panel below it
Copy, export, or pass the result to additional blocks

Gemini-Specific Features

Multimodal Capabilities

Gemini models support multiple input types:

Text: Standard text prompts and completions
Images: Upload and analyze images
Code: Programming and technical reasoning
Long Context: Large context windows for document analysis

Expand Your Agent

Enhance your workflow with additional blocks:

Tools – Fetch URLs, call APIs, scrape data
Memory – Retain context across interactions
Document Processing – Analyze PDFs, text files
Web Search – Combine with real-time information
Chains – Create multi-step reasoning pipelines

Pricing

Gemini models are priced through OpenRouter. Check current rates at: OpenRouter Google Models

Pricing varies by model tier and usage volume.

Troubleshooting

API Key Issues

Ensure you're using an OpenRouter API key, not a Google AI Studio key
Verify the key has sufficient credits
Check that the key is entered correctly without extra spaces

Model Not Available

Gemini models are accessed through OpenRouter
Ensure you've selected a model with the google/ prefix in the dropdown

Rate Limiting

Free tier has request limits per minute
Upgrade to paid tier for production usage
Consider using google/gemini-2.0-flash-lite for cost-effective high-volume tasks

Context Length Errors

Each Gemini model has a maximum context window
Split large tasks across multiple blocks for very long documents

Additional Resources

You are now set up to use Google Gemini models in AutoGPT.