packages/docs/plugin-registry/image-generation.md
The Image Generation plugin enables Eliza agents to generate images using AI image models, including OpenAI's DALL-E, Stable Diffusion via FAL, and other providers.
On-demand plugin. This plugin is resolved from the remote elizaOS plugin registry and auto-installs when configured. It is not included in Eliza's bundled
plugins.jsonindex.
Package: @elizaos/plugin-image-generation
The Image Generation plugin registers an IMAGE model handler and a set of actions that allow agents to generate images from text descriptions, edit existing images, and create variations.
eliza plugins install @elizaos/plugin-image-generation
{
"features": {
"imageGen": true
}
}
The plugin supports multiple image generation backends. Configure via the media.image section:
{
"media": {
"image": {
"enabled": true,
"mode": "own-key",
"provider": "openai",
"model": "dall-e-3"
}
}
}
| Environment Variable | Required | Description |
|---|---|---|
OPENAI_API_KEY | Yes | OpenAI API key |
{
"media": {
"image": {
"provider": "openai",
"model": "dall-e-3",
"size": "1024x1024",
"quality": "standard"
}
}
}
| Environment Variable | Required | Description |
|---|---|---|
FAL_API_KEY | Yes | FAL API key from fal.ai |
{
"media": {
"image": {
"provider": "fal",
"model": "fal-ai/flux/schnell"
}
}
}
| Model | Description |
|---|---|
dall-e-3 | Highest quality, 1024×1024 to 1792×1024 |
dall-e-2 | Legacy, 256×256 to 1024×1024 |
| Model ID | Description |
|---|---|
fal-ai/flux/schnell | FLUX Schnell — fast, high quality |
fal-ai/flux/dev | FLUX Dev — more detailed |
fal-ai/flux-pro | FLUX Pro — best quality |
fal-ai/stable-diffusion-v3-medium | SD3 Medium |
fal-ai/stable-video-diffusion | Video generation |
| Action | Description |
|---|---|
GENERATE_MEDIA | Generate an image from a text description using mediaType: "image" |
EDIT_IMAGE | Edit an existing image with a prompt |
IMAGE_VARIATION | Create variations of an existing image |
DESCRIBE_IMAGE | Analyze and describe an image (uses vision) |
After the plugin is loaded:
"Draw a cozy coffee shop on a rainy afternoon, watercolor style"
"Generate an image of a robot reading a book in a library"
"Create a logo for an AI company called Eliza"
The agent generates the image and can share it in the conversation or save it to the workspace.
Generated images are:
| Size | Aspect Ratio |
|---|---|
1024x1024 | Square (default) |
1792x1024 | Landscape |
1024x1792 | Portrait |
| Quality | Description |
|---|---|
standard | Faster, lower cost |
hd | Higher detail, slower, higher cost |
For FAL auto-enable, set in media.image:
{
"media": {
"image": {
"enabled": true,
"mode": "own-key",
"provider": "fal"
}
}
}
This triggers the FAL plugin to load automatically.