website/docs/integrations/index.md
Hermes Agent connects to external systems for AI inference, tool servers, IDE workflows, programmatic access, and more. These integrations extend what Hermes can do and where it can run.
:::tip Start here If you only have time to set up one integration, set up Nous Portal — a single OAuth login covers 300+ models plus the four Tool Gateway tools (web search, image generation, TTS, and browser automation). :::
Hermes supports multiple AI inference providers out of the box. Use hermes model to configure interactively, or set them in config.yaml.
The web_search and web_extract tools support eight backend providers, configured via config.yaml or hermes tools:
| Backend | Env Var | Search | Extract | Crawl |
|---|---|---|---|---|
| Firecrawl (default) | FIRECRAWL_API_KEY | ✔ | ✔ | ✔ |
| SearXNG | SEARXNG_URL | ✔ | — | — |
| Brave (free tier) | BRAVE_SEARCH_API_KEY | ✔ | — | — |
| DuckDuckGo (ddgs) | (none) | ✔ | — | — |
| Tavily | TAVILY_API_KEY | ✔ | ✔ | ✔ |
| Exa | EXA_API_KEY | ✔ | ✔ | — |
| Parallel | PARALLEL_API_KEY | ✔ | ✔ | — |
| xAI | XAI_API_KEY | ✔ | — | — |
Quick setup example:
web:
backend: firecrawl # firecrawl | searxng | brave-free | ddgs | tavily | exa | parallel | xai
If web.backend is not set, the backend is auto-detected from whichever API key is available. Self-hosted Firecrawl is also supported via FIRECRAWL_API_URL.
Hermes includes full browser automation with multiple backend options for navigating websites, filling forms, and extracting information:
/browser connectagent-browser CLISee Browser Automation for setup and usage.
Text-to-speech and speech-to-text across all messaging platforms:
| Provider | Quality | Cost | API Key |
|---|---|---|---|
| Edge TTS (default) | Good | Free | None needed |
| ElevenLabs | Excellent | Paid | ELEVENLABS_API_KEY |
| OpenAI TTS | Good | Paid | VOICE_TOOLS_OPENAI_KEY |
| MiniMax | Good | Paid | MINIMAX_API_KEY |
| xAI TTS | Good | Paid | XAI_API_KEY |
| NeuTTS | Good | Free | None needed |
Speech-to-text supports six providers: local faster-whisper (free, runs on-device), a local command wrapper, Groq, OpenAI Whisper API, Mistral, and xAI. Voice message transcription works across Telegram, Discord, WhatsApp, and other messaging platforms. See Voice & TTS and Voice Mode for details.
MEMORY.md and USER.md files. The agent maintains bounded stores of personal notes and user profile data that survive across sessions.Hermes runs as a gateway bot on 27+ messaging platforms, all configured through the same gateway subsystem:
See the Messaging Gateway overview for the platform comparison table and setup guide.
ha_list_entities, ha_get_state, ha_list_services, ha_call_service). The Home Assistant toolset activates automatically when HASS_TOKEN is configured.~/.hermes/plugins/, project-local .hermes/plugins/, and pip-installed entry points.