docs/netdata-ai/mcp/mcp-clients/ai-devops-copilot.md
Model Context Protocol (MCP) clients like Claude Desktop, Cursor, Visual Studio Code, JetBrains IDEs, Netdata Web Client, Claude Code, and Gemini CLI can connect to Netdata’s MCP server to bring real observability data into your AI workflows. This enables natural‑language analysis with context from your infrastructure and, for CLI tools, optional automation.
Observability‑driven operations
System configuration management
Automated troubleshooting
When MCP clients have access to Netdata, they can make informed decisions about system changes:
Database Performance Tuning:
PostgreSQL is showing high query response times. Check the metrics and optimize
the configuration.
The AI analyzes connection counts, query performance, and resource usage to adjust connection pools, memory settings, and query optimization parameters.
Resource Management:
This Kubernetes cluster is experiencing frequent pod restarts. Investigate and
fix the resource allocation.
The AI examines CPU, memory, and network metrics to identify resource constraints and adjust limits, requests, and HPA configurations.
Storage Optimization:
Disk usage is growing rapidly on our log servers. Implement appropriate
retention policies.
The AI analyzes disk growth patterns, identifies log volume trends, and configures rotation, compression, and cleanup policies.
Network Performance:
API response times are inconsistent. Check network metrics and optimize the
load balancer configuration.
The AI examines network latency, connection distribution, and backend health to adjust load balancing algorithms and connection settings.
Monitoring Setup:
This server runs Redis but we're not monitoring it properly. Please configure
comprehensive monitoring.
The AI detects the Redis installation, configures appropriate collectors, sets up alerting thresholds, and verifies metric collection.
Auto-scaling Configuration:
Set up intelligent auto-scaling based on current usage patterns I'm seeing.
The AI analyzes historical resource utilization to configure scaling policies, thresholds, and cooldown periods that match actual workload patterns.
Complex Test Environment Setup:
I need a complete test environment that mirrors our production setup: a
multi-tier application with PostgreSQL primary/replica, Redis cluster, message
queues, and load balancers. Set up everything with a Netdata monitoring
everything and realistic test data.
The AI leverages its deep knowledge of application architectures and Netdata's monitoring capabilities to:
Keep in mind however, that usually this prompt should be split into multiple smaller prompts, so that the LLM can focus on completing a smaller task at a time.
This showcases how AI can combine application expertise, infrastructure knowledge, and observability best practices to create sophisticated testing environments that would typically require weeks of manual setup and deep domain expertise.
LLMs Are Not Infallible:
System Impact Awareness:
External LLM Provider Exposure:
Network and System Information:
1. Analysis-First Approach:
Instead of: Fix the high CPU usage on server X
Try: Analyze the CPU metrics on server X and explain what might be causing
high usage and what solutions you recommend
2. Review and Validation:
3. Data Sanitization:
4. Graduated Permissions:
5. Environment Separation:
See dedicated configuration guides for each client:
Claude Desktop
Cursor
Visual Studio Code
JetBrains IDEs
Netdata Web Client
Claude Code
Gemini CLI
Train team members on safe AI usage practices
Establish clear guidelines for when AI assistance is appropriate
Create approval processes for AI-suggested changes
Share lessons learned and best practices across teams
CLI-based AI assistants represent the beginning of a transformation in infrastructure management. As these tools mature, they will likely become central to:
However, the human element remains crucial for oversight, validation, and strategic decision-making. The most successful implementations will be those that thoughtfully balance AI capabilities with human judgment and appropriate safety measures.