Duo Workflow Package

This package implements Workhorse's support for AI-assisted features through integration with the Duo Workflow Service. It provides WebSocket proxying, gRPC communication, and action handling for AI workflows.

Overview

The duoworkflow package enables GitLab's AI-assisted features (Duo Chat, Duo Agent) by:

Managing WebSocket connections between clients and Workhorse
Establishing gRPC streams to the Duo Workflow Service
Handling bidirectional message exchange
Executing actions (HTTP requests, MCP tool calls) on behalf of the Duo Workflow Service
Supporting various deployment scenarios (GitLab.com, self-managed, self-hosted)

Package structure

handler.go: HTTP handler for WebSocket connections and graceful shutdown management
runner.go: Main orchestrator that manages the lifecycle of a single workflow execution
stream_manager.go: Manages gRPC streams to the Duo Workflow Service (primary and optional cloud-tracking stream for self-hosted deployments)
client.go: gRPC client for communicating with the Duo Workflow Service
actions.go: Handler for executing HTTP action requests from the Duo Workflow Service
mcp.go: Model Context Protocol (MCP) client for tool execution
lock.go: Distributed workflow locking using Redis
metrics.go: Prometheus metrics

Core components

Handler

The Handler manages WebSocket connections and provides graceful shutdown:

type Handler struct {
    rails    *api.API
    rdb      *redis.Client
    backend  http.Handler
    upgrader websocket.Upgrader
    runners  sync.Map // map[*runner]bool
}

Key responsibilities:

Accepts HTTP requests and upgrades them to WebSocket connections
Pre-authorizes requests with GitLab Rails
Creates and tracks runner instances for each connection
Gracefully shuts down all active runners during server shutdown

Usage:

handler := NewHandler(rails, rdb, backend)
http.Handle("/ai/duoworkflow", handler.Build())

Client

The Client manages gRPC communication with the Duo Workflow Service:

type Client struct {
    grpcConn   *grpc.ClientConn
    grpcClient pb.DuoWorkflowClient
    headers    map[string]string
}

Key features:

Creates gRPC connections with keepalive parameters
Maintains bidirectional streams for message exchange
Implements retry logic with exponential backoff
Handles connection failures gracefully

Configuration:

Maximum message size: 4MB
Keepalive time: 20 seconds
Retry attempts: 4 with exponential backoff

Runner

The runner orchestrates a single workflow execution:

type runner struct {
    rails              *api.API
    backend            http.Handler
    token              string
    originalReq        *http.Request
    conn               websocketConn
    streamManager      *streamManager
    lockManager        *workflowLockManager
    workflowID         string
    mutex              *redsync.Mutex
    lockFlow           bool
    serverCapabilities []string
    mcpManager         mcpManager
    workflowDefinition string
}

Responsibilities:

Handles WebSocket messages from clients
Handles gRPC actions from the Duo Workflow Service
Manages message serialization/deserialization
Coordinates HTTP request execution and MCP tool calls
Manages workflow lifecycle and graceful shutdown

Message handling flow:

WebSocket messages → Unmarshal JSON to Protocol Buffer → Send to gRPC stream
gRPC actions → Process action type → Execute action → Send response back to gRPC stream
WebSocket closure → Send StopWorkflow request → Wait for acknowledgment

Stream manager

The streamManager manages gRPC streams to the Duo Workflow Service:

type streamManager struct {
    wf                 workflowStream
    client             *Client
    cloudServiceClient *Client
    cloudServiceStream selfHostedWorkflowStream
    originalReq        *http.Request
    sendMu             sync.Mutex
}

Responsibilities:

Opens and owns the primary ExecuteWorkflow gRPC stream
Optionally opens a secondary TrackSelfHostedExecuteWorkflow stream for self-hosted deployments
Provides mutex-protected Send to allow concurrent goroutines to write safely
Translates io.EOF and quota-exceeded gRPC errors into sentinel errors on Recv

Action handlers

HTTP request execution

The runHTTPActionHandler executes HTTP requests to the GitLab API:

type runHTTPActionHandler struct {
    rails       *api.API
    backend     http.Handler
    token       string
    originalReq *http.Request
    action      *pb.Action
}

Process:

Parse the action's path and method
Construct an HTTP request to the GitLab API
Add authentication headers (OAuth token)
Add client IP information (X-Forwarded-For)
Execute the request through the backend handler
Capture response body and status code
Enforce maximum response size (4MB)
Return response as Protocol Buffer ActionResponse

Security features:

Uses OAuth tokens from the original request
Validates request paths
Limits response body size to prevent memory exhaustion
Preserves X-Forwarded-For headers

MCP tool execution

The MCP manager handles communication with Model Context Protocol servers:

type mcpManager interface {
    Tools() []*pb.McpTool
    PreApprovedTools() []string
    HasTool(name string) bool
    CallTool(ctx context.Context, action *pb.Action) (*pb.ClientEvent, error)
    Close() error
}

Features:

Initializes connections to configured MCP servers
Discovers available tools from each server
Filters tools by name and pre-approved status
Executes tool calls and returns results
Handles tool execution errors gracefully

Workflow locking

For self-managed instances, distributed workflow locking prevents concurrent execution:

type workflowLockManager struct {
    rdb *redis.Client
}

Process:

Acquire a distributed lock when workflow starts
Release the lock when workflow ends
Return specific error if lock cannot be acquired
Requires Redis configuration

Message flow

Client to Duo Workflow Service

WebSocket ClientEvent
    ↓
Unmarshal JSON to Protocol Buffer
    ↓
streamManager.Send
    ↓
Duo Workflow Service receives ClientEvent

Duo Workflow Service to client

gRPC Action received (streamManager.Recv)
    ↓
Determine action type
    ↓
Execute action (HTTP request or MCP tool)
    ↓
Create ActionResponse
    ↓
streamManager.Send
    ↓
Receive response in gRPC stream
    ↓
Marshal to JSON
    ↓
Send to WebSocket
    ↓
Client receives action response

Metrics

The package exposes four Prometheus counters.

`gitlab_workhorse_duo_workflow_connections_total`

Incremented for every inbound request that passes pre-authorization, before the WebSocket upgrade is attempted. This includes requests that subsequently fail to upgrade.

`gitlab_workhorse_duo_workflow_connection_errors_total`

Incremented whenever a connection fails at any stage, labelled by error_type:

Stage	Trigger	`error_type`
WebSocket upgrade	`websocket.Upgrader.Upgrade` returns an error	`other`
Runner initialisation	`newRunner` / `newStreamManager` returns an error	`other`
Runner execution	Usage quota exceeded	`quota_exceeded`
Runner execution	Workflow lock cannot be acquired	`locked`
Runner execution	Any other `runner.Execute` error	`other`

The ratio connection_errors_total / connections_total gives the connection error rate.

Example query to break down errors by type:

promql

rate(gitlab_workhorse_duo_workflow_connection_errors_total[5m]) by (error_type)

`gitlab_workhorse_duo_workflow_sessions_total`

Incremented each time a gRPC ExecuteWorkflow stream is successfully opened to the Duo Workflow Service (inside newStreamManager).

`gitlab_workhorse_duo_workflow_session_errors_total`

Incremented for every non-EOF error received on the ExecuteWorkflow stream, labelled by the gRPC status code string (e.g. "Internal", "Unavailable", "ResourceExhausted"). io.EOF is the normal end-of-stream signal and does not increment this counter.

Example query:

promql

rate(gitlab_workhorse_duo_workflow_session_errors_total[5m]) by (grpc_code)

Configuration

From GitLab Rails

Workhorse receives configuration during pre-authorization:

type DuoWorkflow struct {
    Service                  *DuoWorkflowServiceConfig // Primary gRPC service
    CloudServiceForSelfHosted *DuoWorkflowServiceConfig // Optional cloud tracking service
    LockConcurrentFlow       bool                      // Enable workflow locking
    McpServers               map[string]*McpServerConfig
}

type DuoWorkflowServiceConfig struct {
    URI     string            // gRPC service URI (e.g., "localhost:50052")
    Headers map[string]string // Headers for gRPC requests (e.g., OAuth token)
    Secure  bool              // Use TLS for gRPC connection
}

MCP server configuration

MCP servers are configured in GitLab Rails and passed to Workhorse:

ruby

{
  gitlab: {
    # URL is automatically resolved in Workhorse
    Headers: { "Authorization" => "Bearer token" },
    Tools: ["tool1", "tool2"], # Empty means all tools
    PreApprovedTools: ["tool1"]
  },
  external_server: {
    URL: "https://mcp-server.example.com",
    Headers: { "Authorization" => "Bearer token" },
    Tools: []
  }
}

Error handling

Connection errors

Duo Workflow Service unavailable: Returns ErrServerUnavailable and closes WebSocket
MCP server unavailable: Logs error and continues without MCP tools
Network errors: Implements gRPC retry logic with exponential backoff
Usage quota exceeded: gRPC RESOURCE_EXHAUSTED with USAGE_QUOTA_EXCEEDED message is translated to errUsageQuotaExceededError; the WebSocket is closed with ClosePolicyViolation

Message handling errors

Invalid messages: Logs error and closes connection
Oversized responses: Truncates response bodies to maximum size
Serialization errors: Returns error response to Duo Workflow Service

Graceful shutdown

During server shutdown:

Initiates graceful shutdown of all active runners
Sends StopWorkflow requests to Duo Workflow Service
Waits for workflows to complete within timeout
Forcefully terminates connections if needed

Security

Authentication and authorization

Pre-authorization with GitLab Rails before WebSocket upgrade
OAuth tokens from original request used for API calls
Secure token propagation through Workhorse

Testing

Test files are located in the same directory:

handler_test.go: WebSocket connection handling and pre-authorization
client_test.go: gRPC client creation and connection management
runner_test.go: Message handling and workflow execution
actions_test.go: HTTP request execution and response handling
mcp_test.go: MCP server communication
lock_test.go: Distributed workflow locking
metrics_test.go: Prometheus counter instrumentation

Running tests

bash

go test ./internal/ai_assist/duoworkflow/... -v

Constants and limits

MaxMessageSize: 4MB — maximum size of gRPC messages
ActionResponseBodyLimit: ~4MB — maximum response body size
wsWriteDeadline: 60 seconds — WebSocket write timeout
wsCloseTimeout: 5 seconds — WebSocket close timeout
wsStopWorkflowTimeout: 10 seconds — workflow stop request timeout
gRPC keepalive time: 20 seconds — keepalive ping interval
gRPC retry attempts: 4 — maximum retry attempts for failed requests

Duo Workflow Package

Duo Workflow Package

Overview

Package structure

Core components

Handler

Client

Runner

Stream manager

Action handlers

HTTP request execution

MCP tool execution

Workflow locking

Message flow

Client to Duo Workflow Service

Duo Workflow Service to client

Metrics

gitlab_workhorse_duo_workflow_connections_total

gitlab_workhorse_duo_workflow_connection_errors_total

gitlab_workhorse_duo_workflow_sessions_total

gitlab_workhorse_duo_workflow_session_errors_total

Configuration

From GitLab Rails

MCP server configuration

Error handling

Connection errors

Message handling errors

Graceful shutdown

Security

Authentication and authorization

Testing

Running tests

Constants and limits

Related documentation

`gitlab_workhorse_duo_workflow_connections_total`

`gitlab_workhorse_duo_workflow_connection_errors_total`

`gitlab_workhorse_duo_workflow_sessions_total`

`gitlab_workhorse_duo_workflow_session_errors_total`