Cherry Studio AI Core Architecture Documentation

Version: v4.0 (ToolFactory + providerToolPlugin unified tool injection) Updated: 2026-03-20 Applicable to: Cherry Studio v1.8.1+

This document describes the complete data flow and architectural design from user interaction to AI SDK calls in Cherry Studio. It serves as the key documentation for understanding the application's core functionality.

Architecture Overview
Complete Call Flow
Core Components
Provider System Architecture
Plugin and Middleware System
Message Processing Flow
Type Safety Mechanisms
Tracing and Observability
Error Handling
Performance Optimization
Testing Architecture

1. Architecture Overview

1.1 Architectural Layers

Cherry Studio's AI calls follow a clear layered architecture:

┌─────────────────────────────────────────────────────────────┐
│                        UI Layer                              │
│  (React Components, Redux Store, User Interactions)         │
└────────────────────────┬────────────────────────────────────┘
                         │
                         ▼
┌─────────────────────────────────────────────────────────────┐
│                   Service Layer                              │
│  src/renderer/src/services/                                  │
│  ┌────────────────────────────────────────────────────┐    │
│  │ ApiService.ts                                       │    │
│  │  - transformMessagesAndFetch()                      │    │
│  │  - fetchChatCompletion()                            │    │
│  │  - fetchMessagesSummary()                           │    │
│  └────────────────────────────────────────────────────┘    │
└────────────────────────┬────────────────────────────────────┘
                         │
                         ▼
┌─────────────────────────────────────────────────────────────┐
│                 AI Provider Layer                            │
│  src/renderer/src/aiCore/                                    │
│  ┌────────────────────────────────────────────────────┐    │
│  │ AiProvider (AiProvider.ts)                     │    │
│  │  - completions()                                    │    │
│  │  - modernCompletions()                              │    │
│  │  - _completionsForTrace()                           │    │
│  └────────────────────────────────────────────────────┘    │
│  ┌────────────────────────────────────────────────────┐    │
│  │ Provider Config & Adaptation                        │    │
│  │  - providerConfig.ts                                │    │
│  │  - providerToAiSdkConfig()                          │    │
│  └────────────────────────────────────────────────────┘    │
└────────────────────────┬────────────────────────────────────┘
                         │
                         ▼
┌─────────────────────────────────────────────────────────────┐
│                  Core Package Layer                          │
│  packages/aiCore/ (@cherrystudio/ai-core)                    │
│  ┌────────────────────────────────────────────────────┐    │
│  │ RuntimeExecutor                                     │    │
│  │  - streamText()                                     │    │
│  │  - generateText()                                   │    │
│  │  - generateImage()                                  │    │
│  └────────────────────────────────────────────────────┘    │
│  ┌────────────────────────────────────────────────────┐    │
│  │ Provider Extension System                           │    │
│  │  - ProviderExtension (LRU Cache)                    │    │
│  │  - ExtensionRegistry                                │    │
│  │  - OpenAI/Anthropic/Google Extensions              │    │
│  └────────────────────────────────────────────────────┘    │
│  ┌────────────────────────────────────────────────────┐    │
│  │ Plugin Engine                                       │    │
│  │  - PluginManager                                    │    │
│  │  - AiPlugin Lifecycle Hooks                         │    │
│  └────────────────────────────────────────────────────┘    │
└────────────────────────┬────────────────────────────────────┘
                         │
                         ▼
┌─────────────────────────────────────────────────────────────┐
│                   AI SDK Layer                              │
│  Vercel AI SDK v6.x (@ai-sdk/*)                             │
│  ┌─────────────────────────────────────────────────────┐    │
│  │ Provider Implementations                            │    │
│  │  - @ai-sdk/openai                                   │    │
│  │  - @ai-sdk/anthropic                                │    │
│  │  - @ai-sdk/google-generative-ai                     │    │
│  │  - @ai-sdk/mistral                                  │    │
│  └─────────────────────────────────────────────────────┘    │
│  ┌─────────────────────────────────────────────────────┐    │
│  │ Core Functions                                      │    │
│  │  - streamText()                                     │    │
│  │  - generateText()                                   │    │
│  └─────────────────────────────────────────────────────┘    │
└────────────────────────┬────────────────────────────────────┘
                         │
                         ▼
┌─────────────────────────────────────────────────────────────┐
│                   LLM Provider API                          │
│  (OpenAI, Anthropic, Google, etc.)                          │
└─────────────────────────────────────────────────────────────┘

1.2 Core Design Principles

1.2.1 Separation of Concerns

Service Layer: Business logic, message preparation, tool invocation
AI Provider Layer: Provider adaptation, parameter conversion, plugin building
Core Package: Unified API, provider management, plugin execution
AI SDK Layer: Actual LLM API calls

1.2.2 Type Safety First

End-to-end TypeScript type inference
Automatic Provider Settings association
Compile-time parameter validation

1.2.3 Extensibility

Plugin architecture (AiPlugin)
Provider Extension system
Middleware mechanism

2. Complete Call Flow

2.1 Full Flow from User Input to LLM Response

Flow Diagram

User Input (UI)
   │
   ▼
┌─────────────────────────────────────────────────────────────┐
│ 1. UI Event Handler                                          │
│    - ChatView/MessageInput Component                         │
│    - Redux dispatch action                                   │
└─────────────────────────┬───────────────────────────────────┘
                          │
                          ▼
┌─────────────────────────────────────────────────────────────┐
│ 2. ApiService.transformMessagesAndFetch()                    │
│    Location: src/renderer/src/services/ApiService.ts:92      │
│                                                               │
│    Step 2.1: ConversationService.prepareMessagesForModel()   │
│    ├─ Message format conversion (UI Message → Model Message) │
│    ├─ Process image/file attachments                         │
│    └─ Apply message filtering rules                          │
│                                                               │
│    Step 2.2: replacePromptVariables()                        │
│    └─ Replace variables in system prompt                     │
│                                                               │
│    Step 2.3: injectUserMessageWithKnowledgeSearchPrompt()    │
│    └─ Inject knowledge base search prompt (if enabled)       │
│                                                               │
│    Step 2.4: fetchChatCompletion() ────────────────────────► │
└─────────────────────────────────────────────────────────────┘
                          │
                          ▼
┌─────────────────────────────────────────────────────────────┐
│ 3. ApiService.fetchChatCompletion()                          │
│    Location: src/renderer/src/services/ApiService.ts:139     │
│                                                               │
│    Step 3.1: getProviderByModel() + API Key Rotation         │
│    ├─ Get provider configuration                             │
│    ├─ Apply API key rotation (multi-key load balancing)      │
│    └─ Create providerWithRotatedKey                          │
│                                                               │
│    Step 3.2: new AiProvider(model, provider)           │
│    └─ Initialize AI Provider instance                        │
│                                                               │
│    Step 3.3: buildStreamTextParams()                         │
│    ├─ Build AI SDK parameters                                │
│    ├─ Process MCP tools                                      │
│    ├─ Process Web Search configuration                       │
│    └─ Return aiSdkParams + capabilities                      │
│                                                               │
│    Step 3.4: buildPlugins(middlewareConfig)                  │
│    └─ Build plugin array based on capabilities               │
│                                                               │
│    Step 3.5: AI.completions(modelId, params, config) ──────► │
└─────────────────────────────────────────────────────────────┘
                          │
                          ▼
┌─────────────────────────────────────────────────────────────┐
│ 4. AiProvider.completions()                            │
│    Location: src/renderer/src/aiCore/index_new.ts:116        │
│                                                               │
│    Step 4.1: providerToAiSdkConfig()                         │
│    ├─ Convert Cherry Provider → AI SDK Config                │
│    ├─ Set providerId ('openai', 'anthropic', etc.)           │
│    └─ Set providerSettings (apiKey, baseURL, etc.)           │
│                                                               │
│    Step 4.2: Claude Code OAuth special handling              │
│    └─ Inject Claude Code system message (if OAuth)           │
│                                                               │
│    Step 4.3: Routing selection                               │
│    ├─ If trace enabled → _completionsForTrace()              │
│    └─ Otherwise → _completionsOrImageGeneration()            │
└─────────────────────────┬───────────────────────────────────┘
                          │
                          ▼
┌─────────────────────────────────────────────────────────────┐
│ 5. AiProvider._completionsOrImageGeneration()          │
│    Location: src/renderer/src/aiCore/index_new.ts:167        │
│                                                               │
│    Decision:                                                  │
│    ├─ Image generation endpoint → legacyProvider.completions()│
│    └─ Text generation → modernCompletions() ───────────────► │
└─────────────────────────────────────────────────────────────┘
                          │
                          ▼
┌─────────────────────────────────────────────────────────────┐
│ 6. AiProvider.modernCompletions()                      │
│    Location: src/renderer/src/aiCore/index_new.ts:284        │
│                                                               │
│    Step 6.1: buildPlugins(config)                            │
│    └─ Build plugin array (Reasoning, ToolUse, WebSearch, etc.)│
│                                                               │
│    Step 6.2: createExecutor() ─────────────────────────────► │
│    └─ Create RuntimeExecutor instance                        │
└─────────────────────────────────────────────────────────────┘
                          │
                          ▼
┌─────────────────────────────────────────────────────────────┐
│ 7. packages/aiCore: createExecutor()                         │
│    Location: packages/aiCore/src/core/runtime/index.ts:25    │
│                                                               │
│    Step 7.1: extensionRegistry.createProvider()              │
│    ├─ Parse providerId (supports aliases and variants)       │
│    ├─ Get ProviderExtension instance                         │
│    ├─ Compute settings hash                                  │
│    ├─ LRU cache lookup                                       │
│    │  ├─ Cache hit → Return cached instance                  │
│    │  └─ Cache miss → Create new instance                    │
│    └─ Return ProviderV3 instance                             │
│                                                               │
│    Step 7.2: RuntimeExecutor.create()                        │
│    ├─ Create RuntimeExecutor instance                        │
│    ├─ Inject provider reference                              │
│    └─ Initialize PluginEngine                                │
│                                                               │
│    Return: RuntimeExecutor<T> instance ────────────────────► │
└─────────────────────────────────────────────────────────────┘
                          │
                          ▼
┌─────────────────────────────────────────────────────────────┐
│ 8. RuntimeExecutor.streamText()                              │
│    Location: packages/aiCore/src/core/runtime/executor.ts    │
│                                                               │
│    Step 8.1: Plugin lifecycle - onRequestStart               │
│    └─ Execute all plugins' onRequestStart hooks              │
│                                                               │
│    Step 8.2: Internal _resolveModel plugin                   │
│    └─ Resolve model string → LanguageModel via AI SDK        │
│       providerRegistry (no separate ModelResolver class)     │
│                                                               │
│    Step 8.3: Plugin transform - transformParams              │
│    └─ Chain execute all plugins' parameter transformations   │
│                                                               │
│    Step 8.4: Apply middlewares from context                   │
│    └─ wrapLanguageModel with collected middlewares            │
│                                                               │
│    Step 8.5: Call AI SDK streamText() ─────────────────────►│
│    └─ Pass resolved model and transformed params             │
└─────────────────────────────────────────────────────────────┘
                          │
                          ▼
┌─────────────────────────────────────────────────────────────┐
│ 9. AI SDK: streamText()                                      │
│    Location: node_modules/ai/core/generate-text/stream-text  │
│                                                               │
│    Step 9.1: Parameter validation                            │
│    Step 9.2: Call provider.doStream()                        │
│    Step 9.3: Return StreamTextResult                         │
│    └─ textStream, fullStream, usage, etc.                    │
└─────────────────────────┬───────────────────────────────────┘
                          │
                          ▼
┌─────────────────────────────────────────────────────────────┐
│ 10. Stream Data Processing                                   │
│     Location: src/renderer/src/aiCore/chunk/                 │
│                                                               │
│     Step 10.1: AiSdkToChunkAdapter.processStream()           │
│     ├─ Listen to AI SDK's textStream                         │
│     ├─ Convert to Cherry Chunk format                        │
│     ├─ Process tool calls                                    │
│     ├─ Process reasoning blocks                              │
│     └─ Send chunk to onChunkReceived callback                │
│                                                               │
│     Step 10.2: StreamProcessingService                       │
│     └─ Process different chunk types and update UI           │
└─────────────────────────┬───────────────────────────────────┘
                          │
                          ▼
┌─────────────────────────────────────────────────────────────┐
│ 11. Plugin Lifecycle - Completion Phase                      │
│                                                               │
│     Step 11.1: transformResult                               │
│     └─ Plugins can modify final result                       │
│                                                               │
│     Step 11.2: onRequestEnd                                  │
│     └─ Execute all plugins' completion hooks                 │
└─────────────────────────┬───────────────────────────────────┘
                          │
                          ▼
┌─────────────────────────────────────────────────────────────┐
│ 12. UI Update                                                │
│     - Redux state update                                     │
│     - React component re-render                              │
│     - Display complete response                              │
└─────────────────────────────────────────────────────────────┘

2.2 Key Timing Notes

2.2.1 Provider Instance Creation (LRU Cache Mechanism)

typescript

// Scenario 1: First OpenAI request (Cache Miss)
const executor1 = await createExecutor("openai", { apiKey: "sk-xxx" });
// → extensionRegistry.createProvider('openai', { apiKey: 'sk-xxx' })
// → Compute hash: "abc123"
// → LRU cache miss
// → OpenAIExtension.factory() creates new provider
// → Store in LRU: cache.set("abc123", provider)

// Scenario 2: Second request with same config (Cache Hit)
const executor2 = await createExecutor("openai", { apiKey: "sk-xxx" });
// → Compute hash: "abc123" (same)
// → LRU cache hit!
// → Return cached provider directly
// → executor1 and executor2 share the same provider instance

// Scenario 3: Different config (Cache Miss + New Instance)
const executor3 = await createExecutor("openai", {
  apiKey: "sk-yyy", // different key
  baseURL: "https://custom.com/v1",
});
// → Compute hash: "def456" (different)
// → LRU cache miss
// → Create new independent provider instance
// → Store in LRU: cache.set("def456", provider2)

2.2.2 Plugin Execution Order

typescript

// Example: Reasoning + ToolUse + WebSearch enabled
plugins = [ReasoningPlugin, ToolUsePlugin, WebSearchPlugin]

// Execution order:
1. onRequestStart:    Reasoning → ToolUse → WebSearch
2. transformParams:   Reasoning → ToolUse → WebSearch (chain)
3. [AI SDK call]
4. transformResult:   WebSearch → ToolUse → Reasoning (reverse)
5. onRequestEnd:      WebSearch → ToolUse → Reasoning (reverse)

3. Core Components

3.1 ApiService Layer

File Location

src/renderer/src/services/ApiService.ts

Core Responsibilities

Message preparation and conversion
MCP tool integration
Knowledge base search injection
API Key rotation
Call AiProvider

Key Function Details

3.1.1 `transformMessagesAndFetch()`

Signature:

typescript

async function transformMessagesAndFetch(
  request: {
    messages: Message[];
    assistant: Assistant;
    blockManager: BlockManager;
    assistantMsgId: string;
    callbacks: StreamProcessorCallbacks;
    topicId?: string;
    options: {
      signal?: AbortSignal;
      timeout?: number;
      headers?: Record<string, string>;
    };
  },
  onChunkReceived: (chunk: Chunk) => void,
): Promise<void>;

Execution Flow:

typescript

// Step 1: Message preparation
const { modelMessages, uiMessages } =
  await ConversationService.prepareMessagesForModel(messages, assistant);

// modelMessages: Converted to LLM-understandable format
// uiMessages: Original UI messages (for special scenarios)

// Step 2: Replace prompt variables
assistant.prompt = await replacePromptVariables(
  assistant.prompt,
  assistant.model?.name,
);
// e.g.: "{model_name}" → "GPT-4"

// Step 3: Inject knowledge base search
await injectUserMessageWithKnowledgeSearchPrompt({
  modelMessages,
  assistant,
  assistantMsgId,
  topicId,
  blockManager,
  setCitationBlockId,
});

// Step 4: Make actual request
await fetchChatCompletion({
  messages: modelMessages,
  assistant,
  topicId,
  requestOptions,
  uiMessages,
  onChunkReceived,
});

3.1.2 `fetchChatCompletion()`

Key Code Analysis:

typescript

export async function fetchChatCompletion({
  messages,
  assistant,
  requestOptions,
  onChunkReceived,
  topicId,
  uiMessages,
}: FetchChatCompletionParams) {
  // 1. Provider preparation + API Key rotation
  const baseProvider = getProviderByModel(assistant.model || getDefaultModel());
  const providerWithRotatedKey = {
    ...baseProvider,
    apiKey: getRotatedApiKey(baseProvider), // ✅ Multi-key load balancing
  };

  // 2. Create AI Provider instance
  const AI = new AiProvider(
    assistant.model || getDefaultModel(),
    providerWithRotatedKey,
  );

  // 3. Get MCP tools
  const mcpTools: MCPTool[] = [];
  if (isPromptToolUse(assistant) || isSupportedToolUse(assistant)) {
    mcpTools.push(...(await fetchMcpTools(assistant)));
  }

  // 4. Build AI SDK parameters
  const {
    params: aiSdkParams,
    modelId,
    capabilities,
    webSearchPluginConfig,
  } = await buildStreamTextParams(messages, assistant, provider, {
    mcpTools,
    webSearchProviderId: assistant.webSearchProviderId,
    requestOptions,
  });

  // 5. Build middleware configuration
  const middlewareConfig: AiSdkMiddlewareConfig = {
    streamOutput: assistant.settings?.streamOutput ?? true,
    onChunk: onChunkReceived,
    model: assistant.model,
    enableReasoning: capabilities.enableReasoning,
    isPromptToolUse: usePromptToolUse,
    isSupportedToolUse: isSupportedToolUse(assistant),
    webSearchPluginConfig,
    enableWebSearch: capabilities.enableWebSearch,
    enableGenerateImage: capabilities.enableGenerateImage,
    enableUrlContext: capabilities.enableUrlContext,
    mcpTools,
    uiMessages,
    knowledgeRecognition: assistant.knowledgeRecognition,
  };

  // 6. Call AI.completions()
  await AI.completions(modelId, aiSdkParams, {
    ...middlewareConfig,
    assistant,
    topicId,
    callType: "chat",
    uiMessages,
  });
}

API Key Rotation Mechanism:

typescript

function getRotatedApiKey(provider: Provider): string {
  const keys = provider.apiKey
    .split(",")
    .map((k) => k.trim())
    .filter(Boolean);

  if (keys.length === 1) return keys[0];

  const keyName = `provider:${provider.id}:last_used_key`;
  const lastUsedKey = window.keyv.get(keyName);

  const currentIndex = keys.indexOf(lastUsedKey);
  const nextIndex = (currentIndex + 1) % keys.length;
  const nextKey = keys[nextIndex];

  window.keyv.set(keyName, nextKey);
  return nextKey;
}

// Usage scenario:
// provider.apiKey = "sk-key1,sk-key2,sk-key3"
// Request 1 → use sk-key1
// Request 2 → use sk-key2
// Request 3 → use sk-key3
// Request 4 → use sk-key1 (cycle)

3.2 AiProvider Layer

File Location

src/renderer/src/aiCore/index_new.ts

Core Responsibilities

Provider configuration conversion (Cherry Provider → AI SDK Config)
Plugin building (based on capabilities)
Trace integration (OpenTelemetry)
Call RuntimeExecutor
Stream data adaptation (AI SDK Stream → Cherry Chunk)

Constructor Details

typescript

constructor(modelOrProvider: Model | Provider, provider?: Provider) {
  if (this.isModel(modelOrProvider)) {
    // Case 1: new AiProvider(model, provider)
    this.model = modelOrProvider
    this.actualProvider = provider
      ? adaptProvider({ provider, model: modelOrProvider })
      : getActualProvider(modelOrProvider)

    // Sync or async config creation
    const configOrPromise = providerToAiSdkConfig(
      this.actualProvider,
      modelOrProvider
    )
    this.config = configOrPromise instanceof Promise
      ? undefined
      : configOrPromise
  } else {
    // Case 2: new AiProvider(provider)
    this.actualProvider = adaptProvider({ provider: modelOrProvider })
  }

  this.legacyProvider = new LegacyAiProvider(this.actualProvider)
}

completions() Method Details

typescript

public async completions(
  modelId: string,
  params: StreamTextParams,
  providerConfig: AiProviderConfig
) {
  // 1. Ensure config is ready
  if (!this.config) {
    this.config = await Promise.resolve(
      providerToAiSdkConfig(this.actualProvider, this.model!)
    )
  }

  // 2. Claude Code OAuth special handling
  if (this.actualProvider.id === 'anthropic' &&
      this.actualProvider.authType === 'oauth') {
    const claudeCodeSystemMessage = buildClaudeCodeSystemModelMessage(
      params.system
    )
    params.system = undefined
    params.messages = [...claudeCodeSystemMessage, ...(params.messages || [])]
  }

  // 3. Routing selection
  if (providerConfig.topicId && getEnableDeveloperMode()) {
    return await this._completionsForTrace(modelId, params, {
      ...providerConfig,
      topicId: providerConfig.topicId
    })
  } else {
    return await this._completionsOrImageGeneration(modelId, params, providerConfig)
  }
}

modernCompletions() Core Implementation

typescript

private async modernCompletions(
  modelId: string,
  params: StreamTextParams,
  config: AiProviderConfig
): Promise<CompletionsResult> {

  // 1. Build plugins
  const plugins = buildPlugins(config)

  // 2. Create RuntimeExecutor
  const executor = await createExecutor(
    this.config!.providerId,
    this.config!.providerSettings,
    plugins
  )

  // 3. Streaming call
  if (config.onChunk) {
    const accumulate = this.model!.supported_text_delta !== false
    const adapter = new AiSdkToChunkAdapter(
      config.onChunk,
      config.mcpTools,
      accumulate,
      config.enableWebSearch
    )

    const streamResult = await executor.streamText({
      ...params,
      model: modelId,
      experimental_context: { onChunk: config.onChunk }
    })

    const finalText = await adapter.processStream(streamResult)

    return { getText: () => finalText }
  } else {
    // Non-streaming call
    const streamResult = await executor.streamText({
      ...params,
      model: modelId
    })

    await streamResult?.consumeStream()
    const finalText = await streamResult.text

    return { getText: () => finalText }
  }
}

4. Provider System Architecture

4.1 Provider Configuration Conversion

providerToAiSdkConfig() Details

File: src/renderer/src/aiCore/provider/providerConfig.ts

typescript

export function providerToAiSdkConfig(
  provider: Provider,
  model?: Model,
): ProviderConfig | Promise<ProviderConfig> {
  // 1. Route to specific implementation based on provider.id
  switch (provider.id) {
    case "openai":
      return {
        providerId: "openai",
        providerSettings: {
          apiKey: provider.apiKey,
          baseURL: provider.apiHost,
          organization: provider.apiOrganization,
          headers: provider.apiHeaders,
        },
      };

    case "anthropic":
      return {
        providerId: "anthropic",
        providerSettings: {
          apiKey: provider.apiKey,
          baseURL: provider.apiHost,
        },
      };

    case "openai-compatible":
      return {
        providerId: "openai-compatible",
        providerSettings: {
          baseURL: provider.apiHost,
          apiKey: provider.apiKey,
          name: provider.name,
        },
      };

    case "gateway":
      // Special handling: gateway requires async creation
      return createGatewayConfig(provider, model);

    // ... other providers
  }
}

4.2 Provider Extension System

File: packages/aiCore/src/core/providers/core/ProviderExtension.ts

Core Design

typescript

export class ProviderExtension<
  TSettings = any,
  TStorage extends ExtensionStorage = ExtensionStorage,
  TProvider extends ProviderV3 = ProviderV3,
  TConfig extends ProviderExtensionConfig<TSettings, TStorage, TProvider> =
    ProviderExtensionConfig<TSettings, TStorage, TProvider>,
> {
  // 1. LRU cache (settings hash → provider instance)
  private instances: LRUCache<string, TProvider>;

  constructor(public readonly config: TConfig) {
    this.instances = new LRUCache<string, TProvider>({
      max: 10, // Cache up to 10 instances
      updateAgeOnGet: true, // LRU behavior
    });
  }

  // 2. Create provider (with caching)
  async createProvider(
    settings?: TSettings,
    variantSuffix?: string,
  ): Promise<TProvider> {
    // 2.1 Merge default configuration
    const mergedSettings = this.mergeSettings(settings);

    // 2.2 Compute hash (including variantSuffix)
    const hash = this.computeHash(mergedSettings, variantSuffix);

    // 2.3 LRU cache lookup
    const cachedInstance = this.instances.get(hash);
    if (cachedInstance) {
      return cachedInstance;
    }

    // 2.4 Cache miss, create new instance
    const provider = await this.factory(mergedSettings, variantSuffix);

    // 2.5 Execute lifecycle hooks
    await this.lifecycle.onCreate?.(provider, mergedSettings);

    // 2.6 Store in LRU cache
    this.instances.set(hash, provider);

    return provider;
  }

  // 3. Hash computation (ensures same config gets same hash)
  private computeHash(settings?: TSettings, variantSuffix?: string): string {
    const baseHash = (() => {
      if (settings === undefined || settings === null) {
        return "default";
      }

      // Stable serialization (sort object keys)
      const stableStringify = (obj: any): string => {
        if (obj === null || obj === undefined) return "null";
        if (typeof obj !== "object") return JSON.stringify(obj);
        if (Array.isArray(obj))
          return `[${obj.map(stableStringify).join(",")}]`;

        const keys = Object.keys(obj).sort();
        const pairs = keys.map(
          (key) => `${JSON.stringify(key)}:${stableStringify(obj[key])}`,
        );
        return `{${pairs.join(",")}}`;
      };

      const serialized = stableStringify(settings);

      // Simple hash function
      let hash = 0;
      for (let i = 0; i < serialized.length; i++) {
        const char = serialized.charCodeAt(i);
        hash = (hash << 5) - hash + char;
        hash = hash & hash;
      }

      return `${Math.abs(hash).toString(36)}`;
    })();

    // Append variantSuffix
    return variantSuffix ? `${baseHash}:${variantSuffix}` : baseHash;
  }
}

ToolFactory System

Each Extension can declare toolFactories, internalizing AI SDK tool capabilities (web search, URL context, etc.) into the Provider instance. Plugins only query the registry — no need to know specific SDK tool names.

typescript

// toolFactory.ts — Core types
type ToolCapability = 'webSearch' | 'fileSearch' | 'codeExecution' | 'urlContext'

interface ToolFactoryPatch {
  tools?: ToolSet           // Tools to merge into params.tools
  providerOptions?: Record<string, any>  // Options to merge into params.providerOptions
}

// Factory: provider instance → config → patch
type ToolFactory<TProvider> = (provider: TProvider) => (...args: any[]) => ToolFactoryPatch

type ToolFactoryMap<TProvider> = {
  [K in ToolCapability]?: ToolFactory<TProvider>
}

Design points:

Returns ToolFactoryPatch instead of a single Tool, supporting multi-tool (xAI's webSearch + xSearch) and non-tool (OpenRouter's providerOptions) cases
ToolFactory uses ...args: any[] instead of config: Record<string, any>, preserving concrete config types with as const satisfies
ExtractToolConfig<TExt, K> extracts config types from declarations, WebSearchToolConfigMap is auto-generated from coreExtensions

OpenAI Extension Example

typescript

// packages/aiCore/src/core/providers/core/initialization.ts

const OpenAIExtension = ProviderExtension.create({
  name: 'openai',
  aliases: ['openai-response'] as const,
  create: createOpenAI,

  // Tool capability declaration — config types inferred from SDK by TypeScript
  toolFactories: {
    webSearch: (p: OpenAIProvider) =>
      (config: NonNullable<Parameters<OpenAIProvider['tools']['webSearch']>[0]>) => ({
        tools: { webSearch: p.tools.webSearch(config) }
      })
  },

  variants: [{
    suffix: 'chat',
    name: 'OpenAI Chat',
    transform: (provider: OpenAIProvider) => customProvider({
      fallbackProvider: {
        ...provider,
        languageModel: (modelId) => provider.chat(modelId)
      }
    }),
    // Variants can override base toolFactories
    toolFactories: {
      webSearch: (p: OpenAIProvider) =>
        (config) => ({ tools: { webSearch: p.tools.webSearchPreview(config) } })
    }
  }] as const
} as const satisfies ProviderExtensionConfig<OpenAIProviderSettings, ExtensionStorage, OpenAIProvider, 'openai'>)

Config Type Auto-Extraction

typescript

// Extract webSearch config type from extension declaration
type ExtractToolConfig<TExt, K extends string> = TExt extends {
  config: { toolFactories?: { [P in K]?: (provider: any) => (config: infer C) => any } }
} ? C : never

// Auto-generate { openai?: OpenAISearchConfig, anthropic?: ..., ... } from coreExtensions
type WebSearchToolConfigMap = ExtractToolConfigMap<(typeof coreExtensions)[number], 'webSearch'>

4.3 Extension Registry

File: packages/aiCore/src/core/providers/core/ExtensionRegistry.ts

In addition to registration, lookup, and provider creation, the registry now handles tool capability resolution:

typescript

export class ExtensionRegistry {
  // ... register(), get(), createProvider(), parseProviderId() unchanged

  // ==================== Tool Capability Resolution ====================

  /**
   * Get tool factory for a specific provider
   * Variants check their own toolFactories first, then fall back to base
   */
  getToolFactory(providerId: string, capability: ToolCapability): ToolFactory | undefined

  /**
   * Resolve tool capability — the single entry point for plugins
   *
   * 1. Direct: provider has its own toolFactories → use cached provider instance
   * 2. Aggregator fallback: resolve real provider from model.provider segments
   *    e.g., "aihubmix.google" → "google" → Google extension
   *
   * For aggregator providers, internally creates tool-only provider (descriptors don't make network calls)
   */
  async resolveToolCapability(
    providerId: string,
    capability: ToolCapability,
    modelProvider?: string
  ): Promise<{ factory: ToolFactory; provider: ProviderV3 } | undefined>

  /**
   * Get tool-only provider instance (internal)
   * Prefers existing cached instance, otherwise creates dummy (descriptors don't need real API key)
   */
  private async getToolProvider(providerId: string): Promise<ProviderV3 | undefined>
}

Aggregator Provider Flow

User request: aihubmix + claude-opus-4-6
  │
  ├─ model.provider = "aihubmix.anthropic"  (set by aihubmix provider)
  │
  ├─ resolveToolCapability('aihubmix', 'webSearch', 'aihubmix.anthropic')
  │   ├─ Direct: getToolFactory('aihubmix', 'webSearch') → undefined (no toolFactories)
  │   └─ Fallback: split "aihubmix.anthropic" → try "anthropic"
  │       ├─ getToolFactory('anthropic', 'webSearch') → found!
  │       └─ getToolProvider('anthropic') → create/cache Anthropic provider
  │
  └─ factory(anthropicProvider)(config) → { tools: { webSearch: ... } }

5. Plugin and Middleware System

5.1 Plugin Architecture

AiPlugin Interface Definition

File: packages/aiCore/src/core/plugins/types.ts

typescript

export interface AiPlugin {
  /** Plugin name */
  name: string;

  /** Before request starts */
  onRequestStart?: (context: PluginContext) => void | Promise<void>;

  /** Transform parameters (chained call) */
  transformParams?: (params: any, context: PluginContext) => any | Promise<any>;

  /** Transform result */
  transformResult?: (result: any, context: PluginContext) => any | Promise<any>;

  /** After request ends */
  onRequestEnd?: (context: PluginContext) => void | Promise<void>;

  /** Error handling */
  onError?: (error: Error, context: PluginContext) => void | Promise<void>;
}

export interface PluginContext {
  providerId: string;
  model?: string;
  messages?: any[];
  tools?: any;
  // Custom data from experimental_context
  [key: string]: any;
}

PluginEngine Implementation

File: packages/aiCore/src/core/plugins/PluginEngine.ts

typescript

export class PluginEngine {
  constructor(
    private providerId: string,
    private plugins: AiPlugin[],
  ) {}

  // 1. Execute onRequestStart
  async executeOnRequestStart(params: any): Promise<void> {
    const context = this.createContext(params);

    for (const plugin of this.plugins) {
      if (plugin.onRequestStart) {
        await plugin.onRequestStart(context);
      }
    }
  }

  // 2. Chain execute transformParams
  async executeTransformParams(params: any): Promise<any> {
    let transformedParams = params;
    const context = this.createContext(params);

    for (const plugin of this.plugins) {
      if (plugin.transformParams) {
        transformedParams = await plugin.transformParams(
          transformedParams,
          context,
        );
      }
    }

    return transformedParams;
  }

  // 3. Execute transformResult
  async executeTransformResult(result: any, params: any): Promise<any> {
    let transformedResult = result;
    const context = this.createContext(params);

    // Execute in reverse order
    for (let i = this.plugins.length - 1; i >= 0; i--) {
      const plugin = this.plugins[i];
      if (plugin.transformResult) {
        transformedResult = await plugin.transformResult(
          transformedResult,
          context,
        );
      }
    }

    return transformedResult;
  }

  // 4. Execute onRequestEnd
  async executeOnRequestEnd(params: any): Promise<void> {
    const context = this.createContext(params);

    // Execute in reverse order
    for (let i = this.plugins.length - 1; i >= 0; i--) {
      const plugin = this.plugins[i];
      if (plugin.onRequestEnd) {
        await plugin.onRequestEnd(context);
      }
    }
  }
}

5.2 Built-in Plugins

5.2.1 providerToolPlugin — Unified Tool Injection

File: packages/aiCore/src/core/plugins/built-in/providerToolPlugin.ts

All provider-defined tool injection (webSearch, urlContext, etc.) is handled by the unified providerToolPlugin. It is pure orchestration — query the registry, get the factory, apply the patch:

typescript

export const providerToolPlugin = (capability: ToolCapability, config: Record<string, any> = {}) =>
  definePlugin({
    name: capability,
    enforce: 'pre',

    transformParams: async (params: any, context) => {
      const { providerId } = context

      // Get model.provider from context.model (for aggregator provider fallback)
      const modelProvider =
        context.model && typeof context.model !== 'string' && 'provider' in context.model
          ? (context.model.provider as string)
          : undefined

      // Registry handles everything: direct lookup + aggregator fallback + provider acquisition
      const resolved = await extensionRegistry.resolveToolCapability(providerId, capability, modelProvider)
      if (!resolved) return params

      const userConfig = config[providerId] ?? {}
      const patch = resolved.factory(resolved.provider)(userConfig)

      // Unified merge — one if, no provider-specific branches
      if (patch.tools) params.tools = { ...params.tools, ...patch.tools }
      if (patch.providerOptions) params.providerOptions = mergeProviderOptions(...)

      return params
    }
  })

Usage (in PluginBuilder):

typescript

// webSearch and urlContext are both specializations of providerToolPlugin
if (config.enableWebSearch) {
  plugins.push(providerToolPlugin('webSearch', config.webSearchPluginConfig))
}
if (config.enableUrlContext) {
  plugins.push(providerToolPlugin('urlContext'))
}

5.2.2 Reasoning / ToolUse / Logging

Other built-in plugins remain unchanged. See:

packages/aiCore/src/core/plugins/built-in/reasoning/ — Reasoning mode
packages/aiCore/src/core/plugins/built-in/toolUsePlugin/ — Prompt Tool Use
packages/aiCore/src/core/plugins/built-in/logging.ts — Request logging

6. Message Processing Flow

6.1 Message Conversion

File: src/renderer/src/services/ConversationService.ts

typescript

export class ConversationService {
  /**
   * Prepare messages for LLM call
   *
   * @returns {
   *   modelMessages: AI SDK format messages
   *   uiMessages: Original UI messages (for special scenarios)
   * }
   */
  static async prepareMessagesForModel(
    messages: Message[],
    assistant: Assistant,
  ): Promise<{
    modelMessages: CoreMessage[];
    uiMessages: Message[];
  }> {
    // 1. Filter messages
    let filteredMessages = messages
      .filter((m) => !m.isDeleted)
      .filter((m) => m.role !== "system");

    // 2. Apply context window limit
    const contextLimit = assistant.settings?.contextLimit || 10;
    if (contextLimit > 0) {
      filteredMessages = takeRight(filteredMessages, contextLimit);
    }

    // 3. Convert to AI SDK format
    const modelMessages: CoreMessage[] = [];

    for (const msg of filteredMessages) {
      const converted = await this.convertMessageToAiSdk(msg, assistant);
      if (converted) {
        modelMessages.push(converted);
      }
    }

    // 4. Add system message
    if (assistant.prompt) {
      modelMessages.unshift({
        role: "system",
        content: assistant.prompt,
      });
    }

    return {
      modelMessages,
      uiMessages: filteredMessages,
    };
  }
}

6.2 Stream Data Adaptation

File: src/renderer/src/aiCore/chunk/AiSdkToChunkAdapter.ts

typescript

export default class AiSdkToChunkAdapter {
  constructor(
    private onChunk: (chunk: Chunk) => void,
    private mcpTools?: MCPTool[],
    private accumulate: boolean = true,
    private enableWebSearch: boolean = false,
  ) {}

  /**
   * Process AI SDK streaming result
   */
  async processStream(streamResult: StreamTextResult<any>): Promise<string> {
    const startTime = Date.now();
    let fullText = "";
    let firstTokenTime = 0;

    try {
      // 1. Listen to textStream
      for await (const textDelta of streamResult.textStream) {
        if (!firstTokenTime) {
          firstTokenTime = Date.now();
        }

        if (this.accumulate) {
          fullText += textDelta;

          // Send text delta chunk
          this.onChunk({
            type: ChunkType.TEXT_DELTA,
            text: textDelta,
          });
        } else {
          // Don't accumulate, send complete text
          this.onChunk({
            type: ChunkType.TEXT,
            text: textDelta,
          });
        }
      }

      // 2. Process tool calls
      const toolCalls = streamResult.toolCalls;
      if (toolCalls && toolCalls.length > 0) {
        for (const toolCall of toolCalls) {
          await this.handleToolCall(toolCall);
        }
      }

      // 3. Process reasoning/thinking
      const reasoning = streamResult.experimental_providerMetadata?.reasoning;
      if (reasoning) {
        this.onChunk({
          type: ChunkType.REASONING,
          content: reasoning,
        });
      }

      // 4. Send completion chunk
      const usage = await streamResult.usage;
      const finishReason = await streamResult.finishReason;

      this.onChunk({
        type: ChunkType.BLOCK_COMPLETE,
        response: {
          usage: {
            prompt_tokens: usage.promptTokens,
            completion_tokens: usage.completionTokens,
            total_tokens: usage.totalTokens,
          },
          metrics: {
            completion_tokens: usage.completionTokens,
            time_first_token_millsec: firstTokenTime - startTime,
            time_completion_millsec: Date.now() - startTime,
          },
          finish_reason: finishReason,
        },
      });

      return fullText;
    } catch (error) {
      this.onChunk({
        type: ChunkType.ERROR,
        error: error as Error,
      });
      throw error;
    }
  }
}

7. Type Safety Mechanisms

7.1 Type Utilities

File: packages/aiCore/src/core/providers/types/index.ts

StringKeys<T> - Extract String Keys

typescript

/**
 * Extract only string keys from an object type
 * Uses Extract for clean type inference
 * @example StringKeys<{ foo: 1, 0: 2 }> = 'foo'
 */
export type StringKeys<T> = Extract<keyof T, string>;

// Usage in generic constraints:
export interface RuntimeConfig<
  TSettingsMap extends Record<string, any> = CoreProviderSettingsMap,
  T extends StringKeys<TSettingsMap> = StringKeys<TSettingsMap>,
> {
  providerId: T;
  providerSettings: TSettingsMap[T];
}

7.2 Provider ID Resolution Map

The appProviderIds constant provides type-safe provider ID resolution with different behavior for aliases vs variants:

typescript

// Alias → Base Name (normalization)
appProviderIds["claude"]; // → 'anthropic'
appProviderIds["vertexai"]; // → 'google-vertex'

// Variant → Self (reflexive mapping)
appProviderIds["openai-chat"]; // → 'openai-chat'
appProviderIds["azure-responses"]; // → 'azure-responses'

Design Rationale:

Type	Semantics	Mapping Behavior
Alias	Another name for the same thing	Normalize to base name ✓
Variant	Different mode of the same provider	Self-mapping (reflexive) ✓

Type Definition:

typescript

// Helper type to extract variant IDs
type ExtractVariantIds<TConfig, TName extends string> = TConfig extends {
  variants: readonly { suffix: infer TSuffix extends string }[];
}
  ? `${TName}-${TSuffix}`
  : never;

// Map type with conditional self-mapping for variants
export type ExtensionConfigToIdResolutionMap<TConfig> = TConfig extends {
  name: infer TName extends string;
}
  ? {
      readonly [K in
        | TName
        | (TConfig extends { aliases: readonly (infer TAlias extends string)[] }
            ? TAlias
            : never)
        | ExtractVariantIds<TConfig, TName>]: K extends ExtractVariantIds<
        TConfig,
        TName
      >
        ? K // Variant → Self
        : TName; // Base name and aliases → TName
    }
  : never;

7.3 Provider Settings Type Mapping

File: packages/aiCore/src/core/providers/types/index.ts

typescript

/**
 * Core Provider Settings Map
 * Automatically extracts types from Extensions
 */
export type CoreProviderSettingsMap = UnionToIntersection<
  ExtensionToSettingsMap<(typeof coreExtensions)[number]>
>;

/**
 * Result type (example):
 * {
 *   openai: OpenAIProviderSettings
 *   'openai-chat': OpenAIProviderSettings
 *   anthropic: AnthropicProviderSettings
 *   google: GoogleProviderSettings
 *   ...
 * }
 */

7.4 Type-Safe createExecutor

typescript

// 1. Known provider (type-safe)
const executor = await createExecutor("openai", {
  apiKey: "sk-xxx", // ✅ Type inferred as string
  baseURL: "https://...", // ✅ Type inferred as string | undefined
  // wrongField: 123     // ❌ Compile error: unknown field
});

// 2. Dynamic provider (any)
const executor = await createExecutor("custom-provider", {
  anyField: "value", // ✅ any type
});

8. Tracing and Observability

8.1 OpenTelemetry Integration

Span Creation

File: src/renderer/src/services/SpanManagerService.ts

typescript

export function addSpan(params: StartSpanParams): Span | null {
  const { name, tag, topicId, modelName, inputs } = params;

  // 1. Get or create tracer
  const tracer = getTracer(topicId);
  if (!tracer) return null;

  // 2. Create span
  const span = tracer.startSpan(name, {
    kind: SpanKind.CLIENT,
    attributes: {
      "llm.tag": tag,
      "llm.model": modelName,
      "llm.topic_id": topicId,
      "llm.input_messages": JSON.stringify(inputs.messages),
      "llm.temperature": inputs.temperature,
      "llm.max_tokens": inputs.maxTokens,
    },
  });

  // 3. Set span context as active
  context.with(trace.setSpan(context.active(), span), () => {
    // Subsequent AI SDK calls will automatically inherit this span
  });

  return span;
}

8.2 Trace Hierarchy Structure

Parent Span: fetchChatCompletion
│
├─ Child Span: prepareMessagesForModel
│  └─ attributes: message_count, filters_applied
│
├─ Child Span: buildStreamTextParams
│  └─ attributes: tools_count, web_search_enabled
│
├─ Child Span: AI.completions (created in _completionsForTrace)
│  │
│  ├─ Child Span: buildPlugins
│  │  └─ attributes: plugin_names
│  │
│  ├─ Child Span: createExecutor
│  │  └─ attributes: provider_id, cache_hit
│  │
│  └─ Child Span: executor.streamText
│     │
│     ├─ Child Span: AI SDK doStream (auto-created)
│     │  └─ attributes: model, temperature, tokens
│     │
│     └─ Child Span: Tool Execution (if tool calls exist)
│        ├─ attributes: tool_name, args
│        └─ attributes: result, latency
│
└─ attributes: total_duration, final_token_count

9. Error Handling

9.1 Error Type Hierarchy

typescript

// 1. Base Error
export class ProviderError extends Error {
  constructor(
    message: string,
    public providerId: string,
    public code?: string,
    public cause?: Error,
  ) {
    super(message);
    this.name = "ProviderError";
  }
}

// 2. Provider Creation Error
export class ProviderCreationError extends ProviderError {
  constructor(message: string, providerId: string, cause: Error) {
    super(message, providerId, "PROVIDER_CREATION_FAILED", cause);
    this.name = "ProviderCreationError";
  }
}

// 3. Model Resolution Error
export class ModelResolutionError extends ProviderError {
  constructor(
    message: string,
    public modelId: string,
    providerId: string,
  ) {
    super(message, providerId, "MODEL_RESOLUTION_FAILED");
    this.name = "ModelResolutionError";
  }
}

// 4. API Error
export class ApiError extends ProviderError {
  constructor(
    message: string,
    providerId: string,
    public statusCode?: number,
    public response?: any,
  ) {
    super(message, providerId, "API_REQUEST_FAILED");
    this.name = "ApiError";
  }
}

10. Performance Optimization

10.1 Provider Instance Caching (LRU)

Advantages:

✅ Avoid recreating providers with same configuration
✅ Automatically clean up least recently used instances
✅ Memory controlled (max: 10 per extension)

Performance Metrics:

Cache Hit:  <1ms  (direct Map retrieval)
Cache Miss: ~50ms (create new AI SDK provider)

10.2 Parallel Request Optimization

typescript

// ❌ Sequential execution (slow)
const mcpTools = await fetchMcpTools(assistant)
const params = await buildStreamTextParams(...)
const plugins = buildPlugins(config)

// ✅ Parallel execution (fast)
const [mcpTools, params, plugins] = await Promise.all([
  fetchMcpTools(assistant),
  buildStreamTextParams(...),
  Promise.resolve(buildPlugins(config))
])

10.3 Streaming Response Optimization

typescript

// 1. Use textStream instead of fullStream
for await (const textDelta of streamResult.textStream) {
  onChunk({ type: ChunkType.TEXT_DELTA, text: textDelta });
}

// 2. Batch send chunks (reduce IPC overhead)
const chunkBuffer: Chunk[] = [];
for await (const textDelta of streamResult.textStream) {
  chunkBuffer.push({ type: ChunkType.TEXT_DELTA, text: textDelta });

  if (chunkBuffer.length >= 10) {
    onChunk({ type: ChunkType.BATCH, chunks: chunkBuffer });
    chunkBuffer.length = 0;
  }
}

11. Testing Architecture

11.1 Test Utilities (test-utils)

@cherrystudio/ai-core provides a complete set of testing utilities:

typescript

// packages/aiCore/test_utils/helpers/model.ts

// Create complete mock provider (methods are vi.fn() spies)
export function createMockProviderV3(overrides?: {
  provider?: string;
  languageModel?: (modelId: string) => LanguageModelV3;
  imageModel?: (modelId: string) => ImageModelV3;
  embeddingModel?: (modelId: string) => EmbeddingModelV3;
}): ProviderV3;

// Create mock language model (with complete doGenerate/doStream implementation)
export function createMockLanguageModel(
  overrides?: Partial<LanguageModelV3>,
): LanguageModelV3;

// Create mock image model
export function createMockImageModel(
  overrides?: Partial<ImageModelV3>,
): ImageModelV3;

// Create mock embedding model
export function createMockEmbeddingModel(
  overrides?: Partial<EmbeddingModelV3>,
): EmbeddingModelV3;

11.2 Integration Tests

Key integration tests cover the following scenarios:

typescript

// packages/aiCore/src/core/providers/__tests__/ExtensionRegistry.test.ts

describe("ExtensionRegistry", () => {
  describe("Provider Creation", () => {
    it("should create providers through registered extensions");
    it("should resolve aliases to base provider");
    it("should resolve variants with correct suffix");
    it("should leverage LRU cache for identical settings");
  });

  describe("Error Handling", () => {
    it("should throw error for unregistered provider");
    it("should handle concurrent creation requests");
  });
});

// packages/aiCore/src/core/providers/__tests__/ProviderExtension.test.ts

describe("ProviderExtension", () => {
  describe("LRU Cache", () => {
    it("should cache provider instances by settings hash");
    it("should create new instances for different settings");
    it("should deduplicate concurrent creation of same settings");
  });

  describe("Variants", () => {
    it("should create variant providers with transform");
    it("should cache variants independently");
  });
});

11.3 Test Coverage

Current test coverage:

ExtensionRegistry: 68+ test cases
ProviderExtension: 50+ test cases
PluginEngine: 38 test cases
RuntimeExecutor: 30+ test cases
Total: 370+ test cases

Appendix A: Key File Index

Service Layer

src/renderer/src/services/ApiService.ts - Main API service
src/renderer/src/services/ConversationService.ts - Message preparation
src/renderer/src/services/SpanManagerService.ts - Trace management

AI Provider Layer

src/renderer/src/aiCore/index_new.ts - AiProvider
src/renderer/src/aiCore/provider/providerConfig.ts - Provider configuration
src/renderer/src/aiCore/chunk/AiSdkToChunkAdapter.ts - Stream adaptation
src/renderer/src/aiCore/plugins/PluginBuilder.ts - Plugin building

Core Package

packages/aiCore/src/core/runtime/executor.ts - RuntimeExecutor
packages/aiCore/src/core/runtime/index.ts - createExecutor
packages/aiCore/src/core/providers/core/ProviderExtension.ts - Extension base class
packages/aiCore/src/core/providers/core/ExtensionRegistry.ts - Registry
packages/aiCore/src/core/providers/core/initialization.ts - Core provider registrations
packages/aiCore/src/core/plugins/PluginEngine.ts - Plugin engine

App-Level Extensions

src/renderer/src/aiCore/provider/extensions/index.ts - App-level provider extensions
src/renderer/src/aiCore/types/merged.ts - Merged types (core + app extensions)

Test Utilities

packages/aiCore/test_utils/helpers/model.ts - Mock model creation utilities
packages/aiCore/test_utils/helpers/provider.ts - Provider test helpers
packages/aiCore/test_utils/mocks/providers.ts - Mock Provider instances

Appendix B: Frequently Asked Questions

Q1: Why use LRU cache?

A: Avoid recreating providers with same configuration, while automatically controlling memory (max 10 instances/extension).

Q2: What's the difference between Plugin and Middleware?

Plugin: Feature extension at Cherry Studio level (Reasoning, ToolUse, WebSearch)
Middleware: Request/response interceptor at AI SDK level

Q3: When to use Legacy Provider?

A: Only for image generation endpoints when not using gateway, as it requires advanced features like image editing.

Q4: How to add a new Provider?

Create Extension in packages/aiCore/src/core/providers/extensions/
Register to coreExtensions array
Add configuration conversion logic in providerConfig.ts

Document Version: v4.0 Last Updated: 2026-03-20 Maintainer: Cherry Studio Team

Cherry Studio AI Core Architecture Documentation

Cherry Studio AI Core Architecture Documentation

Table of Contents

1. Architecture Overview

1.1 Architectural Layers

1.2 Core Design Principles

1.2.1 Separation of Concerns

1.2.2 Type Safety First

1.2.3 Extensibility

2. Complete Call Flow

2.1 Full Flow from User Input to LLM Response

Flow Diagram

2.2 Key Timing Notes

2.2.1 Provider Instance Creation (LRU Cache Mechanism)

2.2.2 Plugin Execution Order

3. Core Components

3.1 ApiService Layer

File Location

Core Responsibilities

Key Function Details

3.1.1 transformMessagesAndFetch()

3.1.2 fetchChatCompletion()

3.2 AiProvider Layer

File Location

Core Responsibilities

Constructor Details

completions() Method Details

modernCompletions() Core Implementation

4. Provider System Architecture

4.1 Provider Configuration Conversion

providerToAiSdkConfig() Details

4.2 Provider Extension System

Core Design

ToolFactory System

OpenAI Extension Example

Config Type Auto-Extraction

4.3 Extension Registry

Aggregator Provider Flow

5. Plugin and Middleware System

5.1 Plugin Architecture

AiPlugin Interface Definition

PluginEngine Implementation

5.2 Built-in Plugins

5.2.1 providerToolPlugin — Unified Tool Injection

5.2.2 Reasoning / ToolUse / Logging

6. Message Processing Flow

6.1 Message Conversion

6.2 Stream Data Adaptation

7. Type Safety Mechanisms

7.1 Type Utilities

StringKeys<T> - Extract String Keys

7.2 Provider ID Resolution Map

7.3 Provider Settings Type Mapping

7.4 Type-Safe createExecutor

8. Tracing and Observability

8.1 OpenTelemetry Integration

Span Creation

8.2 Trace Hierarchy Structure

9. Error Handling

9.1 Error Type Hierarchy

10. Performance Optimization

10.1 Provider Instance Caching (LRU)

10.2 Parallel Request Optimization

10.3 Streaming Response Optimization

11. Testing Architecture

11.1 Test Utilities (test-utils)

11.2 Integration Tests

11.3 Test Coverage

Appendix A: Key File Index

Service Layer

AI Provider Layer

Core Package

App-Level Extensions

Test Utilities

Appendix B: Frequently Asked Questions

Q1: Why use LRU cache?

Q2: What's the difference between Plugin and Middleware?

Q3: When to use Legacy Provider?

Q4: How to add a new Provider?

3.1.1 `transformMessagesAndFetch()`

3.1.2 `fetchChatCompletion()`