Abliteration

Abliteration is a third-party service that hosts "abliterated" models - open-weight LLMs where the refusal direction has been removed from the residual stream so the model no longer declines requests it would ordinarily refuse. It exposes an OpenAI-compatible chat completions API, and Promptfoo ships a thin abliteration: wrapper around the OpenAI provider for it.

:::warning Safety

Abliterated models intentionally bypass the safety training of their base models. They are primarily useful for red-teaming, jailbreak evaluation, and safety research - not for production traffic. You are responsible for how outputs are used and for complying with the model licenses and laws that apply in your jurisdiction.

:::

Setup

Obtain an API key from Abliteration.
Set the ABLIT_KEY environment variable, or pass apiKey in your provider config.

Environment Variables

Variable	Description
`ABLIT_KEY`	API key sent as the bearer token. Required unless `apiKey` is set in the provider config.
`ABLIT_API_BASE_URL`	Override for the chat-completions base URL. Defaults to `https://api.abliteration.ai/v1`.

Provider config values take precedence over environment variables.

Basic Configuration

yaml

# yaml-language-server: $schema=https://promptfoo.dev/config-schema.json
providers:
  - id: abliteration:abliterated-model
    config:
      temperature: 0.2
      max_tokens: 512

The examples use Abliteration's abliterated-model. Replace it if your account should target a different model. abliteration:<model> is the default syntax; abliteration:chat:<model> is also supported.

OpenAI Compatibility

Abliteration speaks the OpenAI chat-completions protocol, so most options from the OpenAI provider work here too, including sampling options, structured output, and multimodal messages.

Abliteration responses can include reasoning_content. Promptfoo hides this thinking output by default for this provider. Set showThinking: true in the provider config if you want it included in eval outputs.

Multimodal Example

json

[
  {
    "role": "user",
    "content": [
      { "type": "text", "text": "{{question}}" },
      {
        "type": "image_url",
        "image_url": { "url": "https://abliteration.ai/stonehenge.jpg" }
      }
    ]
  }
]