Grafana Pyroscope server API

Authentication

Pyroscope doesn't include an authentication layer. Operators should use an authenticating reverse proxy for security.

In multi-tenant mode, Pyroscope requires the X-Scope-OrgID HTTP header set to a string identifying the tenant. The authenticating reverse proxy handles this responsibility. For more information, refer to the multi-tenancy documentation.

Connect API

The Pyroscope Connect API uses the Connect protocol, which provides a unified approach to building APIs that work seamlessly across multiple protocols and formats:

Protocol Flexibility: Connect APIs work over both HTTP/1.1 and HTTP/2, supporting JSON and binary protobuf encoding
gRPC Compatibility: Full compatibility with existing gRPC clients and servers while offering better browser and HTTP tooling support
Type Safety: Generated from protobuf definitions, ensuring consistent types across client and server implementations
Developer Experience: Simpler debugging with standard HTTP tools like curl, while maintaining the performance benefits of protobuf

The API definitions are available in the api/ directory of the Pyroscope repository, with protobuf schemas organized by service.

Pyroscope APIs are categorized into two scopes:

Public APIs (scope/public): These APIs are considered stable. Breaking changes will be communicated in advance and include migration paths.
Internal APIs (scope/internal): These APIs are used for internal communication between Pyroscope components. They may change without notice and should not be used by external clients.

Ingestion Path

`/push.v1.PusherService/Push`

A request body with the following fields is required:

Field	Description	Example
`series[].labels[].name`	Label name	`service_name`
`series[].labels[].value`	Label value	`my_service`
`series[].samples[].ID`	UUID of the profile	`734FD599-6865-419E-9475-932762D8F469`
`series[].samples[].rawProfile`	raw_profile is the set of bytes of the pprof profile	`PROFILE_BASE64`

curl

curl \
  -H "Content-Type: application/json" \
  -d '{
      "series": [
        {
          "labels": [
            {
              "name": "__name__",
              "value": "process_cpu"
            },
            {
              "name": "service_name",
              "value": "my_service"
            }
          ],
          "samples": [
            {
              "ID": "734FD599-6865-419E-9475-932762D8F469",
              "rawProfile": "'$(cat cpu.pb.gz| base64 -w 0)'"
            }
          ]
        }
      ]
    }' \
  http://localhost:4040/push.v1.PusherService/Push

python

import requests
import base64
body = {
    "series": [
      {
        "labels": [
          {
            "name": "__name__",
            "value": "process_cpu"
          },
          {
            "name": "service_name",
            "value": "my_service"
          }
        ],
        "samples": [
          {
            "ID": "734FD599-6865-419E-9475-932762D8F469",
            "rawProfile": base64.b64encode(open('cpu.pb.gz', 'rb').read()).decode('ascii')
          }
        ]
      }
    ]
  }
url = 'http://localhost:4040/push.v1.PusherService/Push'
resp = requests.post(url, json=body)
print(resp)
print(resp.content)

Querying profiling data

`/querier.v1.QuerierService/Diff`

Diff returns a diff of two profiles

A request body with the following fields is required:

Field	Description	Example
`left.start`	Milliseconds since epoch.	`1676282400000`
`left.end`	Milliseconds since epoch.	`1676289600000`
`left.format`	Possible values: `PROFILE_FORMAT_UNSPECIFIED`, `PROFILE_FORMAT_FLAMEGRAPH`, `PROFILE_FORMAT_TREE`, `PROFILE_FORMAT_DOT`
`left.labelSelector`	Label selector string	`{namespace="my-namespace"}`
`left.maxNodes`	Limit the nodes returned to only show the node with the max_node's biggest total
`left.profileIdSelector`	List of Profile UUIDs to query	`["7c9e6679-7425-40de-944b-e07fc1f90ae7"]`
`left.profileTypeID`	Profile Type ID string in the form <name>:<type>:<unit>:<period_type>:<period_unit>.	`process_cpu:cpu:nanoseconds:cpu:nanoseconds`
`left.stackTraceSelector.callSite[].name`
`left.stackTraceSelector.goPgo.aggregateCallees`	Aggregate callees causes the leaf location line number to be ignored, thus aggregating all callee samples (but not callers).
`left.stackTraceSelector.goPgo.keepLocations`	Specifies the number of leaf locations to keep.
`right.start`	Milliseconds since epoch.	`1676282400000`
`right.end`	Milliseconds since epoch.	`1676289600000`
`right.format`	Possible values: `PROFILE_FORMAT_UNSPECIFIED`, `PROFILE_FORMAT_FLAMEGRAPH`, `PROFILE_FORMAT_TREE`, `PROFILE_FORMAT_DOT`
`right.labelSelector`	Label selector string	`{namespace="my-namespace"}`
`right.maxNodes`	Limit the nodes returned to only show the node with the max_node's biggest total
`right.profileIdSelector`	List of Profile UUIDs to query	`["7c9e6679-7425-40de-944b-e07fc1f90ae7"]`
`right.profileTypeID`	Profile Type ID string in the form <name>:<type>:<unit>:<period_type>:<period_unit>.	`process_cpu:cpu:nanoseconds:cpu:nanoseconds`
`right.stackTraceSelector.callSite[].name`
`right.stackTraceSelector.goPgo.aggregateCallees`	Aggregate callees causes the leaf location line number to be ignored, thus aggregating all callee samples (but not callers).
`right.stackTraceSelector.goPgo.keepLocations`	Specifies the number of leaf locations to keep.

curl

curl \
  -H "Content-Type: application/json" \
  -d '{
      "left": {
        "end": '$(date +%s)000',
        "labelSelector": "{namespace=\"my-namespace\"}",
        "profileIdSelector": [
          "7c9e6679-7425-40de-944b-e07fc1f90ae7"
        ],
        "profileTypeID": "process_cpu:cpu:nanoseconds:cpu:nanoseconds",
        "start": '$(expr $(date +%s) - 3600 )000'
      },
      "right": {
        "end": '$(date +%s)000',
        "labelSelector": "{namespace=\"my-namespace\"}",
        "profileIdSelector": [
          "7c9e6679-7425-40de-944b-e07fc1f90ae7"
        ],
        "profileTypeID": "process_cpu:cpu:nanoseconds:cpu:nanoseconds",
        "start": '$(expr $(date +%s) - 3600 )000'
      }
    }' \
  http://localhost:4040/querier.v1.QuerierService/Diff

python

import requests
import datetime
body = {
    "left": {
      "end": int(datetime.datetime.now().timestamp() * 1000),
      "labelSelector": "{namespace=\"my-namespace\"}",
      "profileIdSelector": [
        "7c9e6679-7425-40de-944b-e07fc1f90ae7"
      ],
      "profileTypeID": "process_cpu:cpu:nanoseconds:cpu:nanoseconds",
      "start": int((datetime.datetime.now()- datetime.timedelta(hours = 1)).timestamp() * 1000)
    },
    "right": {
      "end": int(datetime.datetime.now().timestamp() * 1000),
      "labelSelector": "{namespace=\"my-namespace\"}",
      "profileIdSelector": [
        "7c9e6679-7425-40de-944b-e07fc1f90ae7"
      ],
      "profileTypeID": "process_cpu:cpu:nanoseconds:cpu:nanoseconds",
      "start": int((datetime.datetime.now()- datetime.timedelta(hours = 1)).timestamp() * 1000)
    }
  }
url = 'http://localhost:4040/querier.v1.QuerierService/Diff'
resp = requests.post(url, json=body)
print(resp)
print(resp.content)

`/querier.v1.QuerierService/LabelNames`

LabelNames returns a list of the existing label names.

A request body with the following fields is required:

Field	Description	Example
`start`	Query from this point in time, given in Milliseconds since epoch.	`1676282400000`
`end`	Query to this point in time, given in Milliseconds since epoch.	`1676289600000`
`matchers`	List of Label selectors

curl

curl \
  -H "Content-Type: application/json" \
  -d '{
      "end": '$(date +%s)000',
      "start": '$(expr $(date +%s) - 3600 )000'
    }' \
  http://localhost:4040/querier.v1.QuerierService/LabelNames

python

import requests
import datetime
body = {
    "end": int(datetime.datetime.now().timestamp() * 1000),
    "start": int((datetime.datetime.now()- datetime.timedelta(hours = 1)).timestamp() * 1000)
  }
url = 'http://localhost:4040/querier.v1.QuerierService/LabelNames'
resp = requests.post(url, json=body)
print(resp)
print(resp.content)

`/querier.v1.QuerierService/LabelValues`

LabelValues returns the existing label values for the provided label names.

A request body with the following fields is required:

Field	Description	Example
`start`	Query from this point in time, given in Milliseconds since epoch.	`1676282400000`
`end`	Query to this point in time, given in Milliseconds since epoch.	`1676289600000`
`matchers`	List of Label selectors
`name`	Name of the label	`service_name`

curl

curl \
  -H "Content-Type: application/json" \
  -d '{
      "end": '$(date +%s)000',
      "name": "service_name",
      "start": '$(expr $(date +%s) - 3600 )000'
    }' \
  http://localhost:4040/querier.v1.QuerierService/LabelValues

python

import requests
import datetime
body = {
    "end": int(datetime.datetime.now().timestamp() * 1000),
    "name": "service_name",
    "start": int((datetime.datetime.now()- datetime.timedelta(hours = 1)).timestamp() * 1000)
  }
url = 'http://localhost:4040/querier.v1.QuerierService/LabelValues'
resp = requests.post(url, json=body)
print(resp)
print(resp.content)

`/querier.v1.QuerierService/ProfileTypes`

ProfileType returns a list of the existing profile types.

A request body with the following fields is required:

Field	Description	Example
`start`	Milliseconds since epoch. If missing or zero, only the ingesters will be queried.	`1676282400000`
`end`	Milliseconds since epoch. If missing or zero, only the ingesters will be queried.	`1676289600000`

curl

curl \
  -H "Content-Type: application/json" \
  -d '{
      "end": '$(date +%s)000',
      "start": '$(expr $(date +%s) - 3600 )000'
    }' \
  http://localhost:4040/querier.v1.QuerierService/ProfileTypes

python

import requests
import datetime
body = {
    "end": int(datetime.datetime.now().timestamp() * 1000),
    "start": int((datetime.datetime.now()- datetime.timedelta(hours = 1)).timestamp() * 1000)
  }
url = 'http://localhost:4040/querier.v1.QuerierService/ProfileTypes'
resp = requests.post(url, json=body)
print(resp)
print(resp.content)

`/querier.v1.QuerierService/SelectHeatmap`

SelectHeatmap returns a heatmap visualization for the requested profiles. Note: This endpoint is only available in the v2 storage layer

A request body with the following fields is required:

Field	Description	Example
`start`	Milliseconds since epoch.	`1676282400000`
`end`	Milliseconds since epoch.	`1676289600000`
`exemplarType`	Type of exemplars to include in the response. Needs to matching query_type or be NONE. Possible values: `EXEMPLAR_TYPE_UNSPECIFIED`, `EXEMPLAR_TYPE_NONE`, `EXEMPLAR_TYPE_INDIVIDUAL`, `EXEMPLAR_TYPE_SPAN`	`EXEMPLAR_TYPE_SPAN`
`groupBy`	Group by labels	`["pod"]`
`labelSelector`	Label selector string	`{namespace="my-namespace"}`
`limit`	Select the top N series by total value.
`profileTypeID`	Profile Type ID string in the form <name>:<type>:<unit>:<period_type>:<period_unit>.	`process_cpu:cpu:nanoseconds:cpu:nanoseconds`
`queryType`	Query type: individual profiles or span profiles. Possible values: `HEATMAP_QUERY_TYPE_UNSPECIFIED`, `HEATMAP_QUERY_TYPE_INDIVIDUAL`, `HEATMAP_QUERY_TYPE_SPAN`	`HEATMAP_QUERY_TYPE_SPAN`
`step`	Query resolution step width in seconds

curl

curl \
  -H "Content-Type: application/json" \
  -d '{
      "end": '$(date +%s)000',
      "exemplarType": "EXEMPLAR_TYPE_SPAN",
      "groupBy": [
        "pod"
      ],
      "labelSelector": "{namespace=\"my-namespace\"}",
      "profileTypeID": "process_cpu:cpu:nanoseconds:cpu:nanoseconds",
      "queryType": "HEATMAP_QUERY_TYPE_SPAN",
      "start": '$(expr $(date +%s) - 3600 )000'
    }' \
  http://localhost:4040/querier.v1.QuerierService/SelectHeatmap

python

import requests
import datetime
body = {
    "end": int(datetime.datetime.now().timestamp() * 1000),
    "exemplarType": "EXEMPLAR_TYPE_SPAN",
    "groupBy": [
      "pod"
    ],
    "labelSelector": "{namespace=\"my-namespace\"}",
    "profileTypeID": "process_cpu:cpu:nanoseconds:cpu:nanoseconds",
    "queryType": "HEATMAP_QUERY_TYPE_SPAN",
    "start": int((datetime.datetime.now()- datetime.timedelta(hours = 1)).timestamp() * 1000)
  }
url = 'http://localhost:4040/querier.v1.QuerierService/SelectHeatmap'
resp = requests.post(url, json=body)
print(resp)
print(resp.content)

`/querier.v1.QuerierService/SelectMergeProfile`

SelectMergeProfile returns matching profiles aggregated in pprof format. It will contain all information stored (so including filenames and line number, if ingested).

A request body with the following fields is required:

Field	Description	Example
`start`	Milliseconds since epoch.	`1676282400000`
`end`	Milliseconds since epoch.	`1676289600000`
`labelSelector`	Label selector string	`{namespace="my-namespace"}`
`maxNodes`	Limit the nodes returned to only show the node with the max_node's biggest total
`profileIdSelector`	List of Profile UUIDs to query	`["7c9e6679-7425-40de-944b-e07fc1f90ae7"]`
`profileTypeID`	Profile Type ID string in the form <name>:<type>:<unit>:<period_type>:<period_unit>.	`process_cpu:cpu:nanoseconds:cpu:nanoseconds`
`stackTraceSelector.callSite[].name`
`stackTraceSelector.goPgo.aggregateCallees`	Aggregate callees causes the leaf location line number to be ignored, thus aggregating all callee samples (but not callers).
`stackTraceSelector.goPgo.keepLocations`	Specifies the number of leaf locations to keep.

curl

curl \
  -H "Content-Type: application/json" \
  -d '{
      "end": '$(date +%s)000',
      "labelSelector": "{namespace=\"my-namespace\"}",
      "profileIdSelector": [
        "7c9e6679-7425-40de-944b-e07fc1f90ae7"
      ],
      "profileTypeID": "process_cpu:cpu:nanoseconds:cpu:nanoseconds",
      "start": '$(expr $(date +%s) - 3600 )000'
    }' \
  http://localhost:4040/querier.v1.QuerierService/SelectMergeProfile

python

import requests
import datetime
body = {
    "end": int(datetime.datetime.now().timestamp() * 1000),
    "labelSelector": "{namespace=\"my-namespace\"}",
    "profileIdSelector": [
      "7c9e6679-7425-40de-944b-e07fc1f90ae7"
    ],
    "profileTypeID": "process_cpu:cpu:nanoseconds:cpu:nanoseconds",
    "start": int((datetime.datetime.now()- datetime.timedelta(hours = 1)).timestamp() * 1000)
  }
url = 'http://localhost:4040/querier.v1.QuerierService/SelectMergeProfile'
resp = requests.post(url, json=body)
print(resp)
print(resp.content)

`/querier.v1.QuerierService/SelectMergeSpanProfile`

SelectMergeSpanProfile returns matching profiles aggregated in a flamegraph format. It will combine samples from within the same callstack, with each element being grouped by its function name.

A request body with the following fields is required:

Field	Description	Example
`start`	Milliseconds since epoch.	`1676282400000`
`end`	Milliseconds since epoch.	`1676289600000`
`format`	Profile format specifies the format of profile to be returned. If not specified, the profile will be returned in flame graph format.. Possible values: `PROFILE_FORMAT_UNSPECIFIED`, `PROFILE_FORMAT_FLAMEGRAPH`, `PROFILE_FORMAT_TREE`, `PROFILE_FORMAT_DOT`
`labelSelector`	Label selector string	`{namespace="my-namespace"}`
`maxNodes`	Limit the nodes returned to only show the node with the max_node's biggest total
`profileTypeID`	Profile Type ID string in the form <name>:<type>:<unit>:<period_type>:<period_unit>.	`process_cpu:cpu:nanoseconds:cpu:nanoseconds`
`spanSelector`	List of Span IDs to query	`["9a517183f26a089d","5a4fe264a9c987fe"]`

curl

curl \
  -H "Content-Type: application/json" \
  -d '{
      "end": '$(date +%s)000',
      "labelSelector": "{namespace=\"my-namespace\"}",
      "profileTypeID": "process_cpu:cpu:nanoseconds:cpu:nanoseconds",
      "spanSelector": [
        "9a517183f26a089d",
        "5a4fe264a9c987fe"
      ],
      "start": '$(expr $(date +%s) - 3600 )000'
    }' \
  http://localhost:4040/querier.v1.QuerierService/SelectMergeSpanProfile

python

import requests
import datetime
body = {
    "end": int(datetime.datetime.now().timestamp() * 1000),
    "labelSelector": "{namespace=\"my-namespace\"}",
    "profileTypeID": "process_cpu:cpu:nanoseconds:cpu:nanoseconds",
    "spanSelector": [
      "9a517183f26a089d",
      "5a4fe264a9c987fe"
    ],
    "start": int((datetime.datetime.now()- datetime.timedelta(hours = 1)).timestamp() * 1000)
  }
url = 'http://localhost:4040/querier.v1.QuerierService/SelectMergeSpanProfile'
resp = requests.post(url, json=body)
print(resp)
print(resp.content)

`/querier.v1.QuerierService/SelectMergeStacktraces`

SelectMergeStacktraces returns matching profiles aggregated in a flamegraph format. It will combine samples from within the same callstack, with each element being grouped by its function name.

A request body with the following fields is required:

Field	Description	Example
`start`	Milliseconds since epoch.	`1676282400000`
`end`	Milliseconds since epoch.	`1676289600000`
`format`	Profile format specifies the format of profile to be returned. If not specified, the profile will be returned in flame graph format.. Possible values: `PROFILE_FORMAT_UNSPECIFIED`, `PROFILE_FORMAT_FLAMEGRAPH`, `PROFILE_FORMAT_TREE`, `PROFILE_FORMAT_DOT`
`labelSelector`	Label selector string	`{namespace="my-namespace"}`
`maxNodes`	Limit the nodes returned to only show the node with the max_node's biggest total
`profileIdSelector`	List of Profile UUIDs to query	`["7c9e6679-7425-40de-944b-e07fc1f90ae7"]`
`profileTypeID`	Profile Type ID string in the form <name>:<type>:<unit>:<period_type>:<period_unit>.	`process_cpu:cpu:nanoseconds:cpu:nanoseconds`
`stackTraceSelector.callSite[].name`
`stackTraceSelector.goPgo.aggregateCallees`	Aggregate callees causes the leaf location line number to be ignored, thus aggregating all callee samples (but not callers).
`stackTraceSelector.goPgo.keepLocations`	Specifies the number of leaf locations to keep.

curl

curl \
  -H "Content-Type: application/json" \
  -d '{
      "end": '$(date +%s)000',
      "labelSelector": "{namespace=\"my-namespace\"}",
      "profileIdSelector": [
        "7c9e6679-7425-40de-944b-e07fc1f90ae7"
      ],
      "profileTypeID": "process_cpu:cpu:nanoseconds:cpu:nanoseconds",
      "start": '$(expr $(date +%s) - 3600 )000'
    }' \
  http://localhost:4040/querier.v1.QuerierService/SelectMergeStacktraces

python

import requests
import datetime
body = {
    "end": int(datetime.datetime.now().timestamp() * 1000),
    "labelSelector": "{namespace=\"my-namespace\"}",
    "profileIdSelector": [
      "7c9e6679-7425-40de-944b-e07fc1f90ae7"
    ],
    "profileTypeID": "process_cpu:cpu:nanoseconds:cpu:nanoseconds",
    "start": int((datetime.datetime.now()- datetime.timedelta(hours = 1)).timestamp() * 1000)
  }
url = 'http://localhost:4040/querier.v1.QuerierService/SelectMergeStacktraces'
resp = requests.post(url, json=body)
print(resp)
print(resp.content)

`/querier.v1.QuerierService/SelectSeries`

SelectSeries returns a time series for the total sum of the requested profiles.

A request body with the following fields is required:

Field	Description	Example
`start`	Milliseconds since epoch.	`1676282400000`
`end`	Milliseconds since epoch.	`1676289600000`
`aggregation`	Query resolution step width in seconds. Possible values: `TIME_SERIES_AGGREGATION_TYPE_SUM`, `TIME_SERIES_AGGREGATION_TYPE_AVERAGE`
`exemplarType`	Type of exemplars to include in the response.. Possible values: `EXEMPLAR_TYPE_UNSPECIFIED`, `EXEMPLAR_TYPE_NONE`, `EXEMPLAR_TYPE_INDIVIDUAL`, `EXEMPLAR_TYPE_SPAN`
`groupBy`		`["pod"]`
`labelSelector`	Label selector string	`{namespace="my-namespace"}`
`limit`	Select the top N series by total value.
`profileTypeID`	Profile Type ID string in the form <name>:<type>:<unit>:<period_type>:<period_unit>.	`process_cpu:cpu:nanoseconds:cpu:nanoseconds`
`stackTraceSelector.callSite[].name`
`stackTraceSelector.goPgo.aggregateCallees`	Aggregate callees causes the leaf location line number to be ignored, thus aggregating all callee samples (but not callers).
`stackTraceSelector.goPgo.keepLocations`	Specifies the number of leaf locations to keep.
`step`

curl

curl \
  -H "Content-Type: application/json" \
  -d '{
      "end": '$(date +%s)000',
      "groupBy": [
        "pod"
      ],
      "labelSelector": "{namespace=\"my-namespace\"}",
      "profileTypeID": "process_cpu:cpu:nanoseconds:cpu:nanoseconds",
      "start": '$(expr $(date +%s) - 3600 )000'
    }' \
  http://localhost:4040/querier.v1.QuerierService/SelectSeries

python

import requests
import datetime
body = {
    "end": int(datetime.datetime.now().timestamp() * 1000),
    "groupBy": [
      "pod"
    ],
    "labelSelector": "{namespace=\"my-namespace\"}",
    "profileTypeID": "process_cpu:cpu:nanoseconds:cpu:nanoseconds",
    "start": int((datetime.datetime.now()- datetime.timedelta(hours = 1)).timestamp() * 1000)
  }
url = 'http://localhost:4040/querier.v1.QuerierService/SelectSeries'
resp = requests.post(url, json=body)
print(resp)
print(resp.content)

`/querier.v1.QuerierService/Series`

Series returns profiles series matching the request. A series is a unique label set.

A request body with the following fields is required:

Field	Description	Example
`start`	Milliseconds since epoch. If missing or zero, only the ingesters will be queried.	`1676282400000`
`end`	queried.	`1676289600000`
`labelNames`	List of label_names to request. If empty will return all label names in the result.
`matchers`	List of label selector to apply to the result.	`["{namespace=\"my-namespace\"}"]`

curl

curl \
  -H "Content-Type: application/json" \
  -d '{
      "end": '$(date +%s)000',
      "matchers": [
        "{namespace=\"my-namespace\"}"
      ],
      "start": '$(expr $(date +%s) - 3600 )000'
    }' \
  http://localhost:4040/querier.v1.QuerierService/Series

python

import requests
import datetime
body = {
    "end": int(datetime.datetime.now().timestamp() * 1000),
    "matchers": [
      "{namespace=\"my-namespace\"}"
    ],
    "start": int((datetime.datetime.now()- datetime.timedelta(hours = 1)).timestamp() * 1000)
  }
url = 'http://localhost:4040/querier.v1.QuerierService/Series'
resp = requests.post(url, json=body)
print(resp)
print(resp.content)

Pyroscope Legacy HTTP API

Grafana Pyroscope exposes an HTTP API for querying profiling data and ingesting profiling data from other sources.

Ingestion

There is one primary endpoint: POST /ingest. It accepts profile data in the request body and metadata as query parameters.

The following query parameters are accepted:

Name	Description	Notes
`name`	application name	required
`from`	UNIX time of when the profiling started	required
`until`	UNIX time of when the profiling stopped	required
`format`	format of the profiling data	optional (default is `folded`)
`sampleRate`	sample rate used in Hz	optional (default is `100` Hz)
`spyName`	name of the spy used	optional
`units`	name of the profiling data unit	optional (default is `samples`
`aggregrationType`	type of aggregation to merge profiles	optional (default is `sum`)

name specifies application name. For example:

my.awesome.app.cpu{env=staging,region=us-west-1}

The request body contains profiling data, and the Content-Type header may be used alongside format to determine the data format.

Some of the query parameters depend on the format of profiling data. Pyroscope currently supports three major ingestion formats.

Text formats

These formats handle simple ingestion of profiling data, such as cpu samples, and typically don't support metadata (for example, labels) within the format. All necessary metadata is derived from query parameters, and the format is specified by the format query parameter.

Supported formats:

Folded: Also known as collapsed, this is the default format. Each line contains a stacktrace followed by the sample count for that stacktrace. For example:

foo;bar 100
foo;baz 200

Lines: Similar to folded, but it represents each sample as a separate line rather than aggregating samples per stacktrace. For example:

foo;bar
foo;bar
foo;baz
foo;bar

The `pprof` format

The pprof format is a widely used binary profiling data format, particularly prevalent in the Go ecosystem.

When using this format, certain query parameters have specific behaviors:

format: This should be set to pprof.
name: This parameter contains the prefix of the application name. Since a single request might include multiple profile types, the complete application name is formed by concatenating this prefix with the profile type. For instance, if you send CPU profiling data and set name to my-app{}, it is displayed in Pyroscope as my-app.cpu{}.
units, aggregationType, and sampleRate: These parameters are ignored. The actual values are determined based on the profile types present in the data (refer to the "Sample Type Configuration" section for more details).

Sample type configuration

Pyroscope server inherently supports standard Go profile types such as cpu, inuse_objects, inuse_space, alloc_objects, and alloc_space. When dealing with software that generates data in pprof format, you may need to supply a custom sample type configuration for Pyroscope to interpret the data correctly.

For an example Python script to ingest a pprof file with a custom sample type configuration, see this Python script.

To ingest pprof data with custom sample type configuration, modify your requests as follows:

Set Content-Type to multipart/form-data.
Upload the profile data in a form file field named profile.
Include the sample type configuration in a form file field named sample_type_config.

A sample type configuration is a JSON object formatted like this:

json

{
  "inuse_space": {
    "units": "bytes",
    "aggregation": "average",
    "display-name": "inuse_space_bytes",
    "sampled": false
  },
  "alloc_objects": {
    "units": "objects",
    "aggregation": "sum",
    "display-name": "alloc_objects_count",
    "sampled": true
  },
  "cpu": {
    "units": "samples",
    "aggregation": "sum",
    "display-name": "cpu_samples",
    "sampled": true
  },
  // pprof supports multiple profiles types in one file,
  //   so there can be multiple of these objects
}

Explanation of sample type configuration fields:

units
- Supported values: samples, objects, bytes
- Description: Changes the units displayed in the frontend. samples = CPU samples, objects = objects in RAM, bytes = bytes in RAM.
display-name
- Supported values: Any string.
- Description: This becomes a suffix of the app name, e.g., my-app.inuse_space_bytes.
aggregation
- Supported values: sum, average.
- Description: Alters how data is aggregated on the frontend. Use sum for data to be summed over time (e.g., CPU samples, memory allocations), and average for data to be averaged over time (e.g., memory in-use objects).
sampled
- Supported values: true, false.
- Description: Determines if the sample rate (specified in the pprof file) is considered. Set to true for sampled events (e.g., CPU samples), and false for memory profiles.

This configuration allows for customized visualization and analysis of various profile types within Pyroscope.

JFR format

This is the Java Flight Recorder format, typically used by JVM-based profilers, also supported by our Java integration.

When this format is used, some of the query parameters behave slightly different:

format should be set to jfr.
name contains the prefix of the application name. Since a single request may contain multiple profile types, the final application name is created concatenating this prefix and the profile type. For example, if you send cpu profiling data and set name to my-app{}, it will appear in pyroscope as my-app.cpu{}.
units is ignored, and the actual units depends on the profile types available in the data.
aggregationType is ignored, and the actual aggregation type depends on the profile types available in the data.

JFR ingestion support uses the profile metadata to determine which profile types are included, which depend on the kind of profiling being done. Currently supported profile types include:

cpu samples, which includes only profiling data from runnable threads.
itimer samples, similar to cpu profiling.
wall samples, which includes samples from any threads independently of their state.
alloc_in_new_tlab_objects, which indicates the number of new TLAB objects created.
alloc_in_new_tlab_bytes, which indicates the size in bytes of new TLAB objects created.
alloc_outside_tlab_objects, which indicates the number of new allocated objects outside any TLAB.
alloc_outside_tlab_bytes, which indicates the size in bytes of new allocated objects outside any TLAB.

JFR with labels

In order to ingest JFR data with dynamic labels, you have to make the following changes to your requests:

use an HTTP form (multipart/form-data) Content-Type.
send the JFR data in a form file field called jfr.
send LabelsSnapshot protobuf message in a form file field called labels.

protobuf

message Context {
    // string_id -> string_id
    map<int64, int64> labels = 1;
}
message LabelsSnapshot {
  // context_id -> Context
  map<int64, Context> contexts = 1;
  // string_id -> string
  map<int64, string> strings = 2;
}

Where context_id is a parameter set in async-profiler

Examples

Here's a sample code that uploads a very simple profile to pyroscope:

curl

printf "foo;bar 100\n foo;baz 200" | curl \
-X POST \
--data-binary @- \
'http://localhost:4040/ingest?name=curl-test-app&from=1615709120&until=1615709130'

python

import requests
import urllib.parse
from datetime import datetime

now = round(datetime.now().timestamp()) / 10 * 10
params = {'from': f'{now - 10}', 'name': 'python.example{foo=bar}'}

url = f'http://localhost:4040/ingest?{urllib.parse.urlencode(params)}'
data = "foo;bar 100\n" \
"foo;baz 200"

requests.post(url, data = data)

Here's a sample code that uploads a JFR profile with labels to pyroscope:

curl

curl -X POST \
  -F [email protected] \
  -F [email protected]  \
  "http://localhost:4040/ingest?name=curl-test-app&units=samples&aggregationType=sum&sampleRate=100&from=1655834200&until=1655834210&spyName=javaspy&format=jfr"

Querying profile data

There is one primary endpoint for querying profile data: GET /pyroscope/render.

The search input is provided via query parameters. The output is typically a JSON object containing one or more time series and a flame graph.

Query parameters

Here is an overview of the accepted query parameters:

Name	Description	Notes
`query`	contains the profile type and label selectors	required
`from`	UNIX time for the start of the search window	required
`until`	UNIX time for the end of the search window	optional (default is `now`)
`format`	format of the profiling data	optional (default is `json`)
`maxNodes`	the maximum number of nodes the resulting flame graph will contain	optional (default is `max_flamegraph_nodes_default`)
`groupBy`	one or more label names to group the time series by (doesn't apply to the flame graph)	optional (default is no grouping)

`query`

The query parameter is the only required search input. It carries the profile type and any labels we want to use to narrow down the output. The format for this parameter is similar to that of a PromQL query and can be defined as:

<profile_type>{<label_name>="<label_value>", <label_name>="<label_value>", ...}

Here is a specific example:

process_cpu:cpu:nanoseconds:cpu:nanoseconds{service_name="my_application_name"}

In a Kubernetes environment, a query could also look like:

process_cpu:cpu:nanoseconds:cpu:nanoseconds{namespace="dev", container="my_application_name"}

{{% admonition type="note" %}} Refer to the profiling types documentation for more information and profile-metrics.json for a list of valid profile types. {{% /admonition %}}

`from` and `until`

The from and until parameters determine the start and end of the time period for the query. They can be provided in absolute and relative form.

Absolute time

This table details the options for passing absolute values.

Option	Example	Notes
Date	`20231223`	Format: `YYYYMMDD`
Unix Time seconds	`1577836800`
Unix Time milliseconds	`1577836800000`
Unix Time microseconds	`1577836800000000`
Unix Time nanoseconds	`1577836800000000000`

Relative time

Relative values are always expressed as offsets from now.

Option	Example
3 hours ago	`now-3h`
30 minutes ago	`now-30m`
2 days ago	`now-2d`
1 week ago	`now-7d` or `now-1w`

Note that a single offset has to be provided, values such as now-3h30m will not work.

Validation

The from and until parameters are subject to validation rules related to max_query_lookback and max_query_length server parameters. You can find more details on these parameters in the limits section of the server configuration docs.

If max_query_lookback is configured andfrom is before now - max_query_lookback, from will be set to now - max_query_lookback.
If max_query_lookback is configured and until is before now - max_query_lookback the query will not be executed.
If max_query_length is configured and the query interval is longer than this configuration, the query will no tbe executed.

`format`

The format can either be:

json, in which case the response will contain a JSON object
dot, in which case the response will be text containing a DOT representation of the profile

See the Query output section for more information on the response structure.

`maxNodes`

The maxNodes parameter truncates the number of elements in the profile response, to allow tools (for example, a frontend) to render large profiles efficiently. This is typically used for profiles that are known to have large stack traces.

When no value is provided, the default is taken from the max_flamegraph_nodes_default configuration parameter. When a value is provided, it is capped to the max_flamegraph_nodes_max configuration parameter.

`groupBy`

The groupBy parameter impacts the output for the time series portion of the response. When a valid label is provided, the response contains as many series as there are label values for the given label.

{{% admonition type="note" %}} Pyroscope supports a single label for the group by functionality. {{% /admonition %}}

Query output

The output of the /pyroscope/render endpoint is a JSON object based on the following schema:

type FlamebearerProfileV1 struct {
	Flamebearer FlamebearerV1                  `json:"flamebearer"`
	Metadata FlamebearerMetadataV1             `json:"metadata"`
	Timeline *FlamebearerTimelineV1            `json:"timeline"`
	Groups   map[string]*FlamebearerTimelineV1 `json:"groups"`
}

`flamebearer`

The flamebearer field contains data in a form suitable for rendering a flame graph. Data within the flamebearer is organized in separate arrays containing the profile symbols and the sample values.

`metadata`

The metadata field contains additional information that is helpful to interpret the flamebearer data such as the unit (nanoseconds, bytes), sample rate and more.

`timeline`

The timeline field represents the time series for the profile. Pyroscope pre-computes the step interval (resolution) of the timeline using the query interval (from and until). The minimum step interval is 10 seconds.

The raw profile sample data is down-sampled to the step interval (resolution) using an aggregation function. Currently only sum is supported.

A timeline contains a start time, a list of sample values and the step interval:

json

{
  "timeline": {
    "startTime": 1577836800,
    "samples": [
      100,
      200,
      400
    ],
    "durationDelta": 10
  }
}

`groups`

The groups field is only populated when grouping is requested by the groupBy query parameter. When this is the case, the groups field has an entry for every label value found for the query.

This example groups by a cluster:

json

{
  "groups": {
    "eu-west-2": { "startTime": 1577836800, "samples": [ 200, 300, 500 ] },
    "us-east-1": { "startTime": 1577836800, "samples": [ 100, 200, 400 ] }
  }
}

Alternative query output

When the format query parameter is dot, the endpoint responds with a DOT format data representing the queried profile. This can be used to create an alternative visualization of the profile.

Example queries

This example queries a local Pyroscope server for a CPU profile from the pyroscope service for the last hour.

curl

curl \
  'http://localhost:4040/pyroscope/render?query=process_cpu%3Acpu%3Ananoseconds%3Acpu%3Ananoseconds%7Bservice_name%3D%22pyroscope%22%7D&from=now-1h'

Here is the same query made more readable:

curl

curl --get \
  --data-urlencode "query=process_cpu:cpu:nanoseconds:cpu:nanoseconds{service_name=\"pyroscope\"}" \
  --data-urlencode "from=now-1h" \
  http://localhost:4040/pyroscope/render

Here is the same example in Python:

python

import requests

application_name = 'my_application_name'
query = f'process_cpu:cpu:nanoseconds:cpu:nanoseconds{service_name="{application_name}"}'
query_from = 'now-1h'
url = f'http://localhost:4040/pyroscope/render?query={query}&from={query_from}'

requests.get(url)

See this Python script for a complete example.

Profile CLI

The profilecli tool can also be used to interact with the Pyroscope server API. The tool supports operations such as ingesting profiles, querying for existing profiles, and more. Refer to the Profile CLI page for more information.

Server HTTP API

Grafana Pyroscope server API

Authentication

Connect API

Ingestion Path

/push.v1.PusherService/Push

Querying profiling data

/querier.v1.QuerierService/Diff

/querier.v1.QuerierService/LabelNames

/querier.v1.QuerierService/LabelValues

/querier.v1.QuerierService/ProfileTypes

/querier.v1.QuerierService/SelectHeatmap

/querier.v1.QuerierService/SelectMergeProfile

/querier.v1.QuerierService/SelectMergeSpanProfile

/querier.v1.QuerierService/SelectMergeStacktraces

/querier.v1.QuerierService/SelectSeries

/querier.v1.QuerierService/Series

Pyroscope Legacy HTTP API

Ingestion

Text formats

The pprof format

Sample type configuration

JFR format

JFR with labels

Examples

Querying profile data

Query parameters

query

from and until

format

maxNodes

groupBy

Query output

flamebearer

metadata

timeline

groups

Alternative query output

Example queries

Profile CLI

`/push.v1.PusherService/Push`

`/querier.v1.QuerierService/Diff`

`/querier.v1.QuerierService/LabelNames`

`/querier.v1.QuerierService/LabelValues`

`/querier.v1.QuerierService/ProfileTypes`

`/querier.v1.QuerierService/SelectHeatmap`

`/querier.v1.QuerierService/SelectMergeProfile`

`/querier.v1.QuerierService/SelectMergeSpanProfile`

`/querier.v1.QuerierService/SelectMergeStacktraces`

`/querier.v1.QuerierService/SelectSeries`

`/querier.v1.QuerierService/Series`

The `pprof` format

`query`

`from` and `until`

`format`

`maxNodes`

`groupBy`

`flamebearer`

`metadata`

`timeline`

`groups`