GraphQL and General-Purpose Indexer

See GraphQL and General-Purpose Indexer for more information on the stack.

Indexer setup

The indexer consists of multiple pipelines that each read, transform, and write checkpoint data to a table. Multiple instances of the indexer can run in parallel, each configured by its own TOML file.

Hardware requirements

CPU: 2 cores per instance
Memory: 4GB per instance

Storage requirements

The general-purpose indexer writes to a Postgres database. The storage footprint estimations outlined below are based on the network as of early 2026, and may fluctuate in relation to network growth. These numbers should be seen as directional rather than exact figures.

The bulk of the storage is consumed by obj_versions at 8.2 TB. A pruning strategy is in development.

A 30-day retention adds 1.8 TB on top, while a 90-day retention contributes up to an additional 3.1 TB.

30-day retention:

Table	Heap (GB)	Idx (GB)
tx_affected_objects	64–70	276–397
tx_calls	27–30	239–366
ev_struct_inst	15–19	174–267
tx_affected_addresses	17–18	63–92
ev_emit_mod	8–9	54–85
tx_balance_changes	18–20	5–6
tx_digests	8	21–25
tx_kinds	5	14–16
cp_sequence_numbers	10	5
TOTAL	444–450	1,827–1,842

90-day retention:

Table	Heap (GB)	Idx (GB)
tx_affected_objects	188–202	580–752
tx_calls	82–87	560–715
ev_struct_inst	45–50	432–531
tx_affected_addresses	50–54	151–183
ev_emit_mod	22–24	129–167
tx_balance_changes	57–63	10–16
tx_digests	24–26	45–55
tx_kinds	15–16	30–36
cp_sequence_numbers	10	5
TOTAL	755–842	2,440–3,181

Run `sui-indexer-alt`

Run an indexer instance using this command for each of the configuration files. The command varies based on whether the pipeline is prunable or unprunable:

For unprunable pipelines (must start from genesis)

$ sui-indexer-alt indexer \
    --config <CONFIG_FILE> \
    --database-url <DATABASE_URL> \
    --remote-store-url <REMOTE_STORE_URL>

For prunable pipelines with retention period

$ sui-indexer-alt indexer \
    --config <CONFIG_FILE> \
    --database-url <DATABASE_URL> \
    --remote-store-url <REMOTE_STORE_URL> \
    --first-checkpoint <CHECKPOINT_NUMBER>

For prunable pipelines, calculate the first-checkpoint based on your retention period:

30-day retention: Start from checkpoint current_checkpoint - 10368000
90-day retention: Start from checkpoint current_checkpoint - 31104000

CLI param	Description
`<CONFIG_FILE>`	Path to indexer configuration file.
`<DATABASE_URL>`	Postgres database connection string.
`<REMOTE_STORE_URL>`	URL of a checkpoint bucket to index from, one of multiple possible data sources.
`<CHECKPOINT_NUMBER>`	(Optional) For prunable pipelines only - the checkpoint to start indexing from based on retention requirements.

Examples

Unprunable pipeline (from genesis):

$ sui-indexer-alt indexer \
    --config unpruned.toml \
    --database-url postgres://username:password@localhost:5432/database \
    --remote-store-url https://checkpoints.mainnet.sui.io

Prunable pipeline with 30-day retention (assuming current checkpoint is 100,000,000):

$ sui-indexer-alt indexer \
    --config events.toml \
    --database-url postgres://username:password@localhost:5432/database \
    --remote-store-url https://checkpoints.mainnet.sui.io \
    --first-checkpoint 89632000

Run config recommendations {#indexer-run-config}

Use the TOML files below; they are grouped by pipeline speed. All pipelines in an instance are limited by the slowest pipeline in that instance so these files each contain pipelines that run at approximately the same speed.

:::info When backfilling, you should set the ingest-concurrency to a higher value, e.g. 200, then reduce it to 20 for normal operation at network tip. :::

<table> <thead> <tr> <th>Config TOML</th> <th>Type</th> <th>Description</th> <th>Pipelines</th> <th>Backfill time</th> <th>Data retention</th> <th>Start checkpoint</th> </tr> </thead> <tbody> <tr> <td>`events.toml`</td> <td>**Prunable**</td> <td>Lightweight event tables.</td> <td> <ul className="pl-4"> <li>`ev_emit_mod`</li> <li>`ev_struct_inst`</li> </ul> </td> <td>1-2 days</td> <td>Configurable (e.g., 30 or 90 days)</td> <td>Based on retention period</td> </tr> <tr> <td colspan={7}> <details> <summary>`events.toml`</summary> <ImportContent source="examples/prod-config/indexer/events.toml" mode="code" /> </details> </td> </tr> <tr> <td>`obj_versions.toml`</td> <td>**Unprunable**</td> <td>Object versions table containing complete object version to checkpoint mappings.</td> <td>`obj_versions`</td> <td>10-14 days</td> <td>Must retain all data</td> <td>Genesis (checkpoint 0)</td> </tr> <tr> <td colspan={7}> <details> <summary>`obj_versions.toml`</summary> <ImportContent source="examples/prod-config/indexer/obj_versions.toml" mode="code" /> </details> </td> </tr> <tr> <td>`tx_affected_addresses.toml`</td> <td>**Prunable**</td> <td>Midweight transaction table.</td> <td>`tx_affected_addresses`</td> <td>1-2 days</td> <td>Configurable (e.g., 30 or 90 days)</td> <td>Based on retention period</td> </tr> <tr> <td colspan={7}> <details> <summary>`tx_affected_addresses.toml`</summary> <ImportContent source="examples/prod-config/indexer/tx_affected_addresses.toml" mode="code" /> </details> </td> </tr> <tr> <td>`tx_affected_objects.toml`</td> <td>**Prunable**</td> <td>Midweight transaction table.</td> <td>`tx_affected_objects`</td> <td>1-2 days</td> <td>Configurable (e.g., 30 or 90 days)</td> <td>Based on retention period</td> </tr> <tr> <td colspan={7}> <details> <summary>`tx_affected_objects.toml`</summary> <ImportContent source="examples/prod-config/indexer/tx_affected_objects.toml" mode="code" /> </details> </td> </tr> <tr> <td>`tx_calls.toml`</td> <td>**Prunable**</td> <td>Midweight transaction table.</td> <td>`tx_calls`</td> <td>1-2 days</td> <td>Configurable (e.g., 30 or 90 days)</td> <td>Based on retention period</td> </tr> <tr> <td colspan={7}> <details> <summary>`tx_calls.toml`</summary> <ImportContent source="examples/prod-config/indexer/tx_calls.toml" mode="code" /> </details> </td> </tr> <tr> <td>`tx_kinds.toml`</td> <td>**Prunable**</td> <td>Midweight transaction table.</td> <td>`tx_kinds`</td> <td>1-2 days</td> <td>Configurable (e.g., 30 or 90 days)</td> <td>Based on retention period</td> </tr> <tr> <td colspan={7}> <details> <summary>`tx_kinds.toml`</summary> <ImportContent source="examples/prod-config/indexer/tx_kinds.toml" mode="code" /> </details> </td> </tr> <tr> <td>`unpruned.toml`</td> <td>**Unprunable**</td> <td>Foundational reference data that other queries depend on.</td> <td> <ul className="pl-4"> <li>`cp_sequence_numbers`</li> <li>`kv_epoch_ends`</li> <li>`kv_epoch_starts`</li> <li>`kv_feature_flags`</li> <li>`kv_packages`</li> <li>`kv_protocol_configs`</li> <li>`sum_displays`</li> <li>`tx_balance_changes`</li> <li>`tx_digests`</li> </ul> </td> <td>2-4 days</td> <td>Must retain all data</td> <td>Genesis (checkpoint 0)</td> </tr> <tr> <td colspan={7}> <details> <summary>`unpruned.toml`</summary> <ImportContent source="examples/prod-config/indexer/unpruned.toml" mode="code" /> </details> </td> </tr> </tbody> </table>

Consistent store setup

All consistent store pipelines run in the same instance based on a single configuration file. Like the indexer, the pipelines run in parallel and throughput is limited by the slowest pipeline.

Hardware requirements

CPU: 8 cores
Memory: 32GB

Restore command

Restores one or more pipelines from checkpoint data in a GCS bucket.

$ sui-indexer-alt-consistent-store restore \
    --azure <AZURE_BUCKET> \
    --database-path <DATABASE_PATH> \
    --gcs <GCS_BUCKET> \
    --http <HTTP_ENDPOINT> \
    --object-file-concurrency <OBJECT_FILE_CONCURRENCY> \
    --pipeline <PIPELINE_NAME> \
    --remote-store-url <REMOTE_STORE_URL> \
    --s3 <S3_BUCKET>

CLI parameter	Description
`<AZURE_BUCKET>` *	Name or URL of Azure bucket containing managed snapshots.
`<DATABASE_PATH>`	Path to RocksDB database.
`<GCS_ACCOUNT>` *	Name or URL of GCS bucket containing managed snapshots.
`<HTTP_ENDPOINT>` *	URL of formal snapshot API.
`<OBJECT_FILE_CONCURRENCY>`	Path to indexer configuration file.
`<PIPELINE_NAME>`	Name of pipeline to restore. Can be set multiple times; once per pipeline.
`<REMOTE_STORE_URL>`	URL of a checkpoint bucket to index from, one of multiple possible data sources.
`<S3_BUCKET>` *	Name or URL of AWS S3 bucket containing managed snapshots.

* Must specify one of <AZURE_BUCKET>, <GCS_ACCOUNT>, <HTTP_ENDPOINT>, or <S3_BUCKET>.

Example:

$ sui-indexer-alt-consistent-store restore \
    --database-path /path/to/rocksdb \
    --http https://formal-snapshot.mainnet.sui.io \
    --object-file-concurrency 5 \
    --pipeline balances \
    --pipeline object_by_owner \
    --pipeline object_by_type \
    --remote-store-url https://checkpoints.mainnet.sui.io

Run command

Run a consistent store instance using this command for the configuration file that follows:

$ sui-indexer-alt-consistent-store run \
    --config <CONFIG_FILE> \
    --database-path <DATABASE_PATH> \
    --remote-store-url <REMOTE_STORE_URL>

CLI param	Description
`<CONFIG_FILE>`	Path to consistent store configuration file.
`<DATABASE_PATH>`	Path to RocksDB database.
`<REMOTE_STORE_URL>`	URL of a checkpoint bucket to index from, one of multiple possible data sources.

Example:

$ sui-indexer-alt-consistent-store run \
    --config consistent-store.toml \
    --database-path /path/to/rocksdb \
    --remote-store-url https://checkpoints.mainnet.sui.io

Run config recommendations

<table> <thead> <tr> <th>Config TOML</th> <th>Description</th> <th>Pipelines</th> <th>Backfill time</th> <th>Data retention</th> </tr> </thead> <tbody> <tr> <td>`consistent-store.toml`</td> <td>Consistent store API configuration and event tables.</td> <td> <ul className="pl-4"> <li>`balances`</li> <li>`object-by-owner`</li> <li>`object-by-type`</li> </ul> </td> <td>1-2 hours</td> <td>Must retain all data</td> </tr> </tbody> </table> <details> <summary>`consistent-store.toml`</summary> <ImportContent source="examples/prod-config/consistent_store.toml" mode="code" /> </details>

GraphQL RPC server setup

GraphQL RPC server reads data from the general-purpose indexer's database (Postgres), the consistent store, and the archival service.

Ensure that all unprunable indexer pipelines (obj_versions.toml and unpruned.toml) have fully caught up to the network tip before starting the GraphQL RPC server. The GraphQL service will only operate normally once these pipelines are complete.

</TabItem> </Tabs>

Hardware requirements

CPU: 2 cores per instance
Memory: 4GB per instance

Scale the number of nodes based on the desired read throughput requirements of your client applications.

GraphQL RPC server dependencies

The GraphQL RPC server relies on multiple backend services to fulfill different types of queries:

Archival service (--ledger-grpc-url) provides historical data for most queries involving checkpoints, objects, and transactions.
Consistent store (--consistent-store-url) serves live data for queries related to current object and balance ownership.
Postgres database (--database-url) is the primary store for most queries, except for direct object and transaction lookups handled by the Archival service.
Fullnode RPC (--fullnode-rpc-url) powers transaction simulation and execution.

Set the appropriate service URLs in your run command based on the query types your GraphQL RPC server needs to support.

Run `sui-indexer-alt-graphql`

:::info

If you use the Sui Foundation–hosted public good archival service on Testnet or Mainnet, you may encounter performance issues. The team will address these before the GraphQL RPC and Archival Service reach general availability.

:::

Use the following command to run a GraphQL RPC server node:

sui-indexer-alt-graphql rpc \
    --config <PATH_TO_GRAPHQL_CONFIG_FILE> \
    --indexer-config <PATH_TO_INDEXER_CONFIG_FILE_1> \
    --indexer-config <PATH_TO_INDEXER_CONFIG_FILE_2> \
    --indexer-config <PATH_TO_INDEXER_CONFIG_FILE_3> \
    --ledger-grpc-url <LEDGER_GRPC_URL> \
    --consistent-store-url <CONSISTENT_STORE_URL> \
    --database-url <DATABASE_URL> \
    --fullnode-rpc-url <FULLNODE_RPC_URL>

Multiple --indexer-config parameters can be provided, one for each general-purpose indexer instance.

CLI parameter	Description
`CONFIG_FILE`	Path to the optional GraphQL RPC server configuration file
`INDEXER_CONFIG_FILE`	Path to general-purpose indexer configuration file; can be set multiple times for different pipelines
`LEDGER_GRPC_URL`	URL to Archival service's `LedgerService` gRPC API
`CONSISTENT_STORE_URL`	URL to Consistent store API
`DATABASE_URL`	Postgres database connection string
`FULLNODE_RPC_URL`	URL to full node RPC

Example:

sui-indexer-alt-graphql rpc \
    --config graphql.toml \
    --indexer-config events.toml \
    --indexer-config obj_versions.toml \
    --indexer-config tx_affected_addresses.toml \
    --indexer-config tx_affected_objects.toml \
    --indexer-config tx_calls.toml \
    --indexer-config tx_kinds.toml \
    --indexer-config unpruned.toml \
    --ledger-grpc-url https://archive.mainnet.sui.io:443 \
    --consistent-store-url https://localhost:7001 \
    --database-url postgres://username:password@localhost:5432/database \
    --fullnode-rpc-url https://localhost:9000

Generating Configuration

You can run the GraphQL RPC server without a configuration file, which will use default values. To customize settings, generate a config file using the command below and edit it as needed:


sui-indexer-alt-graphql generate-config > <PATH_TO_GRAPHQL_CONFIG_FILE>

It will produce output similar to the following:

<details> <summary>`graphql.toml`</summary> <ImportContent source="examples/prod-config/graphql.toml" mode="code" /> </details>

Indexer/GraphQL Postgres-compatible Database Setup

Both the indexer and GraphQL server require a Postgres-compatible database shared between them.

These GraphQL request throughputs were tested against the following recommended specs:

500 requests per second when the indexer is backfilling from genesis or running a restore
1000 requests per second when the indexer is indexing from the network tip (~4.25 checkpoints per second)

AlloyDB Omni

AlloyDB Omni recommends 8GB RAM per vCPU link. Allocating less than this results in the database closing Indexer and GraphQL connections during load testing.

Hardware requirements

CPU: 6 cores
Memory: 48GB

Vanilla Postgres

Hardware requirements

CPU: 6 cores
Memory: 48GB

Adding a new pipeline

Adding a new pipeline to an existing indexer currently requires these steps:

We recommend starting a new pipeline in its own indexer instance, optionally with the --first-checkpoint <checkpoint> flag set if you want to start from a checkpoint after genesis.
- Note: --first-checkpoint is only respected if no watermark record exists for the pipeline(s) (the pipeline has not been run before). The watermark record must be manually removed if you want to run the pipeline with a different value of --first-checkpoint or it will be ignored.
After the new pipeline has caught up to the tip of the network, you can optionally merge it into another indexer instance.

Reducing indexer pipeline table and index bloat

Bloat is the difference between the size of the data in the table or index and the amount of space it takes up on disk. Autovacuum prevents bloat if approximately the same number of rows is continually inserted and deleted like the case when pipeline pruning is enabled. However, autovacuum will not handle more rows being deleted than inserted. There are two cases where this can occur:

Enabling pruning on an unpruned pipeline.
Reducing the pruning retention period on a pruned table.

These tools can be used to reduce bloat:

Tool	Type	`ACCESS EXCLUSIVE` locking	Schedulable	Link
`VACUUM FULL`	Built-in	Entire operation	No	https://www.postgresql.org/docs/current/sql-vacuum.html
pg_repack	Extension	Briefly during initial and final step	No	https://reorg.github.io/pg_repack/
pg_squeeze	Extension	Briefly during final step	Yes	https://github.com/cybertec-postgresql/pg_squeeze

Indexer setup

Hardware requirements

Storage requirements

Run sui-indexer-alt

For unprunable pipelines (must start from genesis)

For prunable pipelines with retention period

Examples

Run config recommendations {#indexer-run-config}

Consistent store setup

Hardware requirements

Restore command

Run command

Run config recommendations

GraphQL RPC server setup

Hardware requirements

GraphQL RPC server dependencies

Run sui-indexer-alt-graphql

Generating Configuration

Indexer/GraphQL Postgres-compatible Database Setup

AlloyDB Omni

Hardware requirements

Vanilla Postgres

Hardware requirements

Adding a new pipeline

Reducing indexer pipeline table and index bloat

Run `sui-indexer-alt`

Run `sui-indexer-alt-graphql`