Handsontable Performance Tests

Automated performance measurement suite for Handsontable using Playwright and Chrome DevTools Protocol (CDP) traces.

The system traces real user interactions (scrolling, filtering, sorting, editing), parses the CDP trace into the same categories shown by the DevTools Performance panel, and produces a compact markdown PR comment plus a self-contained interactive HTML report comparing against a golden baseline from develop.

Prerequisites

Node.js 22 (see .nvmrc in the repo root)
pnpm 10.30.2 (corepack enable && corepack prepare [email protected] --activate)

This package is standalone -- it is not part of the pnpm workspace. It has its own package.json and node_modules. However, it depends on the workspace being installed (for cross-env-shell, webpack, and other build tools) and on Handsontable being built (for the UMD bundle).

Quick start

The orchestrator script handles everything -- installing dependencies, building Handsontable, copying build artifacts, installing Playwright, and running all scenarios:

bash

cd performance-tests
node scripts/run.mjs

This takes approximately 3-4 minutes and produces:

Trace JSON files in output/<scenario>/iteration-{1,2,3}.json
A compact markdown report at output/result.md
An interactive HTML report at output/report.html

Running a single scenario

If Handsontable is already built and fixtures are in place:

bash

npx playwright test --grep "scroll-down"

Running with golden baseline

bash

# Save a golden baseline
PERF_MODE=golden node scripts/run.mjs

# Compare current code against the golden baseline
PERF_MODE=compare node scripts/run.mjs

Linting and type checking

bash

npm run lint
npm run typecheck

Scenarios

Each scenario measures a specific user interaction pattern:

Scenario	Grid size	Action	Notes
scroll-down	5000 x 10	`mouse.wheel(0, 350)` x 500	Vertical scroll from top
scroll-up	5000 x 10	`mouse.wheel(0, -350)` x 500	Pre-scrolls to bottom, then scrolls up
scroll-right	10 x 5000	`mouse.wheel(350, 0)` x 500	Horizontal scroll from left
scroll-left	10 x 5000	`mouse.wheel(-350, 0)` x 500	Pre-scrolls to right, then scrolls left
filtering	1000 x 1000	`filters.addCondition()` + `filter()`	Hook timing: beforeFilter -> afterFilter
sorting	1000 x 1000	`columnSorting.sort()`	Hook timing: beforeColumnSort -> afterColumnSort
cell-editing	5000 x 10	selectCell + Enter + type + Enter x 20	Sequential cell edits

Each scenario runs 1 warmup iteration (discarded) followed by 3 measured iterations with CDP tracing.

Project structure

performance-tests/
  scripts/
    run.mjs                   # Orchestrator (install, build, copy, run)
  playwright.config.ts         # Sequential execution, 1 worker, 5 min timeout
  trace-parser.mjs             # CDP trace -> DevTools category breakdown
  .eslintrc.js                 # ESLint config (extends root)
  lib/
    trace-runner.mjs           # CDP Tracing.start/stop + warmup/iteration loop
    hook-timing.mjs            # performance.now() on before/after hook pairs + save
    snapshot-store.mjs         # Golden baseline save/load/compare
    thresholds.mjs             # Shared classification (regression/improvement %)
    chart-generator.mjs        # Inline SVG bar charts for reports
    report-builder.mjs         # Compact markdown PR comment
    html-report-builder.mjs    # Self-contained interactive HTML report
    build-history-index.mjs    # gh-pages history listing for develop runs
    teardown.mjs               # Playwright globalTeardown: traces -> report
    fs-utils.mjs               # Shared filesystem helpers (exists)
    scroll-utils.mjs           # Scroll-and-wait helpers for scroll scenarios
  scenarios/
    <name>/
      scenario.config.mjs      # { name, warmupRuns, iterations }
      fixture.html              # Standalone HTML loading HOT UMD
      <name>.spec.ts            # Playwright test using runTracedScenario()
  fixtures/                     # Built JS/CSS (gitignored, copied by run.mjs)
  golden/                       # Golden snapshots (gitignored, from gh-pages)
  output/                       # Trace JSONs + result.md + report.html (gitignored)

How it works

Trace pipeline

The spec file calls runTracedScenario(), which starts CDP tracing (Tracing.start), executes the action, stops tracing (Tracing.end), and writes the raw JSON per iteration.
The globalTeardown (lib/teardown.mjs) discovers all output/*/iteration-*.json files and feeds them to the trace parser.
The trace parser (trace-parser.mjs) categorizes every trace event into DevTools-equivalent categories: scripting, rendering, painting, loading, system (other), and idle. It computes the auto-zoomed window (matching DevTools' MainThreadActivity.calculateWindow), synthesizes ProfileCall scripting from CPU profile data, and extracts UpdateCounters (heap, nodes, listeners).
Results are averaged across iterations with per-iteration values retained for CV% (coefficient of variation) calculation.
Two report builders produce output:
- Markdown (report-builder.mjs): compact summary table with regression callouts, posted as a sticky PR comment.
- HTML (html-report-builder.mjs): self-contained interactive page with inline SVG bar charts, sortable tables, and scenario detail views. Deployed to GitHub Pages per branch.

Golden baseline workflow

The CI workflow (.github/workflows/performance-tests.yml) operates in two modes:

On push to develop (PERF_MODE=golden): Runs all scenarios, saves the averaged results as golden/snapshots.json, and deploys them to the gh-pages branch under performance-reports/develop/<timestamp>/. A latest.json pointer is updated for PR comparisons. A history index page lists all past runs.
On pull request (PERF_MODE=compare): Fetches latest.json from the gh-pages branch, runs all scenarios, and generates a delta report. The markdown summary is posted as a sticky PR comment; the full HTML report is deployed to GitHub Pages at performance-reports/<branch-slug>/.

If no golden baseline exists (first run, or gh-pages branch not yet created), the report shows raw metrics in self-compare mode.

Metrics

The report includes these categories (matching the DevTools Performance panel):

Category	What it measures
Scripting	JavaScript execution, event handlers, timers, GC
Rendering	Style recalculation, layout, layer updates
Painting	Paint, rasterization, compositing
Loading	HTML parsing, resource loading
System	Internal browser overhead (RunTask, etc.)
Idle	Time between tasks

Additional metrics from UpdateCounters:

JS heap size (min/max)
DOM node count (min/max)
Event listener count (min/max)

CV% (coefficient of variation) is shown per metric. Values above 15% are flagged with !!! -- these indicate unstable measurements that may not be reliable for comparison.

Adding a new scenario

Create a new directory under scenarios/ with three files:

1. `scenario.config.mjs`

export default {
  name: 'my-scenario',   // Must match the directory name
  warmupRuns: 1,
  iterations: 3,
};

2. `fixture.html`

html

<!DOCTYPE html>
<html>
<head>
  <meta charset="utf-8">
  <title>My Scenario</title>
  <link rel="stylesheet" href="../../fixtures/handsontable.css">
  <script src="../../fixtures/handsontable.full.js"></script>
</head>
<body>
  <div id="hot"></div>
  <script>
    const hot = new Handsontable(document.getElementById('hot'), {
      data: Handsontable.helper.createSpreadsheetData(5000, 10),
      rowHeaders: true,
      colHeaders: true,
      width: 1280,
      height: 600,
      autoRowSize: false,
      autoColumnSize: false,
      licenseKey: 'non-commercial-and-evaluation',
    });
    window.__hot = hot;
  </script>
</body>
</html>

Important:

Always set autoRowSize: false and autoColumnSize: false -- these async plugins interfere with trace measurements.
The CSS file is handsontable.css (not handsontable.full.css).
Always expose the instance as window.__hot.

3. `<name>.spec.ts`

import { test } from '@playwright/test';
import path from 'node:path';
import { runTracedScenario } from '../../lib/trace-runner.mjs';
import config from './scenario.config.mjs';

const fixturePath = path.resolve(import.meta.dirname, 'fixture.html');

test(config.name, async({ page }) => {
  await page.goto(`file://${fixturePath}`);
  await page.waitForFunction(() => (window as any).__hot);

  await runTracedScenario({
    page,
    warmupRuns: config.warmupRuns,
    iterations: config.iterations,
    outputDir: path.resolve('output', config.name),
    actionFn: async() => {
      // The measured action
    },
    // Optional: resetFn to restore state between iterations
  });
});

Adding hook timing

For scenarios that measure a specific Handsontable hook pair (like filtering or sorting):

Call injectHookTimer(page, beforeHook, afterHook) once before runTracedScenario. It is idempotent -- safe to call again in resetFn to reset the timer store.
Inside actionFn, call getHookTiming(page, beforeHook, afterHook) and push timing.deltaMs to a deltas array.
After runTracedScenario, call saveHookTimings(outputDir, deltas) to persist the data.

All three functions are exported from lib/hook-timing.mjs. See the filtering or sorting scenario specs for the complete pattern.

Scroll helpers

For scenarios that need to scroll the grid before or between iterations, use scrollToRow(page, row) and scrollToColumn(page, col) from lib/scroll-utils.mjs. These combine scrollViewportTo with a deterministic wait (no waitForTimeout) that verifies the target index is renderable.

Trace parser

The trace-parser.mjs module can also be used standalone to analyze any Chrome trace JSON:

bash

# Parse a single trace
node trace-parser.mjs output/scroll-down/iteration-1.json

# Average multiple traces
node trace-parser.mjs output/scroll-down/iteration-*.json

# Show all categories including zero values
node trace-parser.mjs trace.json --full

# Show debug info (thread IDs, window range, event counts)
node trace-parser.mjs trace.json --debug

Sample report output

When running in compare mode with a golden baseline, the PR comment shows a compact summary:

## ⚡ Performance Results

| Scenario    | Scripting | Rendering | Painting | Total  | Δ         |
|-------------|-----------|-----------|----------|--------|-----------|
| Scroll Down | 3472 ms   | 1721 ms   | 89 ms    | 5282 ms | +8.0% 🟡 |
| Filtering   | 245 ms    | 112 ms    | 15 ms    | 372 ms  | -1.2% 🔵 |

### Regressions

> ⚠️ **Scroll Down** regressed +8.0%
> Scripting +12.0% slower, Rendering -2.1% faster, Painting +5.0% slower

📊 **[Full interactive report →](https://handsontable.github.io/handsontable/performance-reports/my-branch/)**

The HTML report (linked from the PR comment) includes inline SVG bar charts, sortable detail tables, CV% stability indicators, and per-category breakdowns.

Without a golden baseline, the report shows raw metrics in self-compare mode.

Handsontable Performance Tests

Handsontable Performance Tests

Prerequisites

Quick start

Running a single scenario

Running with golden baseline

Linting and type checking

Scenarios

Project structure

How it works

Trace pipeline

Golden baseline workflow

Metrics

Adding a new scenario

1. scenario.config.mjs

2. fixture.html

3. <name>.spec.ts

Adding hook timing

Scroll helpers

Trace parser

Sample report output

1. `scenario.config.mjs`

2. `fixture.html`

3. `<name>.spec.ts`