DOCS-TESTING.md
This guide covers all testing procedures for the InfluxData documentation, including code block testing, link validation, and style linting.
yarn to install all dependenciesdocker build -t influxdata/docs-pytest:latest -f Dockerfile.pytest .| Test Type | Purpose | Command |
|---|---|---|
| Code blocks | Validate shell/Python code examples | yarn test:codeblocks:all |
| Link validation | Check internal/external links | yarn test:links |
| Style linting | Enforce writing standards | .ci/vale/vale.sh |
| Markdown generation | Generate LLM-friendly Markdown | yarn build:md |
| E2E tests | UI and functionality testing | yarn test:e2e |
Code block testing validates that shell commands and Python scripts in documentation work correctly using pytest-codeblocks.
# Test all code blocks
yarn test:codeblocks:all
# Test specific products
yarn test:codeblocks:cloud
yarn test:codeblocks:v2
yarn test:codeblocks:telegraf
chmod +x ./test/src/*.sh
Create databases, buckets, and tokens for the product(s) you're testing. If you don't have access to a Clustered instance, you can use your Cloud Dedicated instance for testing in most cases.
Copy the ./test/env.test.example file into each product directory and rename as .env.test:
# Example locations
./content/influxdb/cloud-dedicated/.env.test
./content/influxdb3/clustered/.env.test
Inside each product's .env.test file, assign your InfluxDB credentials:
INFLUX_ environment variablescloud-dedicated/.env.test and clustered/.env.test, also define:
ACCOUNT_ID, CLUSTER_ID: Found in your influxctl config.tomlMANAGEMENT_TOKEN: Generate with influxctl management createSee ./test/src/prepare-content.sh for the full list of variables you may need.
For influxctl commands to run in tests, move or copy your config.toml file to the ./test directory.
[!Warning]
- The database you configure in
.env.testand any written data may be deleted during test runs- Don't add your
.env.testfiles to Git. Git is configured to ignore.env*files to prevent accidentally committing credentials
print("Hello, world!")
Hello, world!
For commands that require TTY interaction (like influxctl authentication), wrap the command in a subshell and redirect output:
# Test the preceding command outside of the code block.
# influxctl authentication requires TTY interaction--
# output the auth URL to a file that the host can open.
script -c "influxctl user list " \
/dev/null > /shared/urls.txt
To hide test blocks from users, wrap them in HTML comments. pytest-codeblocks will still collect and run them.
pytest-codeblocks has features for skipping tests and marking blocks as failed. See the pytest-codeblocks README for details.
Potential causes:
pytest.inipython (not py) for Python code block language identifiers:
# This works
# This is ignored
The documentation includes tooling to generate LLM-friendly Markdown versions of documentation pages, both locally via CLI and on-demand via Lambda@Edge in production.
# Prerequisites (run once)
yarn install
yarn build:ts
npx hugo --quiet
# Generate Markdown
node scripts/html-to-markdown.js --path influxdb3/core/get-started --limit 10
# Validate generated Markdown
node cypress/support/run-e2e-specs.js \
--spec "cypress/e2e/content/markdown-content-validation.cy.js"
For complete documentation including prerequisites, usage examples, output formats, frontmatter structure, troubleshooting, and architecture details, see the inline documentation:
# Or view the first 150 lines in terminal
head -150 scripts/html-to-markdown.js
The script documentation includes:
scripts/html-to-markdown.js - Comprehensive inline documentationscripts/lib/markdown-converter.js - Shared conversion librarydeploy/llm-markdown/lambda-edge/markdown-generator/index.js - Production deploymentdeploy/llm-markdown/README.md - Deployment guidecypress/e2e/content/markdown-content-validation.cy.js - Validation testsAll generated markdown files include structured YAML frontmatter:
---
title: Page Title
description: Page description for SEO
url: /influxdb3/core/get-started/
product: InfluxDB 3 Core
version: core
date: 2024-01-15T00:00:00Z
lastmod: 2024-11-20T00:00:00Z
type: page
estimated_tokens: 2500
---
Section pages include additional fields:
---
type: section
pages: 4
child_pages:
- title: Set up InfluxDB 3 Core
url: /influxdb3/core/get-started/setup/
- title: Write data
url: /influxdb3/core/get-started/write/
---
# Generate markdown with verbose output
node scripts/html-to-markdown.js --path influxdb3/core/get-started --limit 2 --verbose
# Check files were created
ls -la public/influxdb3/core/get-started/*.md
# View generated content
cat public/influxdb3/core/get-started/index.md
# Check frontmatter
head -20 public/influxdb3/core/get-started/index.md
The repository includes comprehensive Cypress tests for markdown validation:
# Run all markdown validation tests
node cypress/support/run-e2e-specs.js --spec "cypress/e2e/content/markdown-content-validation.cy.js"
# Test specific content file
node cypress/support/run-e2e-specs.js \
--spec "cypress/e2e/content/markdown-content-validation.cy.js" \
content/influxdb3/core/query-data/execute-queries/_index.md
The Cypress tests validate:
{{< >}} or {{% %}})Cause: Page doesn't have <article class="article--content"> element (common for index/list pages)
Solution: This is normal behavior. The converter skips pages without article content. To verify:
# Check HTML structure
grep -l 'article--content' public/path/to/page/index.html
Cause: TypeScript not compiled (product-mappings.js missing)
Solution: Build TypeScript first:
yarn build:ts
ls -la dist/utils/product-mappings.js
Cause: Attempting to process thousands of pages at once
Solution: Use --limit flag to process in batches:
# Process 1000 files at a time
node scripts/html-to-markdown.js --limit 1000
Cause: Product mappings not up to date or path doesn't match known patterns
Solution:
yarn build:tsassets/js/utils/product-mappings.tsBefore committing markdown generation changes:
yarn build:tsnpx hugo --quiet{{<, {{%)<!--, -->)The markdown generation uses a shared library architecture:
docs-v2/
├── scripts/
│ ├── html-to-markdown.js # CLI wrapper (filesystem operations)
│ └── lib/
│ └── markdown-converter.js # Core conversion logic (shared library)
├── dist/
│ └── utils/
│ └── product-mappings.js # Product detection (compiled from TS)
└── public/ # Generated HTML + Markdown files
The shared library (scripts/lib/markdown-converter.js) is:
For deployment details, see deploy/lambda-edge/markdown-generator/README.md.
Link validation uses the link-checker tool to validate internal and external links in documentation files.
Option 1: Build from source (macOS/local development)
For local development on macOS, build the link-checker from source:
# Clone and build link-checker
git clone https://github.com/influxdata/docs-tooling.git
cd docs-tooling/link-checker
cargo build --release
# Copy binary to your PATH or use directly
cp target/release/link-checker /usr/local/bin/
# OR use directly: ./target/release/link-checker
Option 2: Download pre-built binary (GitHub Actions/Linux)
The link-checker binary is distributed via docs-v2 releases for reliable access from GitHub Actions workflows:
# Download Linux binary from docs-v2 releases
curl -L -o link-checker \
https://github.com/influxdata/docs-v2/releases/download/link-checker-v1.0.0/link-checker-linux-x86_64
chmod +x link-checker
# Verify installation
./link-checker --version
[!Note] Pre-built binaries are currently Linux x86_64 only. For macOS development, use Option 1 to build from source.
# Clone and build link-checker
git clone https://github.com/influxdata/docs-tooling.git
cd docs-tooling/link-checker
cargo build --release
# Copy binary to your PATH or use directly
cp target/release/link-checker /usr/local/bin/
For maintainers: To create a new link-checker release in docs-v2:
Create release in docs-tooling (builds and releases binary automatically):
cd docs-tooling
git tag link-checker-v1.2.x
git push origin link-checker-v1.2.x
Manually distribute to docs-v2 (required due to private repository access):
# Download binary from docs-tooling release
curl -L -H "Authorization: Bearer $(gh auth token)" \
-o link-checker-linux-x86_64 \
"https://github.com/influxdata/docs-tooling/releases/download/link-checker-v1.2.x/link-checker-linux-x86_64"
curl -L -H "Authorization: Bearer $(gh auth token)" \
-o checksums.txt \
"https://github.com/influxdata/docs-tooling/releases/download/link-checker-v1.2.x/checksums.txt"
# Create docs-v2 release
gh release create \
--repo influxdata/docs-v2 \
--title "Link Checker Binary v1.2.x" \
--notes "Link validation tooling binary for docs-v2 GitHub Actions workflows." \
link-checker-v1.2.x \
link-checker-linux-x86_64 \
checksums.txt
Update workflow reference (if needed):
# Update .github/workflows/pr-link-check.yml line 98 to use new version
sed -i 's/link-checker-v[0-9.]*/link-checker-v1.2.x/' .github/workflows/pr-link-check.yml
[!Note] The manual distribution is required because docs-tooling is a private repository and the default GitHub token doesn't have cross-repository access for private repos.
# Map content files to public HTML files
link-checker map content/path/to/file.md
# Check links in HTML files
link-checker check public/path/to/file.html
# Generate configuration file
link-checker config
The link-checker automatically handles relative link resolution based on the input type:
Local Files → Local Resolution
# When checking local files, relative links resolve to the local filesystem
link-checker check public/influxdb3/core/admin/scale-cluster/index.html
# Relative link /influxdb3/clustered/tags/kubernetes/ becomes:
# → /path/to/public/influxdb3/clustered/tags/kubernetes/index.html
URLs → Production Resolution
# When checking URLs, relative links resolve to the production site
link-checker check https://docs.influxdata.com/influxdb3/core/admin/scale-cluster/
# Relative link /influxdb3/clustered/tags/kubernetes/ becomes:
# → https://docs.influxdata.com/influxdb3/clustered/tags/kubernetes/
Why This Matters
# Map Markdown files to HTML
link-checker map content/influxdb3/core/get-started/
# Check links in mapped HTML files
link-checker check public/influxdb3/core/get-started/
# Map shared content files
link-checker map content/shared/influxdb3-cli/
# Check the mapped output files
# (link-checker map outputs the HTML file paths)
link-checker map content/shared/influxdb3-cli/ | \
xargs link-checker check
# Check HTML files directly without mapping
link-checker check public/influxdb3/core/get-started/
# Check only files changed in the last commit
git diff --name-only HEAD~1 HEAD | grep '\.md$' | \
xargs link-checker map | \
xargs link-checker check
# Uses default settings or test.lycherc.toml if present
link-checker check public/influxdb3/core/get-started/
# Use production configuration with comprehensive exclusions
link-checker check \
--config .ci/link-checker/production.lycherc.toml \
public/influxdb3/core/get-started/
Automated Integration (docs-v2)
The docs-v2 repository includes automated link checking for pull requests:
The workflow automatically:
Manual Integration (other repositories)
For other repositories, you can integrate link checking manually:
name: Link Check
on:
pull_request:
paths:
- 'content/**/*.md'
jobs:
link-check:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v4
- name: Download link-checker
run: |
curl -L -o link-checker \
https://github.com/influxdata/docs-tooling/releases/latest/download/link-checker-linux-x86_64
chmod +x link-checker
cp target/release/link-checker ../../link-checker
cd ../..
- name: Build Hugo site
run: |
npm install
npx hugo --minify
- name: Check changed files
run: |
git diff --name-only origin/main HEAD | \
grep '\.md$' | \
xargs ./link-checker map | \
xargs ./link-checker check \
--config .ci/link-checker/production.lycherc.toml
Style linting uses Vale to enforce documentation writing standards, branding guidelines, and vocabulary consistency.
brew install vale (or see Vale installation guide).ci/vale/vale.sh wrapper falls back to a pinned Docker image if vale isn't installed locally.# Lint specific files
.ci/vale/vale.sh content/influxdb3/core/**/*.md
# With product config and alert level
.ci/vale/vale.sh --config=content/influxdb/cloud-dedicated/.vale.ini --minAlertLevel=error content/influxdb/cloud-dedicated/write-data/**/*.md
Vale:Vale CLI:Path to vale (or the full path to the binary).Vale can raise different alert levels:
.ci/vale/styles/ contains configuration for the custom InfluxDataDocs style.ci/vale/styles/config/vocabulariescontent/influxdb/cloud-dedicated/.vale.iniFor more configuration details, see Vale configuration.
docs-v2 uses Lefthook to manage Git hooks that run automatically during pre-commit and pre-push.
When you run git commit, Git runs:
We strongly recommend running linting and tests, but you can skip them:
# Skip with --no-verify flag
git commit -m "<COMMIT_MESSAGE>" --no-verify
# Skip with environment variable
LEFTHOOK=0 git commit
# Run all E2E tests
yarn test:e2e
# Run specific E2E specs
node cypress/support/run-e2e-specs.js --spec "cypress/e2e/content/index.cy.js"
For JavaScript code in the documentation UI (assets/js):
assets/js/ns-hugo-imp: namespaceImport debug helpers in your JavaScript module:
import { debugLog, debugBreak, debugInspect } from './utils/debug-helpers.js';
Insert debug statements:
const data = debugInspect(someData, 'Data');
debugLog('Processing data', 'myFunction');
debugBreak(); // Add breakpoint
Start Hugo: yarn hugo server
In VS Code, select "Debug JS (debug-helpers)" configuration
Remember to remove debug statements before committing.
Available test services:
# All code block tests
docker compose --profile test up
# Individual product tests
docker compose run --rm cloud-pytest
docker compose run --rm v2-pytest
docker compose run --rm telegraf-pytest
# Stop monitoring services
yarn test:codeblocks:stop-monitors
--option vs -o)npx hugo --quietyarn build:ts--limit flag--verbose flag to debug conversion issues{{< or {{% in output)index.md) and section pages (index.section.md)pytest.ini, cypress.config.js, lefthook.ymlcompose.yaml, Dockerfile.pytest.github/scripts/ directory./test/ directory.ci/vale/styles/scripts/html-to-markdown.js - CLI wrapperscripts/lib/markdown-converter.js - Core conversion librarydeploy/lambda-edge/markdown-generator/ - Lambda deploymentcypress/e2e/content/markdown-content-validation.cy.js - Validation tests