apps/docs/memorybench/quickstart.mdx
bun run src/index.ts run -p supermemory -b longmemeval -j gpt-4o -r my-first-run
bun run src/index.ts serve
Open http://localhost:3000 to see results visually.
# Check run status
bun run src/index.ts status -r my-first-run
# View failed questions for debugging
bun run src/index.ts show-failures -r my-first-run
Run the same benchmark across multiple providers:
bun run src/index.ts compare -p supermemory,mem0,zep -b locomo -j gpt-4o
Results are saved to data/runs/{runId}/report.json.
{
"accuracy": 0.72,
"accuracyByType": {
"single-hop": 0.85,
"multi-hop": 0.65,
"temporal": 0.70,
"adversarial": 0.68
},
"avgLatency": 1250,
"totalQuestions": 50
}
Head to CLI Reference to play around with all the commands, or check out Architecture to understand how MemoryBench works under the hood.