packages/tar-parser/README.md
Streaming tar archive parsing for JavaScript. tar-parser handles POSIX/GNU/PAX archives incrementally so large tar files can be processed without buffering the full payload.
fetch() streamsnpm i remix
The main parser interface is the parseTar(archive, handler) function:
import { parseTar } from 'remix/tar-parser'
let response = await fetch('https://github.com/remix-run/remix/archive/refs/heads/main.tar.gz')
await parseTar(response.body.pipeThrough(new DecompressionStream('gzip')), (entry) => {
console.log(entry.name, entry.size)
})
If you're parsing an archive with filename encodings other than UTF-8, use the filenameEncoding option:
let response = await fetch(/* ... */)
await parseTar(response.body, { filenameEncoding: 'latin1' }, (entry) => {
console.log(entry.name, entry.size)
})
tar-parser performs on par with other popular tar parsing libraries on Node.js.
> @remix-run/[email protected] bench /Users/michael/Projects/remix-the-web/packages/tar-parser
> node ./bench/runner.ts
Platform: Darwin (24.0.0)
CPU: Apple M1 Pro
Date: 12/6/2024, 11:00:55 AM
Node.js v22.8.0
┌────────────┬────────────────────┐
│ (index) │ lodash npm package │
├────────────┼────────────────────┤
│ tar-parser │ '6.23 ms ± 0.58' │
│ tar-stream │ '6.72 ms ± 2.24' │
│ node-tar │ '6.49 ms ± 0.44' │
└────────────┴────────────────────┘
multipart-parser - Fast, streaming multipart parser for JavaScripttar-parser is based on the excellent tar-stream package (MIT license) and adopts the same core parsing algorithm, utility functions, and many test cases.
See LICENSE