docs/tools/pdf-to-pdfa.md
PDF/A is an ISO-standardized subset of PDF designed for long-term digital preservation. This tool converts your standard PDFs into PDF/A-compliant documents using Ghostscript running directly in your browser.
| Version | Restrictions | Best For |
|---|---|---|
| PDF/A-1b | Strictest. No transparency, no layers. | Maximum compatibility with older archival systems |
| PDF/A-2b | Allows transparency and JPEG2000. | General-purpose archiving (recommended) |
| PDF/A-3b | Allows embedded attachments of any format. | Archiving documents with source data files |
PDF/A-2b is selected by default and works for the vast majority of archiving needs.
Some PDFs contain features that prevent clean PDF/A conversion -- unusual font encodings, complex transparency stacks, or broken internal structures. Enabling Pre-flatten rasterizes the PDF at 300 DPI before conversion, which guarantees compliance at the cost of making text non-selectable. Use it as a fallback when the standard conversion produces errors.
Pre-flattening requires the PyMuPDF WASM engine.