Preview

Page previews will appear here after extraction.

What Gets Extracted

The page extracts PDF.js full-document text, per-page text, and page previews, then scans indirect PDF streams for image XObjects, font files, ICC profiles, XML/text streams, Illustrator private roundtrip data, and unknown binary payloads. JPEG, JPEG 2000, JBIG2, PNG, TIFF, TrueType, OpenType, CFF/CIDFontType0C, ZIP, Zstandard, XML, text, and many raw streams are identified from magic bytes and PDF dictionary hints. Flate, ASCIIHex, ASCII85, RunLength, and recognized Illustrator Zstandard private-data streams are decoded when possible. Files that are not real images, fonts, or extracted text are grouped under Other Files and placed in a separate ZIP folder.

Drop PDF file

Extraction runs locally in this browser. No upload is performed.