Preview
What Gets Extracted
The page extracts PDF.js text and page previews, then scans indirect PDF streams for image XObjects, font files, ICC profiles, XML/text streams, and unknown binary payloads. JPEG, JPEG 2000, JBIG2, PNG, TIFF, font, ZIP, XML, text, and many raw streams are identified from magic bytes and PDF dictionary hints. Flate, ASCIIHex, ASCII85, and RunLength streams are decoded when possible.