How to Reduce PDF Image File Size
Downsampling and Recompression in PDF Optimizer
Images are the single largest contributor to PDF file size. A document containing a handful of press-quality photographs can easily reach 20 to 50 MB, most of that weight sitting in image data at resolutions far higher than the output medium requires. PDF Optimizer addresses this through two operations: downsampling, which reduces image resolution, and recompression, which re-encodes image data using a more efficient algorithm. Both are configurable, both apply separately to different image types, and together they typically account for the majority of file size reduction in any optimization pass.
Images Are the Biggest Driver of PDF File Size
To understand why, consider what happens when a document is created for print. A professional print workflow typically requires images at 300 DPI or higher. At that resolution, a single full-page image at standard print dimensions can contain 25 million pixels or more. That image data is embedded in the PDF in full.
If that PDF is then distributed digitally, the image resolution is wildly in excess of what any screen can display. A standard monitor renders at approximately 96 DPI. A high-resolution display might render at 150 to 200 DPI. The press-quality image in the PDF is carrying three to nine times more data than the output device can use. That excess data adds nothing to the viewing experience and costs significant storage and bandwidth for every copy of the document.
Downsampling reduces the image to the resolution actually needed for the intended output. The visual result is indistinguishable on screen. The file size reduction is substantial.
Image Downsampling: How It Works
Downsampling reduces the resolution of images in a PDF by removing pixels. You specify a target resolution in your JSON profile, and PDF Optimizer reduces any image exceeding that target down to the specified DPI.
The target resolution should match the intended output medium:
• 72 to 96 DPI for screen-only documents and web PDFs where bandwidth or storage is the priority
• 150 DPI for general digital distribution where moderate quality is acceptable
• 300 DPI for print-ready output where image fidelity must be preserved
PDF Optimizer applies downsampling settings separately to color images, grayscale images, and monochrome (black-and-white) images. This matters because different image types in the same document often have different optimal resolution targets. A full-color photograph and a black-and-white scanned form in the same PDF can each be downsampled to the resolution appropriate for their content type.
Recompression: How It Works
When PDF Optimizer downsamples an image, the image data is automatically recompressed. You can also apply recompression independently to images that meet your resolution target but are using an inefficient compression algorithm or an unnecessarily high quality setting.
Recompression decompresses the existing image data and re-encodes it using the algorithm and quality value you specify. PDF Optimizer supports the following compression algorithms:
• JPEG: Lossy compression for photographs and continuous-tone color or grayscale images. Efficient at high compression ratios with minimal perceptible quality loss at moderate quality settings.
• ZIP (Deflate): Lossless compression. No quality loss. Best suited for images with large areas of flat color, diagrams, charts, and screenshots.
• Flate: An alias for ZIP/Deflate. Lossless compression appropriate for the same image types.
• JBIG2: Highly efficient lossy or lossless compression for monochrome (black-and-white) images. Produces significantly smaller files than older bi-level compression methods like CCITT. Ideal for scanned documents and forms with monochrome content.
When to Downsample and When Not To
Downsampling is appropriate when the destination resolution of the document is lower than the resolution of the images it contains. For web PDFs, screen-distributed reports, email attachments, and document portals serving end users on standard displays, downsampling to 150 DPI or lower is almost always appropriate.
Downsampling is not appropriate when:
• The PDF will be used for professional printing and image fidelity must be preserved at full resolution
• The document will be re-exported or processed downstream and the full-resolution source data needs to be retained
• The images are already at or below your target resolution, in which case downsampling has no effect
For workflows that require both a print-quality version and a screen-distribution version of the same document, the correct approach is to run two separate profiles: a printing.json profile for the high-resolution version and a compressionHigh.json or similar profile for the screen version. Because profiles are applied consistently, both versions are produced reliably at any scale.
Configuring Downsampling and Recompression in a JSON Profile
Downsampling and recompression settings are specified per image type in the JSON profile: color, grayscale, and monochrome each have independent settings for resolution target, compression algorithm, and quality value.
A profile targeting screen distribution might downsample color and grayscale images to 150 DPI with JPEG compression at quality 75, and downsample monochrome images to 300 DPI with JBIG2 compression. A print profile would leave resolution unchanged and apply lossless compression only.
Like all PDF Optimizer settings, these options are off by default. You specify only the operations that apply to your use case.
Try PDF Optimizer for free to see how well it reduces file size without losing image quality.
— Start Your Trial at datalogics.com
Frequently Asked Questions
What is image downsampling in a PDF?
Image downsampling reduces the resolution of images embedded in a PDF by removing pixels, reducing the amount of image data in the file. You specify a target DPI in the PDF Optimizer JSON profile, and any image exceeding that resolution is reduced to the target. Downsampling is the most effective single operation for reducing PDF file size when the document contains photographs or other high-resolution images.
What compression algorithms does PDF Optimizer support for images?
PDF Optimizer supports JPEG for continuous-tone color and grayscale images, ZIP and Flate (both lossless, both suitable for flat-color images and diagrams), and JBIG2 for monochrome images. You can specify the algorithm and quality value independently for color, grayscale, and monochrome images in your JSON profile.
What is the difference between downsampling and recompression?
Downsampling reduces image resolution by removing pixels. Recompression re-encodes existing image data using a different compression algorithm or quality setting without necessarily changing the resolution. Both operations reduce file size and can be applied together or independently. Downsampling automatically triggers recompression in PDF Optimizer, but you can also apply recompression alone to images that are already at an appropriate resolution but using an inefficient compression method.