Troubleshooting Common PDF Problems and How to Fix Them
Practical solutions for the most frequent PDF issues including rendering errors, font problems, broken links, and corruption recovery.
Font Display Issues
Font problems are among the most common PDF issues. When a PDF displays with incorrect fonts, garbled characters, or missing text, the root cause is almost always related to font embedding. A PDF that does not embed its fonts relies on the viewer's system having the same fonts installed. If the fonts are not available, the viewer substitutes a different font, which may have different metrics (character widths and heights), causing text to overlap, overflow text boxes, or display incorrectly.
To diagnose font issues, check the document's font embedding status. In Adobe Acrobat, go to File > Properties > Fonts to see a list of all fonts used and whether each is embedded, subset embedded, or not embedded. Fonts listed as "Not Embedded" are the likely culprits. If the PDF must be fixed, re-embedding the fonts is the solution. This requires access to the font files and a tool that can embed them (Acrobat's Preflight tool, Ghostscript, or a PDF library).
Character encoding issues produce a different symptom: text that looks correct on screen but copies as garbage characters. This happens when the PDF's font does not include a proper ToUnicode mapping, which tells the viewer which character each glyph code represents. Without this mapping, the viewer can render the glyph (because it has the font's drawing instructions) but cannot determine the character for text selection and copying. Re-creating the PDF from the source document with a modern PDF producer usually resolves this issue.
Rendering Differences Across Viewers
A PDF that looks perfect in one viewer may display differently in another. While the PDF specification is detailed, different viewers implement it differently, leading to rendering variations. Common areas of inconsistency include transparency handling (older viewers may not support transparency correctly), color management (viewers without ICC color management may display colors differently), and font rendering (hinting and anti-aliasing vary across viewers and operating systems).
Transparency is a frequent source of rendering differences. PDF 1.4 introduced transparency (alpha blending), and while modern viewers handle it well, some older or simpler viewers render transparency incorrectly or ignore it entirely. Documents with transparency effects (drop shadows, semi-transparent watermarks, blending modes) may look different in different viewers. Flattening transparency, which converts transparent elements to opaque ones that look the same, resolves these issues at the cost of editability.
To minimize rendering differences, avoid advanced features when broad compatibility is needed. Use embedded fonts exclusively. Avoid transparency if the document may be viewed in basic viewers. Specify an output intent (ICC color profile) for consistent color. Test the PDF in multiple viewers: Adobe Acrobat (the reference implementation), a web browser's built-in PDF viewer (Chrome uses PDFium, Firefox uses PDF.js), and at least one other viewer (Foxit, Sumatra, or Preview on macOS). If a specific rendering issue is found, determine whether it is a viewer bug or a document structure issue, and adjust accordingly.
Broken Links and Navigation
Broken hyperlinks and navigation issues degrade the usefulness of interactive PDFs. Internal links (pointing to other pages within the same document) break when pages are added, removed, or reordered. External links (pointing to URLs) break when the target web page moves or is deleted. Both are common in documents that have undergone editing after initial creation.
To fix broken internal links, you need to identify them first. Adobe Acrobat can list all links in a document through Edit > Preferences > Page Display > Show All Links, or by using the link editing tool on each page. Programmatic approaches using libraries like PyPDF or pdf-lib can extract all link annotations and their destinations, flagging any that reference non-existent pages or invalid destinations.
Broken external links are harder to detect without actually testing each URL. A script can extract all URLs from a PDF and test each one using HTTP requests, flagging any that return 404 errors, redirects, or connection failures. For documents that will be published, running this link check before publication prevents users from encountering broken links. For long-lived documents, consider using stable URLs (DOIs for academic references, permanent links for government resources) rather than direct URLs that may change. When broken links are found, update them in the PDF using a link editing tool or replace them with archived versions from the Wayback Machine.
Corrupt and Damaged PDFs
PDF corruption can range from minor structural issues to complete unreadability. Common causes include incomplete downloads (the file was not fully transferred), disk errors (data corruption on the storage medium), software crashes during saving (the file was not properly closed), and email system modifications (some systems alter binary attachments). The symptoms range from an error message when opening the file to missing pages, blank pages, or garbled content.
PDF repair tools attempt to reconstruct the file structure from the surviving data. Adobe Acrobat automatically attempts repair when opening a damaged file and will report what it was able to recover. QPDF can attempt to repair structural issues using its "--replace-input" mode with error recovery. Ghostscript can re-process a damaged PDF, effectively creating a new file from whatever content it can read from the damaged one.
For severely corrupted files where automated repair fails, manual recovery techniques may salvage some content. PDF files contain cross-reference tables that map object locations; if these are damaged but the objects themselves are intact, tools like mutool (from MuPDF) can attempt to reconstruct the cross-reference table. If only specific pages are corrupted, extracting the good pages into a new document preserves the recoverable content. As prevention, maintain backups of important PDFs, verify file integrity after transfers using checksums, and avoid saving PDFs to unreliable storage media.
Printing Problems
PDF printing issues include blank pages, incorrect scaling, missing graphics, color shifts, and slow printing. Blank pages when printing typically indicate a transparency or font issue that the print driver cannot handle. Try flattening the document (converting all text to outlines and flattening transparency) before printing. If specific pages are blank, they may contain content in a color space that the printer does not support.
Scaling issues are common: the document prints too large, too small, or with content cut off at the edges. Most PDF viewers default to scaling the page to fit the paper, which reduces the output if the PDF page size is larger than the paper. For documents where exact size matters (architectural drawings, templates with precise dimensions), print at 100% scale ("Actual Size" in the print dialog) and verify that the paper size matches or exceeds the PDF page size.
Slow printing often indicates that the PDF contains complex vector graphics or high-resolution images that overwhelm the printer's processor. Rasterizing the PDF to a fixed resolution (300 DPI for standard printing) before sending it to the printer can dramatically speed up printing by shifting the rendering workload from the printer to the computer. In Adobe Acrobat's print dialog, the "Print As Image" option does this automatically. For recurring print performance issues with specific documents, re-saving the PDF with optimized graphics and downsampled images may be a more permanent solution.
File Size Issues
Unexpectedly large PDF files are a frequent concern. A 10-page document should not be 100 MB, but various factors can cause file bloat. The most common cause is embedded high-resolution images. A single uncompressed 300 DPI photograph can be 25+ MB. Check the images in the document using File > Properties > Fonts (which also lists image resources in some viewers) or by extracting images with pdfimages.
Other causes of large file size include embedded fonts (particularly multiple full font embeddings rather than subsets), duplicate objects (the same image embedded multiple times rather than referenced), incremental save data (accumulated edit history), and embedded files or attachments. A PDF that has been through many rounds of editing can accumulate significant overhead from incremental saves, where each save appends new data without removing the old.
To reduce file size, use the optimization techniques described in the compression and optimization articles: compress images, subset fonts, remove duplicate objects, and save as a new file to eliminate incremental save overhead. Adobe Acrobat's "Reduce File Size" feature and the more detailed "PDF Optimizer" apply these techniques with configurable settings. For a quick reduction, saving with "Save As" (rather than "Save") in most PDF tools creates a clean file without incremental update overhead. For maximum reduction, re-process the file through Ghostscript with appropriate quality settings.
Form and Interactive Element Problems
Interactive PDF elements frequently cause issues across different viewers and platforms. Form fields that work in Adobe Acrobat may not function in web browsers' built-in PDF viewers, which have limited form support. JavaScript calculations and validations may not execute in non-Adobe viewers. Dropdown menus may display differently, and date fields may behave inconsistently across platforms.
To maximize form compatibility, use only basic form features: text fields, checkboxes, radio buttons, and simple dropdowns. Avoid JavaScript-dependent validation and calculations if the form will be filled in non-Adobe viewers. Test the form in the intended filling environment before distributing it. If you must use advanced form features, specify in the form instructions that Adobe Acrobat or Adobe Reader is required.
Digital signature validation problems are common. A signature may show as "Unknown" or "Invalid" in a viewer that does not trust the signing certificate's Certificate Authority. This does not necessarily mean the signature is fraudulent; it may simply mean the viewer does not have the CA in its trust store. To resolve, the recipient can manually add the CA to their trust store, or the signer can use a certificate from a widely trusted CA (one on the Adobe Approved Trust List). Signatures may also show as invalid after any document modification, even permitted modifications like filling other form fields. Understanding the difference between a signature that is cryptographically invalid (the document was tampered with) and one that is unverifiable (the certificate is not trusted) is important for proper handling.