1. PDF Incremental Updates Retain Revision History
PDFs append new object generations without deleting old ones, enabling recovery of prior versions.
"a PDF is mostly just a bunch of objects... update these objects to overwrite them by appending a new 'generation' of an object" (aidos).
"These PDFs apparently used the 'incremental update' feature of PDF, where edits... are merely appended" (layer8).
2. Tools for PDF Inspection and Recovery
Users recommend tools like pdfresurrect, mutool, and qpdf to extract revisions and analyze structure.
"The PDF format allows for previous changes to be retained... This tool extracts all previous revisions" (flotzam, quoting pdfresurrect).
"I recommend mutool for decompressing the PDF... mutool clean -d in.pdf out.pdf" (aidos).
"There needs to be better tooling... using qpdf to export QDF" (alhirzel).
3. Redaction Failures and Secure Sharing Methods
Journalists' poor redactions leaked data; suggestions include rasterizing or print/scan to strip metadata, avoiding printer tracking dots.
"the journalists did the redactions. The metadata timestamps... show that the versions were created three weeks before" (libroot).
"The 'print and scan physical papers back to a PDF of images' technique... is looking better" (password4321).
"Note that all (edit: color-/ink-) printers have 'invisible... yellow dotcodes'" (cookiengineer).