Someone files a GDPR Article 17 request — the right to erasure. You pull up a PDF that contains their personal data alongside a dozen other people's, and now you have to remove their information without destroying the rest of the document. Or you're producing documents in response to a subpoena and need to strip personal data you're not authorized to disclose. Or a report is heading to an outside vendor and needs to go out without employee names attached.
These are the everyday moments where GDPR redaction actually happens. This guide walks through which fields to remove, how to make the removal stand up to an audit, and the common mistakes that turn a good-faith redaction into a reportable data breach.
What GDPR Actually Requires
The General Data Protection Regulation, and specifically Article 17 (right to erasure, "right to be forgotten"), grants data subjects the right to have their personal data deleted in certain circumstances. Relevant to redaction, the law expects that when personal data must be removed — whether in response to an erasure request, a data minimisation exercise, or third-party disclosure — the removal is actually effective.
What counts as "personal data" under GDPR is broader than most people realise. It includes direct identifiers like names, email addresses, phone numbers, and national ID numbers. But it also covers indirect identifiers that can be linked back to an individual: IP addresses, device identifiers, employee numbers, photographs, handwritten signatures, and even pseudonymised identifiers if the key to re-identify them still exists. For health, financial, or criminal-history data, additional "special category" rules apply (Article 9), and the bar for protection is higher still.
A redaction that leaves the original data recoverable inside the PDF file fails the test. Supervisory authorities across the EU have consistently treated recoverable-redaction incidents as personal data breaches, triggering notification obligations and, in some cases, administrative fines.
Which Fields to Redact
Before opening the PDF, decide what to remove. A good starting checklist for a GDPR-oriented redaction pass:
- Names — full names, preferred names, and signature blocks at the bottom of pages
- Contact details — email addresses, phone numbers, postal addresses, including partial addresses like "via Roma 12"
- National and internal identifiers — tax IDs (codice fiscale, NIE, SSN equivalents), passport numbers, employee IDs, customer numbers, case numbers
- Dates that narrow to an individual — date of birth, date of hire, date of a specific medical encounter; avoid the instinct to keep "just the year" if it combined with another attribute re-identifies the person
- Online identifiers — IP addresses, device UUIDs, session tokens, usernames
- Health, financial, and special-category data — diagnoses, prescriptions, account numbers, salary figures, religious affiliation, union membership
- Free-text fields — notes, comments, email subjects, handwritten annotations on scanned forms. These leak more personal data than any structured field.
- Metadata and headers — running headers that repeat a name on every page, the document author field, and timestamps that tie back to a specific person
When in doubt, err toward redacting. A redaction you did not need costs you nothing; a redaction you failed to apply is reportable.
How to Redact the PDF Correctly
The single most important property of a GDPR redaction is that the removed content must be unrecoverable from the exported file. Not hidden — removed. Here is the workflow:
-
Work on a copy. Keep the original under access control. Your redaction work happens on a duplicate so you always have a reference for the audit log.
-
Upload to a redaction tool that permanently removes content. Open the document in RedaktPDF. Avoid any "PDF markup" or "annotation" tool that places a black rectangle on top of the content — those can usually be peeled off in seconds. What you need is a tool that physically strips the underlying text and image objects from the PDF.
-
Whiteout every occurrence. Walk the document page by page. Pay special attention to: running headers and footers (easy to miss), form fields with pre-filled values, table rows, signature blocks, and any scanned attachment pages. For image-based scans, enable OCR so the tool can find text inside the image and let you target it precisely.
-
Export and verify. Download the redacted PDF and open it in a separate viewer. Try three things: (a) copy-paste from the redacted region — you should get nothing; (b) text-search for a redacted term — it should not match; (c) if you have a PDF inspector, look at the content stream and confirm the objects are gone. These three checks take under a minute and are the difference between a defensible redaction and a liability.
-
Log the redaction. Record what was redacted, from which document, when, and under what legal basis (e.g. "Article 17 erasure request REF-2026-0142"). If challenged, you will be asked for this record.
For especially sensitive documents, consider uploading with a signed-in account so the file is end-to-end encrypted in your browser before reaching our servers — the plaintext never leaves your device.
Common Mistakes That Turn a Redaction into a Breach
A few patterns account for almost every failed GDPR redaction.
Black-box overlays that are not actually redactions. A rectangle painted on top of text in a PDF annotation tool is not redaction. The underlying text stays in the file's content stream and can be recovered with a copy-paste. We cover this in more depth here. If your workflow ever produced a document using "draw a shape over the text," assume every redaction you made this way is recoverable.
Forgetting metadata. Document metadata (author, title, subject, keywords) travels with the file. An otherwise-perfect redaction that leaves "Author: Jane Doe" in the metadata has not removed Jane Doe from the file. Clear metadata fields as part of the export, or use a tool that does it for you.
Leaving running headers and footers untouched. Many documents carry the subject's name at the top or bottom of every page. It is easy to redact the big, bold name on page 1 and miss the small header repeating it on pages 2 through 40.
Over-reliance on search-and-destroy. Running a search for "John Smith" and redacting every hit misses handwritten signatures, initials ("JS"), and name variants. Always do a visual page-by-page pass after any automated sweep.
Keeping partial identifiers "because they're harmless." A birth date, a postal code, and an employer together can re-identify most individuals. Under GDPR, that combination is personal data. Treat it as such.
Working on the final copy, not a duplicate. If the redaction corrupts the file or you realise a field was missed after export, you want the original intact.
Making the Redaction Defensible
When a supervisory authority reviews how your organisation handled an erasure request, they care about two things: did the personal data actually get removed, and can you show that you followed a deliberate process. Three habits cover both:
- Use a tool whose redaction is permanent and documented. You should be able to point to a statement that the tool removes content at the file-object level, not visually.
- Verify every redaction before release. The three-check process above (copy-paste, search, inspect) should be a standing step in your workflow.
- Keep a short audit log. Document, redacted fields, legal basis, date, operator. One row per redaction. When a DPO or regulator asks, you have the answer.
For more detail on the underlying redaction mechanics, see how to redact sensitive information from a PDF and PDF redaction vs. black-box overlay. When you are ready, open the RedaktPDF redaction tool and start with a duplicate of the file.
Ready to try RedaktPDF?
Edit, redact, and annotate PDFs directly in your browser — free and encrypted.
Get startedRelated tools
Redakt PDF
Permanently redact PDFs online free. Whiteout strips sensitive data at the file level — not just a black overlay. E2E encrypted. GDPR-ready. No sign-up.
PDF Whiteout
Cover text or images in a PDF with whiteout boxes online free. Choose any fill color for redaction or correction. Files deleted after 2 hours. No sign-up.
Secure Editor
Edit PDFs with AES-256 end-to-end encryption. Encrypted before upload — servers never see plaintext. Passkey and MFA supported. Zero-knowledge architecture.