blog

PDF Metadata: Find, Change and Remove Hidden PDF Data

Shadow of a hacker accessing metadata in a computer monitor
Photo by Mika Baumeister on Unsplash

Interested in learning more about PDF metadata? Read this to learn to find and edit PDF metadata and remove third-party metadata from PDFs using PDFpen.

PDF metadata is data about a PDF. Applications automatically add metadata to PDF files using information the user provides, such as name, and data gathered in the process of creating and saving documents. PDF creators also add metadata to their documents manually. They might add keywords to make their PDF files easier to catalog, index and search; to comply with accessibility standards; or to optimize them for search engines.

How to find metadata in PDF files

Screen-Shot-2021-06-21-at-15.09.06

1. Open any PDF file in PDFpen and click the Inspector icon on the top right corner of the toolbar. You can also access the Inspector by choosing Window > Inspector or using the keyboard shortcut ⌘-Option-I.

2. A dialogue box will appear. Choose the Document Information tab. The information you see is the most easily accessible metadata. Here’s what it looks like:

PDFmetadata_2

How to edit metadata

To add or edit metadata, open the Inspector and choose Document Information. You can manually enter and edit these fields: Title, Author, Subject and Keywords. You can’t edit the Created, Modified, File Size, Page Count, Page Size, Application and Producer fields; these are automatically generated.

Hidden metadata

There’s also metadata you can’t easily see, also known as third-party metadata. In PDF files, it may include the name of the program used to convert the file to PDF format and the metadata creation date. In JPEG files, it might include thumbnails.

Finding this hidden data requires some knowledge of computing, but it isn’t very hard to do. Someone who is computer-savvy could use a program such as Unix strings – see this Wikipedia article on strings (Unix) for more info – to dig for information.

Removing third-party metadata from PDFs

PDFpen enables you to remove third-party metadata from PDF files with just a few clicks. Here’s how:

  1. Open the PDF file.
  2. Choose File > Create Optimized PDF. The Optimize Images dialog box below will appear.
  3. Check the “Remove third-party metadata” box at the bottom. 
  4. Click Create to create an optimized copy of your original PDF with no third-party metadata.
PDFmetadata_3

Why remove third-party metadata

Third-party metadata contained in PDF documents can reveal sensitive information about you, your clients, and the organization you work for. To avoid exposing details that could impair confidentiality, remove both visible and hidden
sensitive information from PDF documents, especially if you’re a legal professional. As an added bonus, third-party data removal also reduces PDF file sizes.

Third-party metadata removal brings added security

PDFpen enables third-party metadata removal since version 12. Combined with redaction and password protection, the ability to remove third-party metadata make PDFpen an even more effective tool for securing PDF files.