Actions80
- Extract Text From Word
- Find And Replace Text
- Convert PDF To Editable PDF Using OCR
- Create Swiss QR Bill
- Split PDF By Barcode
- Split PDF By Swiss QR
- Split PDF By Text
- Split PDF Regular
- Create PDF/A
- Convert HTML To PDF
- Convert Markdown To PDF
- Upload File To PDF4me
- Add Attachment To PDF
- Add Barcode To PDF
- Add Form Fields To PDF
- Fill PDF Form
- Add HTML Header Footer
- Add Image Stamp To PDF
- Add Margin To PDF
- Add Page Number To PDF
- Add Text Stamp To PDF
- AI-Invoice Parser
- AI-Process HealthCard
- AI-Process Contract
- Generate Barcode
- Classify Document
- Parse Document
- Linearize PDF
- Flatten PDF
- Convert To PDF
- Json To Excel
- Convert PDF To Excel
- Convert PDF To Word
- Convert PDF To PowerPoint
- Convert VISIO
- Crop Image
- Delete Blank Pages From PDF
- Delete Unwanted Pages From PDF
- Extract Pages
- Merge Multiple PDFs
- Overlay PDFs
- Rotate Document
- Rotate Page
- Sign PDF
- URL to PDF
- Add Image Watermark To Image
- Add Text Watermark To Image
- Compress Image
- Convert Image Format
- Create Images From PDF
- Flip Image
- Get Image Metadata
- Image Extract Text
- Remove EXIF Tags From Image
- Replace Text With Image
- Replace Text With Image In Word
- Resize Image
- Rotate Image
- Rotate Image By EXIF Data
- Compress PDF
- Get PDF Metadata
- Repair PDF Document
- Get Document From Pdf4me
- Update Hyperlinks Annotation
- Protect Document
- Unlock PDF
- Disable Tracking Changes In Word
- Enable Tracking Changes In Word
- Generate Document Single
- Generate Documents Multiple
- Get Tracking Changes In Word
- Read Barcode From Image
- Read Barcode From PDF
- Read SwissQR Code
- Extract Form Data From PDF
- Extract Pages From PDF
- Extract Attachment From PDF
- Extract Text By Expression
- Extract Table From PDF
- Extract Resources
Overview
This node operation, Get PDF Metadata, extracts metadata information from a PDF file. It supports multiple input methods for providing the PDF content: as binary data from a previous node, as a base64 encoded string, or via a URL pointing to the PDF file.
Typical use cases include:
- Automatically retrieving document properties such as author, title, creation date, and other embedded metadata.
- Integrating PDF metadata extraction into automated workflows for document management, auditing, or indexing.
- Validating PDF files before further processing by checking their metadata.
For example, you might use this node to extract metadata from invoices received as PDFs in an email workflow or to gather document details from PDFs stored on a web server.
Properties
| Name | Meaning |
|---|---|
| Input Data Type | Choose how to provide the PDF file: - Binary Data (from previous node) - Base64 String - URL |
| Input Binary Field | Name of the binary property containing the PDF file (used only if Input Data Type is Binary Data) |
| Base64 PDF Content | Base64 encoded string of the PDF content (used only if Input Data Type is Base64 String) |
| PDF URL | URL to the PDF file to extract metadata from (used only if Input Data Type is URL) |
| Output File Name | Filename for the output metadata JSON file (default: pdf_metadata.json) |
| Async | Enable asynchronous processing (boolean flag) |
Output
The node outputs a JSON object containing the extracted metadata from the PDF. This typically includes standard PDF metadata fields such as:
- Title
- Author
- Subject
- Keywords
- Creator
- Producer
- CreationDate
- ModDate
The exact structure depends on the PDF's embedded metadata but will be presented as JSON.
If the node produces any binary data (e.g., the metadata file), it will be available as a downloadable JSON file named according to the "Output File Name" property.
Dependencies
- Requires access to the PDF file either as binary data, base64 string, or accessible URL.
- Depends on an external PDF processing service or library integrated within the node (not explicitly detailed in the source).
- May require API authentication credentials configured in n8n to interact with the PDF processing backend.
Troubleshooting
Common Issues:
- Providing an incorrect binary field name when using binary data input will cause the node to fail to locate the PDF file.
- Invalid base64 strings or inaccessible URLs will result in errors during metadata extraction.
- Network issues or permission restrictions may prevent accessing the PDF URL.
Error Messages:
- Errors related to missing or invalid input data usually indicate misconfiguration of the input properties.
- Authentication or API errors suggest missing or incorrect API credentials.
- Timeout or network errors when fetching PDF from URL indicate connectivity problems.
Resolutions:
- Verify that the binary property name matches exactly the name used in the previous node.
- Ensure base64 content is correctly encoded and complete.
- Confirm the URL is reachable and publicly accessible or properly authenticated.
- Check API credentials and permissions in n8n settings.