Actions80
- Extract Text From Word
- Find And Replace Text
- Convert PDF To Editable PDF Using OCR
- Create Swiss QR Bill
- Split PDF By Barcode
- Split PDF By Swiss QR
- Split PDF By Text
- Split PDF Regular
- Create PDF/A
- Convert HTML To PDF
- Convert Markdown To PDF
- Upload File To PDF4me
- Add Attachment To PDF
- Add Barcode To PDF
- Add Form Fields To PDF
- Fill PDF Form
- Add HTML Header Footer
- Add Image Stamp To PDF
- Add Margin To PDF
- Add Page Number To PDF
- Add Text Stamp To PDF
- AI-Invoice Parser
- AI-Process HealthCard
- AI-Process Contract
- Generate Barcode
- Classify Document
- Parse Document
- Linearize PDF
- Flatten PDF
- Convert To PDF
- Json To Excel
- Convert PDF To Excel
- Convert PDF To Word
- Convert PDF To PowerPoint
- Convert VISIO
- Crop Image
- Delete Blank Pages From PDF
- Delete Unwanted Pages From PDF
- Extract Pages
- Merge Multiple PDFs
- Overlay PDFs
- Rotate Document
- Rotate Page
- Sign PDF
- URL to PDF
- Add Image Watermark To Image
- Add Text Watermark To Image
- Compress Image
- Convert Image Format
- Create Images From PDF
- Flip Image
- Get Image Metadata
- Image Extract Text
- Remove EXIF Tags From Image
- Replace Text With Image
- Replace Text With Image In Word
- Resize Image
- Rotate Image
- Rotate Image By EXIF Data
- Compress PDF
- Get PDF Metadata
- Repair PDF Document
- Get Document From Pdf4me
- Update Hyperlinks Annotation
- Protect Document
- Unlock PDF
- Disable Tracking Changes In Word
- Enable Tracking Changes In Word
- Generate Document Single
- Generate Documents Multiple
- Get Tracking Changes In Word
- Read Barcode From Image
- Read Barcode From PDF
- Read SwissQR Code
- Extract Form Data From PDF
- Extract Pages From PDF
- Extract Attachment From PDF
- Extract Text By Expression
- Extract Table From PDF
- Extract Resources
Overview
This node extracts form data from a PDF document. It is useful when you need to programmatically retrieve filled-in form values from PDF files, such as surveys, applications, contracts, or any PDF with interactive form fields. The node supports multiple input methods for the PDF file: binary data from a previous node, a base64 encoded string, or a URL pointing to the PDF.
Practical examples:
- Automatically extracting user-submitted data from PDF forms uploaded via a web form.
- Processing PDF invoices or contracts to extract key information without manual data entry.
- Integrating with document management systems to parse and index form data from stored PDFs.
Properties
| Name | Meaning |
|---|---|
| Input Data Type | Choose how to provide the PDF file: - Binary Data (from previous node) - Base64 String - URL |
| Input Binary Field | Name of the binary property containing the PDF file (default "data"). Used only if Input Data Type is Binary Data |
| Base64 PDF Content | Base64 encoded content of the PDF document. Used only if Input Data Type is Base64 String |
| PDF URL | URL to the PDF file to extract form data from. Used only if Input Data Type is URL |
| Document Name | Name of the document used during processing (default "document.pdf") |
| Advanced Options | Custom JSON profiles to adjust API call properties, e.g., setting output data format or other specific options |
Output
The node outputs an array of JSON objects representing the extracted form data from the PDF. Each object corresponds to one input item processed and contains the parsed form field names and their respective values.
If the PDF contains binary data or attachments, those are not part of this output; the focus is strictly on form data extraction.
Dependencies
- Requires access to an external PDF processing API service that supports form data extraction.
- Needs proper API authentication configured in n8n (e.g., an API key credential).
- Internet access may be required if providing a PDF URL.
Troubleshooting
- Common issues:
- Providing an incorrect binary property name will cause the node to fail to find the PDF file.
- Invalid base64 strings or inaccessible URLs will result in errors.
- If the PDF does not contain any form fields, the output will be empty or minimal.
- Error messages:
- Errors related to missing or invalid input data usually indicate misconfiguration of input properties.
- API errors might occur due to authentication failure or exceeding usage limits.
- Resolutions:
- Verify the binary property name matches the actual binary data field.
- Ensure base64 content is correctly encoded and complete.
- Confirm the URL is accessible and points directly to a valid PDF file.
- Check API credentials and permissions.
Links and References
- PDF4me API Documentation
- General info on PDF form fields: https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf (ISO 32000-1 standard)