Actions80
- Add Attachment To PDF
- Add Barcode To PDF
- Add Form Fields To PDF
- Add HTML Header Footer
- Add Image Stamp To PDF
- Add Image Watermark To Image
- Add Margin To PDF
- Add Page Number To PDF
- Add Text Stamp To PDF
- Add Text Watermark To Image
- AI-Invoice Parser
- AI-Process Contract
- AI-Process HealthCard
- Classify Document
- Compress Image
- Compress PDF
- Convert HTML To PDF
- Convert Image Format
- Convert JSON To Excel
- Convert Markdown To PDF
- Convert PDF To Editable PDF Using OCR
- Convert PDF To Excel
- Convert PDF To PowerPoint
- Convert PDF To Word
- Convert To PDF
- Convert URL to PDF
- Convert VISIO
- Convert Word to PDF Form
- Create Images From PDF
- Create PDF/A
- Create Swiss QR Bill
- Crop Image
- Delete Blank Pages From PDF
- Delete Unwanted Pages From PDF
- Disable Tracking Changes In Word
- Enable Tracking Changes In Word
- Extract Attachment From PDF
- Extract Form Data From PDF
- Extract Pages From PDF
- Extract Resources
- Extract Table From PDF
- Extract Text By Expression
- Extract Text From Word
- Fill PDF Form
- Find And Replace Text
- Flip Image
- Flatten PDF
- Generate Barcode
- Generate Document Single
- Generate Documents Multiple
- Get Document From Pdf4me
- Get Image Metadata
- Get PDF Metadata
- Get Tracking Changes In Word
- Image Extract Text
- Linearize PDF
- Merge Multiple PDFs
- Overlay PDFs
- Parse Document
- Protect PDF
- Read Barcode From Image
- Read Barcode From PDF
- Read SwissQR Code
- Remove EXIF Tags From Image
- Repair PDF Document
- Replace Text With Image
- Replace Text With Image In Word
- Resize Image
- Rotate Document
- Rotate Image
- Rotate Image By EXIF Data
- Rotate PDF Page
- Sign PDF
- Split PDF By Barcode
- Split PDF By Swiss QR
- Split PDF By Text
- Split PDF Regular
- Unlock PDF
- Update Hyperlinks Annotation
- Upload File To PDF4me
Overview
The node performs text extraction from images using various input methods. It supports extracting text from an image provided as binary data, a base64-encoded string, or a URL pointing to the image file. This functionality is useful in scenarios such as digitizing printed documents, processing scanned forms, or extracting information from images for further automation workflows.
Practical examples include:
- Extracting text from scanned invoices or receipts to automate accounting processes.
- Reading text from photographs of documents taken by mobile devices.
- Processing screenshots or images containing textual data for content analysis.
Properties
| Name | Meaning |
|---|---|
| Input Data Type | Choose how to provide the image file to extract text from. Options: Binary Data, Base64 String, URL |
| Input Binary Field | Name of the binary property that contains the image file (used when Input Data Type is Binary Data) |
| Base64 Image Content | Base64 encoded image content (used when Input Data Type is Base64 String) |
| Image URL | URL to the image file to extract text from (used when Input Data Type is URL) |
| Output File Name | Name for the output extracted text file in JSON format |
| Async | Enable asynchronous processing (true/false) |
| Binary Data Output Name | Custom name for the binary data in n8n output |
Output
The node outputs the extracted text in JSON format. The JSON contains the recognized text content extracted from the input image. Additionally, the output can include binary data named according to the "Binary Data Output Name" property, which holds the extracted text file content.
If asynchronous processing is enabled, the node handles the extraction process without blocking, suitable for larger images or slower OCR services.
Dependencies
- Requires access to an OCR service or library capable of extracting text from images.
- The node expects the image input either as binary data from previous nodes, a base64 string, or a publicly accessible URL.
- No explicit external API keys or credentials are mentioned in the source code snippet; however, depending on the underlying OCR implementation, appropriate authentication might be required and configured within n8n.
Troubleshooting
Common issues:
- Providing an incorrect binary field name will result in failure to locate the image data.
- Invalid base64 strings or inaccessible URLs will cause errors during text extraction.
- Large images or slow network connections may lead to timeouts if asynchronous processing is disabled.
Error messages:
- Errors related to missing or invalid input data typically indicate misconfiguration of the input properties.
- Network errors when using URLs suggest connectivity issues or invalid URLs.
- OCR service errors may occur if the service is unavailable or the input image format is unsupported.
Resolutions:
- Verify the binary property name matches the actual binary data field.
- Ensure base64 strings are correctly encoded and complete.
- Confirm URLs are reachable and point directly to image files.
- Enable asynchronous processing for large or complex images to avoid timeouts.