h2oGPTe icon

h2oGPTe

h2oGPTe is an AI-powered search assistant for your internal teams to answer questions gleaned from large volumes of documents, websites and workplace content.

Actions198

Overview

This node operation converts files that were uploaded in "Agent_only" ingest mode into PDF format and then parses them for further processing. It is useful when documents initially ingested with limited processing ("Agent_only" mode) need to be converted into a standard, fully parsed format (PDF) for indexing, searching, or analysis within a document collection.

Typical use cases include:

  • Migrating documents from a restricted ingest mode to a fully processed state.
  • Preparing agent-uploaded files for downstream AI-powered document understanding workflows.
  • Automating the conversion and parsing of documents to enable semantic search or question answering on their content.

Properties

Name Meaning
Collection ID String ID of the collection to add the ingested documents into.
Document ID String ID of the document to be parsed.
Additional Options A set of optional parameters to customize the ingestion and parsing process:
- Audio Input Language Language of audio files; default is "auto" for automatic detection.
- Chunk By Page Boolean indicating whether each page should be treated as a separate chunk.
- Gen Doc Questions Boolean to auto-generate sample questions for each document using an LLM.
- Gen Doc Summaries Boolean to auto-generate document summaries using an LLM.
- Handwriting Check Boolean to check pages for handwriting and use specialized models if found.
- Keep Tables As One Chunk Boolean to keep table tokens identified by the parser in a single chunk.
- Ocr Model Specifies which OCR model to use for text extraction from images; default is "auto".
- Permissions List of usernames granted permissions to the document.
- Restricted Boolean indicating if the document should be restricted to certain users only.
- Tesseract Lang Language code used when OCR model is set to "tesseract".
- Timeout Timeout in seconds for the operation; 0 means no timeout.

Output

The node outputs JSON data representing the response from the ingestion API endpoint after converting and parsing the document. This typically includes metadata about the ingested document, status information, and any parsing results or errors.

If the operation involves binary data (e.g., file uploads), it would handle those accordingly, but this specific operation focuses on document conversion and parsing, so output is primarily JSON metadata.

Dependencies

  • Requires an API key credential for authentication with the external service.
  • The node sends HTTP POST requests to the /ingest/agent_only_to_standard endpoint of the configured API.
  • Proper configuration of the API base URL and credentials in n8n is necessary.
  • Optional dependencies include access to OCR models and LLM services for generating summaries and questions.

Troubleshooting

  • Timeouts: If the operation times out, increase the Timeout property value.
  • Permission Errors: Ensure the user has appropriate permissions to access and modify the specified collection and document.
  • Invalid Document ID or Collection ID: Verify that the IDs provided are correct and exist in the system.
  • OCR Failures: If text extraction fails, try changing the Ocr Model or specifying the correct Tesseract Lang.
  • Handwriting Detection Issues: Enabling Handwriting Check requires specialized models; ensure these are available and properly configured.
  • Parsing Errors: Check the document format compatibility and integrity; corrupted or unsupported files may fail to parse.

Links and References

Discussion