docx-extractor

Docx Converter Node for n8n

Package Information

Released: 7/31/2025
Downloads: 18 weekly / 54 monthly
Latest Version: 0.1.0
Author: annhdev

Documentation

Banner image

n8n-nodes-docx-extractor

This is a custom n8n node that converts DOCX files to HTML, Text or Markdown format using the mammoth and turndown libraries.
This node is designed to be used in n8n workflows to facilitate the conversion of DOCX documents into more web-friendly formats like HTML, Text or Markdown.

Features

  • Convert DOCX files to HTML, Text or Markdown format.
  • Support binary data input.
  • Easy integration into n8n workflows.

Installation

npm install n8n-nodes-docx-extractor

Usage

  1. Add the node to your n8n workflow.
  2. Connect the node to a previous node that outputs a DOCX file, ensuring that the input data is in binary format.
    • The input should be a binary file with the MIME type application/vnd.openxmlformats-officedocument.wordprocessingml.document.
  3. Configure the node by selecting the desired output format (HTML, Text, Markdown, or All).

Configuration

  • Output Format: Choose the desired output format for the conversion:
    • HTML: Converts the DOCX file to HTML format.
    • Text: Converts the DOCX file to plain text format.
    • Markdown: Converts the DOCX file to Markdown format.
    • All: Converts the DOCX file to all formats (HTML, Text, and Markdown).

Contributing

References

License

MIT

Discussion