docx-to-md

n8n node to convert Docx files to Markdown

Package Information

Released: 7/3/2025
Downloads: 65 weekly / 123 monthly
Latest Version: 0.1.4
Author: sfrangulov

Documentation

n8n-nodes-docx-to-md

This is an n8n community node. It lets you convert Microsoft Word documents (.docx) to Markdown format in your n8n workflows.

This node provides seamless conversion of Word documents to clean, GitHub-flavored Markdown with automatic formatting correction and linting.

n8n is a fair-code licensed workflow automation platform.

Installation
Operations
Compatibility
Usage
Resources

Installation

Follow the installation guide in the n8n community nodes documentation.

Operations

The Docx to Markdown node supports the following operation:

  • Convert: Converts a Microsoft Word document (.docx) from binary data to clean Markdown format

Features

  • Converts Word documents to GitHub-flavored Markdown
  • Automatically converts tables with proper headers
  • Applies markdown linting and formatting corrections
  • Preserves document structure and formatting
  • Supports headings, lists, tables, and basic text formatting

Compatibility

  • Minimum n8n version: 1.0.0
  • Tested with n8n versions: 1.0.0+
  • Compatible with all n8n deployment methods (self-hosted, cloud, desktop)

Usage

Basic Setup

  1. Add the "Docx to Markdown" node to your workflow
  2. Configure the Input Binary Field parameter with the name of the field containing your Word document binary data (default: "data")
  3. Configure the Destination Output Field parameter with the name where you want the converted Markdown text to be stored (default: "text")

Example Workflow

  1. Use an HTTP Request node or file input to get a .docx file
  2. Connect it to the Docx to Markdown node
  3. The converted Markdown will be available in the specified output field
  4. Use the Markdown output in subsequent nodes (e.g., save to file, send via email, etc.)

Tips

  • The node automatically handles table formatting by converting the first row to headers
  • All markdown output is linted and formatted for consistency
  • Binary data should be in proper .docx format for best results

Resources

License

MIT

Discussion