bozonx-page-scraper-microservice

n8n node for Page Scraper microservice - extract structured content, retrieve HTML, and process URLs in batches

Package Information

Released: 11/21/2025
Downloads: 202 weekly / 491 monthly
Latest Version: 1.2.0
Author: Ivan K

Documentation

n8n-nodes-bozonx-page-scraper-microservice

n8n community node for the Page Scraper microservice. Extract structured content from web pages, retrieve raw HTML, and process URLs in batches.

Installation

Via n8n UI (Recommended)

  1. Go to Settings > Community Nodes
  2. Select Install
  3. Enter n8n-nodes-bozonx-page-scraper-microservice
  4. Click Install

Via npm

npm install n8n-nodes-bozonx-page-scraper-microservice

Prerequisites

You need a running instance of the Page Scraper microservice.

Quick start with Docker:

docker run -d -p 8080:8080 bozonx/page-scraper-microservice:latest

Credentials

Configure Bozonx Microservices API credentials:

  • Gateway URL: Microservice base URL (e.g., http://localhost:8080)
  • API Token (optional): Bearer token if authentication is enabled

Operations

Scrape Page

Extract structured article content (title, author, date, body in Markdown).

Key parameters:

  • URL: Target page
  • Mode: extractor (fast) or playwright (JavaScript support)
  • Fingerprint Options: Anti-bot detection avoidance

Get HTML

Retrieve raw rendered HTML using Playwright browser automation.

Create Batch

Process multiple URLs asynchronously with configurable delays and webhook notifications.

Key features:

  • Schedule delays between requests (default: 1500-4000ms)
  • Webhook notification on completion
  • Per-item or common settings

Get Batch Status

Check batch job progress and retrieve results using the job ID.

Resources

Compatibility

  • n8n: 0.220.0+
  • Node.js: 18.x+

License

MIT

Discussion