bozonx-page-scraper-microservice

n8n node for Page Scraper microservice - extract structured content, retrieve HTML, and process URLs in batches

Package Information

Released: 11/21/2025

Downloads: 202 weekly / 491 monthly

Latest Version: 1.2.0

Author: Ivan K

Available Nodes

Page Scraper

No description available

Documentation

n8n-nodes-bozonx-page-scraper-microservice

n8n community node for the Page Scraper microservice. Extract structured content from web pages, retrieve raw HTML, and process URLs in batches.

Installation

Via n8n UI (Recommended)

Go to Settings > Community Nodes
Select Install
Enter n8n-nodes-bozonx-page-scraper-microservice
Click Install

Via npm

npm install n8n-nodes-bozonx-page-scraper-microservice

Prerequisites

You need a running instance of the Page Scraper microservice.

Quick start with Docker:

docker run -d -p 8080:8080 bozonx/page-scraper-microservice:latest

Credentials

Configure Bozonx Microservices API credentials:

Gateway URL: Microservice base URL (e.g., http://localhost:8080)
API Token (optional): Bearer token if authentication is enabled

Operations

Scrape Page

Extract structured article content (title, author, date, body in Markdown).

Key parameters:

URL: Target page
Mode: extractor (fast) or playwright (JavaScript support)
Fingerprint Options: Anti-bot detection avoidance

Get HTML

Retrieve raw rendered HTML using Playwright browser automation.

Create Batch

Process multiple URLs asynchronously with configurable delays and webhook notifications.

Key features:

Schedule delays between requests (default: 1500-4000ms)
Webhook notification on completion
Per-item or common settings

Get Batch Status

Check batch job progress and retrieve results using the job ID.

Resources

Microservice Repository - Deployment and API documentation
n8n Community Nodes - Installation guide

Compatibility

n8n: 0.220.0+
Node.js: 18.x+

License

MIT