scrape-html

Fetch fully-rendered HTML from any webpage including SPAs using PDF API Hub

Package Information

Downloads: 0 weekly / 33 monthly
Latest Version: 1.0.1
Author: Rishabh Dugar

Documentation

n8n-nodes-scrape-html

NPM Version
License

Fetch the fully-rendered HTML of any webpage using a headless browser — perfect for SPAs and JavaScript-heavy sites.

This is an n8n community node powered by PDF API Hub.


🚀 Install

  1. Go to Settings → Community Nodes in n8n
  2. Enter n8n-nodes-scrape-html
  3. Click Install

🔑 Setup

Sign up at pdfapihub.com → copy your API key → add to n8n credentials.


✨ Features

Parameter Description
URL Any public webpage
Wait Until Fully Loaded, DOM Ready, Network Quiet, or First Response
Wait for Element CSS selector to wait for before capturing (e.g. #main-content)
Extra Delay Additional milliseconds for lazy content
Viewport Desktop, Laptop, Mobile, Tablet, or Custom

Advanced Options

Option Description
Navigation Timeout Max milliseconds to wait for page load
User Agent Custom user-agent string for mobile/bot versions
Extra HTTP Headers Additional headers sent with every request

💡 Use Cases

  • Web scraping — get rendered HTML from React/Vue/Angular SPAs
  • Content monitoring — track changes on dynamic websites
  • SEO analysis — fetch rendered HTML for SEO auditing
  • Data extraction — scrape JS-rendered content that simple HTTP can't reach

License

MIT

Discussion