Eddie.surf icon

Eddie.surf

Web crawling and smart search with Eddie.surf

Overview

The node integrates with Eddie.surf, a web crawling and AI-powered smart search service. It supports multiple operations including crawling URLs to extract data, batch crawling large URL sets, performing intelligent searches across websites using AI, and checking the status of crawl or search jobs.

For the Smart Search operation specifically, the node allows users to submit a search query that leverages AI to find relevant content across websites. This is useful for scenarios such as market research, competitive analysis, content discovery, or extracting specific information from multiple sources without manually visiting each site.

Practical examples:

  • Searching for product pricing details across competitor websites.
  • Finding contact information or company profiles related to a particular industry.
  • Extracting summarized insights from a set of websites based on a custom query.

Properties

Name Meaning
Context JSON object guiding AI processing and data extraction during the search.
Search Query The text query string used to find relevant content via AI-powered search.
Advanced Options Collection of optional settings:
- Callback Mode Notification mode for job completion: "Once" or "Multi".
- Callback URL Optional webhook URL to receive job completion notifications.
- Include Technical Data Whether to include technical data collection (costs extra credits per page).
- Max Depth Maximum link depth to follow when crawling (1-10). Not applicable for Smart Search.
- Max Pages Maximum number of pages to crawl. Not applicable for Smart Search.
- Max Results Maximum number of search results to return (range 1-5000).
- Mock Mode Enables test mode without consuming credits.
- Rules Comma-separated list of custom processing instructions (e.g., "Extract pricing", "Extract contact info").
- Skip Duplicate Domains Whether to skip results from duplicate domains to avoid redundancy.
- Timeout Per Page Timeout in seconds per page load (1-180). Not applicable for Smart Search.
- Website Only Restricts search only within specified websites.

Output

The node outputs a JSON object containing the results of the smart search operation. The structure typically includes:

  • Search results matching the query, potentially enriched by AI processing.
  • Metadata about the search such as total results returned.
  • Any extracted data according to the provided context and rules.

If enabled, technical data may also be included, providing additional metadata about the pages crawled.

The node does not output binary data.

Dependencies

  • Requires an API key credential for authenticating with the Eddie.surf service.
  • The node makes HTTP POST requests to Eddie.surf endpoints for smart search operations.
  • Optional webhook callback URLs can be configured for asynchronous job completion notifications.

Troubleshooting

  • Missing or empty Search Query: The node throws an error if the search query is missing or blank. Ensure the query parameter is provided.
  • Max Results out of range: Values must be between 1 and 5000; otherwise, an error is raised.
  • Invalid URLs (for other operations): URLs must start with http:// or https://.
  • API authentication errors: Verify that the API key credential is correctly configured and has necessary permissions.
  • Timeouts: If pages take too long to load, consider adjusting the timeout per page setting.
  • Duplicate domain filtering: Enabling skip duplicate domains may reduce result count unexpectedly if many results come from the same domain.

Links and References

  • Eddie.surf Official Website (for service overview)
  • Documentation for Eddie.surf API (not linked here, but typically available from the service provider)

Discussion