Crawl4AI Web Scraper

Web scraping using local Crawl4AI instance. Use for fetching full page content with JavaScript rendering. Better than Tavily for complex pages. Unlimited usage.

安裝


              $clawhub install crawl-for-ai

Crawl4AI Web Scraper

Local Crawl4AI instance for full web page extraction with JavaScript rendering.

Endpoints

Proxy (port 11234) — Clean output, OpenWebUI-compatible

Returns: [{page_content, metadata}]
Use for: Simple content extraction

Direct (port 11235) — Full output with all data

Returns: {results: [{markdown, html, links, media, ...}]}
Use for: When you need links, media, or other metadata

Usage


# Via script
node {baseDir}/scripts/crawl4ai.js "url"
node {baseDir}/scripts/crawl4ai.js "url" --json

Script options:

--json — Full JSON response

Output: Clean markdown from the page.

Configuration

Required environment variable:

CRAWL4AI_URL — Your Crawl4AI instance URL (e.g., http://localhost:11235)

Optional:

CRAWL4AI_KEY — API key if your instance requires authentication

Features

JavaScript rendering — Handles dynamic content
Unlimited usage — Local instance, no API limits
Full content — HTML, markdown, links, media, tables
Better than Tavily for complex pages with JS

API

Uses your local Crawl4AI instance REST API. Auth header only sent if CRAWL4AI_KEY is set.

詳情

版本: v1.0.1
下載: 1,753
星標: 4
ClawHub: 在 ClawHub 查看

熱門 Skills

Elite Longterm Memory

Ultimate AI agent memory system for Cursor, Claude, ChatGPT & Copilot. WAL protocol + vector search + git-notes + cloud backup. Never lose context again. Vibe-coding ready.

Manage tasks and projects in Todoist. Use when user asks about tasks, to-dos, reminders, or productivity.

Query Polymarket prediction markets - check odds, trending markets, search events, track prices and momentum. Includes watchlist alerts, resolution calendar, momentum scanner, and paper trading (simulated, no real money).