Crawl4AI
Open-source alternative to Browserbase, Apify, Jina AI.
Open-source web crawler and scraper that produces clean, structured output optimized for LLMs, RAG pipelines, and AI agents. Supports async crawling, CSS/XPath/LLM extraction, and stealth browser control.
Open Source Alternative to:

Open-source web crawler and scraper that produces clean, structured output optimized for LLMs, RAG pipelines, and AI agents. Supports async crawling, CSS/XPath/LLM extraction, and stealth browser control.
Some key features of Crawl4AI:
- Open-source alternative to Browserbase, Apify, Jina AI
- Data Extraction & Web Scraping coverage
- GitHub stars, forks, license, last commit, and activity score synced from directory sources
- Codex course and interactive classroom previews attached to the repository detail page
Categories
Learning previews
Learn this project
Codex HTML Course
Crawl4AI Codex HTML Course
Static preview slot
Crawl4AI Codex HTML Course is being prepared for this directory project.
Interactive Classroom
Crawl4AI Interactive Classroom
Static preview slot
Crawl4AI Interactive Classroom is being prepared for this directory project.
Related projects
Firecrawl
AGPL-3.0
API for AI agents to search, scrape, crawl, and interact with the live web, returning clean Markdown, structured JSON, or screenshots from any page.
Lightpanda
AGPL-3.0
Purpose-built headless browser that delivers 10x faster performance and 10x lower memory usage compared to Chrome headless for web automation and AI workflows.
Maxun
AGPL-3.0
Train robots in 2 minutes to scrape web data automatically. No coding required. Handles pagination, CAPTCHAs, and layout changes with AI.
Documind
Unknown
Documind uses advanced AI and LLMs to extract structured data from PDFs, images, and other documents, streamlining document processing and automation.
Browser Use
MIT
Python library that lets AI agents browse the web by giving them real browser control, DOM access, and the ability to interact with any website.
Skyvern
AGPL-3.0
Transform manual browser tasks into automated workflows using AI. Handle complex forms, CAPTCHAs, 2FA, and data extraction across any website at scale.














