- Stars
- 21,214
- License
- Apache-2.0
- Last commit
- 1 hour ago
Best Open-source Web Scraping & Crawling tools
Explore curated open-source tools in the Web Scraping & Crawling category. Compare technologies, see alternatives, and find the right solution for your workflow.
10+ projects · Page 1 of 1
TypeScriptActive
- Stars
- 76,406
- License
- AGPL-3.0
- Last commit
- 1 hour ago
TypeScriptActive

Apache Nutch
Scalable, extensible Java web crawler for large‑scale data collection
- Stars
- 3,114
- License
- Apache-2.0
- Last commit
- 9 hours ago
JavaActive
- Stars
- 14,162
- License
- AGPL-3.0
- Last commit
- 19 hours ago
TypeScriptActive
- Stars
- 8,824
- License
- BSD-3-Clause
- Last commit
- 20 hours ago
PythonActive
- Stars
- 22,341
- License
- MIT
- Last commit
- 1 day ago
PythonActive
- Stars
- 59,514
- License
- BSD-3-Clause
- Last commit
- 1 day ago
PythonActive
- Stars
- 15,422
- License
- MIT
- Last commit
- 2 days ago
GoActive
- Stars
- 25,016
- License
- Apache-2.0
- Last commit
- 15 days ago
GoActive
- Stars
- 2,529
- License
- MIT
- Last commit
- 19 days ago
TypeScriptActive
- Stars
- 11,689
- License
- Apache-2.0
- Last commit
- 1 month ago
JavaActive
- Stars
- 6,164
- License
- MIT
- Last commit
- 1 month ago
TypeScriptActive

AutoScraper
Automatic, fast, lightweight web scraper that learns from examples
- Stars
- 7,074
- License
- MIT
- Last commit
- 7 months ago
PythonStable










