- Add estimate_pages.py script (~270 lines) - Fast estimation without downloading content (HEAD requests only) - Shows estimated total pages and recommended max_pages - Validates URL patterns work correctly - Estimates scraping time based on rate_limit - Update CLAUDE.md with estimator workflow and commands - Update README.md features section with estimation benefits - Usage: python3 estimate_pages.py configs/react.json - Time: 1-2 minutes vs 20-40 minutes for full scrape 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
7.8 KiB
Executable File
7.8 KiB
Executable File