Commit Graph

2 Commits

Author SHA1 Message Date
Tomas Dvorak aa47f4309f refactor: optimize docker image and implement lightweight fetching
This commit improves the overall efficiency and reliability of the scraper by:

- Optimizing the Dockerfile by reducing layers, using `--no-install-recommends`, and consolidating Playwright installation.
- Adding resource limits (CPU/Memory) to the docker-compose configuration.
- Refactoring `main.go` to remove unused Cloudflare client structures and increasing cache TTL.
- Implementing a `lightweight_fetch` mechanism in `scrapling_fetch.py` using `urllib` to attempt fast requests before falling back to the heavier Scrapling/Playwright engine.
- Adding Cloudflare challenge detection to the lightweight fetcher.
2026-05-11 19:50:59 +02:00
Tomas Dvorak dc3b7e22ee fix upload 2026-03-13 15:42:09 +01:00