An application to generate site map of a given domain
docker run --rm nikhilvep/webcrawl
docker run --rm nikhilvep/webcrawl https://github.com
docker build -t web-crawler:0.1 .
docker run --rm web-crawler:0.1
docker run --rm web-crawler:0.1 https://github.com
url input: https://github.com
- maximum page limit: 250
- maximum links stored per page: 100
- number of concurrent workers (url fetch and parse): 10
time docker run --rm nikhilvep/webcrawl:0.1 https://github.com
0.06s user 0.22s system 1% cpu 22.937 total
- maximum page limit: 250
- maximum links stored per page: 100
- number of concurrent workers (url fetch and parse): 5
time docker run --rm nikhilvep/webcrawl:0.1 -w 5 https://github.com
0.07s user 0.18s system 0% cpu 47.206 total
- maximum page limit: 250
- maximum links stored per page: 100
- concurrency disabled
time docker run --rm nikhilvep/webcrawl:0.1 -con-off https://github.com
0.07s user 0.20s system 0% cpu 2:29.47 total