WARN PlaywrightCrawler: Reclaiming failed request back to the list or queue. Request blocked - received 429 status code. #137

Voyager3D · 2024-01-07T14:51:44Z

I'm no coder and i've not scraped websites before.
But i'm assuming that this error code might be the website denying me scraping it too much?

I was able to output a file from this website after it scanned 150 pages. Worked perfectly, but somewhere after 150 it does not seem to like it and i get this error:
WARN PlaywrightCrawler: Reclaiming failed request back to the list or queue. Request blocked - received 429 status code.

Not sure if im on the ball with that one or not, but any advice would be appreciated!

Cheers!

Cougart · 2024-01-16T18:06:33Z

Hi,
I'm having the same issue with several websites.
Is it possible to add a sleep option between two calls?
I don't see any other possibilities.
Thanks a lot!

SimonGodefroid · 2024-02-12T06:51:36Z

429 being "the too many requests" status code you may have been throttled by the server.

Meaning: to prevent people from making too many requests they block requests coming from a given IP either temporarily or permanently after a given amount of incoming requests. Not saying this is 100% your case but that's the most probable scenario here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WARN PlaywrightCrawler: Reclaiming failed request back to the list or queue. Request blocked - received 429 status code. #137

WARN PlaywrightCrawler: Reclaiming failed request back to the list or queue. Request blocked - received 429 status code. #137

Voyager3D commented Jan 7, 2024

Cougart commented Jan 16, 2024

SimonGodefroid commented Feb 12, 2024 •

edited

Loading

WARN PlaywrightCrawler: Reclaiming failed request back to the list or queue. Request blocked - received 429 status code. #137

WARN PlaywrightCrawler: Reclaiming failed request back to the list or queue. Request blocked - received 429 status code. #137

Comments

Voyager3D commented Jan 7, 2024

Cougart commented Jan 16, 2024

SimonGodefroid commented Feb 12, 2024 • edited Loading

SimonGodefroid commented Feb 12, 2024 •

edited

Loading