Optimize court call scrape #52

antidipyramid · 2024-04-05T13:22:42Z

Since we started all fetching calendar values while scraping court calls, the scrape has slow down to the point where we're unable to scrape all available court calls in under 6 hours.

We could try a some things to make the scrapes more efficient:

Avoiding duplicate calendar requests-- on the results page, there are usually at least two court calls listed for a single case. Caching calendar values should reduce the number of case detail requests by at least half.
If (1) isn't enough, we could also limit the dates we're scraping every day. We could try only scraping the court calls for the current or next day.

antidipyramid · 2024-04-05T13:22:55Z

What do you think @fgregg?

antidipyramid mentioned this issue Apr 5, 2024

Optimize court call scrape #53

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize court call scrape #52

Optimize court call scrape #52

antidipyramid commented Apr 5, 2024

antidipyramid commented Apr 5, 2024

Optimize court call scrape #52

Optimize court call scrape #52

Comments

antidipyramid commented Apr 5, 2024

antidipyramid commented Apr 5, 2024