Update ID scraper to use state's new URL #644

chriszs · 2024-03-27T17:42:35Z

Idaho moved its warn PDF from https://www.labor.idaho.gov/dnn/Portals/0/Publications/WARNNotice.pdf to https://www.labor.idaho.gov/wp-content/uploads/publications/WARNNotice.pdf. The scraper follows this transparently, so there's no breakage, but seems like a good policy to update the URL to reflect the current location.

The text was updated successfully, but these errors were encountered:

chriszs · 2024-03-27T17:52:55Z

One note here: the state's page linking to this file actually links to https://www.labor.idaho.gov/warnnotice/ which does a redirect to the PDF with a note that says, parenthetically, "link is updated as notices are received." That reads to me like the file is updated continuously, but it could also mean they change the link on a semi-regular basis. So, we have a couple options:

retrieve the file at the current URL of the PDF
retain the current behavior and rely on the redirect from the file's old URL
rely on the /warnnotice/ redirect
scrape the HTML page to know which URL to check

I think it's probably a crap shoot, but the simplest thing to do to improve the situation might be #1.

Closes biglocalnews#644

chriszs added a commit to chriszs/warn-scraper that referenced this issue Mar 27, 2024

Fix ID by updating to state's new URL

bc435e2

Closes biglocalnews#644

chriszs mentioned this issue Mar 27, 2024

Update ID scraper to use state's new URL #645

Closed

chriszs changed the title ~~"Fix" ID by updating to state's new URL~~ Update ID to state's new URL Jun 18, 2024

chriszs changed the title ~~Update ID to state's new URL~~ Update ID scraper to use state's new URL Jun 18, 2024

chriszs closed this as completed Nov 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update ID scraper to use state's new URL #644

Update ID scraper to use state's new URL #644

chriszs commented Mar 27, 2024

chriszs commented Mar 27, 2024 •

edited

Loading

Update ID scraper to use state's new URL #644

Update ID scraper to use state's new URL #644

Comments

chriszs commented Mar 27, 2024

chriszs commented Mar 27, 2024 • edited Loading

chriszs commented Mar 27, 2024 •

edited

Loading