You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Idaho moved its warn PDF from https://www.labor.idaho.gov/dnn/Portals/0/Publications/WARNNotice.pdf to https://www.labor.idaho.gov/wp-content/uploads/publications/WARNNotice.pdf. The scraper follows this transparently, so there's no breakage, but seems like a good policy to update the URL to reflect the current location.
The text was updated successfully, but these errors were encountered:
One note here: the state's page linking to this file actually links to https://www.labor.idaho.gov/warnnotice/ which does a redirect to the PDF with a note that says, parenthetically, "link is updated as notices are received." That reads to me like the file is updated continuously, but it could also mean they change the link on a semi-regular basis. So, we have a couple options:
retrieve the file at the current URL of the PDF
retain the current behavior and rely on the redirect from the file's old URL
rely on the /warnnotice/ redirect
scrape the HTML page to know which URL to check
I think it's probably a crap shoot, but the simplest thing to do to improve the situation might be #1.
chriszs
added a commit
to chriszs/warn-scraper
that referenced
this issue
Mar 27, 2024
Idaho moved its warn PDF from
https://www.labor.idaho.gov/dnn/Portals/0/Publications/WARNNotice.pdf
tohttps://www.labor.idaho.gov/wp-content/uploads/publications/WARNNotice.pdf
. The scraper follows this transparently, so there's no breakage, but seems like a good policy to update the URL to reflect the current location.The text was updated successfully, but these errors were encountered: