You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Recursively explore folders within folders and download files within each of them
Scraping Plan
The Sacramento County Sheriff’s Department website should be pretty straightforward to scrape, since all the links to dropbox are provided in links on one web page.
Next steps will be to get the metadata in JSON from HTML, figure out how to download files from dropbox links and make sure we are comprehensively downloading all files in nested Dropbox folders. We hypothetically can force download files from dropbox links by modifying the urls (adding dl=1) so will need to see if that works out.
The text was updated successfully, but these errors were encountered:
agency slug (proposed): ca_sac_county_sheriff
module: *clean/ca/sacramento_sheriffs.py`
url: https://www.sacsheriff.com/pages/released_cases.php
Tasks
Scraping Plan
The Sacramento County Sheriff’s Department website should be pretty straightforward to scrape, since all the links to dropbox are provided in links on one web page.
Next steps will be to get the metadata in JSON from HTML, figure out how to download files from dropbox links and make sure we are comprehensively downloading all files in nested Dropbox folders. We hypothetically can force download files from dropbox links by modifying the urls (adding dl=1) so will need to see if that works out.
The text was updated successfully, but these errors were encountered: