-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ca 50 created scrape and scrape-meta for laserfichie #52
Commits on Aug 22, 2022
-
Configuration menu - View commit details
-
Copy full SHA for d7a2ddb - Browse repository at this point
Copy the full SHA d7a2ddbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 993bfc1 - Browse repository at this point
Copy the full SHA 993bfc1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2e3e71e - Browse repository at this point
Copy the full SHA 2e3e71eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1fe22b4 - Browse repository at this point
Copy the full SHA 1fe22b4View commit details -
Configuration menu - View commit details
-
Copy full SHA for bade08d - Browse repository at this point
Copy the full SHA bade08dView commit details
Commits on Aug 26, 2022
-
Configuration menu - View commit details
-
Copy full SHA for d5708f3 - Browse repository at this point
Copy the full SHA d5708f3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0df0ea0 - Browse repository at this point
Copy the full SHA 0df0ea0View commit details
Commits on Aug 27, 2022
-
Configuration menu - View commit details
-
Copy full SHA for 90a51af - Browse repository at this point
Copy the full SHA 90a51afView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0bb8c27 - Browse repository at this point
Copy the full SHA 0bb8c27View commit details -
Configuration menu - View commit details
-
Copy full SHA for bb607cb - Browse repository at this point
Copy the full SHA bb607cbView commit details
Commits on Sep 13, 2022
-
Configuration menu - View commit details
-
Copy full SHA for 1133a6d - Browse repository at this point
Copy the full SHA 1133a6dView commit details
Commits on Sep 14, 2022
-
Configuration menu - View commit details
-
Copy full SHA for 715f735 - Browse repository at this point
Copy the full SHA 715f735View commit details
Commits on Mar 21, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 11d58ea - Browse repository at this point
Copy the full SHA 11d58eaView commit details
Commits on Mar 22, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2cdd0a4 - Browse repository at this point
Copy the full SHA 2cdd0a4View commit details
Commits on Apr 10, 2024
-
* Partial work porting over and updating warn-scraper conventions * Initial pass at porting usage docs biglocalnews#6 * Add cli list command * Update README * Fix Makefile help text * Commit lockfile * Add stories page * Add throttle and misc cleanups to cli and runner * Add dependency to setup.py * Clobber obsolete SD scraper test module * Log agency slug in runner * Add customizable throttling * Port and update cache * Remove doc tests and disable Python and other downstream actions in CI (for now) * Partial work on ca_san_diego_pd * Fix/update usage docs
Configuration menu - View commit details
-
Copy full SHA for 117744a - Browse repository at this point
Copy the full SHA 117744aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5bf9e35 - Browse repository at this point
Copy the full SHA 5bf9e35View commit details -
* Use pre-commit for make formt * Rework to use Site class per agency. biglocalnews#3 biglocalnews#4
Configuration menu - View commit details
-
Copy full SHA for de28b11 - Browse repository at this point
Copy the full SHA de28b11View commit details -
* Update contributor docs biglocalnews#5 * Update usage docs biglocalnews#6 * update deps * Update main README biglocalnews#11 * add stub page for maintainer docs biglocalnews#9
Configuration menu - View commit details
-
Copy full SHA for 309e677 - Browse repository at this point
Copy the full SHA 309e677View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1587e1c - Browse repository at this point
Copy the full SHA 1587e1cView commit details -
Configuration menu - View commit details
-
Copy full SHA for c9563ef - Browse repository at this point
Copy the full SHA c9563efView commit details -
Configuration menu - View commit details
-
Copy full SHA for e45f0c8 - Browse repository at this point
Copy the full SHA e45f0c8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 49f4f84 - Browse repository at this point
Copy the full SHA 49f4f84View commit details
Commits on Apr 13, 2024
-
Complete San Diego reference implementation biglocalnews#4 (biglocaln…
…ews#19) * Implement/update San Diego scrape_meta and scrape methods * Add cache.write_json and read_json methods * Add filter option to CLI scrape command * update reqs
Configuration menu - View commit details
-
Copy full SHA for e9b0c18 - Browse repository at this point
Copy the full SHA e9b0c18View commit details
Commits on Apr 15, 2024
-
Tests biglocalnews#20 (biglocalnews#21)
* Add basic test coverage * Default to empty string for scrape-meta filter arg * ignore linter code * linter fixes * add tox/testing and linting info to maintainers docs * remove py37 support and bump to Beta dev status * Add tox config and reqs.txts * Update Pipfile.lock
Configuration menu - View commit details
-
Copy full SHA for 1a5c3cb - Browse repository at this point
Copy the full SHA 1a5c3cbView commit details
Commits on Apr 24, 2024
-
Configuration menu - View commit details
-
Copy full SHA for bea2c64 - Browse repository at this point
Copy the full SHA bea2c64View commit details -
Configuration menu - View commit details
-
Copy full SHA for 53aa6b9 - Browse repository at this point
Copy the full SHA 53aa6b9View commit details -
Configuration menu - View commit details
-
Copy full SHA for de74903 - Browse repository at this point
Copy the full SHA de74903View commit details
Commits on May 1, 2024
-
Ca orange county sheriff (biglocalnews#30)
* Add CA orange_county_sheriff.py * add name to contributors --------- Co-authored-by: jrynning <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3effb09 - Browse repository at this point
Copy the full SHA 3effb09View commit details
Commits on May 2, 2024
-
* Update contributor docs. Closes biglocalnews#25 * linter fixes
Configuration menu - View commit details
-
Copy full SHA for 2828124 - Browse repository at this point
Copy the full SHA 2828124View commit details
Commits on May 20, 2024
-
Ca sonoma county sheriff (biglocalnews#34)
* Sonoma County Scraper (biglocalnews#32) * Add ca_sonoma_county_sheriff --------- Co-authored-by: ochezems <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for ca097a8 - Browse repository at this point
Copy the full SHA ca097a8View commit details
Commits on Jun 7, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 8af4b14 - Browse repository at this point
Copy the full SHA 8af4b14View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5c8f2ce - Browse repository at this point
Copy the full SHA 5c8f2ceView commit details
Commits on Jun 10, 2024
-
Merge pull request biglocalnews#38 from biglocalnews/grich/docs-contrib
docs: contrib tweaks; metadata type
Configuration menu - View commit details
-
Copy full SHA for 34b6ae7 - Browse repository at this point
Copy the full SHA 34b6ae7View commit details
Commits on Jul 17, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 35695c9 - Browse repository at this point
Copy the full SHA 35695c9View commit details -
ops: fix runner test (biglocalnews#44)
* ops: fix runner test * ops: avoid redundant gha runs on prs --------- Co-authored-by: Gerald Rich <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1623c2f - Browse repository at this point
Copy the full SHA 1623c2fView commit details -
Configuration menu - View commit details
-
Copy full SHA for e1ab46d - Browse repository at this point
Copy the full SHA e1ab46dView commit details
Commits on Jul 24, 2024
-
Ca 43 santa rosa scraper (biglocalnews#45)
* added santa rosa
Configuration menu - View commit details
-
Copy full SHA for b7732f0 - Browse repository at this point
Copy the full SHA b7732f0View commit details
Commits on Jul 29, 2024
-
Added The scraper for Humboldt with successful pre-commit run (bigloc…
…alnews#48) * Added The scraper for Humboldt with successful pre-commit run * Required Changes done * removed download page where identical
Configuration menu - View commit details
-
Copy full SHA for dc24b8e - Browse repository at this point
Copy the full SHA dc24b8eView commit details
Commits on Jul 30, 2024
-
docs: metadata spec (biglocalnews#49)
* docs: metadata spec * docs: remove refs to scrape --------- Co-authored-by: Gerald Rich <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f689f6b - Browse repository at this point
Copy the full SHA f689f6bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0655900 - Browse repository at this point
Copy the full SHA 0655900View commit details -
Configuration menu - View commit details
-
Copy full SHA for 881c328 - Browse repository at this point
Copy the full SHA 881c328View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5b17957 - Browse repository at this point
Copy the full SHA 5b17957View commit details -
Configuration menu - View commit details
-
Copy full SHA for c3ef96d - Browse repository at this point
Copy the full SHA c3ef96dView commit details -
Configuration menu - View commit details
-
Copy full SHA for f8d0fc9 - Browse repository at this point
Copy the full SHA f8d0fc9View commit details
Commits on Jul 31, 2024
-
Configuration menu - View commit details
-
Copy full SHA for f2f7b52 - Browse repository at this point
Copy the full SHA f2f7b52View commit details -
Configuration menu - View commit details
-
Copy full SHA for 09d65ca - Browse repository at this point
Copy the full SHA 09d65caView commit details -
Configuration menu - View commit details
-
Copy full SHA for 842cc76 - Browse repository at this point
Copy the full SHA 842cc76View commit details
Commits on Aug 1, 2024
-
feat: sacramento pd scraper (biglocalnews#39)
* feat: sacramento pd scraper * fix: isort * scrape most child pages; todo: get sub-sub pages * more recursively grab child pages * inline comments * fix: fn names, py type * feat: collect zip & pdfs; todo: handle dupe assets * chore: ci * feat: download youtube videos & playlists; remove print stmts * style: naming * ops: clean-prefect import clean * ops: fix runner test (biglocalnews#44) * ops: fix runner test * ops: avoid redundant gha runs on prs --------- Co-authored-by: Gerald Rich <[email protected]> * ops: current reqs * naming * refactor: move around methods * refactor: add case_num * Tiny typo fixs * Ca 43 santa rosa scraper (biglocalnews#45) * added santa rosa * Added The scraper for Humboldt with successful pre-commit run (biglocalnews#48) * Added The scraper for Humboldt with successful pre-commit run * Required Changes done * removed download page where identical * docs: metadata spec (biglocalnews#49) * docs: metadata spec * docs: remove refs to scrape --------- Co-authored-by: Gerald Rich <[email protected]> * Update contributing.md * fix: metadata dict types * fix: import typing_extensions --------- Co-authored-by: Gerald Rich <[email protected]> Co-authored-by: Mike Stucka <[email protected]> Co-authored-by: naumansharifwork <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 289982d - Browse repository at this point
Copy the full SHA 289982dView commit details
Commits on Aug 2, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 24f2259 - Browse repository at this point
Copy the full SHA 24f2259View commit details
Commits on Aug 3, 2024
-
Configuration menu - View commit details
-
Copy full SHA for f8de8e1 - Browse repository at this point
Copy the full SHA f8de8e1View commit details
Commits on Aug 5, 2024
-
fix: update setup.py (biglocalnews#60)
Co-authored-by: Gerald Rich <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a0ff2df - Browse repository at this point
Copy the full SHA a0ff2dfView commit details
Commits on Aug 6, 2024
-
Force encoding to allow Windows machines to install (biglocalnews#59)
* Force encoding to allow Windows machines to use this * Update setup.py --------- Co-authored-by: Gerald Rich <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for abc3d6a - Browse repository at this point
Copy the full SHA abc3d6aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4e04d09 - Browse repository at this point
Copy the full SHA 4e04d09View commit details
Commits on Aug 7, 2024
-
Show error when state prefix is omitted biglocalnews#55 (biglocalnews#61
) * Show error when state prefix is omitted biglocalnews#55 * Better messaging, upgrade error level * Patch scrape and scrape-meta both --------- Co-authored-by: Gerald Rich <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for db266c5 - Browse repository at this point
Copy the full SHA db266c5View commit details -
Rework cache to have better typing and UTF-8 compatibility for bigloc…
…alnews#66 (biglocalnews#68) * Add in missing UTF-8 compatibility * Rework typing * Exclude general list from JSON write; patch language for function
Configuration menu - View commit details
-
Copy full SHA for a2482bc - Browse repository at this point
Copy the full SHA a2482bcView commit details -
refactor: SDPD case_id (biglocalnews#62)
* refactor: SDPD case num * copy: case_num >> case_id * remove type ignore --------- Co-authored-by: Gerald Rich <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for c8cc092 - Browse repository at this point
Copy the full SHA c8cc092View commit details
Commits on Aug 8, 2024
-
Configuration menu - View commit details
-
Copy full SHA for f3b0e55 - Browse repository at this point
Copy the full SHA f3b0e55View commit details
Commits on Aug 9, 2024
-
Configuration menu - View commit details
-
Copy full SHA for db05c9c - Browse repository at this point
Copy the full SHA db05c9cView commit details
Commits on Aug 13, 2024
-
Add cache.write_binary for biglocalnews#69 (biglocalnews#71)
* Implement cache.write_binary biglocalnews#69 --------- Co-authored-by: Gerald Rich <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1647cef - Browse repository at this point
Copy the full SHA 1647cefView commit details
Commits on Aug 14, 2024
-
Configuration menu - View commit details
-
Copy full SHA for a8dd32f - Browse repository at this point
Copy the full SHA a8dd32fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 383be94 - Browse repository at this point
Copy the full SHA 383be94View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5372c01 - Browse repository at this point
Copy the full SHA 5372c01View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1d7e9dd - Browse repository at this point
Copy the full SHA 1d7e9ddView commit details
Commits on Aug 19, 2024
-
feat: Monterey County District Attorney (biglocalnews#74)
* created scrape and scrape meta for Monterey County District Attorney * config folder location changed * removed scrape and changed case_num to case_id and in case we dont find case_id it will have title in case_id instead of none --------- Co-authored-by: Gerald Rich <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 61324e3 - Browse repository at this point
Copy the full SHA 61324e3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 55a5b15 - Browse repository at this point
Copy the full SHA 55a5b15View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2312546 - Browse repository at this point
Copy the full SHA 2312546View commit details -
Configuration menu - View commit details
-
Copy full SHA for b82b9c4 - Browse repository at this point
Copy the full SHA b82b9c4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 37e99d4 - Browse repository at this point
Copy the full SHA 37e99d4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1478078 - Browse repository at this point
Copy the full SHA 1478078View commit details -
Patch list_agencies bug biglocalnews#85 (biglocalnews#87)
* Patch list_agencies bug biglocalnews#85 * Readability cleanup
Configuration menu - View commit details
-
Copy full SHA for b5942ea - Browse repository at this point
Copy the full SHA b5942eaView commit details -
added scrape-meta and scrape for riverside pd (biglocalnews#57)
* added scrape-meta and scrape for riverside pd * changes for meta-data file formating * changed file name * tiny fix * removed scrape and changed case_num to case_id
Configuration menu - View commit details
-
Copy full SHA for ec8bd12 - Browse repository at this point
Copy the full SHA ec8bd12View commit details -
refactor: deprecate scrape method (biglocalnews#82)
* refactor: deprecate scrape method * fix: case_number >> case_id --------- Co-authored-by: Gerald Rich <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9de55ae - Browse repository at this point
Copy the full SHA 9de55aeView commit details -
* fix: tests bootstrap * ops: coverage * add coveragerc * fix: tox; test py39, 310, 311, 312 --------- Co-authored-by: Gerald Rich <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 8acc025 - Browse repository at this point
Copy the full SHA 8acc025View commit details -
Configuration menu - View commit details
-
Copy full SHA for 08b7217 - Browse repository at this point
Copy the full SHA 08b7217View commit details
Commits on Aug 20, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 51a4360 - Browse repository at this point
Copy the full SHA 51a4360View commit details
Commits on Aug 21, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 38b610c - Browse repository at this point
Copy the full SHA 38b610cView commit details
Commits on Aug 30, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 1d1e8e9 - Browse repository at this point
Copy the full SHA 1d1e8e9View commit details -
Los Angeles Sheriff's Department for biglocalnews#51 (biglocalnews#54)
* Proof of concept, missing Class * Export out usable case index * Incremental work consolidating things * First attempt at class * closer * Closer * Implement logging * Polish * Proof of concept, missing Class * Export out usable case index * Incremental work consolidating things * First attempt at class * Kill notebook version brought back by rebase * Move ugly details to config file * Rename ugly detail file * Linting * Fix linting * Proof of concept, missing Class * Export out usable case index * Incremental work consolidating things * First attempt at class * Kill notebook version brought back by rebase * Proof of concept, missing Class * Export out usable case index * Incremental work consolidating things * First attempt at class * Proof of concept, missing Class * Export out usable case index * Incremental work consolidating things * First attempt at class * Proof of concept, missing Class * Proof of concept, missing Class * Export out usable case index * Incremental work consolidating things * First attempt at class * Move ugly details to config file * Rename ugly detail file * Linting * Fix linting * Build against biglocalnews#69 flag biglocalnews#70 * ... * Apply suggestions * Clean up notes * Proof of concept, missing Class * Export out usable case index * Incremental work consolidating things * First attempt at class * closer * Closer * Implement logging * Polish * Proof of concept, missing Class * Export out usable case index * Incremental work consolidating things * First attempt at class * Kill notebook version brought back by rebase * Move ugly details to config file * Rename ugly detail file * Linting * Fix linting * Proof of concept, missing Class * Export out usable case index * Incremental work consolidating things * First attempt at class * Kill notebook version brought back by rebase * Proof of concept, missing Class * Export out usable case index * Incremental work consolidating things * First attempt at class * Proof of concept, missing Class * Export out usable case index * Incremental work consolidating things * First attempt at class * Proof of concept, missing Class * Proof of concept, missing Class * Export out usable case index * Incremental work consolidating things * First attempt at class * Move ugly details to config file * Rename ugly detail file * Linting * Fix linting * Build against biglocalnews#69 flag biglocalnews#70 * ... * Apply suggestions * Clean up notes * Copypaste around rebase problems --------- Co-authored-by: Gerald Rich <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f12b238 - Browse repository at this point
Copy the full SHA f12b238View commit details
Commits on Sep 3, 2024
-
added scrape meta for chula_vista_pd biglocalnews#94 (biglocalnews#95)
* added scrape meta for chula_vista_pd biglocalnews#94 * removed user-agent * changes done * Rework URL handling; clean up a little more text * Linted. Oops. --------- Co-authored-by: Mike Stucka <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b139e7c - Browse repository at this point
Copy the full SHA b139e7cView commit details
Commits on Sep 11, 2024
-
Configuration menu - View commit details
-
Copy full SHA for b6ae891 - Browse repository at this point
Copy the full SHA b6ae891View commit details
Commits on Sep 15, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 42a0d85 - Browse repository at this point
Copy the full SHA 42a0d85View commit details -
Configuration menu - View commit details
-
Copy full SHA for 58dabc3 - Browse repository at this point
Copy the full SHA 58dabc3View commit details
Commits on Sep 16, 2024
-
fix: sdpd case_id pagination (biglocalnews#107)
* fix: sdpd case_id pagination * simplify case_id --------- Co-authored-by: Gerald Rich <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1642a4b - Browse repository at this point
Copy the full SHA 1642a4bView commit details
Commits on Sep 27, 2024
-
added scrape meta for fresno pd biglocalnews#114 (biglocalnews#115)
* added scrape meta for fresno pd biglocalnews#114 * file name changed fresno_county_sheriff * file name changed fresno_county_sheriff
Configuration menu - View commit details
-
Copy full SHA for c043916 - Browse repository at this point
Copy the full SHA c043916View commit details
Commits on Oct 1, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 8774020 - Browse repository at this point
Copy the full SHA 8774020View commit details
Commits on Oct 17, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 19f5fd9 - Browse repository at this point
Copy the full SHA 19f5fd9View commit details -
added scrape-meta for oakland pd (biglocalnews#131)
Co-authored-by: Gerald Rich <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for db9cf2f - Browse repository at this point
Copy the full SHA db9cf2fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 36bb4af - Browse repository at this point
Copy the full SHA 36bb4afView commit details