-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Users should be able to initiate a COPA Scraping job #56
Comments
Status indicators
|
Tasks done in e2e:
|
* [56] WIP: transform complete and tested, ready for augmentation step * [56] WIP: return None if there is no file to read * [56] WIP: use try except for trying to open file for reading, return None otherwise * [56] WIP: mock.call_count too high in build, attempt to unpatch functions between tests to ensure correct count * [56] WIP: Update transform test to ensure correct data is being passed to store_string calls * Update .travis.yml * Update .travis.yml * Update .travis.yml * [#56] completed transformation * [56] fixed flake8 issues * Update .travis.yml, fix parsing issue on travis-ci.com
AUGMENTATION: Table of data_allegationcategory Id category_name Table data_allegation (pre-augment) cr_id … current_category augment() Table data_allegation (post augment) cr_id … current_category |
…ranch version of endpoints
…object, copa scrape transformer add api error handling and update tests
…ts in relation to the added functions in copa_scrape_transformer
…not-copa data; and storing errors. Removed commented code. Added tests for the above.
… (1) copa scrape yields error; (2) not-copa scrape yields error: (3) both scrapes yield errors; and (4) no scrape contain errors
…sts fails while the other succeeds" This reverts commit e1b000d.
* [#56] WIP: Add test for adding augments copa recxord to db * Reformated test within test_augment.py * [Daisy] Debugging commi * [#56][Thalia/Everyone] Fix mypy commit error * [#56A] [Clari and Daisy] added react components header, and tab * [#56A][Clari, Thalia and Daisy] Added CSS style sheet for tabs and header. * [Daisy Octavia and Jovanka][#56A] Cleaned up CSS and finished applying proper front end design to header * [Octavia, Jovanka, and Daisy][#56A] Added components and css styling for button * [#56A][JK]WIP: add bg image and footer * [#56A][JK] WIP: bg image fix; styling * [#56A][Jole] WIP: add FOIA tab/placeholder; route; header included in status page and FOIA placeholder page for navigation * [#56][Jole] fix: failing test due to unwrapped link in Tab component
Overarching goal: A user should be able to trigger a process in the server that pulls data from the COPA website and imports new Allegations to the database.
Things to keep in mind:
Goals:
current category
column with a reference to the data_allegationcategory table for that particular categoryThe business need:
From Rajiv:
The primary purpose of this COPA Data Portal data capture step is to create incomplete/phantom complaint records in our database (for new complaints since our last successful FOIA response) so that we can have some matching data for the new documents that are being picked up by our crawlers/scrapers ( https://cpdp.co/crawlers and https:// cpdp.co/documents ).
The second purpose is to compare against the data that we have received via FOIA responses to whether we are missing any records (i.e., were any responsive complaint records omitted from our original dataset and if so which ones).
The third purpose is to compare different versions/snapshots of it over time and see what’s changing (is it just new records being added on to the end, or are older records being added, or removed, or altered).
From Basecamp:
The Civilian Office of Police Accountability (COPA) has just posted a new live data feed to the City's Open Data Portal that goes back 10 years. Here are a few early questions to investigate.
The text was updated successfully, but these errors were encountered: