Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP refactor #109

Open
wants to merge 5 commits into
base: main
Choose a base branch
from
Open

WIP refactor #109

wants to merge 5 commits into from

Conversation

SPTKL
Copy link
Contributor

@SPTKL SPTKL commented Oct 13, 2021

Refactor

ACS - CDTA #105

✅ Crime - CDTA via spatial join

  • we only switched out the spatial boundary from cd to cdta, everything else remains mostly the same

Facdb - CDTA via spatial join

  • since the version of facdb with cdta2020 is not published yet, we are using spatial join to get cdta2020 for facdb

would need to reconfirm methodology

Floodplain - CDTA

  • the acs part of the data is coming from db-factfinder, the acs_community_profiles pipeline
  • the decennial part comes from the published decennial pl release data Add pipeline to incorporate PL data #107. we use block level population and join with lookup_geo to calculate floodplain population

would need to reconfirm methodology

✅ Park accss - CDTA

  • similar method as floodplain, we take block level decennial population and join with lookup_geo to calculate park access

pluto - CDTA via spatial join

  • the pluto attributes are using spatial join, since cdta is not a field in pluto. We are currently using lat/lon, which lead to some records not assigned to a cdta (because point in water & etc)

would need to reconfirm methodology
Question: are we using shoreline clipped or unclipped, and centroid or polygon to do spatial join?

Poverty - CD (not refactored)

  • currently the data is coming from a static csv stored in the data folder.
  • the data is on CD level
    • Need to get updated data. Came from Population or Planning Coordination. Can we get at CDTA level?

✅ Sanitation - CD (not refactored)

  • the santiation data from open data is cd level.

Static Data

  • cb_contact
  • cd_decennial_pop -> need from population
  • cd_poverty
  • cd_puma -> is puma still relevant?
  • cd_son (cd and neighborhood names)
  • cd_title
  • cd_to_block -> how is this relevant? this table is not used anywhere, consider remove
  • cd_tooltips -> someone would need to review this
  • crimes_2019.csv -> probably would need to be crime 2020

@SPTKL
Copy link
Contributor Author

SPTKL commented Oct 14, 2021

#111

@AmandaDoyle
Copy link
Member

Going to be a conversation with decision - use CD where we've had CD and CDTA where we've had PUMA?

@AmandaDoyle
Copy link
Member

ACS and Floodplain should be good with how it is.

@AmandaDoyle
Copy link
Member

Need to get official population counts from Population

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants