-
Notifications
You must be signed in to change notification settings - Fork 0
Product: FacDB
The City Planning Facilities Database aggregates more than 35,000 records from 52 different public data sources provided by City, State, and Federal agencies.
While each source agency classifies its facilities according to their own naming systems, we have grouped all facilities and program sites into the following seven categories to help planners navigate the data more easily:
- Health and Human Services
- Education, Child Welfare, and Youth
- Parks, Gardens, and Historical Sites
- Libraries and Cultural Programs
- Public Safety, Emergency Services, and Administration of Justice
- Core Infrastructure and Transportation
- Administration of Government
Within each of these domains, each record is further categorized into a set of facility groups, subgroups, and types that are intended to make the data easy to navigate and more useful for specific planning purposes. Facility types and names appear as they do in source datasets, wherever possible. A full listing of the facility categories is provided in the data dictionary.
Dataset Name | Facilities Database (FacDB) |
Agency Name | Department of City Planning |
Update Frequency | Quarterly |
Dataset Description | Facilities and program sites that are owned, operated, funded, licensed or certified by a City, State, or Federal agency |
Dataset Keywords | Facilities, Education, Child Welfare, Parks, Gardens, Historical Sites, Libraries, Cultural Programs, Public Safety, Emergency Services, Administration of Justice, Health Services, Human Services, Infrastructure, Transportation, Government Administration |
Dataset Category | City Government |
Additional Information | The Department of City Planning aggregates information about 33,000+ facilities and program sites that are owned, operated, funded, licensed or certified by a City, State, or Federal agency in the City of New York into a central database called the City Planning Facilities Database FacDB). These facilities generally help to shape quality of life in the city’s neighborhoods, and this dataset is the basis for a series of planning activities. This public data resource allows all New Yorkers to understand the breadth of government resources in their neighborhoods. |
Each record in FacDB represents a facility site.
FacDB is the most comprehensive spatial data resource available for facilities run by public and non-public entities in NYC, but it does not claim to capture every facility within the specified domains. Some facilities are deliberately excluded from the data that source agencies provide in order to protect the safety and privacy of their clients. Also, many records could not be geocoded.
There are known to be cases when the address provided in the source data is for a headquarters office rather than the facility site location. Unfortunately, these could not be systematically verified. For more detailed information on a specific facility reach out to the respective oversight agency.
Fair Share Analysis, Neighborhood studies, Facilities planning
Analysis Limitations. As a result of the data limitations and inconsistencies listed below users should be careful in their use of this database so as to avoid developing suspect analyses. For example, a comparison of the density or accessibility of facilities across neighborhoods should recognize that some of the facilities included are organizational headquarters rather than service sites and that this database is not authoritatively comprehensive. In addition, we rely on source data from other agencies to populate the database, and some of these sources may fall out-of-date. Users can find the date of each source dataset’s latest update in the source data dictionary.
Missing Records. Currently, FacDB is the most comprehensive spatial data resource available for facilities run by public and non-public entities in NYC, but it does not claim to capture every facility within the specified domains. Some facilities are deliberately excluded from the data that source agencies provide in order to protect the safety and privacy of their clients. Also, many records could not be geocoded. To learn more about how the data are processed, please review the Data Sources and Compilation Process.
Duplicates. Please be aware that this dataset may include cases of duplicate records for the same facility because several source datasets have content that overlap.
Administrative Addresses. There are known to be cases when the address provided in the source data is for a headquarters office rather than the facility site location. Unfortunately, these could not be systematically verified. For more detailed information on a specific facility reach out to the respective oversight agency.
Public Accessibility of Sites. DCP is unable to verify the public accessibility of all sites. For example, some playgrounds or playing fields may only be accessible to participants in certain programs.
Since the facility records are aggregated from many datasets designed for different purposes, the data will be transformed over several stages to reach its final state. The stages are described below and all the scripts used are available on the NYC Planning GitHub page.
Data loading. Since the source datasets have been maintained by various agencies and updated with different frequencies, datasets are loaded into Amazon s3 as a centralized datahub preparing for the downstream data processing. The list of data sources can be found here.
Geoprocessing. When records have address information, spatial data is assigned by taking the centroid of the BIN returned by Geosupport that matches the DoITT building footprints dataset. If a BIN is not available, the latitude and longitude returned by Geosupport is used to create the geometry for the record. If these fields are not available from Geosupport, but the source data has spatial information (i.e. coordinates) the spatial data is created from the source data. If the source data consisted of polygon geometries, the centroid of the polygon was used to assign the geometry for the records in the database. There are cases where the coordinates from the source data fall in the roadbed and not inside a BBL boundary due to the geocoding technique used by the source. Lastly, if a geometry could not be assigned from the BIN, latitude/longitude from Geosupport, or source data, the centroid of the BBL from the clipped MapPLUTO is used. Other geographic information such as the community district is taken from Geosupport if a value is returned, otherwise administrative districts are assigned via spatial joins where the record has a geometry.
Duplicate Record Removal. Several of the source datasets have content that overlaps. Duplicate records were identified by querying for all the records that fall within the same BIN or BBL and have the same Facility Subgroup or Type, same Facility Name, or same Oversight Agency. Where duplicate records were identified all but the primary record was removed from the database.
The facilities database is being provided by the Department of City Planning (DCP) for informational purposes only. DCP does not warrant the completeness, accuracy, content, or fitness for any particular purpose or use of the dataset, nor are any such warranties to be implied or inferred with respect to the dataset as furnished on the website
DCP and the City are not liable for any deficiencies in the completeness, accuracy, content, or fitness for any particular purpose or use of the dataset, or applications utilizing Dataset, provided by any third party. The City Planning Facilities Database (FacDB) is only as good as the source data it aggregates, and the Department of City Planning cannot verify the accuracy of all records. Please read more about specific data and analysis limitations before using this data.
-
bpl_libraries
- Source: Scraped from BPL website
- Source url: https://www.bklynlibrary.org/locations/json
- To load: run scraper via the recipe repo
-
nypl_libraries
- Source: Scrape from NYPL website
- Source url: https://www.nypl.org/locations/list
- To load: run scraper via the recipe repo
-
uscourts_courts
- Source: Court locator for NY state
- Source url: http://www.uscourts.gov/court-locator/city/New%20York/state/NY
- To load: run scraper via the recipe repo
-
dca_operatingbusinesses
- Source: OpenData
- Source table name/url: https://data.cityofnewyork.us/Business/Legally-Operating-Businesses/w7w3-xahh
-
dcp_colp
- Source: Bytes
- Source table name/url: https://www1.nyc.gov/site/planning/data-maps/open-data.page#city_facilities
-
dcla_culturalinstitutions
- Source: OpenData
- Source table name/url: https://data.cityofnewyork.us/Recreation/DCLA-Cultural-Organizations/u35m-9t32
-
dfta_contracts
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/Social-Services/DFTA-Contracts/6j6t-3ixh
-
doe_busroutesgarages
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/Transportation/Routes/8yac-vygm
-
sca_enrollment_capacity
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/Education/Enrollment-Capacity-And-Utilization-Reports-Target/8b9a-pywy
-
dohmh_daycare
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/Health/DOHMH-Childcare-Center-Inspections/dsg6-ifza
-
dpr_parksproperties
- Source: OpenData
- OpenData table name/url: https://nycopendata.socrata.com/Recreation/Parks-Properties/enfh-gkve
- NOTE: DPR open data table URLs are not consistent. Be sure to double-check before running from the recipes app.
-
dsny_garages
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/Environment/DSNY-Garages/xw3j-2yxf
-
dsny_specialwastedrop
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/Environment/DSNY-Special-Waste-Drop-off-Sites/242c-ru4i
-
dsny_textiledrop
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/Environment/Textile-Drop-Off-Locations-in-NYC/qnjm-wvu5
-
dsny_leafdrop
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/Environment/Leaf-Drop-Off-Locations-in-NYC/8i9k-4gi5
-
dsny_fooddrop
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/Environment/Food-Scrap-Drop-Off-Locations-in-NYC/if26-z6xq
-
dsny_electronicsdrop
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/Environment/Electronics-Drop-Off-Locations-in-NYC/wshr-5vic
-
dycd_afterschoolprograms
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/Education/DYCD-after-school-programs/mbd7-jfnc
-
fdny_firehouses
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/Public-Safety/FDNY-Firehouse-Listing/hc8x-tcnd
-
nycha_communitycenters
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/Social-Services/Directory-of-NYCHA-Community-Facilities/crns-fw6u
-
hhc_hospitals
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/Health/Health-and-Hospitals-Corporation-HHC-Facilities/f7b6-v6v3
-
hra_jobcenters
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/Business/Directory-Of-Job-Centers/9d9t-bmk7
-
hra_medicaid
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/City-Government/Medicaid-Offices/ibs4-k445
-
hra_snapcenters
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/Social-Services/Directory-of-SNAP-Centers/tc6u-8rnp
-
nycha_policeservice
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/Housing-Development/NYCHA-PSA-Police-Service-Areas-/72wx-vdjr
-
nysdec_solidwaste
- Source: OpenData
- OpenData table name/url: https://data.ny.gov/Energy-Environment/Solid-Waste-Management-Facilities/2fni-raj8
-
nysdoh_healthfacilities
- Source: OpenData
- OpenData table name/url: https://health.data.ny.gov/Health/Health-Facility-General-Information/vn5v-hh5r
-
nysdoh_nursinghomes
- Source: OpenData
- OpenData table name/url: https://health.data.ny.gov/Health/Nursing-Home-Weekly-Bed-Census-Last-Submission/izta-vnpq
-
nysomh_mentalhealth
- Source: OpenData
- OpenData table name/url: https://data.ny.gov/Human-Services/Local-Mental-Health-Programs/6nvr-tbv8
-
nysopwdd_providers
- Source: OpenData
- OpenData table name/url: https://data.ny.gov/Human-Services/Directory-of-Developmental-Disabilities-Service-Pr/ieqx-cqyk
-
nysparks_historicplaces
- Source: OpenData
- OpenData table name/url: https://data.ny.gov/Recreation/National-Register-of-Historic-Places/iisn-hnyv
-
nysparks_parks
- Source: OpenData
- OpenData table name/url: https://data.ny.gov/Recreation/State-Park-Facility-Points/9uuk-x7vh
-
qpl_libraries
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/Education/Queens-Library-Branches/kh3d-xhq7
-
sbs_workforce1
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/dataset/Center-Service-Locations/6smc-7mk6
-
moeo_socialservicesitelocations
- Source: OpenData
- OpenData table name/url: https://data.cityofnewyork.us/City-Government/Verified-Locations-for-NYC-City-Funded-Social-Serv/2bvn-ky2h
-
usdot_airports
- Source: OpenData
- OpenData table name/url: https://hub.arcgis.com/datasets/usdot::airports
- Head to url >> api >> copy url from geojson
-
usdot_ports
- Source: OpenData
- OpenData table name/url: https://hub.arcgis.com/datasets/usdot::ports
- Head to url >> api >> copy url from geojson
-
nysdec_lands
- Source: DEC Website
- Source url: http://gis.ny.gov/gisdata/inventories/details.cfm?DSID=1114
- update through recipe app
-
fbop_corrections
- Source: Bureau of Prisons
- Source url: https://www.bop.gov/locations/list.jsp
- When searching by state, there should be 5 NY prisons, 3 of which are in NYC (Brooklyn/New York)
-
nycdoc_corrections
- Source: NYCDOC locations directory
- Source url: https://www1.nyc.gov/site/doc/about/facilities-locations.page
-
nycourts_courts
- Source: NY Courts website
- Source url: http://www.nycourts.gov/courts/nyc/criminal/generalinfo.shtml#BRONX_COUNTY
-
nysdoccs_corrections
- Source: DOCCS NY
- Source url: https://doccs.ny.gov/find-facility
- Hand check for 1 facility in queens, 1 facility in Manhattan, 0 in the other 3 boros. Only look at the correctional facility locations, not the offices.
-
dcp_pops
- Source: Download from POPs app, available on DCP Commons. Be sure to only take the public version.
- Be sure to do this source last, as the OpenData release of POPs needs to be in sync
-
doe_lcgms
- Source: DOE
- Source url: external link available through https://data.cityofnewyork.us/Education/LCGMS-DOE-School-Information-Report/3bkj-34v2/
- This dataset is updated for CEQR
-
dot_bridgehouses
- Source: Will receive via email or FTP
-
dot_ferryterminals
- Source: Will receive via email or FTP
-
dot_mannedfacilities
- Source: Will receive via email or FTP
-
dot_publicparking
- Source: Will receive via email or FTP
-
dot_pedplazas
- Source: Will receive via email or FTP
-
foodbankny_foodbanks
- Source: Foodbank NYC
- Source url: http://www.foodbanknyc.org/get-help/
- Go to the expanded view of the google maps. Click “Download KML” under the options (three dots). Instead of “Entire Map,” select “Food Bank For NYC Open Sites.” Select. “Keep data up to date with network link KML (only usable online).“ Go to https://mygeodata.cloud/converter/kmz-to-csv to convert the kmz to csv, then use recipe app to load in the csv
-
nysed_activeinstitutions
- Source: DOCCS NY
- Source url: All Institutions: Active Institutions with GIS coordinates and OITS Accuracy Code - Select by County__ CSV from https://eservices.nysed.gov/sedreports/list?id=1
-
nysed_nonpublicenrollment
- Source: NYSED IRS site
- Source url: Nonpublic Enrollment by Grade from http://www.p12.nysed.gov/irs/statistics/nonpublic/
-
nysoasas_programs
- Source: OASAS website
- Source url: https://webapps.oasas.ny.gov/providerDirectory/index.cfm?search_type=2
- Download all treatment providers
- Modify download URL to contain today’s date:
https://webapps.oasas.ny.gov/providerDirectory/download/Treatment_Providers_OASAS_Directory_Search_13-Nov-20.csv
-
usnps_parks
- Source: USNPS website
- Source url: https://irma.nps.gov/DataStore/Reference/Profile/2225713
- NOTE: the final number in the URL (2225713) is not always stable. If the data is missing, search through the home.
- dep_wwtc
- doe_universalprek
- Source:
- Source url: https://maps.nyc.gov/prek/data/pka/pka.csv
- Ingest dataset using custom function
- Filter dataset if filters are straightforward, an example of this is filtering state data by county.
- Clean any fields as much as is necessary to use them as inputs in geocoding functions
- Use decorators to geocode as much as possible
- If BBL exists, use function BL
- If BIN exists, use function BN
- If house number, street name, borough and/or zipcode exist, pass into 1B directly
- If address, borough and/or zipcode exist, pass into parse address then into 1B directly
- If no address, bin, and bbl info exist, pass without geocoding
- Do source-specific manipulations in SQL to create the table _{dataset}
- Map source data fields directly to facdb fields
- facname
- factype
- datasource
- facsubgrp
- opname
- optype
- overagency
- capacity
- captype
- proptype
- Include any filtering that is not straightforward in python
- Map source data fields directly to facdb fields
- Combine records from _{dataset} tables together into single table.
- Standardize across datasets
- opname
- overagency
- Assign values by taking from cleaned Geosupport inputs or from lookup tables
- boro
- addressnum
- streetname
- address
- city
- zipcode
- bin
- bbl
- facgroup
- facdomain
- servarea
- opabbrev - lookup with opname (opname should be standardized)
- overabbrev - lookup with overagency (overagency should be standardized, need to decide if we should take Green Book standard)
- overlevel
- Assign geographic attributes and check that values are consistent across a record (need to decide logic about when we take from Geosupport versus source, and which Geosupport functions have priority)
- addressnum
- streetname
- address
- city
- zipcode
- bin
- bbl
- latitude
- longitude
- xcoord
- ycoord
- commboard
- nta
- council
- censtract
- geom
- Standardize across datasets
-
Longform Name:
ID
- Description: Unique ID of the record
-
Longform Name:
Facility name
- Description: Name of the facility in proper case as received from the source data
-
Longform Name:
Type
- Description: Value representing the specific type of facility, which the most granular category of facilities. This value is often taken directly from the source data
-
Longform Name:
Subgroup
- Description: Value identifying the subgroup the facility belongs to based on the facility type. Subgroup values are assigned by DCP
-
Longform Name:
Group
- Description: Value identifying the group the facility belongs to based on the subgroup
-
Longform Name:
Domain
- Description: Value identifying the domain the facility belongs to based on the group. Domain is the broadest categorical grouping
-
Longform Name:
Service area
- Description: Value identifying whether the extent of the area the facility serves is local or regional
-
Longform Name:
Operator name
- Description: Name of the operating entity
-
Longform Name:
Operator acronym
- Description: Abbreviation for the operating entity
-
Longform Name:
Operator type
- Description: Indicates whether the operating entity is public or non-public
-
Longform Name:
Oversight agency name
- Description: Value identifying the domain the facility belongs to based on the group. Domain is the broadest categorical grouping
-
Longform Name:
Oversight agency acronym
- Description: Abbreviation for the oversight agency
-
Longform Name:
Oversight level
- Description: The level of government of the oversight agency: City, State, City-State, Federal, or Non-public Oversight
-
Longform Name:
Capacity
- Description: How many of capacity type/unit the facility is intended to hold.
-
Longform Name:
Capacity type
- Description: Value representing the unit type of capacity, such as beds, visitors, seats, etc.
- removed from dataset
-
Longform Name:
Property type
- Description: x
-
Longform Name:
House number
- Description: Address number of where the facility is located according to GeoSupport
-
Longform Name:
Street name
- Description: Street name where the facility is located, according to GeoSupport
-
Longform Name:
Address
- Description: Concatenated value of AddressNumber and StreetName of where the facility is located
-
Longform Name:
City
- Description: City name where the facility is located according to GeoSupport
-
Longform Name:
Zipcode
- Description: Zip code of address from GeoSupport
-
Longform Name:
Borough
- Description: Full name of the borough the facility is within
-
Longform Name:
Borough Code
- Description: The 1 digit of the borough the facility is within
-
Longform Name:
BIN
- Description: BIN value of the building the facility is located in. If the facility spans multiple buildings only one BIN is reported
-
Longform Name:
BBL
- Description: BBL values for the tax lots the facility is located on. If the facility spans multiple lots only one BBL is reported
-
Longform Name:
Latitude
- Description: Latitude of the location as returned by Geosupport, or calculated using the coordinates in or geometry from the source data
-
Longform Name:
Longitude
- Description: Longitude of the location as returned by Geosupport, or calculated using the coordinates in or geometry from the source data
-
Longform Name:
X coord
- Description: X Coordinate of the location as returned by Geosupport, or calculated using the coordinates in or geometry from the source data
-
Longform Name:
Y coord
- Description: Concatenated value of House Number and Street Name of where the facility is located
-
Longform Name:
Community district
- Description: Community District the facility is within according to Geosupport
-
Longform Name:
NTA code
- Description: Code of the NTA the facility is within according to Geosupport
-
Longform Name:
Council district
- Description: Council district the facility is within according to Geosupport
-
Longform Name:
School district
- Description: School district the facility is within according to Geosupport
-
Longform Name:
Police precinct
- Description: Police precinct the facility is within according to Geosupport
-
Longform Name:
Census tract
- Description: Census tract of the NTA the facility is within according to Geosupport
-
Longform Name:
Source dataset
- Description: Name of the dataset the record came from
-
Longform Name:
Geometry
- Description: Spatial data component
+ status: NA
+ comments: discontinued, we are no longer using this data source
+ status: updated
+ status: updated
+ status: NA
+ comments: no new version of COLP released yet (using 2018 November version on Bytes)
+ status: updated
+ status: updated
+ comments: downloaded from the POPS app
+ status: NA
+ comments: this dataset doesn't need updates
+ status: updated
+ status: updated
+ status: updated
+ comments: this dataset is updated for CEQR
+ status: updated
+ status: updated
+ status: updated
+ status: NA
+ comments: might need refresh from FTP
+ status: NA
+ comments: might need refresh from FTP
+ status: NA
+ comments: might need refresh from FTP
+ status: NA
+ comments: might need refresh from FTP
+ status: NA
+ comments: might need refresh from FTP
+ status: updated
+ status: NA
+ comments: might need refresh from FTP
+ status: updated
+ status: NA
+ comments: doesn't need update, no new facilities added
+ status: updated
+ status: NA
+ comments: need to scrape data from google map and the downloaded KML does not have spatial info
+ status: updated
+ status: updated
+ comments: new data source https://data.cityofnewyork.us/City-Government/Community-Health-Centers/b2sp-asbg/data
+ status: NA
+ comments: receive by email
+ status: NA
+ comments: no update needed, hand checked no new facilities added
+ status: updated
+ status: updated
+ status: NA
+ comments: hand checked, no update needed
+ status: updated
+ comments: not sure there are new libraries added, but the scraper worked
+ status: updated
+ comments: for some reason gdal won't read the link, so I had to manual update. not sure if new records added tho
+ status: updated
+ status: updated
+ comments: 1 facility in queens, 1 facility in Manhattan, 0 in the other 3 boros.
+ status: updated
+ status: updated
+ status: updated
+ comments: manually downloaded selected table __ All Institutions: Active Institutions with GIS coordinates and OITS Accuracy Code - Select by County__ CSV from [website](https://eservices.nysed.gov/sedreports/list?id=1) and loaded into S3
+ status: updated
+ comments: this data set was not previously included in the list for some reason.
+ status: updated
+ comments: original link no longer work, switch to https://edm-recipes.nyc3.digitaloceanspaces.com/2020-03-23/Treatment_Providers_OASAS_Directory_Search_23-Mar-20.csv
+ status: updated
+ status: updated
+ status: updated
+ status: updated
+ status: updated
+ status: updated
+ status: updated
+ commetns: scraper ran smoothly, not sure there are new facilities added
+ status: updated
+ comments: on argis site, it says updated 2020/02/17, no url change
+ status: updated
+ comments: url changed to https://data-usdot.opendata.arcgis.com/datasets/major-ports-1, data is as of __2019/12/17__
+ status: updated
+ comments: manually downloaded from url and loaded into s3