Define Dataset: State Incarceration (Crime and Justice) #38

emily878 · 2015-03-06T22:40:12Z

Define the essential substantive elements of the core State Incarceration dataset. What are the components that it must minimally include? Do we have a dataset that we could hold up as a model?

emily878 · 2015-03-13T20:08:27Z

Hey, @beccasjames - could you help us out with a list of minimal necessary data elements?

beccasjames · 2015-03-20T20:15:09Z

Core elements for a particular state incarceration dataset, inmate population data, would include the following elements:

be regularly updated and archived, daily or weekly preferred
include number of inmates in each facility
include at what percentage of capacity the facility is operating
include numerical and percent change in population from same time period of previous year

A model (or "token") dataset can be found at the California Department of Corrections and Rehabilitation (CDCR). They produce weekly and monthly population reports for both inmate and parole populations, including an extensive archive: http://www.cdcr.ca.gov/Reports_Research/Offender_Information_Services_Branch/Population_Reports.html

Further, an ideal inmate population dataset would:

be in a machine readable format (.csv)
include breakdown of how many inmates are in maximum, medium and minimum security units as well as how many are in solitary confinement

As of now, I have yet to identify a state that fulfills all of these requirements. If discovered, updates will be provided.

waldoj · 2015-03-20T20:16:53Z

Is it desirable, or even possible, to have identifiable, per-prisoner granularity?

emily878 · 2015-03-20T20:20:16Z

Becca and I talked about that and I personally don't think we want that as
our first cut at a dataset. It will increase the visibility of people's PII
in a way that I think will be problematic for the project.

On Fri, Mar 20, 2015 at 4:16 PM, Waldo Jaquith [email protected]
wrote:

Is it desirable, or even possible, to have identifiable, per-prisoner
granularity?

—
Reply to this email directly or view it on GitHub
#38 (comment)
.

Emily Shaw
National Policy Manager | Sunlight Foundation |
(o) 202-742-1520 x 282 | (c) 207-233-5684
@emilydshaw http://twitter.com/emilydshaw

beccasjames · 2015-03-20T20:26:24Z

Echoing Emily here, the PII shared with inmate-level micro-data is potentially problematic. A few states actually do produce extensive, machine-readable datasets with inmate-level micro-data. If you're interested in what those look like, see examples below:

Texas (caution: link will download large .xls file)
Nebraska

waldoj · 2015-03-20T20:38:26Z

Got it—thank you!

waldoj · 2015-03-20T20:44:11Z

That Nebraska data is the weirdest thing. It's an Excel spreadsheet with two worksheets—one with 60,000 records, one with a suspicion-inducing 65,535—that contain just one row, with one number in each row. I feel a bit like I just bought a hard drive at Best Buy, got it home, opened the box, and found only a brick inside.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Define Dataset: State Incarceration (Crime and Justice) #38

Define Dataset: State Incarceration (Crime and Justice) #38

emily878 commented Mar 6, 2015

emily878 commented Mar 13, 2015

beccasjames commented Mar 20, 2015

waldoj commented Mar 20, 2015

emily878 commented Mar 20, 2015

beccasjames commented Mar 20, 2015

waldoj commented Mar 20, 2015

waldoj commented Mar 20, 2015

Define Dataset: State Incarceration (Crime and Justice) #38

Define Dataset: State Incarceration (Crime and Justice) #38

Comments

emily878 commented Mar 6, 2015

emily878 commented Mar 13, 2015

beccasjames commented Mar 20, 2015

waldoj commented Mar 20, 2015

emily878 commented Mar 20, 2015

beccasjames commented Mar 20, 2015

waldoj commented Mar 20, 2015

waldoj commented Mar 20, 2015