- How to run
- TBD
Bail is the fee a defendant can pay to be released from custody pending trial. The purpose of bail is to ensure that the defendant appears in court for their trial. If the defendant fails to appear in court, the bail is forfeited. Otherwise, the defendant recoups the expense.
The bail system is controversial because it can lead to the pretrial detention of individuals who cannot afford to pay bail. In New York City, this often means a stint at Rikers Island, a notorious jail complex that has been the subject of numerous investigations and lawsuits.
CPL 510.10 outlines what judges can and cannot consider when setting bail.
The purpose of this project is to understand how closely judges in New York State adhere to the law, as outlined in CPL 510.10, when setting bail.
- Create a "New York Judge" profile by analyzing the bail decisions of judges in New York State. Use regression models such as Gradient Boosting and Random Forest to assign importance values to various features that might be relevant to the case.
- Create a similar profile for each county: Identify what correlations are stronger with a Kings County judge's decisions than a Richmond County judge's decisions when compared to the average New York State judge.
- Assign profiles to every judge in the state, comparing them to judges in their county and state.
- Are there factors that have an correlation on the amount of bail set?
- What factors have a high importance value that shouldn't, e.g., race, gender, median income of the county?
- What factors have a higher importance value to some judges than others?
Important
Correlation ≠ Causation. This model is good at noticing patterns, but responsible inferences shouldn't take us further than appreciating correlations between datapoints—i.e. just because a judge's bail amounts reflect a strikingly high race-importance, for example, we ought not to assume race plays into the bail decisions. This model is more interested in questions than answers, and is certainly not meant to tar-and-feather anyone in the justice system.
Note
This is not and will never be a tool to be used by counsel. It can't teach a lawyer what to emphasize on their client's behalf. This model will never be as effective as a reasonably perceptive defense attorney. This project is only to understand the extent that judges in New York State abide by the rules laid out in CPL 510.10.
Caution
This data is incomplete, both in its range and its scope. In a perfect world (or one where compute is cheaper and court records are more thorough), this model would take bail eligibility into account, and set ROR "release on recognizance" decisions to a bail amount of $0.00, which would give us deeper and more complete insight into a correlations between bail amounts and other factors. As it is currently running, the model only accounts for bail amounts > 0.
- Clone the repository.