Experimental Design
We estimate the impact of the PLA on violent crime and police practices using a large-scale, multi-city RCT. The study design incorporates two distinct approaches to recruitment and randomization among police agencies depending on their size. For large metropolitan police agencies, we randomize between high-violence police districts within agencies. The current and subsequent managers of the treatment group districts are invited to attend the PLA and outcomes are tracked at the district level. For midsize police agencies—those with populations of at least 40,000, but that are too small to support within-agency randomization—we randomize entire departments into the treatment and control groups. Current and subsequent managers at treatment departments are invited to attend the PLA, and we track outcomes over the entire jurisdiction of the randomized department.
The sample selection process begins by identifying agencies to invite to participate in the study. Guided by both a desire to have impact at scale and practical considerations related to the research design, agencies are identified based on three factors:
1. Part 1 violent crime incidents (homicides, rapes, aggravated assaults, and robberies) occurring within their jurisdiction according to the FBI’s Uniform Crime Reporting (UCR) Program and National Incident-Based Reporting System (NIBRS), or public crime data published by individual cities;
2. Number of geographic patrol districts or precincts; and
3. Availability of administrative data for measuring key outcomes.
Invited agencies large enough to support within-agency randomization are asked to identify districts meeting a minimum violence threshold, which are then paired based on crime and demographic characteristics. A random lottery assigns one district in each pair to treatment and control groups, assignments that will remain in place for the duration of the study.
Midsize agencies with too few violent districts to support within-agency randomization must meet a minimum citywide violence threshold to be considered for department-level randomization. Agencies that meet this threshold are paired with other eligible cities based on crime and demographic characteristics, and a random lottery assigns one agency from each pair to treatment and control groups.
After randomization occurs, a manager of the treatment unit (district or department) will be invited to attend an upcoming PLA cohort. To the extent possible, we will invite additional managers from the treatment unit to attend the PLA in later cohorts, particularly in cases where the original manager leaves the treatment unit due to promotion or reassignment.
We will measure the PLA’s effect on public safety using police administrative data. We are in the process of negotiating data use agreements with large police agencies to collect administrative data directly from departments that enable district-level tracking of outcomes. For midsize police agencies, we will rely primarily on publicly available administrative data reported to the NIBRS.
We will also collect novel survey and interview data to assess the impact of the PLA on leadership approaches and on the quality of management practices, directly measuring whether the intervention achieves its goal of improving police management. To measure these outcomes, we draw on primary data collected via interviews administered to treatment and control group members at baseline and again one-year post-PLA.
To estimate the effects of inviting the police manager(s) of a district (or department, in the case of midsize agencies) to attend the PLA, we will use ordinary least squares (OLS) regression to estimate a two-way fixed effects (TWFE) difference-in-differences specification with fixed effects for treatment units and time periods relative to randomization. The coefficient of interest is an indicator for whether a district is currently assigned to treatment. Due to random assignment of treatment, this coefficient provides an unbiased estimate of the intent-to-treat (ITT) effect of the PLA. This ITT estimate represents the treatment-control difference in the trend of the outcome in the post-treatment period relative to the pre-treatment period. The advantage of this approach is that the inclusion of district fixed effects explains a large amount of the variation in the outcome, allowing for much more precise estimation than would otherwise be possible with a given number of experimental districts. Because randomization will occur in pairs, the treatment assignment indicator is mechanically negatively correlated across districts within the same randomization block. As a result, we will estimate heteroskedasticity-robust standard errors clustering at the randomization block level.
We are also interested in estimating treatment-on-the-treated (TOT) effects of a district actually being overseen by a PLA-educated manager, rather than simply having its manager be invited to attend the PLA. To do this, we will use two-stage least squares (2SLS) to estimate an analogous specification, using the treatment assignment status of a district as an instrumental variable (IV) in the first stage to estimate the TOT effect in the second stage. The recent literature on TWFE estimators shows that they can produce biased estimates of average treatment effects in settings where treatment effects are heterogeneous and treatment adoption is staggered. Fortunately, in our context, even though treatment effects may be (and likely are) heterogeneous, treatment adoption, within pairs, is not staggered: each pair of districts is randomly assigned at the same time either to treatment or control, and comparisons are only made within this randomization block. There will therefore never be a situation in which an earlier-treated district acts as a control for a later-treated district.