Primary Outcomes (explanation)
Primary Outcome 1: Algorithm Treatment Effect
The algorithm treatment effect measures the causal impact of providing algorithmic decision support on operator classification accuracy and risk assessment patterns. This effect is identified through random assignment of operators to receive either algorithmic assistance or standard protocol. By random assignment, we ensure that potential outcomes under both conditions are independent of actual treatment status, conditional on stratification variables including operator experience level, shift assignment, and education. The average treatment effect can be estimated by comparing mean outcomes between randomly assigned treatment and control groups, yielding an unbiased estimate because randomization balances both observed and unobserved characteristics across groups. To examine whether the algorithm's impact varies by operator gender, we estimate conditional average treatment effects within each gender subgroup. Randomization stratified by (or balanced across) gender ensures that gender-specific treatment effects are identified within gender strata, allowing us to assess whether algorithmic assistance affects male and female operators differently.
Primary Outcome 2: Operator Gender Effect
The operator gender effect measures the causal relationship between operator gender and classification decisions, conditional on observable characteristics. Unlike algorithmic assistance, operator gender is not randomly assigned, so we cannot claim that potential outcomes are independent of gender without conditioning on observables. To identify gender effects, we invoke the conditional independence assumption: conditional on observed call characteristics and operator attributes, gender assignment is "as good as random" with respect to potential outcomes. This assumption is justified by several design features. First, calls are randomly assigned to operators within strata, ensuring that any gender differences in classification reflect differences in decision-making for identical calls rather than differential call exposure. Second, we control for extensive call-level characteristics extracted via NLP including weapons mentioned, escalation indicators, victim vulnerability, location, and timing. Third, we control for operator-level characteristics from administrative records including years of experience, education level, shift assignment, and prior VAW-specific training. Fourth, each operator classifies twenty calls with varying content, providing within-operator variation that differences out time-invariant operator characteristics. Under this conditional independence assumption, the gender effect can be estimated via regression adjustment controlling for call and operator observables.
Primary Outcome 3: Interaction Effect (Bias Mitigation)
The interaction effect tests whether algorithmic assistance differentially affects male versus female operators' classification patterns, representing a potential bias mitigation mechanism. This differential treatment effect is defined as the difference between the algorithm's impact on male operators versus its impact on female operators. Equivalently, it can be expressed as the change in the gender classification gap due to algorithmic assistance—comparing the gender gap in classification rates with the algorithm versus without it. Under random assignment of algorithmic assistance and the conditional independence assumption for gender, this interaction effect is identified by comparing treatment-control differences for male operators to treatment-control differences for female operators, conditional on call and operator characteristics. If the interaction effect equals zero, the algorithm equally affects male and female operators with no differential impact. If the interaction effect differs from zero, the algorithm has heterogeneous effects by gender. Of particular policy interest is the case where male operators underclassify risk in the control condition and the algorithm reduces this gender-based underclassification, indicating successful bias correction. This would manifest as a positive interaction effect that partially or fully closes the gender gap in risk classification.