Randomization Method
Method: Computerized stratified block randomization conducted by the Secretaría
de Hacienda de Medellín using R statistical software
Procedure:
The Tax Authority uses a random number generator to assign firms to treatment
groups. Randomization is conducted on the message deployment date using the
following steps:
1. Stratification: Firms are sorted into strata based on:
- Taxpayer type (2 categories: natural person vs. legal entity)
- Economic sector (4 categories: construction, restaurants, tourism/RNT,
financial services)
- Firm size (4 quartiles based on log of taxable base)
- Firm age (4 groups: pre-2000, 2000-2009, 2010-2019, 2020 onwards)
2. Block randomization within strata: Within each stratum, firms are randomly
assigned to one of four groups (1=Control, 2=Social Norm, 3=Deterrence,
4=Combined) with equal probability using computer-generated random numbers
3. Implementation: The Tax Authority executes the randomization algorithm and
generates the treatment assignment list. Each firm receives a unique treatment
code determining which message variant they receive.
4. Documentation: The randomization seed and complete assignment list are stored
securely by the Tax Authority. Researchers receive only anonymized data with
treatment codes (no firm identifiers).
Fixed Seeds (for reproducibility):
- Phase 1 (May 2026): seed = 12345
- Phase 2 (July-August 2026): seed = 123452
Timing: Randomization occurs on the same day as message dispatch to minimize
time between assignment and treatment delivery.
Unit of randomization: Individual firm (identified by NIT - Número de
Identificación Tributaria)
Level: Firm-level randomization. Each firm is independently assigned to one of
the four treatment groups within its stratum. There is no group-level or
cluster-level randomization. Each firm is randomized individually, even if
multiple firms share the same address, ownership, or accountant.
Stratification note: While randomization occurs at the firm level, it is
stratified by firm characteristics to ensure balance across groups. Stratification
does not constitute a separate level of randomization—all randomization occurs
at the individual firm level within strata.
Unit of analysis: Matches unit of randomization (firm-level). All outcome
measures (filing, payment) are recorded at the firm level from administrative
records.
Phase 2 exclusions: Firms assigned in Phase 1 are explicitly excluded from
Phase 2 randomization to prevent double-treatment. This is implemented by
filtering the Phase 1 firm list (by NIT) from the Phase 2 eligible population
before randomization.
Quality checks: After randomization, balance checks verify that observable
characteristics (firm size, sector, taxpayer type, age, prior compliance history,
debt amounts) are statistically similar across the four treatment groups.