International Migration and Identity Formation: The Perception of the Self and Others

Last registered on November 17, 2025

Pre-Trial

Trial Information

General Information

Title
International Migration and Identity Formation: The Perception of the Self and Others
RCT ID
AEARCTR-0016846
Initial registration date
November 13, 2025

Initial registration date is when the trial was registered.

It corresponds to when the registration was submitted to the Registry to be reviewed for publication.

First published
November 17, 2025, 2:23 PM EST

First published corresponds to when the trial was first made public on the Registry after being reviewed.

Locations

Region
Region

Primary Investigator

Affiliation
University of Bristol

Other Primary Investigator(s)

PI Affiliation
Institute for International Economic Studies
PI Affiliation
Max Planck Institute for Research on Collective Goods
PI Affiliation
Max Planck Institute for Research on Collective Goods
PI Affiliation
Groningen University

Additional Trial Information

Status
On going
Start date
2022-03-01
End date
2030-12-31
Secondary IDs
Prior work
This trial does not extend or rely on any prior RCTs.
Abstract
This project examines how international migration influences identity formation, particularly in shaping the perception of the self and others. We partner with an NGO that supports secondary school graduates from Uganda with limited financial means in pursuing a bachelor’s degree in Germany. Admission to the program offers students a transformative opportunity to acquire higher education and increase their earnings. At the same time, students are exposed to a vastly different environment, characterized by distinct economic systems, political institutions, cultural norms, and racial compositions. We leverage the randomized admission among shortlisted applicants and conduct a randomized controlled trial. By tracking the outcomes of applicants and their families and friends in Uganda, we aim to uncover the direct effects of international migration on the perception of the self and others, as well as the spillover effects on those who remain in the country of origin. Our primary outcomes include universalism, gender attitudes, and racial identity.
External Link(s)

Registration Citation

Citation
Barsbai, Toman et al. 2025. "International Migration and Identity Formation: The Perception of the Self and Others." AEA RCT Registry. November 17. https://doi.org/10.1257/rct.16846-1.0
Experimental Details

Interventions

Intervention(s)
We partner with the NGO Malengo. Malengo supports qualified secondary school graduates from Uganda with limited financial means in pursuing a bachelor’s degree in Germany. Since Malengo receives more qualified applications than it can support, it randomizes admission among shortlisted applicants. Malengo’s selection criteria include academic achievement, limited financial means, and motivation to succeed in the program. Once admitted, students receive assistance in applying to bachelor’s programs at German universities that are taught in English. Malengo also helps with visa, travel, and housing arrangements. During their first year, students receive financial support that covers their living expenses (German universities charge no tuition, only minor administrative fees). After their first year, students fund their living expenses through part-time work. Malengo’s mentoring program helps students settle in and find jobs in Germany.
Intervention Start Date
2022-10-01
Intervention End Date
2029-12-31

Primary Outcomes

Primary Outcomes (end points)
We aim to identify the direct effects on applicants and the spillover effects on non-applicants who remain in Uganda. Depending on the outcome, non-applicants include applicants’ parents, siblings, and friends. We pool different groups of non-applicants and consider them in a joint sample when estimating spillover effects. Here is an overview of the primary outcomes for applicants and non-applicants:

1) Universalism: Applicants and non-applicants (parents, siblings, and friends)
2) Gender attitudes: Applicants and non-applicants (parents, siblings, and friends)
3) Black identity: Applicants and non-applicants (siblings and friends)
Primary Outcomes (explanation)
Please refer to the document "Primary and Secondary Outcomes", in which we describe how we define the outcomes.

Secondary Outcomes

Secondary Outcomes (end points)
Here is an overview of the secondary outcomes for applicants and non-applicants:

1) Studying at university and residing abroad: Applicants
2) Social distance: Applicants and non-applicants (parents, siblings, and friends)
3) Social networks: Applicants and non-applicants (siblings and friends)
4) Local attachment to Uganda: Applicants and non-applicants (siblings and friends)
5) Self-description in the Who-Am-I task: Applicants and non-applicants (siblings and friends)
6) Personality traits: Applicants and non-applicants (siblings and friends)
7) Religiosity: Applicants and non-applicants (parents, siblings, and friends)
8) Zero-sum thinking: Applicants and non-applicants (parents, siblings, and friends)
9) Agency: Applicants and non-applicants (parents, siblings, and friends)
10) Tolerance: Applicants and non-applicants (parents, siblings, and friends)
11) Pro-democratic attitudes: Applicants and non-applicants (parents, siblings, and friends)
12) Income: Applicants and non-applicants (parents)
13) Subjective wellbeing and mental health: Applicants and non-applicants (parents, siblings, and friends)
14) Worries: Applicants and non-applicants (parents, siblings, and friends)
15) Discrimination, sexual harassment, and safety: Applicants
Secondary Outcomes (explanation)
Please refer to the document "Primary and Secondary Outcomes", in which we describe how we define the outcomes.

Experimental Design

Experimental Design
We adopt the experimental design that we pre-registered for a separate study based on the same RCT (AEARCTR-0012924). In that preregistration, we evaluate the short-term effects of the intervention and analyze different outcome domains, focusing on income, subjective wellbeing, cognitive skills, and aspirations. In this pre-registration, we instead examine how international migration shapes perceptions of the self and others, with an emphasis on universalism, gender attitudes, and various aspects of identity.

Since Malengo receives more qualified applications than it can support, it randomizes admission among all shortlisted applicants. We can hence conduct a randomized controlled trial. We use stratified randomization to improve the precision of the estimates. We form strata based on the gender of the applicant, whether they come from the Greater Kampala region or not, and whether they attended the arts or science stream in secondary school. Within each stratum, we form octuplets based on applicants’ standardized test scores in the final secondary school exams. Within each octuplet, we assign up to half of the applicants to the treatment group and the remaining applicants to the control group. Our ability to oversample the control group depends on the number of qualified applicants to the Malengo program, as well as Malengo’s operational budget and recruitment schedule. The intervention is the same for all treated applicants. There is one treatment and one control group.

We follow Malengo’s recruitment schedule and interview shortlisted applicants from the 2021-2026 cohorts of Malengo students. We also interview applicants’ parents (or alternative caregivers if they do not live with their parents), siblings, and friends in Uganda to identify spillover effects.

The analysis is based on a survey that tracks respondents over space and time. We conduct baseline interviews with all respondents. They take place before Malengo informs applicants about the (non-)successful application to avoid anticipation effects. We plan to conduct follow-up interviews with applicants every year and with other types of respondents at least once within the first three years of applicants’ planned arrival in Germany (toward the end of this period). To keep the length of the interview manageable and maximize the time of exposure to Germany, we will collect some outcomes only in the last year (i.e., three years after applicants’ planned arrival in Germany).

We will use the following equation to estimate the impact of the intervention:

Y_it = a + b Malengo_i + X’_i c + u_it

where Y_it is the outcome variable of interest for applicant i in year t after the applicant’s planned arrival in Germany. Malengo_i is the treatment dummy indicating whether the applicant has been admitted to the Malengo program. Based on experience with existing cohorts of Malengo students, we expect compliance to be high. X_i is a vector of baseline control variables. It includes the baseline value of the respective outcome variable wherever possible. It also includes randomization strata, Malengo cohort, survey wave, year of observation, and type of respondent fixed effects. Our focus will be on year 3 after the applicant’s planned arrival in Germany (t=3).

We will use the post-double-selection lasso estimation proposed by Belloni et al. (2014) to select additional control variables. We will consider the following baseline variables as inputs for the procedure (including parents’ values where appropriate): Age, gender, tribe, educational attainment, enrollment status, marital status, household size, number of children 0-5, number of children 6-18, UACE/UCE scores, physical health index, self-efficacy index, remittances received at baseline, remittances sent at baseline, business ownership, value of real estate owned, house ownership, number of bedrooms, number of bathrooms, house quality index, frequency of praying, importance of family/friends/leisure time/politics/work/religion/tradition in life, number of close friends, role of luck vs. effort for economic outcomes, desired level of redistribution of income, economic preferences, Big-5 personality traits, curiosity index, social desirability index, worries index, lived abroad for at least three months, having been overseas, number of people known abroad, number of Malengo scholars known, Facebook/Twitter/Instagram/Tiktok account ownership, district, rural/urban, and baseline values of all primary and secondary outcomes (including those specified in AEARCTR-0012924). We will use dummies to indicate missing baseline data and replace missing values with zero, including both variables in the set of potential control variables for the post-double-selection lasso estimation.

We will make the following adjustments to variables if needed. First, some variables might have minimal variation and thus reduce the power to detect an impact. We will therefore exclude all variables for which 95 percent of observations of the relevant sample or more have the same value. Second, we will winsorize continuous variables that are heavily skewed (e.g., incomes) at the 99th percentile and apply the inverse hyperbolic sine transformation to mitigate the influence of outliers. Third, we will consider replacing missing outcome data (e.g., due to attrition) with observed data from a previous follow-up interview or a proxy interview with a knowledgeable family member or friend.

We will use the same specification to analyze spillover effects and estimate it for the pooled sample of the different groups of non-applicants (as specified for the various primary and secondary outcomes above). We will also report results for estimating the treatment effects separately for the different types of non-applicants (but our focus remains on the pooled sample). We will use OLS to estimate the equation above and cluster standard errors at the applicant level. For outcomes with zeros and positive values, such as income, we will also consider using Poisson regressions to express the treatment effect in levels as a percentage (Chen and Roth, Logs with Zeros? Some Problems and Solutions, Quarterly Journal of Economics, 2024).

We will test for effect heterogeneity along the following dimensions: (i) gender, (ii) ability (based on baseline grades), (iii) socio-economic status (based on per-capita consumption expenditures of parents’ households). We will do so by interacting the treatment dummy with a variable that captures the respective dimension of heterogeneity. We may also consider exploring effect heterogeneity using modern machine-learning methods.

We will rely on outcome indices, as defined by Anderson (2008), to reduce the number of hypotheses. These indices are inverse covariance weighted averages of standardized z-scores of individual outcomes, where individual outcomes are recoded so that higher values correspond to “more favorable” outcomes. In addition, we will adjust for multiple testing across the primary outcomes within types of respondents, controlling for the false discovery rate. We will not adjust for multiple testing across secondary outcomes, individual outcomes within domains, types of respondents, or dimensions of heterogeneity, as we put less emphasis on these results.

We will consider replacing any methods mentioned above with superior methods if they become available by the time of analysis.

Note that we follow the guidance provided by Duflo et al. (2020) on pre-analysis plans and only use these fields in the AEA RCT Registry rather than a separate document.
Experimental Design Details
Not available
Randomization Method
Stratified randomization by computer
Randomization Unit
Individual
Was the treatment clustered?
No

Experiment Characteristics

Sample size: planned number of clusters
The treatment is not clustered.
Sample size: planned number of observations
Our target number of observations is about 850 applicants (and the same number of parents, siblings, and friends).
Sample size (or number of clusters) by treatment arms
At least half of the applicants in the sample (and the corresponding numbers of parents, siblings, and friends) will be in the control group. Our ability to oversample the control group depends on the number of qualified applicants to the Malengo program, as well as Malengo’s operational budget and recruitment schedule.
Minimum detectable effect size for main outcomes (accounting for sample design and clustering)
Supporting Documents and Materials

Documents

Document Name
Primary and Secondary Outcomes
Document Type
other
Document Description
File
Primary and Secondary Outcomes

MD5: 23c4250a2818a01be31aed7694b34d80

SHA1: e2e3082180379a506acead4ad5d07406f9e1d386

Uploaded At: November 13, 2025

IRB

Institutional Review Boards (IRBs)

IRB Name
Mildmay Uganda Research and Ethics Committee (MUREC)
IRB Approval Date
2022-01-31
IRB Approval Number
0210-2021