Primary Outcomes (end points)
1. User engagement metrics: positive interactions (likes, replies, reposts, quote posts, see more), sessions, and session depth. A session is defined as a request to the Graze Trending News feed. Session depth is defined as the number of items viewed within a request. Positive interactions and sessions are the primary engagement outcomes, while session depth is a secondary engagement outcome. The primary engagement outcomes are measured on a user-day panel and analyzed intent-to-treat. Included users are those who used the Graze Trending News feed at least once in the month before the experimental period, and the user-day panel begins on the calendar day of that user's first observed Graze Trending News feed impression during the experiment. For positive interactions, our primary measure is "likes", with a secondary measure using the full set of interaction types, weighted in the same way as the ranking rule in the experiment.
2. User-facing quality allocation. The main quality outcome is the average external quality of the visibility allocated by the ranking rule, using position-weighted exposure rather than raw impressions. This will primarily be defined as the average (unweighted) quality of posts in the Top 3 positions sent to users. For these Top 3 exposure measures, an opportunity is a Graze Trending News feed request while the post is eligible. The inference unit is each post entry cohort: we will compute the quality-allocation metric for each design x cohort, then compare designs across cohorts (properly accounting for how world averaging affects variance of the estimate). We average across worlds within each design.
3. Within-quality variance in post visibility. This measures how differently similarly high-quality posts are treated by the ranking rule. We will bin posts into quality quantiles and measure the variance in exposure among posts in each quantile. The primary exposure measure is the fraction of Graze Trending News feed requests while eligible in which a post appears in the Top 3, with a secondary measure on the number of impressions. Same as user-facing quality allocation, the inference unit is the post-entry cohort, and we will average across the 3 worlds within a design before comparing designs across cohorts.
4. Outlet concentration. The main concentration outcome is the Gini coefficient of outlet exposure shares among outlets with at least one eligible post in a given design x world x post-entry cohort. Outlet exposure is defined using the same Top 3 opportunity-based exposure measure as above, averaged across all posts from the outlet in that cohort. The inferential unit is the post-entry cohort: we compute the Gini coefficient separately for each design x cohort, and then compare designs across cohorts. We average across worlds within each design.
5. Cross-world unpredictability. This measures how much item-level market shares diverge across the three parallel worlds within a design. This is defined as the average pairwise absolute difference in item-level market shares across the three worlds within a design. Here, the inference unit is the post-entry cohort; for each design x cohort, we will compute the average pairwise divergence across the 3 worlds for each post, then average across posts within the cohort, and then compare designs across cohorts.