Intervention (Hidden)
During the treatment period, in the treatment group of our experiment (see Experimental Design section for details), our browser extension hides toxic content (posts, comments, replies) on Twitter, Facebook, and YouTube. There is no hiding in the control group.
The browser extension analyzes each post and comment that the user is exposed to on the three platforms. Based on the element’s text content, we compute a toxicity score between 0 and 1. To that end, we use the following process. First, we detect the language of the text. If the language is English, we rely on the ‘original’ model provided by Unitary’s detoxify library (https://www.unitary.ai/). If the language is "fr", "it", "ru", "pt", "es", or "tr", we use the ‘multilingual’ model by Unitary. In all other cases, we use Perspective API (https://perspectiveapi.com/), which additionally supports “ar”, “zh”, “cs”, “nl”, “de”, “hi”, “hi-Latn”, “id”, “ja”, “ko”, “pl”.
All content (in the feed and in comment sections below posts/videos) with the toxicity score exceeding the fixed threshold of 0.3 in the treatment group is hidden by the extension.