class: center, middle, inverse, title-slide .title[ #
The Effects of Partisan Geographical Segregation on Online Behavior on Twitter
] .author[ ### Megan Brown, Tiago Ventura, Jonathan Nagler, and Joshua A. Tucker
Center for Social Media and Politics, NYU
] .date[ ###
Siegel Fellows Workshop
05/06/2023
] --- layout: true <div class="my-footer"><span>Tiago Ventura (CSMaP)                                               Siegel Fellows Workshop</span></div> --- class:middle ## Motivation -- **Heightened levels of polarization are a striking feature of contemporary politics. Although the causes are a contested topic, its effects are largely documented, affecting, for example: **. -- - .midgrey[Citizens’ support for democratic norms (.red[Graham and Svolik, 2020])] -- - .midgrey[objective perceptions about the economy (.red[Enns et al., 2012])] -- - .midgrey[levels of hostility, anger, and adversarial animus towards outgroup .red[(Mason, 2018; Webster, 2020)]] -- - .midgrey[Health behavior and attitudes (.red[Baxter-King et al., 2022; Druckman et al., 2021])] -- --- class:middle ## Motivation **And even non-political outcomes:** -- - .midgrey[dating decisions and attractiveness (.red[Huber and Malhotra, 2017; Nicholson et al., 2016])] -- - .midgrey[interpersonal economic relations (.red[McConnell et al., 2018])] -- - .midgrey[labor market transactions (.red[Gift and Gift, 2015])] -- --- ### The Role of Social Media .center[ <img src="socialmedia.png" width="100%" /> ] --- class:middle ### .center[.blue[Social media increases sorting]] ### .center[<svg viewBox="0 0 512 512" style="height:1em;position:relative;display:inline-block;top:.1em;" xmlns="http://www.w3.org/2000/svg"> <path d="M256 8C119 8 8 119 8 256s111 248 248 248 248-111 248-248S393 8 256 8zm0 448c-110.5 0-200-89.5-200-200S145.5 56 256 56s200 89.5 200 200-89.5 200-200 200zm-32-316v116h-67c-10.7 0-16 12.9-8.5 20.5l99 99c4.7 4.7 12.3 4.7 17 0l99-99c7.6-7.6 2.2-20.5-8.5-20.5h-67V140c0-6.6-5.4-12-12-12h-40c-6.6 0-12 5.4-12 12z"></path></svg>] ### .center[.blue[Echo Chambers]] #### .center[<svg viewBox="0 0 512 512" style="height:1em;position:relative;display:inline-block;top:.1em;" xmlns="http://www.w3.org/2000/svg"> <path d="M256 8C119 8 8 119 8 256s111 248 248 248 248-111 248-248S393 8 256 8zm0 448c-110.5 0-200-89.5-200-200S145.5 56 256 56s200 89.5 200 200-89.5 200-200 200zm-32-316v116h-67c-10.7 0-16 12.9-8.5 20.5l99 99c4.7 4.7 12.3 4.7 17 0l99-99c7.6-7.6 2.2-20.5-8.5-20.5h-67V140c0-6.6-5.4-12-12-12h-40c-6.6 0-12 5.4-12 12z"></path></svg>] ### .center[.blue[Reduce cross-cutting exposure]] ### .center[<svg viewBox="0 0 512 512" style="height:1em;position:relative;display:inline-block;top:.1em;" xmlns="http://www.w3.org/2000/svg"> <path d="M256 8C119 8 8 119 8 256s111 248 248 248 248-111 248-248S393 8 256 8zm0 448c-110.5 0-200-89.5-200-200S145.5 56 256 56s200 89.5 200 200-89.5 200-200 200zm-32-316v116h-67c-10.7 0-16 12.9-8.5 20.5l99 99c4.7 4.7 12.3 4.7 17 0l99-99c7.6-7.6 2.2-20.5-8.5-20.5h-67V140c0-6.6-5.4-12-12-12h-40c-6.6 0-12 5.4-12 12z"></path></svg>] ### .center[.red[Polarization]] --- class: middle ### Are social media echo-chambers real? - Online Media Consumption is similar to offline consumption .red[(Gentzkow and Shapiro, 2011; Wojcieszak and Mutz, 2009; Bisbee and Larsson, 2017)] - Users' friendship networks are heterogeneous outside of politics .red[(Bakshy et al. 2012; Barbéra et al., 2 015)] - Users’ digital media diets are balanced, and strongly influenced by big reputable outlets .red[(Guess 2021; Cardenal et. al., 2019)] --- class: middle ## Puzzle -- <svg viewBox="0 0 512 512" style="height:1em;position:relative;display:inline-block;top:.1em;" xmlns="http://www.w3.org/2000/svg"> <path d="M504 256C504 119 393 8 256 8S8 119 8 256s111 248 248 248 248-111 248-248zm-448 0c0-110.5 89.5-200 200-200s200 89.5 200 200-89.5 200-200 200S56 366.5 56 256zm72 20v-40c0-6.6 5.4-12 12-12h116v-67c0-10.7 12.9-16 20.5-8.5l99 99c4.7 4.7 4.7 12.3 0 17l-99 99c-7.6 7.6-20.5 2.2-20.5-8.5v-67H140c-6.6 0-12-5.4-12-12z"></path></svg> **The existence of online echo-chambers have been largely overstate.** -- <svg viewBox="0 0 512 512" style="height:1em;position:relative;display:inline-block;top:.1em;" xmlns="http://www.w3.org/2000/svg"> <path d="M504 256C504 119 393 8 256 8S8 119 8 256s111 248 248 248 248-111 248-248zm-448 0c0-110.5 89.5-200 200-200s200 89.5 200 200-89.5 200-200 200S56 366.5 56 256zm72 20v-40c0-6.6 5.4-12 12-12h116v-67c0-10.7 12.9-16 20.5-8.5l99 99c4.7 4.7 4.7 12.3 0 17l-99 99c-7.6 7.6-20.5 2.2-20.5-8.5v-67H140c-6.6 0-12-5.4-12-12z"></path></svg> **Yet the consolidation of social media and the internet has been shown to indeed exacerbate political polarization** -- - .midgrey[Allcott et al., 2020: Deactivation Experiment] - .midgrey[Bail et al., 2018: Bot Experiment] - .midgrey[Settle, 2018: Increasing capacity to identify out partisans] - .midgrey[Lelkes et al., 2017: Effects of early adoption on internet] -- #.center[ <svg viewBox="0 0 384 512" style="height:1em;position:relative;display:inline-block;top:.1em;" xmlns="http://www.w3.org/2000/svg"> <path d="M202.021 0C122.202 0 70.503 32.703 29.914 91.026c-7.363 10.58-5.093 25.086 5.178 32.874l43.138 32.709c10.373 7.865 25.132 6.026 33.253-4.148 25.049-31.381 43.63-49.449 82.757-49.449 30.764 0 68.816 19.799 68.816 49.631 0 22.552-18.617 34.134-48.993 51.164-35.423 19.86-82.299 44.576-82.299 106.405V320c0 13.255 10.745 24 24 24h72.471c13.255 0 24-10.745 24-24v-5.773c0-42.86 125.268-44.645 125.268-160.627C377.504 66.256 286.902 0 202.021 0zM192 373.459c-38.196 0-69.271 31.075-69.271 69.271 0 38.195 31.075 69.27 69.271 69.27s69.271-31.075 69.271-69.271-31.075-69.27-69.271-69.27z"></path></svg>] -- --- class:middle ## Turning the echo chamber on its head -- **Offline Partisan Segregation**: - American voters are highly sorted offline with respect to partisanship .red[(Brown and Enos, 2021)] -- **Online Environments**: - Rather than isolating users in homogeneous online echo chambers, social media actually exposes users content that they would hardly be exposed outside of social media. - *it is not isolation from opposing views that drives polarization but precisely the fact that digital media bring us to interact outside of local bubbles (.red[Tornberg 2022, pp2])* --- class:middle ### Social media studies have focused so much on levels of online partisan sorting, but in the process, we lost sight of our local bubbles and how they interact with how users behave online. --- ## Research Questions <br> <br> <br> <br> <br> .content-box-gray[1) What is the relationship between offline partisan sorting and online partisan sorting?] --- ## Research Questions <br> <br> <br> <br> <br> .content-box-gray[2) How does offline political segregation influence online behavior?] --- class:middle ### Our approach We link a novel dataset of the offline networks and online networks of ~1m Twitter users. We measure or have plans to measure: -- - Levels of offline partisan segregation for all users -- - Levels of online partisan segregation on online networks from Twitter -- - Provide meaningful comparisons across different geographical units -- - Estimate the correlation between online and offline segregations -- - .red[For the future:] Estimate the effects of online segregation on online behavior: - outgroup animosity - toxicity - sharing of misinformaiton -- --- class:middle, inverse, center ## Materials and Methods --- ## Data Infrastructure .center[ <img src="datainfra.png" width="100%" /> ] --- ## Simple Matching Procedure -- **Step 1: Parse Voter File Data** - For every voter in each US City, we collect unique: - First name - Last name -- **Step 2: Find Candidates on Twitter** - For every month on Decahose data: - Find matches with the three parsed data - Keep all matches -- **Step 3: Discard Repeated Matches** - Multiple Jonh Does living in New York -- **Step 4: Keep unique matches** -- --- ## Offline Information: Voter Files .pull-left-narrow[ .center[ <img src="voterfile.png" width="100%" /> ] ] .pull-right-wide[ **Data Collection for every matched voters:** - Voter file demographics (gender, race, partisanship, religion) - Residential location (9 digits lat and long) - Closest 1.000 neighboors + their partisanship. ] --- ## Online Information: Twitter Data .pull-left-narrow[ <br> .center[ <img src="twitter.png" width="100%" /> ] ] .pull-right-wide[ **Data Collection for every matched voters:** - Collect their full network (people they follow and follow them) ~ 57M - Collect their most recent timelines (3200 tweets) + 900k * 3,2k - .red[TODO:] Parse the timelines. ] --- ## Measuring Ideology using Online Data .pull-left[ **From voter file:** - Precise measure of matched users **What about their friends:** - Ideology estimation method employed by .red[Barbera, 2015] - Homophily assumption: Following relationships between users and political elites to estimate ideology. ] .pull-right[ .center[ <img src="twideo.png" width="100%" /> ] ] --- ## Measuring Partisan Segregation .center[ <img src="segmeas.png" width="75%" /> .footnote[[Source: Brown and Enos, 2021](https://www.nature.com/articles/s41562-021-01066-z)] ] --- class:middle ## Offline Partisan Segregation .pull-left[ <br> <br> <br> .center[ `\(\text{Offline Exposure} = \frac{\sum_{k=1}^{1000}\frac{1}{d+1}\mathbb(p_k=q_i)}{\sum_{k=1}^{1000}\frac{1}{d +1}}\)` ] ] .pull-right[ Where: - `\(i\)` is a matched voters - `\(k\)` is a given neighbor - `\(d\)` is the distance in meters between the neighbor and the individual - `\(p_k\)` is the partisanship of the neighbor - `\(q_i\)` is the opposite party of the individual whose exposure is being measured. ] --- class:middle ## Online Partisan Segregation .pull-left[ <br><br><br> .center[ `\(\text{Online Exposure} =\frac{\sum_{k=1}^{n}\log(a+1)\mathbb(p_k=q_i)}{\sum_{k=1}^{n}\log(a+1)}\)` ] ] .pull-right[ **Where:** - `\(i\)` is a matched voter - `\(k\)` is a given neighbor - `\(a\)` is the number of interactions between the friend and a user `\(i\)` - `\(p_k\)` is the partisanship of the neighbor - `\(q_i\)` is the opposite party of the individual whose exposure is being measured. ] --- class:inverse, middle, center ## Results --- ## Demographics of Twitter Panel <br><br><br> .center[ <img src="tabdemo.png" width="100%" /> ] --- ## Online vs Offline Exposure .center[ <img src="figs/radians_distr_exposure.png" width="100%" /> ] --- ## Online vs Offline Exposure by Quantiles .center[ <img src="figs/offline_exposure_quantiles.png" width="100%" /> ] --- ## Comparing Offline and Online Exposure Across Subgroups .center[ <img src="tablesubgroups.png" width="100%" /> ] --- ## Correlation between online and offline exposure .center[ <img src="figs/radians_only_by_party_online_offline.png" width="80%" /> ] --- ## Modeling Online Echo Chambers .center[ <img src="tabmodels.png" width="65%" /> ] --- class:middle ## Next Steps -- - Estimating Partisanship for all matched voters -- - Processing interactions data -- - Estimate the effects of offline on online behavior: - .red[what would you be interested in seeing here?] -- - .red[Any other research ideas to pursue with this dataset?] -- --- class:inverse, middle, center # Thank you!