key: cord-0856776-mbgvwmf6 authors: Chong, Miyoung title: Network typology, information sources, and messages of the infodemic twitter network under COVID‐19 date: 2020-10-22 journal: Proc Assoc Inf Sci Technol DOI: 10.1002/pra2.363 sha: b60634bcdca07582ea92a89136155f79b08d9c42 doc_id: 856776 cord_uid: mbgvwmf6 During the COVID‐19 crisis, fake news, conspiracy theories, and backlash against specific groups emerged and were largely diffused via social media. This phenomenon has been described as an “infodemic,” and this study examined that the characteristics of infodemic on Twitter. Typological attributes of the infodemic Twitter network presented the features of “community clusters.” The frequently shard domains and URLs demonstrated coherent characteristics within the network. Top domains and URLs were trustworthy information sources, popular blogs, and public health research institutions. Interestingly, the most shard conversational content of the network was a COVID‐19 relevant incident occurred at a church in Korea based on misinformation and false belief. argued that social media has been a channel to manipulate public opinions. In the context of the COVID-19 outbreak, Twitter networks have become a highly active venue by facilitating "one-to-many" and "many-to-many" communications. However, technology editor of The Guardian claimed that Twitter has become a focal point for misinformation and disinformation regarding the COVID-19 crisis (Hern, 2020, March 4, para. 1) . The World Health Organization officially declared the epidemic of COVID-19 as a pandemic on March 11, 2020. As the confirmed cases and death tolls increase globally, fake news, conspiracy theories, and backlash against specific groups emerged and were mainly diffused via social media, a phenomenon that has been described as an infodemic (Hern & Sabbagh, 2020, March 10) . This study examined the characteristics of infodemic Twitter network in terms of network typology, information sources and messages, and the following research questions were examined in this study • What are the attributes of the infodemic Twitter network? • What are the major information sources of the infodemic Twitter network? • What are the majorly shared messages of the infodemic Twitter network? The tweet dataset was retrieved on Tuesday, March 17, 2020 through Application Programming Interface (API) using the import function of NodeXL (Smith, 2015) . A total of 9,958 tweets (vertices) were collected, and a total of 14,211 relationships were analyzed for this study. All the collected tweets contain the term "infodemic," and each tweet creates an edge for individual relationship that "replies-to," "mentions," and a selfloop edge that is neither "replies-to." To answer the proposed research questions, this study examined network attributes, information sources, and messages in the infodemic Twitter network. This study conducted social network analysis (SNA) using NodeXL (Hansen, Shneiderman, & Smith, 2010) . After removing 1,598 duplicated edges, a total of 12,613 unique relationships were investigated for the study. The Clauset-Newman-Moore algorithm was applied to create clusters to visualize the infodemic network. The infodemic Twitter network was visualized by employing the Harel-Koren Fast Multiscale layout algorithm to the data. The most frequently included URLs, domains, hashtags, words, and word pairs in the network were computed. The top influencers were assessed by applying betweenness centrality, a measure that how often a user is placed in the shortest path between other users and how the user connects groups by filling gaps in the network (Hansen et al., 2011) . Typological attributes of the infodemic Twitter network presented that the features of "community clusters" (Smith, Rainie, Shneiderman, & Himelboim, 2014, p. 3) . A few to several medium-sized and many smaller groups were identified while lacking a dominant centralized cluster in Figure 1 . This could be an evidence that infodemic has become a global topic with diverse degrees of interest to various locations and populations. Table 1 illustrates that top influencers in the network. Top influencers included @who (the World Health Organization), @techreview (MIT Technology Review), @carolecadwalla (the writer for The Guardian and The Observer). Interestingly, @kyunghyang, a Korean daily newspaper, appeared as one of the top influencers in the network. Table 2 displays the top domains and the largely shared URLs included the tweets of the network. The frequently shard domains and URLs demonstrated coherent characteristics with the network above. For example, top domains and URLs were trustworthy information sources, such as news media (cnn.com, nytimes.com, and scmp.com), reliable blog media (medium.com, technologyreview.com, and infodemic.blog), and research related institutions (orfonline.org and elsevierhealth.com). Table 3 presents the top words and top word pairs that frequently shared in the network. The most outstanding topic in Table 3 is a news story about an event occurred in Korea. The minister of a church located in Kyeonggi province in South Korea, sprayed saltwater inside the church members' mouths to prevent the novel coronavirus infection based on false belief and misinformation. However, using the same spray bottle for approximately 100 members, 46 of them were infected by the novel corona virus. Japanese top words and word pairs were largely representing this incident in Table 3 . Top words and word pairs in Korean also presented this case. Interestingly, the most shard conversational content of the network was an COVID-19 relevant incident occurred in a Korean church based on misinformation and belief. Furthermore, this study examined the sentiment of each network to discover users' emotional messages through the levels of positivity and negativity in tweets of infodemic network (Table 4 ). The characteristics of information sources, such as the top domains and URLs, and the implied messages, including the top words and sentiments, in the infodemic Twitter network supported the claim that Twitter as a channel for infodemic. Examining nature and types of information distributed and shared on Twitter regarding the COVID-19 pandemic is important because any disinformation or misinformation could exacerbate this public health crisis. Social bots distort the 2016 US Presidential election online discussion Analyzing social media networks with NodeXL: Insights from a connected world Fake coronavirus tweets spread as other sites take harder stance Catalyzing social media scholarship with open tools and data Mapping twitter topic networks: From polarized crowds to community clusters The author thanks Dr. Marc Smith and the Social Media Foundation for offering a valuable Twitter dataset via NodeXL for the study.