key: cord-1044695-5ml0rajm
authors: Caliskan, Cantay
title: How does “A Bit of Everything American” state feel about COVID-19? A quantitative Twitter analysis of the pandemic in Ohio
date: 2021-04-05
journal: J Comput Soc Sci
DOI: 10.1007/s42001-021-00111-1
sha: 6bcca4b47d8db6306f13644194888c8ff9d90cb0
doc_id: 1044695
cord_uid: 5ml0rajm

COVID-19 has proven itself to be one of the most important events of the last two centuries. This defining moment in our lives has created wide-ranging discussions in many segments of our societies, both politically and socially. Over time, the pandemic has been associated with many social and political topics, as well as sentiments and emotions. Twitter offers a platform to understand these effects. The primary objective of this study is to capture the awareness and sentiment about COVID-19-related issues and to find how they relate to the number of cases and deaths in a representative region of the United States. The study uses a unique dataset consisting of over 46 million tweets from over 91,000 users in 88 counties of the state of Ohio, a state-of-the-art deep learning model to measure and detect awareness and emotions. The data collected is analyzed using OLS regression and System-GMM dynamic panel. Findings indicate that the pandemic has drastically changed the perception of the Republican party in the society. Individual motivations are strongly influenced by ideological choices and this ultimately affects individual pandemic-related outcomes. The paper contributes to the literature by expanding the knowledge on COVID-19 (i), offering a representative result for the United States by focusing on an “average” state like Ohio (ii), and incorporating the sentiment and emotions into the calculation of awareness (iii).

The high level of devastation caused by this unique extreme event motivated many scholars to do research on various aspects related to COVID-19. A quick search online indicates that there are more than 5,000 papers written about pandemic. Among the prominent examples, an eye-opening study by Cinelli et al. [16] looks at the spread of misinformation (usually called infodemics in the literature) from a comparative perspective, using Twitter, Instagram, YouTube, Reddit and Gab. Similarly, Gallotti et al. [27] analyze the reliability of the Twitter messages before and after the pandemic arrives in a specific country. Other examples include the usage of GIS technologies to support the global fight against outbreaks and epidemics [11] .

Twitter, as a social media platform, is a valuable source to gauge the dynamics in a society. As Bourdieu once suggested, linguistic marketplace [12] is a platform where people provide information about themselves for social gain, and, a modern interpretation of this concept would be Twitter. The social gain can be individualistic or collectivistic. In addition to providing a fairly accurate picture of the self, Twitter also precisely reflects the social and economic hierarchies which exist in offline contexts [52] . This paper stands at the intersection of epidemics and social media research, and hence aims to contribute to the literature by focusing on the most recent global pandemic, sars-cov2.

Previous experience shows that extreme events and epidemics provide a unique opportunity to be scientifically productive. A valuable study by Tang et al. [63] provides a systematic review of the literature on social media and outbreaks of emerging infectious diseases by concentrating on H1N1 and Ebola. The authors find that studies suffer from a lack of theorization and need more methodological rigor. Among the 43 papers under focus, 16 papers use Twitter as the source of data. Other social media research examples from the literature on extreme events include 2011 London riots [20] , 2012 hurricane Sandy [34] , and the 2013 European floods [57] . Despite the fact that pandemics are quite rare compared to other extreme events, there are a few studies that use social media to study pandemics as well. One example by Fung et al. [25] examines the amplified fear of the imported Ebola virus in the USA through Twitter. Another example looks at realtime classification of Twitter streams during epidemics [39] .

Other related studies include Missier et al. [45] focus on Twitter and the Zika epidemic by training a classifier to discover the top users who are actively posting relevant content about the topic. Further examples look at the adoption and utilization of social media during extreme events [22, 35, 42, 48] . Like many other examples [29, 47] , this study believes that social media is a very useful source of information during extreme events. Most importantly, social media helps users develop situational awareness during an extreme event and gain a higher resilience [56] .

On the whole, a goal of this study is to bring several fields of research together by forming an interdisciplinary framework: There are very few studies that bring an analysis of communication structure, emotion classification and social network analysis in an extreme event together in a single study; one very rare example looks at three terrorism events from the 2010s in the Western world [62] .

Measuring awareness forms an important component of this paper. The definition of awareness may be subject to interpretation; nevertheless, for this paper, it is used as the "intensity of discussion on a topic". Language-based methods have long been used to detect awareness. In the literature, there are two macro-approaches to measure awareness: dictionary-based and lexicon-based approaches [6] . The first category starts with a small set of opinion words and expands the lexicon through bootstrapping while the second category generates the opinion lexicon through learning the dataset (ibid). This paper uses a lexicon-based approach: the words used to detect awareness have all been extracted from the data at hand. The reason behind this choice was that the expectation that COVID-19 may have drastically changed the language/hashtag landscape because of its devastating social effects and hence a generalizable lexicon would not be helpful (i), and in this way to identify associated with COVID-19 that are not as strongly represented in some other context (for example, the importance of exercising at home).

The second important methodological component that is more widely studied in the literature is the use of hashtags. Hashtags fulfill different roles on Twitter. One of the functions of hashtags is contextually marking conversations around a certain topic [13] and awareness can easily be associated with being part of a conversation. Hashtags are also frequently used for "social tagging" [43] , thus, they fulfill the function of motivating others and making them aware of a topic through self-awareness. Huang et al. [33] suggest that hashtags can be organizational (hence, used for organizing resources) or conversational and serve for transmitting a message. Similarly, an important feature of hashtags is that they are monolingual and do not translate into other languages, thus, they allow for the association of tweets written in different languages [19] .

Hashtags have been used frequently in Twitter-related studies due to their clear message and short one-word structure that do not leave much room for contextual interpretations and technical ambiguities for representation. Thus, they are convenient tools for operationalizing awareness. For that matter, in terms of applications, hashtags are created and used extensively in the context of social protests and thus studied from that perspective [5, 55, 64] , as a measurement of ideology [4] .

The methodological importance of co-occurring hashtags has been noticed by Twitter researchers. Co-occurring hashtags are those that appear among other hashtags in a single tweet and in a collection of tweets (such as among all tweets posted by the same Twitter user). A theoretical discussion on the topic has been proposed by Jan Pöschko [54] where he classifies hashtags into groups both using the linguistic properties of the hashtag and social/geographic variables incorporated into a tweet. Relatedness of hashtags is the core operation which concern hashtags for clustering, classification or recommendation [44, 51, 65] . The significance of hashtags for this study is that they provide an opportunity to detect a range of topics that appear alongside some "core hashtags" that can be used as an alternative tag for the pandemic. These core hashtags have been extracted from various non-academic sources online and are believed to be the most frequently used hashtags in the context of the disease, similarly, they have been qualitatively checked by the author in terms of how well they represent the situation. According to the classification of hashtags provided by Recuero et al. (2012, selected core hashtags are almost always referential; thus, they address the context of what was happening. In addition, Recuero et al. (ibid.) also mention expressive, conative, metalingual, poetic, and phatic hashtags. In most cases, these hashtags have been ignored in this study, since they do not provide any contextual evidence for the range of practical topics that people in Ohio associate with COVID-19.

Other researchers point at the importance of hashtag pairs. In their study on protests in Brazil, Recuero et al. [55] have paid particular attention to the grouping of hashtags and further classified the co-occurring hashtags (individual types of which were mentioned above) in pairs. They believe that co-occurring hashtags may be used to mobilize or motivate people, localize the context, make demands or reinforce an opinion, characterize information or provide general context. Among the most important groups of hashtags, conative-conative hashtag pairs are used to mobilize people through strengthening the meaning of the imperative mood. In this paper, this reinforcement has been translated as "awareness about the pandemic" in general. Similarly, another important hashtag pair provided by Recuero et al. (ibid.) is the reference-reference group. In their paper, this group is mostly associated with the localization of tweets-again interpreted as "domestic awareness" and "nationalistic awareness" in this study. Comparably, emotive tweets have been used to project demands and opinions (about political parties) and used to reflect partisanship or political dissent. Lastly, metalingual hashtags provide information about the situation or they are used to characterize it. Co-occurrences of metalingual hashtags with core hashtags have been used to determine the level of awareness about the social aspects of life related to COVID-19, economic repercussions, ways of entertainment during the pandemic, and sports in general. A similar classification has been brought forward by Shapp [61] where the author differentiates between "tag" hashtags used to designate certain events and "commentary" hashtags that are used to add additional meaning to the main semantic content of the tweet. The "core hashtags" and the "topic hashtags" distinction brought forward in this paper is very similar to the categorization proposed by Shapp.

The connection between awareness and emotions has previously been studied by other scholars. In fact, social media communication systems have been referred to as "social awareness streams" [49] . Thus, they are platforms to indicate awareness about issues important to the users. There is also evidence that these awareness networks are being used to share emotions [, 54] . Sharing of emotions strengthens ties, brings users closer to one another and allows new ties in the network to form [54] . Similarly, the "emotional broadcaster theory" (EBT) indicates that people are highly motivated to tell others what they think about major events, and the posts with the expression of emotion might contain information relevant to listeners [28, 31] . An example that studies the relationship between the sharing of emotions and the properties of social networks is by Kivran-Swaine and Naaman [41] , in which the authors investigate the intensity of social media use and the people's tendency to express emotions.

Psychologists further strengthened the connection between awareness and emotions by conceptualizing 'emotion awareness' [58] . According to Rieffe et al., emotion awareness is an attentional process in which a person monitors and differentiates between discrete emotions and identifies their elicitors. Emotion is projected simultaneously with or as a response to an awareness situation. Thus, an emotional projection consists of a set of components (interpretation of the situation, physiological changes, tendency for action, motor reaction, subjective experience) [60] . In this regard, this article argues that the emotions projected by Twitter users are tightly connected to different issues they are aware of. It is further believed that the intensity of awareness on a topic is strengthened cyclically when other people are aware of that topic, as well. Greater awareness results in a greater intensity in the projection of emotions, and possibly a clearer distinction between different forms of emotions.

Traditional media outlets such as magazines, newspapers, and TV channels have been losing their communicative strength and their power to set discourse over the last few years. The reason behind this phenomenon is the rise of digital media channels (including social media outlets which work as alternative news sources), and the need to communicate news in a much timelier manner. The opportunity offered by social media channels to disseminate news much more punctually than traditional media outlets led to a decrease in the popularity of the latter one.

Social media offers an opportunity to communicate news on many topics including health. Researchers indicate that the use of social media and the better communication of health-related news can lead to better health outcomes. Over time, there has been significant increase in the number of people using social media to provide or seek information on health, share personal experience regarding diseases, medical treatments and medications, and to communicate with experts of healthcare [15, 18, 21, 26, 38] . A systematic review offered by Moorhead et al. [46] identified six advantages of using social media for health communication: "(1) increased interaction with others, (2) more available, shared, and tailored information, (3) increased accessibility and widening access to health information, (4) peer/social/emotional support, (5) public health surveillance, and (6) potential to influence health policy."

Another benefit of using social media data can be observed in public health surveillance [46] . Traditionally, public health surveillance relied on the flow of data coming from healthcare providers and pharmacists [66] . Public health data can be used to identify populations suffering from a particular illness (especially important during a pandemic like , observe the infection patterns of a disease, and identify events related to medications, vaccinations, and other uses of drugs (ibid.).

Other researchers looked at the use of social media for health communication by different demographic groups: younger people use social media disproportionately more for disseminating health-related information, but racial/ethnic disparities cannot be found [36] . Kite et al. [40] analyzed the types of messages shared by public health organizations that the public finds the most engaging.

An initial examination of the data shows that there is considerably high countylevel variation both temporally and across regions in the number of cases and deaths related to COVID-19 in Ohio. The social aspects contributing to the spread or the prevention of the pandemics and the power of social media to measure social dynamics in a context where conventional ways of data collection-such as surveys-have become obsolete provide a unique opportunity for getting help from Twitter. In addition, the sharp interventions introduced by the governors of states to curb down the pandemic offer a unique opportunity to analyze the situation in a quasi-experimental setting. In this context, an interesting research question may be to examine how different levels of awareness and different levels of sentiments about COVID-19-related issues may influence the number of cases and deaths in a county.

The methodological setup of this paper has been structured according to the policies implemented by Governor DeWine. It is assumed and expected that there will be a contagion effect between some of these policies (a list of policies has been given in "Data"). Also, some of the policies will provide a higher incentive for selfisolation and create greater cognizance about the pandemic in general. To minimize these effects and to make the results easily interpretable, this paper offers a twofold analysis by looking at the days before the first case and the days after post-stay-athome order.

With this background in mind, the paper aims to fill several geographical, substantive and analytical gaps in the literature. Talking about potential substantive contributions, COVID-19 is still in an early stage, and especially in countries heavily affected by the pandemic, a social study organized in a region that is highly representative of the United States could be of value for the scholars and policymakers worldwide. In similar regard, this paper aims to extend the regional policy literature. Methodologically, the paper expands the NLP literature on awareness and text similarity by bringing in the sentiment and emotion component to the picture-an issue that has largely been neglected so far in the literature. Lastly, the study offers a regionally focused unique dataset compiled in the early stages of the pandemic and offers an opportunity for new research in the field. This paper answers four related research questions using levels of awareness and emotions extracted from Twitter: Q1. Is pre-first-case awareness about COVID-19 associated with the number of post-stay-at-home-order cases and deaths?

Q2. Is post-stay-at-home-order awareness about COVID-19 after the stay-at-home order associated with the number of cases and deaths? Q3. Is being in a certain pre-first-case mood related to the number of post-stay-athome-order cases and deaths? Q4. Is being in a certain post-stay-at-home-order mood related to the number of cases?

As one can see, there are actually two main questions being investigated with two different datasets. The first expectation is that Ohioans have started to form an awareness about COVID-19 before the pandemic arrived in the continental US, and the awareness they formed has an impact on the number of cases and deaths that have been experienced after the stay-at-home order. The second expectation is that-due to endogeneity-it is difficult to determine the relationship between social awareness/mood and the number of cases during the policy implementation phase and in the pre-policy period due to a low number of cases. The post-policy period offers a better opportunity to analyze those dynamics. The methodological aspects of these choices have been explained in more detail in the next section.

The data for the project has been obtained in several steps and from several sources. To collect a representative sample of users residing in Ohio, tweet activity of the users who have self-identified themselves to be living in geographical coordinates of the state has been live-streamed. The collection has been done using the rtweet module of R statistical language (rtweet.info) for around three days in the month of March 2020. This resulted in the identification of 177,351 users with addresses in Ohio. Among those users, 129,815 have been observed to have self-identified themselves at a county-level address. Their formal addresses have been identified using the Nominatim module that uses OpenStreetMap data as the source of information (openstreetmap.org). The BotOrNot algorithm [17] implemented through the Botometer module of Python language has analyzed those users and finally obtained the results for 105,618 of them. After this pre-processing step and the merging of datasets, 91,096 users have been found. The identification of users who used hashtags in their tweets led to a further elimination of some users, and, thus, the final number of users in the sample became 48,291.

After the identification of users, their tweet histories have been downloaded using the Tweepy module of the Python language (tweepy.org), which provided a total of 188,577,209 tweets. Since the pandemic has entered our lives on the very last day of the year 2019, all tweets dated 2019 or earlier have been discarded from the analysis. This provided 46,078,750 tweets. In addition, discarding the retweets, and tweets with no hashtags, 16,753,733 were recorded. The sample of tweets with hashtags that was obtained is biased towards urban centers to some degree; nevertheless, the proportion of tweets coming from different counties is similar to the distribution of the population within the state of Ohio. A comparison of populations and the sample sizes have been provided in the county-level maps (Fig. 1) .

The county-level data about the number of cases (positive for those who are infected or negative for those who recovered) and number of deaths due to COVID-19 have been obtained from the Ohio Department of Health (https:// coron avirus. ohio. gov/). The data collection starts on January 1, 2020, and ends at the end of April 2020.

The examples below show the variation in number of cases and deaths from different Ohio counties in the crisis. The ridgeline graph (Fig. 2) shows the distribution of cases per person in five different urban centers in Ohio ranked from cities in better condition to the worse. Dayton has the best performance since it has the highest number of cases per person that are zero. Contrastingly, Toledo seems to have the worst performance in terms of preventing the spread of the virus.

The variation of the number of cases across counties can be found in the graph (Fig. 3 ) that compares the growth rate of the pandemic to the number of cases per person in each county. The numbers indicate that Marion county (a mostly rural county north of Columbus, Ohio) and Pickaway County (a semi-rural county to the south of Columbus, Ohio) have the worst performance in terms of fighting against the pandemic.

As previously noted, the methodological setup of this paper is based on the COVID-19-related policy implementations of Governor of Ohio, Mike DeWine. Over the course of the pandemic (the start date of which can be accepted as January 1, 2020), DeWine implemented seven state-wide policies, the most important of which is the last one, the stay-at-home order (KFF.org). An overview of the policies can be found in Table 1 .

The methodological choices made for the paper can be grouped under three sections: measurement of awareness and extraction of emotions from the Twitter data (i), reshaping of the data for analytical purposes (ii), model specification and statistical assumptions (iii). The data-specific and methodological pipeline used in the paper can be seen in Fig. 4 .

Following the data collection stage, the initial step was the identification of "core hashtags" to detect COVID-19-related discussion on social media. These hashtags have been collected from various informal web sites online that provide The complete list of 94 core hashtags can be found in the Appendix. The tweets that include at least one core hashtag have been used to extract co-occurring hashtags, which ultimately resulted in the identification of COVID-19-related topics. To identify the "trending topics", co-occurring hashtags with a count of 100 or above have been identified (this gave a total of 3252 hashtags), and each hashtag has been manually assigned to a topic. For the detection of topics, a few other unsupervised clustering options have been considered, such as Latent Dirichlet Allocation [8] , Non-Negative Matrix Factorization [50] , Louvain-Modularity based clustering on co-occurring hashtag networks [9] , however, none of them gave results as accurate as the hand-coded topic identification. Some hashtags among those (such as #life, #today, #goals cannot be clearly associated with a "general topic" and therefore have been ignored. This provided 1951 hashtags On a related note, this paper uses a "supervised" bag-of-words approach to classify hashtag topics on a continuous scale. To measure the awareness levels, a normalized version of cosine similarity (that is typically used to measure text similarity-for an example see Bird et al. [7] ) and Jaccard similarity (as a validation check) has been used. Thus, hashtags extracted for topic and co-occurring hashtags extracted from tweets have been compared. The formulas for cosine and Jaccard similarity can be found below.

To better understand the type of data used (Fig. 5 ) you can see a network of cooccurring hashtags created based on Zipf's Law. 1 In the literature, there is criticism about using similarity scores to calculate awareness levels on their own. Budanitsky and Hirst [14] introduce the concept of semantic relatedness, indicating that two syntactically irrelevant words (such as hot and cold or car and wheel) may be used together to underline the same meaning. To solve this problem, scholars have identified "sophisticated" similarity measures. Examples include CosText [51] that looks at the cosine similarity of tweets containing the hashtags in question rather than looking at the similarity between groups of hashtags, a labeled LDA model where hashtags are used as labels for the tweets in comparison [44] , CosEntity [24] that constructs a bipartite graph between hashtags and entities in a tweet and Top-k Relatedness (ibid.) that looks at the relatedness of the entities previously mentioned. As previously mentioned and accepted by other works in the literature [24] , tweets are very short and noisy and generally contain very few hashtags. Although not believed to be systematic in nature, this creates several limitations for this paper: a considerable percentage of the data is lost to extract tweets that can be analyzed through hashtags (i), and sentiment analysis and emotion classification perform likely less well than in a controlled study (ii).

Based on the discussion in the paragraphs above, this paper uses a combination of more traditional types of comparison (such as cosine and Jaccard similarity) along with sentiment analysis and emotion classification because of a few reasons. First, using data from a limited geography and a limited timeframe helps to avoid contextual confusions resulting from temporal characteristics. Second, despite the fact that unsupervised topic detection and awareness calculation methods work fine, manually labeling around 2000 hashtags is believed to provide higher levels of precision regarding topic detection. Third, non-continuous, classification-based methods lead to a great data loss, since they want to associate each hashtag/tweet with a single label and therefore do not leave any room for overlapping classification or continuous measurements. Lastly, mathematically more involved similarity measures are harder to interpret and traditional methods offer a more intuitive explanation of the effect size.

As noted, the contribution of the 'awareness calculation' in this paper to the literature is that it controls for sentiments and emotions in a statement. Thus, the second component of the data collection process for explanatory variables has been performed using an "emotion classifier" trained on tweets. The main reasoning behind this choice was to add a sentiment dimension to the awareness calculation. In other words, since awareness about an issue is operationalized through psychological behavior, sentiments are thought to be a nuanced contribution to understanding which social metrics are particularly successful at fighting the pandemic.

The emotion model classifies five different emotions: sadness, happiness, anger, hate and neutrality. The fact that five emotions have been chosen relies on two criteria: availability of labeled emotion datasets (i), and the ongoing discussion on the definition of "basic emotions" that has been started by Paul Ekman in the early 1990s [23] . Ekman originally proposed seven basic emotions, however, this number declined over years, as the expressions for some emotions are more similar to each other than they are to others. Notable works that provide a commendable summary of this debate include Jack et al. [37] , Gu et al. [30] and others.

To classify the emotions, a series of deep learning algorithms have been used. An overview of the model can be seen in the pipeline diagram (Fig. 6) . To predict the emotions in the tweets, pre-trained GloVe vectors have been trained on 2 billion tweets (Pennington et al., 2014) . As the accuracy table indicates, the model performs with an average accuracy of 71% (Table 2 ). Modern algorithms for multinomial emotion classification from related papers have accuracy levels ranging from 61.63 to 85% [32] (Table 3) . Hence, the performance achieved is moderately high, and acceptable for the data at hand. Following awareness measurement and emotion classification, the data have been shaped to answer two sets of questions indicated in the hypotheses section. The linear regression model looks at the relationship between pre-first case awareness and emotions and post-stay-at-home-order number of cases and deaths. For the pre-first case part, January 1-March 9 period has been chosen. The post-stay-at-home order period covers the dates March 23-April 30. The sixteen days between two periods have been ignored since-as previously mentioned-due to the implementation of many policies in that time frame-there is policy contagion and simultaneity between the number of cases and awareness/emotion levels.

In the pre-first-case dataset, each county i is represented as a single, aggregated observation: a set of explanatory variables with average awareness and emotion ratio levels collected from pre-first-case observations and average cases and deaths after the stay-at-home policy has been implemented. The post-stay-at-home-order dataset that is used to analyze the relationship between post-stay-at-home-order awareness and emotion dynamics and the number of cases/deaths has been configured in a panel format. Thus, each observation is the average of values collected from one of the 88 counties in Ohio on a specific day (there are 119 days in total). For the first dataset, a pooled-OLS has been used for the empirical analysis, since it is believed that cross-state disease spread, as well as the policy contagion, is nonexistent in that period. For the second dataset, a dynamic panel estimator has been used to solve the cross-time contagion problem regarding cases and deaths, and two panel estimators have been considered: System-GMM [2] and Difference-GMM [1] . System-GMM has been favored over Difference-GMM, since the latter is believed to have poor finite sample properties (in other words, bias) in cases when the series are highly persistent [10] . In fact, the dataset at hand represents an example where regressors are quite strongly persistent, as the number of cases/deaths from a previous day point can be a very accurate sign of what the results of the pandemic will be on the next day. Finally, Akaike and Bayesian Information Criteria have been calculated to determine the optimal number of lagged variables. The results for AR(1 and AR(2 processes were quite close, and, ultimately AR(1 has been selected to obtain better interpretability of the results. In the end, four different empirical models were created. 2 The formulas have been provided below. (ii)PL Average Cases or Deaths i = 0 + 1 * Total county population i + 2∶20 * PF Average Awareness Score i + 3 * PF Average Positivity Score i + 4 * PF Average Negativity Score i + i (iii)PLDailyCasesorDeaths i,t = i + t + 1 * PLDailyCasesorDeaths i,t−1 + 2∶5 * PLAverageEmotionRatio i,t−1 + i,t (iv)PL Daily Cases or Deaths i,t = i + t + 1 * PL Daily Cases of Deaths i,t−1 + 2∶20 * Average Daily Awareness Score i,t + 21∶39 * PL Average Daily Awareness Score i,t−1 + 40 * PL Daily Positivity Score i,t + 41 * PL Daily Positivity Score i,t−1 + 42 * PL Daily Negativity Score i,t + 43 * PL Daily Negativity Score i,t−1 + i,t , relationship between PL awareness and emotions and PL daily number of cases and deaths. 3 Despite the careful selection of models and the effort to obtain highly representative data, a few drawbacks resulting from the chosen datasets and design need to be mentioned: some counties (very few in number) have not had any cases or deaths for some portion of the post-stay-at-home period-which decreases the variation in the dependent variable (nevertheless, this does not prevent the invertibility of the Hessian). In addition, as previously indicated, neither the emotion classification model nor (expectedly) the calculation of the awareness scores provides a perfect operationalization of the concepts at hand.

A first descriptive look at the data suggests that it is difficult to clearly measure the impact of different policies implemented by Governor DeWine. In fact, most of the policies have been implemented in a period when Ohio did not have many cases. As seen in Fig. 7 , different policies shown with vertical lines do not correspond to a lagging effect of decrease in cases and/or deaths. Thus, as highlighted in the methodological setup, it is theoretically more useful to look at factors that are associated with COVID-19 controlling for the policy effects. 4 The distribution of the explanatory variables, on the other hand, is shown in the graphs in Figs. 8 and 9 that demonstrate the variation in awareness and emotions over time. Figure 8 shows the average amount of normalized cosine similarity Figure 9 shows the ratio of emotions on a given day. With regard to the awareness, it is evident that people worried much more about the politics and much less about the economy before the state-wide policies have been implemented. Similarly, Ohioans chatted the most about the pandemic just before the number of cases and deaths reached a peak number. In addition, there was a considerable amount of discussion about the pandemic before the first case in Ohio. This is in accordance with the expectations: Before the economic impact of COVID-19 was felt by the society, there was a lot of political discourse about how to best deal with the pandemic. After the impact hit Ohio quite strongly, the number of cases surged, many businesses closed, and the structure of the participation in the workforce changed considerably; this may have led to a stronger discussion about the economic aspects of the situation. Looking at the distribution of emotions over time in Fig. 9 , the variation is not as great; however, one can notice that during the March 9-March 23 period when the state-wide policies were quickly introduced, the ratio of tweets classified as "sad" has slightly increased.

In summary, particularly for the case of awareness scores, the data at hand shows great promise for empirical analysis: there is variation across counties, and there is also variation across time. Similarly, as evident from the data, state-wide policies may be one factor behind the change in awareness scores and the variation in emotions. Lastly, it makes sense to look at awareness and emotions separately, since-if there is any-the effect of policies is comparably lower on emotions than it is on the set of awareness scores. The emotional content of the tweets may vary less than the awareness since most COVID-19-related tweets contain a greater amount of factual (or non-factual) information, but less emotional content. This is believed to be a result of a complex set of causes including culture and the demographic and socioeconomic backgrounds of the Twitter users in Ohio. Table 4 Pre-first case awareness (X) and post-stay-at-home cases and deaths (Y)

Empirical findings for the study have been provided in Tables 4, 5, 6, 7 and 8. To show the effect sizes for different regressors, a heatmap is used. The results obtained from the four models offer us a story that can best be explained by the existing ideological divides within the American society. Nevertheless, controlling for the number of cases and deaths, the most striking reason that makes the findings interesting is that there seems to be a considerable amount of shift in the level of awareness and emotions reflected by the group of people affected by COVID-19 when we compare the pre-first case and the post-stay-at-home datasets.

Looking at the pooled-OLS model that reports on the association between prefirst case awareness scores and emotions (X) and post-stay-at-home number of cases and deaths (Y), one can see that people who are opposed to the Republican symbols and ideology (Republicans-Hate) have experienced a lower number of cases. In addition, people who frequently talk about COVID-19 have experienced a smaller number of deaths, and people who discuss sports had a higher number of deaths on average. Among all significant variables, Republicans-Hate stands out with its great effect size. This result can best be explained with the general Table 5 Pre-first case emotions (X) and post-stay-at-home cases and deaths (Y) Table 6 Post-stay-at-home awareness (X) and post-stay-at-home cases (Y) Table 7 Post-stay-at-home awareness (X) and post-stay-at-home deaths (Y) consensus established in the previous months that more liberal segments of the population are more sensitive to the protection against the disease and the prevention of its spread (given that Ohio is a "swing state" that always has fierce electoral competition between Republicans and Democrats (Pew Research Center, June 25, 2020). The reason why "sadder" segments of the society have experienced lower numbers of cases is less clear; nevertheless, this result can still be tied to the stereotypical Democratic vs. Republican interpretation of the pandemic. It is quite likely that parts of the population that are more empathetic to people affected by COVID-19 have developed a grimmer outlook in the earlier phases of the pandemic.

The System-GMM models used for the post-stay-at-home dataset offer a different story. In this case, statistically, all awareness scores and emotion ratios (X) are significantly correlated with the number of cases and deaths (Y) controlling for the lagged independent and dependent variables, as well as the sentiment component of the tweets. Ranked by the effect size, awareness about health technology (i), domestic issues (ii), and opposition against the Republican symbols and ideology (iii) are the top three regressors significantly positively correlated with cases and number of deaths. Contrastingly, awareness about foreign aspects of the pandemic (i), support for the Republican Party and symbols (ii) and awareness about social and nationalistic aspects of COVID-19 (tied-iii) are significantly negatively correlated with the number of cases and deaths. Looking at the emotions, 'being happy' is associated with having a fewer number of cases and deaths.

A comparison between two groups of results shows contrasting aspects possibly hinting at the fact that COVID-19 may have changed social and political perceptions Table 8 Post-stay-at-home emotions (X) and post-stay-at-home cases and deaths (Y) in the population. As expected, people who tweet about possible ways of ending the pandemic are those who have experienced the pandemic in their close communities. Thus, they are associated with more cases and deaths. More interestingly, however, counties that overwhelmingly oppose Republicans are associated with a higher number of cases and deaths in the post-stay-at-home period. This is likely due to the comparably poor response of the United States to the pandemic (Foreign Policy, April 1, 2020) and reflects the shift in the approval rating of the government. However, even more interestingly, people who show overwhelming support for the government are associated with a lower number of cases and deaths. And, again, this is likely a result of the "rally around the flag effect" [3] as evidenced by the close to 10% increase in the approval rating of President Trump in the initial phases of the pandemic (Gallup.com). Thus, some people in the sample withdrew their support and they have been replaced with others. Also, the findings suggest that people with a more global awareness about the pandemic and who also care about their country in a nationalistic way are associated with lower number of cases and deaths. Also, expectedly, counties less heavily affected in the lockdown period feel happier on average.

This paper investigates the relationship between awareness and sentiment of the people in a region that is highly representative of the country of the United States and the effects of COVID-19 on its people. The most important finding is that COVID-19 as a process has changed the awareness and social perceptions of people on COVID-19-related issues as the pandemic has progressed. Specifically, segments of the society that are least hardly hit by COVID-19 were associated with opposition to Republican symbols in the initial phases of the pandemic; the same group is associated with a higher number of cases and deaths during the peak phase. My explanation for this shift is that the "rally under the flag" effect has been replaced with the perceived lackluster performance of the government when the effects of the pandemic became more serious. Additionally, another important finding is that a global perspective on the issue seems to be correlated with better COVID-19 outcomes.

The more important question that is more difficult to answer is: Can or should policymakers and/or innovators react to these findings? The answer is, probably, yes. As the paper is yet another suggestion that America is politically divided, based on the results, policymakers can benefit from focusing on two different strategies. First, policymakers should react timely to new developments and, therefore, not wait for a politically or populistically motivated response to grow. If factual information is brought forward punctually, the public will have more time to analyze and deliberate about the results and will likely more critically evaluate political and populistic statements by politicians. Second, results indicate that certain COVID-19-related topics, such as social and entertainment, are associated with higher cases and deaths. It is a human need to be in close proximity with others and socialize, and this need will grow even larger in an even worse health crisis requiring further isolation. Thus, the second goal should be to devise innovative policies to satisfy social needs.

These findings also contribute to our understanding of the current global health crisis and its likely consequences. First, people's relationship with their government seems to be a good indicator of how successfully they can deal with an extreme event. Second, the findings reinforce the idea that crisis situations reshuffle the perceptions in a society and can have political consequences for the government. To the extent that voters share this assessment, governments with poor COVID-19-related outcomes may weaken in the coming years especially in prolonged crisis situations; this is important to keep in mind for populist governments that have performed quite poorly in the pandemic (New York Times, June 2, 2020). This study has implications for policymakers, as well: party ideologies will likely be formed by even greater ideological divides and greater gap between each other in terms of technical aspects. Political differences will grow if outcomes continue to be difficult to measure objectively or they become clear only in the long term.

Some tests of specification for panel data: Monte Carlo evidence and an application to employment equations

Another look at the instrumental variable estimation of errorcomponents models

Patriotism or opinion leadership? The nature and origins of the "rally'round the flag" effect

Salience vs. commitment: Dynamics of political hashtags in Russian Twitter

Gatekeeping Twitter: Message diffusion in political hashtags

An overview of sentiment analysis in social media and its applications in disaster relief

Natural language processing with python: analyzing text with the natural language toolkit

Latent dirichlet allocation

Fast unfolding of communities in large networks

Initial conditions and moment restrictions in dynamic panel data models

Geographical tracking and mapping of coronavirus disease COVID-19/severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) epidemic and associated events around the world: how 21st century GIS technologies are supporting the global fight against outbreaks and epidemics

The economics of linguistic exchanges

Tweet, tweet, retweet: Conversational aspects of retweeting on twitter. In: 2010 43rd Hawaii International Conference on System Sciences

Evaluating wordnet-based measures of lexical semantic relatedness

Social media and clinical care: ethical, professional, and social implications

The COVID-19 Social Media Infodemic

Botornot: A system to evaluate social bots

How affiliation disclosure and control over user-generated comments affects consumer health knowledge and behavior: A randomized controlled experiment of pharmaceutical direct-to-consumer advertising on social media

Using hashtags to disambiguate aboutness in social media discourse: A case study of #OrlandoStrong. Electronic Theses and Dissertations

Social media and the police: Tweeting practices of British police forces during the

Practical guidance: The use of social media in oncology practice

Social media in disaster response:queensland police service-public engagement during the 2011 floods

Are there basic emotions?

On analyzing hashtags in Twitter

Ebola and the social media

The Use of Social Media in Public Health Surveillance

Assessing the risks of "infodemics" in response to COVID-19 epidemics

Studying online social networks

Social media as crisis platform: The future of community maps/crisis maps

Differentiation of primary emotions through neuromodulators: Review of literature

The emotional broadcaster theory of social sharing

Emotion detection in text using nested long short-term memory. 11480 (IJACSA)

Conversational tagging in Twitter

Online public communications by police & fire services during the 2012 Hurricane Sandy

Twitter adoption and use in mass convergence and emergency events

Use of social media in health communication: Findings from the Health Information National Trends Survey

Dynamic facial expressions of emotion transmit an evolving hierarchy of signals over time

Social media in public health

A tale of two epidemics: Contextual Word2Vec for classifying twitter streams during outbreaks

Please like Me: Facebook and Public Health Communication

Network properties and social sharing of emotions in social awareness streams

Exploring extreme events on social media: A comparison of user reposting/retweeting behaviors on Twitter and Weibo

Position paper, tagging, taxonomy, flickr, article, toread

Entity-centric topic-oriented opinion summarization in Twitter

Recruiting from the network: Discovering Twitter users who can help combat zika epidemics

A new dimension of health care: Systematic review of the uses, benefits, and limitations of social media for health communication

Enhancing disaster management through social media analytics to develop situation awareness what can be learned from twitter messages about hurricane sandy? In: PACIS

Participatory sensing data tweets for micro-urban real-time resiliency monitoring and risk management

Is it really about me? Message content in social awareness streams

Mining the posterior cingulate: Segregation between memory and pain components

Semantic expansion of tweet contents for enhanced event detection in Twitter

The linguistics of self-branding and micro-celebrity in Twitter: The role of hashtags

Zipf's word frequency law in natural language: A critical review and future directions

Hashtags functions in the protests across Brazil

Combining real and virtual volunteers through social media

XHELP: Design of a cross-platform social-media application to support volunteer moderators in disasters

Emotion awareness and internalising symptoms in children and adolescents: The EMOTION AWARENESS QUESTION-NAIRE REVISED

Social sharing of emotion: New evidence and new questions

Psychological models of emotion

Variation in the use of Twitter hashtags

Sense-making in social media during extreme events

Social media and outbreaks of emerging infectious diseases: a systematic review of literature

Evolution of online user behavior during a social upheaval

We know what@ you# tag: Does the dual role affect hashtag adoption?

Harnessing social media for health information management

Core hashtags #2019_ncov, #2019_nCoV, #2019ncov, #2019nCoVmissouri, #ARDS, #Asthma, #Congestionnasal, #convid19, #COPD, #corinavirus, #cornavirus, #corona, #coronachina, #coronaoutbreak, #coronavairus, #coronavid19, #coronavirius, #coronavirus, #coronavir¸s, #coronaviruses, #coronavirusitalianews, #coronavirusitaly, #coronavirusoutbreak, #coronaviruspandemic, #coronaviruss, #coronavir¸s¸, #coronavirusupdates, #cotonavirus, #cov19, #covd19, #COVID, #cov?d, #covi?d_19, #COVID19, #Covid19, #covid19, #cov?d19, #covi?d19, #Covid-19], #covid19italia, #covid19news, #covid19outbreak, #covid19pr, #covid2019, #Covidiots, #covidnews, #covid?19, #cvid19, #DeviatedSeptum, #disease, #dontpanic, #epidemic, #FlattenTheCurve, #Flu, #Grippe, #H1N1, #HcoV19, #illness, #Influenza, #IStayHomeFor, #Legionnaires, #LockdownNow, #ncov, #ncov19, #nCoV19, #ncov2019, #ncov2019, #nCoV2019, #Pandemic, #pandemic, #plagueinc, #pleuralEffusion, #Pneumonia, #precaution, #Preven-tingTheFlu, #prevention, #quarantine, #SafeHands, #SARSCoV2, #sarscov2, #SocialDistancing, #StayAtHomeChallenge, #StayHome, #staysafe, #Together-AtHome, #ViewFromMyWindow, #virus, #viruses, #worldhealth, #worldhealthorganization, #wuhan, #WuhanPneumonia, #wuhanvirus, #WuhanVirus Funding No funding has been received for the completion of this work.

On behalf of all authors, the corresponding author states that there is no conflict of interest.