key: cord-0572327-8hmt0z17 authors: Huang, Xiao; Li, Zhenlong; Jiang, Yuqin; Li, Xiaoming; Porter, Dwayne title: Twitter, human mobility, and COVID-19 date: 2020-06-24 journal: nan DOI: nan sha: e9eb784ce0fc97ccb170fdbee8e811267f9c09ad doc_id: 572327 cord_uid: 8hmt0z17 The outbreak of COVID-19 highlights the need for a more harmonized, less privacy-concerning, easily accessible approach to monitoring the human mobility that has been proved to be associated with the viral transmission. In this study, we analyzed 587 million tweets worldwide to see how global collaborative efforts in reducing human mobility are reflected from the user-generated information at the global, country, and the U.S. state scale. Considering the multifaceted nature of mobility, we propose two types of distance: the single-day distance and the cross-day distance. To quantify the responsiveness in certain geographical regions, we further propose a mobility-based responsive index (MRI) that captures the overall degree of mobility changes within a time window. The results suggest that mobility patterns obtained from Twitter data are amendable to quantitatively reflect the mobility dynamics. Globally, the proposed two distances had greatly deviated from their baselines after March 11, 2020, when WHO declared COVID-19 as a pandemic. The considerably less periodicity after the declaration suggests that the protection measures have obviously affected people's travel routines. The country scale comparisons reveal the discrepancies in responsiveness, evidenced by the contrasting mobility patterns in different epidemic phases. We find that the triggers of mobility changes correspond well with the national announcements of mitigation measures. In the U.S., the influence of the COVID-19 pandemic on mobility is distinct. However, the impacts varied substantially among states. The strong mobility recovering momentum is further fueled by the Black Lives Matter protests, potentially fostering the second wave of infections in the U.S. The outbreak of Coronavirus disease caused by the SARS-CoV-2 virus is a public health emergency that raises wide concerns worldwide, leading to serious health, economic, and social challenges. As of June 23, 2020, there had been a total of 8,993,659 infections and 469,587 deaths globally [1] , and these figures are progressively increasing every day. On March 11, 2020 , the World Health Organization (WHO) reassessed the situation and officially declared COVID-19 as a pandemic, urging countries and regions worldwide to join forces [2] . Since then, major behavioral, clinical, and intervention policies (both strict and loose) have been undertaken to reduce the spread and prevent the persistence of the virus in human populations. Initially discovered in Wuhan, China, at the end of 2019 [2] , an outbreak of COVID-19 was first declared in mainland China in January 2020, before it spread out to European countries, most notably Italy, France, and the U.K, where a significant increase of cases and deaths was noticed after the outbreak in China. In the United States, the first confirmed case occurred on January 19, 2020, in Snohomish County, Washington. Shortly after, the United States has become the new epicenter of the disease as it surpassed Italy in terms of confirmed cases on March 26, 2020 [3] . As of May 31, 2020, there had been a total of 2,302,288 confirmed cases (25.6% of global cases) and 120,333 deaths (25.6% of global deaths) in the U.S. alone, according to the U.S. Centers for Disease Control and Prevention (CDC). To contain the COVID-19 pandemic, one of the non-pharmacological epidemic control measures is to reduce the transmission rate of SARS-COV-2 in the population via social distancing or other similar (self) quarantine measures [4] , with the ultimate goal to reduce person-to-person interactions. Studies have found notable declines in transmission rates after the implementation of mobilityreducing policies in China, Korea, and many European countries [5] [6] [7] [8] . Despite the success of these efforts, not all countries/regions chose to handle the pandemic in a similar manner [9, 10] . The discrepancies in policies and measures at different geographical levels urge an approach to monitoring the mobility dynamics in response to the pandemic, as mobility patterns largely indicate how people respond to the pandemic and whether policies are implemented effectively [11] [12] [13] . Since the initial outbreak of COVID-19, numerous efforts have been made, by incorporating the emerging concept of "Web 2.0" [14] , "Big Data" [15] , and "Citizen as Sensors" [16] , to obtain timely information regarding whether people are actively reducing their exposure to COVID-19 by reducing distances traveled, and by how much. Companies like Google and Apple have released their aggregated and anonymized community mobility reports based on data collected from their mapping services (i.e., Google Maps and Apple Maps). Those reports are updated on a daily basis and can be easily downloaded. In addition, authorities started to collaborate with mobile network operators to estimate and visualize the effectiveness of control measures [11, 17] , in light of the previous success of mobile phone data in assisting the modeling of the spread of other epidemics [18] [19] [20] . Shortly after the outbreak in China, mobility data from Baidu, a famous Chinese online platform, have been put into use to evaluate the effectiveness of the lockdown measure in Wuhan [5] . Leading telecommunication firms also contribute by collaborating with local authorities to estimate the efficiency of travel restrictions as well as to identify the impact of other mobility-reducing related measures [21, 22] . In the U.S., Descarte Lab (www.descarteslabs.com/mobility) has released mobility statistics derived from mobile devices, aiming to facilitate the acquisition of rapid situational awareness at the State-and County-level. City-level studies have also been conducted. For example, locational data from Cuebiq (https://www.cuebiq.com/), gathered via over 180 mobile applications, were used to monitor how social distancing guidelines are implemented on a daily basis in the city of Boston [11] . However, privacy advocates have voiced concerns on whether sharing customer data is appropriate, even in a time of crisis [23, 24] . The rapid spread of the COVID-19 at the global level highlights the need for a more harmonized, less privacy-concerning, easily accessible approach to monitoring human mobility. The rise of social media platforms such as Twitter (twitter.com), Flickr (www.flickr.com), and Instagram (www.instagram.com), offers another possible solution to closely monitoring human mobility changes, thanks to the timely geospatial information from the enormous sensing network constituted by millions of users. The huge volume of usergenerated content from social media platforms greatly facilitates the real-time or near real-time monitoring of human mobility, providing timely data of how people respond to the COVID-19 pandemic geographically, especially within different epidemic phases. The advantages of social media with respect to the aforementioned sources of digital information are that they are extensive (covering large spatial areas), easily accessible, with less privacy concern, and at very low cost [25] [26] [27] [28] . Extracting useful information from social media is not new, as the valuable geospatial insights from social media have been explored in a wide range of fields, including hazard mitigation [29] [30] [31] , evacuation monitoring [27, 32, 33] , urban analytics [34] [35] [36] [37] , and public health [38, 39] , to list a few. Despite the existing applications, few attempts have been made to explore the great potential of the human mobility statistics derived from social media during serious epidemic events, like the COVID-19 pandemic. Questions like whether the mobility data from social media can quantitatively reflect the collaborative effort in fighting the COVID-19 pandemic, how sensitive the mobility is during different epidemic phases, and how it corresponds to the everchanging policies in different geographical regions, deserve answers. To answer the above questions, we focus on Twitter, a popular social media platform, and analyze more than 587 million tweets from all over the world, to see how the worldwide collaborative efforts in reducing the mobility are reflected from this user-generated information in three different scales: global scale, country scale, and U.S. state scale. We propose two types of distance, respectively termed as single-day distance and cross-day distance, to quantify different aspects of public mobility observed from Twitter. We further normalize these distances by setting up their corresponding baselines. To quantify the responsiveness in certain geographic regions within different epidemic phases, we propose a mobility-based responsive index (MRI) to capture the overall degree of mobility changes in response to the COVID-19 pandemic within a specific time window. Finally, we contextualize the mobility dynamics derived from Twitter with detailed measures from local authorities to shed light on their effectiveness. The theoretical, methodological, and contextual knowledge in this study is expected to inspire future applications of these easily accessible, less privacy-concerning, highly spatiotemporal data. More than 587 million geotagged tweets posted by over 10 million twitter users are collected using the official Twitter Streaming Application Programming Interface (API), comprising a five-month period from January 1, 2020 to May 31, 2020. These tweets are stored and queried in a tweet repository managed in an in-door Hadoop cluster with 13 servers using Apache Hive and Impala. A geotagged tweet is a Twitter post with embedded geolocation in the format of exact coordinates (latitude and longitude) from the device's GPS or place names (e.g., state, county, city). While the locational accuracy of a geotagged tweet varies, depending on the settings of the account and how a user chooses to share his/her location, we exclude the tweets that are geotagged with spatial resolution lower than the city level to increase the accuracy and credibility of the mobility pattern. Following Martin et al. [40] , we filter out the non-human tweets by looking at the tweet source from which application a tweet is posted. The computation of travel distance requires locational information from at least two positions. Thus, only users who post tweets on two consecutive days are included in the calculation of cross-day distance, a measure that quantifies displacement between two consecutive days (details in Section 3.1). For single-day distance, a measure that highlights the daily travel pattern, only users who post at least twice a day are included in the calculation (details in Section 3.1). Note that all distances computed in this study are Great Circle distances. To quantify daily human mobility from collected Twitter data, we propose two different types of distance, respectively referred to as single-day distance ( ) and cross-day distance ( ). The concepts of the two distances are presented in Fig 1. In general, represents the users' daily maximum travel distance of all locations relative to the initial location. Its calculation is confined within a single day so that users' daily travel patterns can be revealed. Different from , measures the mean center shift between two consecutive days. where denotes the GCD operator and where , ̅̅̅̅ denotes the mean center for user in day and denotes the mean center operator. for user in day , referred to as , , is further computed by applying the Great Circle distance operator: Intuitively, and represent different aspects of mobility with measuring maximum single-day travel distance and measuring cross-day displacement. The dynamics of and are expected to reflect on how the COVID-19 pandemic affects people's mobility patterns geographically, presumably indicating the regional degree of responsiveness. Inspired by the methodological design in mobility reports from Google (www.google.com/covid19/mobility) and Apple (www.apple.com/covid19/mobility), we set up baselines for and respectively. Unlike studies that utilize a single baseline value summarized from a fixed period of time, our mobility baselines are set for each corresponding day of a week, as a week has been widely recognized as an independent cycle in mobility [25, 41] . That is to say, we calculate a total of fourteen baseline values, seven for and seven for , corresponding to each day of a week. For a geographical region ℝ (globe, a country, or a state), let ℝ and ℝ represent the and of ℝ in day , respectively. We define that suggests that within region ℝ in day , reduced mobility is observed compared with the baseline mobility when measuring single-day distance (or cross-day distance). After the normalization in the previous section, a baseline of (i.e., = 1 ) that separates patterns of increased mobility and reduced mobility is formed. Intuitively, for a time series of values, the size of the area under the NMI baseline ( ) represents the degree of positive responses (i.e., reduce in mobility) for a given period of time, while the size of the area above the NMI baseline ( ) indicates otherwise (Fig 2) . Hypothetically, represents a perfect scenario where mobility instantly reduced to 0 from the beginning and remains 0 until the time series ends. Conceptually, the mobility-based responsive index we propose is the ratio between the net positive response to the perfect scenario, i.e., where ∑ and ∑ respectively denote the summation of areas under the curve and the summation of areas above the curve, given a specific time period. To remove noises, we smooth the time series using a one-dimensional Gaussian filter ( = 2) so that the general trend can be revealed. Further calculations regarding the size of the areas are all based on the smoothed time series. Given the different nature of the two proposed distances, we calculate their separately: where and denote the with and being measured, respectively. We further compute an integrated by weighting and using their total sample sizes: where and denote the total sample sizes used to calculate and , respectively. The rationale of deriving an integrated by fusing and is that, despite their different calculations, they reflect human mobility from diverse perspectives, and therefore their integration serves as an overall index that better summarizes the general degree of mobility-based responsiveness geographically. The derived has a range of (−∞, 1]. In general, the higher the value, the better responsiveness a region has, with = 1 suggests a hypothetically perfect responsiveness. A positive ( > 0 ) suggests positive responsiveness (reduce in mobility) for a region, while a negative one suggests otherwise. As most countries in the world started to aggressively respond to the COVID-19 pandemic after March 2020, we set our baselines in a temporal period from January 13 (to exclude abnormal mobility patterns due to the New Year holiday season) to February 29. Since the outbreak in China and the dramatic increase in cases in Europe, many countries have imposed and continue to impose travel bans and lockdowns [42] . As a result, both and have greatly deviated from their corresponding baselines, especially after March 11, when WHO declared COVID-19 as a pandemic (Fig 3a) . Because both our baselines are set for the individual day in a week, their projections exhibit a clear weekly pattern. In comparison, the time series of and , especially after the declaration of COVID-19 as a pandemic, show considerably less periodicity (Fig 3a) , suggesting that the protection measures (e.g., travel restrictions, social distancing policies, stay-at-home orders) have obviously affected people's weekly routines. The visual comparison of global cross-day trajectory in February (Fig 3b) and May (Fig 3c) has also demonstrated the collaborative effort in reducing mobility to fight the pandemic, evidenced by the remarkably lower density of global trajectory in May. The gap between baselines proves the different nature of and , well explaining our rationale of normalizing and separately, according to their corresponding baselines. We further observe that, throughout the entire time series, the daily value of is considerably lower than the daily value of . This phenomenon can be explained by the existence of a large amount of Twitter users who, despite their large single-day travel distance (high value), keep a similar daily posting routine, which leads to the no significant shift of mean centers between two consecutive days (low value). We observe similar mobility dynamics when and are respectively normalized to and according to their baselines (Fig 4) . Both and started to deviate from the baseline ( = 1) around ten days before the pandemic declaration from the WHO, suggesting that strong mobility-reducing measures had been taken before the declaration on March 11. This mobility pattern coincides with strong early travel restrictions implemented in Europe and Asia at the beginning of March [6, 43] . At the end of March, both and reached the bottom with the lowest = 0.70 and the lowest = 0.45, indicating that single-day distance and cross-day distance respectively reduced to 70% and 45% of the ones in the normal situation. Starting from the end of April, however, both and started to bounce back, and the increasing trend continued to the end of May, presumably resulting from the gradually lifted quarantine measures [44] . Compared with the hypothetically perfect scenario ( = 1) where mobility instantly halts and remains 0 throughout the time series, the overall for the three-month combined is 0.31, and the for the March, April, and May respectively are 0.24, 0.38, and 0.32, revealing the less responsiveness in May compared with April. For country scale study, we set the mobility baseline in a period from January 13 to February 15, as some countries (e.g., Italy and South Korea) already imposed strict or voluntary mobility-reducing policies as early as in late-February. To ensure that Twitter records are sufficient enough to generate a reasonable and stable time series, we mainly target the top 20 countries with most Twitter users, according to the Digital 2020 April Global Statshot Report [45] . The selection of those countries mostly agrees with the Twitter data we collected In general, the impact of COVID-19 pandemic on mobility derived from Twitter is obvious, as the mobility of the selected 20 countries, measured by single-day distance and cross-day distance, is mostly below the mobility baseline in March, April, and May (Fig 5) , suggesting that mobility-reducing measures have been suggested and adopted in those countries. However, the country-level discrepancies in the time series of and can be clearly observed. The mobility in Japan started to drop in late-February (Fig 5) , presumably in response to the announcement by Prime Minister Shinzo Abe on February 27 to close all Japanese elementary, junior high, and high school [46] . The further decline of mobility from early-April to late-April can be explained by the proclamation of the State of Emergency for Tokyo (April 7) and for the rest of the country (April 16) [47] . The mobility of Japan is expected to bounce back, as Japan ended the state of emergency in all of Japan On May 25 [48] . Given the limited temporal coverage of our data, however, its impact on mobility remains unknown. The mobility of the United States started to drop in mid-March when a series of statements were announced, including the declaration of COVID-19 as a pandemic by the WHO (March 11) and the declaration of National Emergency by the White House (March 13). The mobility remained consistently low in April, then gained an upward momentum in May, largely due to the gradually loosened measures [49] . A similar mobility pattern can also be observed in India, where mobility reduced following the WHO's declaration in mid-March and gradually rose in May. Mobility in Malaysia was slightly below the baseline in late-February and early-March. The sudden mobility drop appeared on March 18, which coincides with the date when the Movement Control Order (MCO) from the federal government took effect [50] . The rapid mobility reduction in Malaysia demonstrates that the MCO was effectively and efficiently executed. In Saudi Arabia, mobility started to reduce as early as March 2, when the first case was confirmed [51] . However, and gradually diverged as remained stably low in April and May, while became unstable and eventually recovered and even surpassed baseline mobility in mid-May. The divergence in trends of the two types of distances can be partially explained by the suspension of flights and mass land transport (trains, buses, and taxis) that took effect on March 21 [52] . The lack of public transit is responsible for the consistently low cross-day distance. Table 1 ). The high (0.42) of March in South Korea, a country that suffered from the initial spread of the epidemic in its early stage besides China, indicates that the early and strong mitigation measures were announced and implemented effectively. In light of the gradually easing situation [53] , the social distancing measures in South Korea started to be lifted, evidenced by the fact that its decreased respectively by 0.10 and 0.12 in April and May. In the U.S., the mobility-based responsiveness in March (0.19) is among the weakest in the 20 selected countries (Table 1 ). In April, the of the U.S. reached 0.39, a net gain of 0.20 compared to the in March. The strong responsiveness of mobility in April is largely due to the gradually issued state-level stayat-home orders since late-March that eventually affected at least 316 million people in at least 42 states [54] . With the lifting of orders in late-April and May, however, the U.S. showed reduced responsiveness, evidenced by its 0.1 loss in of May compared to April. As the U.S. has become the new COVID-19 epicenter, the reduced mobility responsiveness, along with the rocketing number of confirmed cases, deserves more attention. Besides the countries presented in Fig 5, Table in S1 Table. Given that the first State of Emergency related to COVID-19 in the U.S. was declared by Washington State (WA) on February 29, while the majority of the states started to react aggressively after mid-March, we set the U.S. mobility baseline in a period from January 13 to February 29. In general, the influence of the COVID-19 pandemic on mobility is distinct, as the drop of mobility in most of the states happened in mid-March (Fig 6) , potentially triggered by the events that include the pandemic declaration (March 11) and the National Emergency declaration (March 13). Although social distancing guidelines that aim to curb the spread have been suggested in the entire nation, the impacts varied substantially among states (Fig 6) . Heavily hit states, e.g., NY, NJ, IL, CA, MA, and PA, generally experienced sharp mobility reduction, and their mobility remained stably low since mid-March. States with low numbers of cases, e.g., DE, MT, ME, WV, SD, and WY, despite the fluctuations in their time series, exhibited relatively marginal mobility reduction compared with heavily hit states. As the first state to announce the State of Emergency at the end of February, the mobility in WA remained close to the baseline in early March. It was not until mid-March that the mobility of WA started to noticeably decrease, which potentially indicates that the early mitigation policies in WA were not implemented effectively. The time series of mobility in states that include KS, MN, MS, AL, WV, SC, and WY, presents a bowl-shaped pattern, suggesting the strong recovery of mobility with some even bouncing beyond the baseline, due to the gradually loosened measures. In response to the COVID-19 pandemic, eight states, including AR, IA, ND, NE, SD, UT, OK, and WY, decline to impose state-wide stay-at-home orders by favoring other restrictions [54] . Without the orders, however, the aforementioned states still present considerable mobility reduction amid the pandemic, indicating the effectiveness of the federal guidelines and other mitigation approaches from the local government. Given the insufficient samples in the calculation of the baseline mobility, the mobility pattern in VT is not presented in Fig 6. The state names associated with their abbreviations and the user counts for distance calculation in each state are presented in the In late-May, the risk of transmission in the U.S. was further complicated by the protests that demand justice after Mr. George Floyd died following an altercation with police. A noticeable mobility increase following the incident can be found in MN, where the incident happened (Fig 7) . Carried by the existed mobility recovering momentum in mid-May, MN saw a significant increasing trend in both and at the end of the time series. A distinct spike can be found on May 29, 2020, when the raw and raw all went beyond the baseline, with the (representing single-day maximum travel distance) reaching about 2.5 times than usual as a consequence of the increased activity during the protests. The divergent functionality of the smoothed and the raw is well illustrated as the former highlights the general trend while the latter is able to capture the spikes caused by disruptive events. At the time of writing, the protests have gradually spread across the U.S. and even overseas. The increase in mobility resulting from the protests deserves close monitoring, as standing in a crowd for long periods undoubtedly raises the risk of increased transmission and further worsens the situation. The monthly at the state level further highlights the responsiveness of each state in the three-month period. As expected, states with early spikes of cases and early strong mitigation policies tend to have a higher in March (Fig 8) . RI (0.49) leads the in March, followed by WA (0.48), MA (0.45), NY (0.45), and DC (0.42) ( Table 2 ). The high responsiveness at the early stage suggests that the mobility-reducing guidelines were implemented timely and efficiently in those states. In April, the responsiveness in almost all the states continued to strengthen (Fig 8) , given the rising of cases and gradually tightened measures, except ID and MT where remained low and rather consistent compared to March (Table 2) . From March to April, MD, AZ, and FL are the top three states with the most increase of , respectively by 0.18, 0.17, and 0.16 ( Table 2 ). The significant boost of mobility-based responsiveness reflects not only the severity of the situation but also the strong implementation of the mitigation measures. However, with the lifting of orders, 45 states (except MT, NH, and WA) have shown reduced responsiveness in May compared to April (Fig 8) . In light of the increasing number of cases in the U.S with no sign of slowing down (at the time of writing), the reduced mobility responsiveness can potentially foster a second wave of infections. Because of the insufficient samples in the baseline calculation, the for VT is not presented in Fig 8 and Table 2 . The rise of social media platforms in recent years offers a potential solution to closely monitoring human mobility dynamics, given their real-time high-volume user-generated content. Public health crises like the COVID-19 pandemic uniquely highlight several merits of social media data. First, social media data are a more harmonized source compared to cellphone records from certain providers that differ geographically. Twitter, for example, has 330 million monthly active users and 500 million daily posts worldwide [55] . Its popularity allows it to serve as a valuable venue where derived mobility dynamics can be cross-compared in different regions, especially for a global epidemic event like the COVID-19. Second, social media data offer both immediacy and spatially explicit geo-information that traditional approaches like surveys and censuses are often not capable of. The rapid spread of the SARS-CoV-2 virus and the everchanging mitigation policies greatly magnify the merit of timeliness in real-time crowdsourcing platforms, including social media. Third, social media data are relatively less privacy-concerning compared to passive data-collecting approaches that include phone calls, cellular records, and smart cards. The privacy issues in the above passive methods preclude the analysis in a more spatial explicit manner, as data collected via those methods are usually deidentified and aggregated before application. Finally, social media data are easily accessible and cost-efficient. Despite the required computational resources and storage that are essential to handle the large volume and velocity of Twitter data, its data acquisition (via Twitter API) is totally free of charge. In these respects, geo-located tweets can and should be considered as a valuable proxy for human mobility, especially during times of crisis (like the COVID-19 pandemic we are facing) that usually cause dramatic mobility changes. The results of this study should be interpreted in light of several important limitations. First, the representativeness of Twitter data may not reflect the characteristics of the population as a whole in terms of socioeconomic status, age, gender, or race. Furthermore, the representativeness may vary geographically. Despite the attempts to improve the understanding of the demographics of Twitter users via profile scrutiny and tweets mining [27, 56] , the intrinsic biases in Twitter samples should be considered when the results of this study are interpreted. The problem of representativeness, however, exists in all digital services. Mobility patterns derived from phone calls and cell phone applications (e.g., Google Maps and Apple Maps) also have to face the criticisms that people left behind by the "Digital Divide" [57] are significantly underrepresented. Second, the Twitter API allows unrestricted access to only about 1% of the total records [58] . From the tweets that streamed down via the Twitter API, we only use tweets that are geotagged with spatial resolution lower than the city level. Despite the "Big Data" nature of Twitter as a data source, the available records that can be used to derive human mobility patterns are still insufficient in some regions at a temporal resolution of daily. Mobility time series computed from insufficient samples tend to have more fluctuations, making the general pattern less recognizable and less reliable. In this respect, the mobility dynamics identified in the study only account for the reaction of Twitter in response to the COVID-19 pandemic and should not be generalized to infer the mobility of the total population without caution. Third, the less privacy-concerning nature of social media data also creates many challenges. Unlikely the passively collected data from mobile telephone records, smart cards, and wireless networks, social media data own intrinsic active nature, as users must grant permission to share their data and determine the locational accuracy of their posts, all depending on their personal settings. Thus, the two types of distances proposed in this study, single-day distance and cross-day distance, only reflect the travel behaviors that users are willing to share. This active nature protects privacy to some degree, at the same time however, dilutes the total amount of available trajectory data both spatially and temporally, potentially causing skewness in the extracted origin-destination information. Finally, our mobility baseline, where mobility patterns from other periods are compared against, is derived from a one-month period that starts from mid-January. We further compute baselines for each corresponding day of a week by recognizing a week as an independent mobility cycle, however, without considering the monthly discrepancies that mobility patterns may present. Studies have shown that mobility may vary regularly on a monthly basis [59, 60] , and the variations differ geographically due to the different cultural and societal settings. The uncertainty, resulting from the short baseline period that specifically covers late-January and February, needs to be acknowledged. Despite these limitations, we believe the strengths and valuable findings in this study outweigh the shortcomings. However, several lines of future studies are still in need. First, future work should investigate the representativeness of Twitter data by delving into the demographics of Twitter users. The mobility patterns documented in this study only reflect Twitter users' collective activities responding to the COVID-19 pandemic. Thus, the representativeness of the findings largely depends on the demographics of the local users in relation to the demographics of the local population. Another research direction is to examine the similarity and dissimilarity in mobility patterns derived from various sources (social media, phone calls, cellular records, smart cards, etc.). Although the choice of data source for studying human mobility is event dependent, a comparison study focusing on events with dramatic mobility changes leads to an overall picture and the strengths and pitfalls of each data source. The third line of research is to explore the potential in the integration of mobility indices from heterogeneous data sources. An integrated mobility index from multiple sources is expected to better reflect the multifaceted nature of human mobility, thus greatly facilitating a comprehensive mobility monitoring. Fourth, research on more extensive Twitter datasets is needed to investigate the possible improvement in mobility that can be captured. Although the hundreds of millions spatially explicit Twitter posts collected in this study are sufficient to quantitatively reflect the human mobility dynamics during the pandemic, an increasing amount of tweets are expected to generate more stable and reliable trends with fewer random fluctuations. Despite the fact that the licenses for other sample sizes, such as Gecahose (returning 10% of the public data) and Firehose (returning 100% of the public data), are costly, difficult to obtain, and requiring a demanding computational environment [61] , their potential in obtaining reliable mobility dynamics at much finer spatiotemporal resolutions deserves attention. Finally, as human population movement is among the critical dimension that drives the spatial spread of COVID-19, how to leverage such twitter derived mobility information for better predicting future infectious risk of a state, county, or community warrants investigation. As the whole world is now fighting the COVID-19 pandemic, the effectiveness of mobilityreducing measures (e.g., social/physical distancing) at varying scales needs rapid investigation. This article examines the reaction in social media, specifically Twitter, spatially and temporally in response to the COVID-19 pandemic as a more harmonized, less privacy-concerning, and cost-efficient approach to assessing human mobility dynamics promptly. Through analyzing around 587 million tweets worldwide, we present how our collaborative efforts in mobility reduction are reflected from this user-generated information in three different geographic scales: global scale, country scale, and U.S. state scale. To quantify various aspects of mobility from Twitter, we propose two types of distance, i.e., the single-day distance that highlights daily travel behavior and the cross-day distance that highlights the displacement between two consecutive days. To facilitate the comparison with normal situations, we further normalize these distances by separately setting up their baselines for each corresponding day of a week. We also propose a mobility-based responsive index ( ) to capture the overall degree of mobility-related responsiveness of particular geographic regions in response to the COVID-19 pandemic. The results suggest that mobility patterns obtained from Twitter data are amendable to quantitatively reflect the mobility dynamics in COVID-19 pandemic at various geographic scales. Globally, the proposed two distances measured from Twitter had greatly deviated from their baselines after March 11, 2020, when WHO declared COVID-19 as a pandemic. The considerably less periodicity after the declaration suggests that the protection measures have obviously affected people's weekly routines. The global reveals less responsiveness in May compared with April. At the country scale, the country-level discrepancies in responsiveness are obvious, evidenced by the contrasting mobility patterns in different epidemic phases. We further find that the triggers of mobility changes correspond well with the announcements of mitigation measures, which in return proves that Twitter-based mobility, to some degree, implies the effectiveness of those measures. At the U.S. state scale, the influence of the COVID-19 pandemic on mobility is distinct, as the drop of mobility in most of the states happened in mid-March following the National Emergency declaration on March 13. However, the impacts varied substantially among states. Heavily hit states generally experienced sharp mobility reduction while states with low numbers of cases exhibited relatively marginal mobility reduction. With orders gradually being lifted since late-April, 45 states (except MT, NH, and WA) have shown reduced responsiveness in May compared to April. The existing mobility recovering momentum is further fueled by the Black Lives Matter protests, potentially fostering the second wave of infections in the U.S. The methodological knowledge and contextual findings in this study seed future applications of the easily accessible, less privacy-concerning, highly spatiotemporal Twitter data in monitoring multi-scale mobility dynamics during disaster events. Funding: The research is supported by NSF (2028791) and University of South Carolina ASPIRE program (135400-20-54176). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. COVID-19) Situation Reports -155 [Internet]. World Health Organization. World Health Organization COVID-19) -events as they happen [Internet]. World Health Organization. World Health Organization US COVID-19 cases surge past 82,000, highest total in world Mapping county-level mobility pattern changes in the United States in response to COVID-19. SIGSPATIAL Special The effect of human mobility and control measures on the COVID-19 epidemic in China Transmission potential and severity of COVID-19 in South Korea A nationwide survey of psychological distress among Chinese people in the COVID-19 epidemic: implications and policy recommendations. General psychiatry Effective containment explains subexponential growth in recent confirmed COVID-19 cases in China Coronavirus disease 2019 (COVID-19) in Italy Covid-19: UK starts social distancing after new model points to 260 000 potential deaths Mobile phone data and COVID-19: Missing an opportunity Staying at home: mobility effects of covid-19. Available at SSRN Aggregated mobility data could help fight COVID-19 What is Web 2.0: Design patterns and business models for the next generation of software. Communications & strategies Big data: the management revolution. Harvard business review Citizens as sensors: the world of volunteered geography European Commission tells carriers to hand over mobile data in coronavirus fight Using mobile phone data to predict the spatial spread of cholera. Scientific reports Impact of human mobility on the emergence of dengue epidemics in Pakistan Quantifying the impact of human mobility on malaria COVID-19 outbreak response: a first assessment of mobility changes in Italy following national lockdown. medRxiv Governments around the world are increasingly using location data to manage the coronavirus Would You Sacrifice Your Privacy to Get Out of Quarantine? Information technology-based tracing strategy in response to COVID-19 in South Korea-privacy controversies Understanding urban human activity and mobility patterns using large-scale location-based data from online social media How do social media, mobility, analytics and cloud computing impact nonprofit organizations? A pluralistic study of information and communication technologies in Indian context. Information Technology for Development Social network, activity space, sentiment, and evacuation: what can social media tell us Identifying disaster related social media for rapid response: a visual-textual fused CNN architecture Reconstructing flood inundation probability by enhancing near real-time imagery with real-time gauges and tweets Using geotagged tweets to track population movements to and from Puerto Rico after Hurricane Maria. Population and Environment Social and geographical disparities in Twitter use during Hurricane Harvey Leveraging Twitter to gauge evacuation compliance: Spatiotemporal analysis of Hurricane Matthew. PLoS one Exploring the role of social media and individual behaviors in flood evacuation processes: An agent-based modeling approach. Water Resources Research Predicting human mobility through the assimilation of social media traces into mobility models Delineating and modeling activity space using geotagged social media data. Cartography and Geographic Information Science Exploring the uncertainty of activity zone detection using digital footprints with multi-scaled DBSCAN Tweets on the road. PloS one How social media will change public health Social media in public health Bridging twitter and survey data for evacuation assessment of Hurricane Matthew and Hurricane Irma. Natural hazards review Evaluating mobility pattern space routing for DTNs The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak. Science Spread and dynamics of the COVID-19 epidemic in Italy: Effects of emergency containment measures Beware of the second wave of COVID-19. The Lancet Global Statshot -DataReportal -Global Digital Insights DataReportal -Global Digital Insights Prediction of the epidemic peak of coronavirus disease in Japan Japan to declare nationwide state of emergency as virus spreads The Japan Times Japan ends its COVID-19 state of emergency Opening Up America Again The United States Government Movement control order not a lockdown, says former health minister: Malay Mail Malay Mail COVID-19) Situation Reports -43 [Internet]. World Health Organization. World Health Organization Saudi Arabia suspending domestic flights, mass land transport in fight against COVID-19 The global community needs to swiftly ramp up the response to contain COVID-19. The Lancet See Which States and Cities Have Told Residents to Stay at Home [Internet]. The New York Times. The New York Times 10 Twitter Statistics Every Marketer Should Know in 2020 Understanding the demographics of Twitter users The digital production gap: The digital divide and Web 2.0 collide Geo-located Twitter as proxy for global mobility patterns. Cartography and Geographic Information Science Understanding individual human mobility patterns. nature Understanding monthly variability in human activity spaces: A twelve-month study using mobile phone call detail records. Transportation Research Part C: Emerging Technologies Mining twitter data with resource constraints