key: cord-0063944-nnhihum4 authors: Goel, Rajeev K.; Nelson, Michael A. title: COVID-19 internet vaccination information and vaccine administration: evidence from the United States date: 2021-06-03 journal: J Econ Finan DOI: 10.1007/s12197-021-09551-x sha: 11eab9687894d34c5dccdaa66f97674fed3d72af doc_id: 63944 cord_uid: nnhihum4 This paper examines the impact of internet information on COVID-19 vaccination rates across U.S. states. Does greater information on the internet about COVID-19 vaccine access increase the number of vaccines administered? Does greater information about COVID-19 vaccine reliability facilitate vaccine administration? To gauge the COVID-19 related information on the world-wide-web we created two cross-state datasets using Google search. One search dealt with searching for information on vaccination availability and scheduling in each state, while the other search involved information on vaccine reliability and its side effects. The estimation results showed that greater availability of the relevant information on the internet increased vaccine administration rates, and this was true for both types of internet searches, resulting in an affirmative answer to the questions above. In contrast, the diffusion of internet access and the digital divide across states did not have a significant impact on vaccination rates. Whereas supply chain issues have garnered most of the attention in terms of the limitations of quickly vaccinating the public, our results show that internet information might be acting as a (largely unheralded) enabler. The fight against the COVID-19 pandemic has recently seen a glimmer of hope with the arrival and regulatory approval of a number of vaccines around the world. Many nations, especially wealthy nations, have started vaccinating their populations, using various guidelines to prioritize access to this scarce resource (Goel and Nelson 2021; Persad et al. 2020) . Even within nations, such as states across the United States, regulations and vaccination delivery to populations vary. A common reason cited has been a lack of supply chain and administration logistics, coupled with the unique characteristics of some vaccines (e.g., short shelf life, transportation at certain temperatures, etc.) that has challenged the pace of vaccine delivery (Baldwin and Weder di Mauro 2020; Farzanegan et al. 2020; Goel et al. 2020 ). Yet, there are two sides to effective delivery and the demandside bottlenecks of vaccine delivery have been ignored. There have only been anecdotal pieces of evidence of eligible recipients not showing up for their vaccine appointments, leading to waste or arbitrary (out of turn) vaccinations. The issue of lagging vaccine administration has profound implications for controlling the pandemic, saving lives, and return on public investments (since COVID-19 vaccine development was paid for by taxpayer dollars in the United States). 1 This paper tries to address the shortcomings on demand-side vaccination bottlenecks by examining the role of the internet. In general, the internet has been a game-changer in many markets, impacting the demand and supply not only of information but also related transaction costs (see Goel and Hsieh 2002) . The internet is being used as a key, often primary, means to provide information about the coronavirus, vaccine availability and safety information, and information about the scheduling and delivery (the last mile of the vaccination process). Two unique questions addressed in this research are: • Does greater information on the internet about COVID-19 vaccine access increase the number of vaccines administered? • Does greater information about COVID-19 vaccine reliability facilitate vaccine administration? To address these questions, we use internet search results (total number of search hits) on Google across U.S. states for a recent month (February 2021), with the following alternative keywords: (a) search A (vaccine access search): "state name COVID-19 vaccine coronavirus appointment schedule location provider", with "COVID-19 vaccine" being the exact phrase in the search (b) search B (vaccine reliability search): "state name COVID-19 coronavirus adverse health effects vaccine side illness", with "COVID-19" being the exact phrase in the search. These searches capture the relevant information available on the internet regarding the availability and reliability of coronavirus vaccines. Since Google's search algorithm lists the sites most accessed first, 2 there is some inherent quality distinction in the searches (also see appendices A and B). These internet searches yielded information by government agencies, public health groups, international organizations, industry groups (health and insurance providers), special interest groups, etc. Appendices A and B show the top 25 hits for searches A and B, respectively. Although the searches yielded hundreds of thousands of hits (see Table 1 ), the samples of top search hits in both cases show useful and pertinent information that would empower potential consumers and seekers of vaccines. Since internet search results can change rapidly, all the searchers for a set of keywords were done in a single sitting, i.e., 2-3 hour window with no breaks in the last week of February 2021 (Search B) and the first week of March 2021 (Search A). With some similarities in the names of some states, a couple of adjustments were made to avoid double-counting. These included: (i) in the case of the state of Virginia, taking out the word "West" to exclude sites with West Virginia; and (ii) in the case of the state of Washington, excluding the words "DC" and "District of Columbia" to avoid overlap with Washington, DC. In spite of these refinements and the case about the timing of searches, internet searching remains imperfect, especially for the general public and non-computer specialists. For example, there could be mirror sites where the same information is displayed on multiple sites (hosted by the same or different organizations), fake sites selling dubious products with text/title relevant to coronavirus to generate hits/traffic, etc. Nevertheless, the application of internet searches to gauge information/awareness used in this paper is novel in the context of coronavirus vaccines. This methodology, however, has been successfully used in other contexts (Goel et al. 2012; Goel and Nelson 2014) . One of the first applications in economics was by Goel et al. (2012) to measure cross-country corruption awareness. Thankfully, the language disparities experienced across different nations are less of an issue when searching across states in the United States (also see Section 3.3 for a consideration of the language aspect). 3 The estimation results showed that greater availability of the relevant information on the internet increased vaccine administration rates, and this was true for both types of internet searches. In contrast, the diffusion of internet access and the digital divide across states did not have a significant impact on vaccination rates. The structure of the rest of the paper includes the model and data in the next section, followed by results and conclusions. 2 Empirical model and data Based on the above discussion, we formulate our main hypothesis that we will test formally using an econometric model that follows: Hypothesis: Greater internet information on vaccine access and reliability will facilitate COVID-19 vaccine administration, ceteris paribus. As discussed above, greater information on the internet about the availability, reliability, and scheduling of coronavirus vaccines would lower transactions costs, mitigate perceived risks and lead to more vaccines being administered. The general form of the estimated equation is the following (with the unit of observation being a state in the United States): The dependent variable is the COVID-19 vaccines administered in a state in the United States as of February 27, 2021. As of that date, the average VACadmin was 23,310 doses per 100,000 population across the 50 states. These could be the first doses of the Pfizer or the Moderna vaccine. 4 States have different rules for prioritizing vaccinations and some of the doses administered might be to non-residents. For example, there are reports of vaccine tourism from Florida. Although the related numbers are changing every week (day), there remains considerable vaccine administration variation across states, even in per capita terms -the range being 18,606 to 35,478. The novel angle here is the consideration of the internet information available across two qualitatively different dimensions (INTERNETsearchA and INTER-NETsearchB) using Google (see Section 1 above). Greater information about vaccine availability and safety is likely to facilitate vaccine administration. This might involve more residents seeking appointments either through greater confidence in vaccines or lower transaction costs of acquiring information and of scheduling vaccinations. The signs and statistical significance of the estimated coefficients on INTERNETsearchA and INTERNETsearchB will test the validity of the hypothesis posed above. (1) The correlation between INTERNETsearchA and INTERNETsearchB in our sample was 0.30. Furthermore, the correlation between VACadmin and INTERNETsearchA was 0.38, and that between VACadmin and INTERNETsearchB was 0.69. Digital divide (DigDivide), households without computers (NOcomputer), and internet diffusion (INTERNETdiff) are included to account for the ability to access (and sometimes produce) information on the internet. Even within a prosperous nation like the United States, there is variation across states in the ability of residents to access the internet, although some of these differences are being reduced with the prevalence of smartphones. The vector Z includes a set of state-level social, economic, political, and geographic factors that are likely to impact the administration of COVID-19 vaccines. GDPpc captures state economic prosperity -wealthier states, ceteris paribus, would have better infrastructures to administer vaccines, whereas state size (StSIZE), measured via state population, could have the opposite effect. State size (StSIZE) accounts for scaling and logistical challenges with efficiently vaccinating the populations in larger, more populous states -https:// www. yesma gazine. org/ health-happi ness/ 2021/ 02/ 17/ covid-vacci ne-rollo ut-united-states/. RURAL and ELDERLY capture the social dimensions associated with the challenges of vaccine administration. States with a larger percentage of the rural population would likely face greater challenges providing information and administration of vaccines to the rural population. Most states have been following federal Centers for Disease Control and Prevention (CDC) guidelines and prioritizing senior citizens to get the vaccines, and given the timing of our administration data, states with a greater share of the elderly would show greater vaccine administrations, ceteris paribus. 5 Two additional social dimensions that we considered dealt with race (RACE) and English language communication skills (LANGUAGE) as they can also impact propensities to seek vaccinations. The political ideology of state governors might matter in a state's approach toward combating the pandemic. Accordingly, we include POLideology to identify states with a Democratic governor (see Potrafke (2018) for a review of the literature on the role of political ideology in economics research). 6 Finally, geographic aspects are considered by including a dummy variable, For-BORDER, to identify states bordering Canada or Mexico. 7 The logic is that, other things being equal, residents in these states might either have different attitudes towards vaccinations or such states might have a larger share of casual foreign visitors, even during the pandemic travel restrictions. Next, the data used to estimate Eq. (1) is discussed. Data on the number of COVID-19 vaccine doses administered are taken from the Centers for Disease Control and Prevention (CDC) COVID Tracker website. 8 State-level tracking began on December 14, 2020, about the time initial vaccine shipments began to roll out to the states and are updated daily based on data reported by the individual states. At the time of writing, the most recent data on state vaccination administration was for February 27, 2021, and that is what is used in this analysis. By that date, most states were still at the Phase 1b priority group for vaccine distribution, with those that followed the CDC Advisory Committee on Immunization Practices (ACIP) recommendations now vaccinating persons 75 years and older. 9 Thirteen states had even progressed to Priority group 1c. All states included some seniors in the eligibility group, although the minimum age varied from 65 years old to 75 depending upon the state. 10 As of the data considered in this analysis, the number of total doses administered per 100,000 population in a state ranged from a low of 18,606 in Texas to a high of 35,478 in Alaska, with a mean value of 23,320 doses across all the states. The two internet search variables used in this analysis (INTERNETsear-chA and INTERNETsearchB) were calculated by the authors using the process described in the introduction of this paper. In both cases, total search results for each state were normalized by state population (measured in thousands). Results showed a large variation across the 50 states for each measure. This variation is shown in Fig. 1 (vaccine availability) and Fig. 2 (vaccine safety) where the normalized search results for the ten highest states and the bottom ten states are depicted along with the sample mean for all 50 states. While there is some consistency in the names of the top (bottom) states depicted in the two figures, some important differences stand out (e.g., Virginia -the state at the bottom in Fig. 1 is not even among the bottom ten in Fig. 2) . Overall, the correlation between these two search measures is only 0.29 in our sample, indicating that there are some important differences between these two in terms of the information conveyed. Internet access (INTERNETdiff) is measured by the percentage of residents in a state that have access to the internet in 2018, the most recent year with available data. To gauge the significance of the digital divide across the states (DigDivide) we use the number of public-access (library) internet computers per 5,000 per state residents in 2017, the most recent year available. The correlation coefficient between these two measures of internet access in our data set is -0.46. Data for the control variables used in this analysis are drawn from reliable sources regularly used in literature. Further details on all on the other variables used in the analysis, descriptive statistics, and data sources can be found in Table 1 . The results section follows. 8 https:// covid. cdc. gov/ covid-data-track er/# vacci natio ns 9 https:// www. cdc. gov/ vacci nes/ hcp/ acip-recs/ vacc-speci fic/ covid-19. html 10 https:// www. kff. org/ other/ state-indic ator/ state-covid-19-vacci ne-prior ity-popul ations/ We estimate two sets of models representing equation (1), one set including the internet search information variables, and a second set including both internet search and internet diffusion/digital divide measures. Results are summarized in Tables 2 and 3, respectively. Models 2.1 through 2.3 in Table 2 report the results when the vaccine availability [INTERNETsearchA] is used as a right-hand-side regressor. The models vary only in terms of whether foreign border (ForBORDER) or Democratic governor (POLideology) is included as an additional control variable in the regression setup. In each of the three models, there is strong statistical support for the proposition that information on vaccine availability, as measured by our internet search variable, positively affects the number of doses of COVID-19 state vaccinations, other things being equal. In each model, the parameter estimate on the internet search variable is statistically significant at better than the 5 percent level of confidence. Greater vaccine availability and scheduling information on the internet lowers the transaction or information costs for the public and this improves vaccination rates. This happens by, for example, greater number of vaccine recipients showing up (and showing up promptly) for vaccine appointments, and fewer no-shows (that are especially important in this context as the vaccines have a short shelf life). It could also be the case that the relevant internet information provides useful signals to suppliers and improves supply chain performance, yielding greater vaccination rates. The parameter estimates on the INTERNETsearchA variable are quite consistent across the three models estimated. This finding supports our main hypothesis. In terms of the actual impact of the search variable on vaccine administration, the parameter estimate on INTERNETsearchA in Model 2.1 (8.07) implies that if internet searches for a state were to increase by one standard deviation based on our data set (144.6 -see Table 1 ) this would lead to an increase of approximately 1,200 doses administered. That figure represents approximately one-third of the sample standard deviation of state vaccines administered in our data set. Of all the control variables considered in these models, only state size (StSIZE), as measured by its population, is statistically significant at conventional levels. This result indicates that smaller states seem to be more adept at vaccinating their populations. Based on the parameter estimates on the state size variable in Models 2.1-2.3, the results suggest that a one standard deviation in state population -as defined in this analysis -will translate into approximately 2,000 fewer vaccinations per 100,000 state population. In other results, states with a greater proportion of the elderly or rural populations, and states bordering Canada and Mexico were no different from other states in terms of vaccination rates. This insignificance also holds for GDPpc and POLideology in Models 2.1-2.3. 11 Turning to vaccine reliability, Models 2.4 through 2.6 in Table 2 report the results when vaccine safety [INTERNETsearchB] is used as a right-hand-side regressor. Similar to the vaccine availability search variable, the parameter estimate on this variable is positive and statistically significant across all three models where it is used. Again, these results lend support to our main hypothesis. INTERNETsearchB is qualitatively different from INTERNETsearchA as it addresses inherent risk aversion. Greater information about the vaccine reliability enables more people to go out and seek vaccines, ceteris paribus. Moreover, the actual impact of this variable appears to be greater than it is for the vaccine availability measure. For example, the parameter estimate on Table 2 Vaccine administration: Internet information on COVID-19 vaccine access and reliability Variable definitions are provided in Table 1 . All models are estimated via ordinary least squares and include a constant term (not reported). The numbers in parentheses are robust (absolute value) t-statistics * denotes statistical significance at the 10% level; **denotes significance at the 5% level (or better) INTERNETsearchB in Model 2.4 (404.49) implies that if internet safety searches for a state were to increase by one standard deviation based on our data set (4.31 -see Table 1 ) this would lead to an increase of approximately 1,700 doses administered. This represents approximately one-half of the sample standard deviation of the VACadmin variable. Similar results were obtained using the parameter estimates on the search variable in Models 2.5 and 2.6. In terms of the control variables used in Models 2.4-2.6, relative to the first three models considered in Table 2 , there is somewhat less statistical support for the proposition that state size (population) is an important driver of the number of vaccines administered. However, there is more support for the proposition that state economic prosperity (GDPpc) enhances a state's record in this regard. Again, ELDERLY and RURAL failed to attain statistical significance. Finally, states with a Democratic governor (Model 2.6) appear to have a better record when it comes to the vaccine rollout, at least for the dates considered in this analysis. 12 Overall, all six models summarized in Table 2 perform well, when judged by R-square and consistently statistically significant F-statistic reported towards the bottom of the table. Next, we consider an extension by addressing dimensions of the public's ability to access the internet. Table 3 takes account of the diffusion of the internet (via the population with internet access (INTERNETdiff)), and the digital divide (proxied by the public access public library internet computers per 5,000 population (DigDivide)). These variables can be seen as capturing the demand side of internet information, although some of the folks with internet access would also be producers of internet information (by, for example, generating/maintaining/authoring websites). In our sample, more than 82 percent of residents in the US had internet access, and there were about 6 public access internet computers on average per public library. The correlation between INTERNETdiff and DigDivide was -0.47; consistent with the intuition that greater internet diffusion reduced the need for public access computers. When INTERNETdiff and DigDivide were alternately added to variants of baseline models from Table 2 , the results in Table 3 show that the signs on the two variables differ: the coefficient on INTERNETdiff is positive, while that on DigDivide is negative. However, in both cases, the relevant coefficients failed to attain statistical significance. The results with regard to the positive and statistically significant effects of the two internet search variables (INTERNETsearchA and INTERNETsearchB) reinforced earlier findings. Thus, it is not the access to the internet, per se, but the quantity and quality of the relevant information available on the internet that facilitates the administration of the COVID-19 vaccines. The other results support what was reported earlier -larger states face vaccine administration challenges, while more prosperous states had vaccine administration advantages, especially when INTERNETsearchB was considered (Models 3.2 and 3.5). Along another related dimension of the digital divide, we considered the percent of state households without a computer (NOcomputer). The corresponding results, in Models 3.3 and 3.6 in Table 3 , showed that NOcomputer lacked statistical significance. In the following section, we consider some additional dimensions that might impact vaccinations. Table 3 Vaccine administration: Internet information on COVID-19, internet diffusion and the digital divide Variable definitions are provided in Table 1 . All models are estimated via ordinary least squares and include a constant term (not reported). The numbers in parentheses are robust (absolute value) t-statistics * denotes statistical significance at the 10% level; **denotes significance at the 5% level (or better) The demand for vaccinations might be influenced by other social and insurance factors. Accordingly, we consider three dimensions: (a) RACE -the percent of population in a state that is white; (b) LANGUAGE -percent of the population that speaks a language other than English at home; and (c) NOinsurance -percent of state population without health insurance. The race could affect attitudes towards vaccinations, while language captures the ability of pro-and anti-vaccination campaigns to reach certain populations. Furthermore, while in general a lack of health insurance would adversely affect vaccinations, COVID-19 vaccines are being dispensed free of charge in the United States because vaccine development was heavily underwritten by taxpayer dollars. Adding these three factors, respectively, as regressors to baseline models from Table 2 , we report these results in Table 6 in the Appendix C. The results show that the coefficients on RACE and NOinsurance were statistically insignificant in all cases, while LANGUAGE is marginally significant at the 10% level in Model A.2, but insignificant in Model A.5. More importantly, the main internet control variables, INTERNETsearchA and INTERNETsearchB, remain positive and statistically significant in all cases. This lends further support to the main findings of the paper. The concluding section follows. The global fight against the coronavirus appears to be in the final stages in the US (notwithstanding the recent virus variants), with the availability of several vaccines. However, the speed of the vaccination rollout drives has been a source of frustration with the public and policymakers, with supply-chain issues (especially given the peculiar storage and transportation requirements of some vaccines) frequently cited as a key underlying constraint (also see, Goel et al. 2020 Kaur and Gupta 2020) . The role of potentially mitigating forces, such as the internet, has not been formally considered. The internet can lower information and transaction costs resulting in lower vaccine risk perceptions and greater/more-efficient scheduling of vaccine appointments. To address formally the role of the internet in the vaccination process, this paper uniquely examines the impact of internet information on COVID-19 vaccination rates across U.S. states. Despite the recent nature of the coronavirus, some formal research has emerged focusing on different aspects of the pandemic (see Alfano and Ercolano 2020; Asongu et al. 2020; Bayram and Shields 2021; Farzanegan et al. 2020; Goel and Nelson 2021; Price and van Holm 2021; Yum 2020 ; Baldwin and Weder di Mauro 2020) for a broader review). However, the role of the internet, in terms of the information related to COVID-19 vaccines available on websites, considered in this paper is new. To gauge the COVID-19 related information on the world-wide-web we created two cross-state datasets using Google search. One search dealt with searching for information on vaccination availability and scheduling in each state (INTER-NETsearchA) , while the other search involved information on vaccine reliability and its side effects (INTERNETsearchB). Some refinements were performed to sanitize the search results from unnecessary noise and avoid double-counting. The estimation results showed that greater availability of the relevant information on the internet increased vaccine administration rates, and this was true for both types of internet searches. This answers the two questions posed in the Introduction and is consistent with the notion of lower transaction costs via easier access to the related health information on the internet and greater ease of scheduling vaccination appointments. In terms of relative magnitudes, the elasticity of vaccine administrations with respect to internet vaccine reliability information (INTERNETsearchB) was almost three times as large as that with respect to internet vaccine scheduling and availability information (INTERNETsear-chA), specifically, Model 2.1: ε VACadmin,INTERNETsearchA = 0.03; and Model 2.4: ε VACadmin,INTERNETsearchB = 0.08. In other words, the risk-reduction role of internet searches yield greater vaccination returns compared to its scheduling role, although both are positive. The relatively modest size of the two elasticities makes sense when one thinks about the large magnitude of internet hits. Plus, effectively consumers and potential consumers would likely be looking at the top search hits most of the time and their responses are less likely to be driven by how search results are populated at the lower end of the search distribution (See appendices A and B). In contrast, the diffusion of internet access and the digital divide across states did not have a significant impact on vaccination rates. Thus, it is not the access to the internet per se but the information available on the internet that enables vaccinations. The findings about the efficacy of internet information can be seen as complementary to the literature that focuses on the effectiveness of digital tracking in controlling the spread of the virus (see Abdou 2021) . Whereas supply chain issues have garnered most of the attention in terms of the limitations of quickly vaccinating the public, our results show that internet information might be acting as a (largely unheralded) enabler. Greater efforts by policymakers, either directly or by subsidizing related websites would assist in vaccinations and ultimately in controlling the virus. The results are also instructive for industry groups like health and liability insurance companies since internet outreach seems effective in facilitating vaccinations and ultimately reducing insurance costs. (See Table 4 ) (See Table 5 ) Good governance and COVID-19: The digital bureaucracy to response the pandemic (Singapore as a model) The efficacy of lockdown against COVID-19: A cross-country panel analysis. Appl Health Econ Health Policy The geography of the effectiveness and consequences of Covid-19 measures: Global evidence Centre for Economic Policy Research, London Bayram AB, Shields, T (2021) Who trusts the WHO? Heuristics and Americans' trust in the World Health Organization during the COVID-19 pandemic Globalization and outbreak of COVID-19: An empirical analysis, CESifo working paper #8315 Internet growth and economic theory Identifying the corrupt cog in the wheel: Dimensions of supply chain logistics and cross-country corruption Whistleblower laws and exposed corruption in the United States Drivers of COVID-19 vaccinations: Vaccine delivery and delivery efficiency in the United States The internet as an indicator of corruption awareness Supply chain performance and economic growth: The impact of COVID-19 disruptions COVID-19 Vaccine: A comprehensive status report Fairly prioritizing groups for access to COVID-19 vaccines Government ideology and economic policy-making in the United States-A survey The effect of social distancing on the early spread of the novel coronavirus Social network analysis for coronavirus (COVID-19) in the United States Publisher's Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations