key: cord-0075980-bwal4ile authors: Cheng, Chaoyuan; Holyoak, Marcel; Xu, Lei; Li, Jing; Liu, Wenjun; Stenseth, Nils Chr.; Zhang, Zhibin title: Host and geographic barriers shape the competition, coexistence, and extinction patterns of influenza A (H1N1) viruses date: 2022-03-21 journal: Ecol Evol DOI: 10.1002/ece3.8732 sha: dc93ef40e97329b8e8fdc3b3438ae644e5acf72f doc_id: 75980 cord_uid: bwal4ile The influenza virus mutates and spreads rapidly, making it suitable for studying evolutionary and ecological processes. The ecological factors and processes by which different lineages of influenza compete or coexist within hosts through time and across geographical space are poorly known. We hypothesized that competition would be stronger for influenza viruses infecting the same host compared to different hosts (the Host Barrier Hypothesis), and for those with a higher cross‐region transmission intensity (the Geographic Barrier Hypothesis). Using available sequences of the influenza A (H1N1) virus in GenBank, we identified six lineages, twelve clades, and several replacement events. We found that human‐hosted lineages had a higher cross‐region transmission intensity than swine‐hosted lineages. Co‐occurrence probabilities of lineages infecting the same host were lower than those infecting different hosts, and human‐hosted lineages had lower co‐occurrence probabilities and genetic diversity than swine‐hosted lineages. These results show that H1N1 lineages infecting the same host or with high cross‐region transmission rates experienced stronger competition and extinction pressures than those infecting different hosts or with low cross‐region transmission. Our study highlights how host and geographic barriers shape the competition, extinction, and coexistence patterns of H1N1 lineages and clades. Influenza A virus (IAV) is one of the most common pathogens worldwide and has caused massive damage to poultry production and human health for centuries (Morens & Taubenberger, 2011; Saunders-Hastings & Krewski, 2016) . The 1918 influenza pandemic (i.e., Spanish flu) and 2009 swine flu pandemic (pdm09) were caused by the H1N1 virus, accounting for more than 40 million deaths (Krammer et al., 2018; . The IAV genome has eight single-stranded RNA fragments, which cause IAV to mutate rapidly (Neumann et al., 2009; Webster et al., 1992) . H1N1 is an RNA virus that mutates quickly, with an overall mutation rate of about 1.8 × 10 −4 s/n/r (substitutions per nucleotide per strand copied; Pauly et al., 2017) . As a consequence of the high mutation rate, there are many subtypes of IAV (e.g., H1N1, H3N2), based on hemagglutinin (HA) and neuraminidase (NA). The H1N1 subtype IAVs have long been circulating in humans, swine, and birds. H1N1 can be categorized according to evolutionary relationships, with a strain representing virus populations with the same genome sequence, and strains with a common ancestor representing clades and lineages representing more distant common ancestry among clades. The eight segments that make up the influenza virus genome can be recombined with segments of other strains, clades, lineages, or subtypes through a process called reassortment or genetic shifts that rapidly produce new subtypes (Krammer et al., 2018) . In recent decades, most outbreaks of influenza pandemics have resulted from new subtypes produced by reassortment (Kawaoka et al., 1989; Smith et al., 2009 ). The H1N1 virus has the longest spread history of all existing subtypes and has caused pandemics many times (Krammer et al., 2018) . Thus, it is a suitable organism for studying the evolution and extinction patterns of organisms. The H1N1 viruses mainly spread during cold seasons (Cheng, Li, et al., 2021) , and it has been recognized that the seasonal H1N1 viruses in humans are replaced quickly by new pandemic strains (Pica et al., 2012) . The prevailing view is that influenza evolution is driven by the balance between host immune responses and virus mutations. Intensive selection imposed by the host immune system drives antigenic drift in the influenza virus and most strains die off in a short time, resulting in continuous replacement of old strains with new strains (Ferguson et al., 2003; Webster et al., 1992) . The interaction of antigens within the host immune network and the spread of multiple strains could trigger cross-immunization (Recker et al., 2007; Uekermann & Sneppen, 2016) . Consequently, the emergence of novel pandemic strains would cause a cross immune response in hosts, which could cause the extinction of circulating seasonal influenza viruses (Pica et al., 2012) . Hence, competition for "antigenic space" and for "breaking the existing herd immunity" of hosts by circulating seasonal strains of influenza virus should play a crucial role in shaping viral evolution and extinction patterns (Recker et al., 2007) . However, the ecological processes and factors mediating competition, coexistence, and extinction of the influenza virus have been infrequently investigated. Hosts may act as a barrier of interspecific transmission for influenza viruses. Host diversity represents the niche breadth of the virus, which is mainly constrained by the genetic differences among host species in antigen resources, cell receptors, or immune responses (Kuiken et al., 2006; Matrosovich et al., 2004; Nelli et al., 2010) . Currently, circulating swine H1N1 virus is mainly divided into three lineages: classical swine lineage, human seasonal lineage, and Eurasian avian lineage (Anderson et al., 2016) . However, some H1N1 lineages can be transmitted between hosts such as humans, swine, and birds (Krammer et al., 2018) . The 2009 influenza A (H1N1) pandemic was caused by an emerging strain that shifted hosts from swine to humans after genome reassortment among three strains (Vijaykrishna et al., 2010) . Regions with different ecological factors are likely important drivers for the spread and evolution of H1N1 lineages. Previous studies have shown that the occurrence and spread of different subtypes or lineages will have regional differences (Anderson et al., 2016; Bedford et al., 2015; Krammer et al., 2018; Lycett et al., 2019; Su et al., 2015) . Geographical barriers can reduce the competition among different H1N1 lineages; however, frequent human transportation can reduce the geographic barrier effect and increase the competition between lineages. Human mobility is more frequent and is often facilitated by long-distance transport such as on planes or trains (Brownstein et al., 2006; Cheng, Li, et al., 2021) . In contrast, the mobility of farm animals (pigs, chickens) is often restricted to short distances and local transportation modes (e.g., many countries require that poultry are slaughtered before they are transported among provinces). Previous studies have shown that human mobility contributes to the global dynamics of H3N2 influenza viruses (Lemey et al., 2014) . Our previous study indicated that human-hosted influenza viruses (H1N1, H3N2) showed much higher spreading velocity and longer distance transmission distance than avian-hosted influenza viruses (H7N9, H5N1), which might be transmitted by poultry or wild birds (Cheng, Li, et al., 2021) . The smaller cross-region mobility of farm animals may contribute to the observed lower replacement rate of old influenza strains by novel ones in swine than humans. Species coexistence could be largely explained by classic ecological niche theory (Armstrong & McGehee, 1980; Grinnell, 1917; Holt, 2009; Hutchinson, 1957; Johnson, 1910; Pearman et al., 2008) . We used available H1N1 HA gene sequences in GenBank (https://www.ncbi.nlm.nih.gov), to analyze the evolution and extinction patterns of H1N1 from different hosts (human, swine, and birds) and different regions, and estimated the co-occurrence probability Disease ecology; Evolutionary ecology; Phylogenetics of different lineages. According to niche theory, species that share the same resources or have more overlap in space and time will compete more severely than species that use different resources or are separated. Specialized utilization of resources or geographic barriers could reduce species competition and increase the likelihood of species coexistence (Hardin, 1960; Pacala & Roughgarden, 1985) . We use the term "competition" to refer to indirect competition through mechanisms including host immunity. Thus, we hypothesize the following: (1) H1N1 lineages infecting the same host will experience higher competition pressures than lineages with different hosts, which will result in higher lineage replacement rates and lower genetic diversity (i.e., the Host Barrier Hypothesis). (2) H1N1 lineages with a higher cross-region transmission intensity will experience a higher competition pressure than those with a lower cross-region transmission intensity; this will result in higher lineage replacement rates and lower genetic diversity with greater cross-region transmission rates (i.e., the Geographic Barrier Hypothesis). In short, within any given host species, competition will prevent all lineages circulating at the same time and in the same region. In this study, lineages are defined by host and evolutionary unit (Figures 1 and 2) . Lineages infecting the same host may belong to different evolutionary units (e.g., human lineage H1 and H2 in Figures 1 and 2) , while lineages infecting multiple hosts may belong to the same evolutionary unit (e.g., the human lineage H1 and swine lineage S2 in Figures 1 and 2 belong to the same lineage). Evolutionary units were determined by phylogenetic methods (see below methods, Figure 2 ). Based on these hypotheses, we make the following predictions: (1) H1N1 lineages infecting the same host (e.g., in human lineages such as human lineage H1 and H2, or swine lineage S1, S2, and S3 in Figures 1 and 2) will have a smaller co-occurrence probability than those infecting different hosts (e.g., between human lineage and swine lineage); (2) human-hosted H1N1 lineages will have a lower co-occurrence probability and genetic diversity than swine-hosted lineages. We downloaded the sequence data of H1N1 IAV from GenBank (https://www.ncbi.nlm.nih.gov/) using a custom Python script with the "Biopython" package (https://biopy thon.org/) and extracted locations and dates for each report. We assigned the sampling location with the latitude and longitude based on the administrative center coordinates using the "GeoPy" package (Esmukov, 2020) in Python 3.6.0 (https://www.python.org/). We excluded samples that lacked a specific day of sampling or strain codes, as well as those with sequence length <1600 bp in length (to ensure the sequences were complete HA sequences, making up 29.8% of the total sequences). To reduce the impacts of sampling effort bias on the research, we only used one sample from the same place in the same month (25.9% of total sequences were removed). Finally, 6097 samples of the H1N1 virus from 1279 locations were used. For subsequent analysis at the amino acid codon level, all sequences were aligned using an H1N1 sequence (A/swine/Hong Kong/61/1977) as a template sequence and to remove all gaps and redundant bases corresponding to the template sequence to be able to convert the sequence to codons (which enables the conversion of nucleotide sequences into amino acid sequences). Sequences were aligned using MEGA7 (Kumar et al., 2016) with default parameters. We classified the cleaned sequences into three host types: human, swine, and avian. We calculated the genetic distance (GD) between each cleaned sequence and the earliest sampled sequence (A/ swine/Hong Kong/61/1977) separately. Then we plotted GD versus sampling date separately for each host type. Based on the molecular clock theory (King & Jukes, 1967; Kimura, 1968) , the mutation rate of HA can be considered constant, and six lineages were readily identified using linear relationships between GD and sampling time (genetic distance and sampling time method, or GD-time method; see Figure 1a ). According to hosts they infect, influenza A (H1N1) viruses can be classified into three categories: human, swine, and avian influenza, and transmission of influenza viruses of the same category mostly occurs among members of the same host species (Matrosovich et al., 2004; Nelli et al., 2010; van Riel et al., 2007) . Thus, we use the following criteria to identify the lineage of the samples: (1) samples with the same host; (2) samples which form a close and continuous linear cluster on the two-dimensional space by genetic distance (GD) and sampling time. Samples from the lineage defined here should belong to a unique evolutionary unit which F I G U R E 2 Phylogeny of HA genes using the combined dataset of six H1N1 lineages. (a) was generated from all identified samples, and (b) was generated from randomly selected samples (10 samples per lineage). Different colors indicate different lineages. H1, H2: human-hosted lineages. S1, S2, S3: swine-hosted lineages. A1: avian-hosted lineage was further verified by using the phylogenetic tree (see below). The GD-time method has the advantage of revealing the replacement of lineages or interspecific transmission events in time (Figure 1 ), while using the phylogenetic tree has the advantage of revealing the distinct evolutionary units ( Figure 2 ). To identify the distinct evolutionary units of H1N1 and clades of each H1N1 lineage, we performed Bayesian phylogenetic analysis in BEAST 2.6.2 (Bouckaert et al., 2019) for each H1N1 lineage we identified above. Sampling dates were used as "tip dates" (i.e., occurrence time of the viruses), and sampling location was used as a discrete trait. All analyses were performed using a coalescent model with a strict molecular clock and constant population size. In each analysis, the length of the Markov chain was set to 1 × 10 8 , with sampling every 1000 steps. The significance of the BEAST analysis was assessed in Tracer (software included with BEAST), and the effective sample size (ESS) of major parameters was >200 after removing 10% of the chain as burn-in. Maximum clade credibility (MCC) trees annotated with the discrete trait (region) were generated in TreeAnnotator (software included with BEAST). All samples of the six identified H1N1 lineages were divided into nine major regions according to sampling locations: North America and South Asia (226 samples). Due to the large size of Asia and the existence of many geographical barriers, we divided Asia into five regions consisting of the four listed above and Central Asia according to the M49 standard (https://unsta ts.un.org/unsd/metho dolog y/ m49/), but there were no samples from Central Asia. We performed a Bayesian discrete phylogenetic analysis in BEAST 1.10.4 (Drummond et al., 2012) to reconstruct the ancestral state of each node in the phylogenetic tree for the discrete trait (region) using resampled data. The trait substitution model was set to be asymmetric. We applied Bayesian stochastic search variable selection (BSSVS) to estimate the significance of pairwise transitions between regions for each analysis. We used Bayesian Factor (BF) as a measure of statistical significance (Lemey et al., 2009) , computed in SpreaD3 (Bielejec et al., 2016) . Inferences from the BSSVS analysis may be influenced by the different sample sizes in each region (De Maio et al., 2015) . Thus, to reduce the bias, we used resampled data in the BSSVS analysis. In resampled data, 10 samples per year per region per lineage were retained (if there were less than 10 samples, all samples were retained). For each H1N1 lineage, we built a network by treating regions as the nodes. Two nodes were regarded as connected if BF-values between them were greater than 3, indicating strong support for the cross-regional transmission (Lemey et al., 2009) . We calculated the degree, degree centrality, closeness centrality, and betweenness centrality of each network with the NetworkX package (Hagberg et al., 2008) in python 3.6.0. The selection pressure on a virus is closely related to its mutation rate (Frost et al., 2018) . To compare the selection pressures on different H1N1 lineages, we calculated the ratio of nonsynonymous and synonymous (d N /d S ) mutations in the data of each lineage with the SLAC (Kosakovsky Pond & Frost, 2005) method in HyPhy 2.5.14 (Kosakovsky Pond et al., 2020) and Datamonkey (Weaver et al., 2018) . To measure the extinction pressure of an H1N1 lineage indirectly, we calculated the nucleotide diversity (pi) and Tajima's D of each lineage for each year using custom python scripts in the "DendroPy" package (Sukumaran & Holder, 2010) . Larger pi and Tajima's D values mean that extinction events are rarer. Since influenza occurrence has apparent seasonality, we used the period from July 1 to June 30 of the following year to represent a year (e.g., data from July 1, 2001 to June 30, 2002 represent the year 2002) to perform the following estimated co-occurrence probability calculations. To quantify the estimated co-occurrence probability of different lineages from different hosts in the same year and region, we defined species co-occurrence probability in a time-space dimension by following our previous studies (Yan et al., 2016) : Here, C i,j is the co-occurrence probability between lineage i and lineage j in a specific year and region. We defined samples of lineages i and j as co-occurring when they were collected from the same year and same region. k represents a specific year and a specific region (year-region), Since there is only one avian-hosted lineage, we only calculated co-occurrence probability for swine and human lineages. For human lineages (H1, H2), we calculated C H1,H2 , which represents the co-occurrence probability between the human-hosted lineages. For swine lineages, because the sample size of S2 is small, we only calculated C S1,S3 , which represents the co-occurrence probability between the swine lineages. We also calculated C S,H , C S,A and C H,A , which represents the co-occurrence probability of H1N1 lineages from different hosts. To test whether two lineages tended to be exclusive or inclusive with each other in term of co-occurrence probability in the year and location, we designed a computer simulation algorithm based on stochastic process (Tijms & Tijms, 1994) to calculate the distribution of C i,j under the null hypothesis without exclusion or inclusion in cooccurrence (i.e., random co-occurrence between them). The simulation algorithm is defined as follows: Here, Sim(i, j) is the co-occurrence probability of lineage i and j from randomly simulated lineages. t (i,j) and r (i,j) is the total occurrence time and location ranges of lineage i and j. rand t (i,j) , r (i,j) represents randomly generating an attribute in the time period t (i,j) and location range r (i,j) . For each lineage, we generated the same number of simulated attributes as the sample size of the lineage and then calculated the simulated co-occurrence probability according to Equation 1. We calculated 1,000,000 simulated co-occurrence probabilities between lineages. Calculations of co-occurrence probability and simulation algorithms were implemented using custom python scripts (see Data Availability Statement section). The evolutionary patterns resulting from employing the genetic distance and sampling time (GD-time) approach are shown in Figure S1 ). The S3 and H2 lineages are continuous and overlapped in the genetic distance (GD)-time dimension. S3 appeared earlier than H2 (Figure 1a) , which is consistent with interspecific transmission from the swine (i.e., S3 lineage) to human (i.e., H2 lineage) lineage, causing the H1N1 pandemic in northern America in 2009 (Krammer et al., 2018; Neumann et al., 2009) . Similarly, H1 and S2 overlapped in the GD-time dimension but H1 appeared earlier (Figure 1a ), which indicates another interspecific transmission of H1N1 from humans (i.e., H1 lineage) to swine (i.e., S2 lineage). (2) Sim (i, j) = rand t (i,j) , r (i,j) . Relationship between H1N1 lineages we defined and the traditional lineages defined in other studies (also see Figure S1 ). "Appearance year" represents the circulating time of each lineage in this study This transmission has not been reported before. Notably, the H1 lineage disappeared immediately after the appearance of the H2 lineage, but S1, S2, and S3 have considerable overlap in time and space (Figure 1 ). Lineages from different hosts (human, birds, swine) also had considerable overlap in both time and space dimensions. These patterns are consistent with the Host Barrier Hypothesis. Phylogenetic analysis also supported the above observations about lineage identification and host associations. The A1 and S1 lineages are relatively independent, with little genetic similarity to other lineages (Figure 2a,b) . The H2 and S3 lineages have many similar strains, and the S2 lineage has many strains similar to the H1 lineage (Figure 2a,b) . These results support the observation that interspecific transmission events of H1N1 lineages occurred between swine and humans. The S1 lineage first appeared in Europe (S1.1) and later spread to Northeast Asia (S1.2) (Figures 1c and 3c) . These two clades of S1 lineage stayed in two different regions and disappeared around 2016 (Figure 3c ). The S2 lineage samples are relatively small (n = 22), first appeared and mainly occurred in North America (Figure 3d ). The S3 showed more symmetric evolution with more old clades (i.e., bushlike evolution). H1 showed prominent fast or explosive evolution (i.e., H1.1, H1.2) within a short period. Table S1 ). Although the sample size of Africa in H2 is relatively small, the resampling results suggest that Africa may play a role in the cross-regional spread of H2. For both the human-and swine-hosted lineages, the observed co-occurrence probability was far less than simulated random co-occurrence probabilities (Figure 5a-c) . This indicates that competitive exclusion reduced the co-occurrence among the lineages. However, the observed co-occurrence probability between avian-and human-or swine-hosted lineages was higher than randomly simulated co-occurrences (Figure 5d ,e). The estimated co-occurrence probabilities (0.011-0.096) within the same host (i.e., for either human-or swine-host, Figure 5a ,b) were much lower than those (0.45-0.66) for different hosts (Figure 5c-e) . Although the HA genes of all lineages were subject to adverse selection, the A1 lineage suffered a stronger adverse selection (d N /d S = 0.088; Table 3 ) compared to other lineages (d N /d S > 0.15). The nucleotide diversity (pi) of the animal-hosted lineages (A1, S1, S2, and S3) was much higher than that of the human-hosted lineages (H1, H2; Table 3 ). H1, H2, and S3 lineages were under similar selection pressure, but the genetic diversity of H1 and H2 was significantly smaller than that of S3 (Table 3) . Both pi (Figure 6a,b) and Tajima's D (Figure 6c ,d) of humanhosted H1N1 lineages are much smaller (indicating more frequent extinction) than those of swine-or avian-hosted lineages (for most cases, p < .05, .01 or .001). The avian lineage A1 had the highest diversity, and its Tajima's D was positive (Figure 6 ), indicating few extinct events, and low selection pressure. Although previous studies have suggested that competition between lineages is an important driver in shaping the ecology and evolution of the influenza viruses (Ferguson et al., 2003; Pica et al., 2012; Recker et al., 2007; Webster et al., 1992) , the ecological factors and processes mediating competition, coexistence and extinction of influenza lineages are poorly known. In this study, we found that within any given host species, competition prevented all lineages circulating at the same time in a given region. H1N1 lineages F I G U R E 4 Cross-region transmission intensity of H1 (a), H2 (b), S1 (c), S2 (d), S3 (e), and A1 (f) lineages based on resampled data. Arrows indicate significant cross-regional spread of the virus among the two connected regions. Arrow color and thickness indicate transmission intensity: blue for strong support (Bayesian Factor (BF) > 10), orange for very strong support (BF > 30), and red for decisive support (BF > 100) Very strong support Decisive support infecting the same host species had a lower co-occurrence probability than those infecting different host species. Furthermore, humanhosted H1N1 lineages with a high cross-region transmission intensity showed a lower co-occurrence probability than swine-hosted lineages with lower cross-region transmission intensity, which supports our hypotheses and predictions. The estimated co-occurrence probability is widely used to represent interspecific competition strength (Yan et al., 2016) . Smaller cooccurrence probability often indicates more intense competition or frequent extinctions. Frequent extinction events in local populations will reduce genetic diversity (McCauley, 1991) . The high replacement rate of influenza strains is attributed to the low genetic diversity of virus strains in humans (Ferguson et al., 2003) . These results suggest that genetic diversity could be used as a proxy for the extinction rate. Su et al., 2015) . For example, the normal circulating influenza virus strains disappeared after novel viruses emerged in 1957 and 1968 (Palese & Shaw, 2006; Palese & Wang, 2011) . After the emergence of the 2009 H1N1 pandemic, the previously circulating swine H1N1 was replaced by the pandemic H1N1 during 2010-2011 (Pica et al., 2012) . We found the estimated co-occurrence probability of human-hosted lineages was lowest (0.011) (Figure 5 ), indicating that competition among human-hosted lineages is likely very high. Indeed, we found one lineage replacement event in human-hosted lineages (i.e., H2 replaced H1) by the GD-time method. Similar patterns were observed for the replacement events at the clade level: H1.1 was replaced by H1.2 and H1.3, and H2.1 was replaced by H2.2 (Figure 3a) , which is consistent with previous observations (Palese & Shaw, 2006; Palese & Wang, 2011) . However, the cooccurrence probability of swine-hosted lineages (S1, S2, S3) was relatively higher (0.096) (Figure 5 ), and we did not see such replacement through time in swine-hosted lineages, indicating lower competition in swine-hosted H1N1 lineages than in human-hosted H1N1 lineages. Notably, the competition defined here refers to indirect competition similar to the "apparent competition" in ecology, that is, competition between two species via a third species. The presence of one H1N1 lineage could enhance the immunity The selection pressure (d N /d S ) and nucleotide diversity (pi) of HA genes in each lineage of the host population (i.e., cross-immunity) and thus exerts negative selection pressure on other nondominant lineages (Pica et al., 2012) . Besides, isolation measures such as travel restrictions and social distance can be effective in controlling the spread of viruses (Cheng, Wan, et al., 2021) . Thus, human interventions such as vaccination and isolation may also have similar effects in mediating indirect competition between H1N1 lineages. and new lineages, driven by immune adaptation, can determine the outcome of lineage replacement (Ferguson et al., 2003) . When a lineage of influenza virus irrupts in a host, the host's immune system is activated and exercised, making the host more resistant to the virus (Krammer et al., 2018) . The newly outbreaking lineage has the advantages of larger infection volume and higher infection efficiency, which in the face of increased host immune response would cause the old lineages to face increased competition and pressure of extinction. Thus, as the level of herd immunity increases, the emerging lineages will gain a competitive advantage and thus replace the older lineages (Ferguson et al., 2003; Pica et al., 2012; Recker et al., 2007; Webster et al., 1992) . Competition strength is largely determined by the niche similarity between species (Hardin, 1960; Holt, 2009; Pearman et al., 2008; Pianka, 1974) . Influenza viruses compete for limited antigen resources of hosts (Pica et al., 2012; Recker et al., 2007) . Therefore, lineages sharing the same host would compete with each other more strongly by infecting the same host, which explains why the estimated co-occurrence probability (0.011-0.096) of human-or swine-hosted H1N1 lineages was very low, as compared to those Previous studies indicate that different influenza virus strains have distinct hosts (e.g., humans, pigs, birds) (Kuiken et al., 2006) , suggesting there are obvious barriers between different hosts limiting interspecific transmission. The receptor binding, HA acid stability (e.g., the pH inside the host cell will affect the activation of HA), and polymerase activity are related to the interspecies transmission of IAVs (Herfst et al., 2014; Russier et al., 2017) . Human-adapted IAVs tend to bind to α-2,6 receptors, while avian-adapted IAVs tend to bind α-2,3 receptors, and both receptors are abundant in pigs (Matrosovich et al., 2004; Nelli et al., 2010; van Riel et al., 2007) . When the influenza virus enters the host cell, it needs to activate the HA protein at a specific pH to cause membrane fusion and release the viral genome into the cytoplasm of infected cells (Bullough et al., 1994) . Avian influenza needs to activate HA protein at a higher pH, whereas human influenza needs to activate HA protein at a lower pH, and pigs support a broad range of HA activation pH (Russier et al., 2016 (Russier et al., , 2017 However, interspecific transmissions are also seen due to mutation or re-assortment of the influenza virus (Karakus et al., 2019; Kawaoka et al., 1989; Krammer et al., 2018; Landolt & Olsen, 2007) . For example, the 2009 H1N1 pandemic was caused by an emerging strain from swine to humans due to the genome reassortment of three different strains (Vijaykrishna et al., 2010) , which is supported by our observation on the interspecific transmission event from swinehosted lineage S3 to human-hosted lineage H2 in 2009 ( Figure 1 ). We also found another interspecific transmission event from humanhosted H1 lineage to swine-hosted S2 lineage ( Figure 1) . Notably, the two interspecific transmission events were continuous and smooth in genetic distance and time dimensions, suggesting the interspecific transmission rate was produced at a normal mutation rate. Geographic barriers are known to play a crucial role in the origin and evolution of a new species or subspecies by reducing gene flow (Howard, 2003; Zhao et al., 2021; Zhu et al., 2021) . However, with the accelerated international movement of people and goods, species can spread easily across continents, which leads to serious biological invasion problems, such as the extinction of native species (Blackburn & Ewen, 2017; Suarez et al., 2001) as well as global transmissions of influenza virus (Cheng, Li, et al., 2021; Lemey et al., 2014) . Previous studies have shown that geographic isolation prolonged the cocirculation of geographically segregated H1N1 lineages (Bedford et al., 2015) . It is still unclear how crossregion transmission of influenza viruses affects their competition and extinction patterns. Humans are much more capable of mobility than terrestrial animals, and can easily move across regions or continents through modern transportation (Brownstein et al., 2006) . This may explain why human-hosted H1N1 lineages (i.e., H1, H2) had a higher cross-region transmission intensity than the swinehosted lineages observed in this study (e.g., S1, S2) ( Figure 4) North America), and Northeast Asia had a significantly higher contribution than other regions in global spread of S3 ( Figure 4 , Table S1 ). Notably, our analysis was based on lineages as defined based on hosts and evolutionary units (e.g., H1, H2, etc.) and not subtypes (e.g., H1N1, H3N2, etc.). Therefore, such competition patterns may differ between our lineages and those subtypes. Genetic diversity is mainly determined by the effective population size and gene flow (Wang, 2002; Xu et al., 2013) . It is recognized that gene flow could increase the genetic diversity of a population (Xu et al., 2013) , but reduce heterogeneity among populations (Slatkin, 1987) . In our study, we found human-hosted H1N1 lineages had a higher cross-region transmission intensity, but a lower genetic diversity, likely due to the extinction of clades under higher competition pressure of human-hosted lineages. Lineages with high rates of clade extinction should have low genetic diversity. In our study, the nucleotide diversity and Tajima's D of swine-or avian-hosted H1N1 lineages was significantly higher than that of human-hosted lineages, indicating that human-hosted H1N1 lineages suffered more extinction pressure than the swine-or avian-hosted lineages. According to our study, the cross-region transmission intensity of H1N1 lineages is very similar to the biological invasion process, which can cause massive extinction of local native species (Clavero & Garciaberthou, 2005) . Our observations are consistent with the previous results that human H1N1 lineages have very low genetic diversity, high extinction rates, and that new strains regularly replaced old strains (Ferguson et al., 2003; Pica et al., 2012) . The lineages we identified are generally consistent with previously recognized lineages (Anderson et al., 2016 (Shope, 1931) , while the Eurasian avian lineage (or Eurasian avian-like) resulted from the spillover from avian flu in Europe with subsequent spread to Asia (Vincent et al., 2014) , and the human seasonal lineage originated in Europe in the 1990s (Brown et al., 1995) . In general, the relationship between H1N1 lineages and clades from different hosts and regions has not been fully examined. As shown by examining the H1N1 lineage evolution within the genetic distance and time dimension (Figure 1 ), H1 had been circulating in humans for several years before 2009 (Figures 1 and 4a ), but when H2 appeared with the outbreak of pdm09, H1 quickly disappeared (Figures 1 and 4a,b) . This supports the previous finding that pdm09 replaced the previous seasonal H1N1 (Krammer et al., 2018) , likely due to the strong competition between H1 and H2. As compared to other swine lineages (S2, S3), the genetic distance between S1 and avian lineage A1 is closer, but the stop codon is different between them; thus, we speculated that the S1 lineage may have come from the A1 lineage a long time ago. Since the sequences of H1 and S2 have the same codon deletion at the same position, and this deletion does not exist in other lineages, S2 could derive from H1, suggesting another interspecific transmission event ( Figure 1 , Table 2 ). H1 disappeared in 2009 due to the emergence and replacement of pdm09 (Krammer et al., 2018) , but the S2 lineage was not replaced, likely because lineages in swine suffered less competition and swine became the reservoir of this lineage. Birds, especially aquatic birds, are considered to be the natural reservoir of influenza viruses (Yoon et al., 2014) . However, our analysis shows that H1N1 has only one lineage (A1) in birds, and A1 is genetically distant from other lineages and has no genetic crossover with the other lineages (Figures 1 and 2) , probably due to the large host barrier between avian and mammals. Our conclusion regarding the global dynamics has several limitations. First, competition was defined based on co-occurrence probability rather than direct evidence of causal mechanisms. The mechanism of competition of influenza viruses for antigenic resources needs to be examined and tested. Besides, human intervention such as vaccination, travel restriction, or isolation may attribute to the strength of competition or replacement of lineages. Second, sampling bias may affect our results. Although resampling could help to overcome the problem, many regions are lacking samples which would cause biased estimation of cross-region transmission or even evolutionary trees. Third, there are more than one hundred subtypes of influenza viruses, and their transmission patterns may be different from H1N1. It is necessary to test the hypotheses using various subtypes. In summary, we found lineages infecting the same host or hosts with a higher cross-regional transmission intensity suffered a higher competition and extinction pressure, which highlights the roles of host and geographic barriers in shaping the competition, coexistence, and extinction patterns of H1N1 lineages. Our results suggest that it is necessary to reduce close contact among different hosts to reduce interspecific transmissions and to reduce cross-border transport of live livestock and poultry to reduce cross-region transmissions in the world. This work is supported by a project from the Institute of Zoology, The authors declare no conflict of interest. The data that support the findings of this study are openly available A phylogeny-based global nomenclature system and automated annotation tool for H1 hemagglutinin genes from swine influenza A viruses Competitive exclusion Global circulation patterns of seasonal influenza viruses vary with antigenic drift Sprea D3: Interactive visualization of spatiotemporal history and trait evolutionary processes Parasites as drivers and passengers of human-mediated biological invasions BEAST 2.5: An advanced software platform for Bayesian evolutionary analysis Disease outbreaks in pigs in Great Britain due to an influenza A virus of H1N2 subtype Empirical evidence for the effect of airline travel on inter-regional influenza spread in the United States Structure of influenza haemagglutinin at the pH of membrane fusion Modeling analysis revealed the distinct global transmission patterns of influenza A viruses and their influencing factors Modeling analysis reveals the transmission trend of COVID-19 and control efficiency of human intervention Invasive species are a leading cause of animal extinctions New routes to phylogeography: A Bayesian structured coalescent approximation Bayesian phylogenetics with BEAUti and the BEAST 1.7 Ecological and immunological determinants of influenza evolution Neutral theory and rapidly evolving viral pathogens The niche-relationships of the California thrasher Exploring network structure, dynamics, and function using networkx The competitive exclusion principle Avian influenza virus transmission to mammals. Assessment & Evaluation in Higher Education Bringing the Hutchinsonian niche into the 21st century: Ecological and evolutionary perspectives Speciation: Allopatric. In eLS Concluding remarks Determinate evolution in the color-pattern of the lady-beetles MHC class II proteins mediate cross-species entry of bat influenza viruses Avian-to-human transmission of the PB1 gene of influenza A viruses in the 1957 and 1968 pandemics Evolutionary rate at the molecular level Non-Darwinian evolution Not so different after all: A comparison of methods for detecting amino acid sites under selection HyPhy 2.5-A customizable platform for evolutionary hypothesis testing using phylogenies Host species barriers to influenza virus infections MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for bigger datasets Up to new tricks -A review of cross-species transmission of influenza A viruses Unifying viral genetics and human transportation data to predict the global transmission dynamics of human influenza H3N2 Bayesian phylogeography finds its roots A brief history of bird flu Human and avian influenza viruses target different cell types in cultures of human airway epithelium Genetic consequences of local population extinction and recolonization Pandemic influenza: Certain uncertainties Comparative distribution of human and avian type sialic acid influenza receptors in the pig Emergence and pandemic potential of swine-origin H1N1 influenza virus Population experiments with the Anolis lizards of St. Maarten and St Fields virology. Ritstjórar: Knipe DM, Howley PM What can we learn from reconstructing the extinct 1918 pandemic influenza virus? Why do influenza virus subtypes die out? A hypothesis A novel twelve class fluctuation test reveals higher than expected mutation rates for influenza A viruses. eLife Niche dynamics in space and time Niche overlap and diffuse competition Hemagglutinin stalk antibodies elicited by the 2009 pandemic influenza virus as a mechanism for the extinction of seasonal H1N1 viruses The generation of influenza outbreaks by a network of host immune responses against a limited set of antigenic types H1N1 influenza viruses varying widely in hemagglutinin stability transmit efficiently from swine to swine and to ferrets Molecular requirements for a pandemic influenza virus: An acid-stable hemagglutinin protein Reviewing the history of pandemic influenza: Understanding patterns of emergence and transmission Swine influenza Gene flow and the geographic structure of natural populations Origins and evolutionary genomics of the 2009 swine-origin H1N1 influenza A epidemic Phylodynamics of H1N1/2009 influenza reveals the transition from host adaptation to immune-driven selection Patterns of spread in biological invasions dominated by long-distance jump dispersal: Insights from Argentine ants DendroPy: A Python library for phylogenetic computing Stochastic models: An algorithmic approach A cross-immunization model for the extinction of old influenza strains Human and avian influenza viruses target different cells in the lower respiratory tract of humans and other mammals Reassortment of pandemic H1N1/2009 influenza A virus in swine Review of influenza A virus in swine worldwide: A call for increased surveillance and research The genetic diversity of central and peripheral populations of ratlike hamster (Cricetulus triton) Datamonkey 2.0: A modern web application for characterizing selective and other evolutionary processes Evolution and ecology of influenza A viruses Variation of genetic diversity in a rapidly expanding population of the greater long-tailed hamster (Tscherskia triton) as revealed by microsatellites Species cooccurrence and phylogenetic structure of terrestrial vertebrates at regional scales Evolution and ecology of influenza A viruses. Assessment & Evaluation in Higher Education Early divergence and differential population histories of the Indo-Pacific humpback dolphin in the Pacific and Indian Oceans Lineage diversification and niche evolution in the Reeves' Butterfly Lizard Leiolepis reevesii (Agamidae)