key: cord-0791825-iaht6nuh authors: James, Nick; Menzies, Max; Bondell, Howard title: In search of peak human athletic potential: A mathematical investigation date: 2021-09-28 journal: Chaos DOI: 10.1063/5.0073141 sha: 2f14cbf9bad3d0e70890e7201f87a7ba21bb304e doc_id: 791825 cord_uid: iaht6nuh This paper applies existing and new approaches to study trends in the performance of elite athletes over time. We study both track and field scores of men and women athletes on a yearly basis from 2001 to 2019, revealing several trends and findings. First, we perform a detailed regression study to reveal the existence of an"Olympic effect", where average performance improves during Olympic years. Next, we study the rate of change in athlete performance and fail to reject the notion that athlete scores are leveling off, at least among the top 100 annual scores. Third, we examine the relationship in performance trends among men and women's categories of the same event, revealing striking similarity, together with some anomalous events. Finally, we analyze the geographic composition of the world's top athletes, attempting to understand how the diversity by country and continent varies over time across events. We challenge a widely held conception of athletics, that certain events are more geographically dominated than others. Our methods and findings could be applied more generally to identify evolutionary dynamics in group performance and highlight spatio-temporal trends in group composition. This paper applies existing and new approaches to study trends in the performance of elite athletes over time. We study both track and field scores of men and women athletes on a yearly basis from 2001 to 2019, revealing several trends and findings. First, we perform a detailed regression study to reveal the existence of an "Olympic effect", where average performance improves during Olympic years. Next, we study the rate of change in athlete performance and fail to reject the notion that athlete scores are leveling off, at least among the top 100 annual scores. Third, we examine the relationship in performance trends among men and women's categories of the same event, revealing striking similarity, together with some anomalous events. Finally, we analyze the geographic composition of the world's top athletes, attempting to understand how the diversity by country and continent varies over time across events. We challenge a widely held conception of athletics, that certain events are more geographically dominated than others. Our methods and findings could be applied more generally to identify evolutionary dynamics in group performance and highlight spatio-temporal trends in group composition. Throughout time, humans have aspired to improve in a wide range of societal aspects. Over the last several hundred years, we have seen significant advances in technology, health, the economy, education, and other fields. Further progress in such areas of human life is expected to continue in the coming years. In areas such as sports and athletics, there may be an ultimate limit to human potential. The recent completion of The Tokyo 2020 Olympic and Paralympic Games piqued our interest in studying evolutionary patterns in human athletic performance over the last two decades. Throughout this paper, we seek to better identify patterns and structural similarities in men and women's track and field events. We seek to identify anomalous trajectories, explore the existence and potential impact of an "Olympic effect" on collective athlete performance, measure the similarity of performance trends between men and women, and study the variance in geographic representation of events' top athletes. We hope that this work will further encourage the application of techniques from the nonlinear dynamics community to better understand time-varying behaviors in athletics, and sports more generally. Every year, there are many competitions at which the world's top athletes participate. Athletic sports comprise two main categories: track, in which athletes are scored by their times (and a lower measured time is a better result), and field, in which athletes are scored by a distance thrown or jumped (and a higher distance is a better result). Typically, competitions consist of both track and field events. In all events, elite athletes a) Electronic mail: max.menzies@alumni.harvard.edu seek the marginal improvements necessary to best their opponents, where seconds, split seconds, and centimeters may make the difference. Among all competitions, the Olympic Games, held every four years, are the most well-known and prestigious, attracting competitors worldwide. The modern Olympic Games were first held in 1896 in Athens, Greece, and winning an Olympic gold medal is considered one of sport's greatest achievements. Given the intense competition around score improvements, athletes, commentators, analysts, and fans alike all share an interest in closely tracking the best results over time. There are numerous reasons for understanding the trends in these results, including fan interest, the identification of avenues for improvement, certain countries performing better than others, allocation of dedicated resources to burgeoning young athletes, and identifying athletes with promising growth trajectories who could become suitable brand ambassadors for various companies. For this purpose, there has been a substantial body of statistical research on sports conducted both on a private and academic basis. 1-5 In particular, statisticians have applied extreme value theory to investigate records, namely singularly outstanding performances, in athletics. [6] [7] [8] [9] However, there have been limited applications of nonlinear dynamics and physicallyinspired mathematics to understand patterns and trends across various sports. [10] [11] [12] [13] Our paper builds on a long literature of multivariate time series analysis, methods that are more frequently applied to other domains, including epidemiology, [14] [15] [16] [17] [18] [19] [20] [21] finance, [22] [23] [24] [25] [26] [27] [28] [29] and other fields. [30] [31] [32] Methods of time series analysis are broad, including parametric models, 33,34 distance analysis and correlation, [35] [36] [37] [38] network models, 39-41 clustering 42 and many others. [43] [44] [45] [46] In this paper, we draw upon numerous existing techniques, including linear regression, 47 trajectory analysis 48, 49 , inconsistency analysis, 50,51 agglomerative hierarchical clustering 52 and geographic dispersion. 53 This paper aims to study trends and relationships between the performance (and composition) of male and female athletes. First, Section III performs several regressions to investigate the existence and extent of an "Olympic effect," in which athletes' arXiv:2109.13517v2 [physics.soc-ph] 1 Feb 2022 scores are usually (but not always) superior during Olympic years. Section IV examines the linearity of score increments more closely to investigate whether there is evidence of a recent leveling off in average scores. Subsequently, Section V takes a closer analysis of the yearly changes in scores and shows broad alignment between male and female performance in corresponding events. Next, we analyze the trajectories of performances over time and determine inconsistencies between the performance of men and women in Section VI. Finally, Section VII analyzes the composition of top athletes on an eventby-event basis. We track the extent of the geographic spread of the best athletes across the world over time. In particular, we show surprising uniformity in this variance, challenging the notion that some events are more dominated by smaller collections of countries than others. Our data consists of the top 100 scores (times and distances, respectively) recorded each year in the men and women's competitions of 16 track and field events. Our data source is World Athletics (https://www.worldathletics.org). Due to the cancellation of many sporting events during COVID-19 and 2021's being incomplete, we exclude 2020 and 2021 from our analysis window. Thus, our data spans t = 2001, ..., 2019. Track events are scored in times, with shorter times scoring better, while field events are scored in distances, with larger distances scoring better. Thus, we consistently analyze track and field events separately, and briefly consider the difference in behavior between the top m = 10 and 100 scores in each event. Our events are the following: high jump, long jump, pole vault, triple jump, discus, hammer throw, javelin, shot put, 10 000 meters (10K), 5000 meters (5K), 3000 meters (3K), 1500 meters (1500m), 800m, 400m, 200m and 100m. All these running events are of a flat elevation. Throughout the manuscript, let x i (t), y i (t) be the average of the top m scores in the women and men's competition of the event indexed by i. In this section, we test the existence of an Olympic effecta phenomenon where athletic performance is superior during Olympic years. To study this effect, we fit several linear regression models for both men and women's events. In model 1, we assume that athletic performance is a linear function over time subject to additive white noise. For each men's event i, model 1 is formulated as follows: (1) Here, y i (t) is the mean of the top m scores in year t, β 0 is the model's intercept and β 1 is the trend of event performance over time, both of which we estimate in the modeling procedure. We implement this for both m = 10 and 100, averaging the top 10 and 100 scores per year, respectively. As is typical in an ordinary least squares (OLS) framework, ε(t) is assumed to be independently and identically distributed from a Gaussian distribution centered at zero. In Appendix A, we include some necessary validation of technical hypotheses that underpin the reliability of the linear regression model. In model 2, we assume that athletic performance is a linear function over time and explicitly model an Olympic effect with an indicator function during Olympic years. Model 2 is formulated as Here, 1 O is an indicator function taking a value of 1 only during Olympic years, so α 0 measures the extent of this effect. Finally, model 3 explores the possibility of 4-year periodic behavior in scores not primarily driven by Olympic years. Specifically, we fit the response variable y i (t) of mean scores against not only year t, but also a categorical variable according to whether t is 0,1,2 or 3 (modulo 4). We encode this with three dummy variables as follows: Here, 1 j (t) is an indicator function taking value 1 if t ≡ j (mod 4). Then α 1 , α 2 and α 3 are the linear coefficients for years that are 1,2 and 3 modulo 4, respectively. As is standard with dummy encoding of categorical variables, four separate categories are encoded with three numeric indicator variables. It is arbitrary which category is not included in this encoding, so we have chosen to distinguish the coefficient of Olympic years by not including an α 0 term here. All three models are written analogously for women's events, using x i (t) as our response variable. We fit these three models to all track and field events for both men and women and explore the insights therein. Our general approach is to find a simple model that generalizes well across most events. Accordingly, we validate our models' estimates using adjusted R 2 , where additional parameters yield a complexity penalty. Table I shows the adjusted R 2 for all three models, where one can see that model 2 is generally the best candidate model. Specifically, model 2 has the best adjusted R 2 score in 58% of events. In the final four columns of the table, we present α 0 and β 1 for model 2, in the cases where m = 100 and m = 10 respectively. β 1 represents the coefficient of the linear term, and α 0 represents the coefficient of the Olympic indicator function. Table I demonstrates that most events display an improving linear trend in performance, with a positive Olympic effect (greater distances for field events and lower times for track events). In the case where m = 100, all field events exhibit a positive α 0 term, with the majority being statistically significant, an Olympic effect where performance improves during Olympic years. Findings on the track are similar, where all but two events (the men and women's 3000 meters) display a negative α 0 term, indicating improved track performance during Olympic years. Interestingly, this Olympic effect appears to be less ubiquitous in the case where m = 10. Five field events (men's long jump, men's pole vault, men's triple jump, men's javelin, and I : First, we record the adjusted R 2 values for three linear models applied to the top m = 100 scores and observe that model 2 is generally the best fit. Next, we record regression coefficients for model 2, both for m = 100 and m = 10. We highlight β 1 and α 0 , the year and Olympic indicator coefficients, respectively. We indicate associated p-values of p < 0.01, p < 0.05, p < 0.1 with * * * , * * and * , respectively. men's shot put) display a negative Olympic effect, while those that are positive are mostly not statistically significant. That is, among the top 10 athletes, we do not observe evidence that Olympic years are associated with increased average athletic performance. Similarly, with track, Olympic years provide less consistently positive boosts in performance. Four track events (men's 800m, women's 1500m, men's 1500m, and women's 3000m) have a positive (but not significant) α 0 term, again indicating less evidence for improved performance during Olympic years. For most track and field events, especially with m = 100, model 2 does a good job of capturing the true essence of the data: a linear trend accompanied by a (generally positive) effect on average performance during Olympic years. Figure 1 shows model 2's estimates for four events: two track events and two field events when m = 100. Figures 1a and 1b , which display the men's 100m and women's 200m, respectively, show typical results for track events. Each features an evident linear trend of reduced times and Olympic effect of improvement during Olympic years. Turning to the field events, Figure 1c , displaying the women's pole vault, shows a typical result for the field. A positive linear term is evident, highlighting consistent improvement in average pole vault performance between 2001 and 2019, as well as spikes of improved performance during Olympic years. Finally, Figure 1d highlights an anomalous trend in the men's hammer throw. Surprisingly, average performance is observed to decline with time, although a beneficial Olympic effect persists. All events exhibit a linear trend with a pronounced Olympic effect. All except the hammer throw exhibit better performance with time (lower times for track and higher distances for field). The hammer throw is an anomaly, exhibiting worse performance with time. In this brief section, we are interested in whether we can observe whether athletic scores are approaching a peak, slowing down in incremental improvements. We now examine this more closely by analyzing the first differences in scores .., 2018, similarly for ∆y i . In this section, we fix m = 100, always considering the average of the top 100 scores each year. Unlike other literature that studies the evolution of world records or the best result each year, this paper seeks to determine trends in overall athlete performance. Accordingly, we do not run our analysis when m = 1, as the underlying trends in performance would be subject to outliers, and may become too volatile. This becomes even more severe when we look at first differences (effectively a first derivative), which is always more irregular than the original data. If we were to study trends in the men's 100 metres where m = 1, one would identify peak performance in 2009, and since then, a decline in performance. However, this peak corresponds to Usain Bolt's exceptional time of 9.58 seconds. 54 To put this result in perspective, the three best times in the men's 100 metres in the 2019 season were 9.76 seconds, 9.86 seconds and 9.87 seconds 54 -notably behind Bolt's world record time. One can see how the study of too few samples within each year may lead to erroneous insights as to whether collective performance of elite athletes in various events is improving or declining. Echoing Section III, we formulate two models for these differences. Model 1 is formulated as follows: analogously to (1). It aims to track the differences in improvement relative to time. Model 2 is formulated analogously to (2) , including a difference term that incorporates the predicted increase during and decrease after an Olympic year. Table II displays the determined β 1 coefficients for each model, and the α 0 term from model 2. Importantly, not a single associated p-value associated with β 1 in either model is less than 0.1, offering no evidence from either model in any event that the rate of incremental improvements is changing over time. On the other hand, even in the first difference model, most values of α 0 are positive and significant for field events, and negative and significant for track events. That is, from both Section III and IV, we observe linear fits in improving performance, with clear evidence of an Olympic effect, but no evidence of diminishing returns or nearing the natural peak of human performance, at least in the mean of the top 100 scorers over this period. In this section, we further analyze the first difference in scores ∆x i , ∆y i . Individual examinations of the data suggest a substantial similarity in the year-on-year changes between men and women's performances within the same event. That is, when the mean score in the men's exhibits an increase or decrease, the same is usually seen in the womens' scores for the same event. To investigate this quantitatively, we compute the normalized inner product η i =< ∆x i , ∆y i > n for each event i: This computes the alignment between the first differences for each event between the yearly changes in men and women's scores. As we use a normalized inner product, it is appropriate to compare different time series with different scales (including different growth rates). More broadly, we compute a 2N × 2N matrix M defined by This matrix computes normalized inner products between all first difference sequences for both men and women's events. Indices i = 1, ..., N correspond to female first difference time series ∆x i , while i = N + 1, ..., 2N correspond to male first difference time series ∆y i . This matrix is constructed to compute inner products between all events of each gender. The subscript n refers to the fact that the inner product is normalized, while N = 16 is the total number of events. All these values First, we display the alignment values η i =< ∆x i , ∆y i > for all events in Figure 2 . Across all events, the median value is 0.54, which is relatively high given that all values lie in [−1, 1] and under no association would be centered around zero. The median alignment of field events is slightly lower than that of track events, from 0.45 to 0.57 respectively. The javelin has the greatest alignment between men and women's first differences with an alignment of 0.8, while the 3000m has the lowest alignment of -0. 19 . To further explore alignment pairings between all candidate events, we perform hierarchical clustering on M in Figure 3 . Specifically, we implement agglomerative hierarchical clustering with the average-linkage criterion. 52 Figures 3b and 3a display the dendrograms of track and field events, respectively. Both dendrograms display a similar theme, with frequent clustering of the same event for women and men. First, Figure 3b has six instances where the equivalent event for men and women is determined to be most similar (100m, 800m, 1500m, 3K, 5K, and 10K). Furthermore, the dendrogram displays a sub-cluster of great affinity in the bottom left quadrant, composed of all short and middle-distance events of both genders. Figure 3a shows the alignment dendrogram between field events. The dendrogram consists of one dominant cluster of primarily throwing events and a smaller second cluster of jumping events. The first major cluster includes both genders' hammer throw, discus, javelin, and shot put events. This major cluster consists of two subclusters, which exhibit more pronounced affinity within themselves. The second cluster consists of the men and women's high jump and men's long jump. Like Figure 3b , the first difference behaviors are broadly grouped by event; this effect is more pronounced in track than field events. For example, the men and women's 100 and 200m are closely clustered together, which may reflect substantial intersection in the athletes performing in these events. II: Regression coefficients β 1 and α 0 for the year and Olympic indicator, respectively, for two models that track the first differences in scores over time. We indicate associated p-values of p < 0.01, p < 0.05, p < 0.1 with * * * , * * and * , respectively. Broad similarity is observed between each model's β 1 coefficients, with not a single β 1 determined to be significant. Thus, our two models produce no evidence of concavity in the trend of the top 100 scores. This finding is by no means inevitable: one could imagine a scenario where the most similar behavior in these first differences is found between events within the same gender, that is, where athletic improvements are most closely associated among athletes competing of the same gender. This is shown not to be the case. Instead, the results in Figure 3 suggest that improvements are passed on rather uniformly between men and women. This could be due to breakthroughs in technology, technique, strength and conditioning, for each event, being passed on to men and women simultaneously. Having observed broad similarity between (first differences in) scores between men and women in the same event in Section V, we now wish to approach the relationship between male and female scores differently and identify any events where there is relatively less consistency in scores. This section aims to study trends over time in scores, moving away from the first differences of the last section. As before, let x i (t), y i (t), i = 1, ..., N,t = 1, ..., T be the multivariate time series consisting of the mean of the top m = 100 women's and men's scores, respectively. While men and women are analyzed in conjunction, we never simultaneously compare track events to field events quantitatively, only descriptively, so our (7), measuring the extent of alignment in first differences between events. Numerous men and women's categories of the same event cluster together, both for (a) field and (b) track events. In addition, clustering is also observed between similar events, potentially due to intersection in the athletes performing in them. notation suppresses the dependence on these choices. Hence, rather than having N = 16 total events, our notation denotes N = 8 events separately for each of the track and field events. To determine the consistency of evolutionary performance in men and women's athletics events, we normalize each event's time series for an appropriate comparison. To determine each men's normalized trajectory we first compute y i = ∑ T t=1 y i (t), and then defineỹ i (t) = y i (t) y i . We similarly define women's normalized trajectoriesx i (t). Then, we may compute distances , for men and women respectively. We remark that there are alternative ways to normalize such trajectories, such as subtracting the means and dividing by the ranges. As we use the L 1 metric to measure distances between the normalized trajectories, it makes consistent sense to use the L 1 norm in the normalization. Next, we apply the linear transformation A i j = 1 − D i j max{D} to both the men and women's distance matrices, producing affinity matrices A with all entries in [0, 1]. To determine the consistency in each event's relative trajectory among the separate collections of men and women's athletic events, we compute the consistency matrix C = |A x − A y |. Specifically, C i j = |A x i j − A y i j |. Elements in the matrix C close to 0 represent distances between events that are relatively similar among men and women. To determine the most inconsistent events, we compute a j = ∑ N i=1 C i j and rank a j , for j = 1, .., N. Larger values indicate greater inconsistency between the male and female categories of the same events regarding their relationship with other events. We note that these distance, affinity and consistency matrices are computed separately for track and field events, so we suppress the dependence on this choice. Figure 4 documents the inconsistency values a j of male and female trajectories in each event. The table highlights greater consistency in event trajectories among the track compared to the field events, with median values of 0.97 and 1.46, respectively. Field events display greater variability in their inconsistency values, with a standard deviation of 0.61, compared with 0.21 exhibited by track events. Indeed, both the most and least consistent events are field events, namely the javelin (inconsistency value of 0.61) and the pole vault (value of 2.58). Figures 5a and 5b show the normalized trajectories of athletic scores over time for both men and women in the javelin and pole vault, respectively. It is evident from Figure 5 that the javelin exhibits highly similar trajectories for men and women -both feature a positive linear trend and rather uniform improvement with time. By contrast, the pole vault displays highly varying trends between genders. The women's pole vault shows steady improvement in performance, while the men's event exhibits a predominantly flat trajectory over time. Finally, we investigate trends in the composition of the top 100 scoring athletes in each event over time. Our data set records which country each athlete represents in the competitions, and we aim to produce and analyze a time-varying measure of the geographic diversity of athletes. and compute the total geographic dispersion over time, defined by measured in meters. Our analysis highlights several trends in the variance of geographic representation among men and women's events. All values of G i , quantifying the total geographic variance for each event, are recorded in Table IV . First, a highly similar geographic dispersion is identified between track and field events. The track events produce a mean (and median) value of 9.2 x 10 6 , while the field events produce a mean (and median) value of 8.8 x 10 6 . The most geographically concentrated track event is the women's 200m, exhibiting a total value of 8.3 x 10 6 . The least concentrated track event is the women's 5K, which produces a cumulative value of 9.8 x 10 6 . Among field events, men's hammer throw consists of the most geographic concentration among the top 100 athletes, while the men's long jump is the most geographically dispersed, with cumulative values of 7.2 x 10 6 and 9.9 x 10 6 , respectively. Second, a highly similar geographic dispersion is also observed between men and women's events. The mean values among men and women are 9.0 x 10 6 and 8.9 x 10 6 , respectively (with median values both 9.0 x 10 6 ). Among men's events, the most geographic variance in athletes is in the men's long jump, while the greatest concentration is in the men's hammer throw. Among female events, the greatest geographic concentration is in the 5K and 10K, with total values of 9.8 x 10 6 , while the least geographic dispersion among athletes is in the 200m and high jump, which both have cumulative values of 8.3 x 10 6 . Figure 6 displays the time-varying geographic variance for the aforementioned events with the greatest and least total values. Three of these displayed events exhibit a moderate increase in geographic variance over time, but the women's 200m exhibits a rather pronounced decrease. As a case study, we investigate the changing composition of this event. In Table III, To investigate this further, we plot the number of US athletes among the top 100 as a function of time in Figure 7 . Figure 1b depicts the counts for US athletes in the men and women's 200m. To include a field event as well, Figure 7b depicts the analogous counts for the shot put. In each sport, we notice a considerable increase in the proportion of women athletes from the US. In this work, we have performed several quantitative analyses to identify patterns in the performance of athletes, the relationship between men and women's trends within the same event, and the geographic composition of athletes over time. This work takes a different approach to other quantitative analyses focused on modelling sport. Rather than modelling yearly performance data among a very small sample of results [5] [6] [7] [8] [9] (usually just the top scorer), this study is interested in broader patterns in performance. Furthermore, many of the techniques used throughout this paper may not provide the same level of insight when small annual samples of results lead to extremely volatile trends. Accordingly, throughout most of our analysis we study the top m = 100 scorers per event per year. First, in Section III, we implemented a regression analysis that revealed linear trends in improving average scores over time. By contrasting the fits of several models, we observed a positive Olympic effect in almost every event, where average scores are superior during Olympic years. While not necessarily surprising, this finding was by no means inevitable, as highly significant athletic events, such as the World Athletic Championships, occur more regularly. Nonetheless, it seems that the Olympic Games produce perhaps the highest levels of training and dedication to achieving the best scores possible. There may be several nuanced reasons for this effect. On the one hand, the increased media attention during the Olympics may increase performance; on the other hand, anxiety and stress may lead to decreased performances. 55 Next, we examined the linearity of these fits in more detail in Section IV. Perhaps surprisingly, and contrary to some recent reporting, 56, 57 we observed no evidence that average performance is "leveling off" close to the present day. This finding may differ with a careful analysis of world records, namely the single top athletic performance by year, but no evidence was observed when analyzing the top 100 athletes. We remark that there may be considerable limitations to analyzing just the single top performance. For example, no athlete has so far surpassed Usain Bolt's world-record performance in the men's 100m or 200m events, set in 2009. An incorporation of the top 100 athletes provides a better view of the holistic state of the elite performance in any candidate event. On the other hand, it is conceivable that incorporating more data, either further into the past or into the future could exhibit some leveling off. We must also note that these two sections carry notable limitations and one should be cautious interpreting these results. For example, non-significant coefficients could be an artefact of the low number of data points (just 19) per event. More broadly, a significant coefficient (such as β 1 for the year-on-year improvement) is always more interpretable than a non-significant coefficient, which is not conclusive evidence of anything. The fact our methods do not broadly produce evidence of performance levelling off is related both to the choice of method (9), for each event. Limited variation is observed between different events. and the question of study. Specifically, our methods (such as regression) work well in the context of studying average performances (mostly over the top m = 100 athletes). They would not work as well focused specifically on the top scorer (m = 1). For this latter topic, many other papers have used quite different statistical techniques. [5] [6] [7] [8] [9] Continuing our study of the first differences, we observed substantial alignment between the men and women's categories of each event in Section V. Even beyond the Olympic effect, men and women's average scores moved in the same direction on a yearly basis to a considerable extent. Clustering based on our normalized inner products frequently paired men and women's categories of the same events together. This suggests that sports may be characterized by improvements in technique and training that benefit the men and women's competitions simultaneously. Interestingly, short and middle distance running exhibited pronounced similarity which may reflect the high degree of intersection in athlete registration among these events. For instance, it is not unusual for the same athlete to represent their country in the 100m and 200m or 200m and 400m. Usain Bolt of Jamaica and Michael Johnson of the US are two examples of such an overlap. In Section VI, we took a different approach, this time seeking anomalies in the relationship between men and women's categories of the same event. For this purpose, we investigated consistency in the normalized trajectories of scores with time. We were able to identify the pole vault and javelin as the events with the greatest and least consistency in the trajectories of men and women's events. We failed to identify higher levels of structure among our trajectory anomalies. Finally, Section VII turned our attention to the athletes themselves, specifically their countries of origin and the changing composition of events over time. Here, we obtained a surprising result: a high degree of similarity in geographic variance across all studied events, track and field, men and women. This may counter a popular conception that certain events are more dominated by smaller selections of countries, such as sprint events by athletes from the US and Jamaica. 58 Instead, with some exceptions, almost every event exhibited proportionate variance of athletes from across the world. Some trends in changing levels of geographic variance over time were observed, particularly for some women's events. Future work can study the proportion of athletes from the US and investigate whether this proportion is increasing, especially in women's events. Overall, this work introduces methods to study the evolution of elite athletes' performance over the past two decades. We identify structural similarities and dissimilarities among various track and field events and the performance of men and women. In addition, we highlight surprising homogeneity in time-varying geographic composition among various events' athletic representation. Although these methods were developed for this application specifically, we believe that our techniques could be applied beyond athletics and more broadly beyond competitive sport. Researchers interested in identifying structure in any phenomenon evolving over time and comparing the evolutionary performance of subgroups could apply these techniques effectively. The data that support the findings of this study are openly available at Ref. 54. A statistical analysis of batting in cricket Random walk picture of basketball scoring Testing the uncertainty of outcome hypothesis in european professional football Bayesian prediction of winning times for elite swimming events Records in athletics through extremevalue theory Statistics for exceptional athletics records Trends and random fluctuations in athletics Extremal analysis of short series with outliers: sea-levels and athletics records Anomalous diffusion and long-range correlations in the score evolution of the game of cricket Scoring dynamics across professional team sports: tempo, balance and predictability Safe leads and lead changes in competitive team sports Game story space of professional sports: Australian rules football Comparing the dynamics of COVID-19 infection and mortality in the United States, India, and Brazil COVID-19 in the United States: Trajectories and second surge behavior Mathematical models to characterize early epidemic growth: A review COVID-19 second wave mortality in Europe and the United States Strong correlations between power-law growth of COVID-19 in four continents and the inefficiency of soft quarantine strategies Power-law distribution in the number of confirmed COVID-19 cases Trends in COVID-19 prevalence and mortality: A year in review Estimating a continuously varying offset between multivariate time series with application to COVID-19 in the United States Complexity in economic and social systems: Cryptocurrency market at around COVID-19 Association between COVID-19 cases and international equity indices Complexity in economic and social systems Efficiency of communities and financial markets during the 2020 pandemic Competition of noise and collectivity in global cryptocurrency trading: Route to a self-contained market A new measure between sets of probability distributions with applications to erratic financial behavior Multiscale characteristics of the emerging global cryptocurrency market Structural clustering of volatility regimes for dynamic trading strategies Polynomial growth in branching processes with diverging reproductive number Distance correlation detecting Lyapunov instabilities, noise-induced escape times and mixing Growing networks with communities: A distributive link model The mathematics of infectious diseases Forecasting COVID-19 Measuring the distance between time series Measuring and testing dependence by correlation of distances Decay of the distance autocorrelation and Lyapunov exponents Novel semi-metrics for multivariate change point analysis and anomaly detection A social network model of COVID-19 Four-tier response system and spatial propagation of COVID-19 in China by a network model A data-driven network model for the emerging COVID-19 epidemics in Wuhan, Toronto and Italy Rare and extreme events: the case of COVID-19 pandemic Could masks curtail the postlockdown resurgence of COVID-19 in the US? Interpreting SARS-CoV-2 seroprevalence, deaths, and fatality rate -making a case for standardized reporting to improve communication Piecewise-constant optimal control strategies for controlling the outbreak of COVID-19 in the Irish population European and US lockdowns and second waves during the COVID-19 pandemic The Elements of Statistical Learning Trajectory modelling techniques useful to epidemiological research: A comparative narrative review of approaches Group-based trajectory modeling: An overview Changes to the extreme and erratic behaviour of cryptocurrencies during COVID-19 Collective correlations, dynamics, and behavioural inconsistencies of the cryptocurrency market over time Fastcluster: Fast hierarchical, agglomerative clustering routines forRandPython Understanding spatial propagation using metric geometry with application to the spread of COVID-19 in the United States Does the media impact athletic performance Athlete atypicity on the edge of human achievement: Performances stagnate after the last peak As Olympic athletics begins, the data suggests we are at the peak of human ability Why are Jamaicans so good at sprinting Model selection and psychological theory: A discussion of the differences between the Akaike information criterion (AIC) and the Bayesian information criterion (BIC) The authors would like to thank Kirsten Doert Eccles from the Melbourne Centre for Data Science, who first proposed the idea on writing a paper studying human performance and the Olympics. The authors would also like to thank Nathaniel Bloomfield for his help in accessing the data. In Figure 8 , we demonstrate the homoscedasticity of residuals for a collection of events, complementing the linear regressions performed in Section III. The plots show in most cases that residuals are satisfactorily distributed with mean zero and rather constant variance.In Tables V, VI and VII, respectively, we document Akaike information criteria (AIC) and Bayesian information criteria (BIC) scores 59 for each of Models 1,2 and 3 from Section III. In Figure 9 , we complement our study of geographic variation in Section VII with a more straightforward measure of athletes' diversity of nationalities each year. We display computed Gini coefficients for track and field events, both according to the top 100 and 10 scorers each year. Compared to Section VII, we see slightly more variability between different events. These different results can be explained by the different method of calculation, which does not take into account geographic dispersion.