key: cord-0906261-141wpv04 authors: Tomchin, Dmitry A.; Fradkov, Alexander L. title: Prediction of the COVID-19 spread in Russia based on SIR and SEIR models of epidemics date: 2020-12-31 journal: IFAC-PapersOnLine DOI: 10.1016/j.ifacol.2021.04.209 sha: e8b75c0170c3c7dfbc2935d4dd515f2c4d5c6574 doc_id: 906261 cord_uid: 141wpv04 An attempt is made to use the simplest epidemic models: SIR and SEIR to predict the spread of COVID-19 in Russia. Simplicity and a small number of parameters are very significant advantages of SIR and SEIR models in conditions of a lack of numerical initial data and structural incompleteness of models. The forecast of distribution of COVID-19 in Russia is carried out according to public data sets from March 10 to April 20, 2020. Comparison of forecast results by SIR and SEIR models are given. In both cases, the peak number of infected persons while maintaining the current level of quarantine measures is forecasted at the end of May 2020. In 2020, mankind faced the most serious challenges due to the pandemic of the new coronavirus COVID-19. Health systems in a number of developed countries were not able to help to a large number of seriously ill patients. A pandemic brings numerous fatal cases. The interest of specialists in various fields, as well as of the general public, raises the question: how long and how seriously will humanity be affected by a coronavirus infection. In a complex, difficult to predict situation, the role of research using mathematical models of epidemics may be important. The main models in this area have been known for over a century and still attract the attention of specialists. In this paper, an attempt is made to use the simplest epidemic models: SIR and SEIR to predict the spread of COVID-19 in Russia. Despite their simplicity, they are often used in epidemiology. They were used for the analysis of the current epidemic of the coronavirus COVID-19 Yang et al. (2020) ; Efimov and Ushirobira (2020) ; Zhong et al. (2020) . For example, on April 22, 2020, out of 598 publications in ArXiv.org, the title or abstract of which contains the words COVID-19 or SARS-CoV-2, 32 have the term SIR, and 12 have the term SEIR. 1 In some works the advantages of a simpler SIR model over a more complex model SEIR are mentioned Roda et al. (2020) . Simplicity and a small number of parameters are the advantages of both SIR and SEIR models, which are very significant under conditions of a lack of numerical initial data and structural incompleteness of the models. SIR and SEIR models were used to predict the distribution of COVID-19 in China, France, Italy, Germany, Portugal and several other countries Yang et al. (2020) ; Efimov and 1 Note, that the publication of six hundred articles on the problem in just two months indicates the serious interest of researchers in this topic. By Nov. 2020 dozens of thousands of the papers on COVID-19 matters can be found in the literature Ushirobira (2020); Zhong et al. (2020) ; Teles (2020) ; IHME (2020); Morato et al (2020) Some results on forecasting the distribution of COVID-19 in Russia as a whole are available in Efimov and Ushirobira (2020) . In Tamm (2020) a modified SEIR model is applied to forecasting spread of COVID-19 in Moscow. In this work, we attempt to forecast the spread of COVID-19 in Russia over a more representative dataset and compare the results of the forecast for SIR and SEIR models based on official statistics on incidence. To set the parameters of the models, which are difficult to determine from the official data, a scenario approach is used: the dynamics of the epidemic is analyzed at several possible parameter values. Some of the SEIR model parameters are selected using the Efimov and Ushirobira (2020) results. A preliminary version of this work is presented in Medarxiv Tomchin and Fradkov (2020) . In the classical SIR model (Kermack-McKendrick model) Kermack and McKendrick (1927) three groups of individuals are considered: susceptible (S), infected (I) and recovered/removed (R). Transmission of infection occurs from infected individuals to susceptible ones. It is believed that the recovered individuals acquire immunity and cannot be infected for the second time. The model is described by differential equations where S(t), I(t), R(t) is the numbers of susceptible, infected and recovered, correspondingly. Positive numbers β, γ are interpreted as parameters defining the rate of in- In 2020, mankind faced the most serious challenges due to the pandemic of the new coronavirus COVID-19. Health systems in a number of developed countries were not able to help to a large number of seriously ill patients. A pandemic brings numerous fatal cases. The interest of specialists in various fields, as well as of the general public, raises the question: how long and how seriously will humanity be affected by a coronavirus infection. In a complex, difficult to predict situation, the role of research using mathematical models of epidemics may be important. The main models in this area have been known for over a century and still attract the attention of specialists. In this paper, an attempt is made to use the simplest epidemic models: SIR and SEIR to predict the spread of COVID-19 in Russia. Despite their simplicity, they are often used in epidemiology. They were used for the analysis of the current epidemic of the coronavirus COVID-19 Yang et al. (2020) ; Efimov and Ushirobira (2020) ; Zhong et al. (2020) . For example, on April 22, 2020, out of 598 publications in ArXiv.org, the title or abstract of which contains the words COVID-19 or SARS-CoV-2, 32 have the term SIR, and 12 have the term SEIR. 1 In some works the advantages of a simpler SIR model over a more complex model SEIR are mentioned Roda et al. (2020) . Simplicity and a small number of parameters are the advantages of both SIR and SEIR models, which are very significant under conditions of a lack of numerical initial data and structural incompleteness of the models. SIR and SEIR models were used to predict the distribution of COVID-19 in China, France, Italy, Germany, Portugal and several other countries Yang et al. (2020) ; Efimov and 1 Note, that the publication of six hundred articles on the problem in just two months indicates the serious interest of researchers in this topic. By Nov. 2020 dozens of thousands of the papers on COVID-19 matters can be found in the literature Ushirobira (2020); Zhong et al. (2020) ; Teles (2020); IHME (2020); Morato et al (2020) Some results on forecasting the distribution of COVID-19 in Russia as a whole are available in Efimov and Ushirobira (2020) . In Tamm (2020) a modified SEIR model is applied to forecasting spread of COVID-19 in Moscow. In this work, we attempt to forecast the spread of COVID-19 in Russia over a more representative dataset and compare the results of the forecast for SIR and SEIR models based on official statistics on incidence. To set the parameters of the models, which are difficult to determine from the official data, a scenario approach is used: the dynamics of the epidemic is analyzed at several possible parameter values. Some of the SEIR model parameters are selected using the Efimov and Ushirobira (2020) results. A preliminary version of this work is presented in Medarxiv Tomchin and Fradkov (2020) . In the classical SIR model (Kermack-McKendrick model) Kermack and McKendrick (1927) three groups of individuals are considered: susceptible (S), infected (I) and recovered/removed (R). Transmission of infection occurs from infected individuals to susceptible ones. It is believed that the recovered individuals acquire immunity and cannot be infected for the second time. The model is described by differential equations where S(t), I(t), R(t) is the numbers of susceptible, infected and recovered, correspondingly. Positive numbers β, γ are interpreted as parameters defining the rate of in-Prediction of the COVID-19 spread in Russia based on SIR and SEIR models of epidemics Dmitry A. Tomchin * Alexander L. Fradkov * * Institute for Problems of Mechanical Engineering, Russian Academy of Sciences, Bolshoy Ave 61, Vasilievsky Ostrov, St. Petersburg, 199178, Russia. Abstract: An attempt is made to use the simplest epidemic models: SIR and SEIR to predict the spread of COVID-19 in Russia. Simplicity and a small number of parameters are very significant advantages of SIR and SEIR models in conditions of a lack of numerical initial data and structural incompleteness of models. The forecast of distribution of COVID-19 in Russia is carried out according to public data sets from March 10 to April 20, 2020. Comparison of forecast results by SIR and SEIR models are given. In both cases, the peak number of infected persons while maintaining the current level of quarantine measures is forecasted at the end of May 2020. In 2020, mankind faced the most serious challenges due to the pandemic of the new coronavirus COVID-19. Health systems in a number of developed countries were not able to help to a large number of seriously ill patients. A pandemic brings numerous fatal cases. The interest of specialists in various fields, as well as of the general public, raises the question: how long and how seriously will humanity be affected by a coronavirus infection. In a complex, difficult to predict situation, the role of research using mathematical models of epidemics may be important. The main models in this area have been known for over a century and still attract the attention of specialists. In this paper, an attempt is made to use the simplest epidemic models: SIR and SEIR to predict the spread of COVID-19 in Russia. Despite their simplicity, they are often used in epidemiology. They were used for the analysis of the current epidemic of the coronavirus COVID-19 Yang et al. (2020) ; Efimov and Ushirobira (2020); Zhong et al. (2020) . For example, on April 22, 2020, out of 598 publications in ArXiv.org, the title or abstract of which contains the words COVID-19 or SARS-CoV-2, 32 have the term SIR, and 12 have the term SEIR. 1 In some works the advantages of a simpler SIR model over a more complex model SEIR are mentioned Roda et al. (2020) . Simplicity and a small number of parameters are the advantages of both SIR and SEIR models, which are very significant under conditions of a lack of numerical initial data and structural incompleteness of the models. SIR and SEIR models were used to predict the distribution of COVID-19 in China, France, Italy, Germany, Portugal and several other countries Yang et al. (2020) ; Efimov and 1 Note, that the publication of six hundred articles on the problem in just two months indicates the serious interest of researchers in this topic. By Nov. 2020 dozens of thousands of the papers on COVID-19 matters can be found in the literature Ushirobira (2020); Zhong et al. (2020) ; Teles (2020); IHME (2020); Morato et al (2020) Some results on forecasting the distribution of COVID-19 in Russia as a whole are available in Efimov and Ushirobira (2020) . In Tamm (2020) a modified SEIR model is applied to forecasting spread of COVID-19 in Moscow. In this work, we attempt to forecast the spread of COVID-19 in Russia over a more representative dataset and compare the results of the forecast for SIR and SEIR models based on official statistics on incidence. To set the parameters of the models, which are difficult to determine from the official data, a scenario approach is used: the dynamics of the epidemic is analyzed at several possible parameter values. Some of the SEIR model parameters are selected using the Efimov and Ushirobira (2020) results. A preliminary version of this work is presented in Medarxiv Tomchin and Fradkov (2020) . In the classical SIR model (Kermack-McKendrick model) Kermack and McKendrick (1927) three groups of individuals are considered: susceptible (S), infected (I) and recovered/removed (R). Transmission of infection occurs from infected individuals to susceptible ones. It is believed that the recovered individuals acquire immunity and cannot be infected for the second time. The model is described by differential equations where S(t), I(t), R(t) is the numbers of susceptible, infected and recovered, correspondingly. Positive numbers β, γ are interpreted as parameters defining the rate of in-Prediction of the COVID-19 spread in Russia based on SIR and SEIR models of epidemics Dmitry A. Tomchin * Alexander L. Fradkov * * Institute for Problems of Mechanical Engineering, Russian Academy of Sciences, Bolshoy Ave 61, Vasilievsky Ostrov, St. Petersburg, 199178, Russia. Abstract: An attempt is made to use the simplest epidemic models: SIR and SEIR to predict the spread of COVID-19 in Russia. Simplicity and a small number of parameters are very significant advantages of SIR and SEIR models in conditions of a lack of numerical initial data and structural incompleteness of models. The forecast of distribution of COVID-19 in Russia is carried out according to public data sets from March 10 to April 20, 2020. Comparison of forecast results by SIR and SEIR models are given. In both cases, the peak number of infected persons while maintaining the current level of quarantine measures is forecasted at the end of May 2020. In 2020, mankind faced the most serious challenges due to the pandemic of the new coronavirus COVID-19. Health systems in a number of developed countries were not able to help to a large number of seriously ill patients. A pandemic brings numerous fatal cases. The interest of specialists in various fields, as well as of the general public, raises the question: how long and how seriously will humanity be affected by a coronavirus infection. In a complex, difficult to predict situation, the role of research using mathematical models of epidemics may be important. The main models in this area have been known for over a century and still attract the attention of specialists. In this paper, an attempt is made to use the simplest epidemic models: SIR and SEIR to predict the spread of COVID-19 in Russia. Despite their simplicity, they are often used in epidemiology. They were used for the analysis of the current epidemic of the coronavirus COVID-19 Yang et al. (2020) ; Efimov and Ushirobira (2020); Zhong et al. (2020) . For example, on April 22, 2020, out of 598 publications in ArXiv.org, the title or abstract of which contains the words COVID-19 or SARS-CoV-2, 32 have the term SIR, and 12 have the term SEIR. 1 In some works the advantages of a simpler SIR model over a more complex model SEIR are mentioned Roda et al. (2020) . Simplicity and a small number of parameters are the advantages of both SIR and SEIR models, which are very significant under conditions of a lack of numerical initial data and structural incompleteness of the models. SIR and SEIR models were used to predict the distribution of COVID-19 in China, France, Italy, Germany, Portugal and several other countries Yang et al. (2020) ; Efimov and Ushirobira (2020); Zhong et al. (2020) ; Teles (2020); IHME (2020); Morato et al (2020) Some results on forecasting the distribution of COVID-19 in Russia as a whole are available in Efimov and Ushirobira (2020) . In Tamm (2020) a modified SEIR model is applied to forecasting spread of COVID-19 in Moscow. In this work, we attempt to forecast the spread of COVID-19 in Russia over a more representative dataset and compare the results of the forecast for SIR and SEIR models based on official statistics on incidence. To set the parameters of the models, which are difficult to determine from the official data, a scenario approach is used: the dynamics of the epidemic is analyzed at several possible parameter values. Some of the SEIR model parameters are selected using the Efimov and Ushirobira (2020) results. A preliminary version of this work is presented in Medarxiv Tomchin and Fradkov (2020). In the classical SIR model (Kermack-McKendrick model) Kermack and McKendrick (1927) three groups of individuals are considered: susceptible (S), infected (I) and recovered/removed (R). Transmission of infection occurs from infected individuals to susceptible ones. It is believed that the recovered individuals acquire immunity and cannot be infected for the second time. The model is described by differential equations where S(t), I(t), R(t) is the numbers of susceptible, infected and recovered, correspondingly. Positive numbers β, γ are interpreted as parameters defining the rate of in- Dmitry A. Tomchin * Alexander L. Fradkov * * Institute for Problems of Mechanical Engineering, Russian Academy of Sciences, Bolshoy Ave 61, Vasilievsky Ostrov, St. Petersburg, 199178, Russia. Abstract: An attempt is made to use the simplest epidemic models: SIR and SEIR to predict the spread of COVID-19 in Russia. Simplicity and a small number of parameters are very significant advantages of SIR and SEIR models in conditions of a lack of numerical initial data and structural incompleteness of models. The forecast of distribution of COVID-19 in Russia is carried out according to public data sets from March 10 to April 20, 2020. Comparison of forecast results by SIR and SEIR models are given. In both cases, the peak number of infected persons while maintaining the current level of quarantine measures is forecasted at the end of May 2020. In 2020, mankind faced the most serious challenges due to the pandemic of the new coronavirus COVID-19. Health systems in a number of developed countries were not able to help to a large number of seriously ill patients. A pandemic brings numerous fatal cases. The interest of specialists in various fields, as well as of the general public, raises the question: how long and how seriously will humanity be affected by a coronavirus infection. In a complex, difficult to predict situation, the role of research using mathematical models of epidemics may be important. The main models in this area have been known for over a century and still attract the attention of specialists. In this paper, an attempt is made to use the simplest epidemic models: SIR and SEIR to predict the spread of COVID-19 in Russia. Despite their simplicity, they are often used in epidemiology. They were used for the analysis of the current epidemic of the coronavirus COVID-19 Yang et al. (2020) ; Efimov and Ushirobira (2020); Zhong et al. (2020) . For example, on April 22, 2020, out of 598 publications in ArXiv.org, the title or abstract of which contains the words COVID-19 or SARS-CoV-2, 32 have the term SIR, and 12 have the term SEIR. 1 In some works the advantages of a simpler SIR model over a more complex model SEIR are mentioned Roda et al. (2020) . Simplicity and a small number of parameters are the advantages of both SIR and SEIR models, which are very significant under conditions of a lack of numerical initial data and structural incompleteness of the models. SIR and SEIR models were used to predict the distribution of COVID-19 in China, France, Italy, Germany, Portugal and several other countries Yang et al. (2020) ; Efimov and Ushirobira (2020); Zhong et al. (2020) ; Teles (2020); IHME (2020); Morato et al (2020) Some results on forecasting the distribution of COVID-19 in Russia as a whole are available in Efimov and Ushirobira (2020) . In Tamm (2020) a modified SEIR model is applied to forecasting spread of COVID-19 in Moscow. In this work, we attempt to forecast the spread of COVID-19 in Russia over a more representative dataset and compare the results of the forecast for SIR and SEIR models based on official statistics on incidence. To set the parameters of the models, which are difficult to determine from the official data, a scenario approach is used: the dynamics of the epidemic is analyzed at several possible parameter values. Some of the SEIR model parameters are selected using the Efimov and Ushirobira (2020) results. A preliminary version of this work is presented in Medarxiv Tomchin and Fradkov (2020). In the classical SIR model (Kermack-McKendrick model) Kermack and McKendrick (1927) three groups of individuals are considered: susceptible (S), infected (I) and recovered/removed (R). Transmission of infection occurs from infected individuals to susceptible ones. It is believed that the recovered individuals acquire immunity and cannot be infected for the second time. The model is described by differential equations where S(t), I(t), R(t) is the numbers of susceptible, infected and recovered, correspondingly. Positive numbers β, γ are interpreted as parameters defining the rate of in- Abstract: An attempt is made to use the simplest epidemic models: SIR and SEIR to predict the spread of COVID-19 in Russia. Simplicity and a small number of parameters are very significant advantages of SIR and SEIR models in conditions of a lack of numerical initial data and structural incompleteness of models. The forecast of distribution of COVID-19 in Russia is carried out according to public data sets from March 10 to April 20, 2020. Comparison of forecast results by SIR and SEIR models are given. In both cases, the peak number of infected persons while maintaining the current level of quarantine measures is forecasted at the end of May 2020. In 2020, mankind faced the most serious challenges due to the pandemic of the new coronavirus COVID-19. Health systems in a number of developed countries were not able to help to a large number of seriously ill patients. A pandemic brings numerous fatal cases. The interest of specialists in various fields, as well as of the general public, raises the question: how long and how seriously will humanity be affected by a coronavirus infection. In a complex, difficult to predict situation, the role of research using mathematical models of epidemics may be important. The main models in this area have been known for over a century and still attract the attention of specialists. In this paper, an attempt is made to use the simplest epidemic models: SIR and SEIR to predict the spread of COVID-19 in Russia. Despite their simplicity, they are often used in epidemiology. (2020); IHME (2020); Morato et al (2020) Some results on forecasting the distribution of COVID-19 in Russia as a whole are available in Efimov and Ushirobira (2020) . In Tamm (2020) a modified SEIR model is applied to forecasting spread of COVID-19 in Moscow. In this work, we attempt to forecast the spread of COVID-19 in Russia over a more representative dataset and compare the results of the forecast for SIR and SEIR models based on official statistics on incidence. To set the parameters of the models, which are difficult to determine from the official data, a scenario approach is used: the dynamics of the epidemic is analyzed at several possible parameter values. Some of the SEIR model parameters are selected using the Efimov and Ushirobira (2020) results. A preliminary version of this work is presented in Medarxiv Tomchin and Fradkov (2020). In the classical SIR model (Kermack-McKendrick model) Kermack and McKendrick (1927) three groups of individuals are considered: susceptible (S), infected (I) and recovered/removed (R). Transmission of infection occurs from infected individuals to susceptible ones. It is believed that the recovered individuals acquire immunity and cannot be infected for the second time. The model is described by differential equations where Abstract: An attempt is made to use the simplest epidemic models: SIR and SEIR to predict the spread of COVID-19 in Russia. Simplicity and a small number of parameters are very significant advantages of SIR and SEIR models in conditions of a lack of numerical initial data and structural incompleteness of models. The forecast of distribution of COVID-19 in Russia is carried out according to public data sets from March 10 to April 20, 2020. Comparison of forecast results by SIR and SEIR models are given. In both cases, the peak number of infected persons while maintaining the current level of quarantine measures is forecasted at the end of May 2020. In 2020, mankind faced the most serious challenges due to the pandemic of the new coronavirus COVID-19. Health systems in a number of developed countries were not able to help to a large number of seriously ill patients. A pandemic brings numerous fatal cases. The interest of specialists in various fields, as well as of the general public, raises the question: how long and how seriously will humanity be affected by a coronavirus infection. In a complex, difficult to predict situation, the role of research using mathematical models of epidemics may be important. The main models in this area have been known for over a century and still attract the attention of specialists. In this paper, an attempt is made to use the simplest epidemic models: SIR and SEIR to predict the spread of COVID-19 in Russia. Despite their simplicity, they are often used in epidemiology. In this work, we attempt to forecast the spread of COVID-19 in Russia over a more representative dataset and compare the results of the forecast for SIR and SEIR models based on official statistics on incidence. To set the parameters of the models, which are difficult to determine from the official data, a scenario approach is used: the dynamics of the epidemic is analyzed at several possible parameter values. Some of the SEIR model parameters are selected using the Efimov and Ushirobira (2020) results. A preliminary version of this work is presented in Medarxiv Tomchin and Fradkov (2020). In the classical SIR model (Kermack-McKendrick model) Kermack and McKendrick (1927) three groups of individuals are considered: susceptible (S), infected (I) and recovered/removed (R). Transmission of infection occurs from infected individuals to susceptible ones. It is believed that the recovered individuals acquire immunity and cannot be infected for the second time. The model is described by differential equations Abstract: An attempt is made to use the simplest epidemic models: SIR and SEIR to predict the spread of COVID-19 in Russia. Simplicity and a small number of parameters are very significant advantages of SIR and SEIR models in conditions of a lack of numerical initial data and structural incompleteness of models. The forecast of distribution of COVID-19 in Russia is carried out according to public data sets from March 10 to April 20, 2020. Comparison of forecast results by SIR and SEIR models are given. In both cases, the peak number of infected persons while maintaining the current level of quarantine measures is forecasted at the end of May 2020. In 2020, mankind faced the most serious challenges due to the pandemic of the new coronavirus COVID-19. Health systems in a number of developed countries were not able to help to a large number of seriously ill patients. A pandemic brings numerous fatal cases. The interest of specialists in various fields, as well as of the general public, raises the question: how long and how seriously will humanity be affected by a coronavirus infection. In a complex, difficult to predict situation, the role of research using mathematical models of epidemics may be important. The main models in this area have been known for over a century and still attract the attention of specialists. In this paper, an attempt is made to use the simplest epidemic models: SIR and SEIR to predict the spread of COVID-19 in Russia. Despite their simplicity, they are often used in epidemiology. In this work, we attempt to forecast the spread of COVID-19 in Russia over a more representative dataset and compare the results of the forecast for SIR and SEIR models based on official statistics on incidence. To set the parameters of the models, which are difficult to determine from the official data, a scenario approach is used: the dynamics of the epidemic is analyzed at several possible parameter values. Some of the SEIR model parameters are selected using the Efimov and Ushirobira (2020) results. A preliminary version of this work is presented in Medarxiv Tomchin and Fradkov (2020). In the classical SIR model (Kermack-McKendrick model) Kermack and McKendrick (1927) three groups of individuals are considered: susceptible (S), infected (I) and recovered/removed (R). Transmission of infection occurs from infected individuals to susceptible ones. It is believed that the recovered individuals acquire immunity and cannot be infected for the second time. The model is described by differential equations where S(t), I(t), R(t) is the numbers of susceptible, infected and recovered, correspondingly. Positive numbers β, γ are interpreted as parameters defining the rate of in- Abstract: An attempt is made to use the simplest epidemic models: SIR and SEIR to predict the spread of COVID-19 in Russia. Simplicity and a small number of parameters are very significant advantages of SIR and SEIR models in conditions of a lack of numerical initial data and structural incompleteness of models. The forecast of distribution of COVID-19 in Russia is carried out according to public data sets from March 10 to April 20, 2020. Comparison of forecast results by SIR and SEIR models are given. In both cases, the peak number of infected persons while maintaining the current level of quarantine measures is forecasted at the end of May 2020. In 2020, mankind faced the most serious challenges due to the pandemic of the new coronavirus COVID-19. Health systems in a number of developed countries were not able to help to a large number of seriously ill patients. A pandemic brings numerous fatal cases. The interest of specialists in various fields, as well as of the general public, raises the question: how long and how seriously will humanity be affected by a coronavirus infection. In a complex, difficult to predict situation, the role of research using mathematical models of epidemics may be important. The main models in this area have been known for over a century and still attract the attention of specialists. In this paper, an attempt is made to use the simplest epidemic models: SIR and SEIR to predict the spread of COVID-19 in Russia. Despite their simplicity, they are often used in epidemiology. They were used for the analysis of the current epidemic of the coronavirus COVID-19 Yang et al. (2020) ; Efimov and Ushirobira (2020); Zhong et al. (2020) . For example, on April 22, 2020, out of 598 publications in ArXiv.org, the title or abstract of which contains the words COVID-19 or SARS-CoV-2, 32 have the term SIR, and 12 have the term SEIR. 1 In some works the advantages of a simpler SIR model over a more complex model SEIR are mentioned Roda et al. (2020) . Simplicity and a small number of parameters are the advantages of both SIR and SEIR models, which are very significant under conditions of a lack of numerical initial data and structural incompleteness of the models. SIR and SEIR models were used to predict the distribution of COVID-19 in China, France, Italy, Germany, Portugal and several other countries Yang et al. (2020) ; Efimov and Ushirobira (2020); Zhong et al. (2020) ; Teles (2020); IHME (2020); Morato et al (2020) Some results on forecasting the distribution of COVID-19 in Russia as a whole are available in Efimov and Ushirobira (2020) . In Tamm (2020) a modified SEIR model is applied to forecasting spread of COVID-19 in Moscow. In this work, we attempt to forecast the spread of COVID-19 in Russia over a more representative dataset and compare the results of the forecast for SIR and SEIR models based on official statistics on incidence. To set the parameters of the models, which are difficult to determine from the official data, a scenario approach is used: the dynamics of the epidemic is analyzed at several possible parameter values. Some of the SEIR model parameters are selected using the Efimov and Ushirobira (2020) results. A preliminary version of this work is presented in Medarxiv Tomchin and Fradkov (2020) . In the classical SIR model (Kermack-McKendrick model) Kermack and McKendrick (1927) three groups of individuals are considered: susceptible (S), infected (I) and recovered/removed (R). Transmission of infection occurs from infected individuals to susceptible ones. It is believed that the recovered individuals acquire immunity and cannot be infected for the second time. The model is described by differential equations where S(t), I(t), R(t) is the numbers of susceptible, infected and recovered, correspondingly. Positive numbers β, γ are interpreted as parameters defining the rate of in- Dmitry A. Tomchin * Alexander L. Fradkov * * Institute for Problems of Mechanical Engineering, Russian Academy of Sciences, Bolshoy Ave 61, Vasilievsky Ostrov, St. Petersburg, 199178, Russia. Abstract: An attempt is made to use the simplest epidemic models: SIR and SEIR to predict the spread of COVID-19 in Russia. Simplicity and a small number of parameters are very significant advantages of SIR and SEIR models in conditions of a lack of numerical initial data and structural incompleteness of models. The forecast of distribution of COVID-19 in Russia is carried out according to public data sets from March 10 to April 20, 2020. Comparison of forecast results by SIR and SEIR models are given. In both cases, the peak number of infected persons while maintaining the current level of quarantine measures is forecasted at the end of May 2020. In 2020, mankind faced the most serious challenges due to the pandemic of the new coronavirus COVID-19. Health systems in a number of developed countries were not able to help to a large number of seriously ill patients. A pandemic brings numerous fatal cases. The interest of specialists in various fields, as well as of the general public, raises the question: how long and how seriously will humanity be affected by a coronavirus infection. In a complex, difficult to predict situation, the role of research using mathematical models of epidemics may be important. The main models in this area have been known for over a century and still attract the attention of specialists. In this paper, an attempt is made to use the simplest epidemic models: SIR and SEIR to predict the spread of COVID-19 in Russia. Despite their simplicity, they are often used in epidemiology. They were used for the analysis of the current epidemic of the coronavirus COVID-19 Yang et al. (2020) ; Efimov and Ushirobira (2020); Zhong et al. (2020) . For example, on April 22, 2020, out of 598 publications in ArXiv.org, the title or abstract of which contains the words COVID-19 or SARS-CoV-2, 32 have the term SIR, and 12 have the term SEIR. 1 In some works the advantages of a simpler SIR model over a more complex model SEIR are mentioned Roda et al. (2020) . Simplicity and a small number of parameters are the advantages of both SIR and SEIR models, which are very significant under conditions of a lack of numerical initial data and structural incompleteness of the models. SIR and SEIR models were used to predict the distribution of COVID-19 in China, France, Italy, Germany, Portugal and several other countries Yang et al. (2020) ; Efimov and Ushirobira (2020); Zhong et al. (2020) ; Teles (2020); IHME (2020); Morato et al (2020) Some results on forecasting the distribution of COVID-19 in Russia as a whole are available in Efimov and Ushirobira (2020) . In Tamm (2020) a modified SEIR model is applied to forecasting spread of COVID-19 in Moscow. In this work, we attempt to forecast the spread of COVID-19 in Russia over a more representative dataset and compare the results of the forecast for SIR and SEIR models based on official statistics on incidence. To set the parameters of the models, which are difficult to determine from the official data, a scenario approach is used: the dynamics of the epidemic is analyzed at several possible parameter values. Some of the SEIR model parameters are selected using the Efimov and Ushirobira (2020) results. A preliminary version of this work is presented in Medarxiv Tomchin and Fradkov (2020). In the classical SIR model (Kermack-McKendrick model) Kermack and McKendrick (1927) three groups of individuals are considered: susceptible (S), infected (I) and recovered/removed (R). Transmission of infection occurs from infected individuals to susceptible ones. It is believed that the recovered individuals acquire immunity and cannot be infected for the second time. The model is described by differential equations where S(t), I(t), R(t) is the numbers of susceptible, infected and recovered, correspondingly. Positive numbers β, γ are interpreted as parameters defining the rate of in- To apply the model to a specific situation, it is necessary to calibrate it i.e. to determine the values of the parameters and the initial conditions from the experimental data. We will use the official data of the World Health Organization (WHO), presented in a convenient form on the website Worldometers Worldometer (2020). Since WHO data is updated once a day, it is convenient to switch from a differential model (1) to a discrete one: In our study the Worldometer (2020) data for calibration from March 10, 2020 to April 20, 2020 were used for calibration. The initial conditions are equal to the values of the variables on March 10, 2020. To estimate the parameters, we use the least squares method. We apply it in two steps: first evaluate γ using the formulâ γ = Model calibration results are shown in Fig.1 . At first glance, the error seems to be large, but it must be kept in mind that the model should first of all take into account the total number of infected people, rather than the rate of its growth. However, one can try to refine the estimates. From the graphs it can be seen that the accuracy of the approximation decreases after March 27, since the real rate of increase in incidence decreases. Obviously, this is due to the introduction of partial self-isolation in the Russian Federation during the third decade of March. For a more accurate prediction, taking into account self-isolation, the time interval of observations was divided into two parts: 1) before March 28 and 2) from March 28 to April 20 and the model parameters on each set were evaluated separately. If we evaluate the parameters of the model separately from the data from March 11 to 20, we getβ = 0.26, while according to the data from March 21 to April 15 we getβ = 0.175, i.e. the same value as in the first case. The estimates of the parameter γ also coincide in both cases:γ = 0.0164. The results of model calibration using a truncated data set are presented in Fig.2 . The results of the forecast of the development of the epidemic 120 days in advance are presented in Fig. 3 . In Fig. 3 , a forecast for the period after April 20 is based on data from March 10 till April 20. The graph shows that the peak incidence (reaching a plateau) is predicted to be around the 70th day, i.e. June 30, 2020. For comparison, in Fig. 3 , b, for the forecast for the period after March 28, the values of the model parameters calculated from the data of March 10-28, i.e. before the introduction of self-isolation are used. The graph shows that if the self-isolation mode were not introduced, the peak would have come earlier: on the 60th day, i.e. 26 of May and its height would be significantly higher. The results can be used to assess the intensity of quarantine measures required for a given slowdown in the development of outbreaks of epidemics and pandemics. Thus, even the simplest SIR model shows the efficiency of introducing a self-isolation regime from the point of view of stretching the incidence growth phase over time and allows us to give useful estimates of the incidence growth time. As for the number of infected people, its prediction at first glance looks disappointing: in order to begin to decrease, this number should approach the total population of the country. However, it should be kept in mind that in the population there are a large number of asymptomatic infected people who easily tolerate the disease often do not even knowing about it. The ability to quantify asymptomatic patients will be shown in the next section using the SEIR model. The SEIR model (Susceptible -Exposed -Infected -Recovered) takes into account the incubation period of the disease Wang et al. (2016) . This model was used to predict the spread of COVID-19 in China Yang et al. (2020) and in European countries Efimov and Ushirobira (2020) . We will use the modified SEIR model with discrete time Efimov and Ushirobira (2020) , described by the equations Here S(t), I(t), R(t) have the same meaning as in the SIR model (2), and E(t) is the number of the infected individuals in the stage of incubation (latent) period. Total population N = S(t) + E(t) + I(t) + R(t), is assumed to be constant as before. The SEIR model (3) has two parameters the values of which can be estimated based on real statistical data: γ, β. The parameter γ > 0 represents the intensity of mortality and recovery, the parameter β > 0 corresponds to the rate of infection of the virus by susceptible people during contact with infected or latent ones. The parameter σ > 0 determines the incubation rate at which symptoms appear in individuals in the latent period. The value of σ can be chosen inversely proportional to the average incubation period of COVID-19: σ = 1/7. The number p C > 0 corresponds to the number of contacts per person per day for infected I(t) (it is assumed that if infected people with symptoms are quarantined (lockdown) then the number of contacts decreases); r t > p C is the number of contacts per person per day for the latent period E(t). Parameters r t , p C can be changed by applying measures that regulate social distance (quarantine, lockdown). They also depend on population density and social traditions. An important peculiarity of the COVID-19 pandemic, as well as a number of previous epidemics, is the significant difference between the actual number of infected individuals and the number of the documented ones. This is due to the presence of a significant number of asymptomatic infections, due to the inability to carry out full testing, as well as due to the inaccuracy and insufficient sensitivity of the tests. Statistical data for Europe and the United States suggest that the proportion of undocumented cases can vary from 40% to 90% Li et al. (2020) . Denote by α the ratio of the total number of infected to the number of documented infected. Given the significant uncertainty of α and the difficulty of determining α, we will carry out calculations for α = 5 2 and for α = 10. First consider the case α = 5. As a result of evaluating the parameters of the SEIR model by LS method according to data from March 10 to April 20, we get: β = 0.027, γ = 0.017. Following Efimov and Ushirobira (2020) and based on the proximity of social traditions in France and in Russia, we will adopt the following values of the remaining parameters as basic ones: σ = 1/7; p C = 2, r = 10. The above values correspond to the current quarantine mode (model-1). The initial conditions S(t), I(t), E(t), R(t) for t = 0 are also set similarly to Efimov and Ushirobira (2020) , using the relations E(t) = (I(t + 1) − (1 − γ)I(t)/σ, S(0) = N, (t) . The data on the number of infectedĪ(t), the number of recovered H(t) and the number of dead D(t) are taken from official sources Worldometer (2020) . Calibration results are presented in Fig. 4 . They show reasonable accuracy of approximation of real data by the model. The forecast results for 120 days according to the model are presented in Fig. 5 . In Fig. 5a ) (left) the results of the forecast for the period from April 20 to August 18, 2020 according to data from 2020/03/10 to 2020/04/20 are shown. In the graphs, the blue curves correspond to the function S(t), the yellow curves correspond to the function E(t), the red curves correspond to the function I(t), and the violet curves to the function R(t). It can be seen that the peak of the epidemic is predicted on the 40th day that is May 30, 2020. For comparison, in Fig. 5, b) the results of the forecast for the model built according to the data from March 10 to April 15 are shown. In such a model less data are used and it would seem that it should have less predictive power. However, the figure shows that even such an "outdated" model predicts the peak of the number of infected individuals with approximately same accuracy: on the 45th day or also on May 30th. In Fig. 6 a similar forecast for the enhanced quarantine mode (model-2: p C = 1.5, r = 7.5) is presented. In Fig. 7 the forecast for an even more strict quarantine mode (model-3: p C = 1, r = 5) is presented. Thus, as can be seen from Figs. 5-7, in the standard quarantine mode, the peak incidence occurs on the 40th day -May 30, 2020, in strict mode -on the 60th day -June 10, and in the very strict mode on the 100th day -July 20, 2020. It is also seen that a more strict regime leads to a decrease in the peak incidence rate: in the standard regime, the number of cases can reach 50 million, in the strict quarantine regime the magnitude of the peak drops to 20 million, and in a very strict regime it becomes less than 10 million. Note that the number of documented cases is still α times less. We proceed to the case α = 10. The model is calibrated similarly to the case α = 5. Calibration results are presented in Fig.8 . They exhibit good accuracy of approximation of real data by the model everywhere, except for the initial section. The forecast results for 120 days for the period from April 20 till August 18, 2020 according to the model are presented in Fig. 9 . The graphs show that Fig. 6 . Forecast based on SEIR model-2 (strict quarantine) for α = 5 according to data from 2020/03/10 till 2020/04/20. Fig. 7 . Forecast based on SEIR model-3 (very strict quarantine) at α = 5 according to data from 2020/03/10 till 2020/04/20. the predicted dates for reaching the maximum number of infected people will fall on May 23-24. In Fig. 10 a similar forecast for strict quarantine mode (model-2, p C = 1.5, r = 7.5) is presented. Fig. 11 demonstrates forecast for a " very strict " quarantine mode (model-3: p C = 1, r = 5). In the paper the mathematical modelling of the spread of the COVID-19 Coronavirus Disease Based on Statistics from incidence in Russia from March 10 to April 20, 2020 is discussed. Two classical simple models of epidemiology are considered: SIR and SEIR. The parameters of the models are estimated based on published statistical data. Options for predicting the development of morbidity in the Russian Federation for 120 days are built when changing the parameter of social distance corresponding to different modes of isolation of the population (quarantine) and the parameter α, which determines the proportion of undocumented cases of infection. Our calculations show that the SIR and SEIR models can produce reasonable results under conditions of limited amount of data and lack of reliability of statistics. They allow one to obtain not only qualitative, but also quantitative forecasts of the spread of the virus. In particular, it follows from the calculations that the peak of the number of infected in Russia should be revived no earlier than the end of May. The real WHO data, see Fig.12 Worldometer (2020) confirm the above conclusions. The forecast results can be used to evaluate the efficiency of lockdown regimes, Fig. 10 . Forecast for SEIR model-2 (strict quarantine) for α = 10 according to the data of 2020/03/10 -2020/04/20. Fig. 11 . Forecast for a " very strict " quarantine mode (model-3: p C = 1, r = 5) for α = 10 according to the data of 2020/03/10 -2020/04/20. to select the optimal strategy, for example, by periodically strengthening and weakening quarantine measures Eubank et al. (2020) , etc. It is worth to note that in April-May, 2020, the media repeatedly expressed, with reference to officials, various statements about time of the onset of the "peak" or "plateau" of the spread of the COVID-19 virus in Russia. In mid-April they spoke of a peak in late April or early May 2020, and in late April and early May they spoke of a peak in mid-May. See, e.g. "MOSCOW, April 7. / TASS /. The situation with the spread of coronavirus in Russia will reach a plateau after about 10-14 days, the head of the Federal Biomedical Agency said Veronika Skvortsova on Tuesday in an interview with Russia-24" CORONA24News (2020). However, the forecast results from the SEIR model based on the data from March 10 to April 15, show that such an "outdated" model predicts that the peak of the number Fig. 12 . Number of infected in Russia from Feb.15, 2020 till June 25, 2020 according to Worldometer (2020) of infected may happen on the 45th day, i.e. at the end of May. In the end of June, 2020 one can see that our forecasts of the peak dates are closer to reality than the official ones. It is worth noticing that the calculations were carried out under a number of additional assumptions: it was believed that the entire population of the country was susceptible to the virus, the virus does not have seasonality, etc. Under such conditions, the final scenario is that a significant part of the population will eventually become ill, only the rate of spread of the disease will change. In reality, the spread of infection can be limited by some factors that are currently unknown and were not taken into account here. Further studies should be carried out for models that take into account spatial heterogeneities (network or compartment models) and incomplete susceptibility of the population to infection. In addition it is interesting to compare our forecasting results with those obtained from modified SIR/SEIR models, e.g. with the results based on the fSIR (feedback SIR) model Franco (2020) or other models, e.g. autoregressive integrated moving average (ARIMA) or error-trend-seasonal (ETS) Perone (2020); Wang Y et al. (2020) . On an interval prediction of covid-19 development based on a SEIR epidemic model impact of non-pharmaceutical interventions (NPIs) to reduce COVID-19 mortality and healthcare demand A feedback SIR (fSIR) model highlights advantages and limitations of infection-based social distancing. arXiv The Institute for Health Metrics and Evaluation A contribution to the mathematical theory of epidemics Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV-2) An optimal predictive control strategy for COVID-19 (SARS-CoV-2) social distancing policies in Brazil ARIMA forecasting of COVID-19 incidence in Italy, Russia, and the USA. arXiv Why is it difficult to accurately predict the COVID-19 epidemic? Infectious Disease Modelling COVID-19 in Moscow: prognoses and scenarios. FARMAKOEKONOMIKA. Modern Pharmacoeconomic and Pharmacoepidemiology A time-dependent SEIR model to analyse the evolution of the SARS-CoV-2 epidemic outbreak in Portugal. arXiv.org Partial Prediction of the Virus COVID-19 Spread in Russia Based on SIR and SEIR Models Estimating the Prevalence and Mortality of Coronavirus Disease 2019 (COVID-19) in the USA, the UK, Russia, and India Infection And Drug Resistance V Statistical physics of vaccination Modified SEIR and AI prediction of the epidemics trend of COVID-19 in China under public health interventions Early prediction of the 2019 novel coronavirus outbreak in the mainland China based on simple mathematical model