key: cord-0798086-0uwv7e2p authors: Mallick, Piklu; Bhowmick, Sourav; Panja, Surajit title: Prediction of COVID-19 Infected Population for Indian States through a State Interaction Network-based SEIR Epidemic Model ⁎ ⁎ This work is supported by Science and Engineering Research Board (SERB), Department of Science and Technology, Govt. of India with project reference no. MSC/2020/000039. date: 2022-12-31 journal: IFAC-PapersOnLine DOI: 10.1016/j.ifacol.2022.04.113 sha: 9ff7f09a1f1d682d34011ae98588405b35ddb95b doc_id: 798086 cord_uid: 0uwv7e2p Objective of this present study is to predict the COVID-19 trajectories in terms of infected population of Indian states. In this work, a state interaction network of sixteen Indian states with highest number of infected caseload is considered, based on networked Susceptible-Exposed-Infected-Recovered (SEIR) epidemic model. An intervention term has been introduced in order to capture the effect of lockdown with different stringencies at different periods of time. The model has been fitted using least absolute shrinkage and selection operator (LASSO). Machine learning methods have been used to train the parameters of the model, cross-validate the data, and predict the parameters. The predictions of infected population for each of the sixteen states have been shown using data considered from January 1, 2021 till writing this manuscript on June 25, 2021. Finally, the effectiveness of the model is manifested by the calculated mean error and confidence interval. The devastation of the pandemic COVID-19 has left the mankind exasperated to find for any remotely possible solution and intensify the combating efforts against the deadly transmission of the disease. Originated in China (ECDC, 2020) , (WCHC, 2019) and gradually engulfing the entire world, this deadly disease has been dangerously affecting all of our daily lives and various other day-to-day activities for the last one year or so now, infecting hundred millions of people and claiming more than millions of lives globally till June 25, 2021 June 25, (worldometer, 2021 . With regard to the impact of the disease in national scenario of India, millions of people have been infected with the virus while lakhs have died of the disease till June 25, 2021 (India, 2021) . Though, India managed to flatten the curve of COVID-19 during the first wave, and minimised the per day caseload; but the inevitable second wave hit hard and devastated India. The impact of second wave is approximately four times worse than the previous one. Quite naturally then, understanding the evolution and spreading dynamics of the disease and its future course in Indian states is of utmost importance at this point of time for major health and security concerns in the society as well as risk assessment and focussed medical support in regions. The transmission dynamics of every viral disease can ⋆ This work is supported by Science and Engineering Research Board (SERB), Department of Science and Technology, Govt. of India with project reference no. MSC/2020/000039. be dictated by existing dynamical epidemiological model (Draief and Massoulie, 2009) , (Bailey et al., 1975) . In this regard, Bernoulli developed the first known mathematical epidemiological model in eighteenth century that captures the spreading of smallpox (Bernoulli, 1760) . Incidentally, two of the classical compartmental epidemiological transmission models are (i) Susceptible (S)-Infected (I)-Susceptible (S), i.e., SIS model (Kermack and McKendrick, 1932) and (ii) Susceptible (S)-Infected (I)-Recovered (R) model, i.e., SIR model (Kermack and McKendrick, 1927) , both developed by Kermack and Mckendrick. In the former model, basically all the individuals start as susceptible and becomes infected when comes in contact with an infectious individual, and finally on getting recovered again becomes susceptible to the disease. On the other hand, in the later model, the infected individuals when recover from the disease stay recovered for some duration due to either enhanced immunity after the disease or due to the effect of the vaccine. These models are associated with two very important parameter namely (i) transmission rate (β) which represents the probability of infection and (ii) curing rate (δ) that represents the probability of recovery from the disease. However, in the case of COVID-19, it has been observed that COVID-19 infected patients can be both symptomatic and asymptomatic. The symptomatic patients are those which have apparent symptoms of the disease and require immediate medical attention. On the other hand, asymptomatic patients are those which are infected by other symptomatic or asymptomatic patients and carries the disease with them during a period known The devastation of the pandemic COVID-19 has left the mankind exasperated to find for any remotely possible solution and intensify the combating efforts against the deadly transmission of the disease. Originated in China (ECDC, 2020) , (WCHC, 2019) and gradually engulfing the entire world, this deadly disease has been dangerously affecting all of our daily lives and various other day-to-day activities for the last one year or so now, infecting hundred millions of people and claiming more than millions of lives globally till June 25, 2021 June 25, (worldometer, 2021 . With regard to the impact of the disease in national scenario of India, millions of people have been infected with the virus while lakhs have died of the disease till June 25, 2021 (India, 2021) . Though, India managed to flatten the curve of COVID-19 during the first wave, and minimised the per day caseload; but the inevitable second wave hit hard and devastated India. The impact of second wave is approximately four times worse than the previous one. Quite naturally then, understanding the evolution and spreading dynamics of the disease and its future course in Indian states is of utmost importance at this point of time for major health and security concerns in the society as well as risk assessment and focussed medical support in regions. The transmission dynamics of every viral disease can ⋆ This work is supported by Science and Engineering Research Board (SERB), Department of Science and Technology, Govt. of India with project reference no. MSC/2020/000039. be dictated by existing dynamical epidemiological model (Draief and Massoulie, 2009) , (Bailey et al., 1975) . In this regard, Bernoulli developed the first known mathematical epidemiological model in eighteenth century that captures the spreading of smallpox (Bernoulli, 1760) . Incidentally, two of the classical compartmental epidemiological transmission models are (i) Susceptible (S)-Infected (I)-Susceptible (S), i.e., SIS model (Kermack and McKendrick, 1932) and (ii) Susceptible (S)-Infected (I)-Recovered (R) model, i.e., SIR model (Kermack and McKendrick, 1927) , both developed by Kermack and Mckendrick. In the former model, basically all the individuals start as susceptible and becomes infected when comes in contact with an infectious individual, and finally on getting recovered again becomes susceptible to the disease. On the other hand, in the later model, the infected individuals when recover from the disease stay recovered for some duration due to either enhanced immunity after the disease or due to the effect of the vaccine. These models are associated with two very important parameter namely (i) transmission rate (β) which represents the probability of infection and (ii) curing rate (δ) that represents the probability of recovery from the disease. However, in the case of COVID-19, it has been observed that COVID-19 infected patients can be both symptomatic and asymptomatic. The symptomatic patients are those which have apparent symptoms of the disease and require immediate medical attention. On the other hand, asymptomatic patients are those which are infected by other symptomatic or asymptomatic patients and carries the disease with them during a period known The devastation of the pandemic COVID-19 has left the mankind exasperated to find for any remotely possible solution and intensify the combating efforts against the deadly transmission of the disease. Originated in China (ECDC, 2020), (WCHC, 2019) and gradually engulfing the entire world, this deadly disease has been dangerously affecting all of our daily lives and various other day-to-day activities for the last one year or so now, infecting hundred millions of people and claiming more than millions of lives globally till June 25, 2021 June 25, (worldometer, 2021 . With regard to the impact of the disease in national scenario of India, millions of people have been infected with the virus while lakhs have died of the disease till June 25, 2021 (India, 2021) . Though, India managed to flatten the curve of COVID-19 during the first wave, and minimised the per day caseload; but the inevitable second wave hit hard and devastated India. The impact of second wave is approximately four times worse than the previous one. Quite naturally then, understanding the evolution and spreading dynamics of the disease and its future course in Indian states is of utmost importance at this point of time for major health and security concerns in the society as well as risk assessment and focussed medical support in regions. The transmission dynamics of every viral disease can ⋆ This work is supported by Science and Engineering Research Board (SERB), Department of Science and Technology, Govt. of India with project reference no. MSC/2020/000039. be dictated by existing dynamical epidemiological model (Draief and Massoulie, 2009) , (Bailey et al., 1975) . In this regard, Bernoulli developed the first known mathematical epidemiological model in eighteenth century that captures the spreading of smallpox (Bernoulli, 1760) . Incidentally, two of the classical compartmental epidemiological transmission models are (i) Susceptible (S)-Infected (I)-Susceptible (S), i.e., SIS model (Kermack and McKendrick, 1932) and (ii) Susceptible (S)-Infected (I)-Recovered (R) model, i.e., SIR model (Kermack and McKendrick, 1927) , both developed by Kermack and Mckendrick. In the former model, basically all the individuals start as susceptible and becomes infected when comes in contact with an infectious individual, and finally on getting recovered again becomes susceptible to the disease. On the other hand, in the later model, the infected individuals when recover from the disease stay recovered for some duration due to either enhanced immunity after the disease or due to the effect of the vaccine. These models are associated with two very important parameter namely (i) transmission rate (β) which represents the probability of infection and (ii) curing rate (δ) that represents the probability of recovery from the disease. However, in the case of COVID-19, it has been observed that COVID-19 infected patients can be both symptomatic and asymptomatic. The symptomatic patients are those which have apparent symptoms of the disease and require immediate medical attention. On the other hand, asymptomatic patients are those which are infected by other symptomatic or asymptomatic patients and carries the disease with them during a period known The devastation of the pandemic COVID-19 has left the mankind exasperated to find for any remotely possible solution and intensify the combating efforts against the deadly transmission of the disease. Originated in China (ECDC, 2020), (WCHC, 2019) and gradually engulfing the entire world, this deadly disease has been dangerously affecting all of our daily lives and various other day-to-day activities for the last one year or so now, infecting hundred millions of people and claiming more than millions of lives globally till June 25, 2021 June 25, (worldometer, 2021 . With regard to the impact of the disease in national scenario of India, millions of people have been infected with the virus while lakhs have died of the disease till June 25, 2021 (India, 2021) . Though, India managed to flatten the curve of COVID-19 during the first wave, and minimised the per day caseload; but the inevitable second wave hit hard and devastated India. The impact of second wave is approximately four times worse than the previous one. Quite naturally then, understanding the evolution and spreading dynamics of the disease and its future course in Indian states is of utmost importance at this point of time for major health and security concerns in the society as well as risk assessment and focussed medical support in regions. The transmission dynamics of every viral disease can ⋆ This work is supported by Science and Engineering Research Board (SERB), Department of Science and Technology, Govt. of India with project reference no. MSC/2020/000039. be dictated by existing dynamical epidemiological model (Draief and Massoulie, 2009) , (Bailey et al., 1975) . In this regard, Bernoulli developed the first known mathematical epidemiological model in eighteenth century that captures the spreading of smallpox (Bernoulli, 1760) . Incidentally, two of the classical compartmental epidemiological transmission models are (i) Susceptible (S)-Infected (I)-Susceptible (S), i.e., SIS model (Kermack and McKendrick, 1932) and (ii) Susceptible (S)-Infected (I)-Recovered (R) model, i.e., SIR model (Kermack and McKendrick, 1927) , both developed by Kermack and Mckendrick. In the former model, basically all the individuals start as susceptible and becomes infected when comes in contact with an infectious individual, and finally on getting recovered again becomes susceptible to the disease. On the other hand, in the later model, the infected individuals when recover from the disease stay recovered for some duration due to either enhanced immunity after the disease or due to the effect of the vaccine. These models are associated with two very important parameter namely (i) transmission rate (β) which represents the probability of infection and (ii) curing rate (δ) that represents the probability of recovery from the disease. However, in the case of COVID-19, it has been observed that COVID-19 infected patients can be both symptomatic and asymptomatic. The symptomatic patients are those which have apparent symptoms of the disease and require immediate medical attention. On the other hand, asymptomatic patients are those which are infected by other symptomatic or asymptomatic patients and carries the disease with them during a period known Karnataka-561203, India (e-mail: sourav.bhowmick1@gmail.com) Abstract: Objective of this present study is to predict the COVID-19 trajectories in terms of infected population of Indian states. In this work, a state interaction network of sixteen Indian states with highest number of infected caseload is considered, based on networked Susceptible-Exposed-Infected-Recovered (SEIR) epidemic model. An intervention term has been introduced in order to capture the effect of lockdown with different stringencies at different periods of time. The model has been fitted using least absolute shrinkage and selection operator (LASSO). Machine learning methods have been used to train the parameters of the model, cross-validate the data, and predict the parameters. The predictions of infected population for each of the sixteen states have been shown using data considered from January 1, 2021 till writing this manuscript on June 25, 2021. Finally, the effectiveness of the model is manifested by the calculated mean error and confidence interval. Keywords: Networked SEIR model, COVID-19, Epidemic model, Prediction, Intervention stingency. The devastation of the pandemic COVID-19 has left the mankind exasperated to find for any remotely possible solution and intensify the combating efforts against the deadly transmission of the disease. Originated in China (ECDC, 2020), (WCHC, 2019) and gradually engulfing the entire world, this deadly disease has been dangerously affecting all of our daily lives and various other day-to-day activities for the last one year or so now, infecting hundred millions of people and claiming more than millions of lives globally till June 25, 2021 (worldometer, 2021). With regard to the impact of the disease in national scenario of India, millions of people have been infected with the virus while lakhs have died of the disease till June 25, 2021 (India, 2021) . Though, India managed to flatten the curve of COVID-19 during the first wave, and minimised the per day caseload; but the inevitable second wave hit hard and devastated India. The impact of second wave is approximately four times worse than the previous one. Quite naturally then, understanding the evolution and spreading dynamics of the disease and its future course in Indian states is of utmost importance at this point of time for major health and security concerns in the society as well as risk assessment and focussed medical support in regions. The transmission dynamics of every viral disease can ⋆ This work is supported by Science and Engineering Research Board (SERB), Department of Science and Technology, Govt. of India with project reference no. MSC/2020/000039. be dictated by existing dynamical epidemiological model (Draief and Massoulie, 2009) , (Bailey et al., 1975) . In this regard, Bernoulli developed the first known mathematical epidemiological model in eighteenth century that captures the spreading of smallpox (Bernoulli, 1760) . Incidentally, two of the classical compartmental epidemiological transmission models are (i) Susceptible (S)-Infected (I)-Susceptible (S), i.e., SIS model (Kermack and McKendrick, 1932) and (ii) Susceptible (S)-Infected (I)-Recovered (R) model, i.e., SIR model (Kermack and McKendrick, 1927) , both developed by Kermack and Mckendrick. In the former model, basically all the individuals start as susceptible and becomes infected when comes in contact with an infectious individual, and finally on getting recovered again becomes susceptible to the disease. On the other hand, in the later model, the infected individuals when recover from the disease stay recovered for some duration due to either enhanced immunity after the disease or due to the effect of the vaccine. These models are associated with two very important parameter namely (i) transmission rate (β) which represents the probability of infection and (ii) curing rate (δ) that represents the probability of recovery from the disease. However, in the case of COVID-19, it has been observed that COVID-19 infected patients can be both symptomatic and asymptomatic. The symptomatic patients are those which have apparent symptoms of the disease and require immediate medical attention. On the other hand, asymptomatic patients are those which are infected by other symptomatic or asymptomatic patients and carries the disease with them during a period known The devastation of the pandemic COVID-19 has left the mankind exasperated to find for any remotely possible solution and intensify the combating efforts against the deadly transmission of the disease. Originated in China (ECDC, 2020), (WCHC, 2019) and gradually engulfing the entire world, this deadly disease has been dangerously affecting all of our daily lives and various other day-to-day activities for the last one year or so now, infecting hundred millions of people and claiming more than millions of lives globally till June 25, 2021 June 25, (worldometer, 2021 . With regard to the impact of the disease in national scenario of India, millions of people have been infected with the virus while lakhs have died of the disease till June 25, 2021 (India, 2021) . Though, India managed to flatten the curve of COVID-19 during the first wave, and minimised the per day caseload; but the inevitable second wave hit hard and devastated India. The impact of second wave is approximately four times worse than the previous one. Quite naturally then, understanding the evolution and spreading dynamics of the disease and its future course in Indian states is of utmost importance at this point of time for major health and security concerns in the society as well as risk assessment and focussed medical support in regions. The transmission dynamics of every viral disease can ⋆ This work is supported by Science and Engineering Research Board (SERB), Department of Science and Technology, Govt. of India with project reference no. MSC/2020/000039. be dictated by existing dynamical epidemiological model (Draief and Massoulie, 2009) , (Bailey et al., 1975) . In this regard, Bernoulli developed the first known mathematical epidemiological model in eighteenth century that captures the spreading of smallpox (Bernoulli, 1760) . Incidentally, two of the classical compartmental epidemiological transmission models are (i) Susceptible (S)-Infected (I)-Susceptible (S), i.e., SIS model (Kermack and McKendrick, 1932) and (ii) Susceptible (S)-Infected (I)-Recovered (R) model, i.e., SIR model (Kermack and McKendrick, 1927) , both developed by Kermack and Mckendrick. In the former model, basically all the individuals start as susceptible and becomes infected when comes in contact with an infectious individual, and finally on getting recovered again becomes susceptible to the disease. On the other hand, in the later model, the infected individuals when recover from the disease stay recovered for some duration due to either enhanced immunity after the disease or due to the effect of the vaccine. These models are associated with two very important parameter namely (i) transmission rate (β) which represents the probability of infection and (ii) curing rate (δ) that represents the probability of recovery from the disease. However, in the case of COVID-19, it has been observed that COVID-19 infected patients can be both symptomatic and asymptomatic. The symptomatic patients are those which have apparent symptoms of the disease and require immediate medical attention. On the other hand, asymptomatic patients are those which are infected by other symptomatic or asymptomatic patients and carries the disease with them during a period known Karnataka-561203, India (e-mail: sourav.bhowmick1@gmail.com) Abstract: Objective of this present study is to predict the COVID-19 trajectories in terms of infected population of Indian states. In this work, a state interaction network of sixteen Indian states with highest number of infected caseload is considered, based on networked Susceptible-Exposed-Infected-Recovered (SEIR) epidemic model. An intervention term has been introduced in order to capture the effect of lockdown with different stringencies at different periods of time. The model has been fitted using least absolute shrinkage and selection operator (LASSO). Machine learning methods have been used to train the parameters of the model, cross-validate the data, and predict the parameters. The predictions of infected population for each of the sixteen states have been shown using data considered from January 1, 2021 till writing this manuscript on June 25, 2021. Finally, the effectiveness of the model is manifested by the calculated mean error and confidence interval. Keywords: Networked SEIR model, COVID-19, Epidemic model, Prediction, Intervention stingency. The devastation of the pandemic COVID-19 has left the mankind exasperated to find for any remotely possible solution and intensify the combating efforts against the deadly transmission of the disease. Originated in China (ECDC, 2020), (WCHC, 2019) and gradually engulfing the entire world, this deadly disease has been dangerously affecting all of our daily lives and various other day-to-day activities for the last one year or so now, infecting hundred millions of people and claiming more than millions of lives globally till June 25, 2021 (worldometer, 2021). With regard to the impact of the disease in national scenario of India, millions of people have been infected with the virus while lakhs have died of the disease till June 25, 2021 (India, 2021) . Though, India managed to flatten the curve of COVID-19 during the first wave, and minimised the per day caseload; but the inevitable second wave hit hard and devastated India. The impact of second wave is approximately four times worse than the previous one. Quite naturally then, understanding the evolution and spreading dynamics of the disease and its future course in Indian states is of utmost importance at this point of time for major health and security concerns in the society as well as risk assessment and focussed medical support in regions. The transmission dynamics of every viral disease can ⋆ This work is supported by Science and Engineering Research Board (SERB), Department of Science and Technology, Govt. of India with project reference no. MSC/2020/000039. be dictated by existing dynamical epidemiological model (Draief and Massoulie, 2009) , (Bailey et al., 1975) . In this regard, Bernoulli developed the first known mathematical epidemiological model in eighteenth century that captures the spreading of smallpox (Bernoulli, 1760) . Incidentally, two of the classical compartmental epidemiological transmission models are (i) Susceptible (S)-Infected (I)-Susceptible (S), i.e., SIS model (Kermack and McKendrick, 1932) and (ii) Susceptible (S)-Infected (I)-Recovered (R) model, i.e., SIR model (Kermack and McKendrick, 1927) , both developed by Kermack and Mckendrick. In the former model, basically all the individuals start as susceptible and becomes infected when comes in contact with an infectious individual, and finally on getting recovered again becomes susceptible to the disease. On the other hand, in the later model, the infected individuals when recover from the disease stay recovered for some duration due to either enhanced immunity after the disease or due to the effect of the vaccine. These models are associated with two very important parameter namely (i) transmission rate (β) which represents the probability of infection and (ii) curing rate (δ) that represents the probability of recovery from the disease. However, in the case of COVID-19, it has been observed that COVID-19 infected patients can be both symptomatic and asymptomatic. The symptomatic patients are those which have apparent symptoms of the disease and require immediate medical attention. On the other hand, asymptomatic patients are those which are infected by other symptomatic or asymptomatic patients and carries the disease with them during a period known The devastation of the pandemic COVID-19 has left the mankind exasperated to find for any remotely possible solution and intensify the combating efforts against the deadly transmission of the disease. Originated in China (ECDC, 2020), (WCHC, 2019) and gradually engulfing the entire world, this deadly disease has been dangerously affecting all of our daily lives and various other day-to-day activities for the last one year or so now, infecting hundred millions of people and claiming more than millions of lives globally till June 25, 2021 (worldometer, 2021). With regard to the impact of the disease in national scenario of India, millions of people have been infected with the virus while lakhs have died of the disease till June 25, 2021 (India, 2021) . Though, India managed to flatten the curve of COVID-19 during the first wave, and minimised the per day caseload; but the inevitable second wave hit hard and devastated India. The impact of second wave is approximately four times worse than the previous one. Quite naturally then, understanding the evolution and spreading dynamics of the disease and its future course in Indian states is of utmost importance at this point of time for major health and security concerns in the society as well as risk assessment and focussed medical support in regions. The transmission dynamics of every viral disease can ⋆ This work is supported by Science and Engineering Research Board (SERB), Department of Science and Technology, Govt. of India with project reference no. MSC/2020/000039. be dictated by existing dynamical epidemiological model (Draief and Massoulie, 2009) , (Bailey et al., 1975) . In this regard, Bernoulli developed the first known mathematical epidemiological model in eighteenth century that captures the spreading of smallpox (Bernoulli, 1760) . Incidentally, two of the classical compartmental epidemiological transmission models are (i) Susceptible (S)-Infected (I)-Susceptible (S), i.e., SIS model (Kermack and McKendrick, 1932) and (ii) Susceptible (S)-Infected (I)-Recovered (R) model, i.e., SIR model (Kermack and McKendrick, 1927) , both developed by Kermack and Mckendrick. In the former model, basically all the individuals start as susceptible and becomes infected when comes in contact with an infectious individual, and finally on getting recovered again becomes susceptible to the disease. On the other hand, in the later model, the infected individuals when recover from the disease stay recovered for some duration due to either enhanced immunity after the disease or due to the effect of the vaccine. These models are associated with two very important parameter namely (i) transmission rate (β) which represents the probability of infection and (ii) curing rate (δ) that represents the probability of recovery from the disease. However, in the case of COVID-19, it has been observed that COVID-19 infected patients can be both symptomatic and asymptomatic. The symptomatic patients are those which have apparent symptoms of the disease and require immediate medical attention. On the other hand, asymptomatic patients are those which are infected by other symptomatic or asymptomatic patients and carries the disease with them during a period known as latency period or incubation period which can be as long as fourteen days. In other words, a person remains infected with the disease and carries the disease with her/him unaware of it and unnoticed in the society during the entire latency period before the symptoms begin to show up and the person requires medical attention. This state is called Exposed (E) state, and the rate at which the exposed individuals begin to show symptoms of being infected is called incubation rate (σ). Specifically speaking, exposed state is more dangerous than the infected state in the sense that during the entire incubation period, the exposed person can also expose other individuals who have come in contact with her/him, and eventually getting infected with the disease (Agrawal et al., 2021a) , (Agrawal et al., 2021b) . Hence, in this work the prediction of the future trajectories of COVID-19 in Indian states has been done based on the networked SEIR epidemiological discrete-time dynamical model (Vrabac et al., 2022) . Various epidemiological models are provided in details in Hethcote (2000) and for recent and relevant review, Pastor-Satorras et al. (2015) and Nowzari et al. (2016) are excellent repositories. A rigorous study of stability analysis of various model has been provided in Paré et al. (2017) . Network-based epidemic models are provided in Mesbahi and Egerstedt (2010) . Moreover, investigation along with detailed stability analysis of a multi-competitive mutating virus epidemic model over a time-varying network has been carried out in Paré et al. (2021) . Network-based epidemic model relaxes many simplistic assumptions of well-mixed population model, e.g., homogeneous transmission rate and curing rate which reflect equal chances of getting infected and getting cured. Incidentally, the propagation of the disease which is mainly attributed to social interactions can take place among people from the same geographical locations or across states, and in that sense the nodes of the networks can be considered to be the states and their connectivity or interactions represent the edges of the network. Towards that end, the quantitative knowledge of the interactions between states in terms of population flow or the interactions within the state in terms of population density can be thought to be good indicators of the propagation of this disease across states. However, the knowledge regarding the population flow which should take into account not only the airline transportation data, but also various other modes of communication such as railways or roadways can not be expected to be very precise and reliable (Mandal et al., 2020) . As an alternative to availing this data, the prediction analysis of the disease can be recast as a network inference problem for a particular network-based epidemic model where useful parameters of the model and the contact pattern can be estimated. Based on given epidemiological data, Welch et al. (2011) dwells upon the statistical inference methods to estimate interaction network. In the similar research area, for Chinese province Hubei (where the disease originated), predicted the future course of COVID-19 in the early days of the disease based on a networked SIR model where an interaction network of Hubei's cities is considered, recently in Pizzuti et al. (2020) a transmission modifier term has been introduced, that incorporates the intervention policies to predict the future course of COVID-19 in Italy. Moreover, underlying contact network reconstruction and prediction of epidemic course for generalized models has been discussed in . Other than this, control algorithm for disease retardation in networked model has been formulated in Zino et al. (2020) ; Wang et al. (2020a) . In this study, the focus is on predicting the epidemic size of each of the sixteen most-infected Indian states through a state interaction networked-based SEIR epidemic model. The epidemic dynamical model underpins the transmission dynamics of COVID-19 with its tenets of disease spreading mechanism rooted in the estimated contact network among states. Moreover, other parameters of the models such as curing rate and incubation rate are also inferred from this study to predict the future course of infected caseload. Predicting the caseloads of each states for next fourteen days will help Government and policy makers to plan for the necessary arrangements of medical facility like requirements of numbers of beds, ICU, oxygen, medicines, and health care workers. Further, this prediction will help Government to plan for the necessary interventions by optimal trade off. The epidemic prediction results of the states are based on the data considered from January 1, 2021 to June 25, 2021 available in (India, 2021) . In the remainder of the paper, Section 2 describes the models of the work, Section 3 is about the different methods used, Section 4 is all about the simulated results and discussion, and the concluding discussion and future work are in Section 5. The following is the state connectivity network-based SEIR epidemic transmission model with difference of time index from k to k + 1 being equal to one day: 16} are the index of the sixteen states as provided in Table 1 , σ i > 0 and δ i > 0 are the incubation rate and curing rate for state i, respectively. β ij ≥ 0 denotes the rate at which susceptible individuals are getting infected when coming in close contact with infected individuals between states i and j (i ̸ = j) while self-transmission rate due to infected individuals within the same state i are represented by β ii > 0, respectively. α i [k] is the intervention term to simulate the different intervention strategy implemented by different state Governments. The entire process of networked-based dynamical transmission of the disease between the compartments is shown in Fig. 1 , where the transmission is shown between two states i and j. It is witnessed that unlike the first wave of COVID-19 there was no nationwide lockdown in the second wave. Instead, based on experts advice different states impose lockdown at different time and that too with different stringency. Table 1 To integrate this effect of different lockdown strategy by different state Government at different instant of time, an intervention term α i [k] ∈ [0, 1] has been introduced (Pizzuti et al., 2020) , (Wang et al., 2020b) , where i represent i th state and k represent the timeline or more specifically the day of imposition of intervention. After detailed research (MHA-GoI, 2021) , it is found that most of the states impose night curfew within April, 2021; then they went for weekend lockdown, followed by partial lockdown; and finally they opted for strict or complete lockdown. However, after May 1 while some of the states start to open completely others went for partial relaxation of lockdown. The considered phases are as follows: Depending on the stringency of lockdown human mobility decreases. Moreover, a decrease in human mobility will definitely decrease the rate at which the disease is transmitting, as COVID-19 spread through human interaction. Thus, the considered intervention term α i [k] have dependencies on strictness of lockdown as well as the time of imposition of lockdown. Table 1 shows the sixteen most-infected Indian states with their indices in descending order of infected population and total population of respective states. In this study, data has been considered from January 1, 2021 to June 25, 2021. From the available data for this entire duration and for each state, the fraction of reported infected cases per day has been calculated by dividing (I daily ) with the total population of the state (N state ), i.e., Input data, I = I daily Nstate . The block diagram representation in Fig. 3 encapsulates our methods for the prediction of future course of COVID-19 progress in the most-infected sixteen Indian states based on the considered data from January 1, 2021 till June 25, 2021. Input data I is passed through data processing block and gives a seven-day moving average data as an output, then the processed data is used to calculate exposed fraction (E i ), recovered fraction (R i ), and thus susceptible fraction (S i ) using networked SEIR model (1), for each state and for entire duration of considered data. It is to be noted that to calculate the fraction of individuals in each compartment, S i + E i + I i + R i = 1 is used. Subsequently, data consisting information of S i , E i , and I i are used to train and validate the data using a machine learning algorithm, and thus to predict the future course of epidemic size in Indian states in terms of the total number of infected population. (1), the following can be written: where F i ∈ R (n−1)×N and Then, from (1), (2),(3), and (4), the following can be written: However, with almost every modelling, there is associated modelling error namely σ i (i ∈ ℑ N ) in our case such that whereV i andF i are values obtained from modelling. Consequently, following , modelling error σ i is reduced by using least absolute shrinkage and selection operator, commonly known as LASSO Foucart and Rauhut (2017) to estimate the network transmission parameters such that where λ i > 0 is the regularization parameter obtained by cross-validation method. The solution of the LASSO problem (7) . Hold-out cross validation method has been used with holdout ratio being 0.2. Fig. 4 to Fig. 7 shows the cumulative infected population of all the states as tabulated in Table 1 with consecutive four states in each figure, based on the considered data from January 1, 2021 to June 25, 2021. The abbreviation of the states are provided in Table 1 . For accuracy purpose, three days are skipped from the data points, i.e., data is considered till June 23, 2021 and then subsequently the prediction is shown for next fourteen days (two weeks). Blue lines show the actual plot of data available whereas, red lines show the prediction part for respective states. Thus, from the predicted results it is evident that for almost all the states the cumulative number of caseload is saturating except for the states Kerala and Odisha. Furthermore, Fig. 6 shows that after few days Odisha surpass Gujrat with respect to the cumulative infected caseload. Moreover, it is found that the average error percentage for the neglected three days is very small. The predicted result has been compared with the real COVID-19 data. Thus, from the comparison, per day error percentage for each of the sixteen states has been calculated. Therefore, from the calculated per day error the mean error percentage of each of the states has been calculated. Table 2 shows the mean error percentage of each of the sixteen states. It is clear from the table that, for most of the states the mean error is close to or less than 1%, whereas the maximum mean error is found to be 3.84% for the state Kerala and minimum mean error is found to be 0.04% for the state Gujarat. Again, the 95% confidence interval(CI) (Papoulis and Pillai, 2002) as well as 99% confidence interval (CI) has also been calculated to find the effectiveness of this method, as shown in Table 2 . Fig. 8 shows the plot of mean error percentage of each states. Moreover, it is found that the error of prediction for this method very less. Thus, it can be inferred that this model works well to predict the cumulative infection caseload for each of the states. In this study, a state interaction network-based SEIR epidemic model is investigated to predict the spread of COVID-19 in sixteen most-infected Indian states in terms of total cumulative number of infected people. Exposed state has been considered to incorporate the huge number of asymptomatic cases in India. Moreover, incorporation of new exposed compartment along with the intervention term α i [k] helps to train the model parameters, and thus to predict the cumulative caseload quite accurately for all the considered states. Furthermore, the disease network transmission rates across/within the state which are otherwise difficult to quantify, are inferred from this network-based epidemic model approach. Other important modelling parameters such as curing rate and incubation rate are also inferred from this approach for each state. The mean error percentage of the prediction has been found to be very less, this shows the effectiveness of this method. As future work, there remains many challenges to understand the transmission dynamics of COVID-19 when considering the virus is mutating. Also, to find what are the factors that is leading to consecutive waves. Finally, to find optimal control strategies to control the disease propagation. Modelling the spread of SARS-CoV-2 pandemic -Impact of lockdowns & interventions SUTRA: An Approach to Modelling Pandemics with Asymptomatic Patients, and Applications to COVID-19 The mathematical theory of infectious diseases and its applications. Charles Griffin & Company Ltd, 5a Crendon Street Essai d'une nouvelle analyse de la mortalité causée par la petite vérole, et des avantages de l'inoculation pour la prévenir. Histoire de l'Acad Map India outline, states (white Epidemics and rumours in complex networks Rapid Risk Assessment: Outbreak of acute respiratory syndrome associated with a novel coronavirus A mathematical introduction to compressive sensing The Mathematics of Infectious Diseases A contribution to the mathematical theory of epidemics. Proceedings of the Contributions to the mathematical theory of epidemics. II.-the problem of endemicity Prudent public health intervention strategies to control the coronavirus disease 2019 transmission in India: A mathematical model-based approach Graph theoretic methods in multiagent networks Circulars for COVID-19-Ministry of Home Affairs-GoI Analysis and Control of Epidemics: A Survey of Spreading Processes on Complex Networks Probability, Random Variables, and Stochastic Processes. Tata McGraw-Hill Education Modeling, estimation, and analysis of epidemics over networks: An overview Multi-competitive viruses over time-varying networks with mutations and human awareness Epidemic processes in complex networks Network-based prediction of COVID-19 epidemic spreading in Italy Network-based prediction of the 2019-ncov epidemic outbreak in the chinese province hubei Network reconstruction and prediction of epidemic outbreaks for general group-based compartmental epidemic models Capturing the Effects of Transportation on the Spread of COVID-19 With a Multi-Networked SEIR Model Controlling a Networked SIS Model via a Single Input over Undirected Graphs An epidemiological forecast model and software assessing interventions on the COVID-19 epidemic in China Wuhan Municipal Health and Health Commission's briefing on the current pneumonia epidemic situation in our city Statistical inference to advance network models in epidemiology On assessing control actions for epidemic models on temporal networks