key: cord-0055239-5fyi7rxj authors: Rao, Congjun; Gao, Yun title: Influencing factors analysis and development trend prediction of population aging in Wuhan based on TTCCA and MLRA-ARIMA date: 2021-01-15 journal: Soft comput DOI: 10.1007/s00500-020-05553-9 sha: dc17488d00a0df0826ea562f26b4150ccd00aea5 doc_id: 55239 cord_uid: 5fyi7rxj With the rapid development of the economy, the problem of population aging has become increasingly prominent. To analyse the key factors affecting population aging effectively and predict the development trend of population aging timely are of great significance for formulating relevant policies scientifically and reasonably, which can mitigate the effects of population aging on society. This paper analyses the current situation of population aging in Wuhan of China and discusses the main factors affecting the population aging quantitatively, and then establishes a combination prediction model to forecast the population aging trend. Firstly, considering the attribute values of the primary influence factors are multi-source heterogeneous data (the real numbers, interval numbers and fuzzy linguistic variables coexist), a two-tuple correlation coefficient analysis method is proposed to rank the importance of the influencing factors and to select the main influencing factors. Secondly, a combination prediction model named Multiple Linear Regression Analysis-Autoregressive Integrated Moving Average is established to predict the number and the proportion of aging population in Wuhan. By using the statistical data of Wuhan in the past 20 years, this combination prediction model is used for empirical analysis, and a prediction result of the number and the proportion of aging people in Wuhan in the future is obtained. Based on these quantitative analysis results, we propose some countermeasures and suggestions on how to alleviate the population aging of Wuhan from aspects of economic development, pension security system design and policy formulation, which provide theoretical basis and method reference for relevant population management departments to make scientific decisions. In recent years, the proportion of aging people in the global population has been increasing year by year, and the population age structure has been aging. In China, there were two baby peaks, from 1949 to 1957, and from 1962 to 1971 . At the first peak, the birth rate was between 32 and 38 per thousand. At the second peak, the birth rate was between 30 and 44 per thousand. The two baby peaks have had a long-term impact on the age structure of China's population. With the changing situation at home and abroad, China became one of the countries with a large number of aging populations in 1999. The aging age structure of the population will change the social economy, consumption and labor force, and have a negative impact on the steady economic development in the new normal state. Population aging has a complex and long-term impact on national development and social operation (Wiener and Tilly 2002; Modigliani 2005; Lyons et al. 2018; Schön and Stähle 2020) . The population aging refers to the aging process of population age structure caused by the increasing proportion of population aged 60 (or 65)-years-old or older in the total population. Generally speaking, when the number of people aged 60 or above in a certain region or a country is more than 10% of its total population (or the proportion of people aged 65 or above is more than 7%), the country or region has entered an aging society. Population aging will bring a series of problems to individuals, families and society. For example, the increasing aging population will bring social and economic aspects of challenges, such as & Congjun Rao cjrao@163.com demographic dividend gradually disappear, the working population will shrink, the burden coefficient of young adults will increase, medical costs will rise, elderly dependency ratio will rise, the elderly consumer demand will be increased, the average savings rate will decline and social investment will reduce. The population aging will lead to higher health care expenditures of families, which increases the demand for financial resources of families with older people and increases the economic burden on more families and family members, and the burden on families and society to provide for the aged will increase. With the aging of the population becoming more serious, the health problems of the elderly population will change, such as increased rates of cancer, bone fractures, cardiovascular diseases, depression and dementia, as well as increased disease complications, which will significantly reduce the quality of health and the level of life expectancy of the elderly population. Thus, population aging is not only a serious social problem, but also a highly valued economic problem. As the capital of Hubei Province, the central city of central China and the new first-tier city, Wuhan has a fasteconomic development, a favorable geographical location, a long history and abundant educational resources. Since 1993, the degree and scale of population aging in Wuhan was increasing and expand. Wuhan introduced a talent introduction policy named ''Millions of college students stayed in Wuhan'' in 2017. The implementation of this policy will have an impact on the net migration rate and density of population, cause the change of population age structure, and further affect the aging degree of population in Wuhan. As an important political, economic and cultural development center, if the problem of population aging is not properly solved, it will inevitably affect the development of Wuhan's economy in the future. According to the Statistical Yearbook of Wuhan (Wuhan Bureau of Statistics 2020), in 1993, the population number of Wuhan aged 60 and older and its proportion were 710,000 and 10.3%, respectively. Since the beginning of the twenty-first century, both the number and the proportion of aging population in Wuhan have shown a trend of continuous increase. In 2010, the number of aging populations in Wuhan exceeded 1 million and rose to 1.2743 million, with the proportion of aging population reaching 16%. In 2016, the number of people aged 60 or older in Wuhan rose to 1.72752 million, with the proportion of aging population rising to 20.7% over 20%. It can be seen from the above data that the number and the proportion of aging population in Wuhan are increasing year by year, and the aging degree is becoming more and more serious. In Wuhan, the aging population shows a trend of large number and rapid increase, but the prediction of the aging population of Wuhan and the analysis of the influencing factors are not satisfactory, which will affect the government's scientific decision-making and the formulation and implementation of relevant policies to a certain extent. In addition, the level of economic development in Wuhan is not in line with the speed of population aging. At the same time, the continuous aggravation of aging will inevitably lead to the gradual shortage of labor force, which will also test the old-age security system in Wuhan. Thus, we need to make a reasonable prediction and comprehensive analysis on the aging degree and development trend of population in Wuhan, so as to provide theoretical basis and method reference for relevant population management departments to make scientific decisions. Based on the above research background, the purpose of this paper is to select the main influencing factors of the population aging in Wuhan, predict the aging population and its proportion in the future, and provide decisionmaking support for the government to formulate appropriate policies in an effective and timely manner, so as to eliminate the adverse impact of the population aging on social and economic development. On the problem of population aging, many scholars have made many valuable discussions from different perspectives. They mainly focus on the development status and trend of population aging, the prediction of future aging population, the analysis of influencing factors of population aging and countermeasures, and so on (Wang 2004; Zhang 2009; Lee and Shin 2019; Li et al. 2021 ). The research on the problem of population aging should not only be limited to the problem of population prediction, but also should trace back to the source and find the main influencing factors of population aging fundamentally, and then take this as the basis to predict and analyse the population aging, so as to formulate effective and feasible countermeasures to eliminate the impact of population aging on social and economic development. Therefore, it is very important to study the influencing factors of population aging. In analyzing the main influencing factors of population aging, the most commonly used methods include correlation coefficient method, principal component analysis method, gray relational degree analysis method, MLRA model, and so on. Wei et al. (2018) discussed the relationship between population policy and economic growth by considering the necessity of solving the problem of population aging. He (2011) , respectively, used the gray relational model and the principal component analysis model to select the main influencing factors of population aging. Then, Chen and Hao (2014) selected the main influencing factors of population aging by constructing MLRA model. Li (2017) firstly used the Thiel index method to decompose the differences among different regions, and then, from the perspective of panel data, the estimation method of Feasible Generalized Least Squared (FGLS) was used to test the influencing factors of population aging in combination with panel data (Hu et al. 2012; Xu et al. 2017; Wang and Gan 2017; Xiao et al. 2020b) . The above analysis on the influencing factors of population aging is done under the condition that the attribute values of the influencing factors are all in the form of real numbers. However, in actual decision-making, the values of indexes that have a great impact on population aging cannot be obtained in the form of accurate real numbers. Instead, only a value range can be estimated according to the actual situation, or a rank value can be expressed by fuzzy linguistic variables (Rao and Zhao 2009) . Therefore, in the actual analysis of the main factors of population aging in Wuhan, there will be the multi-source and heterogeneous data for the attribute values of primary influencing factors, i.e., the real numbers, the interval numbers and the fuzzy linguistic variables coexist. But most of the traditional analysis methods are no longer applicable to this kind of multi-source heterogeneous data with real numbers, interval numbers and fuzzy linguistic variables. In this paper, on the basis of the attribute value types of primary influencing factors, when the attribute values are the multi-source heterogeneous data with the real numbers, interval numbers and fuzzy linguistic variables, we apply the two-tuple model (Herrera and Martínez 2000; Herrera et al. 2005; Zhang 2013; Rao et al. 2016a, b) , and transform all original multi-source heterogeneous data into two-tuple data, and then propose a new method named TTCCA to select the main factors influencing the population aging in Wuhan. The two-tuple model was first proposed by Herrera and Martínez (2000) . In this model, a linguistic fuzzy variable is regarded as a continuous variable within its definitional domain and a dual combination formed by a linguistic fuzzy variable, and a real number is used to express the linguistic assessment information (Rao et al. 2017a ). Subsequently, some extended models were proposed, i.e., 2-tuple hybrid ordered weighted averaging (THOWA) operator (Rao et al. 2015) , interval 2-tuple linguistic VlseKriterijumska Optimizcija I Kaompromisno Resenje in Serbian (ITL-VIKOR) method (You et al. 2015) , gray linguistic 2-tuple weighted averaging (GLTWA) operator (Rao et al. 2016b ), 2-tuple linguistic Data Envelopment Analysis (DEA) (Geng et al. 2017) , linguistic 2-tuple gray correlation degree model (Rao et al. 2017a) , 2-tuple linguistic generalized aggregation (I2LGA) operator (Liu and Chen 2018), Hesitant 2-tuple linguistic Bonferroni operators , 2-tuple fuzzy linguistic approach (Muhuri and Gupta 2020) , two-tuple mixed correlation degree (Rao et al. 2020a) , multi-source heterogeneous multi-attribute decision-making (MSHMADM) method based on the linguistic 2-tuple (Xiao et al. 2020c, d) . These studies show that the data process in the two-tuple model and its extended models can effectively avoid the information loss and information distortion in the process of information gathering comparing with some existing congeneric methods. In this paper, just based on the advantages of two-tuple model in information processing, we will propose a new method named TTCCA by combining the two-tuple model with the traditional correlation coefficient analysis method to solve the selection problem of the main factors influencing the population aging in Wuhan. The research on development trend prediction of population aging mainly used some quantitative methods (Hou 2012; Wang and Liu 2012; Zhao et al. 2015; Chen 2016; Li 2016) such as gray prediction models, parameter prediction methods of regression analysis, matrix prediction method, population age shift model, time series model, neural network model, nonparametric regression method, MLRA model, support vector regression, Bayesian Hierarchical space-time model, and so on. Gillen and Spore (1994) made the first in-depth study of population aging by assuming that the global fertility rate remains at the replacement level, thus predicted the number of global population. In 2003, Lutz et al. (2003) proposed a population age shifting model to predict the population by setting the total fertility rate, and compared the prediction results of different schemes. Jiang (2012) used the population age shifting model and the population development equation model to predict various population age indexes in China. Gray Model (1,1) (GM(1,1)), as an important prediction model, is also widely used in population prediction. Liang (2017) also used an extended gray prediction model based on GM (1, 1) to predict the aging population. The Leslie matrix prediction method is also widely used in the prediction of population aging trend. Smit et al. (2006) made a medium and long-term prediction of the elderly population by Leslie matrix prediction method, and then analyzed the future aging problem from different perspectives such as the total population, structure and aging coefficient. Liu (2016) used the Leslie matrix prediction method to predict the size and structure of population in China. Based on this, Meng (2012) proposed a combination model of Leslie matrix and time series analysis to predict the aging population. For the study of population aging, time series models are also commonly used. For example, Sun and Wu (2015) predicted the number of future aging population and its proportion through the ARIMA, and proposed corresponding countermeasures and suggestions for solving various social problems caused by population aging. Gong et al. (2007) and proposed parametric autoregression models for analysis and prediction, and compared the prediction results with first-order autoregressive model (AR (1)) to improve the prediction accuracy of aging population. The parametric regression analysis models are also often used in the study of population aging, commonly used are exponential model and Logistic regression model. The exponential model is based on linearization of the model, the ordinary least squares (OLSs) are used to estimate the parameters of the linear regression model, and then the historical data of population aging are used for fitting and predictive analysis. For example, Chen and Yu (2006) , Zhu and Pang (2009) used the Logistic regression model to predict the population and obtained a good prediction effect. The method of MLRA is used to establish a model and predict the population future through the OLS according to the population data of a certain region or the whole country. To a certain extent, the traditional prediction methods have limitations in the prediction accuracy and scope of application. Many scholars have applied the artificial neural network method to the problem of population aging. For example, Chen et al. (2012) proposed a radial neural network prediction model to solve the population prediction problem in Hunan Province and obtained a good prediction effect. In addition, Lv and Xuan (2012) predicted the aging coefficient of Beijing's population by constructing a Vector Autoregressive model (VAR). In the above analysis of the aging population trend prediction, many studies extrapolated the time trend based on simple statistical data and simple target variables, without combining the comprehensive impact of population fertility, death, migration, medical care and other factors. Such prediction results will have an impact on the reasonable prediction of population aging. In view of this problem, this paper will establish a combination prediction model to predict the development trend of population aging in Wuhan. The rest of this paper is organized as follows. Section 3 proposes a new TTCCA method to select the main influence factors of population aging under the information environment of multi-source heterogeneous data (the real numbers, interval numbers and fuzzy linguistic variables coexist). Section 4 establishes a combination prediction model based on MLRA and ARIMA to predict the population aging trend and the proportion of aging population, and makes an empirical analysis according to the population statistical data of Wuhan in the past 20 years. Based on these quantitative analysis results, Sect. 5 provides some countermeasures and suggestions on how to alleviate the population aging of Wuhan. Section 6 concludes the paper. The flow chart regarding the scheme of the proposed methods in our paper is shown in the following Fig. 1 . In this section, we analyse the present situation of population aging in Wuhan first, and then present a new TTCCA method based on traditional correlation coefficient analysis method to select the main influence factors of population aging. First of all, the population aging situation in Wuhan is statistically analyzed. By consulting the Statistical Yearbook of Wuhan (Wuhan Bureau of Statistics 2020), the data information of population aged 60 and above in Wuhan from 1993 to 2016 can be obtained (see Table 1 ). When formulating policies, the government should not only consider the number of population aged 60 and above, but also consider whether the proportion of aging population is reasonable. Based on the data of population aged 60 and above in Wuhan, the proportion of population aged 60 and above in each year can be further obtained, as shown in Table 1 and Fig. 2 . As can be seen from Table 1 and Fig. 2 , since Wuhan entered the aging city in 1993, the aging population in Table 1 The number and the proportion of population aged 60 and above in Wuhan from 1993 to 2016 Years The number of population aged 60 and above (ten thousand) The proportion of population aged 60 and above (%) Years The number of population aged 60 and above (ten thousand) From 710,000 in 1993 to 860,000 in 2000 , to 1,274,300 in 2010 , and to 1,727,500 in 2016 . From 1993 , the number and the proportion of population aged 60 and above in Wuhan increased continuously. Compared with 1993, the proportion of people aged 60 and above increased by 10.4% points in 2016. It is foreseeable that this proportion will continue to rise in the future, and the problem of population aging will gradually become more prominent. The number and the proportion of population aged 60 and above are rising in Wuhan, and the population development is facing the problem of population structure change. Therefore, it is necessary to further analyse the population age structure, and the results are shown in Fig. 3 . It can be seen from Fig. 3 that the population structure of Wuhan has changed significantly. In 2016, the proportion of population aged 0 to 14 in the total population was 12.07%, and that of population aged 15 to 59 was 67.23%. Compared with the sixth census in 2010, the proportion of population aged 0 to 14 in the total population increased by 2.07% points, while the proportion of population aged 60 and above rose by 5.5% points. Compared with the fifth census in 2000, the proportion of population aged 0 to 14 in the total population decreased by 5.23% points, and the proportion of population aged 60 and above in the total population increased by 9.2% points. The proportion of population aged 15 to 59 in the total population showed a parabolic shape, which reached the highest in 2005 and is now in the declining stage, that is, the demographic dividend of Wuhan is decreasing, and it can be predicted that the proportion will continue to decline in the future stage. In general, the population age structure of Wuhan has changed significantly, and the population aging is becoming more and more serious. In addition, the birth rate, death rate and natural growth rate are often the main influencing factors of population aging, so it is necessary to make a statistical analysis of its changing trend. From the Statistical Yearbook of Wuhan (Wuhan Bureau of Statistics 2020), we can get the related data of birth rate, mortality and natural growth rate from 1997 to 2016. Before 2009, the birth rate, mortality and natural growth rate fluctuate around 8%, 6% and 2.5%, respectively. But since 2010, the birth rate and natural growth rate showed an obvious upward trend. The natural growth rate fell sharply in 2004, when the death rate exceeded the birth rate because of special factors. In practical decision-making, due to the complexity of the environment, the fuzziness of human thought and the limit factors of data acquisition conditions, we cannot get accurate values expressed in real numbers for some indexes with large influence on population aging, and only estimate a value range (e.g., the average annual consumption level of residents), or use fuzzy language variables to represent a level value (e.g., physical health level, employment stability, etc.). Therefore, in the actual analysis of the main factors influencing the population aging, there will be a case that the attribute values of the preselected influencing factors are the multi-source heterogeneous data, i.e., the real numbers, interval numbers and fuzzy linguistic variables coexist. For such kind of index data, the classical statistical methods (e.g., correlation coefficient analysis, principal component analysis, etc.) cannot be directly used to extract the main influencing factors. For the decision-making problem with multi-source heterogeneous data information, it is necessary to transform the multi-source heterogeneous data for the need of decision. Some traditional processing methods transform the fuzzy linguistic variables into real numbers, or transform all real numbers and fuzzy linguistic variables into triangular fuzzy numbers (or interval numbers, or trapezoidal fuzzy numbers), and then use the fuzzy TOPSIS method, the VIKOR method and so on to rank alternatives. The deficiency of these methods is liable to occurs the information loss or information distortion in the data conversion process. However, the two-tuple model proposed by Herrera and Martínez in 2000 can overcome this deficiency. The two-tuple regards a linguistic phrase as a continuous variable in its domain, and it is in the form of a binary form consisting of a predetermined linguistic phrase and a real value to express the linguistic assessment information integration after received all the information, which can effectively avoid information loss and distortion in the process of information gathering and operation in linguistic assessment, and the calculation precision and reliability of information processing are superior to other similar methods (Herrera and Martínez 2000; Herrera et al. 2005; Zhang 2013; Rao et al. 2017a Rao et al. , 2020b . Based on the information processing advantages of two-tuple model, this section proposes a new method named TTCCA method based on the traditional correlation coefficient analysis to select the main factors influencing the population aging in Wuhan. (1) The definition and operations of two-tuple. A two-tuple is expressed by a binary form (s k , a k ), where s k and a k are defined as follows. (1) s k is the k-th element in a predefined linguistic evaluation set S, where S ¼ fs 0 ; s 1 ; . . .; s t g consists of t þ 1 linguistic fuzzy variables. If k [ l, then s k [ s l . (2) a k is a numerical value that represents the deviation between the evaluation result and s k , such that a k 2 ½À0:5 ; 0:5Þ. Definition 2 (Herrera and Martínez 2000; Rao et al. 2017a ) The operations of two-tuple (s k , a k ) are defined as follows. (1) Comparison operation. For any two two-tuples (s k , a k ) and ðs l ; a l Þ, if k ! l, then ðs k ; a k Þ ! ðs l ; a l Þ: (2) Max operator and Min operator. When ðs k ; a k Þ ! ðs l ; a l Þ, we have maxf ðs k ; a k Þ; ðs l ; a l Þg ¼ ðs k ; a k Þ; minfðs k ; a k Þ; ðs l ; a l Þg ¼ ðs l ; a l Þ: (3) Distance operator. For any two two-tuples A:(s k , a k ) and B: ðs l ; a l Þ, the distance between A and B is defined as DðA; BÞ ¼ a k þ k À l À a l j j l À 1 : (2) Transformation of multi-source heterogeneous data In the actual decision-making, considering that the multi-source heterogeneous data have different dimensions and orders of magnitude, it is necessary to process the original index value information to eliminate the adverse effect of dimensions and orders of magnitude on the decision result. The basic data processing method is to standardize the original index values and then convert the standardized data into two-tuples. Next, we give the method of standardizing the original index values, and then provide the methods of how the linguistic fuzzy variables, real numbers and interval numbers are converted into twotuples. (1) Standardization of original decision matrix Suppose that there are m preselected influencing factors affecting population aging in Wuhan. The attribute values of these m preselected influencing factors are multi-source heterogeneous data (the real numbers, interval numbers and fuzzy linguistic variables coexist). We give the time series statistical index values of each index for consecutive p years. The matrix formed by all attribute values of p years is denoted as C ¼ ðc ij Þ mÂp . Next, we standardize the data in the matrix C. Let the matrix C be For the attribute values in the form of real numbers, we use the following Eq. (1) to standardize For the attribute values in the form of interval numbers, the attribute value is denoted as ½c L ij ; c U ij , i ¼ 1; 2; . . .; p, j ¼ 1; 2; . . .; m, we use the following Eq. (2) to standardize By using the Eq. (1), the attribute values in the form of real numbers can be standardized as z ij 2 ½0 ; 1, and by using the Eq. (2), the attribute values in the form of interval numbers can be standardized as ½z L ij ; z U ij , where z L ij 2 ½0; 1 and z U ij 2 ½0; 1. The standardized matrix is denoted as Next, we provide the methods of how the linguistic fuzzy variables, real numbers and interval numbers are converted into two-tuples. (i) For a linguistic fuzzy variable s k 2 S, we can use the method given by the following Definition 3 to convert it into a two-tuple. Definition 3 (Herrera and Martínez 2000) Let s k 2 S be a linguistic fuzzy variable, the corresponding two-tuple can be converted by the following function h: h : S ! S  ½À0:5; 0:5Þ; (ii) For a real number d 2 ð0 ; 1Þ, we can use the method given by the following Definition 4 to convert it into two-tuple. Definition 4 (Herrera and Martínez 2000) Let S ¼ fs 0 ; s 1 ; . . .; s t g be a linguistic evaluation set, and d 2 ð0 ; 1Þ, then d 2 ð0 ; 1Þ can be converted into an equivalent two-tuple by the following function D: D : ½0; 1 ! S  ½À0:5; 0:5; such that where ''round'' is a rounding operation. From Definition 4, we can use an inverse function D À1 of function D defined in the following Definition 5 to return a two-tuple ðs k ; a k Þ into the corresponding real number d 2 ð0 ; 1Þ. Definition 5 (Herrera and Martínez 2000) Let S ¼ fs 0 ; s 1 ; . . .; s t g be a linguistic evaluation set, and ðs k ; a k Þ be a two-tuple, then there is a function D À1 , which can convert the two-tuple ðs k ; a k Þ into the corresponding real number d 2 ð0 ; 1Þ. (iii) For an interval number c ij ¼ ½c L ij ; c U ij , we can use the following Definition 6 to convert it into a two-tuple. Definition 6 (Herrera and Martínez 2000; Rao et al. 2017a ) Let S ¼ fs 0 ; s 1 ; . . .; s t g be a linguistic evaluation set, and I ¼ ½a ; b be an interval number, the intersection of I ¼ ½a ; b and the linguistic fuzzy variable s k is denoted as where l I ðxÞ and l s k ðxÞ are the membership functions of I and s k , respectively, i.e., Then the corresponding equivalent real number d of (s k , a k ) is defined as and we can use the method given by Definition 4 to convert d into the corresponding two-tuple (s k , a k ). If we use triangular fuzzy number to express linguistic fuzzy variable s k , that is, In this paper, we set t ¼ 4, then we have Step 1: Standardize the data in the original decision matrix C ¼ ðc ij Þ mÂp . Use Eqs. (1) and (2) to standardize the real numbers and interval numbers, respectively, and the standardized decision matrix is denoted as Z ¼ ðz ij Þ mÂp . Step 2: Use the methods given by Definition 3 to Definition 6 to convert all data in Z ¼ ðz ij Þ mÂp into two-tuples, and the two-tuple matrix is denoted as Step 3: Determine the reference sequence and the compared sequence. Take a decision element of p consecutive years as a reference sequence, which is denoted as ; . . .; h 0 ðpÞÞ ¼ ððs 0;1 ; a 0;1 Þ; ðs 0;2 ; a 0;2 Þ; . . .; ðs 0;p ; a 0;p ÞÞ: In the two-tuple matrix H ¼ ðh ij Þ mÂp , the elements in each column constitute a time series, which is called a compared sequence and is denoted as Step 4: Calculate the two-tuple correlation coefficient where 0;j ; a 0;j Þ; and D À1 is the converted operation given by Definition 5. Step 5: Rank the influence degree of all m preselected influencing factors. According to ranking result of twotuple correlation coefficient r 0i calculated in Step 4, the larger the value after taking the absolute value of r 0i , the greater the influence of the influencing factor i on population aging. Similar to the traditional correlation coefficient method, when 0:7 r 0i j j\1, the factor i is called highly correlated with population aging. When 0:4 r 0i j j:7, the factor i is called moderately correlated with population aging. When 0:2 r 0i j j:4, the factor i is called low correlation with population aging. When r 0i j j:2, the correlation is called to be very low or close to zero. In fact, from the TTCCA method given by Eq. (12), we can see that if the attribute values of all m preselected influencing factors are not multi-source heterogeneous data, but all of them are real numbers, then the Eq. (12) just become the traditional calculation formula of correlation coefficient analysis method. In other words, the TTCCA method proposed in this section is an extension and generalization of the traditional correlation coefficient analysis method. (1) Data in the form of real numbers: the main factors of influencing population aging in Wuhan from 2010 to 2015 include per capita Gross Domestic Product (GDP) (Yuan), birth rate (%), death rate (%), natural growth rate (%), two-child rate (%) and population density (people/per square kilometer), where ''Yuan'' is the unit of Chinese currency. The index values of these influencing factors are real numbers, which can be obtained from Statistical Yearbook of Wuhan (Wuhan Bureau of Statistics 2020). (2) Data in the form of interval numbers: due to the gap between the rich and the poor, the annual consumption expenditure of residents in Wuhan cannot reasonably reflect the consumption expenditure of residents in the whole city. If the influence factor of annual consumption expenditure is only used to analyse, the result of modeling will inevitably produce a big deviation. In order to make the analysis more accurate, the household consumption expenditure is expressed by the interval number within the scope of the annual consumption expenditure. Here, the average annual consumption level of Wuhan residents from 2010 to 2015 is converted into interval numbers (see Table 2 ) for analysis. When making judgment or prediction on the problem to be studied, the judgment method based on the knowledge and experience of experts is called Delphi method (Ma et al. 2011 ) (also known as expert investigation method). In essence, the Delphi method is an anonymous feedback consultation method. The general process is to sort out, summarize and make statistics after obtaining the opinions of experts on the issues to be decided, and then give anonymous feedback to all experts and solicit opinions again, focus opinions and give feedback again, until the consensus is reached. The advantage of this method is that everyone's views will be collected, which can avoid some possible disadvantages of group decision-making, so that people with special status have no chance to control the will of the group, and can give full play to the role of all experts. Thus, in this paper, the Delphi method is used to determine the index values for the qualitative indexes, which cannot be determined in the form of exact real numbers. (3) Data in the form of fuzzy linguistic variables: Considering that the physical condition and the employment stability degree are two qualitative indexes, so the Delphi method can be used to determine their index values. The decision-making process of Delphi method is as follows: Step 1: Specify the requirements and decision objectives, and inform the selected experts in writing. The selected experts must master the knowledge of the specific field. The number of experts is generally 8-20. Step 2: Each expert makes his own decision based on his own experience and knowledge, explains the reasons and basis of the decision in writing, and gives a written reply to the person that is in charge of the decision-making. Step 3: The person in charge of decision-making summarized and sorted out the experts' decision-making opinions, and, respectively, explained the basis and reasons of different decision-making values, and then returned to the experts, and asked the experts to modify the original decision. Step 4: When the experts make the second decisionmaking, they should analyse the reasons and basis for the second decision according to various decision-making opinions. Repeat above steps, all experts revise their decision result again and again until they are basically in agreement. Now we use above Delphi method to determine the evaluation values of several qualitative indexes. Among the influencing factors of population aging in Wuhan, the index values of physical condition and employment stability degree cannot be expressed in real numbers, and generally evaluated in the form of fuzzy linguistic variables such as ''good'', ''average'', ''poor'' and so on (Rao et al. 2015 (Rao et al. , 2017b Xiao et al. 2020a; Peng et al. 2020; Tian et al. 2019 Tian et al. , 2020 . Here the Delphi method can be used to determine the index values of these two factors. Because the physical condition is related to the variables such as medical care expenditure, life insurance expenditure, residents' minimum living allowance, and so on. And these variables can be quantified by the exact real numbers. In practical decision-making, each invited expert will give the comprehensive assessment value s k according to the values of medical care expenditure (a ij ), life insurance expenditure (b ij ) and residents' minimum living allowance (c ij ). Thus, the expert's evaluation decision sheet (see Table 3 ) can be obtained. The evaluation value is generally divided into 5 grades, that is, very good, good, average, poor, very poor. Summarize and sort out the evaluation results of all experts, and send them to all the experts again. Let all experts revise their original decision according to the overall evaluation results, and provide their revised decision-making opinions and their basis and reasons. After repeated consultation, induction and modification, the process is over until all the experts are basically in agreement. The final decision result is denoted as s k , k ¼ 0; 1; . . .; 4, where the corresponding fuzzy linguistic variables are ''very poor'', ''poor'', ''average'', ''good'' and ''very good''. The employment stability degree is related to the number of new jobs, the unemployment rate and the quantity of employment. And these three variables can be quantified by the exact real numbers. Similar to Table 3 , we give the expert's evaluation decision sheet (see Table 4 ) on employment stability degree. Each invited expert will give the comprehensive assessment value s h based on their own knowledge and experience according to the values of the number of new jobs (x ij ), the unemployment rate (y ij ) and the quantity of employment (z ij ). The decision-making process of applying Delphi method is similar to the decision-making process of physical condition given above. The final decision result is denoted as s k , k ¼ 0; 1; . . .; 4, where the corresponding fuzzy linguistic variables are ''very low'', ''low'', ''average'', ''high'' and ''very high''. Table 5 . The specific decision-making process based on TTCCA method is as follows. (1) Convert the multi-source heterogeneous data matrix into two-tuple matrix. By using the transformation methods given by Definition 3 to Definition 6, we can covert all multi-source heterogeneous data in Table 5 into two-tuples, and the two-tuple matrix are given in Table 6 . (2) Determine the reference sequence and the compared sequence. Use the aging population (A 0 ) in six consecutive years as a reference sequence, the values of per capita GD (A 1 ), birth rate (A 2 ), death rate (A 3 ), natural growth rate (A 4 ), two-child rate (A 5 ), population density (A 6 ), average annual consumption level of residents (A 7 ), physical condition (A 8 ) and employment stability degree (A 9 ) in six consecutive years as 9 compared sequences. (3) Calculate the two-tuple correlation coefficient between compared sequence h i ðjÞ(i ¼ 1; 2; . . .; 9) and reference sequence h 0 ðjÞ. By using the Eq. (12) in Sect. 3.2.2, we obtain the twotuple correlation coefficient as follows. r 01 ¼ 0:980; r 02 ¼ 0:983; r 03 ¼ À0:718; r 04 ¼ 0:901; r 05 ¼ 0:937; r 06 ¼ 0:002; r 07 ¼ À0:961; r 08 ¼ À0:180; r 09 ¼ À0:109: (4) Ranking of schemes. Step (3), the ranking result of the correlation coefficient obtained after taking the absolute value is as follows. Thus, the ranking result of the influence degree for all m preselected influencing factors is as follows. birth rate 1 per capita GD 1 average annual consumption level of residents 1 two-child rate 1 natural growth rate 1 death rate 1 physical condition 1 employment stability degree 1 population density According to above ranking result of the 9 influencing factors, the birth rate has the largest influence on the population aging in Wuhan, followed by the per capita GDP of Wuhan, and then followed by average annual consumption level of residents and two-child rate, and the population density has the least influence on the population aging in Wuhan. Obviously, 0:9\ r 0i j j\1, i ¼ 1; 2; 4; 5; 7, is satisfied, which means the first five factors, i.e., birth rate, per capita GD, average annual consumption level of residents, two-child rate and natural growth rate, are highly relevant to population aging in Wuhan. Thus, these five factors are selected as the main factors of influencing population aging in Wuhan. In this section, the MLRA model and ARIMA are integrated, and combined with the weighted combination idea, a combination prediction model is established to predict the Influencing factors analysis and development trend prediction of population aging in Wuhan… future number and the proportion of aging population in Wuhan. Specifically, firstly, the five main factors influencing the population aging selected in Sect. 3 are taken as independent variables, and the number of aging population and the proportion of aging population are taken as dependent variables, respectively, to establish the multivariate prediction models for forecasting the number of aging population and the proportion of aging population, respectively. Then, and the number of aging population and the proportion of aging population are taken as dependent variables, respectively, two univariate prediction models based on ARIMA are established to predict the number of aging population and the proportion of aging population, respectively. Finally, based on the predicted results of the above two models for the aging population and the proportion of aging population, combined with the weighted combination idea, a combination prediction model named MLRA-ARIMA is established to predict the number and the proportion of aging population in Wuhan in the next 10 years. MLRA model is to establish multiple linear regression equations of dependent variables to multiple independent variables based on the actual observed values of dependent variables and multiple independent variables, and verify the effectiveness of the model by testing and analyzing the significance of the comprehensive linear influence of each variable on dependent variables. ARIMA is a model established by transforming non-stationary time series into stationary time series, and then make a regression of dependent variables to its lag value and the present value and lag value of random error terms. The former is a multivariate prediction model, while the latter is a univariate prediction model. This section will combine these two prediction models and establish a combination prediction model to predict the future number and the proportion of aging population in Wuhan. In Sect. 3, by quantitative analysis, it is concluded that birth rate, per capita GDP, average annual consumption level of residents, two-child rate and natural growth rate are the main factors of affecting population aging in Wuhan. And the two main variables reflecting the trend of population aging in Wuhan are the number of aging population and the proportion of aging population. In order to effectively analyse the trend of the number of aging population (or the proportion of aging population) with the change of these five main influence factors, we take these five main influence factors as the independent variables, and the number of aging population (or the proportion of aging population) as the dependent variable to establish a prediction model to predict the number of aging population (or the proportion of aging population) based on MLRA (Mori and Suzuki 2018; Rao and Yan 2020; Wang et al. 2020b) . The modeling steps are as follows: Step 1: Model establishment Take the number of aging population (or the proportion of aging population) as the dependent variable, which is denoted as y (or y 0 ), and the per capita GDP, birth rate, twochild rate, average annual consumption level of residents and natural growth rate as five independent variables, which are denoted as x 1 ; x 2 ; . . .; x 5 , respectively. Then the MLRA model is express as where b 0 is a constant term, b 1 ; b 2 ; . . .; b 5 are the regression coefficients of y to x 1 ; x 2 ; . . .; x 5 , e is an unobservable random variable, which is called an error term with the mean value is zero, and the variance is greater than zero, Use the OLS (Zhang et al. 2016) to estimate the parameter b in the model (13), then the sum of the squared errors of the given data is QðbÞ ¼ ðy À xbÞ T ðy À xbÞ: To find the value of b that minimizes QðbÞ, we can get the least squares estimation of b, which is denoted asb, whereb ¼ ðX T XÞ À1 X T Y. Substituteb into Eq. (13), we can obtain the estimated value of y, i.e., y ¼b 0 þb 1 x 1 þb 2 x 2 þ Á Á Á þb 5 x 5 ; and the fitting value of the given data iŝ Y ¼ Xb: Step 2: Hypothesis testing of regression model After the MLRA model is established, the hypothesis of linear relationship between dependent variables and multiple independent variables needs to be tested for significance, that is, the established MLRA model needs to be tested for significance. The determination coefficient R 2 refers to the proportion of the dependent variable y that can be explained by the free variable, and its mathematical model is where SST ¼ P 5 i¼1 ðy i À yÞ 2 is squares sum of dispersion, and SSE ¼ P 5 i¼1 ðy i Àŷ i Þ 2 i is squares sum of residuals. The value range of R 2 is from 0 to 1, and the closer the value of R 2 approaches 1, the better the fitting effect of the established regression equation on the actual observed value is. And the closer the value of R 2 approaches 0, the worse the fitting effect of the established regression equation on the actual observed value is. (2) Residual analysis The Durbin Watson (DW) test (Grose and King 1991) can be used to verify the quality of a model. The expression for the DW statistic is where the value range of DW statistic is from 0 to 4, t is the time, and residuals are collected in chronological order. If a linear random process x t can be expressed as then the ARMA(p,q) model in the form of a delay operator can be expressed by where /ðBÞ ¼ 1 À / 1 B À / 2 B 2 À Á Á Á À / p B p , is called a p-order autoregressive coefficient polynomial, and hðBÞ ¼ 1 À h 1 B À h 2 B 2 À Á Á Á À h q B q is called a q-order moving average coefficient polynomial. The judgment criteria of the ARMA are shown in Table 7 . When the sequence is not a stationary sequence or has an obvious linear trend, the difference method is needed to eliminate the tendency. The first-order difference is rx t ¼ x t À x tÀ1 , the p-order difference is r p x t ¼ r pÀ1 x t À r pÀ1 x tÀ1 , and the k-step difference is r k ¼ x t À x tÀk . Thus, the model of ARIMA can be expressed by If it is expressed by delay operator, we have The above method is to establish a mathematical model based on curve fitting and parameter estimation for one or several groups of time series data, and to predict the future data by testing the fitting effect of the model. The combined prediction model can integrate different types of individual prediction models, which can reduce the systematic error of model prediction and improve the prediction effect (Zhu et al. 2019; Qu et al. 2019; Mao et al. 2020; Wang et al. 2020a) . The modeling principle of linear combination prediction model is as follows: where y 0t is the combination prediction value of the period t, y 1t ; y 2t ; . . .; y nt is the prediction values of different n single prediction models at time t, and W 1 ; W 2 ; . . .; W n is the corresponding combination weights. The key of linear combination prediction model is to determine a reasonable weight W i , which is determined by the minimum variance principle of combination prediction error. When n ¼ 2, where r 2 i ði ¼ 1; 2Þ is the residual variance of the i-th prediction model. When n [ 2, where Q i is the sum of the square residuals of the i-th single prediction model. (2) The combination prediction model Combine the MLRA model established in Sect. 4.1.1 with the ARIMA established in Sect. 4.1.2, we establish a combination prediction model, and the basic steps are as follows: Step 1: Take the five main factors influencing the population aging selected in Sect. 3 as the independent variables, and the number of aging population and the proportion of aging population as the dependent variables to establish multivariate linear regression models, respectively. Then, use the ARIMA to predict the index values of five main influence factors for the future 10 years, and substitute the predicted values into the MLRA model (13), then the predictive value of the number and the proportion of aging population in Wuhan in the next 10 years can be obtained. Step 2: Based on the historical data of the number and the proportion of aging population in Wuhan from 1993 to 2016, establish the ARIMA time series model to predict the number and the proportion of aging population in Wuhan in the next 10 years. Step 3: Combine with the prediction results of the two models in Step 1 and Step 2, the combination prediction values of the number and the proportion of aging population in Wuhan in the next 10 years are given. The combination prediction formula is: where y 1 and y 2 , respectively, represent the predicted value of the number of aging population (or the proportion of aging population) in Wuhan in the same year calculated by the MLRA model and the ARIMA, and W 1 and W 2 , respectively, represent the weight of the predicted value of the two models. Step 4: Model test For the combination prediction model, it is necessary to test the feasibility that whether the model can be used to predict future. The common test method is the mean error test. The calculation formula is as follows. where jD i j is the absolute error of the i-th real value and the fitting value. The calculation method of jD i j isjD i j ¼ŷ i Ày i j j y i ,where y i is the actual real value andŷ i is the fitting value of the combination model. From model (13), take the proportion of aging population as the dependent variable, and the five main factors (per capita GDP, birth rate, two-child rate, average annual consumption level of residents and natural growth rate) influencing the population aging selected in Sect. 3 as the independent variables, and then we establish the MLRA model. The Statistical Product and Service Solutions (SPSS) software is used for parameter estimation of the MLRA model. According to the preliminary regression results, it can be found that although the determination coefficient is 0.997, the P value of several variables is greater than 0.05, which indicates that the regression model is not significant. Therefore, the multicollinearity test of independent variables is carried out by using the method of variance inflation factor, and the relative results are shown in Table 8 . From Table 8 , we can see that the VIF values of variables x 1 and x 4 are much greater than 10, so we can conclude that there is multicollinearity among the independent variables. In addition, we use the conditional number method of eigenvalues to verify the results by using the R software, and the conditional number is 638.8108, which is much greater than 100, indicating that there is a strong multicollinearity. Therefore, the stepwise regression method and the optimal subset regression method are used to select relevant indexes, respectively. The stepwise regression method shows that when the model has only the independent variables x 1 , x 2 , x 4 and x 5 , the AIC of the model is the smallest and the added value of residual sum of squares is the least. Moreover, the optimal subset regression method shows that the maximum adjusted determination coefficient and the minimum Cp value are 0.996 and 3.848, respectively, after removing the variable x 3 . Therefore, after deleting the variable x 3 , the MLRA model is obtained as follows. y ¼ 9:508 þ 0:788x 1 À 0:139x 2 þ 0:003x 4 þ 0:334x 5 : Firstly, the significance of the model (19) is judged. The model fitting results are shown in Table 9 . As can be seen from Table 9 , the VIF values of the independent variables of the model are all less than 10, which means there is no multicollinearity among the independent variables. Moreover, the P values are all less than 0.05, that is, all independent variables are significant, and the multicollinearity of the original model is eliminated. Secondly, the fit goodness test is carried out for the MLRA model (19). We can conclude from the fit goodness test by Eq. (14) and DW test by Eq. (15) that the standard error of the model is small, and the adjusted value of R 2 is 0.936, which means the established model has passed the F test with a significance level of 0.01. Therefore, it can be considered that the fitting degree of the model is good, and the established MLRA model (19) is reasonable. The test value of DW test is 1.160, which indicates that the residual and independent variables are independent of each other. Similarly, take the number of aging populations as the dependent variable, and the five main factors influencing the population aging selected in Sect. 3 as the independent variables to establish another MLRA model given by Eq. (13). The stepwise regression method is used for the selection of relevant indexes, and the model is as follows. y ¼ 88:980 þ 9:846x 1 À 3:644x 2 þ 0:023x 4 þ 2:219x 5 : For the above MLRA model (20), the multicollinearity test, the fit goodness test and the DW test are all passed. In addition, another statistical software, i.e., R Software, is used for regression modeling, and the obtained regression results were exactly the same as models (19) and (20). By predicting the indexes of the main factors influencing the population aging in Wuhan in the next 10 years, together with the MLRA model (19), the prediction results are shown in Table 10 . Influencing factors analysis and development trend prediction of population aging in Wuhan… Similar to the analysis method above, we can obtain the prediction results of the number of aging populations in Wuhan, which is listed in Table 11 . The data of population aged 60 and above in Wuhan from 1993 to 2016 are taken as the original sequence, and the ARIMA is established to analyse and predict the number of aging population in Wuhan. The calculated results are shown in Table 12 . The comparison graph drawn by the original real values and the predicted values are given in Fig. 4. From Fig. 4 , we can observe that the curve corresponding to the predicted values matches the curve corresponding to the original values very well. (1) Stability test The original sequence is taken the logarithm operation, and the autocorrelation test is conducted according to the sequence diagram of the logarithmic sequence. It is found that the autocorrelation coefficient decayed to 0 very slowly with the increase of the number of delayed phases, and the reverse increase trend occurred after the number of delayed phases was 10, so the sequence can be determined as a non-stationary sequence. Further, the unit root test is performed on the original sequence, and it is found that at the significance level of 0.05, the unit root test quantity was 3.2163, which is greater than the critical value of the unit root test at the significance level of 1% to 10%. The null hypothesis is accepted, indicating that the sequence is nonstationary. The above test results show that the original sequence is non-stationary, and the order of the non-stationary sequence can be determined. (2) Determine the order of difference The first-order difference of the original sequence is found to be non-stationary. The results show that the value of the second difference fluctuates around 0. Intuitively speaking, the second difference has no obvious trend and no periodicity. Therefore, the sequence of second-order difference is considered to be a stationary sequence. Further, the autocorrelation test was performed for the secondorder difference sequence, and the results are shown in Fig. 5 . It can be seen from Fig. 5 that the autocorrelation coefficient fluctuates around 0 and is basically close to 0, so the sequence after the second difference can be judged as a stationary sequence. Then, the unit root test is conducted to further confirm its stationarity. The test results are shown in Table 13 . It can be concluded from Table 13 that the unit root test value of the second-order difference is less than the critical value at the significance level of 1%, and the null hypothesis is rejected at the significance level of 0.05. Therefore, the second-order difference sequence of the logarithmic sequence is a stationary sequence, i.e., d ¼ 2. To sum up, the sequence can be considered as a secondorder differential stationary non-white noise sequence, so the ARIMA can be established. (3) Parameter estimation of the model By calculating, it is found that the models of ARIMA(1,2,3) and ARIMA(1,2,1) both meet the requirements. Therefore, a more suitable model should be selected from these two models. The fitting analysis of ARIMA(1,2,3) and ARIMA(1,2,1) is made. The fitting analysis results of these two models are shown in Table 14 and Table 15 , respectively. By comparing the results in Table 14 and Table 15 , it can be seen that the indexes of Akaike's information criteria and Schwarz criterion for ARIMA (1, 2, 3) are all smaller than that of ARIMA(1,2,1), so it can be considered that the ARIMA (1,2,3) is better, and the fitting result of ARIMA(1,2,3) is (4) Applicability test of the model Before the above model (21) is used to predict the future, the adaptability of the model needs to be tested. The residual test results are shown in Fig. 6 . The autocorrelation coefficient, partial autocorrelation coefficient, Q-Stat value and P value all show that the residual sequence does not exist autocorrelation, the residual sequence is stable, and the model fitting is successful, indicating that the model (21) has passed the test. So we can use the ARIMA (1, 2, 3) given by Eq. (21) to predict the number of aging population in Wuhan from 2017 to 2026. The predicted result is listed in Table 16 . In addition, the population in Wuhan from 1993 to 2016 is collected from the Statistical Yearbook of Wuhan, and the proportion of aging population in Wuhan from 1993 to 2016 is calculated. The ARIMA is used to predict the proportion of aging population in Wuhan in the future. After taking the logarithm of the original sequence, the second-order difference is made, and the second-order difference sequence is used for modeling. The established model is ARIMA(3,2,3), and the fitting effect of the model is good, and the accuracy is high. The results of the proportion of aging population in Wuhan in the next 10 years are shown in Table 16 . In Sects. 4.2.1 and 4.2.2, the number and the proportion of aging population in Wuhan are modeled and predicted by using the models of MLRA and ARIMA, and the average relative error are relatively small (that is, the prediction accuracy is relatively high). For example, when the MLRA model is used to predict the proportion of aging population in Wuhan, the fitting precision is 93.6. % When the ARIMA is used to predict the number and the proportion of Influencing factors analysis and development trend prediction of population aging in Wuhan… aging population in Wuhan, the relative error is within 10%. Therefore, it is concluded that both MLRA and ARIMA are applicable to the short-term prediction of aging population in Wuhan. In order to obtain a better prediction result, reduce the systematic error caused by the model, and improve the prediction effect, a combined prediction model can be established to predict the aging trend of population in Wuhan. Due to the high accuracy of the MLRA and ARIMA in this paper, the combination prediction model Significance level of 5% -3.030 Significance level of 10% -2.655 (17) proposed in Sect. 4.1.3, i.e., y ¼ W 1 y 1 þ W 2 y 2 , is used to predict the number and the proportion of aging population in Wuhan in the next 10 years. Here we think the predicted values obtained by these two models are of the same importance, namely, we set W 1 ¼ W 2 ¼ 0:5. Before using the combination prediction model for prediction, the error analysis on the three established prediction models (MLRA, ARIMA, combination prediction model). The model error analysis on the number and the proportion of aging population in Wuhan is shown in Table 17 . It can be seen from Table 17 that in the error analysis, the average relative error of the combination prediction model is the smallest, so the combination prediction model can be used to predict the number and the proportion of aging population in Wuhan in the next 10 years. Based on the prediction results of the MLRA in Sect. 4.1.1 and the prediction results of the ARIMA in Sect. 4.1.2, the combination prediction model given by Eq. (17) is used to predict the number and the proportion of aging population in Wuhan in the next 10 years. The prediction results are shown in Table 18 . We can draw the graph of the prediction results of the combination prediction model listed in Table 18 , which is shown in Fig. 7 . As can be seen from Fig. 7 , the proportion of aging population in Wuhan grew slowly from 2017 to 2020, but the proportion grew faster and faster after 2020, and the proportion of aging population in Wuhan will still increase year by year in the next 10 years. The aging population has an obvious rising trend, and the rising speed is relatively fast. By 2026, the proportion of aging population in Wuhan will reach 39%. Section 3 finds out the main factors influencing the population aging in Wuhan via empirical analysis. Section 4 analyses and forecasts the number and the proportion of aging population in Wuhan by establishing a combination prediction model, and concludes that the number and the proportion of aging population in Wuhan will increase year Autocorrelation Partial Correlation AC PAC Q-Stat Prob Fig. 6 The residual test diagram of the ARIMA(1,2,3) Table 16 The predicted results The number of aging population (ten thousand) Years The proportion of aging population (%) The number of aging population (ten thousand) by year in the next 10 years, and the trend of increase is obvious. Based on these quantitative analysis results, this section will provide some countermeasures and suggestions on how to alleviate the problem of population aging in Wuhan from the aspects of economic development, pension policy, pension security system, and so on. (1) Enhance the recognition degree of population aging in Wuhan Fig. 7 The comparison diagram of the combination prediction model Table 18 The prediction results of the combination prediction model Years The number of aging population (ten thousand) The proportion of aging population (%) Years The number of aging population (ten thousand) The proportion of aging population (%) From 1993 to now, Wuhan is one of the many aging cities, and the aging degree of the population is continued unabated. Thus, population aging has become an important issue that cannot be ignored in the social and economic development of Wuhan. As a new first-tier city, Wuhan's future development strategy and planning should take the population aging into consideration. As can be seen from Table 1 , the proportion of aging population in Wuhan exceeded 10% in 1993 and increased year by year in the next 23 years. It can also be seen from the historical data that the number of aging people aged 60 and above increased from 710,000 in 1993 to 1,727,500 in 2016. The increase of the number of aging people more than doubled with an increasing trend every year. As can be seen from the prediction results of the proportion and the number of aging populations in Table 18 , the proportion of aging population in Wuhan is getting higher and higher in the next 10 years. In 2026, the proportion of aging population in Wuhan will reach 39%. These research results fully illustrate that the population aging problem poses a severe challenge to the economic and social development and management of Wuhan. The government needs to pay more attention to the population aging and take the aging problem as an important problem to maintain the development and stability of Wuhan. In a wider range of areas, comprehensive consideration, coordinated operations, take population aging as a strategic problem to plan and deploy early, formulate effective and feasible countermeasures to eliminate the adverse effects of population aging on social and economic development. (2) Maintain the steady and sustained economic growth of Wuhan In Sect. 3, when analyzing the main influencing factors of population aging in Wuhan, it can be found that the per capita GDP of Wuhan is one of the most important influencing factors of population aging by calculating the twotuple correlation coefficient. Therefore, economic development has an important impact on the population aging of Wuhan. From 1997 to 2016, Wuhan's per capita GDP increased year by year, and the increase rate became larger after 2007. Therefore, maintaining the steady and sustained growth of Wuhan's economy is crucial to solving the problem of population aging. First, economic development can ensure the financial support of implementing the aging policy. Second, from the data in Table 5 in Sect. 3, the residents' consumption level of Wuhan is growing every year, from 14,490 Yuan in 2010 to 24,943 Yuan in 2015, the residents' consumption level increases more than 10,000 Yuan in 5 years, which means the economic growth also contributed to the Wuhan residents' average annual consumption level. Sustained economic growth is a necessary condition to ensure the continuous increase of Wuhan residents' consumption level. From Sect. 3.2, among the main factors influencing population aging in Wuhan, the residents' medical care expenditure is one of the important indexes of measuring the Wuhan residents' physical condition. From the Statistical Yearbook of Wuhan (Wuhan Bureau of Statistics 2020), we can obtain the data of Wuhan residents' medical care expenditure (see Table 19 ). As can be seen from Table 19 , the medical care expenditure of Wuhan residents is increasing year by year, which also indicates that sufficient capital investment is essential to establish a medical security system in line with the actual situation of Wuhan. As a new first-tier city, Wuhan's political, economic and cultural status is selfevident. Only economic development can promote the progress of the city, and only economic development can provide better economic guarantee for solving the problem of population aging in Wuhan. (3) Improve Wuhan's family planning policy From Sect. 3, we concluded that the birth rate and the two-child rate have a great impact on the population aging in Wuhan according to the quantitative analysis of the MLRA model given by Eqs. (19) and (20). From the Data in Statistical Yearbook of Wuhan, before 2010, the birth rate in Wuhan was controlled at around 8%. After 2010, the birth rate began to increase continuously. By 2015, the birth rate in Wuhan reached 12.7%. At the same time, before 2010, the two-child rate in Wuhan was 13%. After 2010, the two-child rate increased rapidly, reaching over 20% in 2014 and 27.13% in 2015. It can be seen that Wuhan is actively responding to the national call and fully implementing the two-child policy. But the birth rate of Wuhan city in 2016 fell to 11.48%, which shows that although the two-child rate is rising, the other fertility rate, such as the one-child rate and three-child rate, have fallen. Therefore, our government should continue to attach great importance to population and family planning, strengthen research on population development strategies, scientifically grasp the trend of population development and determine the concept of giving priority to investment in the all-round development of human beings. Our government must improve the family planning policy further, actively support the national two-child policy and keep the birth rate rising reasonably, perfect the family planning management and service system, make give priority to the family planning to share the fruits of reform and development, and strive to make the population development in harmony with the economic and social development. (4) Vigorously develop the aging industry in Wuhan As can be seen from Table 18 , the population aging degree of Wuhan is becoming more and more serious. The Influencing factors analysis and development trend prediction of population aging in Wuhan… proportion of aging population has increased from 10.3% in 1993 to 20.7% in 2016, doubling the proportion of aging population. From our prediction result, in the next 10 years, the proportion of aging population in Wuhan is still increasing year by year. Compared with 1996, the proportion of aging population in Wuhan will double to 25.907% in 2020, and reach 39.266% in 2026. The growing proportion of aging people will place additional burdens on individuals, families and society. Therefore, to develop the aging industry and to encourage the elderly people to reemployment can ensure that the elderly have economic income, and also can ease the pressure of the family and society. The variables mentioned in Sect. 3, such as medical care expenditure, life insurance expenditure and the quantity of employment, reflect the incomplete development of the aging market. Therefore, the government should attach importance to the development opportunities and formulate relevant plans to expand the aging market, so as to increase economic income and improve the aging market. At the same time, the priority areas of the aging industry in Wuhan should be determined. From Sect. 3, we know that the medical care expenditure, life insurance expenditure and the quantity of employment of Wuhan are the main factors influencing the population aging. Therefore, the development of the aging industry in Wuhan should give priority to the medical care market, insurance market and job creation. (5) Appropriately increase the capital investment for the elderly in Wuhan In Sect. 3, when analyzing the main influencing factors, it is found that the growth rate of residential GDP does not correspond to the growth rate of residents' medical care expenditure and residents' minimum living allowance. According to the data in Statistical Yearbook of Wuhan (Wuhan Bureau of Statistics 2020), the growth rates of the three from 2011 to 2015 are calculated, and the growth rates are shown in Table 20 . From Table 20 , we can see that the growth rate of per capita GDP is significantly different from that of medical care expenditure and residents' minimum living allowance, and both the growth rates of residents' minimum living allowance and medical care expenditure are lower than the growth rate of per capita GDP (except 2011). It can be seen that while the economy is growing, it is also necessary to appropriately increase the capital investment for the population aging, and appropriately increase the growth rate of the residents' minimum living allowance, so that the residents' minimum living allowance can maintain basic life and medical care. (6) Strengthen the implementation of Wuhan's talent introduction policy Wuhan implemented the policy of ''one million college students studying in China'' in 2017. This implementation of the talent introduction policy is of great significance to alleviate the problem of population aging. The processes and trends of demographic age structure change (as can be seen from Fig. 3 ) are influenced by population movements and migration in addition to births and deaths. The population density of Wuhan has been increasing year by year since 1997 (with a sharp increase in 2012), which indicates that the population in per unit land area of Wuhan has been increasing year by year. The net migration of population in Wuhan before 2010 is relatively large, the net migration rate of population before 2010 is positive, and the net migration rate before 2005 presents an increasing trend, indicating that the net migration of population is of great significance to the change of population age structure in Wuhan. The net migration rate of Wuhan is negative after 2010, which indicates that the population migration rate of Wuhan is greater than the population outflow rate. The implementation of talent introduction policy can increase the proportion of population aged 15 to 59 in the total population, leading to the change of population age and the change of the proportion of aging population. Therefore, the government should strengthen the implementation of talent introduction policy. This paper analyses the main factors of influencing population aging in Wuhan and discusses the development trend prediction of population aging in Wuhan. First, under the data information environment that the attribute values of the preselected influence factors are multi-source heterogeneous data (the real numbers, interval numbers and fuzzy linguistic variables coexist), a TTCCA method is proposed to rank the importance of the influencing factors and to select the main influencing factors. Then, a combination prediction model is established to predict the population aging trend in Wuhan, and the predicted values of the aging population in Wuhan in the next 10 years are given. Finally, based on the results of quantitative analysis, some countermeasures and suggestions on how to alleviate the problem of population aging in Wuhan are provided. The contribution of this paper is as follows: (i) In the analysis of the main factors that influence the population aging problem in Wuhan, when the attribute values of the preselected influence factors are the multi-source heterogeneous data, we propose a new TTCCA method based on the traditional correlation coefficient analysis, which provides a new effective approach to solve the optimization selection problem of main influencing factors under the information environment of multi-source heterogeneous data. (ii) Combining MLRA with ARIMA, together with the weighted combination idea, a combination prediction model named MLRA-ARIMA is established to predict the number and the proportion of the aging population in Wuhan. The combination prediction model overcomes the shortcomings of the single model in prediction accuracy and prediction scope, thus effectively enhancing the reliability of the prediction results. In the future research work, we will fully consider the impact of COVID-19, and establish some new models based on our proposed models in this paper for the prediction of the aging population and the analysis of the main factors influencing the population aging in Wuhan. Influencing factors analysis and development trend prediction of population aging in Wuhan… Correlation coefficient of intuitionistic hesitant fuzzy sets based on informational energy and their applications to clustering analysis China's demographic estimation using the generalized stable population model Research on regional difference decomposition and influence factors aging in China Three models for predicting population growth-theoretical foundation, application methods, and revised expressions Radial basis function neural network model applied in the forecast of population aging taking Hunan Province as an example The prediction of China's population aging trend and analysis of the structure based on nonparametric auto-regression model An extended 2-tuple linguistic DEA for solving MAGDM problems considering the influence relationships among attributes Functional and residential status transition among nursing home residents Nonparametric autoregression prediction model on population growth rate The locally unbiased two-sided Durbin-Watson test Principal component analysis on the indirected influencing factors of population aging in Shanxi Province A 2-tuple fuzzy linguistic representation model for computing with words Managing non-homogeneous information in group decision making Prediction and analysis of population in Hubei Province based on Leslie model Population Aging, population growth and economic growth: evidence from China's provincial panel data Population prediction based on age shift algorithm Nonlinear effects of population aging on economic growth Analysis and prediction of Chinese aging based on Bayesian hierarchy spatio-temporal model Analysis on regional differences and influencing factors of population aging in China Pricing strategies and profit coordination under a double echelon green supply chain Multiattribute group decision making based on intuitionistic 2-tuple linguistic information Europe's population at a turning point The prediction of aging coefficient in Beijing A cross-country analysis of population aging and financial security Constructing road safety performance indicators using Fuzzy Delphi Method and Grey Delphi Method Grey-Lotka-Volterra model for the competition and cooperation between third-party online payment systems and online banking in China Population prediction based on leslie matrix and time series analysis Generalized ridge estimator and model selection criteria in multivariate linear regression A novel solution approach for multiobjective linguistic optimization problems based on the 2-tuple fuzzy linguistic representation model An integrated multi-criteria decision-making framework for sustainable supplier selection under picture fuzzy environment Drought prediction based on SPI and SPEI with varying timescales using LSTM recurrent neural network Optimal strategy for a green supply chain considering shipping policy and default risk Study on the interactive influence between economic growth and environmental pollution Multi-attribute decision making model based on optimal membership and relative entropy Location selection of city logistics centers under sustainability An extended uniform-price auction mechanism of homogeneous divisible goods: supply optimisation and non-strategic bidding A hybrid multi-attribute group decision making method based on grey linguistic 2-tuple Decision mechanism for supplier selection under sustainability Compound mechanism design of supplier selection based on multi-attribute auction and risk management of supply chain Comprehensive evaluation of non-waste cities based on two-tuple mixed correlation degree Design of comprehensive evaluation index system for P2P credit risk of ''three rural'' borrowers 2-stage modified random forest model for credit risk assessment of P2P network lending to ''Three Rurals'' borrowers An intelligent hybridization of ARIMA with machine learning models for time series forecasting When old meets young? Germany's population ageing and the current account Translating bioassay results to field population responses using a Leslie-matrix model for the marine amphipod Corophium volutator An empirical study on the impact of population aging on household consumption in China Weighted picture fuzzy aggregation operators and their applications to multi-criteria decision-making problems Tourism environmental impact assessment based on improved AHP and picture fuzzy PROMETHEE II methods International comparison study on the aged trend and reasons in China China's aging population and regional economic growth Preliminary research on aging population and flexible retirement policy of Shanghai Hesitant 2-tuple linguistic Bonferroni operators and their utilization in group decision making Channel integration choices and pricing strategies for competing dual-channel retailers Solving fuzzy regression equation and its approximation for random fuzzy variable and their application Study on the relationship between population policy and economic growth in the context of aging Population ageing in the United States of America: implications for public programmes Wuhan Statistical Yearbook of Wuhan A novel grey Riccati-Bernoulli model and its application for the clean energy consumption prediction Parameter optimization for nonlinear grey Bernoulli model on biomass energy consumption prediction Evaluation model of industrial operation quality under multi-source heterogeneous data information A novel car-following inertia gray model and its application in forecasting short-term traffic flow The probe for the weighted dual probabilistic linguistic correlation coefficient to invest an artificial intelligence project Spatial variation of population aging and associated factors in Jiangsu Province Group multi-criteria supplier selection using an extended VIKOR method with interval 2-tuple linguistic information Some interval-valued 2-tuple linguistic aggregation operators and application in multiattribute group decision making On the dominance of Mallows model averaging estimator over ordinary least squares estimator System reliability prediction by support vector regression with analytic selection and genetic algorithm parameters selection Application of autoregressive and logistic discrete model in Chinese population forecast Carbon price forecasting with variational mode decomposition and optimal combined model Conflict of interest The authors declare that they have no conflict of interest.Ethical approval This article does not contain any studies with human participants performed by any of the authors.Informed consent Informed consent was obtained from all individual participants included in the study.