key: cord-0971448-ghb3l3cx authors: Lu, Yahan; Yang, Lixing; Yang, Kai; Gao, Ziyou; Zhou, Housheng; Meng, Fanting; Qi, Jianguo title: A distributionally robust optimization method for passenger flow control strategy and train scheduling on an urban rail transit line date: 2021-12-24 journal: Engineering (Beijing) DOI: 10.1016/j.eng.2021.09.016 sha: 16f78ab361300204e754375ead7a9d2318373370 doc_id: 971448 cord_uid: ghb3l3cx Regular coronavirus disease 2019 (COVID-19) epidemic prevention and control have raised new requirements that necessitate operation-strategy innovation in urban rail transit. To alleviate increasingly serious congestion and further reduce the risk of cross-infection, a novel two-stage distributionally robust optimization (DRO) model is explicitly constructed, in which the probability distribution of stochastic scenarios is only partially known in advance. In the proposed model, the mean-conditional value-at-risk (mean-CVaR) criterion is employed to obtain a tradeoff between the expected number of waiting passengers and the risk of congestion on an urban rail transit line. The relationship between the proposed distributionally robust model and the traditional two-stage stochastic programming (SP) model is also depicted. Furthermore, to overcome the obstacle of model solvability resulting from imprecise probability distributions, a discrepancy-based ambiguity set is used to transform the robust counterpart into its computationally tractable form. A hybrid algorithm that combines a local search algorithm with a mixed-integer linear programming (MILP) solver is developed to improve the computational efficiency of large-scale instances. Finally, a series of numerical examples with real-world operation data are executed to validate the proposed approaches. The outbreak of coronavirus disease 2019 (COVID-19) has been greatly affecting people's daily travel patterns, resulting in significant uncertainty regarding passenger demand in urban public transit. As the main artery of public transit, the urban rail transit system transports a large number of passengers shuttling between different lines and stations every day. In some megacities, the passenger demand in peak hours far exceeds the transport capacity at busy stations (especially at some transfer stations), and a great many passengers must wait on the platforms and are even stranded, resulting in a serious crowdedness issue [1] . According to the World Health Organization, people who leave their home during an epidemic period should refrain from crowding to avoid cross-infection. In other words, passenger congestion in the confined environment of stations necessarily increases the chance of cross-infection and decreases travel comfort. Thus, practical and effective measuressuch as limiting the vehicle loading capacity, keeping a safe distance between passengers, and so forth-are needed to ensure safe operation of rail transit with the gradual restoration of social order and the increase in traffic volume brought by the resumption of work and schools. At present, two types of measures have been applied in China's urban rail transit systems, which do indeed mitigate the risk of congestion at stations with large passenger demand: ① The first measure involves passenger flow control. To cope with large passenger demand, the best choice is to impose passenger flow control; that is, to limit the speed of passenger flow entering the platform and make passengers queue in an orderly manner in station halls, so as to avoid passengers gathering on the platforms and thus reduce the risk of cross-infection while improving operation safety. For example, so far, up to 96 stations in the Beijing rail transit system have imposed routine passenger control strategies. ② Another practical method is demand-oriented train scheduling; this is a traditional approach used to deal with heavily congested passenger flow in peak hours, which has been studied deeply in the past years [2] [3] [4] . More precisely, based on detailed information on dynamic passenger demand, this method aims to shorten the train headway to enhance the capacity of the urban rail transit line/system and optimize train schedules so as to minimize the passenger waiting time at stations. However, the phenomenon of huge passenger flow during peak hours has become normative in some megacities, such that even the maximum departure frequency is insufficient to significantly ease the heavy congestion [5] . In addition, as far as most transfer stations are concerned, the number of transfer passengers in practical operations has far exceeded the volume of outside arrival passengers. Xizhimen Station in Beijing can be taken as an illustration. As shown in Fig. 1 , the transfer passenger demand at this station is always greater than the outside arrival passenger demand, which is particularly obvious during peak periods. Although some studies [5] [6] [7] [8] [9] have paid a great deal of attention to this problem, most still neglect to control transfer passengers simultaneously with outside arrival passengers. To relieve crowding and enhance efficiency as much as possible on an overall line, there is an urgent need to study passenger flow control strategies combined with train scheduling under the influence of transfer passengers-a need that is not addressed in the existing literature. In reality, some important information on the urban rail transit system, such as passenger demand, is usually uncertain. To capture this uncertainty, studies [10, 11] have used stochastic scenarios with deterministic probability distributions to characterize the randomness of passenger demand, and the scenario-based time-dependent passenger demand can be recorded by the automatic fare collection (AFC) system. However, it is difficult to obtain the accurate probability distribution of the involved scenarios in advance, especially in real-world applications [12] . Based on partial known probability information, the modeling methodology corresponds to the distributionally robust optimization (DRO) approach, which has recently been applied to the field of transportation planning and management. Enlightened by this method, the present study aims to find a robust passenger flow control strategy that works in all passenger demand scenarios with partially known probability. For clarity, Fig. 2 shows the passenger control process, in which the time-dependent passenger demand of two scenarios is represented by blue and red curves, respectively, and a robust passenger flow control strategy (represented by the purple line) is generated to control the passenger entering speed and thereby reduce platform congestion. This decision-making process is associated with the following potential risks incurred by the two-fold uncertainty: ① Dynamic outside arrival passengers and transfer passengers are both highly uncertain, and are described by stochastic scenario sets; and ② in practice, operators might only know the support of the probability distribution information of the scenarios, while precise distribution information is out of reach. Under these uncertain conditions, risk management has not been sufficiently studied in the existing literature related to the passenger flow control problem, to the best of our knowledge, so it will also be explicitly addressed in our study. Imbalance of supply and demand is the main cause of congestion on an urban rail transit line. Based on this fact, the existing literature has focused on regulating capacity and demand from the supply and demand sides, respectively, in order to alleviate congestion. On the supply side, the literature mainly optimizes the train schedule to meet the dynamic passenger demand with uneven temporal and spatial distribution, so as to avoid a deficiency or insufficiency of transport capacity; on the demand side, a passenger flow strategy, differential pricing, and other measures are mainly used to guide and limit the passenger flow. In addition, the existing studies that focus on dealing with uncertainty in the field of transportation tend to adopt different methodologies, including stochastic programming (SP), robust optimization (RO), and DRO. The following discussion provides a detailed overview of the state of art in the literature. Due to the increasing gap between the limited transport capacity and the continuous growth of passenger demand, many urban rail transit lines in China are suffering from overcrowding. The question of how to deal with this problem is thus the focus of operation management departments and many researchers. As a substantial increase in transport capacity is almost impossible in the near future, implementing passenger flow control at congested stations or lines is the best choice to reduce crowdedness. Many researchers have devoted time and effort to this problem, and the related studies can be summarized into the following two categories: the first category is based on the mathematical programming approach and the other one is based on the computer simulation method. • Passenger flow control based on the mathematical programming approach. In recent years, an increasing number of researchers have employed the mathematical programming approach to study the passenger flow control problem, which includes three stratification levels: the station level, line level, and network level. Considering the fixed train schedules and the time-dependent passenger demand, existing studies generally aim to minimize the delay time or the waiting time of passengers. For example, Xu et al. [6] proposed a multi-objective optimization model for an urban rail transit station under uncertain demand in order to calculate the number of passengers that should be controlled through the genetic algorithm. In addition, Xu et al. [8] addressed the problem of passenger flow control in an urban rail transit network during peak hours. Their problem was formulated as a mixed-integer programming model to simultaneously adjust the number of inbound and transfer passengers, which could be solved by the genetic algorithm. Shi et al. [9] developed an integer linear programming model to generate an optimal passenger flow control strategy for an urban rail transit system. Based on analyses of the alighting and boarding processes, Wang et al. [14] formulated an integer programming model to achieve the optimal state for the entire urban rail transit line, with a focus on minimizing the average passenger delay. Yuan et al. [15] established a mixed-integer linear programming (MILP) model in order to minimize the total passenger waiting time in an urban rail transit network and solved it by CPLEX software. By introducing a timetable-oriented space-time network representation, Meng et al. [16] proposed an integer linear programming model to produce high-quality passenger flow control strategies for an urban rail transit line. • Passenger flow control based on the computer simulation method. In view of the complex dynamic characteristics of the operation state in an urban rail transit system, computer simulation modeling can reveal the occurrence and evolution of passenger flow congestion, and thus give full play to the advantages of simulation modeling in this problem. Numerous published studies use the simulation method to investigate the passenger flow control at stations. Hoogendoorn and Daamen [17] established microscopic pedestrian dynamics models to assess the design safety in large passenger flow scenarios in Lisbon metro stations. To simulate passenger flow at the Xuanwumen Station (a transfer station) in the Beijing rail transit system during peak hours, Zhang et al. [18] used Vissim software, which achieved good performance. Seriani and Fernndez [19] studied the effects of pedestrian traffic management on passengers' boarding and alighting time, and used experimental means to obtain the standards for pedestrian traffic management at the platform and doors of metro vehicles. Fei and Liu [20] used Anylogic software to formulate a simulation model of passenger flow organization and accurately identified the congestion points according to the simulation results. Jiang et al. [21] proposed a coordination model of passenger flow control and train jumping based on the Q-learning approach, which could be efficiently solved by simulation methods. Over the past several decades, a large and growing body of literature has proposed different mathematical models and solution algorithms for train scheduling problem in urban rail transit. For example, to minimize passenger waiting time, Niu and Zhou [4] developed a nonlinear programming model in which the train schedule for the whole line could be optimized by the genetic algorithm. They then took oversaturated situations into consideration and formulated a mixed-integer nonlinear programming model [22] . Li et al. [23] studied the fairness issue in the train scheduling process. To optimize train schedules by considering min-max fairness and α fairness, they established a mixed-integer programming model that could be solved by a simulated annealing-based adaptive large neighborhood search algorithm. Yin et al. [24] proposed a mixed-integer programming model to fix the coordinated train scheduling problem with unlimited train boarding capacity. Yang et al. [25] developed a 0-1 integer linear programming model for the last train timetable in an urban rail transit network, which could be effectively solved by a heuristic algorithm based on Lagrangean relaxation. Liu et al. [26] formulated a mathematical optimization model to determine a robust train schedule for maximizing robustness and minimizing total energy consumption, so as to avoid delay propagation as much as possible. Huang et al. [27] considered the rescheduling problem in urban rail transit systems. Two mixed-integer nonlinear programming models with different recovery strategies were formulated to reschedule trains during track disruption, which could be linearized by a hybrid approach combining big-M and time-indexed formulations. Tian and Niu [28] addressed the demand-oriented train scheduling problem under overtaking operations. Their problem was formulated into an integer linear programming model, and a corresponding novel surrogate-dual-variable column generation algorithm was proposed to generate approximate optimal solutions. Mo et al. [29] proposed a mixed-integer nonlinear model to generate an optimized train timetable with suitable train speed profiles, as well as an efficient rolling stock plan, aiming to minimize both passenger waiting and travel times. Qi et al. [30] formulated an integer linear programming model for the integrated stop planning and timetabling problem, with the aim of minimizing the total travel time of trains. Overall, the above studies address the passenger flow control problem and the train scheduling problem separately, thus failing to achieve an optimal system effect or reduce the pressure on urban rail transit operation to the greatest extent. It should be noted that integrated optimization of the above two aspects is much more challenging than dealing with them separately because of the complex coupling relationship between trains and passenger flow. To date, only three papers can be found in the literature that use a mathematical programming approach to carry out an integrated study of these two aspects: Xu [31] formulated a quadratic programming model to generate a multi-station collaborative passenger flow control strategy and a corresponding train schedule; in this model, the decision variables include the number of inbound passengers and the dwell time of trains at certain key stations. Shi et al. [5] established an integer linear programming model to obtain an optimal passenger flow control strategy for the whole line and a corresponding train schedule to improve the service quality, in which the passenger flow control strategy was implemented on each timestamp. Gong et al. [32] implemented the passenger flow control based on the train status. However, none of these studies consider the influence of transfer passengers. It is well known that transportation activities usually faces a lot of dynamics and uncertainties due to system complexity [1, 11, 33, 34] . With the development of optimization theory and the improvement of computing technologies, an increasing amount of attention has been paid to how to deal with uncertain parameters. Thus far, three major kinds of approachesnamely, SP, RO, and DRO-have been developed to handle uncertainty. More specifically, the SP method assumes that the probability distribution of uncertain parameters is known in advance. For example, Gong et al. [35] considered the stochastic environment in an urban rail transit system and developed an integer nonlinear programming model to simultaneously optimize the number of train services, the headway settings, and the speed profile selection decisions. Errico et al. [36] constructed a two-stage SP model for the vehicle routing problem with hard time windows, which could be solved by exact branch-cut-and-price algorithms. It is widely recognized that the SP method is based on a given probability distribution. However, the real value of probability distribution is difficult to obtain in practice and usually needs to be estimated through historical data. Due to the existence of estimated errors, major deviations may occur in practical applications, which is a great weakness of this method. The RO approach considers that uncertain parameters are all based on uncertainty sets without any distribution assumption. The aim of the RO approach is to overcome the uncertainty and find the best possible solutions that satisfy all constraints. For example, to handle the uncertainty of passenger demand, Yang et al. [10] proposed a MILP model with the max-min reliability criterion, aiming to increase the number of last-train successful transfer passengers and reduce the total running time. Qi et al. [37] studied the train timetabling and stop-planning problem with uncertain passenger demand. An integer linear programming model was formulated and solved by CPLEX software. Based on the technique of Light Robustness, Cacchiani et al. [38] proposed three MILP models to derive robust train-stop plans and timetables, in order to reduce passengers' inconvenience. Here, we note that a serious drawback of the RO approach is that it is too rigid and conservative; however, it does not require the distribution information to be known in advance, unlike the SP method. The rapid development of the DRO approach has overcome the aforementioned shortcomings of the SP and RO methods. The DRO approach combines the statistical learning technique with optimization theory; it can obtain a good enough solution by assuming that the parameters obey some possible distributions. The theories underpinning the DRO approach have been well studied over the last decades [39, 40] . For example, Delage and Ye [39] produced one of the first studies on this approach, in which the DRO models were formulated under the first-order and second-order moment inequalities, respectively. Esfahani and Kuhn [40] proved that the dual form of a Wasserstein-metric DRO model could be decomposed into finitely convex optimization problems with tractable computational complexity. Recently, this approach has also attracted much attention from certain engineering fields [41] [42] [43] [44] [45] [46] [47] . For example, focusing on the planning and scheduling problem under uncertain demand, Shang and You [41] formulated a two-stage DRO model to produce less conservative solutions. For finite and infinite horizon optimal control problems, Van et al. [42] proposed distributionally robust constrained formulations to provide frameworks that could generate robust control policies. Considering the integration of urban and rural public transport systems, Shang et al. [43] developed a DRO model with ambiguous chance constraints regarding the travel time requirement, with the aim of minimizing the total construction cost. For a disaster relief management problem under demand uncertainty, Wang et al. [44] proposed a DRO model to simultaneously optimize the integrated facility location, inventory pre-positioning, and delivery decisions. Yang et al. [45] presented a distributionally robust chance-constrained program to model the last-train coordination planning problem in an urban rail transit system, in which uncertain passenger demand was captured by an ambiguous set. Wang et al. [46] focused on the hub location problem with multiple commodities under uncertain demand and cost. This problem was modeled as a DRO model, which was further reformulated as a moderate-sized mixed-integer linear program for ease of calculation. Zhang et al. [47] established a DRO model with the Wasserstein distance-based ambiguity set for the vehicle routing problem with time windows, aiming to minimize the risk of late customer arrivals. Thus far, situations with uncertain passenger demand have received less attention in the area of passenger flow control. As far as we know, there is only one relevant reference in this field that addresses uncertain demand: Meng et al. [11] proposed an integer linear programming model on an urban rail transit line by applying the SP method; in this model, uncertain passenger demand was described using a stochastic scenario set with completely pre-given distribution information. Moreover, there is no relevant theoretical achievement employing the DRO approach in the field of passenger flow control combined with train scheduling. In summary, the passenger flow control problem has been a hot research topic in recent years, and some mathematical models and simulation methods to address this problem have been well studied. Nevertheless, most of these approaches still separately consider or partially integrate the three aspects of passenger flow control, train scheduling, and SP method. Moreover, no existing study simultaneously deals with the two-fold uncertainty with respect to the passenger demand and scenario probability, even though it is a significant and challenging issue for real-world applications, as it is difficult to predict passenger demand accurately. With this concern, this paper is particularly interested in developing a DRO method for the passenger flow control strategy and train scheduling on an urban rail transit line, as well as a powerful algorithm with fast computing speed. To fill the abovementioned gaps, this paper aims to make the following contributions to the study of passenger flow control and train scheduling problem on an urban rail transit line: (1) A DRO model, which focuses not only on both outside arrival passengers and transfer passengers, but also on train schedules, is rigorously formulated. Compared with the existing literature, this model has two significant improvements: ① The proposed model considers the inherent uncertainty of passenger demand. Since passenger demand in the real world is usually uncertain and dynamic, a series of stochastic scenarios are developed to depict this uncertain parameter in this paper. ② The precise probability distribution information of each stochastic scenario does not need to be known completely in advance-that is, only the partial distribution information is needed, which is the main reason why this approach is employed in this paper. Since very few existing studies consider the inherent uncertainty of passenger demand and the characteristics of scenario probability cannot be predicted in advance, this is a novel idea. (2) Our distributionally robust formulation is a semidefinite programming model, which is a notoriously difficult problem to solve. To convert this model into a computationally tractable form, a discrepancy-based ambiguity set is employed. Furthermore, the equivalent MILP model can be obtained through the strict theoretical proofs. Based on an analysis of the complexity of our formulation, we propose an efficient hybrid algorithm by combining a local search algorithm (LS) with a suitable MILP solver to generate high-quality solutions in an acceptable computation time, especially for large-scale instances. (3) Finally, to demonstrate the effectiveness and practicability of our proposed approaches, a series of numerical experiments are implemented based on operating data from Line 15 in the Beijing rail transit system. We compare the experimental results of our DRO model with those of the traditional SP model and the classic RO model. These experiments demonstrate the performance and advantages of the proposed optimization model and solution approaches. The remainder of this paper is organized as follows. In Section 2, we give a detailed description of the passenger flow control and train scheduling problem on an urban rail transit line. In Section 3, we formulate the problem into a two-stage SP model and a DRO method, respectively. Then, Section 4 proposes a hybrid algorithm based on the LS and a suitable MILP solver. In Section 5, we implement a case study, i.e., a large-scale case based on the operating data of Line 15 in the Beijing rail transit system, to show the superiority of our proposed approaches. Finally, Section 6 gives some conclusions and future research. Consider an oversaturated urban rail transit line, which consists of a series of common and transfer stations (S or and S tr are used to denote the sets of common and transfer stations, respectively), and a set of physical segments between stations (the kth segment is defined as the connection from stations k to k + 1). Denote the set of stations on this line as S, then . In operations, the transfer between different lines is a common passenger behavior at each It is widely recognized that passenger demand in an urban rail transit system is time-dependent and varies at different time periods, during which outside arrival passengers continuously come, while transfer passengers are intermittently centralized due to the arrival of trains from adjacent lines at certain intervals. In addition, passenger demand is often uncertain in practical operations due to the influence of various factors. Thus, to describe the dynamic and stochastic passenger demand more realistically, the time horizon is discretized into a series of timestamps with discretized time granularity t unit in this study, denoted by , and a stochastic scenario set-that is, -is employed to characterize the {1,2, , } uncertainty. Note that, since it is difficult to know the probability of each scenario precisely in advance, in order to be more realistic, we characterize this parameter by imprecise discrete probability distributions. Similar to the existing literature [11] , the dynamic and stochastic passenger demand is depicted by and , which represent the number come,out ( , ) of outside arrival passengers and the number of transfer passengers from adjacent lines at station k and timestamp t in scenario ω, respectively. It should be noted that the aforementioned passenger demand can be obtained from historical AFC data through travel demand assignment approaches [48] . For an urban rail transit line with uncertain passenger demand, congestion is directly related to the following four factors: outside arrival passengers, transfer passengers from adjacent lines, the schedules of involved trains, and the train capacity. Due to the limited improvement of train capacity in real-world operations, the most effective method to alleviate congestion is to consider the effects of the other three factors from the system perspective, and to carry out collaborative optimization of the train schedules and passenger flow control strategies. In the process of passenger flow control, outside arrival passengers are first required to queue in station halls, and they are then released to the platform to board train under the robust i passenger flow control strategy at station in scenario (denoted by ). Nevertheless, the transfer passengers k  inc,out , ) are allowed to go to the platform freely and wait for the next train arrivals, as shown come,tr ( , ) Fig. 4 . Once the number of outside arrival and transfer passengers exceed the capacity of the trains, outside passengers who arrive later than this time must be detained in the station hall; they then gradually enter the platform according to the robust passenger flow control strategy. In this paper, the control strategy is only imposed on outside arrival passengers, while transfer passengers are released to the platform and can take trains freely. In addition, all the passengers on the platform are required to take the train that arrives earliest, without waiting for the next train, in order to guarantee operation safety. In summary, the focus of this paper is to develop a multi-scenario mathematical formulation for the robust passenger flow control problem combined with train scheduling on an urban rail transit line, with the goal of minimizing the sum of operating time, the expected number of waiting passengers, and the conditional value-at-risk (CVaR) for making decisions. Our formulation also explicitly captures the time-dependent and stochastic passenger demand-that is, the outside arrival passengers and the transfer passengers-and the train capacity, aiming to achieve the optimization of system performance. The proposed mathematical formulation is based on the following three assumptions. The outside arrival and transfer passenger demand is known in advance by processing the historical AFC data. A similar assumption has been widely adopted in previous studies [1, 8, 11] . Assumption 2: The section running time and station dwell time of trains are pre-given by rail managers according to the actual operating data. That is, these parameters are not affected by the numbers of boarding or alighting passengers. Assumption 3: We only consider the direction from station 1 to station as the operation direction, while the other S direction can be operated according to the same optimization method. In addition, the inbound and transfer passengers, who are released to platforms under the robust passenger flow control strategy, can all be taken away by the first arrival train. This assumption is common in passenger flow control and train timetabling studies [4, 32] . In this section, we first introduce a two-stage SP model for passenger flow control and train scheduling problem with completely known probability information in Section 3.1. We then formulate a novel DRO model with uncertain probability in Section 3.2. To characterize the above problem in a stochastic version, the modeling process will be explicitly discussed in the following sections. In Section 3.1.1, we first list all the related notations and decision variables in the formulation. The events that describe the dynamic processes of passenger loading and train operations are introduced in Section 3.1.2. In Section 3.1.3, we present a two-stage SP model based on the mean-CVaR criterion. In this paper, we use boldface letters to represent vectors and V is employed to represent the set of random variables. Uncertainty is modeled via a set of stochastic scenarios, the probability distribution of which is denoted as p[·]. We use to represent the expectation of x with probability distribution p[x]. Based on these definitions, we next develop a two-[ ] In these decision variables, and determine the arrival and departure times of train i at station k, respectively; is also a scheduling variable, which corresponds to a train timetable (i.e., the headway between adjacent trains, the , ( ) : ratio of outside arrival passengers who arrive at station v heading to station k in scenario ω, For the sake of clarity, this section introduces a non-increasing binary variable d i,k (t) to denote the departure characteristics of train i at station k and timestamp t. Specifically, d i,k (t) = 0 indicates that train i has departed from station k at timestamp t, and d i,k (t) = 1 indicates otherwise. As shown in Fig. 5 , train i departs from station k at timestamp 2. It is clear that a series of "1" for d i+1,k (t) -d i,k (t) indicates the departure headway between trains i and i + 1 at station k, during which passengers can board train i + 1 according to the robust passenger flow control strategy. By using this binary variable, it is possible to formulate the problem of interest as an optimization model with linear forms, as described in the following discussion. A two-stage SP model based on the mean-CVaR criterion is formulated in this section, in which the distribution of the uncertain passenger demand is assumed to be completely known in advance and is captured by multiple scenarios with deterministic probability distribution. In the first stage, we propose a formulation that is only relevant to train scheduling, shown in the equations below: [ ] , , is the random optimal value of the second-stage model, denotes the first-stage decision vector of train ( , ( )) represents the random input data of passenger flows, is the expected number of waiting passengers, λ is ( ) ξ   the tradeoff coefficient that reflects the decision-makers' risk preference, ζ 1 and ζ 2 are two cost coefficients, and CVaR denotes the conditional value-at-risk criterion, which is a popular risk measure to model the decision-maker's risk preference. Compared with the value-at-risk (VaR) criterion, CVaR highlights the average level of excess losses that occur beyond the VaR cutoff point in the distribution. Therefore, CVaR is superior to VaR in reducing uncertainty risk, since more risk-averse decisions can be obtained by adopting the CVaR criterion. This could be particularly helpful in preventing serious situations in which a large number of passengers congregate at an urban rail transit station. The objective function in Eq. (1) aims to minimize the sum of the operating time (i.e., T 0 ), the expected number of waiting passengers, and the CVaR for making decisions. In particular, if λ = 0, the obtained solution will be the result of not taking the CVaR for making decisions into consideration. It is clear that this objective function will minimize the operating time if ζ 2 is set to zero. In this case, this problem turns out to be a supply-side-oriented optimization. However, when considering multi-scenario coupling, only minimizing the operating time will inevitably increase the risk of service unfulfillment. Thus, the CVaR criterion is employed to handle the decision risk caused by uncertainty, and the number of waiting passengers is the most common decision criterion in the passenger flow control problem. Given the section running time and the station dwell time , Eqs. (2) and (3) In the second stage, the corresponding robust passenger flow control strategy is optimized, which is formulated as follows: waita,out , ( , ( )) min ( ) s.t. come,out come,tr ina , wait,tr , on , on inc al , 1 , , In this stage, we need to formulate the relationship between the train schedule and the robust passenger flow control strategy, by establishing their coupling constraints. The objective function Eq. (10) aims to minimize the total number of waiting passengers for different scenarios. As we consider the direction from station 1 to station as the operation direction, and | | S thus station is the terminal station with no passenger demand, the number of waiting people at this station is zero. By | | S imposing Eq. (11), it is ensured that all passenger demand should be satisfied over the considered time horizon T. The number of passengers waiting for train i in the station hall can be calculated by Eq. (12) . Since all outside arrival passengers need to queue in the station hall and wait for boarding permission, the number of waiting passengers is the difference between the cumulative number of outside arrival passengers and the cumulative number of passengers being served. Eq. (13) tracks the number of transfer passengers boarding train i at station k. As the transfer passengers can directly reach the platform, these passengers must have a high priority to get on trains. By using Eqs. (12) and (13), passenger demand and train schedules can be tightly connected in a series of unified formulations. Eqs. (14) and (15) require the number of waiting passengers to be non-negative because the number of passengers who are allowed to get on any train i cannot be greater than the number of outside arrival and transfer passengers, respectively. Next, we analyze the dynamic passenger loading process, which consists of two types of activities: boarding and alighting. Since multiple stochastic scenarios are considered in this study, a robust passenger flow control strategy, , which is inc , ( ) If the discrete probability distribution of different scenarios can be specified, we can formulate the expected number of waiting passengers as a computationally tractable form, which can be expressed as follows: [ ( , ( ))] ( , ( )) where , is the probability of scenario ω such that , and 1 2 ( , ,..., ) Similarly, the third part of the objective function, , 1 2 ( , ( )) ( ( , ( )), ( , ( )),..., ( , ( )) in which denotes decision-makers' risk preferences, and the value of represents the maximum loss of the function   at a given value of risk preference . ( , ( )) For clarity, an auxiliary variable t(ω) is introduced here, and Eq. (25) can be reformulated as follows: 1 min 1 s.t. ( , ( )) 0 where , e is the unit vector. According to the above analyses, the two-stage SP model can be formally expressed as follows: In practice, it is rather difficult to precisely obtain the probability distribution of stochastic scenarios. In such a circumstance, the probability distribution is assumed to be only partially known; in this study, it belongs to an ambiguity set, which will be analyzed in the following sections. Based on the notations in the previous sections, we formally propose the following DRO model in Section 3.2.1; next, in Section 3.2.2, we specifically design a discrepancy-based ambiguity set for our developed model. In Section 3.2.3, we further reformulate the DRO model as a MILP model. Finally, we analyze the complexity in terms of the number of decision variables and key constraints in Section 3.2.4. Given the uncertain probability of each stochastic scenario, a DRO model based on the worst-case mean-CVaR criterion is formulated in this section, as shown below: in which we use to denote the worst-case evaluation over all possible distributions, and represents an sup{} P   p P ambiguity set characterized by the available partial distributional information. Below, we discuss the equivalent formulations for the DRO model. First, the objective function in model (Eq. (28)) can be transformed into According to the strong dual property of , we can change the order of the operators , leading to the ( , ( )) Then, the proposed two-stage DRO model (Eq. (28)) (i.e., the DRO model) can be equivalently formulated as follows: Here, the computation processes of and are dependent on the property of the ambiguity (33) Note that the DRO model (Eq. (28)) is a semidefinite programming, which is computationally intractable for general ambiguity sets. Hence, to solve it directly using a commercial solver (e.g., CPLEX) within an acceptable computation time, we can transform the DRO model (Eq. (28)) into its computationally tractable form. In the next section, a special ambiguity set is designed to characterize the distribution uncertainty, and the corresponding computationally tractable forms of the DRO model (Eq. (28)) are deduced. In this study, uncertainty is modeled via a series of stochastic scenarios, and the corresponding probability distributions are characterized by a discrepancy-based ambiguity set [49] , that is, P, which is defined as follows: To be specific, p 0 is a vector representing the nominal distribution of the discrete probability, ς denotes the fluctuation vector, and Ψ is a real value between 0 and 1. Note that the condition e T ς = 0 is formulated to ensure that the vector p meets the requirements e T (p 0 + ς) = 1. That is, the discrepancy-based ambiguity set describes the deviation from a reference (often empirical) distribution. The reasons why we use this ambiguity set are threefold: ① This set is simple and practical. The former part, p 0 , can be obtained from the data of different dates in practical operations, which is easy to realize in engineering. ② This set is convenient for transformation into a computationally tractable form by applying the related theoretical analysis. ③ This set often possesses asymptotic properties [50] . Under the designed ambiguity set, the deterministic equivalent formulation can be obtained by invoking the theoretical results of Ref. [49] , which are shown as follows. where are auxiliary variables or vectors, and Ψ = eΨ. , , , , , Under the discrepancy-based ambiguity set P, Eq. (32) can be transformed into the following equivalent form: where . With the strong duality theory of linear programming, the dual form of constraint (32) can be formulated as the following MILP model: Likewise, the dual programming of Eq. (33) is developed as the following linear programming model: Combining models (Eq. (37) and Eq. (38)), under the discrepancy-based ambiguity set , the DRO model (Eq. (28)) is equivalent to the following MILP model: ( , ( )) ( Eq Eqs. , ( )) 0 (2) 23) , ( 0 s. By regrouping the terms, model (Eq. (39)) can be represented as model (Eq. (35)). The proof of Theorem 1 is thus complete. Remark 2. In the case of Ψ = 0, the DRO model (Eq. (35)) is equivalent to the two-stage SP model (Eq. (27)). The Table 1 . To illustrate this problem more clearly, an instance with 40 in-service trains, 20 stations, 180 timestamps, and three scenarios is provided in order to identify the total number of decision variables and constraints. In this case, there are over 150 000 decision variables and 319 980 constraints, which indicates that the proposed model (Eq. (35)) is a fairly complex problem. Based on the analyses above, if the scale of the problem is small, the proposed model (Eq. (35)) can be solved by means of MILP solvers (e.g., CPLEX) in a short time, due to the linearity of the model. However, it is difficult to solve real-world large-scale instances. Thus, we need to design heuristic algorithms to search for approximate optimal solutions within an acceptable computation time, in order to meet the needs of real-world applications. Therefore, an effective hybrid algorithm combining a LS with a MILP solver will be developed in this section to find high-quality solutions. The framework and the detailed techniques of our hybrid algorithm are introduced in the following discussion. An algorithm framework is developed to solve the proposed DRO model, following the procedure shown in Fig. 6 . In brief, the original problem (Eq. (35)) is decomposed into two subproblems. The first subproblem (denoted by MP) determines the decision variables related to the train schedule for the focus line, while the second subproblem (denoted by SP) optimizes the control variables for uncertain passenger demand over all the involved scenarios. A LS is designed to search for feasible train schedules. Then, each feasible schedule is taken as the input for subproblem SP to produce the corresponding robust passenger flow control strategy, which is used to evaluate the generated schedule in subproblem MP. We observe that subproblem SP is a MILP model that can theoretically be solved to optimality. However, similar to the problem in Ref. [11] , the scenarios in subproblem SP are tightly coupled across different scenarios, making it difficult for the problem to be solved directly by MILP solvers. Therefore, we decompose it here into a series of single-scenario-based subproblems SSP in order to speed up the computation, in which the train schedule is the input for subproblem SP and SSP. As each subproblem SSP is a MILP problem and does not involve the coupling relationship among different scenarios, it can be efficiently solved using a suitable MILP solver. and then add the following constraint to subproblem SP. inc,out inc,out,SSP , , ( ) , , , The solution space of the decision variable , which is the key variable that couples the various scenarios together, inc,out , ( ) i k P  is greatly reduced by Eq. (41) ; thus, Eq. (41) acts as a so-called "valid inequality" to the problem of interest. The proposed valid inequality is then taken as the input for subproblem SP. The LS is an iterative searching procedure that starts from an initial solution and attempts to find a better solution. It should be noted that, due to the existence of time-related binary variable d i,k (t), the number of decision variables in subproblem MP is huge. For simplicity, this paper assumes that the running time over each section and the dwell time at each station are identical for different in-service trains. To guarantee the headway in operations, it is therefore sufficient to ensure that the departure headway can be kept at the start terminal station. In addition, since the arrival time for the first train is known in advance, the decision variable mentioned above can be converted into the headway between two consecutive trains arriving at the start terminal station. Thus, this paper takes a -dimensional headway vector to represent a solution for the train ( 1|) I  scheduling problem MP. To be specific, we adopt a problem-based solution representation, which consists of only one type of decision variable-that is, the headway vector -in which h i,1 denotes the headway between , which significantly reduces the complexity of subproblem MP. 1 I  To solve subproblem MP, an initial solution should first be generated. Next, the approximate optimal solution is searched in a greedy way. During the searching process, only the best solution in the neighborhood space can be accepted as a seed solution. Finally, the algorithm is terminated when the current best solution cannot be updated within a given maximum number of iterations, or when the number of search iterations reaches a predetermined threshold. Then, the best solution that has been encountered is outputted as a near-optimal solution. For the sake of description, we introduce some important notations in the algorithm: n denotes the index of iteration, n max denotes the threshold of the number of iterations, M denotes the maximum number of iterations for which the encountered best solution is not updated, X n denotes the best solution at iteration n, X 0 denotes the initial solution, m denotes the number of candidate solutions at each iteration, N(X n-1 ) denotes the neighborhood at iteration n, X* denotes the current best solution, f* denotes the current best objective value, and f n denotes the best objective value at iteration n. In this study, the initial solution is generated randomly as the seed solution for the first iteration. That is, we first generate a total of headway variables randomly within a given range , and thus an initial solution X 0 can be produced,  whose feasibility can be tested by Algorithm 1, shown below. If this solution is feasible, it will be set as the initial solution; otherwise, we need to regenerate the headway vector until a feasible solution is found. In the searching process, the initial solution will be treated as the input data for subproblem SP, and the corresponding objective value is used as the evaluation of this train schedule. n is greater than a pre-given threshold n max , the search process should be terminated. ② If the objective value of the currently encountered best solution is not updated within M iterations, the LS is terminated, and the current best solution can be treated as the near-optimal solution. We use y to denote the number of iterations that the current best solution is not updated. In this section, a series of numerical experiments based on real-world operation data from the Beijing rail transit system are conducted to demonstrate the effectiveness and performance of our proposed models and algorithm. More specifically, the stochastic and dynamic passenger demand is obtained by processing historical AFC data, and the running time of each section is deduced from the practical timetable. To solve this large-scale problem within an acceptable computation time, we use the proposed algorithm framework to generate near-optimal solutions for the models. The algorithm is coded by MATLAB 2014a with CPLEX 12.6 on a Windows 10 personal computer with Intel Core i5-10500 central processing unit (CPU) and 16G random-access memory (RAM). As shown in Fig. 7 , this set of experiments consider Line 15 of the Beijing rail transit system (i.e., the blue line), which has a total length of 41.4 km and 20 stations. This line connects a residential zone in a suburban town with the downtown working area, and numerous commuters need to ride this line on weekdays, resulting in over-saturation in the morning peak hours in the downtown direction. We only consider the direction from station Fengbo (FB) in the suburbs to station Qinghuadongluxikou (QH) downtown as the experiment environment, for which the station dwell time and the section running time are taken from real operation data, as shown in Table S1 in Appendix A. We select the morning peak hours from 7:00 to 10:00 as the considered time horizon. To characterize the dynamics of the passenger demand, the discretized time granularity is set as t unit = 60 s in this series of experiments, and a total of 180 timestamps are involved (i.e., ), {1, 2,...,180} T  so as to balance the real-world operation conditions and the computation efficiency. To depict the uncertainty of passenger demand, three typical stochastic scenarios with unknown probabilities are taken into account, for which the nominal probability distribution is predetermined to be . To determine the ambiguity set, the value of the 0 (0.20, 0.30, 0.50) T  p adjustable parameter Ψ is set as 0.02, 0.06, and 0.10, respectively; the risk-aversion parameter α is chosen from the set ; and the tradeoff coefficient λ is chosen from the set . For illustrative clarity, the dynamic {0.95, 0.50, 0.05} {0.10, 0.50, 0.90} outside arrival passenger demand in scenario 1 is displayed in Fig. 8 . In addition, it should be noted that there are four transfer stations on this focus line: station Wangjing (denoted as WJ) connects Lines 15 and 14, station Wangjing West (WJX) connects Lines 15 and 13, station Datunlu East (DTLD) connects Lines 15 and 5, and station Olympic Green (ALPK) connects Lines 15 and 8. These four stations attract many transfer passengers in addition to the outside arrival demand. In the data preparation stage, the passenger ratios for the outside arrival and transfer passengers at different destinations are given in Tables S2 and S3 in Appendix A, respectively. In the algorithm, the threshold of the number of iterations n max is set as 100. The other relevant parameters are listed in Table S4 in Appendix A, where C is set as the capacity using the 120% full-loading rate of each in-service train. To test the proposed DRO model (Eq. (35)), we consider the computation results by using different combinations of parameters α, Ψ, and λ. It should be noted that the cost coefficients ζ 1 and ζ 2 in the objective function are used to make tradeoffs between operations and services. Here, to make the comparison of optimization results more obvious and to consider the difference in the order of magnitude for two objectives, we set ζ 1 as 100 000 and ζ 2 as 1 in this set of experiments, and the detailed sensitivity analyses related to these two coefficients are shown in Section 5.5. The computation results are displayed in Table 2 . For clarity, Fig. 9 presents the optimal objectives obtained by the DRO model with different combinations of parameters. Typically, with the increase of parameter Ψ, the optimal objective of the model (Eq. (35)) gradually worsens. Furthermore, the tradeoff coefficient λ and the optimal objectives have the same monotonicity, and the optimal objective increases when the risk-aversion parameter α increases. These phenomena are consistent with the theoretical results. More specifically, the parameter α represents the level of risk aversion; that is, α = 1 represents extreme risk aversion. Therefore, a larger parameter α leads to a more conservative decision plan, which corresponds to a larger objective value. In the ambiguity set, Ψ represents the disturbance range of the uncertain probability. As the parameter Ψ increases, the distribution of probability information to the decision-maker becomes increasingly coarse, resulting in much worse outcomes for the decision. For clarity, Fig. 10 displays an optimized train schedule and the number of in-vehicle passengers in each train over each section, where the red lines represent the full-load sections. Clearly, most departure headway during the 7:00-7:42 time period is relatively small, and the full-load situations are all associated with the trains that depart from the start terminal station during this time period, which demonstrates that the train schedule and passenger demand are closely coupled. Next, we are particularly interested in investigating how the objective value converges to the optimal value in the searching process of our proposed algorithm. Since the convergence tendency of the objective values is similar for different instances, we only present the results of the instance with α = 0.95, λ = 0.10, and Ψ = 0.02 for illustrative convenience here, as shown in Fig. 11 . It follows from this figure that the best objective value can be iteratively reduced by the proposed LS + MILP solver algorithm, where the solution quality is gradually improved with a fast convergent speed. More specifically, the best objective value decreases drastically in the first several iterations, but is not updated any longer in the last few iterations. Thus, we conclude that the LS + MILP solver algorithm exhibits a good performance in solving the developed DRO model (Eq. (35)) in terms of solution quality, while requiring a reasonable computation time. Furthermore, to clarify the impact of the tradeoff coefficient, λ, in the following context, we will discuss the variations of the expected number of waiting passengers and CVaR when parameter λ changes. We summarize this section with the following observation. Observation 1. (The necessity of considering different combinations of α, λ, and Ψ.) It follows from the computation results that the combination of parameters α, λ, and Ψ has significant effects on the DRO model. If the decision-makers prefer a lower risk decision, it is better to set a bigger α and λ, and to know the probability distribution as precisely as possible (in which case, Ψ should be small). That is, the level of risk aversion and the disturbance range of uncertain probability are significant factors in the process of optimizing the train schedules and passenger flow control strategies. Next, we further consider the variation of the expected number of waiting passengers and CVaR with respect to different values of parameter λ. For comparison, we set parameter α as 0.50 and set Ψ as 0.02 and 0.06, respectively. Fig. 12 depicts the variation trend of the expected number of waiting passengers and CVaR with different values of λ. Clearly, for two instances in this figure, the CVaR yields an approximately linear increasing tendency in the optimal objective if we increase the coefficient λ, while the expected value shows the opposite trend when α and Ψ are fixed. In addition, when λ is set as 0.5, the risk preference of the decision-makers is neutral. Thus, the expected value and CVaR are very close in this case, as shown in Fig. 12 . These findings are summarized as the following observation. In this section, numerical experiments are conducted by solving the two-stage SP model (Eq. (27) ). Since the probability distribution is known exactly in this model, the disturbance range of the uncertain probabilities-that is, Ψ-is equal to 0. Therefore, we display the computation results with different combinations of α and λ in Fig. 13 . As shown, the optimal objective shows an increasing tendency with an increase in the risk-aversion parameter α if the tradeoff coefficient λ is fixed. In addition, when the level of risk aversion is fixed, the optimal objective follows the same trend with respect to the tradeoff coefficient λ. It should be noted that, since parameter Ψ is set as 0 in the SP model (Eq. (27)), the objective values should theoretically be less than those of the DRO model (Eq. (35)), with the same parameter settings. Below, we further analyze their differences between the SP model and the DRO model, by conducting a series of comparative experiments. For clarity, the following formula is used to quantify the differences (denoted as Price) in the optimal objective values between the SP model (termed SP*) and the DRO model (denoted as DRO*): Table 3 provides a detailed performance comparison for various combinations of α, λ, and Ψ. From the computation results in Table 3 , it can be observed that the DRO model always yields a slightly larger objective value than the SP model, since the values of Price are all positive. This demonstrates that, in an urban rail transit system, the more precise the information of probability distribution we know (i.e., the smaller Ψ is), the better the passenger flow control strategy and train schedule are. In this set of experiments, it can be seen that the maximum Price calculated is not greater than 0.051640%, which corresponds to a very small extra cost in the process of handling distribution uncertainty. Hence, in terms of solution stability, our proposed DRO model is superior to the SP model. Observation 3. (The superiority of the DRO model in comparison with the SP model.) To summarize, we conclude that the DRO model (Eq. (35)) utilizes limited distribution information to hedge against the uncertainty more robustly than the SP model (Eq. (27)), while requiring a relatively small Price. In this section, we compare the performance of the classic RO model in Eq. S(2) and the DRO model (Eq. (35)). Since the risk measure, CVaR, is not involved in the classic robust method, we set λ = 0 in the following experiments. For concise descriptions, the classic RO model (Eq. S(2)) is referred to as the RO model. Fig. 14 provides a comparison of the results when adopting different Ψ and α in the models. As shown in Fig. 14 , regardless of what Ψ is, the approximate optimal objective of the DRO model is smaller than that of the RO model. The reason for the poor optimization result of the RO model might be that the RO model is designed to optimize the worst-case scenario rather than the expected value, which leads to overconservative solutions. It should be noted that Ψ represents the disturbance range of the uncertain probability. We thus conclude that the uncertainty can be handled well by our proposed DRO model, regardless of whether it is strong or weak. In addition, we set different values of α to carry out experiments, and the results show that the objective values remain the same when parameter α varies. This finding is consistent with the following analyses: since α is set as zero in these experiments, the coefficient λ/ (1 -α) in the objective function is 0 for any value of α, so the corresponding part vanishes. ② When a decision-maker is extremely risk averse, he/she can employ the classic RO method to optimize the worst-case scenario, so as to obtain a conservative solution; otherwise, we recommend the DRO approach with an appropriate combination of parameters, which can be selected flexibly according to the decision-makers' preference and actual requirements. In this section, we perform sensitivity analyses with respect to the variation of two parameters, ζ 1 and ζ 2 . These two parameters are used in the objective function of the DRO model (Eq. (35)) to balance the operating time (denoted as Obj. 1) related to the interests of the operators and the target of the second stage relating to passengers (denoted as Obj. 2). To be specific, if the value of ζ 2 /ζ 1 goes to ∞, then Obj. 2 takes precedence over Obj. 1, which implies that more weight is given to passenger interests; otherwise, the proposed model mainly focuses on the optimization of the train schedule for operators. In particular, we aim to discuss how different values of ζ 2 /ζ 1 , leading to different train schedules, can affect the dynamic passenger control process. Table 4 reports the results obtained by varyingζ 1 , ζ 2 , and ζ 2 /ζ 1 . We list the objective values in the fifth column, and list the values of Obj. 1 and Obj. 2 in the sixth and seventh columns, respectively. As shown, when the value of ζ 2 /ζ 1 is between 0.01 and ∞, the value of Obj. 2 remains constant until ζ 2 /ζ 1 drops to 0.00001. The experiment with ζ 1 = 1 and ζ 2 = 0 does not consider the interests of the passengers, but only optimizes the train schedules. We note that the value of Obj. 2 increases drastically in this case. This computation result indicates the importance of collaboratively optimizing the robust passenger flow control strategies and the train schedule in order to ease congestion. In addition, in order to analyze the additional costs caused by the ambiguous distribution, we present the Price of each instance in Fig. 15 . Clearly, although the value of Price varies considerably with respect to different ζ 1 and ζ 2 , it is still acceptable, since it does not exceed 4.00% at most. Moreover, for instances ζ 1 = 1 and ζ 2 = 0, the Price turns out to be 0, since the objective function does not take Obj. 2 into account. Table 4 Sensitivity analyses of ζ 1 and ζ 2 with α = 0.50, λ = 0.10, and Ψ = 0.02. Observation 5. (Sensitivity to parameters ζ 1 and ζ 2 ): Considering both Obj. 1 and Obj. 2, especially regarding the results of the last experiment, it can be seen that only optimizing the train schedule is insufficient to achieve the system optimum on the considered oversaturated line. To improve the service quality as much as possible, the decision-makers of the urban rail transit system must pay more attention to the collaborative optimization of robust passenger flow control and train scheduling approaches, and moreover, they must carefully determine the value of ζ 2 /ζ 1 based on the practical operation needs. To reduce over-saturation of the urban rail transit line, this paper investigated the integrated problem of collaboratively optimizing passenger flow control strategies and train schedules, for a situation in which the passenger demand is uncertain and is characterized by a series of stochastic scenarios with partially known probability distribution. A multi-scenario twostage DRO model was formulated to generate near-optimal strategies for the passenger flow control and train scheduling under distribution ambiguity. The mean-CVaR criterion was adopted in the objective function, and the goal was to minimize the sum of the operating time, the expected number of waiting passengers, and the CVaR of congestion, the coefficients of which could be adjusted flexibly according to the practical situation in order to achieve the most suitable balance. Since the proposed model is usually computationally intractable, a discrepancy-based ambiguity set was constructed by using partial information on the uncertain probability distribution of the stochastic scenarios. Computationally tractable forms of the proposed model were also deduced. To obtain high-quality solutions for large-scale problems within an acceptable computing time, we developed a hybrid algorithm by combining a LS with a suitable MILP solver. To verify the performance of the proposed approaches, a series of numerical experiments were conducted using actual operation data from a Beijing rail transit line. The computation results show that our proposed DRO model is superior to the SP model in terms of solution stability. In addition, our model is superior in terms of solution quality and solution stability, compared with the classic RO model. These findings validate the proposed DRO method and the LS + MILP solver algorithm. Future research can be dedicated to the following aspects: ① In the process of characterizing the uncertain probability distribution, a discrepancy-based ambiguity set is used in this study. Thus, a promising extension would be to construct multitype ambiguity sets for capturing the probability distribution, such as a Gaussian ambiguity set or a polyhedral ambiguity set. ② The study of integrated and interconnected multimodal transport systems remains a hot topic for the future development of urban mobile ecosystems. Hence, the collaborative optimization of the passenger flow control in a multimodal transportation system may be an interesting research direction for comprehensively easing congestion. ③ In addition, it should be noted that this paper only considers the operation environment of an urban rail transit line. An interesting direction for future research would be to investigate efficient algorithms for more complex lines or networks in Beijing and in other representative cities. Timetable coordination in a rail transit network with time-dependent passenger demand Exact formulations and algorithm for the train timetabling problem with dynamic demand Locating optimal timetables and vehicle schedules in a transit line Optimizing urban rail timetable under time-dependent demand and oversaturated conditions Service-oriented train timetabling with collaborative passenger flow control on an oversaturated metro line: an integer linear optimization approach Capacity-oriented passenger flow control under uncertain demand: algorithm development and real-world case study Metro passenger flow control with station-to-station cooperation based on stop-skipping and boarding limiting Passenger flow control with multi-station coordination in subway networks: algorithm development and realworld case study Cooperative passenger flow control in an oversaturated metro network with operational risk thresholds MILP formulations and a TS algorithm for reliable last train timetabling with uncertain transfer flows Collaborative passenger flow control for oversaturated metro lines: a stochastic optimization method Robust optimization Optimization of conditional value-at-risk Modeling and optimization of collaborative passenger control in urban rail stations under mass passenger flow Passenger flow control strategies for urban rail transit networks Collaborative passenger flow control on an oversaturated metro line: a path choice approach Design assessment of Lisbon transfer stations using microscopic pedestrian simulation Application of VISSIM in pedestrian simulation of MTR stations Pedestrian traffic management of boarding and alighting in metro stations Simulation analysis of metro congestion points and optimization method Q-learning approach to coordinated optimization of passenger inflow control with train skip-stopping on an urban rail transit line Train scheduling for minimizing passenger waiting time with time-dependent demand and skip-stop patterns: nonlinear integer programming models with linear constraints Trade-off between efficiency and fairness in timetabling on a single urban rail transit line under timedependent demand condition Mixed-integer linear programming models for coordinated train timetabling with dynamic demand Collaborative optimization of last-train timetable with passenger accessibility: a space-time network design based approach A robust and energy-efficient train timetable for the subway system Coupling time-indexed and big-M formulations for real-time train scheduling during metro service disruptions Optimization of demand-oriented train timetables under overtaking operations: a surrogate-dual-variable column generation for eliminating indivibility An exact method for the integrated optimization of subway lines operation strategies with asymmetric passenger demand and operating costs An Integer Linear Programming model for integrated train stop planning and timetabling with timedependent passenger demand Calculation of Station service capacity and adaptability in urban rail transit [dissertation Equity-oriented train timetabling with collaborative passenger flow control: a spatial rebalance of service on an oversaturated urban rail transit line Simulation modeling and optimization for ambulance allocation considering spatiotemporal stochastic demand Evacuation based on spatio-temporal resilience with variable traffic demand Train timetabling with dynamic and random passenger demand: a stochastic optimization method A priori optimization with recourse for the vehicle routing problem with hard time windows and stochastic service times Robust train timetabling and stop planning with uncertain passenger demand Robust optimization models for integrated train stop planning and timetabling with passenger demand uncertainty Distributionally robust optimization under moment uncertainty with application to data-driven problems Data-driven distributionally robust optimization using the Wasserstein metric: performance guarantees and tractable reformulations Distributionally robust optimization for planning and scheduling under uncertainty Distributionally robust control of constrained stochastic systems Distributionally robust cluster-based hierarchical hub location problem for integration of urban and rural public transport system Tractable approximations for the distributionally robust conditional vertex p-center problem: application to the location of high-speed railway emergency rescue stations Distributionally robust last-train coordination planning problem with dwell time adjustment strategy Distributionally robust hub location Robust data-driven vehicle routing with time windows Urban travel demand: a behavioral analysis Distributionally robust design for bicycle-sharing closed-loop supply chain network under risk-averse criterion Data-driven distributionally robust polynomial optimization This work was supported the National Natural Science Foundation of China (71621001, 71825004, and 72001019), the Fundamental Research Funds for Central Universities (2020JBM031 and 2021YJS203), and the Research Foundation of State Key Laboratory of Rail Traffic Control and Safety (RCS2020ZT001). Yahan Lu, Lixing Yang, Kai Yang, Ziyou Gao, Housheng Zhou, Fanting Meng, and Jianguo Qi declare that they have no conflict of interest or financial conflicts to disclose. Input: a solution that needs to be checked Output: the feasibility of the solutionStep 1: according to Eqs. (2)- (5) and the input solution, calculate the arrival and departure times for all in-service trains at each station, then go to Step 2;Step 2: if , go to Step 3; otherwise, the solution is infeasible, stop;Step 3: the solution is feasible, stop. In order to generate the neighborhood for each iteration, a total number m of candidate solutions must be generated. For example, the set of candidate solutions at iteration n can be denoted by , where X n,c N X X X X X   represents one of the candidate solutions and c denotes the index of candidate solutions. More specifically, we can generate the candidate solutions by adding or subtracting a tiny integer vector on X n-1 , according to the following calculation:where the value of is randomly generated.It is worth mentioning that we need to test the feasibility of each newly generated neighbor solution by means of Algorithm 1. If the solution is feasible, we then insert it into the neighborhood of the current seed solution; otherwise, a new solution must be regenerated until its feasibility is guaranteed. The procedure of the LS + MILP solver algorithm is reported below as Algorithm 2, which is designed based on the aforementioned specific techniques. Two criteria are employed to terminate the searching process: ① If the current iteration Algorithm 2. The procedure of the LS + MILP solver algorithm.Step 1: initialize the iteration index and generate a feasible initial solution . Calculate the objective value of the initial solution; let