key: cord-0028370-k6ojmzty
authors: Zhou, Wei; Zheng, Jian; Xiao, Yingjie
title: An online identification approach for ship domain model based on AIS data
date: 2022-03-10
journal: PLoS One
DOI: 10.1371/journal.pone.0265266
sha: 67532be31191b021bbd9760993bf1c6d0a9d6156
doc_id: 28370
cord_uid: k6ojmzty

As an important basis of navigation safety decisions, ship domains have always been a pilot concern. In the past, model parameters were usually obtained from statistics of massive historical cumulative data, but the results were mostly historical analysis and static data, which obviously could not meet the needs of pilots who wish to master the ship domain in real time. To obtain and update the ship domain parameter online in time and meet the real-time needs of maritime applications, this paper obtains CRI as the weight coefficient-based PSO-LSSVM method and proposes to use short-term AIS data accumulation through the risk-weighted least squares method online rolling identification method, which can filter nonhazardous targets and improve the identification accuracy and real-time performance of nonlinear models in the ship domain. The experimental examples show that the method can generate the ship domain dynamically in real time. At the same time, the method can be used to study the dynamic evolution characteristics of the ship domain over the course of navigation, which provides a reference for navigation safety decisions and the analysis of ship navigation behavior.

The ship domain is an important concept of water transportation, which was defined in 1975 by Goodwin [1] . To ensure their own safety, ships need to maintain a certain safe distance from surrounding ships during navigation, and the ship domain model is the way to describe this space scope [2] . Research on ship domain models has experienced decades of development. Building a practical ship domain model can better describe ship behavior. For example, Reference [3] proposes the novel concept of the probabilistic ship domain, which depicts the ship domain boundary as a vague value. An adaptive ship safety domain is proposed with spatial risk functions in Reference [4] . The reasonableness and superiority of establishing a ship domain model considering the factors affecting both one's own ship and other ships are analyzed in Reference [5] . Reference [6] presents a new ship domain model that is both realistic and practical, including various factors considered by seafarers based on the awareness values formed. In Reference [7] , a free-form ship domain was developed empirically for navigation in confined waters, and the size of the ship domains was assumed to be dynamically enlarged with increased ship speeds. Through the use of a machine learning algorithm, Reference [8] develops intelligent ship domain models, which better represent the usual navigation practice than traditional approaches. From the perspective of the development process of ship models, their research methods and shapes have gone through a process from experience, statistics, analytical expression, data mining and intelligent technology, from simple geometric shapes to complex shapes, and from static to dynamic shapes [9] . The utilization of AIS information has received increasing attention, and many scholars have begun to use massive AIS data to analyze and obtain ship domain models [10, 11] . Reference [12] observes ships sailing during a four-year period by AIS, estimates how closely ships pass each other and fixed objects, and then establishes an empirical minimum ship domain. Considering the navigation characteristics of ships with limited maneuvering capability and the influence of ships on the ship effect, an algorithm to determine the boundary of the ship domain model is proposed, and experiments are carried out using AIS trajectory data in Reference [13] . According to Reference [14] , the surrounding waters of the target ship are divided into grids, and then the grid densities of ships are calculated to determine the shape and size of the ship domain. By analyzing a large number of ship-encounter samples obtained from AIS data, the available maneuvering margin (AMM) can be used in Reference [15] to explain the first evasive maneuvere, and finally, the size of the ship domain can be empirically estimated. From Reference [16] , through AIS data, the relationship between ship-avoidance behavior and the nearest encounter point is analyzed; the ship-avoidance behavior is quantified, and then the ship domain boundary is obtained. In Reference [17] , the method was proposed by using a large volume of AIS data to obtain the ship domains in restricted waters.

The above research results provide ideas and references for obtaining ship domain models using AIS data, but most of them are statistical analyses of diachronic data, lacking real-time dynamic research. With the need for real-time information navigation applications, it is necessary to propose a more dynamic and real-time ship domain model acquisition method. Different from traditional statistical methods, the method in this paper makes AIS data serve navigation safety better, enables ships to quickly perceive the dynamics of surrounding ships, identifies the actual parameters of the ship domain, and generates the boundary of the ship domain so that pilots can grasp the dynamics of their own ship domain in time and provide a basis for their navigation safety decisions more quickly. The main work of this paper is as follows:

The idea of generating ship domains online is proposed to provide a reference for the dynamic application of AIS data in navigation safety decision making. Short-term accumulated AIS data are adopted. When the minimum requirements for identification data are met, the collision risk index, which is obtained by the PSO-LSSVM method, is used as the control method of identification error filtering, combined with the weighted least square method, to quickly generate the ship domain and carry out dynamic rolling updates. In this paper, starting from a period of the navigation process, the change in the ship domain brought by the change in ship speed and navigation water area in the dynamic navigation process is observed to discover the evolution law of the ship domain, grasp its dynamic change in time, and provide a reference for an in-depth and detailed study of its change.

The rest of this paper is organized as follows: Section 2 introduces the ship domain model to be identified, considers the data required for model identification using AIS, and describes the corresponding ship-encounter parameter calculation formula. Then, Section 3 proposes a schematic diagram of online identification, including single online identification and realtime online rolling identification. In Section 4, the PSO-LSSVM method for collision risk index estimation is introduced, and the identification method of collision risk weighting is established. Then, in Section 5, an experimental example combined with the application of this method is described, and the results are analysed. In Section 6, based on the experimental results, some dynamic evolution laws of the ship domain in navigation are summarized. Finally, Section 7 summarizes the main findings and applications of the methods described in this paper, as well as suggestions for future work.

In terms of ship domain shapes, ship domain models usually include circular, fan-shaped, elliptical, quasi-elliptical, polygonal, etc. Davis [18] proposed the circular ship domain, and Goodwin proposed a similar fan-shaped ship domain [19] . The Fujii model [20] , Goldwell model [21] , Kijima model [22] , and quaternion ship domain model [23] are elliptic or quasielliptic ship domain models, while the polygonal ship domain is proposed in References [24, 25] . To meet the need for fast online identification, the mainstream elliptic model is used as the basis for ship domain model identification. The elliptic equation is as follows:

where a, b and (x 0 , y 0 ) are the main parameters to be identified. According to the equationsolving requirements, at least four or more target ship data are required to identify model parameters. To meet the identification accuracy requirement, the position of the target ship should be distributed in four quadrants of the coordinate system around the ship, not concentrated in one or a few quadrants. Considering that the identification model is nonlinear, to improve the accuracy and efficiency of identification, this paper adopts the collision risk weighted least square method to carry out online identification.

AIS data contain dynamic information required by ship-encounter situations, which is the basis for identifying ship domain models. Only by calculating the ship encounter parameters can the superposition of the target ships around the ship be dynamically grasped, the situation distribution of the ship encounter be obtained, the required data for solving the domain model equation be obtained, and then the model parameters can be identified. The parameters are calculated as follows [26] : Suppose that the corresponding coordinates of the longitude and latitude of the ship are (x O , y O ); the speed is v O ; the course is C O , and the corresponding coordinates of the longitude and latitude of the target ship are (x T , y T ); its speed is v T , and its course is C T . D ¼ ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi

V r ¼ ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffiffi

Considering the requirement of position superposition processing of the target ship, the position distribution of the target ship around the ship is marked by distance using Eq (2) and relative bearing using Eq (3). In Eq (3), the value of the arctangent function in the calculation formula of the relative bearing should be determined according to the coordinate quadrant where the target ship is relative to the center of the ship, and the angle value is [0,360˚]. K is the ratio of ship speed using Eq (4). V r is the relative speed using Eq (5) . C r is relative course using Eq (6) . DCPA and TCPA can also be obtained using Eqs (7) and (8) . In addition, the main parameters of this paper are shown in Table 1 .

As the parameters are identified online and need to be updated dynamically, only a small amount of data can be relied on to obtain the final results. This is significantly different from the results obtained from massive data mining. Therefore, it is necessary to solve the problem of reliability of the method and meet the requirement of the minimum dataset required for identification. Based on the above considerations, this paper proposes the following identification schematic. The core idea is to achieve the required dataset identification through short-term data accumulation. The dataset is updated by rolling to meet the requirement of real-time results. Differential data processing can be realized through collision risk to improve the identification progress. At the same time, the average collision risk and the number of identification data points were used as references for the reliability of the conclusions. Combined with the average collision risk and dynamic identification results, the pilot can better grasp the real-time dynamics of the ship domain. Compared with the traditional model, the pilot can also understand the safety margin of the compressible space in the ship domain. The results are a research method of the dynamic evolution of the ship domain and a reference for navigation safety decision-making. The schematic diagram includes single identification and dynamic rolling identification. 

1. Through the VTS center of the maritime administration of the jurisdiction, AIS data can be obtained. Then, according to the AIS data, the meeting situation between the target ship in the surrounding waters and the ship can be obtained through dynamic and real-time calculations. Considering the general range of the ship domain, the boundary is 3 nautical miles for open waters and 1.5 nautical miles for narrow waters.

2. DCPA, TCPA, relative distance D, relative bearing Q, velocity ratio K and other parameters are calculated dynamically. The ship collision risk CRI is obtained by the PSO-LSSVM method, and the risk of the target ship is graded.

3. With a unit time interval, ships that meet the requirements of the encounter range are identified as target ships and alternatives that can participate in the identification data of the ship domain model. 4 . When the number of identified target ships reaches the required number of ships for model identification, online parameter identification can be carried out quickly; the current ship domain model parameters can be dynamically updated, and the ship domain boundary curve can be drawn.

To avoid the identification error caused by the small amount of data, ship collision risk is adopted as the method of filtering nonhazardous targets. As the error weight coefficient of identification, CRI can control the influence of nonhazardous targets on the results so that the dangerous targets are reflected in the identification results, and the meaning of the ship domain as the scope of ship navigation safety is better reflected.

According to the above process, when the ship domain elliptic equation can be identified and the error accuracy requirements are met, the ship domain identification results can be dynamically output. The identification schematic diagram is shown in Fig 1 . 

CRI, namely, the collision risk index, is a parameter used to measure the degree of ship collision risk. Usually, the value range can be [0, 1]. The higher the value, the more dangerous it is. The DCPA, TCPA, relative distance D, relative bearing Q and ship speed ratio K can be used to measure the relevant indicators of ship collision risk. The indicator value and weight of CRI are related to the judgment of expert experience, and this paper refers to the fuzzy calculation result of referring to the survey results of experts in References [27, 28] as a training data sample. Then, CRI is obtained through PSO-LSSVM. This method has good small-sample generalization learning ability and can well reflect the expert experience in the numerical results of CRI. 4.1.1. PSO. Particle swarm optimization (PSO) is a fast iterative algorithm to find the optimal particle in the search area and the search for the optimal particle as the solution of the optimization problem [29] . In the iterative process, the individual extremum p best and global extremum g best are constantly updated, and the particles are constantly updated accordingly. The i particle in the n dimension is represented as

, and the global extremum of the particle swarm is p best = (p i1 , p i2 ,� � �,p in ). The velocity and position of particles are constantly updated with the extreme value, which are calculated as Eq (9) , and finally find the optimal solution.

where v k i is the velocity of particle i in iteration k; x k i is the corresponding position; ω is the momentum coefficient; c 1 , c 2 is the learning factor, c 1 , c 2 2(0,2); and r 1 , r 2 is a random number between (0,1). 

The least squares support vector machine (LSSVM) model is an improvement on the standard SVM. LSSVM uses the equality constraint and linear solution to replace the inequality constraint and nonlinear solution in SVM, which has higher accuracy and efficiency. To train sample set {(x i , y i )|i = 1,2,� � �,n}, x i 2R m , y i 2R, x i is input vector, y i is the output value, and n is the number of training samples. High-dimensional nonlinear mapping is φ:

where H is a high-dimensional characteristic space, in which the sample set is as follows:

where w is the weight vector and b is the value of the offset.

According to the principle of minimizing structural risk, the objective function of the LSSVM optimization problem is established as follows [30] :

The constraint condition is as follows:

where w is the sample weight vector; ξ i is the sample relaxation variable; and γ is the penalty factor for error. The Lagrange solution equation of the objective function is as follows:

where α i is the Lagrange multiplier. The optimal parameters α and b can be obtained by the following KKT conditions. [31]

When w and ξ i of the characteristic space are eliminated, the optimization problem is transformed as follows:

where I is the n order identity matrix, Θ = [1,2,� � �,n], α = [α 1 , α 2 ,� � �,α n ], y = [y 1 , y 2 ,� � �,y n ], and K is the kernel function matrix.

According to Eqs (15) and (16), the regression function of LSSVM is [32] :

If the CRI is acquired by the LSSVM, its model diagram is shown in Fig 3. The kernel function of the LSSVM method in this paper is the Gaussian radial basis function (RBF), and the equation is as follows [33] :

where σ is the kernel function parameter. The smaller the value is, the easier it is to underfit, and the larger the value is, the easier it is to overfit. Because of its simple structure and strong generalization ability, the RBF function can satisfy the rapid optimization of parameters.

The accuracy of the LSSVM method depends on the values of the kernel parameter σ and the penalty factor γ. The PSO algorithm can quickly find the optimal σ and γ and reduce the tedious parameter adjustment operation [34] . The flow of CRI acquisition by PSO-LSSVM is shown in Fig 4, and steps are as follows:

Step 1. The parameters σ and γ in the LSSVM are initialized, and the velocity and position of each particle are determined in PSO.

Step 2. The LSSVM method is used to train each particle, and the training results find the optimal position, whose value is equal to the maximum value of the fitness function. The RMSE, namely, the root mean square error, is taken as the fitness function, and the equation is as follows [35] :

RMSE ¼ ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi

where y i andŷ i are the real value and corresponding predicted value of the model respectively, and the training and testing datasets need to be normalized. Step 3. The particle position and velocity are changed according to the position and velocity update algorithm.

Step 4. The objective function value of the particle is calculated: the objective function value calculated by each particle is compared with its optimal value. If it is better than the historical optimal objective function, the historical optimal value is replaced by the objective function of the current particle; otherwise, the original value is still used, and the population optimal objective function is evaluated in the same way.

Step 5. If the maximum number of iterations is reached or the error is less than the set value, the iteration is terminated; otherwise, return to Step 2.

Step 6. LSSVM is established and tested according to the optimal parameters. If the CRI error meets the requirements, the program is terminated, and the final PSO-LSSVM model is obtained; otherwise, return to Step 1.

Considering that the elliptical ship domain is adopted as the identification object in this paper, as shown in Fig 5, the coordinate of AIS data point P' is (x i , y i ); the coordinate of point P on the boundary of the identified ellipse domain model is (x, y), and the distances from to the 

d ¼ ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi

The geometric distance between AIS data points P' and P on the boundary of the identified elliptic ship domain model is as follows:

Dd ¼ j ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi

ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi

When line OP' is perpendicular to the X axis, x i = x 0 and x = x 0 are substituted into Eq (1), and we can obtain: Substituting it into Eq (22), the following can be obtained:

Dd ¼ j ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi

When x i 6 ¼x 0 , the linear equation of OP' is:

Substituting it into Eqs (1) and (22), the following can be obtained:

Dd ¼ ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi

ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi

where:

According to the above method, the geometric distance between the identified elliptic domain and the identified AIS data can be calculated [36] , which can be regarded as the identification error. The least square method is usually used to solve the problem. The general least square method treats each piece of data as equally important, when the importance of each piece of data may be different. This paper considers that the importance of data of different targets is different. Therefore, when it is necessary to consider the difference in the importance of data, the more reasonable meethod is to use the weighted method. According to the weighted least squares (WLS) identification principle, the identification problem of the ship domain model comes down to the minimum sum of the weighted squares of geometric distances (identification errors) between obtained boundary data points and identified AIS data points. The corresponding minimization objective function is as follows:

Eqs (24) and (26) are substituted into Eq (28) to obtain the final weighted objective function as follows: min X n i¼0 ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi

ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi

ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi

where CRI i is the ship collision risk corresponding to AIS data points, which is used as the weight of identification error and obtained by the PSO-LSSVM method. By substituting AIS data into the ship domain model equation and iterating parameters, the objective function in Eq (29) is minimized, and the corresponding ship domain model parameters can be identified.

To verify the feasibility of the research method, the dynamic changes in the ship domain during ship navigation are analysed, and the differences in the evolution characteristics of the ship domain in different waters are compared. Therefore, navigation process experiments are carried out in ship routing and other waters. The Yangtze Estuary area, which requires a long voyage in and out of the port, was chosen to allow a sufficiently long time to observe changes. Busy Yangtze River estuary waters can obtain more data to meet the update frequency requirements. In the selection of ship type, the mainstream ship size and ordinary cargo ship type are selected. To better reflect the general situation, dangerous ships with special characteristics should be avoided.

In experiments, AIS data are accessed, decoded, and preprocessed; abnormal data are deleted; data within the research scope are screened, and the dynamic overlay is calculated to generate the encounter situation. Before using the PSO-LSSVM method, the data need to be normalized. Then, identification is carried out according to the method presented in this paper. Through the test, it is found that selecting approximately 1 hour of data accumulation can meet the requirements of data volume and data distribution, and approximately 10 minutes of dynamic update can observe significant changes, which basically meets the requirements of dynamics. Two ordinary cargo ships of the same size are selected for the experiment. Periods close to the same day are selected to eliminate weather and other disturbances as much as possible and enhance comparability. For example, a general cargo ship (with a length of 190 m and a width of 32 m) exits the port through the South Channel of the Yangtze River Estuary and enters the port through the north deep-water channel. The AIS track is shown in Fig 6. The AIS data approximately 1 hour before the start of identification were used to train and test the PSO-LSSVM. After the training met the requirements, it was used to obtain the CRI of the target ship during the online identification. To compare the characteristics of the methods, the WLS method in this paper is used to identify the ship domain parameters online, and it is compared with the least square (LS) method. Sixty minutes of AIS data accumulation are adopted, updated, and iterated every ten minutes, rolling real-time online identification of the ship domain.

The previous AIS data of the test area are used to obtain the CRI training set by fuzzy mathematics, and then these data needs to be normalized. After PSO-LSSVM training and testing, the CRI model based on PSO-LSSVM is established to obtain the CRI parameters of the surrounding ships required for ship domain identification. The PSO parameters are set as follows: the learning factors c 1 and c 2 are 1.5 and 1.7 respectively, the momentum coefficient ω is 1, the maximum number of evolution is 200, and the size of the particle population is 20. The parameters that need to be optimized include the penalty factor γ and the kernel parameter σ. After PSO optimization, the optimized parameter γ is 100 and σ 2 is 0.1062.Some training samples are shown in Table 2 2. From Fig 9, there is a significant difference between the results identified by the WLS and LS methods in this paper. The ship domain identified by WLS changed from nearly round to more elongated ellipses. The result of LS identification is a flatter ellipse. The WLS method is less affected by the low-risk target in the transverse position. In contrast, the LS method is obviously affected, and the domain shape is pulled flat, which is inconsistent with the general understanding that the longitudinal length of the ship domain is greater than or equal to the transverse length. In comparison, the WLS method is more suitable for online identification, with an obvious filtering effect of low-risk targets and more accurate identification results.

3. In Fig 9, � CRI is the average collision risk of the identification targets, which can centrally reflect the reliability of the identification results. The results were more reliable with a In open waters, the length of the domain is eight times that of the own ship, and the corresponding long axis of the domain is 0.41 nautical miles. All of these are obviously smaller than the results in this paper. The Fujii model can be considered as a ship domain with CRI of 1, and then the long-axis results of the two models will sometimes be relatively close. For example, in Fig 9(T1) , the long axis of the ship domain is approximately 1.5 nautical miles, and the corresponding � CRI is 0.28. When CRI is 1, the long axis of the domain is adjusted to 0.42 nautical miles after conversion, which is close to the Fujii model in open water. However, the ship domain model identified in this paper is still larger than the Fujii model, because the pilot will actually maintain a larger safety area when conditions permit, to ensure navigation safety. Fig 10, the ship's speed changes roughly in a pattern of deceleration first and then acceleration. Corresponding to Fig 11, the long axis of the ship domain identified by the WLS method also roughly shows a process of shortening first and then lengthening, while the short axis has no obvious change. This shows that the long axis of the ship domain will become longer with increasing ship speed and will become shorter with decreasing ship speed. The experiment shows that the driver hoped to maintain a larger safety distance with the longitudinal position when the ship speed increased, but the safety distance in the lateral position did not change significantly.

6. In Fig 12, the ship domain overlying diagram shows that the ship domain identified by the WLS method in the dynamic process is roughly within the range of the ship domain identified using the total data accumulation, and its distribution fluctuates. The centers of the ship domain are roughly distributed in the center of the ship, slightly to the right of the rear; in particular, with the increase in ship speed, there is a slightly backward trend.

According to the AIS data of the ship's inbound navigation in the North Channel, real-time online rolling identification of ship domain parameters is performed; the dynamic change curves of the ship domain are drawn, and the results are analyzed as follows:(T6) The results of the inbound process are analyzed as follows:

1. Fig 13(T1) to 13(T6) show the online rolling identification results of the ship domain for one hour when entering the port from the North Channel and reflect the dynamic evolution process of the ship domain boundary. The North Channel is a ship's routing water area, and the boundary process of the ship domain changes obviously during navigation. The ship navigation efficiency is higher; the ship encounter situation is relatively simpler, and the boundary fluctuation in the ship domain is relatively small and more stable. Fig 16, the ship domain overlying diagram shows that the ship domain identified by the WLS method in the dynamic process is roughly within the range of the ship domain identified using the total data accumulation; the distribution is stable, and the fluctuation is small. The centers of the ship domain are basically distributed in the center of the ship.

Based on the online identification experiment and result analysis of navigation in the South Channel and North Channel of the Yangtze Estuary, it can be summarized as follows: 1. In a water area where the ship's routing is not implemented, the long axis of the ship domain changes obviously with ship speed, while the short axis is basically unaffected. This shows that with increasing ship speed, the pilot tends to maintain a greater safety distance, and therefore, the boundary of the ship domain is greater. In comparison, in ship-routing waters, the ship domain is more affected by the regulations of navigation, and the domain boundary is more stable. In other words, the fluctuation characteristics of the ship domain are obviously influenced by the traffic environment of sailing waters.

2. Through the overlying diagram of the ship domain, it is found that the ship domain has been changing dynamically in the actual navigation process, with the change in the ship's motion parameters, the surrounding environment of the navigation waters, and the encounter situation of the own ship and other ships, showing certain correlation, volatility, and randomness.

3. The PSO-LSSVM method is used to obtain CRI, and the collision risk weighting method proposed in this paper can effectively filter nonhazardous targets and improve the impact of dangerous targets on the results, which is consistent with the concept of the ship domain, and its identified ship domain is more consistent with the actual situation of navigation.

4. Through dynamic online rolling identification in the experiment, ship domain parameters can be quickly calculated; the dynamic evolution process of the ship domain in the navigation process can be better presented, and its dynamic fluctuation characteristics and change rules can be found. The online identification method in this paper introduces risk weighting, which will make the ship domain obtained contain some subjectivity, because the collision risk concept has some subjectivity. In addition, the experiment was carried out in the specific water area, which has a certain representativeness. The corresponding results reflect the evolution characteristics of the ship domain during ship navigation. However, the ship domain is affected by many factors, such as navigation rules, the natural environment and traffic conditions, etc. Different waters will differ in these respects. Therefore, the characteristics summarized in experiments can be used as a reference for other similar waters, but the characteristics of the ship domain in specific waters need to be analyzed through specific experiments.

Based on real-time AIS data and through the short data accumulated, an online identification method can dynamically generate the ship model by calculating ship-encounter parameters and acquiring the risk of collision as identification data sources to ensure identification accuracy and real-time requirements. By selecting representative waters, the characteristics of the ship domain in ship-routing waters and unimplemented ship-routing waters are analyzed experimentally. It is found that the ship domain changes dynamically in the sailing process, and to some extent with the waters environment and ship motion parameters, which can provide a reference for the study of similar waters.

The method in this paper enables AIS data to be applied to navigation safety decisions in a more real-time manner, explores the dynamic evolution law of the ship domain, and promotes the better application of relevant theories in the ship domain to intelligent navigation. In this paper, dynamic, real-time and procedural research methods are proposed to lay a foundation for research on dynamic characteristics in the ship domain. In the future, it will be possible to study ship domains in complex scenes, such as analyzing the change characteristics of ship domains with ship navigation environments by considering weather conditions, visibility, wind direction, sea state, water depth, and tide, etc.

Supporting information S1 File. PSO-LSSVM training and testing data. The training and testing data in the file can be used to train the PSO-LSSVM to obtain CRI and plot the results as shown in Figs 7 and 8. (XLSX) S2 File. Online identification outbound or inbound process data. The own ship's trajectory can be plotted using the position data, as shown in Fig 6. The outbound data in Excel can be used for rolling online recognition, and the results can be obtained as shown in Figs 9 to 12. Meanwhile, the inbound data in Excel can be used for rolling online recognition, and the results can be obtained as shown in Figs 13 to 16. (XLSX)

A statistics study of ship domain

Review of research on ship domain

Probabilistic ship domain with applications to ship collision risk assessment. Ocean Eng

AIS-Based Multiple Vessel Collision and Grounding Risk Identification based on Adaptive Safety Domain

Dynamic Fuzzy Ship Domain Considering the Factors of Own Ship and Other Ships

Seafarers' awareness-based domain modelling in restricted areas

An Empirically-Calibrated Ship Domain as a Safety Criterion for Navigation in Confined Waters

Developing contextually aware ship domains using machine learning

Review of ship safety domains: Models and applications

A Revisit of the Definition of the Ship Domain based on AIS Analysis

Empirical Ship Domain based on AIS Data

Ship domain model for ships with restricted manoeuvrability in busy waters

A Spatiotemporal Statistical Method of Ship Domain in the Inland Waters Driven by Trajectory Data

An empirical ship domain based on evasive maneuver and perceived collision risk

Ship domain research based on AIS data

Computation method of ship domains in restricted waters based on AIS data

A Computer Simulation of Marine Traffic Using Domains and Arenas

A unified analytical framework for ship domains

Marine Traffic Behaviour in Restricted Waters

Automatic collision avoidance system using the concept of blocking area

An Intelligent Spatial Collision Risk Based on the Quaternion Ship Domain

Modelling of a ship trajectory in collision situations at sea by evolutionary algorithm

On-line trajectory planning in collision situation at sea by evolutionary computationexperiments

Regional Collision Risk Prediction System at a Collision Area Considering Spatial Pattern

The research of the ship collision risk model based on fuzzy comprehensive evaluation

An approach of vessel collision risk assessment based on the D-S evidence theory

Integrated support vector regression and an improved particle swarm optimization-based model for solar radiation prediction

Short-Term Traffic Flow Forecasting Method Based on LSSVM Model Optimized by GA-PSO Hybrid Algorithm

Toward Group Applications of Zinc-Silver Battery: A Classification Strategy Based on PSO-LSSVM

A study on ship collision conflict prediction in the Taiwan Strait using the EMDbased LSSVM method

An improved CS-LSSVM algorithm-based fault pattern recognition of ship power equipments

Ship Accident Prediction Based on Improved Quantum-Behaved PSO-LSSVM. Mathematical Problems in Engineering

Reconstruct the Support Vectors to Improve LSSVM Sparseness for Mill Load Prediction

Nonlinear least squares method for elliptic fitting

We acknowledge Dr. Qiang Zhang and Dr. Yun Li (Shanghai Maritime University, Shanghai, China.) for very helpful discussion of the study.

Conceptualization: Wei Zhou.