A Port Ship Flow Prediction Model Based on the Automatic Identification System and Gated Recurrent Units

Xu Xiaofeng; Bai Xiang'en; Xiao Yingjie; He Jia; Xu Yuan; Ren Hongxiang

doi:10.1007/s11804-021-00228-9

A Port Ship Flow Prediction Model Based on the Automatic Identification System and Gated Recurrent Units

https://doi.org/10.1007/s11804-021-00228-9

1
College of Merchant Shipping, Shanghai Maritime University, Shanghai 201306, China
2
Shanghai Waterway Engineering Design and Consulting Co., Ltd., Shanghai 200120, China
3
Naval Architecture and Ocean Engineering College, Dalian Maritime University, Dalian 116026, China

Corresponding author:
Xiaofeng Xu, xfxu@shmtu.edu.cn

Received: 16 April 2021

Accepted: 07 July 2021

Abstract

Abstract
Water transportation today has become increasingly busy because of economic globalization. In order to solve the problem of inaccurate port traffic flow prediction, this paper proposes an algorithm based on gated recurrent units (GRUs) and Markov residual correction to pass a fixed cross-section. To analyze the traffic flow of ships, the statistical method of ship traffic flow based on the automatic identification system (AIS) is introduced. And a model is put forward for predicting the ship flow. According to the basic principle of cyclic neural networks, the law of ship traffic flow in the channel is explored in the time series. Experiments have been performed using a large number of AIS data in the waters near Xiazhimen in Zhoushan, Ningbo, and the results show that the accuracy of the GRU-Markov algorithm is higher than that of other algorithms, proving the practicability and effectiveness of this method in ship flow prediction.
- Ship flow prediction ·
- GRU neural network ·
- Markov residual correction ·
- AIS data
Highlights
• An algorithm is proposed based on gated recurrent units (GRUs) and Markov residual correction to pass a fixed cross-section.

• The statistical method of ship traffic flow based on the automatic identification system (AIS) is introduced and a model is presented for predicting the ship traffic flow.

• According to the basic principle of cyclic neural networks, the law of ship traffic flow in the channel is explored in the time series.

HTML

1 Introduction

In recent years, with the rapid development of the economy, the development scale of Chinese ports has been continuously expanding, and the traffic flow of ships in navigable waters has been increasing rapidly. At the same time, the traffic jam of the waterway also occurs frequently, and the navigation safety situation becomes increasingly severe. In order to prevent ship collision caused by maritime traffic congestion in busy waterways, more and more researches have been conducted, including navigable capacity estimation, maritime traffic analysis, ship behavior analysis and traffic complexity analysis (Liu et al., 2020; Ricci et al., 2020; Zhou et al., 2020). The prediction of vessel traffic volume provides a basic basis for the planning and design of waterways and navigation management of vessels. It enables the port department to scientifically and rationally plan the water area layout according to the changing rules of ship traffic flow, so as to improve the navigation efficiency of the waterway, which is of great significance for the improvement of navigation safety. Therefore, it is particularly important to design an algorithm that can accurately and effectively predict the flow of ships in the port to solve the problem of navigation safety.

At present, the main methods for forecasting ship traffic flow include regression analysis model, gray model (Guo et al. 2019), support vector machine, neural network, and combination model. The automatic identification system (AIS) data recording the ship's movement trajectory allows us to evaluate the operating efficiency of the ship when entering and leaving the port (Hongxiang et al. 2011). A time efficiency evaluation framework is proposed to evaluate the amount of time each ship spends in different internal areas. Watai et al. (2018) predicted the energy consumption of port ships and discusses strategies to reduce the energy consumption of port ships by using the proposed prediction model considering green ports. Pengfei et al. (2017) constructed the BP neural nets-Markov prediction model and modified the predicted value of the BP neural network by using the Markov prediction model to reduce the range of relative residual value and improve the accuracy of ship traffic flow prediction. Guo-Feng (2013) and Chan et al. (2018) introduced particle swarm optimization algorithm to optimize the model on the basis of the BP neural nets-Markov prediction model, which further improved the accuracy of the model prediction. But a major shortcoming of this research is that the BP neural network is not ideal for processing time-series data. Han et al. (2020) proposed an algorithm based on the cultural firefly algorithm to optimize the generalized regression neural network, which takes the weights between each layer in the network as the code to optimize and forecast the traffic flow of ships. The final experimental results show that the optimized algorithm has better generalization performance and higher result accuracy. The initial detection path of each UAV is obtained based on the minimum loop method. According to the time-invariant assumption commonly used in traditional traffic flow prediction, an intelligent particle swarm optimization algorithm is proposed by Chan et al. (2013). By combining the mechanism of PSO, neural network, and fuzzy inference system, a short-term traffic flow predictor is developed to adapt to the time-varying traffic flow. At present, BP neural network is widely used in traffic flow analysis (Qinghui et al. 2019). But BP neural network is based on the static feedforward network, which essentially transforms the dynamic complex time problem into the static spatial problem. But in fact, the ship's flow data is dynamic, and there is a certain relationship in time sequence (Xuantong et al. 2019). Therefore, the time factor and the process before reaching the temperature state will be ignored when the dynamic problem is converted to the static problem, which will directly affect the accuracy of the prediction results (Zhenguo and Shukui 2019; Ziwen 2020).

In this paper, on the basis of previous studies, the GRU-Markov prediction algorithm is designed. Due to the close relationship between port ship flow and time, compared with other algorithms, such as BP neural network, the method proposed in this paper to use GRU to predict ship flow has great advantages. Firstly, the GRU algorithm is used to preliminarily predict the port ship flow, and then the Markov algorithm is used to modify the neural network prediction results to further improve the accuracy of the algorithm prediction. The approximate algorithm flow is shown in Figure 1.

Figure 1 Algorithm flowchart

Download: Full-Size Img

2 Data Analysis and Statistics Based on AIS

The original AIS data is discrete and has many redundant values. In fact, the ship's navigation behavior is mainly reflected in the change of the ship's navigation position with time. The dynamic information in the corresponding AIS information is longitude, latitude, speed, course, ROT, and update time. In order to distinguish the navigation behavior of different ships, maritime mobile service identity (MMSI) is added to the ship navigation behavior characterization data. Parts of the ship timing data screened out according to MMSI are shown in Table 1.

Table 1 Ship time-series information

MMSI	Longitude	Latitude	Time
412, 437, 230	122.273240	29.724140	2017-05-01 05:13:25
412, 437, 230	122.272705	29.723510	2017-05-01 05:13:44
412, 437, 230	122.272670	29.723480	2017-05-01 05:13:47
412, 437, 230	122.272190	29.722910	2017-05-01 05:14:04
412, 437, 230	122.272160	29.722880	2017-05-01 05:14:24
412, 437, 230	122.271576	29.722132	2017-05-01 05:14:45

In this study, the number of ships passing through a certain section of the channel in a period of time is recorded to measure the traffic flow of ships. As shown in Figure 2, L1L2 is the observation section and ab is the line connecting the two AIS acquisition points of the ship. If the position relationship of the two lines is an intersection, it can be judged that the ship has passed the section of flow statistics.

Figure 2 Schematic diagram of navigation ship passing through observation section

Download: Full-Size Img

The position relationship of two-line segments includes coincidence, intersection, and separation. The ship can be included in the statistical range when the connecting line of two consecutive acquisition points intersects the line of flow statistics section. In this paper, the following algorithm is used to determine whether the two-line segments intersect.

1. Fast repulsion experiment: First, judge whether the projections of two-line segments coincide in X and Y coordinates. That is to judge whether the endpoint with a larger X in one-line segment is smaller than the endpoint with a smaller X in the other line segment. If so, it means that there must be no intersection point between the two-line segments. Similarly, judge y.

2. Straddle experiment: If two-line segments intersect, it means that they cross each other. If ab crosses L₁L₂, then a and b are located on both sides of the line where L₁ and L₂ are located. r_xy is used to represent the vector between x and y, then:

$$ \left({r}_{{L}_{1}a}\times {r}_{{L}_{1}{L}_{2}}\right)\bullet \left({r}_{{L}_{1}{L}_{2}}\times {r}_{{L}_{1}b}\right) \gt 0 $$

(1)

Similarly, the basis for judging L₁L₂ straddling ab is

$$ \left({r}_{{aL}_{1}}\times {r}_{ab}\right)\bullet \left({r}_{ab}\times {r}_{{aL}_{2}}\right) \gt 0 $$

(2)

The relationship between the straddling and fast rejection experiments is shown in Figure 3.

Figure 3 Determination of intersection relation of line segments

Download: Full-Size Img

3 Foundation of Gated Recurrent Unit

3.1 Structure of Circular Neural Network

Neural network can be regarded as a black box that can fit any function. The traditional BP neural network fitting effect has been better, but it can only deal with each output separately. The former output is completely unrelated to the latter output, and the processing performance of sequence information is not good enough. Therefore, the structure of the recurrent neural network is proposed.

As shown in Figure 4, a simple circular neural network (RNN) consists of three parts: input layer x, hidden layer s, and output layer o. At any time t, the value of the network hidden layer $ {S}_{t} $ depends not only on the input $ {x}_{t} $ of the network at that time, but also on the output $ {S}_{t-1} $ of the hidden layer at the previous time, which shows that the RNN takes the timing information into account in the network structure, and the equation is as follows:

$$ {S}_{t}=f({\boldsymbol{U}}\cdot {x}_{t}+{\boldsymbol{W}}\cdot {S}_{t-1}) $$

(3)

Figure 4 RNN principle

Download: Full-Size Img

where f is the activation function of the hidden layer, and U and W represent the connection weight matrix from the input layer to hidden layer and between the hidden layer, respectively.

3.2 Gated Recurrent Unit

Because the common recurrent neural network has gradient disappearance and gradient explosion and cannot deal with long-term dependence and other problems, this paper adopts a special RNN; that is, the RNN unit in the network is replaced by gated recurrent unit (GRU), so as to solve the gradient disappearance and gradient explosion problems in standard RNN.

The input and output structures of GRU are the same as those of ordinary RNN, and their internal structure is slightly different from that of RNN. There are two gates in GRU, reset gate and update gate. Intuitively, the reset gate determines how to combine the new input information with the previous memory, while the update gate defines the amount of previous memory saved to the current time step. If the reset gate is set to 1 and the update gate is set to 0, a standard RNN model is obtained. The internal structure of GRU is shown in Figure 5.

Figure 5 GRU structure

Download: Full-Size Img

At time t, update gate $ {z}_{t} $ is calculated by the following equation:

$$ {z}_{t}=\sigma ({{\boldsymbol{W}}}_{z}{x}_{t}+{{\boldsymbol{U}}}_{z}{h}_{t-1}) $$

(4)

where $ {x}_{t} $ is the input at the time t, $ \sigma $ is the sigmoid activation function, $ {h}_{t-1} $ is the information of the last moment, and W and U are weight matrices. The update gate adds the two parts of information and inputs it into the sigmoid function, so it compresses the activation result between 0 and 1.

The reset gate mainly determines how much past information needs to be forgotten. The calculation is as follows:

$$ {r}_{t}=\sigma ({{\boldsymbol{W}}}_{r}{x}_{t}+{{\boldsymbol{U}}}_{r}{h}_{t-1}) $$

(5)

Its parameters are similar to the update gate, but the parameters and uses of a linear transformation are different.

In the use of the reset door, the new memory content will use the reset door to store the relevant information in the past, the calculation expression is as follows:

$$ {h}_{t}^{^{\prime}}=\mathrm{tanh}({\boldsymbol{W}}{x}_{t}+{r}_{t}\odot {\boldsymbol{U}}{h}_{t-1}) $$

(6)

where $ \odot $ is the multiplication of the corresponding elements in the matrix on both sides.

Finally, calculating $ {h}_{t} $ determines the information to be passed to the next unit. This process is expressed as follows:

$$ {h}_{t}={z}_{t}\odot {h}_{t-1}+(1-{z}_{t})\odot {h}_{t}^{^{\prime}} $$

(7)

where $ {h}_{t} $ is the hidden state at the current time and the final output of GRU.

4 Modification of Initial Prediction Value Based on Markov Method

The navigation behavior of ships is an uncertain process, so the neural network structure trained by limited samples is usually not completely reliable. Because the state transition matrix has the ability to track the random fluctuation of variables, it can be used to correct the prediction results of neural networks, so as to improve the prediction accuracy of the prediction model. Markov chain can infer the possible state of a variable in the future based on the analysis of its current state and future change direction. However, the Markov method is not suitable for medium and long-term prediction. Therefore, this section applies it to the short-term prediction of ship flow to improve the prediction accuracy of the model.

According to the prediction value of the neural network, the relative error δ is calculated:

$$ =\frac{{P}_{ex}-{P}_{n}}{{P}_{n}}\times 100\% $$

(8)

where $ {P}_{ex} $ is the neural network prediction of ship flow, and $ {P}_{n} $ is the actual flow of the ship.

When state $ {E}_{i} $ is transferred to state $ {E}_{j} $ after k times, the probability is as follows:

$$ {P}_{ij}^{(k)}=\frac{{A}_{ij}^{(k)}}{{A}_{i}} $$

(9)

where $ {A}_{ij}^{(k)} $ is the number of transitions of sample state from $ {E}_{i} $ to state $ {E}_{j} $, and $ {A}_{i} $ is the total number of state occurrences.

The core of the Markov method is to determine the state probability transition matrix. In this paper, the state transition probability matrix $ {{\boldsymbol{P}}}^{(k)} $ is calculated according to the state of relative error:

$$ {{\boldsymbol{P}}}^{(k)}=\left[\begin{array}{c}\begin{array}{c}{P}_{11}^{\left(k\right)} {P}_{12}^{\left(k\right)} \cdots {P}_{1n}^{\left(k\right)}\\ {P}_{21}^{\left(k\right)} {P}_{22}^{\left(k\right)} \cdots {P}_{2n}^{\left(k\right)}\\ \vdots \vdots \vdots \vdots \end{array}\\ {P}_{n1}^{(k)} {P}_{n2}^{(k)} \cdots {P}_{nn}^{(k)}\end{array}\right] $$

(10)

where n is the number of states after classification.

The standard of state division is to take the upper and lower thresholds of the relative error δ of the predicted value of sample data as the range of state division.

Then, the initial state vector X (0) is determined according to the error data predicted by the ship flow prediction model, and the state transition result of step k is calculated by the state transition equation:

$$ {\boldsymbol{X}}\left(k\right)={\boldsymbol{X}}(0){{\boldsymbol{P}}}^{(k)} $$

(11)

Finally, according to the relative error of Markov prediction, the corrected ship flow P is calculated:

$$ P={P}_{ex}\left(1-{}^{^{\prime}}\right), {}^{^{\prime}}=\frac{{}_{h}+{}_{l}}{2} $$

(12)

where $ {}_{h} $, $ {}_{l} $ are the upper and lower thresholds of the error state interval.

5 Examples and Analysis

5.1 Data Analysis

The data studied in this paper comes from the AIS data of the Xiashimen water area in Zhoushan, Ningbo, in 2017. In the data, the water area of the ship sailing is 122.2° E, 29.8° N-122.35° E, 29.7° N. Different ships are distinguished by their MMSI codes. The navigation of the original ship in the water area is shown in Figure 6.

Figure 6 Navigation of ships in waters

Download: Full-Size Img

The original data is preprocessed to roughly judge whether the ship's trajectory passes through the Xiazhimen water area. After the ships passing through the Xiazhimen gate are selected, the fast exclusion and straddle experiments are carried out on the tracks of each ship to determine the times and time points of the flow statistical Sects. (122.31° E29.75° N, 122.32° E 29.775° N). Part of the flow data after processing is shown in Table 2.

Table 2 Ship traffic flow in Xiazhimen

Time	Ship flow (ship/h)
Time	May 3rd	May 4th	May 5th	May 6th	May 7th
0:00-1:00	10	4	7	8	5
1:00-2:00	9	6	9	3	7
2:00-3:00	10	2	8	9	9
3:00-4:00	7	3	7	5	9
4:00-5:00	7	9	5	3	4
5:00-6:00	8	11	9	11	8
6:00-7:00	12	9	12	4	9
7:00-8:00	3	3	8	6	16
8:00-9:00	10	10	8	12	13
9:00-10:00	11	11	22	22	12
10:00-11:00	15	18	14	12	8
11:00-12:00	12	11	9	8	8
12:00-13:00	15	10	11	13	13
13:00-14:00	13	9	11	13	10
14:00-15:00	11	12	10	14	8
15:00-16:00	12	11	14	12	9
16:00-17:00	10	10	11	11	4
17:00-18:00	9	4	9	11	11
18:00-19:00	13	2	12	12	8
19:00-20:00	12	1	13	9	14
20:00-21:00	12	4	13	7	10
21:00-22:00	11	5	7	5	7
22:00-23:00	4	8	6	3	5
23:00-24:00	2	9	3	8	6

5.2 Construction of Neural Network Structure

In order to deeply mine the rules of the historical discharge data of the water area, this model takes the cross-section flow of five continuous time nodes as the input to build a time series ship flow prediction model (Hongxiang et al. 2011; He et al., 2019; Krizhevsky et al. 2017). The neural network model contains many hyperparameters, and the parameters that need to be adjusted are roughly divided into two categories. One is optimization parameters, such as learning rate and training algebra. One is the parameters of the model, such as the number of hidden layers and the number of neurons. The adjustment of hyperparameters has a great influence on the fitting effect of the model. For example, the learning rate affects the fitting speed of the model and the number of neurons and hidden layers affect the fitting effect of the model (Yanhong et al. 2016; Guo-Feng 2013; Ming-Wei et al. 2021). The model consists of one input node, five hidden layers (GRU layer, dropout layer, and dense layer) and one output node. The number of hidden layers and the number of neurons are determined based on experience. The input data length of the input node is n and the dimension is 1. After passing through five hidden layers, it is output at the output node, and the output is the ship flow of the next time node. The structures of the five hidden layers are as follows: the first GRU layer with 100 neurons. The second layer is the dropout layer with a probability of 0.2. The third layer is the GRU layer with 100 neurons. The fourth layer is the dropout layer with a probability of 0.2. The fifth layer is the dense layer with 1 neuron, which is used to connect the neurons in the upper layer to form one-dimensional output. The structure of the neural network is shown in Figure 7.

Figure 7 Neural network structure

Download: Full-Size Img

Dropout layer was proposed in 2012 and applied to neural networks, mainly to prevent model overfitting. The direct function of dropout layer is to reduce the number of intermediate features, thus reducing redundancy and increasing the orthogonality of each feature in each layer. According to relevant experience, the probability of dropout is set to 0.2.

The learning of neural networks is a process of continuous error correction, that is, the final data calculated each time is compared with the actual data and the error is obtained, and then the weight is continuously adjusted until the error is reduced to an acceptable level. The loss function selected in this paper is the mean square error (MSE) function, which is one of the commonly used machine learning loss functions. It mainly measures the quality of the fitted data. The smaller the value, the closer it is to the actual value. The equation of mean square error function is as follows:

$$ \mathrm{MSE}=\frac{1}{n}\sum\limits_{i=1}^{n}{(\stackrel{\sim }{{Y}_{i}}-{Y}_{i})}^{2} $$

(13)

In the model optimization, an adaptive moment estimation (Adam) optimization algorithm is selected. Based on the original gradient descent algorithm, it tracks the exponential decay average of the past gradient to continuously optimize the model.

5.3 Input Data Processing

Because different indicators often have different dimensions and units, which will greatly affect the results of data analysis, therefore, in order to eliminate the dimensional impact between indicators, it is necessary to standardize the data. The standardization method selected in this paper is min-max normalization, which is a linear transformation of the original data, mapping all the data to 0-1. The equation is as follows:

$$ {x}^{*}=\frac{x-{\text{min}}}{{\text{max}}-{\text{min}}} $$

(14)

where max is the maximum value of sample data and min is the minimum value of sample data.

6 Experimental Results

In this paper, the ship traffic flow data of 5 h before the flow section is used to predict the ship traffic flow of the sixth hour. For example, the ship traffic flow data from 0:00 on May 1, 2017, to 5:00 on May 1, 2017, is used as the input to predict the ship flow from 5:00 to 6:00 on May 1, 2017. The prediction results and errors of May 5 are shown in Figure 8 and Table 3. The table shows the error comparison between the predicted data and the actual value of the day. In Figure 8, the abscissa is each time point in a day, and the ordinate is the number of ships.

Figure 8 Graph of GRU prediction results

Download: Full-Size Img

Table 3 Result and error of GRU prediction

Time	Ship flow (ship/h)		Error (%)
Time	Actual value	Estimate value	Error (%)
11:00-12:00	12	10.8806	9.32
12:00-13:00	9	9.7012	7.79
13:00-14:00	8	8.8862	11.07
14:00-15:00	12	9.8579	17.85
15:00-16:00	8	9.1067	13.83

As can be seen from Figure 8, the GRU model can well fit the characteristics of less flow in the early morning of a day and the peak flow from noon to afternoon. However, due to the randomness of ship flow, when there is a small peak in a certain period of time (as shown in Figure 8, the flow of 18:00-19:00 is significantly higher than other periods), the effect of model fitting is not very accurate. Therefore, this paper uses the Markov method to correct the initial prediction value to make the predicted value closer to the real value.

6.1 Markov Residual Correction

In order to verify the correction effect of the Markov method on the initial prediction data of neural network, taking the prediction result of ship flow in May 2017 as an example, the relative error δ is obtained through calculation, and some data are shown in Table 4. The actual value and predicted value are arranged in time, and the state of relative error δ is determined by Table 4. In this paper, the predicted relative error is divided into four states according to the state division standard: [− 30%, − 15%], E₂ (− 15%, 0], E₃ (0, 15%], E₄ (15%, 30%], and based on this, the Markov state table is established, as shown in Table 5.

Table 4 Prediction relative error of test sample

Time	Actual value (ship)	Estimate value (ship)	Relative error (%)	State
5:00-6:00	6	6.4343	7.23	E₃
6:00-7:00	6	7.5381	25.63	E₄
7:00-8:00	8	7.4153	-7.31	E₂
8:00-9:00	6	6.8540	14.23	E₃
9:00-10:00	11	9.7619	-11.25	E₂
10:00-11:00	7	8.1619	16.59	E₄
$\vdots$	$\vdots$	$\vdots$	$\vdots$	$\vdots$

Table 5 Markov state transition table

State	E₁	E₂	E₃	E₄	Total
E₁	6	11	6	9	32
E₂	11	5	11	14	41
E₃	6	11	6	13	36
E₄	9	14	13	22	58
Total	32	41	36	58	167

From Table 5, we can determine the Markov state transition matrix $ {{\boldsymbol{P}}}^{(1)} $:

$$ {{\boldsymbol{P}}}^{(1)}=\left[\begin{array}{cc}\begin{array}{cc}\frac{3}{16}& \frac{11}{32}\\ \frac{11}{41}& \frac{5}{41}\end{array}& \begin{array}{cc}\frac{3}{16}& \frac{9}{32}\\ \frac{11}{41}& \frac{14}{41}\end{array}\\ \begin{array}{cc}\frac{1}{6}& \frac{11}{36}\\ \frac{9}{58}& \frac{7}{29}\end{array}& \begin{array}{cc}\frac{1}{6}& \frac{13}{36}\\ \frac{13}{58}& \frac{11}{29}\end{array}\end{array}\right] $$

(15)

Markov modified predicted value was shown in Figure 9. Taking the ship flow data of sequence 1 of the selected data as the initial state vector, that is, X (0) = (0, 0, 1, 0), then the state vector X (1) = X (0)$ {{\boldsymbol{P}}}^{(1)} $= (9/58, 7/29, 13/58, 11/29). Therefore, the probability that the prediction relative error δ of sequence 2 is in E₄ is the largest. According to the prediction results of the neural network, we can know that the predicted ship flow at sequence 2-time point is $ {p}_{1} $=7.5381. According to modified Eq. (12), the modified ship flow $ {p}_{\text{Markov}} $=5.8420 of the Markov chain can be calculated.

Figure 9 Markov modified predicted value

Download: Full-Size Img

Given that the actual ship flow at sequence 2-time point is $ {p}_{2} $=6, the relative error δ = 25.63% before Markov correction is w, and the relative error after Markov correction is δ = − 2.66%. According to the same method, the ship flow correction value of each subsequent sequence point can be calculated.

6.2 Comparative Experiment of Prediction Model

In order to verify the advantages of the proposed GRU-Markov prediction model in prediction accuracy, the prediction results of the proposed algorithm are compared with those of other algorithms. The comparison algorithms are GRU algorithm without Markov optimization, SVM, and BP neural network. The results of the comparison are shown in Table 6.

Table 6 Comparison of prediction results

Time	Ship flow(ship/h)
Time	Actual value	SVM	BP	GRU	GRU-Markov
5:00-6:00	6	6.9856	6.7382	6.4303	6.0758
6:00-7:00	6	7.9547	7.6872	7.5381	5.8420
7:00-8:00	8	6.8971	7.0358	7.4153	8.2024
8:00-9:00	6	7.0368	6.9872	6.8540	5.9478
9:00-10:00	11	9.3751	9.9350	9.7619	10.7548

The results show that, considering the relationship between ship flow and time series, the prediction results of the GRU neural network are generally closer to the real value than support vector machine and BP neural network, and the error is further reduced after Markov correction. Therefore, the performance of GRU-Markov is better than other algorithms.

7 Conclusion

Based on the research of ship flow prediction method, combined with the characteristics of AIS data information, the AIS data are sorted and counted by using fast exclusion and straddle experiment. The discrete AIS data are counted into traffic data arranged in time series. Considering the influence of time series type data before and after, this paper proposes to apply the GRU algorithm to ship flow prediction. A neural network prediction model based on GRU-Markov is constructed. The ship flow data of five consecutive time points are used as input to predict the ship flow data of the next time point. GRU is used to make basic predictions, and then the Markov method is applied to modify the initial prediction value. The proposed algorithm is verified by a large number of AIS data near Xiazhimen, Zhoushan, Ningbo, and compared with the traditional prediction algorithm SVM and BP neural network, it proves that the algorithm in this paper has great advantages in prediction accuracy and algorithm time-consuming. However, due to the characteristics of a large amount of historical data required for the training of the neural network itself, the algorithm in this paper also has certain limitations. For example, it may not be applicable under the conditions of insufficient historical data, etc. Through the above work, it is proved that the GRU-Markov model is practical and effective in the prediction of ship flow, and also provides a new theoretical basis for the prediction of ship track in the future.

Figure 1 Algorithm flowchart

Download: Full-Size Img

Figure 2 Schematic diagram of navigation ship passing through observation section

Download: Full-Size Img

Figure 3 Determination of intersection relation of line segments

Download: Full-Size Img

Figure 4 RNN principle

Download: Full-Size Img

Figure 5 GRU structure

Download: Full-Size Img

Figure 6 Navigation of ships in waters

Download: Full-Size Img

Figure 7 Neural network structure

Download: Full-Size Img

Figure 8 Graph of GRU prediction results

Download: Full-Size Img

Figure 9 Markov modified predicted value

Download: Full-Size Img

Table 1 Ship time-series information

MMSI	Longitude	Latitude	Time
412, 437, 230	122.273240	29.724140	2017-05-01 05:13:25
412, 437, 230	122.272705	29.723510	2017-05-01 05:13:44
412, 437, 230	122.272670	29.723480	2017-05-01 05:13:47
412, 437, 230	122.272190	29.722910	2017-05-01 05:14:04
412, 437, 230	122.272160	29.722880	2017-05-01 05:14:24
412, 437, 230	122.271576	29.722132	2017-05-01 05:14:45

Table 2 Ship traffic flow in Xiazhimen

Time	Ship flow (ship/h)
Time	May 3rd	May 4th	May 5th	May 6th	May 7th
0:00-1:00	10	4	7	8	5
1:00-2:00	9	6	9	3	7
2:00-3:00	10	2	8	9	9
3:00-4:00	7	3	7	5	9
4:00-5:00	7	9	5	3	4
5:00-6:00	8	11	9	11	8
6:00-7:00	12	9	12	4	9
7:00-8:00	3	3	8	6	16
8:00-9:00	10	10	8	12	13
9:00-10:00	11	11	22	22	12
10:00-11:00	15	18	14	12	8
11:00-12:00	12	11	9	8	8
12:00-13:00	15	10	11	13	13
13:00-14:00	13	9	11	13	10
14:00-15:00	11	12	10	14	8
15:00-16:00	12	11	14	12	9
16:00-17:00	10	10	11	11	4
17:00-18:00	9	4	9	11	11
18:00-19:00	13	2	12	12	8
19:00-20:00	12	1	13	9	14
20:00-21:00	12	4	13	7	10
21:00-22:00	11	5	7	5	7
22:00-23:00	4	8	6	3	5
23:00-24:00	2	9	3	8	6

Table 3 Result and error of GRU prediction

Time	Ship flow (ship/h)		Error (%)
Time	Actual value	Estimate value	Error (%)
11:00-12:00	12	10.8806	9.32
12:00-13:00	9	9.7012	7.79
13:00-14:00	8	8.8862	11.07
14:00-15:00	12	9.8579	17.85
15:00-16:00	8	9.1067	13.83

Table 4 Prediction relative error of test sample

Time	Actual value (ship)	Estimate value (ship)	Relative error (%)	State
5:00-6:00	6	6.4343	7.23	E₃
6:00-7:00	6	7.5381	25.63	E₄
7:00-8:00	8	7.4153	-7.31	E₂
8:00-9:00	6	6.8540	14.23	E₃
9:00-10:00	11	9.7619	-11.25	E₂
10:00-11:00	7	8.1619	16.59	E₄
$\vdots$	$\vdots$	$\vdots$	$\vdots$	$\vdots$

Table 5 Markov state transition table

State	E₁	E₂	E₃	E₄	Total
E₁	6	11	6	9	32
E₂	11	5	11	14	41
E₃	6	11	6	13	36
E₄	9	14	13	22	58
Total	32	41	36	58	167

Table 6 Comparison of prediction results

Time	Ship flow(ship/h)
Time	Actual value	SVM	BP	GRU	GRU-Markov
5:00-6:00	6	6.9856	6.7382	6.4303	6.0758
6:00-7:00	6	7.9547	7.6872	7.5381	5.8420
7:00-8:00	8	6.8971	7.0358	7.4153	8.2024
8:00-9:00	6	7.0368	6.9872	6.8540	5.9478
9:00-10:00	11	9.3751	9.9350	9.7619	10.7548

References(22)

Chan KY, Dillon TS, Chang E (2013) An intelligent particle swarm optimization for short-term traffic flow forecasting using on-road sensor systems. IEEE Trans Industr Electron 60(10): 4714-4725. https://doi.org/10.1109/tie.2012.2213556

Guo-Feng F (2013) Support vector regression model based on empirical mode decomposition and auto regression for electric load forecasting. Energies 6(4): 1887-1901. https://doi.org/10.3390/en6041887

Han X, Zheping S, Jiacai P (2020) Prediction of ship traffic flow based on cultural firefly algorithm and generalized regression neural network. J Shanghai Jiaotong Univ 54(4): 421-429. https://doi.org/10.16183/j.cnki.jsjtu.2020.04.011

He W, Zhong C, Sotelo MA (2019) Short-term vessel traffic flow forecasting by using an improved Kalman model. Clust Comput 22(10): 7907-7916. https://doi.org/10.1007/s10586-017-1491-2

Hongxiang F, Yingjie X, Fancun K (2011) Prediction model of ship traffic flow based on support vector machine. China Navigation 34(4): 62-66

Krizhevsky A, Sutskever I, Hinton GE (2017) ImageNet classification with deep convolutional neural networks. Commun ACM 60(6): 84-90. https://doi.org/10.1145/3065386

Liu C, Xian J, Zhou X (2020) AIS data-driven approach to estimate navigable capacity of busy waterways focusing on ships entering and leaving port. Ocean Eng 218(15): 108215. https://doi.org/10.1016/J.OCEANENG.2020.108215

Longwen Z, Daofang C, Zongliang Z (2020) Prediction of port vessel traffic flow based on SARIMA-BP model. Navigation China 43(1): 50-55

Ming-Wei Li, Yutain W, Jing G (2021) Hong Weichiang (2021) Chaos cloud quantum bat hybrid optimization algorithm. Nonlinear Dyn 103: 1167-1193. https://doi.org/10.1007/S11071-020-06111-6

Mingxiang F, Shih-Lung S, Guojun P, Zhixiang F (2020) Time efficiency assessment of ship movements in maritime ports: A case study of two ports based on AIS data. J Transp Geogr 86(6): 102741. https://doi.org/10.1016/j.jtrangeo.2020.102741

Pengfei L, Yuan Z, Yang L (2017) BP neural network Markov prediction model for ship traffic volume. J Shanghai Mar Univ 38(2): 17-21. https://doi.org/10.13340/j.jsmu.2017.02.004

Qingbo Fan, Fucai Jiang, Quandang M (2018) BP neural network Markov traffic flow prediction model based on PSO. J Shanghai Mar Univ 39(2): 22-27. https://doi.org/10.13340/j.jsmu.2018.02.005

Qinghui Z, Guangru Li, Xiao Y (2019) Prediction of ship traffic flow based on Elman neural network based on cyclic structure optimization. High Tech Commun 29(3): 295-301

Quandang Ma, Fucai J, Qingbo F (2019) Application of PSO unbiased grey Markov model in ship traffic flow prediction. China Navigation 42(1): 97-103

Ricci A, Janssen WD, van Wijhe HJ (2020) CFD simulation of wind forces on ships in ports: Case study for the Rotterdam Cruise Terminal. J Wind Eng Ind Aerodyn 205:104315. https://doi.org/10.1016/j.jweia.2020.104315

Watai RA, Ruggeri F, Tannuri EA (2018) An analysis methodology for the passing ship problem considering real-time simulations and moored ship dynamics: Application to the Port of Santos, in Brazil. Appl Ocean Res 80:148-165. https://doi.org/10.1016/j.apor.2018.08.012

Xuantong W, Jing L, Tong Z (2019) A machine-learning model for zonal ship flow prediction using AIS data: a case study in the South Atlantic States Region. J Mar Sci Eng 7(12): 463. https://doi.org/10.3390/jmse7120463

Yanhong C, Weichiang H, Wen S, Ningning H (2016) Electric load forecasting based on a least squares support vector machine with fuzzy time series and global harmony search algorithm. Energies 9(2): 70. https://doi.org/10.3390/en9020070

Zhaoxia G, Weiwei L, Youkai W (2019) A multi-step approach framework for freight forecasting of river-sea direct transport without direct historical data. Sustainability 11(15): 4252. https://doi.org/10.3390/su11154252

Zhenguo D, Shukui Z (2019) Prediction model for ship traffic flow considering periodic fluctuation factors. Proceedings of 2019 3rd International Conference on Computer Engineering, Information Science and Internet Technology. Computer Science and Electronic Technology International Society, 5. https://doi.org/10.26914/c.cnkihy.2019.037179

Zhou Yang, Daamen Winnie, Velinga Tiedo (2020) Impacts of wind and current on ship behavior in ports and waterways: A quantitative analysis based on AIS data. Ocean Eng 213:107774. https://doi.org/10.1016/j.oceaneng.2020.107774

Ziwen Y (2020) Prediction of ship flow in multi branch channel based on automatic identification system data. Waterway Eng 2020(9): 152-157. https://doi.org/10.16233/j.cnki.issn1002-4972.20200820.025

click to enlarge

Figures(9) / Tables(6)

Article Contents

A Port Ship Flow Prediction Model Based on the Automatic Identification System and Gated Recurrent Units

https://doi.org/10.1007/s11804-021-00228-9

Corresponding author: Xiaofeng Xu, xfxu@shmtu.edu.cn

1 Introduction

2 Data Analysis and Statistics Based on AIS

3 Foundation of Gated Recurrent Unit

3.1 Structure of Circular Neural Network

3.2 Gated Recurrent Unit

4 Modification of Initial Prediction Value Based on Markov Method

5 Examples and Analysis

5.1 Data Analysis

5.2 Construction of Neural Network Structure

5.3 Input Data Processing

6 Experimental Results

6.1 Markov Residual Correction

6.2 Comparative Experiment of Prediction Model

7 Conclusion

Publishing history

目录

Corresponding author:
Xiaofeng Xu, xfxu@shmtu.edu.cn