Full article: Extended linear regression model for vessel trajectory prediction with a-priori AIS information

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

As maritime activities increase globally, there is a greater dependency on technology in monitoring, control, and surveillance of vessel activity. One of the most prominent systems for monitoring vessel activity is the Automatic Identification System (AIS). An increase in both vessels fitted with AIS transponders and satellite and terrestrial AIS receivers has resulted in a significant increase in AIS messages received globally. This resultant rich spatial and temporal data source related to vessel activity provides analysts with the ability to perform enhanced vessel movement analytics, of which a pertinent example is the improvement of vessel location predictions. In this paper, we propose a novel strategy for predicting future locations of vessels making use of historic AIS data. The proposed method uses a Linear Regression Model (LRM) and utilizes historic AIS movement data in the form of a-priori generated spatial maps of the course over ground (LRMAC). The LRMAC is an accurate low complexity first-order method that is easy to implement operationally and shows promising results in areas where there is a consistency in the directionality of historic vessel movement. In areas where the historic directionality of vessel movement is diverse, such as areas close to harbors and ports, the LRMAC defaults to the LRM. The proposed LRMAC method is compared to the Single-Point Neighbor Search (SPNS), which is also a first-order method and has a similar level of computational complexity, and for the use case of predicting tanker and cargo vessel trajectories up to 8 hours into the future, the LRMAC showed improved results both in terms of prediction accuracy and execution time.

KEYWORDS:

1. Introduction

The world’s oceans are of critical importance to humanity as they are key to fisheries, shipping, and the environment. From an economic perspective, it is estimated that 90% of all global goods and energy transportation are done by sea, with millions of people being dependent on maritime-related activities for their livelihood. As maritime activities increase globally, there is a greater dependency on technology in monitoring, control, and surveillance of vessel activities. One of the most prominent systems for monitoring vessel activity is the Automatic Identification System (AIS). AIS operates in the VHF band and transmits messages from vessels to other vessels, terrestrial shore stations, and satellites. Due to the global increase in vessels fitted with AIS transmitters and the proliferation of satellite and terrestrial receiving stations, there has been a significant increase in AIS messages received globally. This increased data volume makes it possible to track the real-time movement of vessels and opens the door for improving vessel location predictions via historic vessel movement patterns. Many algorithms have been developed in recent years to aid in improved vessel coordinate prediction. Methods range from simplistic models as done by Burger, Kleynhans, and Lups Grobler (Citation2020) to machine learning (ML) models as done by Xin (Citation2020).

Pallotta et al. (Citation2014) presented a vessel prediction method based on Ornstein-Uhlenbeck stochastic processes, where the parameters of these processes are estimated from historical patterns in historic AIS data. The data is clustered into three types: vessels, waypoints, and routes. Route extraction is done using Traffic Route Extraction and Anomaly Detection from AIS Data (THREAD). The implementation details of THREAD can be found in Pallotta, Vespe, and Bryan (Citation2013a, Citation2013b). The three types of clustered data aid in vessel prediction and empirical calculations. The maximum prediction time window of a vessel depends on the mean duration of the historically observed route.

Lee, Han, and Whang (Citation2007) presented a trajectory clustering method, where similar trajectories are clustered together. The method works by employing a partition-and-group framework for clustering trajectories. It works by first grouping trajectories into a set of line segments, and similar segments are then grouped into clusters. The method consists of two phases, partitioning and grouping. In the first phase, the minimum description length principle is used (Grünwald, Jae Myung, and Pitt Citation2005). In the second phase, a density-based line-segment clustering algorithm is used (Ester et al. Citation1996)based on a Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise (DBSCAN). The model’s sensitivity on its parameters is improved in Jiashun (Citation2012).

Rong, Teixeira, and Guedes Soares (Citation2019) presented a probabilistic trajectory prediction model that describes the uncertainty in future spatial locations along a vessel’s trajectory by using continuous probability distributions. A non-parametric Bayesian model based on a Gaussian Process (GP) is presented. The GP is used to describe the uncertainty of the vessel’s motion, while the longitudinal uncertainty is derived from the dubiety of the vessel’s acceleration. The parameters of the model are derived from historic AIS data. Cholesky decomposition is used to reduce the computational complexity of the algorithm. Prior probabilities are used to predict vessel coordinates in real-time.

Jaskolski (Citation2017) implemented a Discrete Kalman Filter (DKF) (Kalman Citation1960) to predict future locations of vessels. The DKF, in the context of vessel coordinate prediction, constantly adjusts itself for an improved prediction as new observations are observed. It is assumed that a vessel fitted with an AIS sensor will not constantly send updates. The DKF consists of two sets of equations: prediction and measurement update equations used in prediction and parameter updates, respectively. Burger, Kleynhans, and Lups Grobler (Citation2020), showed that there is no significant improvement in the prediction capability that can be achieved by using a DKF instead of a Linear Regression Model (LRM) to predict linear trajectories.

Xiao et al. (Citation2020) conducted an extensive review of maritime knowledge mining and traffic forecasting technologies. The LRM is compared to several non-linear approaches. Three broad categories of non-linear algorithms are considered: machine learning approaches, knowledge-based approaches, and control theory assisted methods. The predictions range from long to short-term predictions. It is also shown that more complex methods are more accurate than the LRM but have a higher computational cost than the LRM.

Forti et al. (Citation2020) made use of a deep learning (DL) neural network (NN) approach to predict trajectories of vessels. A sequence-to-sequence model that utilizes a Long Short-Term Memory (LSTM) encoded-decoder recurrent neural network (RNN) is proposed. Historic AIS data is used to train the LSTM model. The method aims to learn the predictive distribution of maritime traffic patterns using historic AIS data. Learning the predictive patterns enables the model to predict more accurately. It was shown that the model could predict more accurately than the Ornstein-Uhlenbeck process, given a time window of 20 observations.

Murray and Prasad Perera (Citation2020) presented a novel dual linear autoencoder approach to predict a vessel’s trajectory. The method predicts a future trajectory using historic AIS data. The method implements unsupervised learning for trajectory clustering and classification.

Xin (Citation2020) presented a context-based trajectory prediction algorithm utilizing LSTM networks. Real-valued target trajectories are converted into discrete path sets. Distinctive patterns are clustered hierarchically using historical AIS data. Two models are compared, an RNN consisting of one LSTM and another RNN consisting of $k$ LSTMs. In the RNN with $k$ LSTMS, one LSTM is created for each distinct path. Yaun et al. (Citation2019) used an LSTM model to reconstruct vessel trajectories. This complex method requires pre-processing and data clustering.

Dimitrios, Xidias, and Lekkas (Citation2016) aim to accurately predict future geo-coordinates of a vessel by using artificial NNs (ANNs). Their model learns in real-time whilst predicting. It has a prediction time horizon of up to 15 minutes. Different types of model pre-processing and construction are implemented. Historic AIS data was used to train the ANN. Alizadeh, Asghar Alesheikh, and Sharif (Citation2021) proposed three novel prediction methods based on historic AIS data. The first method proposed is a Point-based Similarity Search Prediction (PSSP), which was inspired by Wijaya and Nakamura (Citation2013). The historical points are measured in terms of their spatial location, SOG, and COG. The second method proposed is called Trajectory-based Similarity Search Prediction (TSSP), where each recorded AIS trip is regarded as a trajectory. The PSSP is a point-based method, whereas the TSSP is a trajectory-based method. Finally, a trajectory-based similarity search prediction is proposed using an RNN LSTM (TSSPL). Alizadeh, Asghar Alesheikh, and Sharif (Citation2021) point out that vessel movement is affected by external movements such as wind, waves, and sea currents. The PSSP and the TSSP are not able to account for these external factors. Another RNN LSTM model was, therefore, built to take this into account (i.e. TSSPL). The TSSPL has an additional input, a measure of similarity between trajectories (similar to what was done in Tang, Yin, and Shen Citation2019).

The Safety of Life at Sea (SOLAS) regulation V/19 states that passenger vessels and vessels larger than 300 Gross Tonnage must have AIS transmitters fitted. All cargo and tanker vessels should adhere to the SOLAS regulations. The regulations were adopted and are now regulated by the International Maritime Organization (IMO).Footnote¹ There are, however, some drawbacks when working with AIS data. Vessels may deactivate their AIS transponders, and some vessels may not be fitted with transponders (Chen et al. Citation2019).

Other vessel tracking information systems include Long-Range Identification and Tracking (LRIT), Coastal radar, Satellite-borne Synthetic Aperture Radar (SAR), and optical satellites. Moreover, methods such as video tracking at ports exist where maritime video surveillance is present, as done in Chen et al. (Citation2019) and Chen et al. (Citation2020).

Finally, the method presented by Hexeberg, Flåten, and Brekke (Citation2017) uses historic AIS data to predict future locations of vessels. The method is called Single-Point Neighbor Search (SPNS). The method does a close neighbor (CN) search by extracting historic observations within a certain radius of the current vessel’s spatial location. Vessels in the CN set that do not adhere to prespecified SOG and COG range values are removed. Using the CN set, the median COG and SOG values are calculated. Using the median COG and SOG, the predicted longitude and latitude are calculated. The method predicts at constant distance intervals, where the SOG is used to calculate the time passed between two observations. The method can be confidently predicted with a time horizon of up to 15 minutes.

The aforementioned methods are complex in nature, being programmatically challenging to implement from first principles. The setup, initialization, and parameter estimation of these methods require fine-tuning to obtain optimal and accurate results. Burger, Kleynhans, and Lups Grobler (Citation2020) showed that the complexity trade-off of using a DKF over an LRM resulted in no significant performance improvement in prediction accuracy, where the DKF was much more complex than the LRM. The DKF parameters are sensitive, and getting the initial set of parameters to be a reasonable estimate is crucial for the model’s performance. Xiao et al. (Citation2020) showed that the increase in performance gains comes with a significant increase in computational cost.

Therefore, this paper proposes a novel vessel location prediction method that extends the Linear Regression Model (LRM) proposed by Burger, Kleynhans, and Lups Grobler (Citation2020). Moreover, this method uses historic AIS vessel information to improve prediction accuracy. It is easy to implement, and its initial parameters require little to no fine-tuning. The proposed algorithm also enables the LRM to predict non-linear trajectories. The a-priori information used includes a Spatial Map (SM) of historic cell counts, COG values, and COG standard deviations (SDs). The SMs the method uses are easy to generate and update, which is essential as data and movement patterns of vessels change over time. We refer to this method as the LRM with added COG (LRMAC) throughout the paper.

Due to the fact that the SPNS is also a simplistic first-order method (proposed by Hexeberg, Flåten, and Brekke Citation2017), the LRMAC will be compared to it. When comparing the models, an in-depth comparison of the model accuracies and time complexities will be made. Our tests will be implemented on cargo and tanker vessel data only. This paper is structured as follows: first, we discuss the dataset and the required processing. Second, we summarize the a-priori information the LRMAC requires. Third, we present the LRMAC and the comparison method, the SPNS. Fourth, we discuss the experimental design. Finally, we end the paper with our results and a conclusion.

2. Data

In this section, we will introduce the dataset that was used to test the performance of the proposed and benchmark algorithms. We will also be discussing the pre-processing steps that are necessary to ensure the proposed algorithm will run as intended.

An open-source dataset by Ray et al. (Citation2019) was used. It consists of AIS messages recorded in the Celtic Sea, the North Atlantic Ocean, the English Channel, and the Bay of Biscay (France). In , the spatial range of the data is depicted visually (also see the details in ). The specific data characteristics are given in , and the observational period was over 6 months starting on 1 October 2015.

Figure 1. Data Spatial range – courtesy of Google Earth 2021.

Table 1. Spatial range details for Google Earth extract.

Download CSV Display Table

Table 2. Dataset attributes.

Download CSV Display Table

In , the dataset attributes and characteristics are introduced. The mathematical symbols that we use in this paper are presented in .

Table 3. Variables and Symbol Descriptions.

Display Table

2.1. Pre-processing steps

Observations were removed from the dataset that did not adhere to the following criteria:

Ship type within [70,89], where cargo vessel’s type is in [70,79] and tankers in [80,89].
SOG > 0.5kn, removing stationary observations, including stationary vessels experiencing drift due to currents and other natural phenomena due to being anchored.
SOG < 60kn, observations with high speeds are likely outliers as cargo and tanker vessels move at relatively low speeds. If more than 60 kn is observed for cargo or tanker vessels, it is most likely due to technical errors in the recorded AIS data.

The remainder of the data were grouped according to vessel MMSI and sorted in ascending order according to the timestamp recorded by each observation. All trajectories with less than 20 observations or those that span less than 5 min in total were removed. The data spans over a period of 6 months, as denoted in , which implies that there will be more than one trajectory for a given vessel at different time periods and spatial locations. Pre-processing was done to remove any non-sensical data that may lead to incorrect SMs.

2.2. Vessel Interpolation for Spatial Maps (SMs)

After pre-processing, an augmented dataset was constructed by interpolating between observations for each unique vessel MMSI, creating less sparse trajectory observations per vessel. This augmented dataset is then used to construct physically realistic and usable spatial maps (SMs) of the dataset in question. The original dataset is too sparse (as is) for a meaningful and sensical SM to be created from it. The sparsity of the original data is due to several reasons. Firstly, vessels closer to receivers can share their position more often than vessels further away. Furthermore, vessels close to the shoreline travel at lower speeds than vessels further away. Naturally, more AIS messages will be recorded for slower vessels than faster vessels over the same distance traveled.

Observational interpolation was only performed in the following cases:

The time difference between the two observations is no longer than six hours,
The distance between the two observations is within 15 km, and
The distance between observations is no smaller than the size of one grid cell. The grid cell size used was $0.88 km \times 0.88 km$ . This specific constraint prevents the over-representation of a grid cell, only adding one observation to a cell if the interpolated trajectory passed through it.

We made use of linear interpolation models by using scikit-learn (Pedregosa et al. Citation2011). Longitude, latitude, and SOG were interpolated if the cases above were met. Gaussian filtering was also used to have smoother versions of the SMs.

The COG of the interpolated and recorded observations was calculated via

(1)

ψ_{t} = \arctan (\frac{ϕ_{t - 1} - ϕ_{t}}{λ_{t - 1} - λ_{t}})

(1)

We did not make use of the COG values present in the dataset. These values yielded inaccurate results, more accurate results were obtained by calculating the actual COG based on the longitude and latitude. Having a new augmented dataset with a better representation of the historical locations and trajectories, we can create representative SMs, essential to the algorithm we propose.

3. Spatial Maps (SMs)

Any two-dimensional grid that spans the earth’s surface will be referred to as a spatial map. Each cell in the two-dimensional grid is associated with a specific range of longitudinal and latitudinal coordinates. In this paper, our SMs will be set up from the observations recorded in the dataset, which spans a latitude and longitude of $ϕ \in [- 10^{\circ}, 0^{\circ}]$ and $λ \in [45^{\circ}, 55^{\circ}]$ , respectively.

In this paper, we use a square SM, meaning the SM’s width and length have the same number of cells. The dimensions of the SMs depend on the range of the $ϕ$ and $λ$ . The dimensions of the SMs we constructed were $1250 \times 1250$ cells as shown in . The size of the SMs is $10^{\circ} \times 10^{\circ}$ square degrees. Each cell’s resolution, therefore, is equal to $0.008 \times 0.008$ square degrees. An extract of the upper left-hand corner of an artificially created SM is depicted in . This figure denotes the number of recorded observations in a cell, within a longitude and latitude range based on the historic AIS data.

Figure 2. Spatial distribution map matrix extract example.

Haversine’s formula for distance on a sphere is given by EquationEquation (2)(2) $\begin{aligned} d = 2 r \cdot \arcsin \\ \sqrt{si n^{2} (\frac{ϕ_{2} - ϕ_{1}}{2}) + \cos (ϕ_{1}) \cos (ϕ_{2}) \sin^{2} (\frac{λ_{2} - λ_{1}}{2})} \end{aligned}$ (2) . This formula is used to calculate the distance between two coordinates on a sphere.

(2)

\begin{aligned} d = 2 r \cdot \arcsin \\ \sqrt{si n^{2} (\frac{ϕ_{2} - ϕ_{1}}{2}) + \cos (ϕ_{1}) \cos (ϕ_{2}) \sin^{2} (\frac{λ_{2} - λ_{1}}{2})} \end{aligned}

(2)

where,

$r$ represents the mean radius of the earth ( $\approx 6371 km$ ).
$ϕ_{1}$ and $ϕ_{2}$ represent the latitudinal coordinates of two observations, the latitudes of points one and two, respectively.
$λ_{1}$ and $λ_{2}$ represent the longitudinal coordinates of two observations, the longitudes of points one and two, respectively.

It now follows from the Haversines formula that the area associated with each cell is roughly $0.89 \times 0.89$ km², where $0.89$ km $\approx 0.48$ nautical miles.

3.1. Vessel counts SDM ( $K$ )

The first type of SM that we use represents the number of observations recorded within each grid cell $ξ$ . Each $ξ$ in $K$ records the number of observations that were historically within the longitude and latitude range of that specific cell. Cell counts will be higher in spatial areas where vessels are slower moving compared to faster-moving areas. shows the logarithmically scaled SDM of the vessel counts for each cell. The cell counts of the current vessel location and its neighbors are used in the prediction equations of the LRMAC, scaling the contribution of historic COG information.

Figure 3. Log scale vessel counts SDM.

3.2. The COG $(Ψ)$ and COGSD $(Σ)$ SM

The second SM that we will be using will be the course over ground SM (represented by $Ψ$ ). Each cell of the $Ψ$ represent the mean COG value that was recorded in that cell. COG is measured in degrees and will always be positive.

The third SM we used is the so-called COG standard deviation SM (represented by $Σ$ ). The value of each of its cells was computed as follows:

(3)

Σ_{i_{ϕ}, i_{λ}} = \sqrt{\frac{1}{n} \sum_{j = 1}^{n} {(ψ_{i_{ϕ}, {i_{λ}}_{j}} - Ψ_{i_{ϕ}, i_{λ}})}^{2}}

(3)

where

$n$ , represents the number of observations observed in a cell as determined by $K$ .
$i_{ϕ}, i_{λ}$ , represents the index values associated with $ϕ$ and $λ$ respectively on the SM grid.
$Ψ_{i_{ϕ}, i_{λ}}$ , represents the mean COG at a specific index value on the SM grid.
${ψ_{(i_{ϕ}, i_{λ})}}_{j}$ , represents the $j^{th}$ COG value in the cell with index values $i_{ϕ}, i_{λ}$
$Σ_{i_{ϕ}, i_{λ}}$ , refers to the COG standard deviation at index value $i_{ϕ}, i_{λ}$ .

Loosely speaking, the entries in $Σ$ can be interpreted as how “confident” we ought to be in the corresponding entry in $Ψ$ . Higher cell values in $Σ$ mean that historically there were many vessels traveling in different directions as the SD is higher.

In , we can see a visual representation of $Ψ$ . Looking at the figure, we can see that cargo and tanker vessels move in a specific direction in certain geographic locations. Two distinct highways are clearly visible close to the center of (Grobler and Kleynhans Citation2019). A highway is a route that many vessels traverse. The aqua green highway is used by cargo and tankers to travel upward (North), while the blue highway is used to travel downward (South).

Figure 4. Course over ground SM.

In , we see a visual representation of $Σ$ , as calculated by EquationEquation (3)(3) $Σ_{i_{ϕ}, i_{λ}} = \sqrt{\frac{1}{n} \sum_{j = 1}^{n} {(ψ_{i_{ϕ}, {i_{λ}}_{j}} - Ψ_{i_{ϕ}, i_{λ}})}^{2}}$ (3) . The standard deviation of the areas that contain more traffic in different directions is larger than those containing less traffic (this is especially true for the areas surrounding harbors). Yellowish colors represent higher SD values. The highways mentioned, however, are not associated with high SD values. This implies that these highways are highly directional as shown in .

Figure 5. Course over ground standard deviation SM.

4. The proposed algorithm

In this section, we present the proposed LRMAC algorithm. As mentioned before, this algorithm uses historic (a-priori) AIS data to predict the trajectories of vessels. The algorithm can predict non-linear vessel trajectories. To be more specific, the LRMAC uses three SMs discussed in the previous section $K, Ψ,$ and $Σ$ . We start this section by discussing the different unit conversions that the algorithm requires.

4.1. Speed Over Ground (SOG)

We converted SOG from knots to meters per second via EquationEquation (4)(4) $V_{t}^{'} = 0.51 \dot{4} \cdot V_{t}^{^{''}}$ (4) .

(4)

V_{t}^{'} = 0.51 \dot{4} \cdot V_{t}^{^{''}}

(4)

We then converted the SOG into degrees/s (°/s) using

(5)

V_{t} = \frac{V_{t}^{'}}{l}

(5)

with

(6)

l = \frac{2 π}{360} \times 6378000 = 111137 m

(6)

The constant $l$ , can be interpreted as the average number of meters that one degree of latitude and longitude span on Earth.

4.2. The Linear Regression Model (LRM)

The method presented in this paper (LRMAC) is an extension of the LRM presented by Burger, Kleynhans, and Lups Grobler (Citation2020). We will summarize it below for the benefit of the reader. The LRM can only be predicted linearly with regular time intervals. The prediction interval that we used was 1 s, i.e. $Δ k_{t} = 1 s$ . The LRM-based approach works by estimating $V_{t}$ using a rolling window linear model.

One of the parameters that must be set is the prediction duration, i.e. the total number of hours one would like to predict into the future. The other parameter will be the step size of the predictions, measured in seconds between any two observations. The prediction interval can then be computed.

The LRM consists of two sets of equations: the predictor and measurement update equations. Let the vector $x_{t}$ be defined as:

(7)

x_{t} = {[λ_{t}, ϕ_{t}]}^{T}

(7)

The symbols $ϕ$ and $λ$ are defined in . Subscript $t$ refers to the observations at timestep $t .$ If there is no observation at timestep $t, x_{t}$ is assumed to be equal to the all-zero vector.

The prediction equations for the LRM are

(8)

{\hat{x}}_{t}^{-} = {\hat{x}}_{t - 1} + {\hat{V}}_{ω, t} \cdot Λ_{t}

(8)

(9)

{\hat{V}}_{ω, t} = Δ {\hat{V}}_{ω, t - 1} \cdot k_{t} + {\hat{V}}_{ω, c_{t - 1}}

(9)

(10)

Λ_{t} = {[cos (ψ_{t}) sin (ψ_{t})]}^{T}

(10)

The variables in the equations above are defined below,

${\hat{x}}_{t}^{} -$ denotes the vessel’s predicted position vector using all observations up until timestep $t - 1.$

${\hat{x}}_{t - 1}$ denotes the vessel’s updated estimated position vector using all observations up until timestep $t - 1$ .
${\hat{V}}_{ω, t}$ denotes the vessels predicted SOG using all observations up until timestep $t - 1$ .

$\nabla {\hat{V}}_{ω, t - 1}$ denotes the vessel’s updated estimated gradient of our LRM using all observations up until timestep $t - 1$ .
${\hat{V}}_{ω, c_{t - 1}}$ denotes the updated estimated $y$ -intercept of our LRM using all observations up until timestep $t - 1$ .
$ψ_{t}$ , denotes the COG of a vessel at timestep $t$ , the COG remains the same until we observe a new COG value, i.e. getting an update from the vessel.
$ω$ , denotes the window size of our LRM.
$k_{t}$ , denotes the elapsed time in seconds at timestep $t$ .

The measurement update equations are

(11)

{\hat{x}}_{t} = {\hat{x}}_{t}^{-} + (x_{t} - {\hat{x}}_{t}^{-}) = x_{t}

(11)

(12)

\nabla {\hat{V}}_{ω, t} = \frac{\sum_{i = {\b 1}_{n_{t} > ω} (n_{t} - ω)}^{n_{t}} (V_{i} - {\overset{ˉ}{V}}^{ω}) (k_{i} - {\overset{ˉ}{k}}^{ω})}{\sum_{i = 1_{n_{t} > ω} (n_{t} - ω)}^{n_{t}} (k_{i} - {\overset{ˉ}{k}}^{ω})}

(12)

(13)

{\hat{V}}_{ω, c_{t}} = {\overset{ˉ}{V}}^{ω} - \nabla {\hat{V}}_{ω, t} \cdot {\overset{ˉ}{k}}^{ω}

(13)

Moreover

(14)

1_{n_{t} > ω} = \{\begin{matrix} 1, if n_{t} > ω \\ 0, otherwise \end{matrix}

(14)

(15)

{\overset{ˉ}{V}}^{ω} = \frac{1}{n_{ω}} \sum_{i = 1_{n_{t} > ω} (n_{t} - ω)}^{n_{t}} V_{i}

(15)

(16)

{\overset{ˉ}{k}}^{ω} = \frac{1}{n_{ω}} \sum_{i = 1_{n_{t} > ω} (n_{t} - ω)}^{n_{t}} k_{i}

(16)

(17)

n_{ω} = \{\begin{matrix} n_{t}, if n_{t} < ω \\ ω, otherwise \end{matrix}

(17)

The variables in the equations above are defined below:

$V_{i}$ denotes the $i^{th}$ true SOG observation that was recorded for a particular vessel. We assumed $V_{0} = 4 m / s$ .
$k_{i}$ denotes the total time that has elapsed after having the $i^{th}$ true observation.
$n_{t}$ denotes the total number of true observations recorded after $Δ k_{t} \cdot t$ seconds, where in this paper $Δ k_{t} = 1 s$ . $Δ k_{t}$ refers to the prediction time interval between two subsequent observations.
Note that subscript $t$ is used as a time step index, while subscript $i$ is used as an observational index.

The measurement update equation values are only updated once a new observation is recorded. If there are no new observations recorded, the algorithm will evaluate the prediction equations. During the period of no observations, it is assumed that the COG remains unchanged. Moreover, the predicted position vector at $t - 1$ is used to obtain a new estimate of the position vector at $t$ .

4.3. LRM with a-priori COG information (LRMAC)

In this section, we will present our novel prediction method that is an extension of the LRM. The proposed method uses spatial maps as they can easily be loaded into memory, and their sizes are relatively small. The proposed method will allow for non-linear trajectory predictions of vessels. Note that the symbols used in this section should not be confused with those used in the previous section for the LRM.

The LRMAC algorithm no longer assumes a constant COG. The value of the COG is dynamically updated using a-priori information, using the three SM matrices $K$ , $Ψ$ and $Σ$ . As the COG value is dynamically updated, the LRMAC can predict non-linear trajectories (i.e. the SOG in the longitudinal and latitudinal directions will change). It is assumed that the SOG will remain constant over the prediction period, where the constant SOG used during prediction is derived from the last $ω$ observed SOG values of the vessel under consideration.

The predictor equations are modified for the LRMAC compared to the LRM. Whilst in prediction mode, the COG value is updated via a-priori information. The COG value computed in step $t$ is only applied during step $t + 1$ .

Let $\hat{x} \bar{_{t}}$ be the predicted position vector as in EquationEquation (8)(8) ${\hat{x}}_{t}^{-} = {\hat{x}}_{t - 1} + {\hat{V}}_{ω, t} \cdot Λ_{t}$ (8) . Let,

(18)

n_{\hat{ϕ}, \hat{λ}} = [n_{\hat{ϕ}}, n_{\hat{λ}}]

(18)

denote the SM index positions associated with ( $ϕ$ , $λ$ ). In other words, with $[n_{\hat{ϕ}}, n_{\hat{λ}}]$ we can extract the elements in $K$ , $Ψ$ and $Σ$ associated with ( $ϕ$ , $λ$ ) by making use of matrix subscripting. Let us now construct the following index matrix H:

(19)

H = [\begin{matrix} n_{\hat{ϕ} - η \cdot κ, \hat{λ} - η \cdot κ} \dots n_{\hat{ϕ} - η \cdot κ, \hat{λ}} \dots n_{\hat{ϕ} - η \cdot κ, \hat{λ} + η \cdot κ} \\ ⋮ ⋱ ⋮ . . . ⋮ \\ n_{\hat{ϕ} \cdot κ, \hat{λ} - η \cdot κ} \dots n_{\hat{ϕ}, \hat{λ}} \dots n_{\hat{ϕ}, \hat{λ} + η \cdot κ} \\ ⋮ . . . ⋮ ⋱ ⋮ \\ n_{\hat{ϕ} + η \cdot κ, \hat{λ} - η \cdot κ \dots n_{\hat{ϕ} - η \cdot κ, \hat{λ}} \dots n_{\hat{ϕ} + η \cdot κ, \hat{λ} + η \cdot κ}} \end{matrix}]

(19)

where

$η$ in EquationEquation (19)(19) $H = [\begin{matrix} n_{\hat{ϕ} - η \cdot κ, \hat{λ} - η \cdot κ} \dots n_{\hat{ϕ} - η \cdot κ, \hat{λ}} \dots n_{\hat{ϕ} - η \cdot κ, \hat{λ} + η \cdot κ} \\ ⋮ ⋱ ⋮ . . . ⋮ \\ n_{\hat{ϕ} \cdot κ, \hat{λ} - η \cdot κ} \dots n_{\hat{ϕ}, \hat{λ}} \dots n_{\hat{ϕ}, \hat{λ} + η \cdot κ} \\ ⋮ . . . ⋮ ⋱ ⋮ \\ n_{\hat{ϕ} + η \cdot κ, \hat{λ} - η \cdot κ \dots n_{\hat{ϕ} - η \cdot κ, \hat{λ}} \dots n_{\hat{ϕ} + η \cdot κ, \hat{λ} + η \cdot κ}} \end{matrix}]$ (19) denotes the neighborhood parameter, and
$κ$ represents the width and length of an SM cell (see and ).

The index matrix $H$ is used to select a specific sub-grid/matrix from $K$ and $\b Σ$ .

Let the a-priori cell counts, of the area surrounding ${\hat{x}}_{t}^{-}$ , therefore, be denoted by

K_{H}, where K_{H} \subset K

The size of the aforementioned area is determined by $η$ . Moreover, let the a-priori average COG value associated with ${\hat{x}}_{t}^{-}$ , be denoted by

Ψ_{n_{\hat{ϕ}, \hat{λ}}}, where Ψ_{n_{\hat{ϕ}, \hat{λ}}} \in Ψ

Finally, let the COG SD associated with ${\hat{x}}_{t}^{-}$ , be defined as,

Σ_{n_{\hat{ϕ}, \hat{λ}}}, where Σ_{n_{\hat{ϕ}, \hat{λ}}} \in Σ

4.3.1. Updating the COG using a-priori information

We first need to calculate the confidence we have in the a-priori COG value $Ψ_{n_{\hat{ϕ}, \hat{λ}}}$ . This confidence measure $ρ$ allows ranges between $[0, 1]$ , and it is used to scale the contribution of the a-priori COG information. It is determined by the number of observations in the neighborhood, as shown by EquationEquation (20)(20) $ρ = 1_{Ψ} \cdot \frac{K_{n_{\hat{ϕ}, \hat{λ}}}}{\max (K_{H})}$ (20) . If the a-priori grid count for our current cell is high compared to the other cell counts in the neighborhood, we can more confidently say that the predicted location is in an area where historically many vessels have traveled before. If the observed cell count for the current position is lower than in the surrounding cells, we will give less confidence in the a-priori COG update and rather assign more confidence in the LRM with the scaling factor. We can achieve this by calculating the confidence factor $ρ$ :

(20)

ρ = 1_{Ψ} \cdot \frac{K_{n_{\hat{ϕ}, \hat{λ}}}}{\max (K_{H})}

(20)

where

$ρ$ denotes the confidence (scaling factor) that we have in our prediction as determined by the current position of the vessel.
$K_{n_{\hat{ϕ}, \hat{λ}}}$ denotes the cell count value associated with $(\hat{ϕ}, \hat{λ})$ . The index $n_{\hat{ϕ}, \hat{λ}}$ is used to extract it from $K$ .
$1_{Ψ}$ denotes the indicator function that sets $ρ$ to zero. The indicator function enforces the restrictions that we impose on whether the COG should be updated or not.

$ρ$ determines the total weight the a-priori COG information should have in the LRMAC. If $ρ$ is close to one, we can be confident in the a-priori value $Ψ_{n_{\hat{ϕ}, \hat{λ}}}$ , and if it is close to zero, we are less confident, and it should have less of a contribution in our predictions, as seen in EquationEquation (22)(22) ${\hat{ψ}}_{t + 1} = (1 - ρ) ψ_{t} + ρ Ψ_{n_{\hat{ϕ}, \hat{λ}}}$ (22) . Two factors influence $ρ$ . First, if a few vessels have traversed the cell associated with $x_{t}$ relative to its neighboring cells, $\frac{K_{n_{\hat{ϕ}, \hat{λ}}}}{\max (K_{H})}$ would be relatively close to zero, indicating that the a-priori COG information should have less impact in updating $Ψ_{t + 1}$ . The same is true for the opposite: values closer to one would have a more significant contribution to updating $Ψ_{t + 1}$ . The second factor is determined by the value of the indicator function in EquationEquation (21)(21) $1_{Ψ} = {\begin{matrix} 0, Ψ_{n_{\hat{ϕ}, \hat{λ}}} \notin ℜ \\ 0, Σ_{n_{\hat{ϕ}, \hat{λ}}} \notin ℜ \\ 0, Σ_{n_{\hat{ϕ}, \hat{λ}}} > 10^{°} \\ 0, \max (K_{H}) = 0 1, otherwise \end{matrix}$ (21) .

(21)

1_{Ψ} = {\begin{matrix} 0, Ψ_{n_{\hat{ϕ}, \hat{λ}}} \notin ℜ \\ 0, Σ_{n_{\hat{ϕ}, \hat{λ}}} \notin ℜ \\ 0, Σ_{n_{\hat{ϕ}, \hat{λ}}} > 10^{°} \\ 0, \max (K_{H}) = 0 1, otherwise \end{matrix}

(21)

This equation will evaluate to zero if,

$Ψ$ or $Σ$ contains no information at index $n_{\hat{ϕ}, \hat{λ}}$ . This implies that there is no a-priori information available for us to make use of.
The square root of the COG SD at $n_{\hat{ϕ}, \hat{λ}}$ is larger than 10° i.e. standard deviation. This implies that many vessels have traversed through the cell associated with ${\hat{x}}_{t}$ , all going in different directions. This implies that the a-priori COG value is unreliable.

The COG value can now be updated as follows:

(22)

{\hat{ψ}}_{t + 1} = (1 - ρ) ψ_{t} + ρ Ψ_{n_{\hat{ϕ}, \hat{λ}}}

(22)

where

$(1 - ρ)$ indicates the role that the previous observed COG should have in the COG update.
$ψ_{t}$ denotes the previously observed or predicted COG.
$ρ$ denotes the scaling factor that the historic a-priori COG information should have.
$Ψ_{n_{\hat{ϕ}, \hat{λ}}}$ , denotes the a-priori COG value at index $n_{\hat{ϕ}, \hat{λ}}$ .

If our confidence in the a-priori COG value is high, we weigh it accordingly. If it is low, we rather put more trust in the COG value at time step $t$ (weighted by $ρ$ ). Note the estimated COG will replace the COG value that is used in EquationEquation (10)(10) $Λ_{t} = {[cos (ψ_{t}) sin (ψ_{t})]}^{T}$ (10) .

4.3.2. A flow diagram representation of the LRMAC

In , a flowchart of the LRMAC methodology is depicted. The flowchart depicts the LRM and how a-priori course information is added to extend the LRM into the LRMAC. Initializations are in green, functions are in blue, the predicted location is in gray, and parameter extracts are denoted in orange. It is assumed that all pre-processing has already been applied to the dataset.

Figure 6. LRMAC flowchart.

The first step is to construct the SMs from the dataset containing all historic AIS data and initialize all parameters. The last $ω$ recorded observations are used as additional input parameters in the LRM, specifically for the measurement update equations. The LRM consists of two sets of equations, measurement update and predictor equations. The measurement equations are used to update the predictor parameters and the predictor equations to predict the next set of coordinates.

The algorithm starts at the indicated red dot in the flowchart. The LRM is used to estimate the longitude and latitude at the next time step. All predictions are made at regular spaced time intervals $Δ k_{t}$ . The LRM assumes a constant SOG and COG, based on the last $ω$ observations.

Given the predicted longitude and latitude, we extend the LRM into the LRMAC by dropping the constant COG assumption. The COG will now be updated based on a-priori COG information at the location predicted by the LRM. The extension of the LRM allows the predictions to follow historic movement trends in the SMs, allowing for non-linear trajectory prediction.

Given the predicted location ${\hat{x}}_{t}$ , the corresponding index $n_{\hat{ϕ}, \hat{λ}}$ in the SMs can be calculated, and the neighboring indexes matrix $H$ . Using $n_{\hat{ϕ}, \hat{λ}}$ and $H$ , the a-priori values: $K_{H}$ , $Ψ_{n_{\hat{ϕ}, \hat{λ}}}$ , $Σ_{n_{\hat{ϕ}, \hat{λ}}}$ , and $K_{n_{\hat{ϕ}, \hat{λ}}}$ is extracted and used in EquationEquations (20)(20) $ρ = 1_{Ψ} \cdot \frac{K_{n_{\hat{ϕ}, \hat{λ}}}}{\max (K_{H})}$ (20) and (Equation21(21) $1_{Ψ} = {\begin{matrix} 0, Ψ_{n_{\hat{ϕ}, \hat{λ}}} \notin ℜ \\ 0, Σ_{n_{\hat{ϕ}, \hat{λ}}} \notin ℜ \\ 0, Σ_{n_{\hat{ϕ}, \hat{λ}}} > 10^{°} \\ 0, \max (K_{H}) = 0 1, otherwise \end{matrix}$ (21) ) to calculate the updated COG value ${\hat{ψ}}_{t + 1}$ . The updated COG is used in the next iteration by updating the measurement equations of the LRM, which in turn updates the predictor equations. This allows for a COG that updates dynamically based on historic AIS data.

All other parameters, including the COG, will be updated once new observations are received from the vessel, updating all the measurement equations.

4.4. Single-Point Neighbor Search (SPNS)

We will now be summarizing the method presented by Hexeberg, Flåten, and Brekke (Citation2017) called Single-Point Neighbor Search (SPNS). We will be comparing the LRMAC to the SPNS, as it was one of the most similar methods we found. We will compare the prediction accuracy and the time complexity of each. Note, do not confuse the variables defined in this section with the variables used in the rest of the paper.

Let

(23)

X = {[X_{1} X_{2} \dots X_{M}]}^{T}

(23)

be defined as a matrix with all the historic AIS data observations, where $M$ indicates the number of AIS messages recorded.

Let

(24)

X_{i} = [MMS I_{i} t_{i} p_{i}^{T} χ_{i} v_{i}]

(24)

be defined as a vector, where $i \in \{1, 2, \dots, M\}$ . The elements in $X_{i}$ refer to the MMSI, timestamp, position vector, COG, and SOG of timestep i. The position vector $p_{i}^{T}$ denotes the spatial location of a vessel, where $p_{i} = {[λ_{i}, ϕ_{i}]}^{T}$ denotes the longitude and latitude at message $i$ .

A predicted trajectory consists out of $K^{s}$ predicted positions, which are instants of time. At every iteration $k$ , a prediction is made, where $k \in 1, \dots, K^{s}$ . The predicted state is divided into an a-priori state ${\hat{X}}_{i}^{k -}$ and a-posteriori state ${\hat{X}}_{i}^{k +}$ . The states are denoted as:

(25)

{\hat{X}}_{i}^{k -} = [MMS I_{i} {\hat{t}}^{k} {\hat{p}}^{k} {\hat{χ}}^{k -} {\hat{v}}^{k -}]

(25)

and

(26)

{\hat{X}}_{i}^{k +} = [MMS I_{i} {\hat{t}}^{k} {\hat{p}}^{k} {\hat{χ}}^{k +} {\hat{v}}^{k +}]

(26)

The only difference between the two equations above, is the COG and SOG. The predicted ${\hat{χ}}^{k -}$ and ${\hat{v}}^{k -}$ at iteration $k$ in the a-priori state, represent the predicted COG and SOG between the previous position ${\hat{p}}^{k - 1}$ and the current position ${\hat{p}}^{k}$ . The a-posteriori predicted COG and SOG at iteration $k$ is the difference between the current position ${\hat{p}}^{k}$ and the next position ${\hat{p}}^{k + 1}$ .

The SPNS makes use of a close neighbor search, where a set radius is defined as a parameter and observations within the radius is queried, given that the observations adhere to a set of predefined constraints.

Let the close neighbors (CNs) at prediction step $k$ be defined as:

(27)

C^{k} = {X_{i}_{|} d ({\hat{p}}^{k}, p_{i}) \leq r_{c}, χ_{i} \in S, X_{i} \in X}

(27)

where

$d ({\hat{p}}^{k}, p_{i})$ is defined as the Haversine distance between the longitude and latitude in ${\hat{p}}^{k}$ and $p_{i}$ . The Haversine distance is defined in EquationEquation (2)(2) $\begin{aligned} d = 2 r \cdot \arcsin \\ \sqrt{si n^{2} (\frac{ϕ_{2} - ϕ_{1}}{2}) + \cos (ϕ_{1}) \cos (ϕ_{2}) \sin^{2} (\frac{λ_{2} - λ_{1}}{2})} \end{aligned}$ (2) .
$r_{c}$ , is defined as the search radius in meters, to search for all the CNs within $r_{c}$ of the current position. The value of $r_{c}$ is a predefined parameter.
$S$ is defined as the interval of course angles. Only observations within the course angle interval will be included in the CN set.

Let the course angles $S$ be defined by:

(28)

S = [{\hat{χ}}^{k -} - Δχ, {\hat{χ}}^{k -} + Δχ]

(28)

where $Δχ > 0$ , is the maximum course angle deviation, which is a predefined parameter.

The above pre-processing steps of the distance and COG deviation will filter out all the observations that we do not need for the remaining steps. All the CNs are extracted from historic AIS data. Let every state that belongs to the set of CNs at the prediction step $k$ be denoted as $X_{i} \in C^{k}$ , where $X_{c}^{k} = [MMS I_{c}^{k} t_{c}^{k} χ_{c}^{k} v_{c}^{k}]$ and $c \in \{1, \dots, C\}$ . $C$ is the number of CNs at $k$ .

The predicted trajectory at a state $X_{i}$ is defined as,

(29)

{\hat{T}}_{i} = \{[{\hat{p}}^{1} {\hat{t}}^{1}], \dots, [{\hat{p}}^{K} {\hat{t}}^{K^{s}}]\}

(29)

Let the true trajectory, $T_{i}$ , given state $X_{i}$ , be defined as

(30)

T_{i} = \{[p^{1} t^{1}], \dots, [p^{L} t^{L}]\}

(30)

$L$ denotes the number of AIS states recorded. $K^{s}$ and $L$ are not necessarily equal as several prediction steps can be made between two subsequent AIS messages. The first elements in both ${\hat{T}}_{i}$ and $T_{i}$ are equal, as it is the starting point given by state $X_{i}$ .

4.4.1 SPNS prediction

A new parameter is introduced, $Δ l$ , which denotes the step length from the current observation to the next predicted observation in meters. $Δ l$ decides how far the next position should be propagated. Let the predicted position be denoted by

(31)

{\hat{p}}^{k + 1} = p^{k} + Δ l \cdot {[sin ({\hat{χ}}^{k +}) \cdot f ({\hat{ϕ}}^{k}) cos ({\hat{χ}}^{k +}) \cdot g ({\hat{ϕ}}^{k})]}^{T}

(31)

$f ({\hat{ϕ}}^{k})$ and $g ({\hat{ϕ}}^{k})$ are functions of the current latitude ${\hat{ϕ}}^{k}$ , which transforms from meters to degrees longitude and latitude, respectively. The step length, $Δ l$ , reflects the curvature of the sea lanes ahead.

4.4.1.1 Course prediction

The COG value ${\hat{χ}}^{k +}$ is used when calculating the predicted position ${\hat{p}}^{k + 1}$ . ${\hat{χ}}^{k +}$ is the a-priori course calculated from the extracted CNs at position $p^{k}$ . Note that the course is periodic in $[0^{\circ}, 360^{\circ}]$ ; special care must be taken when calculating the CN set’s mean COG. The mean COG is calculated as follows:

(32)

{\overset{ˉ}{χ}}_{c} = \{\begin{matrix} ta n^{- 1} (\frac{\overset{ˉ}{s}}{\overset{ˉ}{c}}) if \overset{ˉ}{s} > 0, \overset{ˉ}{c} > 0 \\ ta n^{- 1} (\frac{\overset{ˉ}{s}}{\overset{ˉ}{c}}) + 180^{\circ} if \overset{ˉ}{c} < 0 \\ {tan}^{- 1} (\frac{\overset{ˉ}{s}}{\overset{ˉ}{c}}) + 360^{\circ} if \overset{ˉ}{s} ⟨0, \overset{ˉ}{c}⟩ 0 \end{matrix}

(32)

where

(33)

\overset{ˉ}{s} = \frac{1}{C} \sum_{c = 1}^{C} sin (χ_{c})

(33)

(34)

\overset{ˉ}{c} = \frac{1}{C} \sum_{c = 1}^{C} cos (χ_{c})

(34)

A constant velocity model is used whenever $C^{k} \notin R$ as done by Hexeberg, Flåten, and Brekke (Citation2017). The median course ${\tilde{χ}}_{c}$ can be calculated by calculating $\tilde{s}$ and $\tilde{c} .$ Hexeberg, Flåten, and Brekke (Citation2017) recommended to use the median course for non-linear trajectories.

4.4.1.2 Speed prediction

The median ${\tilde{v}}_{c}$ of the CNs is used to calculate the predicted speed. Note that the speed prediction is only used in the time update equation shown in Algorithm 1. The predicted speed ${\tilde{v}}_{c}$ is used to calculate the time passed between the current observation and the predicted observation, where the time passed is denoted by $\frac{Δ l}{{\hat{v}}^{k +}}$ . The time update equation is defined as:

(35)

{\hat{t}}^{k + 1} = {\hat{t}}^{k} + \frac{Δ l}{{\hat{v}}^{k +}}

(35)

where

${\hat{t}}^{k}$ denotes the current time,
$Δ l$ denotes the distance between the current and predicted observation, and
${\hat{v}}^{k +} = {\tilde{v}}_{c}$ , given set $C^{k}$ at $k$ .

The calculation of the predicted time allows the SPNS to have regular spaced distance intervals of length $Δ l$ . The algorithm for the SPNS is presented below.

Table

Display Table

In , the set of hyperparameters that are used for curved trajectories is shown. These decision parameters are identical to the ones presented by Hexeberg, Flåten, and Brekke (Citation2017) and were used to compare the SPNS to the LRMAC.

Table 4. Curved trajectory decision parameters.

Display Table

5. Experimental design

This section will discuss how we set up our experiments to get the optimal set of parameters for the proposed LRMAC model. We will also be discussing how we will compare the LRMAC to the SPNS. We will be comparing the LRM and LRMAC to show the increase in prediction accuracy of the LRMAC over the LRM and then compare the LRMAC to the SPNS model comparing prediction accuracy and time complexity. We will be discussing the vessels used, how we extracted their trajectories, the subsampling approach that we made use of, and how we determined the performance of the two approaches.

To compare the LRMAC and SPNS, 40 unique vessel trajectories were extracted at random from the dataset published by Ray et al. (Citation2019) after having been pre-processed. The pre-processing removed stationary observations and vessels with too few observations, as mentioned earlier in the paper. The extracted vessels are a mix of cargo and tanker vessels. The extracted trajectories of the vessels were all from the high-density areas depicted in . We restricted our use case to this example to show the performance of the LRM, LRMAC, and SPNS on curved trajectories. The LRMAC algorithm can be applied to various use cases and will perform well on any SM containing clear regions exhibiting path directionality.

5.1 SPNS vessel query setup

Since the SPNS algorithm requires querying historic observations within a specified radius, all the data was loaded into a PostgreSQL database.Footnote² An extension was added to PostgreSQL called PostGIS,Footnote³ which allows for improved spatial queries with a datatype called geometry. The PostGIS plugin uses a unique kind of indexing. Querying observations in PostgreSQL with PostGIS allow that vessels within a given radius from the search point can be extracted into $C^{k}$ .

5.2 Trajectory subsampling method

In order to have a large test set of trajectories to identify the ideal parameters for the LRMAC and compare it to the SPNS, we created a method that would subsample trajectories. We are effectively creating multiple trajectories from one observed trajectory. A given trajectory $T_{i}$ will be subsampled into different time subsets given a prediction length $h$ . Each time a subset’s starting observation will differ by 1 hour compared to the subset that precedes or follows it. We refer to this hour difference as a stride length $s$ , where $s$ defines the starting point of the stride hour from $T_{i}$ . Stride values are measured in hours. This method allows us to extract multiple sub trajectories $T_{s, h}$ from $T_{i}$ . Let the number of sub trajectories that can be created with a prediction length of $h$ from $T_{i}$ be denoted by

(36)

# T_{h} = {⌊\max (t_{T_{i}})⌋}_{hour} - h + 1

(36)

and let the total number of sub trajectories from $T_{i}$ with different starting positions $s$ be denoted by,

(37)

# T_{s, h} = \sum_{h = 1}^{{⌊\max (t_{T_{i}})⌋}_{hour}} # T_{h}

(37)

where

${⌊\max (t_{T_{i}})⌋}_{hour}$ denotes the closest floored hour to the maximum observed time in $T_{i}$ .
$h$ refers to the prediction length, where $h \in {1, 2, \dots, {⌊ m a x (t_{T_{i}}) ⌋}_{hour}$ , and

$s$ denotes the stride starting position measured in hours,

s \in \{0, 1, \dots, {⌊\max (t_{T_{i}})⌋}_{hour} - h\}

In , an example is shown of the trajectory subsampling and the number of subsets we can create given an observed trajectory between 6 and 7 hours. Using EquationEquation (37)(37) $# T_{s, h} = \sum_{h = 1}^{{⌊\max (t_{T_{i}})⌋}_{hour}} # T_{h}$ (37) , the total number of sub trajectories that can be created from this trajectory is $21$ .

Figure 7. Subsample visualization.

This method of sub-trajectory sampling allows us to have multiple trajectories to compare the prediction and time performance between the LRMAC and the SPNS model. The prediction performance is determined by the Haversine distance between the expected spatial location at time $t$ compared to the predicted location at time $\hat{t}$ . The time performance is computed by the time the methods take from execution until the predicted trajectory of length $h$ has been calculated.

First, we will run the LRMAC method on all sub-trajectories with different sets of parameters $η$ and $ω$ to determine the optimal set of parameters that results in the smallest median Haversine distance on average. The LRMAC with its optimal parameters and SM densities will be compared to the LRM, and SPNS with the parameters defined by Hexeberg, Flåten, and Brekke (Citation2017). The SM densities were not optimized for these tests, the cell sizes remained constant, and the number of cells fixed at $1250 \times 1250$ . We will also be testing the speed performance between the two algorithms, measuring the execution times.

5.3 Parameter estimation of LRMAC

To obtain the optimal set of hyperparameters for the LRMAC, the LRMAC was tested extensively on different combinations of $η$ and $ω$ . The combination of $η$ and $ω$ that gave the lowest overall Haversine distance error will be chosen as the ideal set of hyperparameters. The considered parameter values were: $η = [1, 2, 3, 4]$ and $ω = [1, 3, 5, 7, 9]$ , resulting in 20 combinations.

The parameters that yielded the best results were: $ω = 3$ and $η = 1$ . The LRMAC was tested on all sub trajectories. The combination of $ω$ and $η$ with the smallest average median Haversine distance over all sub-trajectories and timeframes was chosen as the ideal set of hyperparameters. compares the average median Haversine distances, and we can see that there is no significant performance increase after the combination $ω = 3$ and $η = 1$ . The chosen hyperparameters have a lower complexity as the $ω$ and $η$ are small, compared to the other parameters with similar performance. Smaller values of $ω$ means that our function is more flexible to incorporate new information, as there are fewer elements to consider. Large values of $ω$ will mean that our algorithm is less flexible, using more observed observations from the current trajectory to make a prediction. Larger values of $η$ means more historic SM cells to include when making a prediction, as the neighborhood size is larger.

Figure 8. Average median haversine error for each hyperparameter combination.

5.4 SPNS prediction adjustment

Since the SPNS predicts in constant distance intervals $Δ l$ instead of constant time intervals as the LRMAC does, we had to adjust the final prediction of the SPNS to allow for an exact comparison at $h$ (prediction horizon). To get the predicted spatial location of the SPNS after $h$ has passed, we let the SPNS predict until ${\hat{t}}^{k + 1} > h$ . We calculate the time that has passed between the last two predicted observations ${\hat{X}}_{i}^{(k) -}$ and ${\hat{X}}_{i}^{(k + 1) -}$ , where time is indicated by ${\hat{t}}^{k}$ and ${\hat{t}}^{k + 1}$ , and where, ${\hat{t}}^{k} < h < {\hat{t}}^{k + 1} .$ , let he total time passed be denoted by $Δ t_{| \hat{y} |}$ . The Haversine distance between ${\hat{X}}_{i}^{(k) -}$ and ${\hat{X}}_{i}^{(k + 1) -}$ is calculated $Δ l_{|{\hat{X}}_{i}^{(k) -} - {\hat{X}}_{i}^{(k + 1) -}|}$ . The SPNS will be rerun from ${\hat{X}}_{i}^{(k) -}$ with the same set of parameters and only changing the prediction step length $Δ l = Δ l_{|{\hat{X}}_{i}^{(k) -} - {\hat{X}}_{i}^{(k + 1) -}|}$ . This will give us the predicted spatial location at $h$ , as ${\hat{t}}^{k + 1} = h$ determined by EquationEquation (35)(35) ${\hat{t}}^{k + 1} = {\hat{t}}^{k} + \frac{Δ l}{{\hat{v}}^{k +}}$ (35) .

5.5 Prediction performance comparison algorithm

The algorithm below shows how the errors were calculated for both the LRMAC and SPNS on each sub trajectory. We used the median error results for each predicted timeframe $h$ to compare the performance between the two algorithms.

Table

Display Table

5.6 System specifications

All the tests will be carried out on a system with the specifications listed in .

Table 5. Testing system specifications.

Download CSV Display Table

6. Results

In this section, we will compare the LRMAC with the SPNS.

The LRMAC is an improved version of the LRM, where the LRMAC allows for the prediction of non-linear trajectories. In below, the median Haversine error is shown for both the LRM (red) and LRMAC (blue), together with the standard deviation (SD) from the median. The LRMAC has a reduced SD and median error compared to the LRM. In terms of short-term prediction accuracy, we see that the LRM and LRMAC are not significantly different, but for longer prediction windows (>120 min), the proposed method shows an improvement. To further support this observation, when looking at , we can see that the LRMAC reproduced the trajectory, where the LRM went off course. The LRMAC assumes a constant speed calculated which was estimated from the first $ω$ observed observations. The LRMAC predicted where the vessel would be after 8 hours, should it have traveled at a constant speed. The prediction error comparison does not accurately represent the performance of the LRMAC. When looking at the predicted trajectory, we can see that the predicted LRMAC trajectory is more similar to the actual trajectory when compared to the LRM.

Figure 9. LRM vs LRMAC.

Figure 10. LRM vs LRMAC on SDM (MMSI 419689000).

The error is measured as the Haversine distance between the actual observation at time $t$ compared to the predicted observation at time $t$ for each method. The LRMAC has a lower SD for longer range predictions than the LRM. This is expected as our test set contains vessels that are in the highways depicted in . Shorter prediction periods $h$ will mean the subsets will be near-linear as cargo and tanker vessels have a slow rate of turn. We do not expect a vessel trajectory to stay linear for more extended prediction periods, as there are obstacles like landmasses and other vessels. The LRM will keep predicting linearly, whereas the LRMAC will adapt to predict non-linear trajectories using historical COG information. We can see from $h = 240$ onwards that the median difference between the LRM and LRMAC increases. We expect that the LRMAC will be able to predict with any SM that has path directionality.

In , the LRM (lime) and LRMAC (aqua) predictions are shown of MMSI 419689000, together with the actual trajectory (magenta) that the vessel had. Looking at the error lines of the LRMAC (yellow) and the LRM (red), we see that their distances to the expected location differ by $3.47 km$ . The error distances are not significantly different; however, the LRMAC was still able to predict the correct path, given the constant speed assumption. The LRMAC only used $ω = 3$ historic observations to predict the 8-hour trajectory. The background of depicts the vessel counts SM $K$ , shown in . Given a use case where the historic data have paths that are longer and highly non-linear (not just a piecewise linear historic path), we believe that the LRMAC would do significantly better than the LRM as it will follow the trend of the historic data.

In , the median prediction error of the LRMAC (blue) is compared to the SPNS (green). Similar to the LRM and LRMAC comparison, the error was calculated over all the sub trajectories with a prediction length of $h$ . The median error for both methods was extracted at each $h$ together with the standard deviation (SD). Note that the SD is plotted around the median, instead of the mean absolute deviation, and this was done to reflect the algorithmic stability.

Figure 11. SPNS vs LRMAC prediction results.

, which compares the LRMAC and the SPNS, is a better comparison as both the LRMAC and SPNS follow the historic route. Looking at , we see no significant prediction performance difference between the LRMAC and the SPNS for short prediction periods. However, when considering longer prediction periods (>120 min), the median error and SD of the LRMAC are lower than that of the SPNS. The SPNS’s SD starts to increase dramatically, compared to the LRMAC’s SD, which increases at a lower rate. When looking at the prediction interval $[5, 15]$ min, the recommended time prediction horizon for the SPNS, the SPNS has a smaller SD than the LRMAC.

One assumption that the LRMAC makes is that a vessel keeps a constant speed, and the prediction is based on this constant speed assumption. The SOG of a vessel is derived from the last $ω$ observed observations. The SPNS calculates the SOG based on the historic observations, but the speed is not used in the coordinate prediction step but instead in the time predictions.

In below, we see the time complexity of both methods for each prediction length $h$ . The time was measured over all the sub-trajectories, and the median time is shown. The SPNS computation time is significantly longer to predict than the LRMAC. The speed of the SPNS is limited by the time it takes to query the CNs from a database. Larger database table sizes will lead to more significant query times as there are more observations to search through. The LRMAC uses SMs whose sizes stay fixed, given more observations for the same longitude and latitude range. The SM sizes will only increase if the longitude and latitude range of the observed data increase.

Table 6. LRMAC vs SPNS median prediction times.

Display Table

Looking at and , we see that even though the two methods are not significantly different in their prediction accuracies for short prediction periods, the LRMAC had a shorter execution time. In , we see that the SPNS has a smaller SD than the LRMAC for the $[5, 15]$ min timeframe.

We believe that in higher density areas closer to harbors, the SPNS will have more accurate prediction results as the set of CNs will only contain vessels moving in the same direction, where the LRMAC would default to the LRM as the a-priori COG SD $Σ$ , will be high. In areas closer to harbors, vessels tend to move slower, and prediction timeframes are usually smaller with increased AIS coverage. We think that the SPNS can be used together with the LRMAC. If the LRMAC encounters a cell with a high SD, the SPNS can be deployed until a cell with a lower SD is encountered. A hybrid approach between the SPNS and LRMAC may improve prediction times and improve prediction accuracy in areas with a significant amount of traffic in different directions.

It should be noted that the LRMAC is limited by the spatial resolution of the SMs it employs. In the case of sparse historic data, the SMs generated from this data would be inaccurate, resulting in a decrease in performance of the LRMAC. Also, when the prediction time interval between two consecutive predictions is too large, the LRMAC will skip past important a-priori information and have inaccurate prediction results. Furthermore, the LRMAC was only tested on cargo and tanker vessels in this study, as the movement of other vessel types, such as fishing vessels are of an erratic nature and the SMs that will reflect this behavior, with high COG standard deviation.

The LRMAC can be used to predict trajectories of vessels or to impute historic trajectories. The LRMAC would be more accurate as observations are recorded, and the observations in the window size $ω$ is updated. Currently, the predictions of the LRMAC and SPNS were done on the assumption that only the first $ω$ observations’ information will be used, simulating AIS transponders that are switched off for extended periods of time.

7. Conclusions

In this paper, we presented an extension of the LRM method proposed by Burger, Kleynhans, and Lups Grobler (Citation2020), which uses historic AIS vessel information (particularly the historic COG information), which is robust and easy to implement. The LRMAC implemented in this paper had a smaller incurred prediction error and associated standard deviation than the LRM implemented by Burger, Kleynhans, and Lups Grobler (Citation2020). The LRMAC can predict a trajectory similar to the actual trajectory where the LRM goes off course. The LRMAC makes use of SMs to improve its prediction capability. The LRMAC was compared to Single-Point Neighbor Search (SPNS), which has a similar level of computational complexity and, for the use case of predicting tanker and cargo vessel trajectories up to 8 hours into the future, showed improved results both in terms of the prediction accuracy and execution time. The LRMAC can be used to predict trajectories of vessels or impute vessel trajectories. The LRMAC will have reduced prediction accuracy in high-density areas where historically vessels traveled in different directions. We think a hybrid approach between the LRMAC and the SPNS may lead to improved prediction accuracy and execution time. Future work includes exploring the possibility of a hybrid approach between the LRMAC and SPNS.

Acknowledgement

We would like to thank MUNUS International for all the financial support given *(munus.ai).

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

The data that support the findings of this study are available at https://doi.org/10.5281/zenodo.1167595 (Ray et al. Citation2019).

Additional information

Notes on contributors

Christiaan Neil Burger

Christiaan Neil Burger an MSc Computer Science student at Stellenbosch University, South Africa, in 2021. The contents of this paper form part of his Masters’ thesis. He is in the field of Data Science with a focus on data mining, prediction, and machine learning.

Waldo Kleynhans

Waldo Kleynhans received a Ph.D. (Electronic Engineering) from the University of Pretoria, South Africa as well as an MBA from Heriot-Watt University, Scotland in 2012 and 2011 respectively. He is a statistical signal processing and machine learning expert focusing on various application areas including maritime domain awareness and telecommunications. He holds adjunct faculty positions with the University of Pretoria, South Africa as well as San Diego State University, USA.

Trienko Lups Grobler

Trienko Lups Grobler received his PhD in Engineering at the University of Pretoria in 2013. He was a postdoctoral fellow at Rhodes University from 2013 to 2016. He joined the Computer Science Division of Stellenbosch University in 2017. His research interests include remote sensing and interferometry.

Notes

1. For more on the International Maritime Organization (IMO), see: www.imo.org . For additional information on AIS transponders see: https://www.imo.org/en/OurWork/Safety/Pages/AIS.aspx.

2. For more on PostgreSQL see: www.postgresql.org

3. For more on PostGIS see: www.postgis.net

References

Alizadeh, D., A. Asghar Alesheikh, and M. Sharif. 2021. “Vessel Trajectory Prediction Using Historical Automatic Identification System Data.” Journal of Navigation 74 (1): 156–174. doi:10.1017/S0373463320000442.
Web of Science ®Google Scholar
Burger, C.N., W. Kleynhans, and T. Lups Grobler. 2020. “Discrete Kalman Filter and Linear Regression Comparison for Vessel Coordinate Prediction.” 2020 21st IEEE International Conference on Mobile Data Management (MDM). Versailles, France: IEEE. 269–274. doi: 10.1109/MDM48529.2020.00062.
Google Scholar
Chen, X., Q. Lei, Y. Yang, Q. Luo, O. Postolache, J. Tang, and W. Huafeng. 2020. “Video-Based Detection Infrastructure Enhancement for Automated Ship Recognition and Behavior Analysis.” Journal of Advanced Transportation 2020: 1–12. doi:10.1155/2020/7194342.
Web of Science ®Google Scholar
Chen, X., S. Wang, C. Shi, W. Huafeng, J. Zhao, and F. Junjie. 2019. “Robust Ship Tracking via Multi-view Learning and Sparse Representation.” Journal of Navigation 72 (1): 176–192. doi:10.1017/S0373463318000504.
Web of Science ®Google Scholar
Dimitrios, Z., E.K. Xidias, and D. Lekkas. 2016. “Real-time Vessel Behavior Prediction.” Evolving Systems 7 (1): 29–40. doi:10.1007/s12530-015-9133-5.
Google Scholar
Ester, M., H.-P. Kriegel, J. Sander, and X. Xiaowei. 1996. “A Density-based Algorithm for Discovering Clusters in Large Spatial Databases with Noise.” In Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining, 226–231.
Google Scholar
Forti, N., L. M. Millefiori, P. Braca, and P. Willett. 2020. “Prediction of Vessel Trajectories from AIS Data via Sequence-to-Sequence Recurrent Neural Networks.” In ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 8936–8940. doi: 10.1109/ICASSP40776.2020.9054421.
Google Scholar
Grobler, T.L., and W. Kleynhans. 2019. “Extracting High-Volume Traffic Routes from AIS Spatial Distribution Maps.” In IGARSS 2019-2019 IEEE International Geoscience and Remote Sensing Symposium, 10031–10034. doi: 10.1109/IGARSS.2019.8900105.
Google Scholar
Grünwald, P.D., I. Jae Myung, and M.A. Pitt. 2005. Advances in Minimum Description Length: Theory and Applications. Cambridge, MA: MIT Press.
Google Scholar
Hexeberg, S., A. L. Flåten, and E. F. Brekke. 2017. “AIS-based Vessel Trajectory Prediction.” In 20th International Conference on Information Fusion (Fusion), 1–8. doi: 10.23919/ICIF.2017.8009762.
Google Scholar
Jaskolski, K. 2017. “Automatic Identification System (AIS) Dynamic Data Estimation Based on Discrete Kalman Filter (KF) Algorithm.” Maritime Technical Journal 211 (4): 71–87. doi: 10.5604/01.3001.0010.6747.
Google Scholar
Jiashun, C. 2012. “A New Trajectory Clustering Algorithm Based on TRACLUS.” In Proceedings of 2012 2nd International Conference on Computer Science and Network Technology, 783–787. doi: 10.1109/ICCSNT.2012.6526048.
Google Scholar
Kalman, R.E. 1960. “A New Approach to Linear Filtering and Prediction Problems.” Journal of Basic Engineering 82 (1): 35–45. doi:10.1115/1.3662552.
Google Scholar
Lee, J.-G., J. Han, and K.-Y. Whang. 2007. “Trajectory Clustering: A Partition-and-GroupFramework.” Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data. Beijing, China: Association for Computing Machinery. 593–604. doi: 10.1145/1247480.1247546.
Google Scholar
Murray, B., and L. Prasad Perera. 2020. “A Dual Linear Autoencoder Approach for Vessel Trajectory Prediction Using Historical AIS Data.” Ocean Engineering 209: 107478. doi:10.1016/j.oceaneng.2020.107478.
Web of Science ®Google Scholar
Pallotta, G., S. Horn, P. Braca, and K. Bryan. 2014. “Context-enhanced Vessel Prediction Based on Ornstein-Uhlenbeck Processes Using Historical AIS Traffic Patterns: Real-World Experimental Results.” In 17th International Conference on Information Fusion (FUSION),1–7. IEEE.
Google Scholar
Pallotta, G., M. Vespe, and K. Bryan. 2013a. “Traffic Route Extraction and Anomaly Detection from AIS Data.” International COST MOVE Workshop on Moving Objects at Sea. Brest, France.
Google Scholar
Pallotta, G., M. Vespe, and K. Bryan. 2013b. “Vessel Pattern Knowledge Discovery from AIS Data: A Framework for Anomaly Detection and Route Prediction.” Entropy 15 (6): 2218–2248. doi:10.3390/e15062218.
Web of Science ®Google Scholar
Pedregosa, F., G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, G. Olivier, M. Blondel, et al. 2011. “Scikit-learn: Machine Learning in Python.” Journal of Machine Learning Research 12:2825–2830.
Web of Science ®Google Scholar
Ray, C., R. Dréo, E. Camossi, A.-L. Jousselme, and C. Iphar. 2019. “Heterogeneous Integrated Dataset for Maritime Intelligence, Surveillance, and Reconnaissance.” Data in Brief 25: 104141. doi:10.1016/j.dib.2019.104141. Accessed 2020.
PubMed Web of Science ®Google Scholar
Rong, H., A.P. Teixeira, and C. Guedes Soares. 2019. “Ship Trajectory Uncertainty Prediction Based on a Gaussian Process Model.” Ocean Engineering 182: 499–511. doi:10.1016/j.oceaneng.2019.04.024.
Web of Science ®Google Scholar
Tang, H., Y. Yin, and H. Shen. 2019. “A Model for Vessel Trajectory Prediction Based on Long Short term Memory Neural Network.” Journal of Marine Engineering & Technology 21 (3): 136–145. doi:10.1080/20464177.2019.1665258.
Google Scholar
Wijaya, W.M., and Y. Nakamura. 2013. “Predicting Ship Behavior Navigating through Heavily Trafficked Fairways by Analyzing AIS Data on Apache HBase.” First International Symposium on Computing and Networking. Matsuyama: IEEE. 220–226. doi: 10.1109/CANDAR.2013.39.
Google Scholar
Xiao, Z., F. Xiuju, L. Zhang, and R. Siow Mong Goh. 2020. “Traffic Pattern Mining and Forecasting Technologies in Maritime Traffic Service Networks: A Comprehensive Survey.” IEEE Transactions on Intelligent Transportation Systems 21 (5): 1796–1825. doi:10.1109/TITS.2019.2908191.
Web of Science ®Google Scholar
Xin, X. 2020. “Context-Based Trajectory Prediction with LSTM Networks.” CIIS 2020: 2020 The 3rd International Conference on Computational Intelligence and Intelligent Systems. New York, NY, USA: Association for Computing Machinery. 100–104. doi: 10.1145/3440840.3440842
Google Scholar
Yaun, Z., J. Liu, Y. Liu, and L. Zongzhi 2019. “A Novel Approach for Vessel Trajectory Reconstruction Using AIS Data.” In 29th International Ocean and Polar Engineering Conference, 4554–4559.
Google Scholar

Extended linear regression model for vessel trajectory prediction with a-priori AIS information

ABSTRACT

1. Introduction

2. Data

Table 1. Spatial range details for Google Earth extract.

Table 2. Dataset attributes.

Table 3. Variables and Symbol Descriptions.

2.1. Pre-processing steps

2.2. Vessel Interpolation for Spatial Maps (SMs)

3. Spatial Maps (SMs)

3.1. Vessel counts SDM (K)

3.2. The COG Ψ and COGSD Σ SM

4. The proposed algorithm

4.1. Speed Over Ground (SOG)

4.2. The Linear Regression Model (LRM)

4.3. LRM with a-priori COG information (LRMAC)

4.3.1. Updating the COG using a-priori information

4.3.2. A flow diagram representation of the LRMAC

4.4. Single-Point Neighbor Search (SPNS)

4.4.1 SPNS prediction

4.4.1.1 Course prediction

4.4.1.2 Speed prediction

Table 4. Curved trajectory decision parameters.

5. Experimental design

5.1 SPNS vessel query setup

5.2 Trajectory subsampling method

5.3 Parameter estimation of LRMAC

5.4 SPNS prediction adjustment

5.5 Prediction performance comparison algorithm

5.6 System specifications

Table 5. Testing system specifications.

6. Results

Table 6. LRMAC vs SPNS median prediction times.

7. Conclusions

Acknowledgement

Disclosure statement

Data availability statement

Additional information

Notes on contributors

Christiaan Neil Burger

Waldo Kleynhans

Trienko Lups Grobler

Notes

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

3.1. Vessel counts SDM ( $K$ )

3.2. The COG $(Ψ)$ and COGSD $(Σ)$ SM