LGMar 26, 2025

A Methodology to extract Geo-Referenced Standard Routes from AIS Data

arXiv:2503.22734v11 citationsh-index: 26
Originality Synthesis-oriented
AI Analysis

This enables maritime surveillance authorities to analyze vessel navigational behavior for insights into patterns, potentially aiding in detecting deviations due to weather or illicit activities, though it is incremental as it builds on existing AIS data analysis techniques.

The study tackled the problem of extracting standard maritime routes from raw AIS data by proposing an unsupervised methodology using finite state machines and density-based clustering, achieving effectiveness with less than 5% outliers on a 1.15 TB dataset covering Arctic and Europe-Middle East-North Africa areas.

Maritime AIS (Automatic Identification Systems) data serve as a valuable resource for studying vessel behavior. This study proposes a methodology to analyze route between maritime points of interest and extract geo-referenced standard routes, as maritime patterns of life, from raw AIS data. The underlying assumption is that ships adhere to consistent patterns when travelling in certain maritime areas due to geographical, environmental, or economic factors. Deviations from these patterns may be attributed to weather conditions, seasonality, or illicit activities. This enables maritime surveillance authorities to analyze the navigational behavior between ports, providing insights on vessel route patterns, possibly categorized by vessel characteristics (type, flag, or size). Our methodological process begins by segmenting AIS data into distinct routes using a finite state machine (FSM), which describes routes as seg-ments connecting pairs of points of interest. The extracted segments are ag-gregated based on their departure and destination ports and then modelled using iterative density-based clustering to connect these ports. The cluster-ing parameters are assigned manually to sample and then extended to the en-tire dataset using linear regression. Overall, the approach proposed in this paper is unsupervised and does not require any ground truth to be trained. The approach has been tested on data on the on a six-year AIS dataset cover-ing the Arctic region and the Europe, Middle East, North Africa areas. The total size of our dataset is 1.15 Tbytes. The approach has proved effective in extracting standard routes, with less than 5% outliers, mostly due to routes with either their departure or their destination port not included in the test areas.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes