ABSTRACT
When applying long short-term memory (LSTM) neural network model to traffic prediction, there are limitations in exploiting spatial-temporal traffic state features. The interpretability of models has not received enough attention. This study suggests an LSTM traffic flow prediction model that can anticipate traffic volume 24 h in advance. The model makes use of the traffic flow state information obtained from the fuzzy C-means clustering method by clustering the multi-day historical traffic flow data. Markov chain is used to capture the label feature of traffic flow using transition probability matrix information. To show the efficacy of the suggested technique, experiments were conducted using real traffic volume data from a city in China. The simulation results demonstrate that the proposed model can attain greater prediction accuracy, and the network training time may be significantly reduced.
CO EDITOR-IN-CHIEF:
ASSOCIATE EDITOR:
Nomenclature
= | the bias vector (e.g. is the input gate bias vector) | |
= | the number of clusters | |
= | LSTM’s cell state | |
CNN | = | convolutional neural network |
= | the square Euclidean distance | |
DNN | = | deep neural network |
= | the transition frequency at the time interval | |
= | the basic modules contain input gate, forget gate, output gate, input modulation gate, and memory cell state | |
FCM | = | Fuzzy C-means |
GCN | = | graph convolutional network |
= | LSTM’s hidden state | |
= | the hidden vector sequence | |
= | the maximum step size | |
LSTM | = | long short -term memory |
= | the fuzzifier | |
MAPE | = | the mean absolute percentage error |
MSE | = | the mean square error |
= | the total number of traffic volume states | |
= | a one-step transition probability matrix | |
= | a multi-step transition probability matrix | |
= | the transition probability | |
= | the transition probability at the time | |
= | the marginal probability | |
RNN | = | recurrent neural network |
= | hyperbolic tangent function | |
= | the membership degree of data object in cluster | |
= | the initial membership matrix | |
= | the daily of traffic volume | |
= | the cluster center | |
= | the real traffic volume at time period | |
= | the predicted traffic volume at time period | |
= | the states of the other days traffic volume at | |
= | the cell update gate weig ht matrix | |
= | the forget gate weight matrix | |
= | the input gate weight matrix | |
= | the output gate weight matrix | |
= | input variable | |
= | a sliding window | |
= | an output sequence | |
= | a given significance level | |
= | sigmoid function | |
= | chi-square statistics |
Disclosure statement
No potential conflict of interest was reported by the author(s).