Header menu link for other important links
X
Traffic Incident Duration Prediction using BERT Representation of Text
Published in Institute of Electrical and Electronics Engineers Inc.
2021
Volume: 2021-September
   
Abstract
Owing to the diverse nature of traffic incidents, accepting and storing relevant data in the form of natural language is more convenient than in constrained value fields. Textual information in such cases can be rich enough for traffic incident analysis and modelling even in the absence of certain fixed set of parameters. However limited studies considered the complexity in processing such information to predict traffic incident duration. In this paper, we propose to represent the textual data from incident reports using BERT word embeddings. These text representations are then inputted into various regressors such as LSTM, XGBoost, RF and SVR to predict traffic incident duration. To demonstrate the significance of this approach, the method is compared with the state-of-the-art approach using LDA representation. Dataset used for the experiment is the Caltrans Performance Measurement System (PeMS). Result analysis indicates that the BERT- LSTM hybrid model is effective in capturing the contextual meaning of textual incident reports to predict the traffic incident duration and outperforms LDA topic modelling with MAE around 11.16 minutes. © 2021 IEEE.
About the journal
JournalData powered by TypesetIEEE Vehicular Technology Conference
PublisherData powered by TypesetInstitute of Electrical and Electronics Engineers Inc.
ISSN15502252