This paper proposes a data fusion approach to automatically detect activity patterns in a GPS dataset based on travel diaries and correct misclassification errors. The Activity Patterns Detection consists of a Supervised Learning framework, thanks to which the activity purposes in the travel diaries are learned and then predicted in the GPS dataset. Furthermore, we deploy Unsupervised Learning to identify similar spatial and temporal activities in the GPS dataset and, based on travel diaries, to correct the misclassification errors. This work shows that, based on a few observations in the travel diaries and a set of features such as the resting time before the activity takes place, the number of occurrences of the same trip and the percentage of the trip made during daytime and the speed, it is possible to detect activities with an overall accuracy of 90%. Since the GPS dataset does not have information on the activity performed by the user, in reality, the aggregated results are validated based on the Kolmogorov-Smirnov test. The experiment shows that, with a confidence level of 99%, the majority of spatial and temporal feature distributions of activities in the travel diaries dataset are similar to those in the GPS dataset. Thanks to this approach, planners and transport operators can automatically obtain spatial and temporal patterns of frequent activities in urban areas.

Hadjidimitriou, N., Cantelmo, G., Antoniou, C., Machine learning for activity pattern detection, <<JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS>>, 2023; (27): 834-848. [doi:10.1080/15472450.2022.2084336] [https://hdl.handle.net/10807/339628]

Machine learning for activity pattern detection

Hadjidimitriou, Natalia
Primo
;
2023

Abstract

This paper proposes a data fusion approach to automatically detect activity patterns in a GPS dataset based on travel diaries and correct misclassification errors. The Activity Patterns Detection consists of a Supervised Learning framework, thanks to which the activity purposes in the travel diaries are learned and then predicted in the GPS dataset. Furthermore, we deploy Unsupervised Learning to identify similar spatial and temporal activities in the GPS dataset and, based on travel diaries, to correct the misclassification errors. This work shows that, based on a few observations in the travel diaries and a set of features such as the resting time before the activity takes place, the number of occurrences of the same trip and the percentage of the trip made during daytime and the speed, it is possible to detect activities with an overall accuracy of 90%. Since the GPS dataset does not have information on the activity performed by the user, in reality, the aggregated results are validated based on the Kolmogorov-Smirnov test. The experiment shows that, with a confidence level of 99%, the majority of spatial and temporal feature distributions of activities in the travel diaries dataset are similar to those in the GPS dataset. Thanks to this approach, planners and transport operators can automatically obtain spatial and temporal patterns of frequent activities in urban areas.
2023
Inglese
Hadjidimitriou, N., Cantelmo, G., Antoniou, C., Machine learning for activity pattern detection, <<JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS>>, 2023; (27): 834-848. [doi:10.1080/15472450.2022.2084336] [https://hdl.handle.net/10807/339628]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/339628
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact