Prediction of Hangzhou Subway Station Passenger Flow based on Data Mining

An accurate passenger flow prediction is essential for subway station operators and passengers because it can reduce the congestion of subway stations, ensure passenger safety, and reduce passengers’ waiting time. The primary objective of this research is analysing the smart card data of Hangzhou su...

Full description

Bibliographic Details
Main Author: Zhang, Pengfei
Format: Dissertation (University of Nottingham only)
Language:English
Published: 2020
Online Access:https://eprints.nottingham.ac.uk/62857/
_version_ 1848799980031049728
author Zhang, Pengfei
author_facet Zhang, Pengfei
author_sort Zhang, Pengfei
building Nottingham Research Data Repository
collection Online Access
description An accurate passenger flow prediction is essential for subway station operators and passengers because it can reduce the congestion of subway stations, ensure passenger safety, and reduce passengers’ waiting time. The primary objective of this research is analysing the smart card data of Hangzhou subway stations and developing two prediction models for the passenger flow of subway stations in 15 minutes. The two models are linear regression model and the neural networks model. The testing and evaluation of these two models indicate that neural networks model has superior predictive accuracy than the linear regression model. During modelling, some researches are used to improve prediction performance. Firstly, this research explores the regularity of passenger flow in the subway station in different time granularity and use the Pearson correlation coefficient as the index of the regularity. The result indicates that the regularity of passenger flow in the subway station is better in the time granularity more than 15 minutes. Therefore, this research predicts the passenger flows in 15 minutes. Secondly, this study analyses different factors that affect passenger flow in the subway station. Thirdly, this research uses the K-means clustering algorithm to cluster the 80 subway stations into four station types with different passenger flow patterns. The station type is regarded as independent variable and inputs in two models. In addition, this research discusses the limitation of two models and proposes some improvements for the two models.
first_indexed 2025-11-14T20:44:17Z
format Dissertation (University of Nottingham only)
id nottingham-62857
institution University of Nottingham Malaysia Campus
institution_category Local University
language English
last_indexed 2025-11-14T20:44:17Z
publishDate 2020
recordtype eprints
repository_type Digital Repository
spelling nottingham-628572023-04-18T13:55:44Z https://eprints.nottingham.ac.uk/62857/ Prediction of Hangzhou Subway Station Passenger Flow based on Data Mining Zhang, Pengfei An accurate passenger flow prediction is essential for subway station operators and passengers because it can reduce the congestion of subway stations, ensure passenger safety, and reduce passengers’ waiting time. The primary objective of this research is analysing the smart card data of Hangzhou subway stations and developing two prediction models for the passenger flow of subway stations in 15 minutes. The two models are linear regression model and the neural networks model. The testing and evaluation of these two models indicate that neural networks model has superior predictive accuracy than the linear regression model. During modelling, some researches are used to improve prediction performance. Firstly, this research explores the regularity of passenger flow in the subway station in different time granularity and use the Pearson correlation coefficient as the index of the regularity. The result indicates that the regularity of passenger flow in the subway station is better in the time granularity more than 15 minutes. Therefore, this research predicts the passenger flows in 15 minutes. Secondly, this study analyses different factors that affect passenger flow in the subway station. Thirdly, this research uses the K-means clustering algorithm to cluster the 80 subway stations into four station types with different passenger flow patterns. The station type is regarded as independent variable and inputs in two models. In addition, this research discusses the limitation of two models and proposes some improvements for the two models. 2020-12-01 Dissertation (University of Nottingham only) NonPeerReviewed application/pdf en https://eprints.nottingham.ac.uk/62857/1/Prediction%20of%20Hangzhou%20Subway%20Station%20Passenger%20Flow%20based%20on%20Data%20Mining.pdf Zhang, Pengfei (2020) Prediction of Hangzhou Subway Station Passenger Flow based on Data Mining. [Dissertation (University of Nottingham only)]
spellingShingle Zhang, Pengfei
Prediction of Hangzhou Subway Station Passenger Flow based on Data Mining
title Prediction of Hangzhou Subway Station Passenger Flow based on Data Mining
title_full Prediction of Hangzhou Subway Station Passenger Flow based on Data Mining
title_fullStr Prediction of Hangzhou Subway Station Passenger Flow based on Data Mining
title_full_unstemmed Prediction of Hangzhou Subway Station Passenger Flow based on Data Mining
title_short Prediction of Hangzhou Subway Station Passenger Flow based on Data Mining
title_sort prediction of hangzhou subway station passenger flow based on data mining
url https://eprints.nottingham.ac.uk/62857/