File Download
Supplementary
-
Citations:
- Appears in Collections:
postgraduate thesis: Effective and efficient traffic data mining on sparse road networks
Title | Effective and efficient traffic data mining on sparse road networks |
---|---|
Authors | |
Advisors | Advisor(s):Cheng, CKR |
Issue Date | 2022 |
Publisher | The University of Hong Kong (Pokfulam, Hong Kong) |
Citation | Han, X. [韓笑琳]. (2022). Effective and efficient traffic data mining on sparse road networks. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. |
Abstract | A gigantic amount of traffic data, such as GPS and sensor information collected from highways, are often used to drive many road network applications, e.g., navigation, incident detection, and Point-of-Interest (POI) recommendation. Hence, topics about these ``big transportation data'' have attracted a lot of interest from both research and industry communities. A common problem about these applications is data sparsity. Specifically, each road does not have an equal amount of data. Roads that are rarely used may be associated with very few data. This can affect the effectiveness of road network applications that use them. In this thesis, we investigate how data sparsity can be taken into account, for improving the quality of the following three problems: incident detection, outlier detection, and weight completion.
First, we focus on a clustering-based approach for Incident Detection (ID) on sparse traffic data. It enables the finding of traffic incidents quickly from traffic data, thereby allowing emergency actions (e.g., rescuing injured people) to be carried out in a timely fashion. Existing ID solutions often rely on dense traffic data. We ask the question: Can ID be performed on sparse traffic data? As these data may not be enough to describe the state of the roads involved, they can undermine the effectiveness of existing ID solutions. To tackle this challenge, we borrow an important insight from the transportation literature, and use clustering to derive incident patterns. We devise a solution to detect anomalies in sparse traffic data by incident patterns. Experiments show that it is more effective on sparse traffic data, and efficient on large trajectory datasets.
Next, we study how to apply neural networks for the time-dependent trajectory outlier detection problem based on sparse traffic condition. It aims to extract abnormal movements of vehicles on the roads. This important problem, which facilities understanding of traffic behavior and detection of taxi fraud, is challenging due to the varying traffic conditions at different times and locations. To tackle this problem, we propose the deep-probabilistic-based outlier detection algorithm. This method, which employs deep-learning methods to obtain time-dependent outliners from a huge volume of trajectories, can handle complex traffic conditions and detect outliners accurately. Experiments show that our method is more accurate, and can handle millions of trajectories.
Finally, we utilize neural networks for the stochastic weight completion on sparse road networks. Road network applications make extensive use of network edge weights (e.g., traveling times). Some of these weights can be missing, especially in a road network where traffic data may not be available for every road. We study the stochastic weight completion problem, which computes the weight distributions of missing road edges. This is difficult, due to the intricate temporal and spatial correlations among neighboring edges. To tackle these challenges, we propose to incorporate the contextual properties about the road network (e.g., speed limits, number of lanes, road types) to provide finer granularity of road correlations. Experiments show that it is more effective and efficient than state-of-the-art solutions. |
Degree | Doctor of Philosophy |
Subject | Traffic flow - Mathematical models Data mining |
Dept/Program | Computer Science |
Persistent Identifier | http://hdl.handle.net/10722/311682 |
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Cheng, CKR | - |
dc.contributor.author | Han, Xiaolin | - |
dc.contributor.author | 韓笑琳 | - |
dc.date.accessioned | 2022-03-30T05:42:23Z | - |
dc.date.available | 2022-03-30T05:42:23Z | - |
dc.date.issued | 2022 | - |
dc.identifier.citation | Han, X. [韓笑琳]. (2022). Effective and efficient traffic data mining on sparse road networks. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. | - |
dc.identifier.uri | http://hdl.handle.net/10722/311682 | - |
dc.description.abstract | A gigantic amount of traffic data, such as GPS and sensor information collected from highways, are often used to drive many road network applications, e.g., navigation, incident detection, and Point-of-Interest (POI) recommendation. Hence, topics about these ``big transportation data'' have attracted a lot of interest from both research and industry communities. A common problem about these applications is data sparsity. Specifically, each road does not have an equal amount of data. Roads that are rarely used may be associated with very few data. This can affect the effectiveness of road network applications that use them. In this thesis, we investigate how data sparsity can be taken into account, for improving the quality of the following three problems: incident detection, outlier detection, and weight completion. First, we focus on a clustering-based approach for Incident Detection (ID) on sparse traffic data. It enables the finding of traffic incidents quickly from traffic data, thereby allowing emergency actions (e.g., rescuing injured people) to be carried out in a timely fashion. Existing ID solutions often rely on dense traffic data. We ask the question: Can ID be performed on sparse traffic data? As these data may not be enough to describe the state of the roads involved, they can undermine the effectiveness of existing ID solutions. To tackle this challenge, we borrow an important insight from the transportation literature, and use clustering to derive incident patterns. We devise a solution to detect anomalies in sparse traffic data by incident patterns. Experiments show that it is more effective on sparse traffic data, and efficient on large trajectory datasets. Next, we study how to apply neural networks for the time-dependent trajectory outlier detection problem based on sparse traffic condition. It aims to extract abnormal movements of vehicles on the roads. This important problem, which facilities understanding of traffic behavior and detection of taxi fraud, is challenging due to the varying traffic conditions at different times and locations. To tackle this problem, we propose the deep-probabilistic-based outlier detection algorithm. This method, which employs deep-learning methods to obtain time-dependent outliners from a huge volume of trajectories, can handle complex traffic conditions and detect outliners accurately. Experiments show that our method is more accurate, and can handle millions of trajectories. Finally, we utilize neural networks for the stochastic weight completion on sparse road networks. Road network applications make extensive use of network edge weights (e.g., traveling times). Some of these weights can be missing, especially in a road network where traffic data may not be available for every road. We study the stochastic weight completion problem, which computes the weight distributions of missing road edges. This is difficult, due to the intricate temporal and spatial correlations among neighboring edges. To tackle these challenges, we propose to incorporate the contextual properties about the road network (e.g., speed limits, number of lanes, road types) to provide finer granularity of road correlations. Experiments show that it is more effective and efficient than state-of-the-art solutions. | - |
dc.language | eng | - |
dc.publisher | The University of Hong Kong (Pokfulam, Hong Kong) | - |
dc.relation.ispartof | HKU Theses Online (HKUTO) | - |
dc.rights | The author retains all proprietary rights, (such as patent rights) and the right to use in future works. | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject.lcsh | Traffic flow - Mathematical models | - |
dc.subject.lcsh | Data mining | - |
dc.title | Effective and efficient traffic data mining on sparse road networks | - |
dc.type | PG_Thesis | - |
dc.description.thesisname | Doctor of Philosophy | - |
dc.description.thesislevel | Doctoral | - |
dc.description.thesisdiscipline | Computer Science | - |
dc.description.nature | published_or_final_version | - |
dc.date.hkucongregation | 2022 | - |
dc.identifier.mmsid | 991044494007203414 | - |