THESIS
2018
xv, 96 pages : illustrations ; 30 cm
Abstract
Spatiotemporal systems are common in the real world. Forecasting the multi-step future
of these spatiotemporal systems based on past observations, or, Spatiotemporal Sequence
Forecasting (STSF), is a significant and challenging problem. Due to the complex spatial
and temporal relationships within the data and the potential long forecast horizon, it is
challenging to design appropriate Deep Learning (DL) architectures for STSF. In this thesis,
we explore DL architectures for STSF. We first define the STSF problem and classify it
into three subcategories: Trajectory Forecasting of Moving Point Cloud (TF-MPC), STSF
on Regular Grid (STSF-RG), and STSF on Irregular Grid (STSF-IG). We then propose
architectures for STSF-RG and STSF-IG problems.
For the STSF-RG problems, we proposed t...[
Read more ]
Spatiotemporal systems are common in the real world. Forecasting the multi-step future
of these spatiotemporal systems based on past observations, or, Spatiotemporal Sequence
Forecasting (STSF), is a significant and challenging problem. Due to the complex spatial
and temporal relationships within the data and the potential long forecast horizon, it is
challenging to design appropriate Deep Learning (DL) architectures for STSF. In this thesis,
we explore DL architectures for STSF. We first define the STSF problem and classify it
into three subcategories: Trajectory Forecasting of Moving Point Cloud (TF-MPC), STSF
on Regular Grid (STSF-RG), and STSF on Irregular Grid (STSF-IG). We then propose
architectures for STSF-RG and STSF-IG problems.
For the STSF-RG problems, we proposed the Convolutional Long-Short Term Memory
(ConvLSTM) and the Trajectory Gated Recurrent Unit (TrajGRU). ConvLSTM uses convolution
in both the input-state and state-state transitions of LSTM and is better at capturing the
spatiotemporal correlations than the Fully-connected LSTM (FC-LSTM). TrajGRU improves
upon ConvLSTM by actively learning the recurrent connection structure, which achieves
better prediction performance with less parameters. To better investigate the effectiveness
of our proposed architectures and other DL models for STSF-RG, we chose to tackle the
precipitation nowcasting problem, which is a representative STSF-RG problem with a huge
real-world impact. By incorporating ConvLSTM into an Encoder-Forecaster (EF) structure,
we proposed the first machine learning based solution for precipitation nowcasting that
outperforms the operational algorithm. To facilitate future studies for this problem and gauge
the state-of the-art methods, we proposed the first large-scale benchmark for precipitation
nowcasting: HKO-7. HKO-7 has new evaluation metrics and has both the offline setting and
the online settings in the evaluation protocol. We evaluated seven models in the offline and
online settings. Experiment results show that 1) all deep learning models outperform the
optical
flow based models, 2) TrajGRU attains the best overall performance among deep
learning models, and 3) models consistently perform better in the online setting.
For the STSF-IG problems, we converted the sparsely distributed observations into data
on a spatiotemporal graph and utilized graph convolution operators, or graph aggregators,
to build the model. We proposed a new graph aggregator called Gated Attention Network
(GaAN). GaAN not only uses multiple attention heads to aggregate information from the
neighborhoods but also uses another set of gates to control each attention head's importance.
With experiments on two large-scale inductive node classification datasets, we showed that
GaAN outperforms the baseline graph aggregators. Also, we proposed a unified framework
called Graph GRU (GGRU), which transforms any valid graph aggregators to RNNs that are
designed for STSF-IG. We compared GGRU with other state-of-the-art methods in traffic
speed forecasting and found it achieves the best overall performance.
Post a Comment