Spatial temporal machine learning

Paper written by Senzhang Wang, Jiannong Cao, Fellow, IEEE, Philip S. Yu, Fellow, IEEE

It has broad applications in various domains including

ST data are in continuous space
Has both spatial and temporal properties
Not independently generated
Traditional methods rely heavily on feature engineering. Hard to process natural ST data in their raw form.
Compared with traditional methods, the advantages of deep leaning models for STDM are as follows:
- Automatic feature representation learning
  - Do not require hand-crafted features
  - The spatial proximity and the long-term temporal correlations are complex and hard to be captured.
  - With CNN and RNN, these can be automatically learned from the raw data directly.
- Powerful function approximation ability

Data types:

Data representations: (different deep learning models require different data representations as input)

CNN: convolutional neural networks
GraphCNN: generalize CNN to graph structured data.
RNN and LSTM:

RNN is designed to recognize the sequential characteristics and use patterns to predict the next likely scenario. They are widely used in the applications of speech recognition and natural language processing.
Seq2Seq:
Autoencoder and stacked AE:

Raw ST data will first be represented as a particular data format to fit the deep learning model.
RNN and LSTM models are good at handling sequence data with short-term or long-term temporal correlation, while CNN models are effective to capture the spatial correlation in the image like matrices.

Usually one type of ST data instance corresponds to one typical data representations.
Trajectory and time series data can naturally be represented as sequence data.
Spatial map can be represented as a 2D matrix.
ST raster can be represented as a 2D matrix or 3D tensor
Trajectory data can also be represented as a matrix to apply CNN. For example, a city is first partitioned into grid cell regions. Then the ST field can be modeled as a matrix with each cell region representing an entry. If a trajectory paths over the cell region, the corresponding entry value is set to 1; otherwise, set to 0.
Spatial map is sometimes represented as a graph