Merlin: Multi-View Representation Learning for Robust Multivariate Time Series Forecasting with Unfixed Missing Rates

Code for our SIGKDD'25 paper "Merlin: Multi-View Representation Learning for Robust Multivariate Time Series Forecasting with Unfixed Missing Rates"

Requirements

This code is built on Python 3.11. The required packages can be installed using the following command:

# Install Python
conda create -n Merlin python=3.11
conda activate Merlin
# Install dependencies
pip install -r requirements.txt

Note: Our code has been tested in VS Code. If you run it in PyCharm, there might be path-related issues due to the differences in the path logic between VS Code and PyCharm.

Data processing

The four datasets (METR-LA, PEMS04, China AQI, and Global Wind) adopted in our paper can be found at the following two links:

Our repository provides compressed files for part of the data. Please extract them before running the data preprocessing code.

Baidu Netdisk: https://pan.baidu.com/s/1YPm7Qhj-FV3CBT36n-GG2A?pwd=2ytm (Extraction code: 2ytm)
Google Drive: https://drive.google.com/drive/folders/1m80WUbHmbTaCP3WXGUJWdGMd0WfQ8Rt8?usp=drive_link

We present two data masking ways:

The masked points in the low missing rate dataset are a subset of those in the high missing rate dataset.
Both the low and high missing rate datasets are masked randomly.

This project places the first masking approach in the datasets/ directory. If you wish to use the second masking approach, please move the datasets/PEMS04/preprocess_PEMS04_radom.py file from the data folder to the datasets/ directory before running. The processing way for other datasets is the same.

Please use the following way to process the raw data:

# METR-LA dataset
python datasets/preprocess_METR_subset.py
# PEMS04 dataset
python datasets/preprocess_PEMS04_subset.py
# China AQI dataset
python datasets/preprocess_AQI_subset.py
# Global Wind dataset
python datasets/preprocess_WIND_subset.py

The hyperparameters of our model

STID

input_len: 12 (This paper follows the experimental settings of most existing spatiotemporal forecasting models, fixing both the length of historical observations and future forecasting to 12.)
output_len: 12
num_nodes: number of time series. METR-LA：207, PEMS04: 307, China AQI: 1300, Global wind: 2908
input_size: 3 (It represents the dimensions after concatenating the original time series with the two temporal embeddings, set to 3.)
if_T_i_D: True
if_D_i_W: True
embed_dim: 64 (For the basic introduction to these spatiotemporal embeddings, please refer to the original STID paper: https://github.com/GestaltCogTeam/STID.)
node_dim: 64
temp_dim_tid: 64
temp_dim_diw: 64
time_of_day_size: METR-LA and PEMS04: 288, China AQI: 24, Global wind: 1
day_of_week_size: 7
cl_hidden = 4 (Since contrastive learning requires dimensionality reduction of the representations, we provide two methods. The representation has the dimension [B, H, N, 1]. The first method reduces it to [B, H2, N], while the second method reduces it to [B, H2, 1].)
cl_student = 16

Merlin

Weight_pre: 2 (The weight of the L1 loss.)
weight_KD: 2 (The weight of knowledge distillation.)
weight_CL: 1 (The weight of contrastive loss.)
Temperature: 1
num_miss_rate: 4 (We use four missing rates in this paper)
batch size: 16
epoch: 101
max_norm = 5 (Gradient clipping.)
lr_rate = 0.0002
weight_decay = 0.0001
milestone = [1,10,25,50,75,90,100] (Adjust the learning rate when the epoch reaches the numbers in the list.)
gamme = 0.5 (Learning rate adjustment ratio.)

Train the teacher model and the student model

After completing data preprocessing, please train the teacher model using the following way:

python train_teacher.py

This repository provides pretrained teacher model weight files for four datasets. For details, please refer to directories such as model_results/PEMS04. Due to the large size of the global wind files, we have not uploaded them to the repository yet.

After training the teacher model, please train the student model using the following way:

python train_student_merlin.py

After running the above file, the model's performance metrics under the four missing rates will be printed directly.

Train one stage model

We provid TSmixer as an example. If you want to train a separate forecasting model for a specific missing rate, you can run and debug the following code：

python train_onestage.py

Train two stage model

We provid TimesNet as the imputation and iTransformer as forecasting. If you want to train a two stage model (imputation + forecasting) for a specific missing rate, you can run and debug the following code：

python train_twostage.py

citation

If the code is helpful to you, please cite the following paper:

@misc{yu2025merlinmultiviewrepresentationlearning,
      title={Merlin: Multi-View Representation Learning for Robust Multivariate Time Series Forecasting with Unfixed Missing Rates}, 
      author={Chengqing Yu and Fei Wang and Chuanguang Yang and Zezhi Shao and Tao Sun and Tangwen Qian and Wei Wei and Zhulin An and Yongjun Xu},
      year={2025},
      eprint={2506.12459},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2506.12459}, 
}

Folder Structure:

The folder structure is organized as follows:

Merlin/
├── datasets/                  # Contains processed data and masking scripts
│   ├── METR-LA/
│   │   ├── METR-LA.h5
│   │   └── METR-LA.pkl
│   ├── PEMS04/
│   │   ├── adj_PEMS04_distance.pkl
│   │   ├── adj_PEMS04.pkl
│   │   ├── PEMS04.csv
│   │   └── PEMS04.npz
│   ├── China-AQI/
│   │   └── China-AQI.csv
│   ├── Global-wind/
│   │   └── Global-Wind.csv
│   ├── preprocessing_METR_subset.py
│   ├── preprocessing_PEMS_subset.py
│   ├── preprocessing_AQI_subset.py
│   ├── preprocessing_WIND_subset.py
│   └── data_solve.py
│
├── model_results/            # Stores trained model weights
│   ├── PEMS04/
│   ├── METR-LA/
│   ├── China-AQI/
│   └── Global-Wind/
│
├── models/                  # Model architecture definitions
│   ├── forecasting/
│   │   ├── STID/
│   │   ├── TSmixer/
│   │   └── iTransformer/
│   └── imputation/
│       └── TimesNet/
│                        
├── Metircs/                # Loss (Merlin) and Metrics   
│   ├── Loss.py
│   └── mask_metrics.py
│
├── python train_teacher.py
├── python train_student_Merlin.py
├── python train_onestage.py
├── python train_twostage.py
├── requirement.txt           # Dependencies
└── README.md                 # Project overview and instructions

Acknowledge

We appreciate the following github repos for their valuable codebase:

STID: https://github.com/GestaltCogTeam/BasicTS
iTransformer and TimesNet: https://github.com/thuml/Time-Series-Library
TSmixer: https://github.com/google-research/google-research/tree/master/tsmixer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Merlin: Multi-View Representation Learning for Robust Multivariate Time Series Forecasting with Unfixed Missing Rates

Requirements

Data processing

The hyperparameters of our model

STID

Merlin

Train the teacher model and the student model

Train one stage model

Train two stage model

citation

Folder Structure:

Acknowledge

About

Uh oh!

Releases 1

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
datasets		datasets
figure		figure
metrics		metrics
model_results		model_results
models		models
LICENSE		LICENSE
README.md		README.md
requirement.txt		requirement.txt
train_onestage.py		train_onestage.py
train_student_merlin.py		train_student_merlin.py
train_teacher.py		train_teacher.py
train_twostage.py		train_twostage.py

License

ChengqingYu/Merlin

Folders and files

Latest commit

History

Repository files navigation

Merlin: Multi-View Representation Learning for Robust Multivariate Time Series Forecasting with Unfixed Missing Rates

Requirements

Data processing

The hyperparameters of our model

STID

Merlin

Train the teacher model and the student model

Train one stage model

Train two stage model

citation

Folder Structure:

Acknowledge

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages