Network traffic matrix prediction with incomplete data via masked matrix modeling

Introduction

This repo is the implementation of "Network traffic matrix prediction with incomplete data via masked matrix modeling" (Information Sciences, Under review).

Dataset

Two publicly available datasets are utilized to validate the proposed prediction method, namely the Abilene and GÉANT datasets. They provide the statistical traffic volume data of the real network traffic trace from the American Research and Education Network (Abilene) and the Europe Research and Education Network (GÉANT) .

Topology	Nodes	Flows	Links	Interval	Horizon	Records
Abilene	12	144	15	5 min	6 months	48046
GÉANT	23	529	38	15 min	4 months	10772

Framework

Masked matrix modeling-based matrix completion
- Masked matrix modeling (Mask generation, Pre-filling, and Reconstruction)
- 3D-UNet module
Traffic matrix prediction
- LSTM2D module

Results

Baselines: Zero-filling/Mean-filling/KNN/MC-NMF/LMaFit/IALM-MC/SRCNN/GCRINT - LSTM2D/LSTNet/MTGNN

Environment

python=3.7.9

torch==1.7.0

tsai==0.3.0

numpy==1.19.2

...

*more details can be found at pytorch-gpu.yml.

Getting Started

Mask Generation

utils/data_help.py

dataset = 'abilene'
gen_normal_missing_matrix(dataset=dataset,mean_ratio=0.1,std=0.05,counts=3)

Config

config.py
config = Config(
    device=torch.device('cuda' if torch.cuda.is_available() else 'cpu'),
    gpu=0,
    cpu=os.cpu_count(),
    model = 'UNet3D_LSTM2D',
    bilinear = True,
    kernel_size = 5,
    in_chan = 1,
    dataset = 'abilene', # abilene or geant
    epochs=200,
    batch_size=32,
    learning_rate=0.0001,
    seq_len=26,  # previous timestamps to use
    pre_len=1,  # number of timestamps to predict
    dim_model = 64,
    rounds = 3,
    heads = 1,
    dim_ff = 512,
    train_rate = 0.6,
    test_rate=0.2,
    rnn_layers =3,
    encoder_layers =1,
    dropout = 0.2,
    missing_ratio = 0.4,
    std = 0.05,
    early_stop = 15,
    flow = 144, # for abilene
    # flow = 529, # for geant
    lw=0.5,
    test_during_training=True
)

Training

python prediction_with_md_train.py

...

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
dataset		dataset
dict		dict
logs		logs
model		model
recored_logs		recored_logs
tensorboardslogs		tensorboardslogs
topo		topo
utils		utils
README.md		README.md
args.py		args.py
configs.py		configs.py
main.py		main.py
miss_train.py		miss_train.py
npy_to_mat.py		npy_to_mat.py
prediction_test.py		prediction_test.py
prediction_train.py		prediction_train.py
prediction_with_imputer.py		prediction_with_imputer.py
prediction_with_knn.py		prediction_with_knn.py
prediction_with_md_train copy.py		prediction_with_md_train copy.py
prediction_with_md_train.py		prediction_with_md_train.py
prediction_with_miss.py		prediction_with_miss.py
pytorch-gpu.yaml		pytorch-gpu.yaml
restruction_train.py		restruction_train.py
unite_train.py		unite_train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Network traffic matrix prediction with incomplete data via masked matrix modeling

Introduction

Dataset

Framework

Results

Environment

Getting Started

Mask Generation

Config

Training

About

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Network traffic matrix prediction with incomplete data via masked matrix modeling

Introduction

Dataset

Framework

Results

Environment

Getting Started

Mask Generation

Config

Training

About

Resources

Uh oh!

Stars

Watchers

Forks