This repository accompanies our paper at IEEE Information Theory Workshop (ITW) 2024: Sampling to Achieve the Goal: An Age-Aware Remote Markov Decision Process.
In this work, we investigate the problem of goal-oriented decision-making under random communication delays, a key challenge in remote control and networked systems. While the Age of Information (AoI) has been widely used to optimize freshness, existing works mainly treat AoI as a metric to minimize, without rigorously exploring its causal role in downstream decision-making. Furthermore, few solutions exist that connect AoI with remote MDP planning, nor are there principled algorithms with theoretical guarantees.
To bridge these gaps, we formulate the age-aware remote control problem as an MDP with age-dependent observations and propose a novel framework: the Age-Aware Remote MDP. Our key insight is to treat AoI not as a target metric, but as a structured, dynamic side-information that influences remote decision-making policy.
We address a key question:
🧠What is the value of information, when freshness fades—and only its power to shape remote decisions remains?
The Bisec-MRVI folder contains the code for Algorithm 1 in our paper.
Files | Description |
---|---|
MRVI_Bisec_Main.m | The main file. You can run this file directly to reproduce our result. |
MRVI.m | The inner layer MRVI algorithm. This function outputs the optimal value of the MDP. |
Qfunction_MRVI.m | This function outputs the modified Q-function in the Value Update process. |
ValueUpdate_MRVI.m | The function accomplishes the value update process in MRVI. |
The FPBI folder contains the code for Algorithm 2 in our paper.
Files | Description |
---|---|
FPBI_Main.m | The main file. You can run this file directly to reproduce our result. |
AoI_threshold.m | This algorithm reproduces the AoI-optimal policy proposed in Theorem 3 of Sampling for Data Freshness Optimization: Non-linear Age Functions. |
hstar.m | This function implements the fixed point iteration on h. |
hstar_AoI.m | This function implements the fixed point iteration for h under AoI-optimal sampling. |
hstar_Zerowait.m | This function implements the fixed point iteration for h under zero-wait sampling. |
Qfunction.m | This function outputs the Q-function in the Value Update process. |
ValueUpdate.m | This function implements the fixed point iteration on W. |
ValueUpdate_AoI.m | This function implements the fixed point iteration on W under AoI-optimal sampling. |
ValueUpdate_Zerowait.m | This function implements the fixed point iteration on W under zero-wait sampling. |
Transitionfunction.m | This function describes the transformed transition probability. |
If you encounter any issues with reproduction, feel free to reach out to me at hitliaimin@163.com.
You may cite this paper if it helps your research:
@inproceedings{li2024sampling,
author = {Aimin Li and Shaohua Wu and Gary C. F. Lee and Xiaomeng Chen and Sumei Sun},
title = {Sampling to Achieve the Goal: An Age-aware Remote Markov Decision Process},
booktitle = {Proc. {IEEE} {ITW}},
pages = {121-126},
year = {2024},
}