In this assignment you have to simulate a simple version of an inventory control problem. The objective is to familiarize with the computation of the dynamic programming solution and compare it with a CCP method.
Deadline: Please send your problem set before January 10th, 2025.
see here
There are two types of penalty term in the utility function, each with their economic interpretation.
- In this period t, I somehow want to eat chips
$c>0$ , but I have run out of chips$i=0$ . - I incur some penalty because I can not eat my fav snack right away. I need to go out and buy some.
- Then I go to the supermarket, see the price
$(p_s)$ . I know the state$(i=0,c>0,p, \epsilon)$ when making the decision$x$ . - If I decide to buy
$x=1$ , then my next period inventory would be$i'=i+x-c=0.75$ - If I decide not to buy
$x=0$ , then my next period$i'=\max{0,i+x-c}=0$
- In this period t, I somehow want to eat chips
$c>0$ , but I have run out of chips$i=0$ . - Then I go to the supermarket, see the price
$(p_s)$ . I know the state$(i=0,c>0,p, \epsilon)$ when making the decision$x$ . - If I decide to buy
$x=1$ , then I can happily eat the chips and there's no penalty. My next period$i'=i+x-c=0.75$ - If I decide not to buy
$x=0$ , I incur the loss of "starving". Then my next period$i'=\max{0,i+x-c}=0$
see here
see here