During any day, I may own either 0 or 1 share of a stock. The price of the stock is governed by the Markov chain shown in Table. Tomorrow’s Price Today’s Price $0 $1 $2 $3 $0 .5 .3 .1 .1 $1 .1 .5 .2...

1 answer below »

During any day, I may own either 0 or 1 share of a stock. The price of the stock is governed by the Markov chain shown in Table.



















































Tomorrow’s Price





Today’s Price



$0



$1



$2



$3



$0



.5



.3



.1



.1



$1



.1



.5



.2



.2



$2



.2



.1



.5



.2



$3



.1



.1



.3



.5



At the beginning of a day in which I own a share of stock, I may either sell it at today’s price or keep it. At the beginning of a day in which I don’t own a share of stock, I may either buy a share of stock at today’s price or not buy a share. My goal is to maximize my expected discounted profit over an infinite horizon (use β = .95).


a Use the policy iteration method to determine an optimal stationary policy.


b Use linear programming to determine an optimal stationary policy.


c Perform two iterations of value iteration.


d Find a policy that maximizes average daily profit.



Answered Same DayDec 24, 2021

Answer To: During any day, I may own either 0 or 1 share of a stock. The price of the stock is governed by the...

Robert answered on Dec 24 2021
135 Votes
During any day, I may own either 0 or 1 share of a stock. The price of the stock is governed by the Markov
chain shown in Table.
Tomorrow’s Price
Today’s Price $0 $1 $2 $3
$0 .5 .3 .1 .1
$1 .1 .5 .2 .2
$2 .2 .1 .5 .2
$3 .1 .1 .3 .5
At the beginning of a day in which I own a...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here