Find policy that maximize infinite horizon discounted reward

Assignment Help Engineering Mathematics
Reference no: EM131433538

Homework -

Q1. Consider a discounted cost problem with the following parameters:

-State space: {1, 2},

-Action spaces: U(1) = {1, 2}, U(2) = 1.

-Rewards: r(1; 1) = 5, r(1, 2) = 10, r(2, 1) =  1.

-Transition probabilities: P(u = 1) =1929_Figure.png; P(u = 2) = 1235_Figure1.png (* is undefined)

-Discount factor β = 0:9

Find a policy that maximizes infinite horizon discounted reward.

Q2. A target is randomly moving among I locations according to a Markov chain with transition matrix P. An agent wants to follow this target. At each time, the agent sees the target location and its own location. It then decides to move to a new location.

(i) If the target is at location i and the agent at location j at the beginning of time instant t, the agent incurs a cost of c(i, j).

(ii) If the agent's location at the beginning of time instant t is j and it decides to move to k, it incurs a moving cost of d(j, k).

Formulate the agent's problem as a discounted cost MDP. Use Matlab (or another software) to find the optimal policy with the following values:

I = 4;β = 0.95

1837_Figure2.png

Q3. A person has an umbrella that she takes from home to office and vice versa. There is a probability p of rain at the time she leaves home or office independently of earlier weather. If the umbrella is in the place where she is and it rains, she takes the umbrella to go to the other place (and this involves no cost). If there is no umbrella, and it rains, there is a cost W for getting wet. If the umbrella is in the place where she is but it does not rain, she may take the umbrella to the other place (and this involves an inconvenience cost V) or she may leave the umbrella behind (which involves no cost). Costs are discounted at a factor β, 0 < β < 1.
(a) Formulate this as an infinite horizon discounted cost problem. Identify the state and decision spaces. (Note that the decision spaces can be different for different states.)

(b) Write the fixed point equation for the value function and characterize the optimal strategy.

Q4. Show that the minimum cost is the solution of linear program:

Maximize J*

Subject to

J* + w(i) ≤ c(i, u) + j=1I Pij(u)w(j), 1 ≤ i ≤ I, u ∈ U.

Reference no: EM131433538

Questions Cloud

What services are usually included in assisted living : What is assisted living, and how does it differ from nursing facility care?Who provides assisted living?What services are usually included in assisted living?How is assisted living financed?What regulations apply to assisted living?What are some of t..
What makes an organization project portfolio successful : What makes an organization's project portfolio successful? How would you rate your organization's portfolio and what criteria do you base this on?
Company governance and stakeholder interests : In the last few years we have seen many examples of the breakdown between the company governance and stakeholder interests. Do you think these corporate scandals might have been played out differently if the corporations involved had built their busi..
What is senior housing : What is senior housing, and how does it differ from other types of long-term care?Who provides the various types of senior housing?What services are usually included in the various types of senior housing?a. Age-restricted communities ,b. Independent..
Find policy that maximize infinite horizon discounted reward : EE 556 Homework. Consider a discounted cost problem with the following parameters: Find a policy that maximizes infinite horizon discounted reward
Product profitability to focusing on customer profitability : How can we account for the upheaval in orientation from focusing on product profitability to focusing on customer profitability? If it's such a good idea, why didn't companies operate from the perspective of building customer value 50 years ago?
Stakeholders resist the implementation of change : Discuss the reasons why stakeholders resist the implementation of change. What are some of the signs indicating that there is a resistance to change within an organization?
How can a compensation plan be modified : Describe some ethical dilemmas sales professionals may encounter. How can a compensation plan be modified in order to minimize these dilemmas?
Engage in the communication process : Discuss the challenges that present themselves when people from different cultures engage in the communication process in at least 150 words.

Reviews

Write a Review

Engineering Mathematics Questions & Answers

  Prime number theorem

Dirichlet series

  Proof of bolzano-weierstrass to prove the intermediate value

Every convergent sequence contains either an increasing, or a decreasing subsequence.

  Antisymmetric relations

How many relations on A are both symmetric and antisymmetric?

  Distributed random variables

Daily Airlines fies from Amsterdam to London every day. The price of a ticket for this extremely popular flight route is $75. The aircraft has a passenger capacity of 150.

  Prepare a system of equations

How much money will Dave and Jane raise for charity

  Managing ashland multicomm services

This question is asking you to compare the likelihood of your getting 4 or more subscribers in a sample of 50 when the probability of a subscription has risen from 0.02 to 0.06.]  Talk about the comparison of probabilities in your explanation.

  Skew-symmetric matrices

Skew-symmetric matrices

  Type of taxes and rates in spokane wa

Describe the different type of taxes and their rates in Spokane WA.

  Stratified random sample

Suppose that in the four player game, the person who rolls the smallest number pays $5.00 to the person who rolls the largest number. Calculate each player's expected gain after one round.

  Find the probability density function

Find the probability density function.

  Develop a new linear programming for an aggregate production

Linear programming applied to Aggregate Production Planning of Flat Screen Monitor

  Discrete-time model for an economy

Discrete-time model for an economy

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd