BUST10133 Decision Analytics Assignment

Assignment Help Game Theory
Reference no: EM132731584

BUST10133 Decision Analytics - University of Edinburgh

SECTION A

1. a) i) Describe the properties of an absorbing state in a Markov chain.
ii) Explain what the absorption probability means for an absorbing state in a Markov chain.

Consider the following Markov chain model of consumers' purchasing behaviour of groceries from five supermarkets A, B, C, D and E. The state represents the supermarket. The set of possible states is {A, B, C, D, E}. The transition matrix models the change of consumers' preferences in supermarket during any month.

b) Consider customers preferring supermarket B. Calculate the probability that these customers switch to supermarket D or E three months later.

c) Calculate the probability that customers ever switch to supermarket D from supermarket B before reaching supermarket E.

d) Calculate the expected time until the customers switch from supermarket B to D before reaching supermarket E.

e) For some reasons, supermarket E is closed. Customers preferring supermarket E have a 50% chance of switching to supermarket C and a 50% chance of switching to supermarket D.

Modify the Markov chain model and find the long-run probability distribution of the state of the process at the beginning of a month.

2. a) Explain why the maximum expected total reward is unsuitable as a decision- making criterion in an infinite horizon Markov decision process, and how to overcome this.

Consider the following Markov decision process model of the sales of a product. The state of the process is the classification of sales performance at the beginning of a month--- excellent (e), good (g) or poor (p). Hence the set of possible states is S = {e, g, p}. The decision is the promotion for this month --- either nothing (n), direct marketing (d), or advertising (a). Hence, the set of possible decisions in state i is Ki =
{n, d, a} for all i in S. The following table shows rki, the expected profit (in £'000s) during a month when the process is in state i and action k is chosen, and pki,j the probability that the process makes a transition to state j when the process is in state i and action k is chosen.

i

k

rki

pki,e

pki,g

pki,p

e

n

100

0.5

0.5

0

e

d

80

0.7

0.3

0

e

a

50

1

0

0

g

n

60

0

0.6

0.4

g

d

40

0.5

0.5

0

g

a

10

0.8

0.2

0

p

n

10

0

0

1

p

d

-10

0

0.4

0.6

p

a

-30

0.3

0.7

0

The objective is to maximise the infinite horizon expected discounted reward with a discount factor of 0.8 per month.

b) Use policy iteration to determine whether the promotion decision that does nothing when the sales performance is excellent, does direct marketing when the sales performance is good, and does advertising when the sales performance is poor is optimal.

c) i) By how much can rap change without affecting the conclusion from (b)?

ii) Suppose that pde,e = q and pde,g = 1-q. Within what range of q does the conclusion from (b) still hold?

d) Suppose that it takes one month to organise a special promotion, meaning that the decision made at the beginning of a month is the special promotion to use in the next month, rather than this month. Explain how you would modify the Markov decision

process model above for this situation. Explain how the state, action, immediate reward, and transition probability will change.

SECTION B

3. The Edinburgh Cashmere Company sells cashmere scarfs. The company has found that when the winter is very cold they sell 10000 scarfs; when the winter is average they sell 7000 scarfs, and when the winter is mild they sell 5000 scarfs. Each scarf sells for £50 and costs £35 to make. Any unsold scarfs during the season are sold to a discount chain at half price (£25). The company has learnt a forecast for the coming winter, saying that there was a 30% chance that the winter would be very cold, a 40% chance that it would be average, and a 30% chance that it would be mild.

a) Under the Hurwicz decision criterion with the coefficient of optimism ?? = 0.6, how many scarfs should the company make to maximise profits?

Following the above information, the Edinburgh Cashmere Company can choose to do sales promotion A or promotion B when the winter is average or mild. By adopting Promotion A, the company can sell additional 2000 scarfs. And by adopting Promotion B, the company can sell additional 3000 scarfs. Promotion A costs £15000, and Promotion B costs £30000. From past experiences, Promotion A has a 35% success rate, and Promotion B has a 50% success rate.

b) Draw a decision tree that models this problem and use it to determine the policy that the company should adopt to maximise expected monetary value.

c) What is the expected value of perfect information on the winter weather?

d) Suppose the probability of promotion B being successful is unknown. Determine the range of this probability within which the policy determined in (b) remains optimal.

4. a) Give three reasons why the decision tree approach is not an efficient way to solve a sequential sampling or bandit problem.

A manufacturer produces mugs in batches of 1,000 which can either be accepted or rejected. If the batch is accepted, the manufacturer can make a profit of £5,000. If the batch is rejected, the manufacturer makes a loss of £3,000. The manufacturer estimates that the probability that the batch is accepted is 80%.

Before releasing this batch of mugs to customers, the manufacturer can either perform up to two independent tests on the sample mugs from this batch or rework this batch.

Reworking a batch involves examining every mug and replacing all faulty mugs with conforming ones so that the batch will definitely be accepted. The estimated cost of reworking a batch is £1000.

Performing a test costs £200, but the test does not give perfect information. Experience has shown that the probability the test returns a positive result given that the batch is good is 0.9, and the probability the test returns a negative result given that a batch is bad is 0.8.

b) Model this situation as a sequential sampling problem and find the functional equations for the problem.

c) Use the model from (b) to determine a testing policy the manufactory should adopt in order to maximise its expected profit.

d) Calculate the expected value of perfect information (EVPI) for the problem.

e) Calculate the risk profile for the policy determined in (c).

Attachment:- Decision Analytics.rar

Reference no: EM132731584

Questions Cloud

Calculate the expected rate of return on the stock : The betas for these four stocks are 0.4, 2, 1.35, and -0.2, respectively. What is the portfolio beta?
Net present value of project-redmond incorporated : Redmond Incorporated, with a tax rate of 26% and a weighted average cost of capital of 10.5%, is looking at a project that requires an outlay of $150,000 for eq
Define flow as it relates to hip hop : In your own words, define 'flow' as it relates to hip hop. Is there a specific rapper who's flow you particularly like? What about their flow is appealing.
Net present value of cash flows-darrington inc : Darrington Inc. is evaluating an equipment purchase which requires an expenditure of $473 today followed by an inflow of $150 in year one
BUST10133 Decision Analytics Assignment : BUST10133 Decision Analytics Assignment Help and Solution, University of Edinburgh - Assessment Writing Service - Describe the properties of an absorbing state
Research the six fundamental human rights : In 500 words or more (no less), research the 6 fundamental human rights: The Right to Life, the Right to the Freedom from Harm, the Right to Free Thought.
State the percentage rate to two places : State the percentage rate to two places, and do not use the % symbol.
What overall argument is the author making : What overall argument is the author making? What specific examples does the author focus on in the reading? How is this argument being made?
What is the fair and present value of lease : You have a structured settlement but you need cash now... so you call J.G. Wentworth and they make you a low-ball offer that you reject.

Reviews

Write a Review

Game Theory Questions & Answers

  Use the best-response approach to find all nash equilibria

Player 1 has the following set of strategies {A1;A2;A3;A4}; player 2’s set of strategies are {B1;B2;B3;B4}. Use the best-response approach to find all Nash equilibria.

  A supplier and a buyer, who are both risk neutral

A supplier and a buyer, who are both risk neutral, play the following game,  The buyer’s payoff is q^'-s^', and the supplier’s payoff is s^'-C(q^'), where C() is a strictly convex cost function with C(0)=C’(0)=0. These payoffs are commonly known.

  Pertaining to the matrix game theory problem

Pertaining to the matrix need simple and short answers, Find  (a) the strategies of the firm (b) where will the firm end up in the matrix equilibrium (c) whether the firm face the prisoner’s dilemma.

  Nash equilibria

Consider the two-period repeated game in which this stage game is played twice and the repeated-game payo s are simply the sum of the payo s in each of the two periods.

  Find the nash equilibrium

Two players, Ben and Diana, can choose strategy X or Y. If both Ben and Diana choose strategy X, every earns a payoff of $1000.

  Construct the payoff matrix for the game

The market for olive oil in new York City is controlled by 2-families, Sopranos and Contraltos. Both families will ruthlessly eliminate any other family that attempts to enter New York City olive oil market.

  Question about nash equilibrium

Following is a payoff matrix for Intel and AMD. In each cell, 1st number refers to AMD's profit, while second is Intel's.

  Finding the nash equilibrium

Determine the solution to the given advertising decision game between Coke and Pepsi, assuming the companies act independently.

  Nash equilibria to determine the best strategy

Little Kona is a small coffee corporation that is planning entering a market dominated through Big Brew. Each corporation's profit depends on whether Little Kona enters and whether Big Brew sets a high price or a low price.

  Creating a payoff table

Suppose you and your classmate are assigned a project on which you will earn one combined grade. You each wish to receive a good grade, but you also want to avoid hard work.

  Determine the nash equilibrium for trade policy

Consider trade relations in the United State and Mexico. Suppose that leaders of two countries believe the payoffs to alternative trade policies are as follows:

  Find the nash equilibrium outcomes

Use the given payoff matrix for a simultaneous move one shot game to answer the accompanying questions.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd