Involving Elementary Reinforcement Learning Assignment

Assignment Help Other Subject
Reference no: EM132479438

Assignment - Involving Elementary Reinforcement Learning

Assignment Objectives - These are the tasks you have to do:

Learn the best floor for the car to be waiting using the Tsetlin, Krinsky, Krylov and LRI schemes.

In each case, use a suitable value for the "memory" and learning parameter.

In each case, plot the ensemble average of the waiting time for an ensemble of 100 experiments.

Introduction - The goal of this assignment is to have you implement some simple Learning Automata (LA) and Reinforcement Learning (RL) strategies. The application domain is quite straightforward, but it is typical of the domains where LA and RL can be used.

Problem Statement - The assignment consists of using LA to solve a simplified version of the Elevator Problem. A building has an elevator that stops at k floors as in Figure 1. People can request the elevator from floor i, and get off the elevator on floor j, where i = 1 . . . k; i ≠ j. At any time instant, the probability that the elevator is requested from a floor, and the probability that the person gets off the elevator on another floor, are given by the matrices E and L respectively (not shown), whose components are the probabilities of a person entering and leaving the floors respectively. After each trip, the elevator can move to rest at one of the k different floors so as to minimize the waiting time for the next passenger.

1003_figure.png

Figure 1: A simple model of an Elevator system, where the car can be made to wait at one of k possible floors.

The distributions E and L for are unknown to the LA/RL scheme. However, the unknown waiting can be quantified as follows. Every index i ∈ {1, 2, . . . , k} is mapped to a unique number G(i) ∈ {1, 2, . . . , k} by a one-to-one and onto mapping. In this assignment, you can assume that k = 6. Then, the waiting time fi, (which is unknown to the LA/RL system) obeys the equation:

fi = 0.8 · G(i) 0.4 · CEIL[G(i)/2] + h, i = {1, 2, ..., 6},

where the random random times are affected by a noise, h, that has a Gaussian distribution with a mean of 0 and a variance of an input parameter, σ2, and where CEIL[] is the "Ceiling" function of the argument. In other words, at any time t, h is a random number generated from a Gaussian distribution that represents the random delays at the respective floors. In this assignment, you can use a Gaussian Random Number Generator from a standard existing library available in the language that you are using.

Questions - During the demo you should be prepared to discuss the following questions:

1. Explain the way you those the parameters for each scheme.

2. Can you rank the schemes in terms of their speed/accuracy?

Reference no: EM132479438

Questions Cloud

What are the physical properties of the data set : This exercise involves you working with a dataset of your choosing. Visit the Kaggle website, browse through the options and find a dataset of interest.
Discuss about the charts compositions : For initial post talk about the charts compositions: the chart size, scales, orientation and value charting as well as the chart's trustworthy and elegant.
How learning changes over time impact organizational culture : Review the section on Linear Development in Learning Approaches. Discuss how learning changes over time impact organizational culture. What is the impact.
How diligent are you in keeping your own information secure : How diligent are you in keeping your own information secure? Review the steps listed in the chapter and comment on your security status.
Involving Elementary Reinforcement Learning Assignment : Assignment - Involving Elementary Reinforcement Learning. Explain the way you those the parameters for each scheme. Can you rank schemes in terms of their speed
Compute the current residual income of northeast division : Compute the current residual income of the Northeast Division and the division's residual income if the competitor is acquired. Will divisional management
What you feel would have prevented the breach : Today you will submit information on the specifics of the security breach. However, as to not create a scenario where you copy and paste, your task.
Binomial probability distribution for the number : If 75 patients are admitted with the disease, how many are expected to recover?
What is an area of it you intend to or would like to study : What is an area of IT you intend to or would like to study? What methodology or methodologies do you think would be appropriate to address your research.

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd