How does the optimal policy path look like

Assignment Help Computer Engineering

Reference no: EM133399561

Question 1
Submit your final project. You are expected to submit a document containing the code and the obtained results, and a 2~5 minute video showing how the project runs.

Final Project Document
Include all the project and dataset description that you have submitted in the previous assignment. If you have modified anything, make sure to include the latest descriptions. Add to that the project code. Include both, the code as text, and screenshots of the code. Also, include screenshots that show the obtained results when you run the code.

Project Video
Include a short video (around 2-5 minutes) explaining your project and show how it works. Run the code, enter any necessary input, show the obtained results. Explain what the project is about, what the code does, and what the obtained results mean. You can use your ASU zoom account to record a video, capturing your screen and showing the code running, store it in the cloud, and include the link to the video.

Question 2
Reinforcement Learning
Reinforcement learning is more suitable for simulated environments, and environments in which making mistakes has very low to zero cost. For the applications listed below, list whether reinforcement learning is suitable or not, explaining why. Feel free to include details that better define the environment or the conditions under which reinforcement learning would or would not be suitable.
1. Medical diagnosis of cancer patients
2. Recommending the next action to an auto-pilot
3. Exploring a area with limited space to explore
4. Identifying plants based on their physical features
5. Recommending buying/selling decisions for stocks
6. Controlling a robot arm that assembles toys
7. Training a self-driving car in a simulated environment
8. Training a self-driving car in a real environment (driving in real streets)

Question 3
Discussion: Ethical Issues Related to AI
.AI is being used in many applications related to various life aspects. This led to rising concerns about un-ethical uses of AI.
One controversial example for AI usage is deep fakes. Many have shared concerns about how such technology, powered by AI, can be used in illegitimate ways. You can read more about deep fakes here.
In this discussion, write about one use of AI that you think could be unethical. You can talk about deep fakes, or any other AI technology of your choice.
Grading Criteria:
1. List an AI Application (5 points)
2. Explain what this application is about (10 points)
3. List one usage for this application that may be considered unethical, explaining why that could be the case. (15 points)

Question 4
Training a Room Explorer using RL

Run the script after modifying the main code. Answer the questions below.
1. How does the optimal policy path look like? Draw the path on a 4 X 4 grid.
2. Does it contain as much right or left turns as the policies generated by the old reward function?
3. What is the number of times the exploration task was carried out until the optimal policy was found? (If you are lucky, the code will re run a couple of hundred times. If not, it can keep running for a couple of thousand times before it finds the optimal exploration policy. It should not take more than a couple of minutes maximum.)

Question 5
Using a FSM to program a Self-Driving Car Agent

Keep in mind that there are multiple ways by which a self-driving vehicle can be designed. The choice for a specific method depends on the task environment.
A FSM method would be suitable for simpler environments, with a relatively small number of states, and a small number of input percepts and actions. That's because FMS design relies on listing all percepts at different states, and deciding which action should be taken accordingly. Below is a FSM developed by a research team for high-level decision making in a self-driving car:

Components of a FSM
The finite state machine shows a set of states, representing different actions, and how the transition occurs from one action to another based on the input percept. To design a FSM, you need to first identify:
1. Input Percept: All input percepts, and the values they may take.
2. States: All possible actions/states.
3. Transition Model: How the transition should happen from one state to the another based on the perceived input.

Question 6
Word Suggestion with N-Grams
In this lab, we will generate N-grams for a given text corpus, and use it to make suggestions for the next word to type.

Attachment:- Reinforcement Learning.rar

Reference no: EM133399561

Questions Cloud

What life was like before the industrial revolution : Describe what life was like before the Industrial Revolution. What are some things that impact our daily life that resulted from the Industrial Revolution?

What does it mean to say that a database displays : What does it mean to say that a database displays both entity integrity and referential integrity? And why entity integrity and referential integrity important

Describe one successful way sound music is used to engage : Describe one successful way sound/music is used to engage the audience -- use the text to help describe why you think so. Describe one unsuccessful way

Describe how groups and cultures are presented : Describe how groups and/or cultures are presented in all your advertisements. Your analysis is limited to the people in your ads who form groups

How does the optimal policy path look like : How does the optimal policy path look like? Draw the path on a 4 X 4 grid. 2. Does it contain as much right or left turns as the policies generated by the old

Defines the product or service they are serving : Company defines the customers. Company defines the product or service they are serving. Company defines Define the company's values. As per the above

Discuss the aws simple monthly calculator : What are some of the general use cases that are covered? What are the requirements to use this calculator?

Do you want to info technology course to make it worthwhile : What do you want to learn from Computer and Info Technology course to make it worthwhile? Is a basic understanding of information systems vital to you?

How does filmmaker sembène use sexuality in the film xala : How does the filmmaker Sembène use sexuality in the film Xala to represent corruption? Think, for instance, of the relationship between sexual impotence

User Account

All Pages