What is required to define a well-posed learning problem

Assignment Help Computer Engineering
Reference no: EM13807902

Question 1:

(a) What is required to define a well-posed learning problem?

(b) Here are two potential real-world application tasks for machine learning:

1. a winery wishes to uncover relationships between records of the quantitative analyses of its wines from the lab and some key subjective descriptions applied to its wine (e.g. dry, fruity, light, etc.)

2. you want to predict students' marks in the final exam of COMP9417 given their marks from the other assessable components in the course - you may assume that the corresponding data from previous years is available Pick one of the tasks and state how you would define it as a well-posed machine learning problem in terms of the above requirements.

(c) Suggest a learning algorithm for the problem you chose (give the name, and in a sentence explain why it would be a good choice).

Question 2:

(a) Write an algorithm called "Find-G" to find a maximally-general consistent hypothesis. You can assume the data will be noise-free and that the target concept is in the hypothesis space.

(b) Outline the steps in a proof that Find-G will never fail to cover a positive example in the training set.

Question 3: Decision Tree Learning

(a) Describe the main steps in the basic decision tree learning algorithm.

The table below contains a sample S of ten examples. Each example is described using two Boolean attributes A and B. Each is labelled (classified) by the target Boolean function.

Id A B Class

1 1 0 +

2 0 1 -

3 1 1 -

4 1 0 +

5 1 1 -

6 1 1 -

7 0 0 +

8 1 1 +

9 0 0 +

10 0 0 -

(b) What is the entropy of thse examples with respect to the given classification?

This table gives approximate values of entropy for frequencies of positive examples in a two-class sample.

Frequency of class ‘+' in sample Entropy of sample

0.0 0.00

0.1 0.47

0.2 0.72

0.3 0.88

0.4 0.97

0.5 1.00

0.6 0.97

0.7 0.88

0.8 0.72

0.9 0.47

1.0 0.00

What is the information gain of attribute A on sample S above?

(d) What is the information gain of attribute B on sample S above?

(e) Which would be chosen as the "best" attribute by a decision tree learner using the information gain splitting criterion? Why?

(f) Describe a method for overfitting-avoidance in decision tree learning.

Question 4: Learning for Numeric Prediction

(a) Let the weights of a two-input perceptron be: w0 = 0.2, w1 = 0.5 and w2 = 0.5. Assuming that x0 = 1, what is the output of the perceptron when:

[i] [1 mark] x1 = -1 and x2 = -1 ?

[ii] [1 mark] x1 = -1 and x2 = 1 ?

Letting w0 = -0.2 and keeping x0 = 1, w1 = 0.5 and w2 = 0.5, what is the perceptron output when:

[iii] [1 mark] x1 = 1 and x2 = -1 ?

[iv] [1 mark] x1 = 1 and x2 = 1 ?

(b) [6 marks] Here is a regression tree with leaf nodes denoted A, B and C:

X <= 5 : A

X > 5 :

| X <= 9: B

| X > 9: C

This is the training set from which the regression tree was learned:

5 Please see overX Class

1 8

3 11

4 8

6 3

7 6

8 2

9 5

11 12

12 15

14 15

Write down the output (class) values and number of instances that appear in each of the leaf

nodes A, B and C of the tree.

Question 5: Neural and Tree Learning on Continuous Attributes

(a) In general, feedforward neural networks (multi-layer perceptrons) trained by error back-propagation are:

(i) fast to train, and fast to run on unseen examples

(ii) slow to train, and fast to run on unseen examples

(iii) fast to train, and slow to run on unseen examples

(iv) slow to train, and slow to run on unseen examples

In one sentence, explain your choice of answer.

Suppose you have a decision tree (DT) and a multi-layer perceptron (MLP) that have been trained on data sampled from a two-class target function, with all attributes numeric. You can think of both models as graphs whose edges are labelled with numbers: weights in the MLP and threshold constants for attribute tests in the DT.

(b) Compare and contrast the roles of these numbers in the two models.

(c) Compare and contrast the methods of learning these numbers in the two models.

Reference no: EM13807902

Questions Cloud

Prepare a profit payoff table from transrail point : Prepare a profit payoff table from Transrail's point of view. For this payoff table find Transrail's optimal decision using (1) the pessimistic approach, (2) the optimistic approach, and (3) minimax regret approach.
World health survey of the world health organization. : What are your impressions of the survey items and the level of understanding needed to administer the survey? Pretend you are the surveyor.
Examine the concept of a global company : Write a six pages paper summarizing your findings and drawing from academic sources as well as company information. In your paper, examine the concept of a global company and the question-is a global company a synthesis of its component cultures
Confidence interval interpretation issues : .What is the population that the confidence interval in the article is making an inference about? Be specific.
What is required to define a well-posed learning problem : What is required to define a well-posed learning problem? Here are two potential real-world application tasks for machine learning: a winery wishes to uncover relationships between records of the quantitative analyses of its wines from the lab and s..
Concept of a global company : Write a six pages paper summarizing your findings and drawing from academic sources as well as company information. In your paper, examine the concept of a global company and the question-is a global company a synthesis of its component cultures, ..
Describe the current state of selected company : Create an MS PowerPoint Presentation in which you describe the current state of your selected company's supply chain and an identified process.
Appropriate for a hospital emergency room : Discuss whether you think a 30 percent capacity cushion is appropriate for a hospital emergency room. What factors would you consider when setting this cushion?
Find the maximum rotation period t of the cylinder : Find the maximum rotation period T of the cylinder which would prevent a 61 kg person from down - find the centripetal force experienced by the ball.

Reviews

Write a Review

Computer Engineering Questions & Answers

  A big company has many different departments housed in a

a large company has many different departments housed in a large office complex in the city. each department has its

  Write server program act a memory manager

The server program accepts memory requests for each of several clients. A page table is built for each client. The frames assigned to the client are sent hack to the client.

  Estimating average access time of system

The hit ratio for read accesses only is 0.9. A write-through procedure is used. Explain the average access time of system taking only memory read cycles?

  Program in visual basic to calculate area of a triangle

Write down a program in the visual basic which determines the area of a triangle. If three sides a, b and c entered do not make a triangle transmit the message out to the screen that the data entered was invalid.

  Values of the cf and of

assume that EAX contains a given contents before the instruction is executed, and give the contents of EAX as well as the values of the CF, OF, SF, and ZF flags after the instruction is executed:

  Find the class declaration

utalize the member function max() as an external function.

  A program that would calculate and print the parking charges

A parking garage charges a atleast fee of $2.00 to park for up to 3 hours. The garage charges an additional $0.50 per hour for each hour or part thereof in excess of 3 hours.

  Write a base class to hold basic computer information

Write a class that inherits the computer class for Windows computers.

  Write a program that plays a simple trivia game

Write a program that plays a simple trivia game. The game should have five questions. Each question has a corresponding answer and point value between 1 and 3 based on the difficult of the question.

  Design c++ program to accept an n*m matrix

Write down a C++ program to accept an n*m matrix containing only 0 and 1 values, and then find it's biggest zero submatrix i.e. the biggest submatrix in which all the values are 0.

  How to plan a payroll class

How to plan a PayRoll class.

  Questionthis question tests your facts and understanding of

questionthis question tests your facts and understanding of normal forms.given following relation and functional

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd