Compute the Euclidean distance

Assignment Help Basic Statistics
Reference no: EM132225850

Homework -

Q1. For each part below indicate whether we would generally expect the performance of a flexible statistical learning method to be better or worse than an inflexible method. Justify your answer.

(i) The sample size n is extremely large, and the number of predictors p is small.

(ii) The number of predictors p is extremely large, and the number of observations n is small.

(iii) The relationship between the predictors and response is highly non-linear.

(iv) The variance of the error terms, i.e. σ2 = Var(∈), is extremely high.

Q2. We have data from the questionnaires survey (to ask people opinion) and objective testing with two attributes (acid durability and strength) to classify whether a special paper tissue good or not. The following table gives the training sample

X1 = Acid Durability (seconds)

X2 = Strength (kg/square meter)

Y

7

7

Bad

7

4

Bad

3

4

Good

1

4

Good

5

3

Good

7

5

Good

Now the factory produces a new paper tissue that pass laboratory test with X1 = 3 and X2 = 7. Suppose we wish to use this data set to make a prediction for Y when X1 = 3 and X2 = 7 using K-nearest neighbors.

(i) Compute the Euclidean distance between each observation and the test point X1 = 3 and X2 = 7.

(ii) What is our prediction with K = 1? Why?

(iii) What is our prediction with K = 3? Why?

(iv) If the Bayes decision boundary in this problem is highly non-linear, then would we expect the best value for K to be large or small? Why?

Reference no: EM132225850

Questions Cloud

How do the citizens of the country access health care : Describe any foreign health care system by answering the following questions: How do the citizens of the country access health care?
Discuss what sets louis armstrongs playing : Discuss what sets Louis Armstrongs playing and singing apart in "Heebie Jeebies," "West End Blues," and "Lazy River." Why are these classic
Describe a variety of ways to improve employee performance : Describe a variety of ways to improve employee performance utilizing feedback, rewards, and positive reinforcement.)
How will the electronic medical record help : Health care has become to depend on information technology (IT) to deliver, monitor, and communicate health care delivery.
Compute the Euclidean distance : Compute the Euclidean distance between each observation and the test point X1 = 3 and X2 = 7. What is our prediction with K = 1? Why
What are the obstacles in improving quality of care : Based on your learning over the past six weeks, identify the factors that will be required for future change in the health care industry pertaining to quality.
How trend enhance or impede organizational effectiveness : Unions, whose membership has steadily declined over the past 50 years, are now seeking to expand their organizing efforts by targeting nontraditional union.
Source of energy that the government : According to the text, there are several renewable energy sources that could reduce dependence on fossil fuels and nuclear power.
Post an explanation of the importance of forecasting : Post an explanation of the importance of forecasting as the last step in the QCQ process. Use your previous public policy and administration issue

Reviews

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd