Compute the Euclidean distance

Assignment Help Basic Statistics
Reference no: EM132225850

Homework -

Q1. For each part below indicate whether we would generally expect the performance of a flexible statistical learning method to be better or worse than an inflexible method. Justify your answer.

(i) The sample size n is extremely large, and the number of predictors p is small.

(ii) The number of predictors p is extremely large, and the number of observations n is small.

(iii) The relationship between the predictors and response is highly non-linear.

(iv) The variance of the error terms, i.e. σ2 = Var(∈), is extremely high.

Q2. We have data from the questionnaires survey (to ask people opinion) and objective testing with two attributes (acid durability and strength) to classify whether a special paper tissue good or not. The following table gives the training sample

X1 = Acid Durability (seconds)

X2 = Strength (kg/square meter)

Y

7

7

Bad

7

4

Bad

3

4

Good

1

4

Good

5

3

Good

7

5

Good

Now the factory produces a new paper tissue that pass laboratory test with X1 = 3 and X2 = 7. Suppose we wish to use this data set to make a prediction for Y when X1 = 3 and X2 = 7 using K-nearest neighbors.

(i) Compute the Euclidean distance between each observation and the test point X1 = 3 and X2 = 7.

(ii) What is our prediction with K = 1? Why?

(iii) What is our prediction with K = 3? Why?

(iv) If the Bayes decision boundary in this problem is highly non-linear, then would we expect the best value for K to be large or small? Why?

Reference no: EM132225850

Questions Cloud

How do the citizens of the country access health care : Describe any foreign health care system by answering the following questions: How do the citizens of the country access health care?
Discuss what sets louis armstrongs playing : Discuss what sets Louis Armstrongs playing and singing apart in "Heebie Jeebies," "West End Blues," and "Lazy River." Why are these classic
Describe a variety of ways to improve employee performance : Describe a variety of ways to improve employee performance utilizing feedback, rewards, and positive reinforcement.)
How will the electronic medical record help : Health care has become to depend on information technology (IT) to deliver, monitor, and communicate health care delivery.
Compute the Euclidean distance : Compute the Euclidean distance between each observation and the test point X1 = 3 and X2 = 7. What is our prediction with K = 1? Why
What are the obstacles in improving quality of care : Based on your learning over the past six weeks, identify the factors that will be required for future change in the health care industry pertaining to quality.
How trend enhance or impede organizational effectiveness : Unions, whose membership has steadily declined over the past 50 years, are now seeking to expand their organizing efforts by targeting nontraditional union.
Source of energy that the government : According to the text, there are several renewable energy sources that could reduce dependence on fossil fuels and nuclear power.
Post an explanation of the importance of forecasting : Post an explanation of the importance of forecasting as the last step in the QCQ process. Use your previous public policy and administration issue

Reviews

Write a Review

Basic Statistics Questions & Answers

  How many unique committee can be formed

A four-person committee is to be formed from an group of eight chemists. How many unique committee can be formed?

  Association-environmental club membership and snowmobile

Do these data suggest that there is an association between environmental club membership and snowmobile use among visitors to Yellowstone National Park?

  What does the constant term tell for the regression equation

For the regression equation, answer the following: What is it used for? Where does it come from? What does the constant term tell you?

  What is the probability of a woman receiving a salary below

according tonbspadvertising agenbspthe average base salary for women working as copywriters in advertising firms is

  Which symbols are complex and ambiguous

In the Film Citizen Kane, which symbols' meanings are clear and simple? Which symbols are complex and ambiguous? What gives them this quality?

  Average processes are ergodic in the mean

For each of the following, determine whether the statement is True or False.

  Measuring the intensity of a headache inpatients

The effectiveness of a headache medicine is tested by measuring the intensity of a headache inpatients before and after drug treatment. The data consist of before and after intensities for eachpatient.Explain

  Find the mean and standard error of the sample means

Find the mean and standard error of the sample means that is normally distributed with a mean of 10 and a standard deviation of 2 and a sample size of 25.

  Find value of standardized statistic for sample proportion

In a random sample of n = 100 recently made cars, 13% (P^ = .13) have defects. Find the value of the standardized statistic (z-score) for this sample proportion.

  Finding probability and the inverse

The instructor wanted to "pass" anyone who scored above 69. What proportion of exams will have passing scores?

  Probability that the total number of gadgets

A factory produces Xn gadgets on day n where the Xn are independent and identically distributed random variables, with mean 5 and variance 9.

  What is the probability of getting a sample mean

Assuming normality, what is the probability of getting a sample mean this low or lower if the claims about the mean and standard deviation are true?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd