How can we evaluate the k-means model on data

Assignment Help Other Engineering
Reference no: EM131760326

Problem 1: Explain why and when one would want to use k-means clustering, furthermore, give an explanation of how the algorithm works given a set of data points

Problem 2: Explain one method of picking the `k' in k-means clustering

Problem 3: What is the problem with picking such `k' centroids randomly? Can you devise a better method to pick k that could resolve this problem?

Problem 4: How can we evaluate the k-means model on data ?

Problem 5: Explain the similarities and differences between K-means and Linear Regression. When would you use linear regression instead of k-means?

Problem 6: Consider the following problem context:
We want to model the relationship between the number of students that complain about fees to the department head, with the time spent by the head to deal with such student complaints. We know that there are 4 classes in the department, `information science 101' (11 students), `programming 101' (5 students) and `statistics 202' (3 students) and 'distributed systems 402 (12 students)'. There are no common students between these classes.

- Suggest what the the outcome and input variables could be and whether the latter should be understood as categorical or numerical

- Write a mathematical expression for the regression line for this problem (see online help about how to write mathematical statements in latex)

- What would be the input variables for the above problem context?

- State how the answer from (b) and (c) would then be used to complete the model.

- Discuss briefly your strategy for validating the above model

Problem 7: Explain the difference between observed outcome, line fitting error, estimated/predicted values, and the residuals.

Problem 8: After designing a linear regression model for two variables, you discover the following residual distribution ref fig1. What does this mean?

Give an example of a plot that would correspond to this residual graph.

Classification and Validation

Problem 9: Explain the use case for logistic regression, and state at least one similarity and one difference between logistic regression and linear regression

Problem 10: Explain the function of the ROC curve for logistic regression
Your answer should mention null, alternative hypothesis, true positives and false positives and classifier thresholds.

Verified Expert

The assignment required to study and research about Roc curve, k-means clustering and answer the problems given in the assignment. The assignment further required to give examples along with the explanation and their relations.

Reference no: EM131760326

Questions Cloud

Compare the consistency of sales : You are employed as a statistician for a company that makes household products, which are sold by part-time salespeople who work during their spare time.
Condition of creative and new : The condition of creative and new, are always arrived at with a thought process first. It must be seen in the mind's eye before it can be expressed.
Determine the total allocated overhead cost : Determine the total allocated overhead cost for January, March, and August. (Do not round intermediate calculations. Round your answers to the nearest.
Describe competency models : Describe competency models, case-based decision making, and systems thinking.
How can we evaluate the k-means model on data : Explain why and when one would want to use k-means clustering, furthermore, give an explanation of how the algorithm works given a set of data points
Website troubleshooting-search engine optimization : You own a consultant firm that offers the following services: Website troubleshooting, Search Engine Optimization (SEO),
Discuss the concept of reasonable assurance : Discuss the concept of reasonable assurance and the degree of confidence that financial statement users should have in the financial statements
Predetermined overhead rate based on direct labor hours : Maureen Corporation estimated its overhead costs would be $22,800 per month except for January when it pays the $182,880 annual insurance premium.
Discussion of business ethics into the public domain : The turmoil in the world’s financial system and near collapse of the banking system has surfaced a discussion of business ethics into the public domain.

Reviews

inf1760326

4/25/2018 5:00:24 AM

Thanks so much, you did a fantastic job writing this custom assignment for me, I can't tell you how much I appreciate you making the changes. This is proof to me that this web site is truly one-of-a kind and really professional. I will definitely be using your services again

len1760326

12/11/2017 5:19:37 AM

please find attachment .... pdf for picture ... and document for questions But please I need best answer for every questions ... please take care about picture residual.pdf I put it with attachments please every question put ans under ... don't give me general please .... use pdf for answers and change your word of tutor ... but please take care some question Problem 8 for residual.pdf Used answers in assignments 2 CPIS 604 to change by with your own word please

Write a Review

Other Engineering Questions & Answers

  Characterization technology for nanomaterials

Calculate the reciprocal lattice of the body-centred cubic and Show that the reciprocal of the face-centred cubic (fcc) structure is itself a bcc structure.

  Calculate the gasoline savings

How much gasoline do vehicles with the following fuel efficiencies consume in one year? Calculate the gasoline savings, in gallons per year, created by the following two options. Show all your work, and draw boxes around your answers.

  Design and modelling of adsorption chromatography

Design and modelling of adsorption chromatography based on isotherm data

  Application of mechatronics engineering

Write an essay on Application of Mechatronics Engineering

  Growth chracteristics of the organism

To examine the relationship between fermenter design and operating conditions, oxygen transfer capability and microbial growth.

  Block diagram, system performance and responses

Questions based on Block Diagram, System Performance and Responses.

  Explain the difference in a technical performance measure

good understanding of Mil-Std-499 and Mil-Std-499A

  Electrode impedances

How did this procedure affect the signal observed from the electrode and the electrode impedances?

  Write a report on environmental companies

Write a report on environmental companies

  Scanning electron microscopy

Prepare a schematic diagram below of the major parts of the SEM

  Design a pumping and piping system

creating the pumping and piping system to supply cool water to the condenser

  A repulsive potential energy should be a positive one

Using the data provided on the webvista site in the file marked vdw.txt, try to develop a mathematical equation for the vdW potential we discussed in class, U(x), that best fits the data

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd