Identify clusters and describe their centroids and business

Assignment Help Computer Engineering
Reference no: EM133360606

Question: Use of the cluster analysis in predictive analytics can be described as follows:

Using existing data we can identify clusters (groups) in data. Each cluster may be described in data terms (using cluster centroids etc.), and each cluster can be explant in terms of its business meaning.

Then when a new data arrives it can be tested to identify which cluster is the closest, which will suggest that the new data belongs to this cluster.

Directions:

  1. Pick any dataset relevant to your major that you would like to analyze. Avoid use the same or similar datasets to one you use for your final project.
  2. Randomly divide it into two chunks 80% and 20% of records.
  3. Select input variables (2 min) that you will use for cluster analysis. Provide reasoning for the selection.
  4. Use SPSS or other tool to apply appropriate cluster analysis method to clusters the larger part of the dataset.
  5. Identify clusters and describe their centroids and business meaning.
  6. If classes are poorly identified by the analysis or their business meaning is hard to describe. Change your variable selection and go to the step 3.
  7. For at least 5 records from the remaining smaller part of the dataset identify the closest cluster centroid. That will be a prediction which cluster those records belong too. Note that they have not been used in cluster identification, therefore this prediction will qualify as an example of predictive analytics.
  8. Submit a Word report describing each step and a result of this process, include relevant scripts and outputs produced by the tool you use.

Reference no: EM133360606

Questions Cloud

Buyer decision-making process : Explain how you went through the buyer decision-making process for your recent purchase of a high-involvement product
What source of differentiation has bmw achieved : What source of differentiation has BMW achieved? Multiple Choice personnel convenience image price.
What is your opinion of denyer viewpoint concerning database : Read an article by Charles Denyer Best Practices for Database Security. What is your opinion of Denyer's viewpoint concerning Database Security Best Practices?
Laurie is avid consumer of conservative news : Laurie is an avid consumer of conservative news and never watches a program that conflicts with her beliefs.
Identify clusters and describe their centroids and business : Using existing data we can identify clusters (groups) in data. Each cluster may be described in data terms (using cluster centroids etc.), and each cluster
Why should buy the firewalls from two different vendors : Your boss suggested installing two firewalls from one vendor. However, you believe that the company should buy the firewalls from two different vendors.
Canadian wealth management landscape : What are the major trends in the Canadian Wealth Management Landscape from the suppliers of capital as well as the users of capital? -
What journalists have reported about your company : Researching a company involves reading the news to see what journalists have reported about your company.
Successful global marketing product strategy : Explain what is a successful global marketing product strategy? Compare the different types of innovation.

Reviews

Write a Review

Computer Engineering Questions & Answers

  Mathematics in computing

Binary search tree, and postorder and preorder traversal Determine the shortest path in Graph

  Ict governance

ICT is defined as the term of Information and communication technologies, it is diverse set of technical tools and resources used by the government agencies to communicate and produce, circulate, store, and manage all information.

  Implementation of memory management

Assignment covers the following eight topics and explore the implementation of memory management, processes and threads.

  Realize business and organizational data storage

Realize business and organizational data storage and fast access times are much more important than they have ever been. Compare and contrast magnetic tapes, magnetic disks, optical discs

  What is the protocol overhead

What are the advantages of using a compiled language over an interpreted one? Under what circumstances would you select to use an interpreted language?

  Implementation of memory management

Paper describes about memory management. How memory is used in executing programs and its critical support for applications.

  Define open and closed loop control systems

Define open and closed loop cotrol systems.Explain difference between time varying and time invariant control system wth suitable example.

  Prepare a proposal to deploy windows server

Prepare a proposal to deploy Windows Server onto an existing network based on the provided scenario.

  Security policy document project

Analyze security requirements and develop a security policy

  Write a procedure that produces independent stack objects

Write a procedure (make-stack) that produces independent stack objects, using a message-passing style, e.g.

  Define a suitable functional unit

Define a suitable functional unit for a comparative study between two different types of paint.

  Calculate yield to maturity and bond prices

Calculate yield to maturity (YTM) and bond prices

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd