What benefit does this two-step clustering approach have

Assignment Help Basic Statistics
Reference no: EM131581516

Assignment: Case Problem- Know Thy Customer

Know Thy Customer (KTC) is a financial consulting company that provides personalized financial advice to its clients. As a basis for developing this tailored advising, KTC would like to segment its customers into several representative groups based on key characteristics.

Peyton Avery, the director of KTC's fledging analytics division, plans to establish the set of representative customer profiles based on 600 customer records in the fileKnowThyCustomer. Each customer record contains data on age, gender, annual income, marital status, number of children, whether the customer has a car loan, and whether the customer has a home mortgage. KTC's market research staff has determined that these seven characteristics should form the basis of the customer clustering.

Peyton has invited a summer intern, Danny Riles, into her office so they can discuss how to proceed. As they review the data on the computer screen, Peyton's brow furrows as she realizes that sis task may not be trivial. The data contains both categorical variables (Female, Married, Car, Mortgage), and interval variables (Age, Income, and Children).

Managerial Report

Playing the role of Peyton, you must write a report documenting the construction of the representative customer profiles. Because Peyton would like to use this report as a training reference for interns such as Danny, your report should experiment with several approaches and explain the strengths and weaknesses of each. In particular, your report should include the following analyses:

1. Using k-means clustering on all seven variables, experiment with different values of k. Recommend a value of k and describe these k clusters according to their "average" characteristics. Why might k-means clustering not be a good method to use for these seven variables?

2. Using hierarchical clustering all seven variables, experiment with using complete linkage and group average linkage as the clustering method. Recommend a set of customer profiles (clusters). Describe these clusters according to their "average" characteristics. Why might hierarchical clustering not be a good method to use for these seven variables?

3. Apply a two-step clustering method:

a. Apply hierarchical clustering on the binary variables Female, Married, Car, and Mortgage to recommend a set of clusters. Using Matching Coefficients as the similarity measure and group average linage as the clustering method.

b. Based on the clusters from part (a), split the original 600 observations into mseparate data sets, where m is the number of clusters recommended from part (a). For each of these m data set, apply 2-means clustering using Age, Income, and Children as variables. This will generate a total of 2m clusters. Describe these 2m clusters according to their "average" characteristics.

What benefit does this two-step clustering approach have over the approaches in parts (1) and (2)? What weakness does it have?

Reference no: EM131581516

Questions Cloud

What rules would you propose instituting : Suppose that you and three roommates are living in an apartment or dorm suite with a common area for living, dining, and cooking.
Reduce the community fishing activities : Suppose that a small fishing community in a developing country has been operating successfully for centuries without any regulations.
Mechanics of a speculative attack and double play process : Discuss the attack on the Hong Kong dollar. Discuss the mechanics of a speculative attack and the “double play” process
What do the points on the light curve represent : Summarize briefly, in your own words, what Planet Hunters is doing: Describe what the x and y axes on the plot represent
What benefit does this two-step clustering approach have : What benefit does this two-step clustering approach have over the approaches in parts (1) and (2)? What weakness does it have?
Post a brief comparison of the health status : Post a brief comparison of the health status of the two EU countries you selected with that of the U.S. .
Public lands should be sold to private interests : Some people have suggested that certain public lands would be managed more efficiently if they were auctioned off to the highest bidders.
Explain parole boards do even more for crime victims : In what ways could probation officers, corrections officials, and parole boards do even more for crime victims
What is evidence-based criminal justice : The assignment is to write a paper answering the question, What is evidence-based criminal justice. As you begin your research, remember that data drive

Reviews

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd