Write the explicit form of the first pc

Assignment Help Basic Statistics
Reference no: EM132853514

Problem Statement: The 'Hair Salon.csv' dataset contains various variables used for the context of Market Segmentation. This particular case study is based on various parameters of a salon chain of hair products. You are expected to do Principal Component Analysis for this case study according to the instructions given in the following rubric.

Note: This particular dataset contains the target variable satisfaction as well. Please do drop this variable before doing Principal Component Analysis.

1) Perform Exploratory Data Analysis [both univariate and multivariate analysis to be performed]. The inferences drawn from this should be properly documented.

2) Scale the variables and write the inference for using the type of scaling function for this case study.

3) Comment on the comparison between covariance and the correlation matrix after scaling.

4) Check the dataset for outliers before and after scaling. Draw your inferences from this exercise.

5) Build the covariance matrix, eigenvalues and eigenvector.

6) Write the explicit form of the first PC (in terms of Eigen Vectors)

7) Discuss the cumulative values of the eigenvalues. How does it help you to decide on the optimum number of principal components? What do the eigenvectors indicate? Perform PCA and export the data of the Principal Component scores into a data frame.

8) Mention the business implication of using the Principal Component Analysis for this case study.

This is my drive location where the data set can be accessed. There are 3 files - Hair Salon.csv , Data Dictionary & the PCA question file.

https://drive.google.com/drive/u/1/folders/16yzWowBSDUC8ZZEYpXgXRIIe08b7zpE5

Reference no: EM132853514

Questions Cloud

Compute the probability that a randomly selected student : Compute the probability that a randomly selected student speaks French, given that the student is male
Determine the cost of sales for the month of July : Question - Consider the following: Expected sales for July and August are $8,000 and $8,300 respectively. Determine the cost of sales for the month of July
Calculate the annual amount of sinking fund : A sinking fund is to be set up for the purchase of a machine in 5 years time. The present cost of the machine is RM6000. Calculate the annual amount
Compare and contrast how regional economics were connected : Compare and contrast how regional economics were connected to political systems in the New England, Middle, and Southern colonies.
Write the explicit form of the first pc : Problem Statement: The 'Hair Salon.csv' dataset contains various variables used for the context of Market Segmentation. This particular case study is based
What are the advantages of the matching principle : What are the advantages and disadvantages of the matching principle and the revenue recognition principles from an investor's perspective
What is the z-score that corresponds to a raw : You are given the following information for a sample (mean = 40; standard deviation = 5). What is the z-score that corresponds to a raw score of 60?
Aristotle virtues and aquinas theological virtues : Discuss the moral difference between Aristotle's Virtues and Aquinas' theological virtues. Give your own example to illustrate the difference
Assignment - Calculating Gross Earnings : Assignment- Calculating Gross Earnings - Patrick Nolan is a bank teller. He receives a weekly salary of $630 for a 35-hour week. What is his regular hourly rate

Reviews

Write a Review

Basic Statistics Questions & Answers

  What is the irr for the project in problem

What is the IRR for the project in problem #6?  If the Minimum Attractive Rate of Return (MARR) is 20%, would you recommend investing in this project?  Discuss any reasons that you might not invest in a project even if it exceeds the MARR.

  Find the mean and standard deviation of the strengths what

specifications for an aircraft bolt require that the ultimate tensile strength be at least 18 kn. it is known that 10

  Determining pattern represents mean-median and mode

Pattern that most accurately represents mean, median and mode is?

  What is the net requirement for caramel turtles

An MRP planner has prepared the following table showing product structure, lead times (orders are lot-for-lot), and quantities on hand:

  For number 5 above what is the mean and standard deviation

1. the probability of being left handed is 10. you will sample 10 people what is the probability that you sample people

  Suppose that the usual one way air fare to a certain city

on one busy holiday weekend a national airline has many requests for a standby flights at half of the usual one way

  Report the slope and explain what it means

Highway and City The figure shows the relationship between the number of kilometers per liter on the highway and that in the city for some cars.

  Find the probability distribution for y

a. Find the probability distribution for y, the number of dollars won (use the rule for equally likely events).

  Determining the health coverage-frequencies

Health coverage, frequencies. The Behavioral Risk Factor Surveillance System (BRFSS) is an annual telephone survey designed to identify risk factors

  Probability that the fisher chosen from clearwater

Suppose that one fisher from each park is chosen at random. What is the probability that the fisher chosen from Clearwater had a license and the fisher chosen from Mountain View did not have a license?

  Smoking habits of a group of college students

If a student is chosen at random, find the probability of getting someone who is a non-smoker. Round your answer to three decimal places.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd