Gaussian mixture model

Assignment Help Basic Statistics
Reference no: EM132795586

Multivariate Statistical Analysis Assignment

Question 1. Consider centering our multivariate data, i.e. x*j = xj - x¯, for j = 1, 2, . . . , n. The first principal component is obtained by maximising the sample variance of y1 = uT1 xj. Show that this is equivalent to minimising the residual sum of squares, where the jth residual is defined as

||x*j - u1Tx*j/u1Tu1).u1||

Question 2. Varimax is a commonly used oblique rotation technique. When applied to factor analysis, does it affect the values of the communitalities? Please justify your answer.

Question 3. A study was conducted to investigate the factors and predictors leading to the bankruptcy of firms. As part of the study, a factor analysis was conducted and the model with m = 3 factors was chosen. The sample correlation matrix of the eight variables under consideration was given below.

375_figure1.jpg

Estimate of the factor loadings was obtained using the principal component method. The three (estimated) factor loadings obtained were

 

552_figure2.jpg

and their proportion of total variance explained were 0.287, 0.341, and 0.280 respectively.
(a) Find the specific variances and communalities.

(b)Calculate the residual matrix, given by R - LˆLˆT - Ψ.

(c) Is having m = 3 factors appropriate? Justify your answer.

(d) In the context of this study, a factor loading less than 40% is considered small. The variables can be categorized into 3 groups:
{[x1, x2, x3], [x4, x5, x6], [x7, x8]}

Comment on any findings you observed. For example, does any of the groups contribute significantly to one or more factors?

Question 4. A survey was conducted on n = 70 randomly chosen people to study the association between owning certain assets and happiness indices. The variables were total price of cars owned (x1), total price of TV owned (x2), price of most valuable asset (apart from cars and houses) (x3), happiness score (x4), and satisfaction score (x5).

The sample correlation matrix obtained was given by

1589_figure3.jpg


(a) Find the sample canonical correlations.
(b) Perform a statistical test to determine whether the two groups of variables ( x1, x2, x3 and x4, x5 ) are uncorrelated.
(c) Using standardized variables, construct the canonical variates corresponding to the "significant" canonical correlation(s) and interpret them.
(d) Do the assets variables provide much information about the happiness variables (i.e. happiness score and satisfaction score)?

Question 5. Consider two groups in a city:
π1 : ride-on-mower owners
π2 : owners without ride-on mowers
In order to identify the best sales prospects for an intensive sales campaign, a ride-own mower manufacturer is interested in categorising families as proprioceptive owners or non-owners on the basis of income (x1) and land size (x2). A random sample of n1 = 12 current owners and n2 = 12 current non-owners were surveyed. The data is given in the following table.

 

1470_figure4.jpg

(a) Develop a linear classification function for this data.
(b) Using the function developed in part (a), construct a ‘confusion matrix' by classifying the given observations in the data.
(c) Find the apparent error rate.
(d) State any assumptions you make to justify the use of the method in parts (a) and (b).

Bonus question
The following question is optional and may be attempted for bonus marks.

Question 6. Derive an EM algorithm for calculating the maximum likelihood estimate of the parameters of the isotropic Gaussian mixture model, where the ith component of the mixture model has distribution of the form Npi, σ2iI) with µi and σi2 being unknown.

Reference no: EM132795586

Questions Cloud

How much is the net investment income of thief : During 2021, Ashley reported net income of 500,000 and it also did not pay cash dividends during that year. How much is the net investment income of THIEF
Was there one specific playwright that you identified with : Was there one specific playwright that you identified with, whose themes, stories or attitudes stood out and spoke to you personally? Was there one thing.
Find the nash equilibrium price : Suppose the two firms compete on quantities. Find the Nash equilibrium price and the output of each firm. How much profit does each firm make?
Compute for the amount of cash paid to the borrower on dec : Compute for the carrying amount of the loan receivable on December 31, 2022. On December 31 ,2022, the bank determined that the borrower.
Gaussian mixture model : Derive an EM algorithm for calculating the maximum likelihood estimate of the parameters of the isotropic Gaussian mixture model, where the ith component
What is the labor efficiency variance : An auto company reports these cost data: Actual Results Total labor cost: $1,140,000. What is the labor efficiency variance
What happened after alexander the great death : What happened after Alexander the Great's death? Explain the three types of gov't. that developed after his death. Each answer should be doubled-spaced.
Explain why a demand curve will shift : Explain why a demand curve will shift. Explain why a supply curve will shift.
Calculate the annual cash flows from fixed-payment annuity : Calculate the annual cash flows (annuity payments) from a fixed-payment annuity if the present value of the 20-year annuity is $1.4 million

Reviews

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd