Transformation of data, Applied Statistics

Assignment Help:

PCA is a linear transformation that transforms the data to a new coordinate system such that the greatest variance by any projection of the data comes to lie on the first coordinate (called the first principal component), the second greatest variance on the second coordinate, and so on. The PCA can be used for dimensionality reduction in a dataset while retaining those characteristics of the dataset that contribute most to its variance, by keeping lower-order principal components and ignoring higher-order ones. Such low-order components often contain the "most important" aspects of the data. But this is not necessarily the case, depending on the application. Let p and tn denote respectively the original and reduced number of variables. The original variables are denoted X. In the simplest case our measure of accuracy of reconstruction is the sum ofp squared multiple correlations between X-variables and the predictions of X made froin the factors. In the more general case we can weight each squared multiple correlation by the variance of the corresponding X-variable.

Since we can set those variances ourselves by multiplying scores on each variable,by any constant we choose, this amounts to the ability to assign any weights we choose to the different variables.


Related Discussions:- Transformation of data

Median for grouped data, Grouped Data  In order to find the median, the...

Grouped Data  In order to find the median, the median class is to be first located and then interpolation is to be used by assuming that items are evenly spaced over the entire

Steps in anova, Steps in ANOVA The three steps which constitute the ana...

Steps in ANOVA The three steps which constitute the analysis of variance are as follows: To determine an estimate of the population variance from the variance that exi

Example of discrete random variable, Example of discrete random variable: ...

Example of discrete random variable: 1. What is a discrete random variable? Give three examples from the field of business. 2. Of 1000 items produced in a day at XYZ Manufa

Classification of universe, Classification of Universe The universe may...

Classification of Universe The universe may be classified either on the basis of number of units and on the basis   of existence of units as is clear from the following chart :

Mean and median, The amounts of money won by the top ten finishers in a fam...

The amounts of money won by the top ten finishers in a famous car race are listed below. $1,172,246    $163,659    $440,584    $350,634     $290,596 $186,731    $145,809     $143,2

Atmospheric circulation and precipitation, (a) Elevation (m)...

(a) Elevation (m) 0 400 800 1200 1600 2000 2400 2800 3200 4000 480

Regression, why we use dummy variable

why we use dummy variable

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd