Transformation of data, Applied Statistics

Assignment Help:

PCA is a linear transformation that transforms the data to a new coordinate system such that the greatest variance by any projection of the data comes to lie on the first coordinate (called the first principal component), the second greatest variance on the second coordinate, and so on. The PCA can be used for dimensionality reduction in a dataset while retaining those characteristics of the dataset that contribute most to its variance, by keeping lower-order principal components and ignoring higher-order ones. Such low-order components often contain the "most important" aspects of the data. But this is not necessarily the case, depending on the application. Let p and tn denote respectively the original and reduced number of variables. The original variables are denoted X. In the simplest case our measure of accuracy of reconstruction is the sum ofp squared multiple correlations between X-variables and the predictions of X made froin the factors. In the more general case we can weight each squared multiple correlation by the variance of the corresponding X-variable.

Since we can set those variances ourselves by multiplying scores on each variable,by any constant we choose, this amounts to the ability to assign any weights we choose to the different variables.


Related Discussions:- Transformation of data

Corelation regrassion, the two regrassion line will pass through the point ...

the two regrassion line will pass through the point (x,y)

Different analyses of recurrent events data, Different analyses of recurren...

Different analyses of recurrent events data: The bladder cancer data listed in Wei, Lin, and Weissfeld (1989) is used in Example 54.8/49.8 of SAS to  illustrate different anal

Correlation analysis, Correlation Analysis Correlation Analysis is perf...

Correlation Analysis Correlation Analysis is performed to measure the degree of association between two variables. The measure is called coefficient of correlation. The coeffic

Control chart, construction of control chart,n chart

construction of control chart,n chart

Mode, Mode The mode is the value which occurs most frequ...

Mode The mode is the value which occurs most frequently in a set of observations on the point of maximum frequency and around which other items of the set cluste

Regression analysis, Of the 6,325 kindergarten students who participated in...

Of the 6,325 kindergarten students who participated in the study, almost half or 3,052 were eligible for a free lunch program. The categorical variable sesk (1 == free lunch, 2 = n

Time series analysis., how is a free hand graph secular trend method plotte...

how is a free hand graph secular trend method plotted

#title., Features of index numbers

Features of index numbers

Which average is to be used to describe statistical data?, There ar...

There are situations where none of the three averages is fully satisfactory. For example, if the number of items in a series is very small, none of these av

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd