Assumptions in regression, Applied Statistics

Assignment Help:

Assumptions in Regression

To understand the properties underlying the regression line, let us go back to the example of model exam and main exam. Now we can find an estimate of a student's main exam points, if we also know his or her points on the model exam. As we have stated, a student with score of 85 in the model exam should receive points for the main exam in the vicinity of 75 to 95.

If we knew the model exam scores of all students along with their main exam scores, we would then have the population of values. The mean and the variance of the population of the model exam would be μx and σx2 and respectively. The measurements for the main exam points are  μy  and  σy2 .

The assumptions in regression are:

  1. The relationship between the distributions X and Y is linear, which implies the formula E(Y|X=x) = A + Bx at any given value of X = x.

  2. At each X, the distribution of Yx is normal, and the variances  σx2  are equal. This implies that E's have the same variance,  σ2.

  3. The Y-values are independent of each other.

  4. No assumption is made regarding the distribution of X.

    Since we do not have all of the students' course points and main exam points we must estimate the regression line E(Y|X = x) = A + BX.

    The figure shows a line that has been constructed on the scatter diagram. Note that the line seems to be drawn through the collective mid-point of the plotted points. The term  2148_simple linear regression.png  is the estimate of the true mean of Y's at any particular X = x.

    Figure 8

    682_assumptions in regression.png

Related Discussions:- Assumptions in regression

Standard error, Standard Error The measure of reliability of the estima...

Standard Error The measure of reliability of the estimating equation that we have developed is given by standard error of estimate. The standard error of estimate represented b

Interaction of enviornment and gene , entropy test to measure interaction b...

entropy test to measure interaction between enviornmental factors and genes

Mode, Mode Mode is the value of the observation which occurs with the  ...

Mode Mode is the value of the observation which occurs with the   greatest  frequency and thus  it is the most fashionable value, Mode has been derived from French  word  La  m

Eigenvalue-based rules, Henry Kaiser suggested a rule for selecting a numbe...

Henry Kaiser suggested a rule for selecting a number of components m less than the number needed for perfect reconstruction: set m equal to the number of eigenvalues greater than I

Stratified random sampling, Stratified Random Sampling: This method of ...

Stratified Random Sampling: This method of sampling is used when the population is comprised of natural subdivision of units, The method consist in classifying the population u

Find the minimum constant workforce, Find the minimum constant workforce: ...

Find the minimum constant workforce: ABC Company, a manufacturer of roofing supplies, has developed monthly forecasts for roofing tiles. The forecasted demand and the expected

Stratified sampling, Stratified Sampling Stratified Sampling is ...

Stratified Sampling Stratified Sampling is generally used when the population is heterogeneous. In this case, the population is first subdivided into several parts (or s

Multiple correspondence analysis, Correspondence analysis is an exploratory...

Correspondence analysis is an exploratory technique used to analyze simple two-way and multi-way tables containing measures of correspondence between the rows and colulnns of an

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd