Assumptions in regression, Applied Statistics

Assignment Help:

Assumptions in Regression

To understand the properties underlying the regression line, let us go back to the example of model exam and main exam. Now we can find an estimate of a student's main exam points, if we also know his or her points on the model exam. As we have stated, a student with score of 85 in the model exam should receive points for the main exam in the vicinity of 75 to 95.

If we knew the model exam scores of all students along with their main exam scores, we would then have the population of values. The mean and the variance of the population of the model exam would be μx and σx2 and respectively. The measurements for the main exam points are  μy  and  σy2 .

The assumptions in regression are:

  1. The relationship between the distributions X and Y is linear, which implies the formula E(Y|X=x) = A + Bx at any given value of X = x.

  2. At each X, the distribution of Yx is normal, and the variances  σx2  are equal. This implies that E's have the same variance,  σ2.

  3. The Y-values are independent of each other.

  4. No assumption is made regarding the distribution of X.

    Since we do not have all of the students' course points and main exam points we must estimate the regression line E(Y|X = x) = A + BX.

    The figure shows a line that has been constructed on the scatter diagram. Note that the line seems to be drawn through the collective mid-point of the plotted points. The term  2148_simple linear regression.png  is the estimate of the true mean of Y's at any particular X = x.

    Figure 8

    682_assumptions in regression.png

Related Discussions:- Assumptions in regression

Compare the t interval with the bootstrap interval, Jocko's Garage has been...

Jocko's Garage has been accused of insurance fraud. Data on estimates made by Jocko and another garage were obtained for 10 damaged vehicles (available in 'jockogarage.txt'). Here

Caveat, Caveat We must be careful when interpreting the meaning of asso...

Caveat We must be careful when interpreting the meaning of association. Although two variables may be associated, this association does not imply that variation in the independ

Histogram, Histogram: It is generally used for charting continuous fre...

Histogram: It is generally used for charting continuous frequency   distribution. In histogram, data are plotted as a series  of rectangle one over the other. Class intervals

Quartiles, Related Positional Measures Besides median, there are other ...

Related Positional Measures Besides median, there are other measures which divide a series into equal parts. Important amongst these are quartiles, deciles and percentiles.

Data project, Dr. Jim Mirabella UNIT EIGHT: DATA ANALYSIS PROJECT All Excel...

Dr. Jim Mirabella UNIT EIGHT: DATA ANALYSIS PROJECT All Excel output should be copied into a single Word document where you must enter all of your responses to the questions below.

Which average is to be used to describe statistical data?, There ar...

There are situations where none of the three averages is fully satisfactory. For example, if the number of items in a series is very small, none of these av

Find the conditional distribution of turning diameter, 1. Assume the random...

1. Assume the random vector (Trunk Space, Length, Turning diameter) of Japanese car is normally distributed and the unbiased estimators for its mean and variance are the truth. For

Regression analysis and experimental design, For many decades, there has be...

For many decades, there has been considerable attention paid to identifying various factors that help to reduce the number of fatalities on Australian roads. In 1964 Victoria and S

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd