Assumptions in regression, Applied Statistics

Assignment Help:

Assumptions in Regression

To understand the properties underlying the regression line, let us go back to the example of model exam and main exam. Now we can find an estimate of a student's main exam points, if we also know his or her points on the model exam. As we have stated, a student with score of 85 in the model exam should receive points for the main exam in the vicinity of 75 to 95.

If we knew the model exam scores of all students along with their main exam scores, we would then have the population of values. The mean and the variance of the population of the model exam would be μx and σx2 and respectively. The measurements for the main exam points are  μy  and  σy2 .

The assumptions in regression are:

  1. The relationship between the distributions X and Y is linear, which implies the formula E(Y|X=x) = A + Bx at any given value of X = x.

  2. At each X, the distribution of Yx is normal, and the variances  σx2  are equal. This implies that E's have the same variance,  σ2.

  3. The Y-values are independent of each other.

  4. No assumption is made regarding the distribution of X.

    Since we do not have all of the students' course points and main exam points we must estimate the regression line E(Y|X = x) = A + BX.

    The figure shows a line that has been constructed on the scatter diagram. Note that the line seems to be drawn through the collective mid-point of the plotted points. The term  2148_simple linear regression.png  is the estimate of the true mean of Y's at any particular X = x.

    Figure 8

    682_assumptions in regression.png

Related Discussions:- Assumptions in regression

Pattie-lynns utility function, Pattie-Lynn's utility function for total as...

Pattie-Lynn's utility function for total assets is, in which A represents total assets in thousands of dollars. (a) Graph Pattie-Lynn's utility function. How would y

Diversity of data , The box plot displays the diversity of data for the tot...

The box plot displays the diversity of data for the totexp; the data ranges from 30 being the minimum value and 390 being the maximum value. The box plot is positively skewed at 1.

Mean and median, The amounts of money won by the top ten finishers in a fam...

The amounts of money won by the top ten finishers in a famous car race are listed below. $1,172,246    $163,659    $440,584    $350,634     $290,596 $186,731    $145,809     $143,2

Cluster sampling, Cluster Sampling This method is also known as multi s...

Cluster Sampling This method is also known as multi stage sampling .Under this method random selection is made of the ultimate or final units from a given stratum. The sampling

Find the mean and standard deviation, Problem : A company supplying ele...

Problem : A company supplying electrical products, places a rush order for electric wires. Consignments of wires are to be sent immediately when they are available. Previous

Agreement, Agreement The degree to which different observers, raters or ...

Agreement The degree to which different observers, raters or diagnostic the tests agree on the binary classification. Measures of agreement like that of the kappa coefficient qu

Methods of forecasting, Methods of Forecasting  Various techniques whic...

Methods of Forecasting  Various techniques which are generally used in business forecasting are as under: 1.      Forecasting  through the opinion of heads  of department

Critique 2, prepare a critical analysis of a quantitative study focusing on...

prepare a critical analysis of a quantitative study focusing on protection of human participants data collection data management and analysis problem statement and interpretation o

Determine nash equilibria, Two students are sitting in a lecture and consid...

Two students are sitting in a lecture and considering whether to ask a question from the professor (both of them are considering the same question). If they both ask, the questi

Different analyses of recurrent events data, Different analyses of recurren...

Different analyses of recurrent events data: The bladder cancer data listed in Wei, Lin, and Weissfeld (1989) is used in Example 54.8/49.8 of SAS to  illustrate different anal

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd