Scatter diagram - correlation analysis, Applied Statistics

Assignment Help:

Scatter Diagram

The first step in correlation analysis is to visualize the relationship. For each unit of observation in correlation analysis there is a pair of numerical values. One is considered the independent variable; the other is considered dependent upon it and is called the dependent variable. One of the easiest ways of studying the correlation between the two variables is with the help of a scatter diagram.

A scatter diagram can give us two types of information. Visually, we can look for patterns that indicate whether the variables are related. Then, if the variables are related, we can see what kind of line, or estimating equation, describes this relationship.

The scatter diagram gives an indication of the nature of the potential relationship between the variables.

Example 

A sample of 10 employees of the Universal Computer Corporation was examined to relate the employees' score on an aptitude test taken at the beginning of their employment and their monthly sales volume. The Universal Computer Corporation wishes to estimate the nature of the relationship between these two variables

Aptitude Test Score

Monthly Sales (Thousands of Rupees)

Aptitude Test Score

Monthly Sales (Thousands of Rupees)

X

Y

X

Y

50

30

70

60

50

35

70

45

60

40

80

55

60

50

80

50

70

55

90

65

To determine the nature of the relationship for example, we initially draw a graph to observe the data points.

Figure 1

2406_scatter diagram.png

On the vertical axis, we plot the dependent variable monthly sales. On the horizontal axis we plot the independent variable aptitude test score. This visual display is called a scatter diagram.

In the figure given above, we see that larger monthly sales are associated with larger test scores. If we wish, we can draw a straight line through the points plotted in the figure. This hypothetical line enables us to further describe the relationship. A line that slopes upward to the right indicates that a direct, or a positive relation is present between the two variables. In the figure given above we see that this upward-sloping line appears to approximate the relationship being studied.

The figures below show additional relations that may exist between two variables. In figure 2(a), the nature of the relationship is linear. In this case, the line slopes downward. Thus, smaller values of Y are associated with larger values of X. This relation is called an inverse (linear) relation.

Figure 2

705_scatter diagram1.png

 

Figure 2(b) represents a relationship that is not linear. The nature of the relationship is better represented by a curve than by a straight line - that is, it is a curvilinear relation. The relationship is inverse since smaller values of Y are associated with larger values of X.

Figure 2(c) is another curvilinear relation. In this case, however, larger values of Y are associated with larger values of X. Hence, the relation is direct and curvilinear.

In figure 2(d), there is no relation between X and Y. We can draw neither a straight line nor a curve that adequately describes the data. The two variables are not associated.


Related Discussions:- Scatter diagram - correlation analysis

#regression, #regression line drawn as Y=C+1075x, when x was 2, and y was 2...

#regression line drawn as Y=C+1075x, when x was 2, and y was 239, given that y intercept was 11. calculate the residual

Which average is to be used to describe statistical data?, There ar...

There are situations where none of the three averages is fully satisfactory. For example, if the number of items in a series is very small, none of these av

Assumptions in anova, Assumptions in ANOVA The various populations f...

Assumptions in ANOVA The various populations from which the samples are drawn should be normal and have the same variance. The requirement of normality can be discarded if t

Multiple correspondence analysis, Correspondence analysis is an exploratory...

Correspondence analysis is an exploratory technique used to analyze simple two-way and multi-way tables containing measures of correspondence between the rows and colulnns of an

Determine that the events are mutually exclusive or not, In a study of outc...

In a study of outcomes for patients who had been in the Intensive care Unit (ICU) at a large hospital, the records from last 150 patients who had been in the ICU for more than one

BIVARIATE FREQUENCY , MARKS IN LAW :10 11 10 11 11 14 12 12 13 10 MARKS IN ...

MARKS IN LAW :10 11 10 11 11 14 12 12 13 10 MARKS IN STATISTICS :20 21 22 21 23 23 22 21 24 23 MARKS IN LAW:13 12 11 12 10 14 14 12 13 10 MARKS IN STATISTICS:24 23 22 23 22 22 24 2

Median, The median, as the name suggests, is the middle value of a series a...

The median, as the name suggests, is the middle value of a series arranged in any of the orders of magnitude i.e. ascending or descending order. As distinct from the arithmetic

Diversity of data , The box plot displays the diversity of data for the tot...

The box plot displays the diversity of data for the totexp; the data ranges from 30 being the minimum value and 390 being the maximum value. The box plot is positively skewed at 1.

Its a portfolio assignment, i m doing MBA in singapore and i want a good wo...

i m doing MBA in singapore and i want a good work. i want a data for 200 observations and then answers for some questions. and i need the data to be approved by our professor first

The weekly treatment , A researcher is interested in comparing the effectiv...

A researcher is interested in comparing the effectiveness of three different parts of therapy for anger problems. 8 participants are randomly assigned to 3 treatment conditions: Co

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd