Develop an estimated regression equation

Assignment Help Applied Statistics
Reference no: EM131512553

Group Assignment

Task 1 -

Consumer Research, Inc., is an independent agency that conducts research on consumer attitudes and behaviours for a variety of firms. In one study, a client asked for an investigation of consumer characteristics that can be used to predict the amount charged by credit card users. Data were collected on annual income, household size, and annual credit card charges for a sample of 50 consumers. The following data are recorded for Consumer information.

Income ($1000s)

Household Size

Amount Charged ($)

Income ($1000s)

Household Size

Amount Charged ($)

54

3

4016

54

6

5573

30

2

3159

30

1

2583

32

4

5100

48

2

3866

50

5

4742

34

5

3586

31

2

1864

67

4

5037

55

2

4070

50

2

3605

37

1

2731

67

5

5345

40

2

3348

55

6

5370

66

4

4764

52

2

3890

51

3

4110

62

3

4705

25

3

4208

64

2

4157

48

4

4219

22

3

3579

27

1

2477

29

4

3890

33

2

2514

39

2

2972

65

3

4214

35

1

3121

63

4

4965

39

4

4183

42

6

4412

54

3

3720

21

2

2448

23

6

4127

44

1

2995

27

2

2921

37

5

4171

26

7

4603

62

6

5678

61

2

4273

21

3

3623

30

2

3067

55

7

5301

22

4

3074

42

2

3020

46

5

4820

41

7

4828

66

4

5149

Required:

1. Use methods of descriptive statistics to summarize the data. Comment on the findings.

2. Develop estimated regression equations, first using annual income as the in- dependent variable and then using household size as the independent variable. Which variable is the better predictor of annual credit card charges? Discuss your findings.

3. Develop an estimated regression equation with annual income and household size as the independent variables. Discuss your findings.

4. What is the predicted annual credit card charge for a three-person household with an annual income of $40,000?

5. Discuss the need for other independent variables that could be added to the model. What additional variables might be helpful?

Task 2 -

The data set for group assignment you can find on Blackboard in the folder assignment.

Required:

Activity 01:

Enter all data from the spreadsheet "Data for Assignment "into Excel. You will need to set up the variable view with the following 11 variables and then enter the data in excel:

a) Student_ID,

b) Year_Enrolled,

c) HI001_Final_Exam,

d) HI001_Assignment_01,

e) HI001_Assignment_02,

f) HI002_Final_Exam,

g) HI002_Assignment_01,

h) HI002_Assignment_02,

i) HI003_Final_Exam,

j) HI003_Assignment_01,

k) HI003_Assignment_02.

Activity 02:

a) Draw a histogram for each one of the 11 variables?

b) Do descriptive statistics (mean, standard deviation, minimum, maximum) for each one of the 11 variables.

Activity 03:

a) Do at least 10 different correlations between the any pairs of variables: For example:

  • HI001_Final_Exam and HI002_Final_Exam
  • HI001_Assignment_01 and HI001_Assignment_02

b) For each correlation discuss the results:

  • Are they are positive/negatively correlated?
  • Are they weak or strong correlations?
  • What is the significance value?
  • What does the significance value reveal about the data we have used?

Required:

a) Copy -paste the result from your Excel file to a Word document.

b) Copy-paste ALL the output from all the activities requested in Activity 01 to 03 in Excel and put the answers in the same Word document.

c) Answer all discussion questions requested in Activity 01 to 03 and put the answers in the same Word document.

d) Submit a soft copy of the Excel files used in Excel and the Assignment Word document online under Assignment final submission.

Task 3 -

As part of a long-term study of individuals 65 years of age or older, sociologists and physicians at the Wentworth medical Center in upstate New York investigated the relationship between geographic location and depression. A sample of 60 individuals, all in reasonably good health, was selected; 20 individuals were residents of Florida, 20 were residents of New York, and 20 were residents of North Carolina. Each of the individuals sampled was given a standardized test to measure depression. The data collected follow; higher test scores indicate higher levels of depression. These data are available on the website that accompanies this text in the file named medical1. A second part of the study considered the relationship between geographic location and depression for individuals 65 years of age or older who had a chronic health condition such as arthritis, hypertension, and/or heart ailment. A sample of 60 individuals with such conditions was identified. Again, 20 were residents of Florida, 20 were residents of New York, and 20 were residents of North Carolina. The levels of depression recorded for this study follow. These data are available on the website that accompanies this text in the file named medical.

Florida

New York

North Carolina

Florida

New York

North Carolina

3

8

10

13

14

10

7

11

7

12

9

12

7

9

3

17

15

15

3

7

5

17

12

18

8

8

11

20

16

12

8

7

8

21

24

14

8

8

4

16

18

17

5

4

3

14

14

8

5

13

7

13

15

14

2

10

8

17

17

16

6

6

8

12

20

18

2

8

7

9

11

17

6

12

3

12

23

19

6

8

9

15

19

15

9

6

8

16

17

13

7

8

12

15

14

14

5

5

6

13

9

11

4

7

3

10

14

12

7

7

8

11

13

13

3

8

11

17

11

11

Required:

1. Use descriptive statistics to summarize the data from the two studies. What are your preliminary observations about the depression scores?

2. Use analysis of variance on both data sets. State the hypotheses being tested in each case. What are your conclusions?

3. Use inferences about individual treatment means where appropriate. What are your conclusions?

Attachment:- Data.rar

Reference no: EM131512553

Questions Cloud

Construct a confidence interval for the difference : Construct a 95% confidence interval for the difference in proportions of women who deliver preterm - What proportion of children living in a US urban neighborhood is overweight?
Create an apa formatted annotated bibliography : Spend time researching 10 references related to your selected topic that you can use in your research paper. Create an APA formatted annotated bibliography.
Principles of management : To access ProQuest articles, you MUST first open a Web browser window to the Ashworth College Library; otherwise, you will be denied access to the articles.
How the iron triangle can be used to assess health care : Describe how the Iron Triangle can be used to assess health care. Give specific examples. Write at least 500 words. Support your though by vivid sources.
Develop an estimated regression equation : HI6007 Group Assignment. Develop an estimated regression equation with annual income and household size as the independent variables. Discuss your findings
Find eigenvalues and associated eigenvector of give matrix a : Find the eigenvalues and associated eigenvectors of the given matrix A. Apply the eigenvalue method to find a general solution of the given system.
Example of a misleading statistical visualization : Find an example of a misleading statistical visualization - there's unfortunately plenty online and in print.
Attractiveness of and competitive pressures : Describe how Porter's Five Forces Model is used to evaluate the attractiveness of and competitive pressures in an industry. Provide an example.
Benefits director at a newly formed organization : You are the new Benefits Director at a newly formed organization with 150 employees privately owned business that's not publicly traded.

Reviews

len1512553

5/31/2017 12:15:11 PM

Australian student, need it as per the guidelines. Copy –paste the result from your Excel file to a Word document. Copy-paste ALL the output from all the activities requested in Activity 01 to 03 in Excel and put the answers in the same Word document. Answer all discussion questions requested in Activity 01 to 03 and put the answers in the same Word document. Submit a soft copy of the Excel files used in Excel and the Assignment Word document online under Assignment final submission.

Write a Review

Applied Statistics Questions & Answers

  One of the nation''s biggest regional airlines

One of the nation's biggest regional airlines has tracked 4,000 landings and take-offs during the past month. Treating these data as the population of interest, the company found that the average time the planes spent on the ground (called the turn t..

  What is the difference between the mean of the two groups

What is the difference between the mean of the two groups? What is the difference is standard deviation? What is the null and alternative hypothesis? Do the data results lead you to reject or fail to reject the null hypothesis?

  Calculate the correlation between pretest and posttest score

Calculate the correlation between pretest and posttest scores separately for each gruop and determine sperately for each group whether performance improved from pretest to posttest

  What is the median

109000, 109000, 109000, 170000, 170000, 275000, 325000, 500000, 600000, 9237500 What is the median ?  what is The interquartile range ?

  Test hypothesis - all five categories have same probability

For a one-way contingency table, the following frequencies are observed: 23, 34, 43, 53, 16. Using α = .01, test the hypothesis that all five categories have the same probability.

  Examine the traced and medicalaid relationship

Examine the relationship between Traced by MedicalAid. Is there evidence that whether or not a child was traced is independent of whether the mother had medical aid

  Draw an updated process map

AYN443 - Electronic Commerce Cycles MYOB Assignment. Identify the major 2 control issues and suggest a solution. Your solution should state what control activity(s) it uses (See Lecture 3). Draw an updated Process Map with your solutions included

  The authors found that in 31 patients tested negative by sr

We examined the use of heparin-PF4 ELISA screening for heparin-induced thrombocytopenia (HIT) in critically ill persons. Using C-serotonin release assay (SRA) as the way of validating HIT, the authors found that in 31 patients tested negative by SRA,..

  Can you explain why you added the e

A few days ago you answered a question I had on probability. The original equation was R(t)=a^(-bt). In your answer you changed it to R(t) = a* e^(-bt). Can you explain why you added the e?

  College do not differ in height from apex men

Consider the following: Your friend wishes to know if men at his college are 72 inches tall, on average. He randomly sampled 10 men on his campus, measured their height, and calculated the sample mean to be 69 inches. He concluded that men at his col..

  Statistics for categorical data-odds ratios and chi-square

To test the hypothesis that there is no association between use of postmenopausal hormones and risk of MI, chi-square statistics need to be calculated.

  Giddens solve the problem of agency versus structure

An essays of at least  600 words respond, How does Giddens solve the problem of agency versus structure? To answer this question, you need to explain Gidden's theory of structuration. Your essay should integrate the following concepts:

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd