MATH 4044 Statistics for Data Sciences Assignment

Assignment Help Other Subject
Reference no: EM133157571 , Length: 25 pages

MATH 4044 Statistics for Data Sciences - University Of South Australia

To achieve maximum marks for each question, you should aim to:
- Complete the requested statistical analysis in SAS using appropriate tasks or procedures.
- Include only the output most relevant to the question and interpret all key results. Do not include every piece of output produced by SAS!
- Discuss the results more broadly in the context of the given scenario.

Introduction
Use the data to study the factors that affect the beneficiary's insurance charges.

Data Description
Machine Learning with R by Brett Lantz is a book that provides an introduction to ma- chine learning using R. The dataset is used as an example for regression in the book. The data is downloaded from Some post-processing was carried out for the purpose of the case study.
The data file for this case study is called insurancev2.sas7bdat. The dataset con- tains insurance charges to the beneficiary, together with their demographic information. Variables in that file are as follows:

Assignment Tasks

Question 1
(a) Carry out a one-way analysis relating log_charges to agegroup. Use contrasts to test at least one a-priori hypothesis of your choice. Examine and comment on residuals. Also carry out appropriate post-hoc comparisons and discuss your results.
(b) Use SAS to perform a one-way ANCOVA relating log_charges to agegroup and bmi with bmi as a covariate, including appropriate post-hoc comparisons:
- Confirm that there is a linear relationship between the response variable and the covariate (a scatterplot and correlation coefficient plus a comment will suffice);
- Check the two additional ANCOVA assumptions (report and comments only on the parts of the output most directly relevant to condition check- ing):
∗ Independence of the covariate and the treatment effect (perform a one-way ANOVA test);
∗ Equality of slopes (add and check significance of the interaction term);
- Report and briefly discuss your results.
Technical note: Make sure you obtain and examine Type III Sum of Squares (ss3). Also obtain estimates of 'least squares means' (lsmeans) which are means by treatment adjusted for the covariate.

Question 2
(a) Carry out a one-way analysis of variance relating log_charges and weight_range. Examine and comment on residuals. Use contrast to test at least one a-priori hypothesis of your choice. Also carry out appropriate post-hoc comparison and discuss your results.
(b) Extend your analysis in part a to test whether there is evidence of interaction between weight_range and smoker. Examine and comment on residuals. Carry out appropriate post-hoc comparisons and discuss your results.

Question 3 Carry out an additional ANCOVA or factorial ANOVA of your choice to find other factors that may have significant impact on insurance charges.

Question 4 Write a summary of your findings from Questions 1-3. Keep the technical details of the analyses that led you to these conclusions to the absolute minimum. Rather, focus on practical significance and present your findings in non-specialist terms. One to two paragraphs (up to a page) will be sufficient.

Attachment:- Statistics for Data Sciences.rar

Reference no: EM133157571

Questions Cloud

Strong understanding of professional ethics : Create and maintain professional relationships with colleagues as well as the wider workplace community - understanding of professional ethics
What are the annual carrying? cost : Tinnendo, Inc. believes it will sell 4 million? zen-zens, What are annual carrying? cost, annual ordering? cost, and optimal order quantity for the? zen-zens
What do you personally think of completing a resume : What do you personally think of completing a Resume? What is one area of your resume you need to improve on? Job experience, objective, awards, etc
Determine the contribution margin ratio : Fixed costs are $239,400, and operating income is $1,675,800. Determine the following: Contribution margin ratio and Variable cost per unit
MATH 4044 Statistics for Data Sciences Assignment : MATH 4044 Statistics for Data Sciences Assignment Help and Solution, University Of South Australia - Assessment Writing Service
How much should be recorded in the machine account : The machine's depreciation expense for the year is $40,000. How much should be recorded in the machine account on December 31 (net of depreciation)
Record the transactions for prada : An independent appraisal valued the land at $90,000 and the communication equipment at $10,000. Record the transactions for Prada
Which statement is true of fredrick corporation : Fredrick Corporation issues a $1,000 bond with a stated rate of interest of 4%. Which statement is true of Fredrick Corporation
How much will be paid to common stockholders : How much of the $24,000 dividend will be paid to preferred stockholders and how much will be paid to common stockholders in 2024

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd