Comment on the overall adequacy of the final model

Assignment Help Applied Statistics
Reference no: EM131505866

Assessment Item - Research Report

Data

The file:Birthweights.xlsx contains data on the following variables for a sample of 1000 births recorded in a large local hospital in 2015:

Variable

Description

Birthweight

Birthweight in grams

Gestation

Length of pregnancy in days

Smoke

Whether the mother is a smoker or not

Pre-pregnancy weight

Mother's pre-pregnancy weight in kilograms

Height

Mothers height in centimetres

Status

Mother's indigenous status

Age

Mother's age in years

Background

Management at the hospital is interested in being able to better manage room allocations and bookings in their maternity ward. They are keen to identify mothers at risk of having low birthweight babies who may require additional hospital resources during their stay in the hospital.

The hospital has collected data for a number of previous births at the hospital. The data contains information on the variables outlined in the table above. As a consultant, they have approached you and asked if you could analyse this dataset.

Tasks

Part 1 - Analysis

1. Past records (2004) show that the average birthweight was 3500 grams. Test at 5% if the average birthweight in 2015 has increased with the improvement in general nutrition.

(Include all six steps for hypothesis testing.)

2. Performa two-sample t-test for each of the following tasks. (Include all six steps for hypothesis testing in each.)

(a) Determine if there is evidence that on average the weight of a baby of a mother who smokes is less than that of a mother who does not.(α= 5%)

(b) Determine if being indigenous is a disadvantage in terms of birthweight. (α= 5%)

The hospital managementis particularly interested in whether you can develop a regression model to help them to predict the birthweight of a baby based on the variables in the data supplied. The model could then be used to predict birthweight to identify babies at risk in future.

3. By using the forward stepwise method, develop a multiple regressionmodel to predict the birthweight.
Step 1: Gestation only
Step 2: Gestation and Smoke
Step 3: Gestation, Smoke and Pre-pregnancy Weight
Step 4: Gestation, Smoke, Pre-pregnancy Weight and Height
Step 5: Gestation, Smoke, Pre-pregnancy Weight, Height and Status
Step 6: Gestation, Smoke, Pre-pregnancy Weight, Height, Status and Age

(a) Interpret the regression coefficients of all six (6) independent variables in the model obtained in Step 6, and comment on the statistical significance of each.

(b) Use Excel to obtain the correlation matrix for the following variables: Gestation, Pre-pregnancy Weight,Height,Ageand Birthweight. Do you think multi-collinearity is a problem in the regression model? Are the correlation coefficients consistent with the regression coefficients obtained in the model in Step 6? Discussbriefly.

(c) Focusing on Steps 3 and 4, discuss fully how the introduction of Height in Step 4 affects the regression coefficient of Pre-pregnancy Weight.

(d) Based on the results in (a) to (c), explain which independent variables should be includedor excluded to formulate the final model. State the final model.

(e) Comment on the overall adequacy of the final model.

(f) Consider an indigenous mother who is a smoker, 20 years of age, and 160cm tall with a pre-pregnancy weight of 58kg and gestational age of 267 days.What is the expected weight of the child, using the final model you have developed in (d)?

4. Compute the difference in the average birthweight of babies of indigenousandnon-indigenous mothers (called thebirthweight difference, for simplicity). Discuss fullyif there is any discrepancy between the regression coefficient of Statusobtained in the regression model and the birthweight difference.

Part 2 -Report

You are required to submit a concise report (word limit: 400) presenting any important features or relationships in the data. The content of your report should be based on, but not restricted to, insights gleaned fromyour analyses conducted in Part 1.

Part 1 - Analysis

- For presentation and ease of marking, it is advisable to include relevantExcel output in your answer to each question in this part instead of placing them in appendices.
- There is no word limit in Part 1.
Part 2 - Report
- The report is primarily based on the data provided. If, however, you wish to include, and refer to, additional information, you can use any referencing system as long as it is used consistently.
- You can include relevantcharts and Excel objects in your report.
- Use 1& ½ spacing and font size of 11.
- Theword limitof 400 (with a tolerance of 10%) is exclusive of words in tables, appendices and reference list (if any).

Attachment:- Birthweights RAW DATA.xlsx

Reference no: EM131505866

Questions Cloud

Describe the role of department of homeland security : Describe role of Department of Homeland Security in cyber security for US citizens and US corporations with regards to attacks from WikiLeak supporting hackers.
Identify which side of the argument you agree : Identify which side of the argument you agree with and provide at least two reasons for your position. Use lessons and arguments from your entire study.
Compute the equivalent uniform cr amount : Given that the purchase price of a machine is $1,000 and its market value at EOY four is $300, complete Table P5-24 below [values (a) through (f)].
Examine state and local law enforcement agencies authority : Assignment: Immigration Enforcement- Examine state and local law enforcement agencies' authority to create and enforce their own immigration policies.
Comment on the overall adequacy of the final model : Compute the difference in the average birthweight of babies of indigenousandnon-indigenous mothers - Comment on the overall adequacy of the finalmodel.
What do i want to accomplish in life : You must answer the following: What is my purpose in life? What really counts? What do I want to accomplish in life?
Should the new system be purchased : A simple, direct space heating system is currently being used in a professional medical office complex.
Describe key elements of the role that congress plays : Describe key elements of the role that Congress plays within the U.S. federal system, with particular focus on Congress' ability to reflect the will of the peop
What is the aw method : A company is considering constructing a plant to manufacture a proposed new product. The land costs $300,000, the building costs $600,000.

Reviews

len1505866

5/24/2017 3:44:27 AM

PC (3.1): Use information literacy skills, and communicate effectively and professionally in written forms and using media appropriate for diverse purposes and contexts Written expression and integration ofrelevant statistical findings [Part 2] Writes fluently and clearly using language, format, and structure that always adheres to the report genre; meaning is clearly articulated and effectively expressed, and relevant to task

len1505866

5/24/2017 3:44:17 AM

HO(2.1): Investigate real world business issues and situations through the effective analysis, evaluation and synthesis of theory and practice Interpretation and explanation of research findings [Qs 3(b) (c) & 4] Results are presented clearly and interpreted correctly and comprehensively; research findings are critically discussed in depth and are coherently related to all aspects of the analysis and research problem Results are presented clearly and interpreted correctly in some detail; research findings are well discussed in detail in relation to most parts of the analysis and research problem Results are mostly presented clearly, though minor errors of interpretation are evident; research findings are well discussed in relation to some aspects of the analysis and research problem, though explanation is lacking in detail in parts Some results have been presented and interpreted correctly though substantive errors in explanation and/or interpretation are present; research findings do not sufficiently address the research question and/or analysis,and contain minimal explanation

len1505866

5/24/2017 3:42:58 AM

Criteria 7 KS (1.1): Demonstrate and apply integrated discipline (including technical) knowledge across the broad field of business with depth in one or more core business disciplines Application of statistical knowledge [Qs 1 and 2] Selects and correctly uses relevant graphs and statistical concepts throughout the report KS (1.2): Apply technical and technological skills appropriate and effective for real world business purposes and contexts Analysis of data [Qs 3(a), (d), (e)& (f)] Analysis methodsappropriate for comprehensively and critically investigating the research question were selected; all analyses and calculations were correctly performed

len1505866

5/24/2017 3:42:17 AM

• You should submit your response to both parts as a single pdf document saved in the format: BSB123 Report_StudentName.pdf • After uploading your research report, it is your responsibility to go back to the Assignment Upload page to check that your report was properly uploaded. • Due: 11:59 pm 28 via Blackboard Part 1 - Analysis • For presentation and ease of marking, it is advisable to include relevantExcel output in your answer to each question in this part instead of placing them in appendices. • There is no word limit in Part 1. Part 2 - Report • The report is primarily based on the data provided. If, however, you wish to include, and refer to, additional information, you can use any referencing system as long as it is used consistently. • You can include relevantcharts and Excel objects in your report. • Use 1& ½ spacing and font size of 11. • Theword limitof 400 (with a tolerance of 10%) is exclusive of words in tables, appendices and reference list (if any).

Write a Review

Applied Statistics Questions & Answers

  Build model for predicting survival of passengers on titanic

Build a model for predicting the survival of passengers on the Titanic using a decision tree in RapidMiner using the two data sets, titanic3_train.csv and titanic3_score.csv.

  Discuss difference of cross-sectional and time series data

Discuss the difference between cross-sectional data and time series data. If we record the total number of cars sold in 2011 by each of 10 car salespeople, are the data cross-sectional or time series data

  Calculate a confidence interval for unknown population

Calculate a 95% confidence interval for the unknown population mean business age - Test, at the 5% level of significance, whether location (Parramatta or Sydney CBD) of the business is related to type of business (privately held, publicly traded or..

  Investing in u.s. treasury notes

Investing in U.S. Treasury notes (T-notes) can produce interest income. However, investors can face risk of capital losses when selling Treasury securities because of changing market interest rates over time. The following contains some weekly ..

  The number of cars that pass through.

An attendant at a car was is paid according to the number of cars that pass through.

  Forecasting with time series analysis objective crusty

forecasting with time series analysis objective crusty pizza executives have to forecast december sales for the 10

  A sample of price of 16 different models of mobile in a stor

1) A sample of price of 16 different models of mobile in a store are as follows: 900 300 340 450 280 220 340 290 370 400 310 340 430 270 380 910 a) Calculate the mean, median, mode, first and third quartile. b) Calculate the variance, standard deviat..

  An internet service provider provides internet connections

Problem 1. An internet service provider (ISP) provides internet connections to 100,000 customers. 10,000 of the customers have high-speed connections and 90,000 of the customers have low-speed connections. The ISP wants to know whether, on the ..

  Mike dreskin manages a large los angeles movie theater

8-3 Mike Dreskin manages a large Los Angeles movie theater complex called Cinema I, II, III, and IV. Each of the four auditoriums plays a different film; the schedule is set so that starting times are staggered to avoid the large crowds that would oc..

  What is the interpretation of r-square

What is the interpretation of R-square (just use the latest output) and how to calculate correlation based on it?

  Addition rule to determine the probability

A department store manager has decided that dress code is necessary for team coherence. Team members are required to wear wither blue shirts or red shirts. There are 9 men and 7 women in the team. On a particular day, 5 men wore blue shirts and 4 oth..

  Creating a frequency distribution

Individual data values or grouped data when creating a frequency distribution?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd