Develop a model to predict an interval variable

Assignment Help Applied Statistics
Reference no: EM131314752 , Length:

UNIT - DATA ANALYSIS PROJECT

All Excel output should be copied into a single Word document where you must enter all of your responses to the questions below. Format the document professionally so it flows well.

Include a table of contents.

- Choose any published database from the internet or Bethel library (such as those from the Census Bureau or any financial sites). You may opt to use one of the data files provided by the instructor if applicable.

- Get advanced approval from the instructor on your chosen database.

- If the file is large, randomly choose 200 of the observations from the data.

- Explain each variable in the file that you are analyzing. Be sure your file includes at least

3 scale variables and at least 2 nominal variables.

- Conduct a descriptive analysis on any 2 interval / ratio variables you wish using

Descriptive_Statistics.xls and Frequency_Distribution.xls. Explain the output.

- Conduct 3 different hypothesis tests of your choice using appropriate variables from the file (note: you must use 3 different tests and not run one test on 3 different variables). In each case, state the variables being tested as well as the hypothesis, decision and conclusion. Use 3 of the following (1-Sample Test for Means, 1-Sample Test for Proportions, 2-Sample Test for Means - Independent Samples, 2-Sample Test for Means

- Paired Samples, 2-Sample Test for Proportions, Analysis of Variance, Chi Square Goodness of Fit Test, Chi Square Test of Independence, Correlation Test).

- Develop a model to predict an interval/ratio variable using at least 2 other variables.

Use Multiple_Regression.xls and state the regression model and which variables are or are not significant. Also, use the model to make a prediction by making up values for each of the independent variables.

- Write a one to two page summary of your findings. Include the data file in the appendix.

The project is due at the close of Week 8. You may work alone or in a team of 2 (you choose your own partners and both of you must let your instructor know of your intent to work together).

You may use a data set from the internet or from your workplace, or you may use one of the files provided on www.drjimmirabella.com/bethel The files are described here, and the variables are described within the files. If there is anything confusing about these data files, please ask your instructor.

BASEBALL: This file includes actual team by team data for the 1997 MLB season. The key variable to predict in Multiple Regression Analysis is the number of wins (or possibly the attendance). Lots of interesting analysis possibilities here, including how team salary relates to a team making the playoffs, or whether money buys wins, or how wins relate to attendance, or how performance on the field relates to the field surface, etc. If you know something about baseball, this file should make sense to you.

Dr. Jim Mirabella

CARS: This file is self-explanatory after you open it. Several variables describe the car (sports car, SUV, engine size, horsepower, etc.), and several describe the car's performance (CityMPG and Highway MPG). It also includes the Dealer Cost and Suggested Retail Price.

The key

variable to predict in Multiple Regression Analysis is the Suggested Retail Price. Lots of rosstabulation options for Chi Square Analysis, lots of ANOVA and t-test options in which you analyze miles per gallon or price as a function of any of the many variables included.

LOW BIRTH WEIGHT: This file looks at factors that might predict a baby being born with low birth weight. Birth weights of 5.5 pounds or less are considered low in this file. Use the actual birth weight as the key predicted variable in Multiple Regression Analysis. Lots of variables about the mother regarding her weight, race, medical problems, and doctor's visits can be used for Chi Square analysis or as factors in ANOVA's or t-tests.

MUTUAL FUNDS: This file looks at Large Cap, Mid Cap and Small Cap funds with either Growth or Value objectives. Some funds have fees. Funds are either high, average or low risk. Assets range from 50.7 million dollars to 66.5 billion dollars. For Multiple Regression Analysis, you can choose to predict any of the three Return rates (measured in percents). Lots of categorical variables to choose from in a Chi Square Analysis or as factors to analyze differences in mean return rates.

TIPS: This file includes data on 75 patrons at the Spaghetti Warehouse on a given day. The key variable here is the Tip Rate or the Tip Total. If you wait tables there, under what circumstances are you most likely to get a better tip? You can compute mean Bills or Tips or Tip Rates as a function of the meal time, the party size or the size of the party at the table. Note that you should not use a nominal variable with 3 or more values in the Multiple Regression Analysis (unless you convert to dummy variables, but that is unnecessary here).

No of Pages/Words : as needed to answer

Verified Expert

This task provides a clear brief description on the classification of three types of wines. the difference in mean alcohol content among the three types of wines is calculated using one way ANOVA..

Reference no: EM131314752

Questions Cloud

Determine its bulk modulus of elasticity : A liquid in a cylinder has a volume of 1200 cm3 at 1.25 MPa and a volume of 1188 cm3 at 2.50 MPa. Determine its bulk modulus of elasticity.
Draft a high-level project plan using a computer application : Using the scenario linked below, draft a high-level project plan (Gantt Chart) using a computer application of your choice (e.g., Excel or, MS Project) Be sure to: include project checkpoints needed to ensure project success.
Determine the bulk modulus of elasticity : A pressure of 10 MPa is applied to 0.25 m3 of a liquid, causing a volume reduction of 0.005 m3. Determine the bulk modulus of elasticity.
Determine the percentage decrease in its volume : Water in a container is originally at 100 kPa. The water is subjected to a pressure of 120 MPa. Determine the percentage decrease in its volume.
Develop a model to predict an interval variable : Develop a model to predict an interval / ratio variable using at least 2 other variables - Choose any published database from the internet or Bethel library (such as those from the Census Bureau or any financial sites). You may opt to use one of th..
Determine the upward force on the glass : A glass tube having an inside diameter of 0.25 mm and an outside diameter of 0.35 mm is inserted into a pool of mercury at 20°C such that the contact angle is 13°8. Determine the upward force on the glass.
How bcs is used for sustaining performance as described : Research and describe a real-world implementation of balanced score card. Discuss how BCS is used for sustaining performance as described in this chapter.
Determine how far the column of mercury in the tube : Determine the difference in pressure between the inside and outside of a soap film bubble at 20 °C if the diameter of the bubble is 4 mm.
What are the post-money and pre-money valuations : Onset VC invested $750,000 to purchase preferred shares at $1 per share in return for 31.58% ownership. What are the post-money and pre-money valuations? What is the number of shares that Onset has? What is the total number of shares in company?

Reviews

inf1314752

12/19/2016 7:39:23 AM

You are amazing. This fair demonstrates what a distinction working with ExpertsMind is versus your rivals. I have been blazed a few times by them and was concerned rashly as I didn't realize what's in store from ExpertsMind.com. I have now discovered that you were definitely justified even despite the hold up and the cash and I will go to no other service however ExpertsMind.com later on. Much obliged once more. You are an aggregate lifeline........

inf1314752

12/19/2016 7:37:18 AM

I need to check data set that needs to be used for analysis. There is mention of few data sets in given file. Also document says use Descriptive_Statistics.xls and Frequency_Distribution.xls and Multiple_Regression.xls. so are there any templates for these procedures or we just need to use Excel functions for analysis.This link has several free world based data sets, many with 200 sample sizes. such as All forms of TB, detection rate (%) for tuberculosis from the world health organization or nearly any other topic. https://www.gapminder.org/data/

mai1314752

12/17/2016 12:28:25 PM

If you would rather not use a data file of your choosing from any internet database I can request one from the instructor. Thanks

mai1314752

12/17/2016 12:28:13 PM

All Excel output should be copied into a single Word document where you must enter all of your responses to the questions below. Format the document professionally so it flows well. Include a table of contents.

mai1314752

12/17/2016 12:27:34 PM

Hello, I'm sorry I'm at work and just got your email question. Yes you can use the excel function for analysis and the data set can be from any internet source just as healthcare, census bureau or some other topic that you may be familiar with. The instructor didn't assign a data set but just requested that we inform her of our choice. I assumed it would be easier if you could use any set of your own choosing. I will share the instructors assignment request again in this email. Thank you and I will watch more closely if you have any further questions.

Write a Review

Applied Statistics Questions & Answers

  Hypothesis testing

What assumptions about the number of pedestrians passing the location in an hour are necessary for your hypothesis test to be valid?

  Calculate the maximum reduction in the standard deviation

Calculate the maximum reduction in the standard deviation

  Calculate the expected value, variance, and standard deviati

Calculate the expected value, variance, and standard deviation of the total income

  Determine the impact of social media use on student learning

Research paper examines determine the impact of social media use on student learning.

  Unemployment survey

Find a statistics study on Unemployment and explain the five-step process of the study.

  Statistical studies

Locate the original poll, summarize the poling procedure (background on how information was gathered), the sample surveyed.

  Evaluate the expected value of the total number of sales

Evaluate the expected value of the total number of sales

  Statistic project

Identify sample, population, sampling frame (if applicable), and response rate (if applicable). Describe sampling technique (if applicable) or experimental design

  Simple data analysis and comparison

Write a report on simple data analysis and comparison.

  Analyze the processed data in statistical survey

Analyze the processed data in Statistical survey.

  What is the probability

Find the probability of given case.

  Frequency distribution

Accepting Manipulation or Manipulating

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd