Build a multiple regression model from your data

Assignment Help Basic Statistics
Reference no: EM131068431


For this Course Project you will collect data, perform preliminary data analysis, build and analyze a model, and use the results of your analysis to make predictions, draw conclusions, and support decisions.

The Project will be conducted in three phases:

Phase I:  Collect data and describe your data set. Please include: a description of what the data is, how it was collected (if known), type of variables (categorical/ continuous), unit of analysis, and business scenario.

Phase II:  Perform preliminary analysis of your data, using descriptive statistics.  Please include: central tendencies, variability, normality, and a visual representation for each variable as well as correlations between variables, and your preliminary thoughts on which variables will be included in the regression model (i.e. which are independent variables and which is the dependent variable; these can and probably will change!).  IF you would like to include regression analysis for me to look at then I will give you feedback.

The first two phases will be graded based on a satisfactory submission.  If the first submission is on-time and satisfactory, then full credit will be awarded.  In the case of an unsatisfactory submission five points may be deducted for each required re-do.

Phase III:  Build a multiple regression model from your data, and prepare a business report that includes all of your previous work, and that presents a recommendation to a decision-maker based on your model and analysis.


Phase I, Data Collection

You may collect your data from (almost) any source(s).  The objective is to include a numerical response (dependent) variable that can be predicted from some number of other (independent) variables.  These data do not have to come from the same source, but should be compatible as data sets.  Data should be cross-sectional (no time-series data).

The minimum requirement is 50 observations with ten independent variables. The requirement is to include a numerical response (dependent) variable that can be predicted from some other (independent) variables. Numerical dependent variables are better, but up to 3 may be categorical (max 3 categories) or Binary. These data do not have to come from the same source, but should be compatible as data sets (i.e., if your response is a monthly result over a ten-year period, your other data should cover the same time period and increments). The minimum requirement is 50 observations (50 countries, 50 companies, 50 counties, whatever) with ten independent variables and one dependent variable (11 overall). It is best not to have one the variables at 50 points in time, unless the points in time are quite close to each other. It would be better to have one or a few points in time with lots of observations at that time. [Beware the tautology:  do not collect temperature and humidity to "predict" the heat index!]  Ensure that your data set will allow you to draw relevant conclusions about something that matters. The data may be from any field (preferably business-related) and should be collected so that you can establish relationships among your data to support some sort of a conclusion or recommendation. Please explain your planned business scenario - i.e. who would need to predict this DV and what would they use results for?

The submission will be in the form of an Excel file submitted in Canvas with a summary of what it is and where it came from.

Phase II, Preliminary Data Analysis

Apply descriptive statistics to your data set.  This can include graphical depictions as well as some basic calculated statistics.  Since you will be building a multivariate model, the correlations between your independent variables should be included.  You should, at this point, be able to make some preliminary observations about your data.  These observations (and any others you come up with later) should make their way into your business report, but will generally appear in appendices unless you determine them to be critical to the decision you are recommending. This submission should be a word document with your excel file also attached.

For each variable separately:

- Variable Name

- Description (what is it?)

- Units

- Central Tendency (mean, median, mode - use the appropriate one!)

- Variability (range, standard deviation)

- Normal distribution?  (continuous variables only)

- Outliers?  What did you decide to do with the outliers?

- Correlation with your Dependent variable

- Concerning correlations with other Independent variables (.7 or higher)

- Visual representation of variable

Overview of Data

- After running all descriptive information, do you have any thoughts on which may be better predictors of your dependent variable or thoughts overall of how things look? 

Phase III:  Model Construction and Business Report

You will build a multiple regression model from your data using the techniques we have learned in the course.  You should decide here how you intend to use your model to conduct analysis, make predictions, and support decisions. 

You will wrap all of your work up in a business report.  Remember that the target is an executive who you will ask for a decision based on your recommendation.  Perform analysis with your model, interpret your model, include your calculations and the original data (in appendices) but present the bottom line to the decision-maker up front.  The report will be submitted in paper copy at the beginning of class.  The clear plastic binder is highly discouraged.

While many organizations suggest a format for a business report, there are as many that do not, so the presentation is up to you.  However, the following page may be used as a guideline.

Business Report Format-

Cover Sheet

Title.  Indicate who the report is for, and what the report is about.  (Use this to establish the "setting" for your instructor to grade your submission.)

Your name and position. (Again to establish context for the grader.)

Executive Summary.  A single paragraph that an executive can read and immediately know what decision you are recommending and why.

Main Body

A 2-3 page report that tells the executive what decision should be made, and why the decision should be what it is.  This should reference (and may include) the model you are using to support the decision-making process, and may also describe how confident the executive should be when making this decision.  (In extreme cases the report can go up to 5 pages.  Business reports not intended for senior executives may be longer, based on the organization's needs.)

BLUF! (Bottom Line Up Front!)  The decision should be clear after the first few sentences, and definitely by the end of the first paragraph.

Include only that information that will be critical to the executive's decision-making process.

Refer to all supporting data and analyses that are included in appendices.  Appendices should appear in order of importance, and should be referenced in that order.


(No page limit, whatever is appropriate to describe the following)

A. Model and Interpretation

Show the final model (Y=....) you developed to support the decision, and interpret it, to include discussing the effects of the ranges of your input variables.  This is where you discuss the meaning and relationship between predictors and outcome (i.e. when Y increases, what happens to X?) there does not need to be "stats language" here.  It can be very helpful to plug in values to demonstrate how the model works.

B. Model Statistical Analysis

Discuss the strength of the model in terms of how it supports the decision-making process.  Include the relevant Excel output that supports the quality of the model.

  • Correlation and multiple regression analyses were conducted to examine the relationship between Y and X(s)....
  • Discuss normality, missing data problems (if any), outliers (if any), and correlations between Y and X(s) - strength, direction, and r^2.
  • Explain the MR output - r, r^2, F, p. Explain significant beta weights - t, p, relationship
  • Include final tables hereto refer to when discussing results.

C. Model Development

Explain the process you used to turn the data into a model.  Explain predictors that you started with and did not include in your final model with rationale.  Discuss how you checked for assumptions.  Discuss variable elimination and transformation, as well as any other clever modeling techniques you used.  You do not have to include every step of your process, but you should show critical analyses that led to important modeling decisions.

D. Data Analysis

Show your descriptive and graphical analysis of the data, to include all the observations that might contribute to the modeling process.

E. Data

Describe briefly the data set and include the sources.  For very small data sets you may include them.  For other data sets (hundreds of observations) or larger, do not waste your company's paper.

Attachment:- Assignment.rar

Reference no: EM131068431

Questions Cloud

What do the companies sell or produce : When were the companies founded? What do the companies sell or produce? What are the mission and vision statements of the companies?
What is their average tax rate : If a Real Estate Professional has $100,000 in Active Income and $20,000 in Real Estate losses how much can this person write-off of their loss against their Active Income? How much will they pay in total taxes? What is their Marginal Tax Bracket? Wha..
Construction accounting & financial management : Time cards are being entered into the accounting system for four employees. The costs for Employee 1 areto be billed to job cost code 302.01.01100L. Ten hoursof Employee 2 time is to be billed to job cost code302.01.06110L and the remaining 30 hou..
Calculate consumer surplus and producer surplus : Calculate consumer surplus and producer surplus.
Build a multiple regression model from your data : INFO 2020 PROJECT- Build a multiple regression model from your data, and prepare a business report that includes all of your previous work, and that presents a recommendation to a decision-maker based on your model and analysis
The opportunity of the purchase of the land : a business is considering a cash outlay of $250,000 for the purchase of land which it could lease for $35,000 per year. If alternative investments are available which yield an 18% return, the opportunity of the purchase of the land is:
What is the opportunity cost of a bottle of root beer : what is the opportunity cost of a bottle of root beer
Provide the definition of total comprehensive income : Provide the definition of total comprehensive income. Explain the rationale for presenting additional line items, headings, and subtotals in the statement of comprehensive income.
Truck for your construction company with a sticker price : You want to purchase another truck for your construction company with a sticker price of $25,000. The car dealer offers you a $2,000 discount (lowering the price to $23,000) and a 48-month, 8.5% APR compounded monthly. Or, no discount with a 4.0% ..


Write a Review

Basic Statistics Questions & Answers

  Pairings of the remaining players

Suppose that the contestants are numbered 1 through 2n, and that whenever two players contest a match, the lower numbered one wins with probability p. Also suppose that the pairings of the remaining players are always done at random so that all po..

  Probability that at least 4 of the 20 users are hiv

hiv infection among intravenous drug users. found that 40 light users and 55 were heavy users were hiv positive.1

  Test the claim that the standard deviation of the hardness

when 12 bolts are tested for hardness their indexes have a standard deviation of 41.7. test the claim that the

  Calculate the least squares trendline for gdp

(a) Plot the GDP time series graphically. (b) Calculate the least squares trendline for GDP. (c) Use this equation to predict the country's GDP for 2012 and 2013.

  A random sample of 60 undergraduates of a large university

a random sample of 60 undergraduates of a large university found that 32 were in support of an activity fee increase.

  Determining confidence interval estimate

Sample with size n = 100 has mean = 30. Assuming the population standard deviation is 8, construct 95% confidence interval for population mean.

  When a health survey was conducted in some country it found

when a health survey was conducted in some country. it found that 82 percent of the population was infected with a

  What is the shape of the comparison distribution

The samples a group of 100 new fathers and finds that their mean is 64.5. What is the shape of the comparison distribution?

  Sixty-four randomly selected fuses were subjected to a

sixty-four randomly selected fuses were subjected to a twenty percent overload and the time to failure was recorded. it

  Find the probability of 3 from town a and 2 from town b.

At the first tri-city meeting, there were 8 people from town A, 7 people from town B, and 5 people from town C. If the council consists of 5 people,

  Specimens of blood from 10 different animals were analyzed

specimens of blood from 10 different animals were analyzed for blood count say y in units of 100 and packed cell volume

  Find standard error of the mean if rainfall was recorded

If the rainfall was recorded on 27 (instead of 9) randomly selected days throughout the year, the standard error of the mean would be equal to?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd