Find the actual values of dependent variable

Assignment Help Other Subject
Reference no: EM132660085

Business Analytics -Assignment

Question 1. Open excel sheet which is called worksheet "Q1" of the Excel file (provided for this assignment) and develop the following visualizations:
a. A figure showing whether there is a relationship between age and income.
b. A figure showing distribution of income.
c. A figure illustrating registration date and frequency of people who are registered.

Question 2. What is the meaning of outliers in a dataset, how we can detect it and how we can deal with it.

Question 3. Assume there are two explanatory variables (X1 and X2) in a logistic regression model.
• X1 is a categorical variable with levels including very low, low, average, high and very high
• X2 is a categorical variable with levels including Sydney, Melbourne, Hobart and Brisbane. Explain how you will use these variables in developing a logistic regression model. How many coefficients will you have in the final model?

Question 4. The data presented in worksheet "Q4" is the results of a 4-year study conducted to assess how age, weight, and gender influence the risk of diabetes. Risk is interpreted as the probability (times 100) that the patient will have diabetes over the next 4-year period. What predictive model you suggest relating risk of diabetes to the person's age, weight and the gender. Why?

Question 5. There are 500 client records in the worksheet "Q5" of the Excel file (provided for this assignment) who have shopped many special products from an e-Business website. Each record includes data on types of product purchased (between 1-5), purchase amount ($), age, gender, family size of the customer, whether the client has a membership and whether the customer has a discount card.

a) Develop a multiple regression model to predict the spend amount based on other variables in the data set. Write the final equation and interpret the coefficients of the model and explain the accuracy of the model?
b) What would be your recommendation to improve the accuracy of this model? What is your recommendation to improve the simplicity of the model?
Question 6. The following screenshot is taken from the logistics regression output from the data set "credit card". You can find the data set here. The response variable that is called "card" is a binary variable which is considered as success (yes or 1) if the application of the customer for a credit card is accepted.
a. Write the logistics regression equation based on the output?
b. In the Excel sheet "Q6" you can find the actual values of dependent variable versus the prediction values. Calculate overall error, sensitivity, and specificity. Explain the steps of calculations.

Question 7. Consider the following confusion matrix as the result of a logistic regression model that aims to classify a dependent variable y (whether the patient is having heart attack or not) based on a given set of independent variables. In this model, y = 1 indicates having a heart attack and y = 0 indicates not having a heart attack. The cut-off value is considered as 50%. Do you think we need to make a change to the cut-off value? Justify your answer

Question 8. In worksheet "Q8", a dataset from blood bank is presented. The data are recorded for blood donation made by a group of donors of in a period of time. The donor ID is unique for each donor. A donor might have donated more than once in this period. At each donation, the blood total protein level of the donor has been recorded. There are some missing values for blood type. How you can fill in the missing values. Explain your approach (step by step) on how you will fill in the missing values. Apply your approach to fill the missing value as much as possible. (save the results in an Excel worksheet in and name it Question 8.)

Question 9. Open the Excel worksheet Q9. In this data set there are 12 different observations. In the same data set x and y represent independent and dependent variables, respectively. Develop a regression model which best represents the data set. You can develop a linear or a polynomial regression model. However, you cannot exceed a degree 3 exponential regression model. In other words, your options are limited to degree 1, 2 and 3 regression models.

a. Developed a scatter chart and explain based on the scatter chart what kind of regression model is potentially suitable for development?
b. Develop potential regression models and choose the best one. Write the regression equation which you think best represents your data set. Explain why the model you picked is the best one.
c. Develop residual plots for all of the regression models and explain if they can help you in choosing appropriate regression model or not.

Question 10. Assume that you are a business analyst in a manufacturing company. Your manager gave you a task to optimise scheduling of a project. You are asked to minimize the overall manufacturing time. There are three items to be manufactured and three different machineries are available to be used for the manufacturing task. Due to operational reasons every machinery can only be allocated to the manufacturing of one item. Every item can only be manufactured by one machine. Different machinery has different speeds in manufacturing different items. Let us say i ?????? m represent item and machinery, respectively. i1 represents the first item i2 represents the second item and so on. m1 represents the first machine, m2 represents the second machinery and so on. The duration of manufacturing every item pertaining to every machinery is presented in the below table.

m1 m2 m3
i1 5 5 3
i2 4 9 4
i3 3 6 6

In this problem the sequence of manufacturing is important. Items i1 and i2 should be manufactured before manufacturing item i3.

a) Write the linear optimisation model for the company to make the best decision.
b) Solve the model, present the results and interpret them.

Hint: you can use a binary variable such as xim which can take values of zero and one. Let us say
xim = 1 if machine m is engaged to manufacture item i otherwise xim = 0.

Attachment:- Business Analytics.rar

Attachment:- Datasets.rar

Reference no: EM132660085

Questions Cloud

Which the group that sets international preferred accounting : Reflect the assumption that the business will continue operating instead of being closed or sold, unless evidence shows that it will not continue, is the
What amount of loss from the sale is recognized : She sold it on October 15, year 7, for $240,000. What amount of loss from the sale is recognized on her year 7 income tax return
Explain views of descartes-spinoza and leibniz : Define rationalism, and the history behind it. How do the views of Descartes, Spinoza, and Leibniz differ?
How are the factors influenced by the race of the individual : How does the gender of people who break the law influence their treatment by police, prosecutors, courts, and correctional programs?
Find the actual values of dependent variable : Find the actual values of dependent variable versus the prediction values. Calculate overall error, sensitivity, and specificity. Explain the steps
Prepare the company journal entry for the july interest : Prepare the company's journal entry for the July 1 interest payment. The City of Pearl issued 200 bonds at their face value of $7,000 each plus accrued interest
What reason and data would you use to ground your discussion : Child Advice. You meet a juvenile. Out ofconcern, you want to ensure the juvenile stays away from drugs. What reasons, tactics and data would you use to ground.
Describe features of the ethical perspectives of buddhist : Describe features of the ethical perspectives of Buddhist and Confucian traditions that would be beneficial within a workplace/community.
How you will monitor your program or policy : With your action plan already in place, the next step is to analyze whether your intervention provides solutions to the problems you seek to solve.

Reviews

Write a Review

Other Subject Questions & Answers

  Describe how a market can be segmented what are the kinds

explain how a market can be segmented. what are types of segments that a marketing manager could segment a market?

  Glycoside formed between vanilla and glucose

Vanillin, the principle component of vanilla, occurs in vanilla beans and other natural sources as a beta-d-glucopyranoside (ie. as a glycoside formed between vanilla and glucose).

  Define subsequent products makes washing machines

Sequent Products makes washing machines. Over the phone, Sequent offers to sell Treadwater Appliance Outlet one hundred model

  Summarize your findings in a memo to the tax research file

The Downer family has owned Downer's Dairy Farm for generations.The farm totals approximately 800 acres of land.

  Discuss what type of personal protective equipment

what type of personal protective equipment would a health care worker be required to wear when coming in contact with a patient under that particular precaution

  Record an audio clip reporting facts about aztec culture

Record an audio clip, reporting facts about either group. Can be in the form of a news broadcast or a play by play covering a live event.

  Who was the first anthropologist to work in british columbia

Who was the first Anthropologist to work in British Columbia. Discuss and review his research and field methodology.

  What are the ethical and legal responsibilities of nurse

Consider this question: What are the ethical and legal responsibilities of the nurse who believes a home care patient needs to be cared for in a hospital.

  Summarize the articles main ideas - nuclear waste

Your goal for this assignment is to summarize the article's main ideas and important points clearly, concisely, and accurately - Nuclear Waste

  HIV CAPSTONE ASSIGNMENT

Reflect on the Capstone topic and problem statement you posted (Discrimination against individuals affected with HIV/AIDS)

  Explain the difference between leaders and managers

Explain the difference between leaders and managers, as well as the influence and power they may have on the success of this program.

  What trends do you foresee in the next ten years

You have studied changes in the health care environment in the past twenty years.  What trends do you foresee in the next ten years?  Keep in mind that even the "experts" can't predict the future

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd