Reference no: EM133116496
Data Analytics For Business Assignment
Question 1
A lost-time injury is defined by Australian Workplace Standards as an occurrence that resulted in a fatality, permanent disability or time lost from work of one day/shift or more. The data is provided in the file "Question1.csv": Columns A and B contain the causes of lost-time injuries and their percentage of occurrence across the previous year at a mining site.
(i) What type of variable (Continuous, Discrete, Ordinal or Nominal) is Cause and justify your answer?
(ii) Which is the appropriate graphical display to use for the variable type you have identified in part (i)?
(iii) Use R to create an appropriate chart to graphically display the data provided.
(iv) Comment on the key finding from this chart.
Question 2
The Australian Bureau of Statistics regularly reports on large percentages of small businesses failing. In a bid to identify potential indicators, or symptoms, of business failure, a national study of small businesses was undertaken. A random sample of 100 small businesses was obtained and characteristics measured. One of the recorded variables was the ratio of current assets to current liabilities (variable name "Asset_Liability_Ratio"); roughly speaking, this is the amount that the firm is worth divided by what it owes.
Five years later these same small businesses were revisited. Among the variables collected was whether the small business was still operating or not; the latter meaning the business had failed or closed. This is an example of what is known as a longitudinal study.
The study was interested in, amongst many measures of performance, assessing whether the previously recorded ratio of current assets to liabilities differed between small businesses which were still operating five years later and those that were not.
The data is provided in the file "Question2.csv": Columns A and B contain the two variables.
(i) Use R to construct a histogram of Asset Liability Ratio for 100 small businesses. How would you describe the shape? Include your histogram.
(ii) There are two common graphical presentations used to compare the "Asset Liability Ratio" for the "still operational" and "now-closed" small businesses. Which one is preferred for this study? Name, justify your answer and provide the visual display using R.
(iii) Use R to find the mean, median, standard deviation and interquartile ranges of the "Asset Liability Ratio" for the "still operational" and "now-closed" small businesses.
(iv) Using your output created in parts (i)-(iii), give a brief report comparing the "Asset Liability Ratio" for the "still operational" and "now-closed" small businesses. (Hint: Think 3 S's for each group and provide a comparison summary.)
Question 3
The TCS Management Group selected 100 clients randomly and sent them a survey to complete regarding their satisfaction with dealings with TCS. In this survey people were asked about the level of satisfaction where a higher score was indicative of a higher level of satisfaction with possible scores ranging from 0 to 100. The average scores of the sample was 72.44 with a standard deviation of 8.18. Management expects the mean survey score to be at least 70. Provide a hypothesis test, at a 5% significance level, to test if the population mean survey score is at least 70. (Hint: Implement all steps.)
Note: Do the assignment using R. software
Attachment:- Data Analytics For Business.rar