Analysis of single variable in dataset

Assignment Help Basic Statistics
Reference no: EM132112520

Statistical Modelling Assignment

OVERVIEW OF THE ASSIGNMENT

This assignment will test your skills of collecting and analysing data to answer a specific business problem. It also gives you the opportunity to apply the theories you have learned in this course such as finding numerical summaries, displaying with appropriate graphs and using statistical inferences to solve business problems, including constructing hypotheses, test them and interpret the findings. You may have to use two Data sets. One Data set will be sent to you via KOI student email individually and you need to find or collect another dataset.

Suppose you are working for an agency who analyse NSW transport system data to make a recommendation to improve public transport system. You will be given series of research questions. Use your knowledge that you gain from this course to answer these questions by displaying appropriate outputs of Excel, StatKey or Wolfram alpha. Use these answers to write an executive summary which might be a valuable recommendation to Transport NSW.

TASK DESCRIPTION: WRITTEN REPORT

There are two datasets involved in this assignment: Dataset 1 and Dataset 2, detailed below.

Dataset 1: You will receive an email that contains a dataset that is specifically allocated to you. This dataset is a subset of a data Opal Tap on and Tap Off Location - 8th to 14th August 2016 individual sample file, provided by the Transport for NSW Open Data and has been edited to only include a subset of the cases and variables.

The original dataset can be obtained and it is under the license of Creative Commons Attribution 3.0 Australia. Data dictionary of the edited dataset is given in the following table.

Variable

Description

Values

mode

Type of the public transport

Bus, Train, Ferry and Light Rail

date

Date of the tap on/off held

Date/month/year

tap

It is a tap on or off

On and Off

loc

Locations of stops. For bus

postcodes and others name of the stations

Postcodes and names of the stations

count

Total number tap on or off on the certain location and

the certain date

Number

Dataset 2: Collect data (e.g. via a survey) that will answer research question given in section 3. There is no requirement about the number of variables, sampling methods and sample size, but you need to justify your approaches in Section 1 (see below).

Both datasets should be saved in an Excel file (one file, separate worksheets). All data processing should be performed in Excel or Statkey.

Prepare a report in a document file (.doc or .docx) which includes all relevant tables and figures, using the following structure:

1. Section 1: Introduction
a. Give a brief introduction about the assignment and search related article and write a paragraph of summary which supports your assignment. You need to give the full citation of the article.
b. Dataset 1: Give a short description about this dataset. Is this primary or secondary data? What are types of variables involved? Explain briefly what are the possible cases used in this study.
c. Dataset 2: Explain how you collect the data and discuss its limitation (e.g. whether your sample is biased). Is this primary or secondary data? What is/are the type(s) of variable(s) involved? Give a description of cases you consider for this data set.

2. Section 2: Analysis of single variable in Dataset 1
a. To answer research question "Which type of public transport was most used by the NSW people during 8th to 14th of August 2016?", provide a suitable numerical summary and graphical display for the variables mode of Dataset 1. Give a detailed comment to answer the research question.
b. Now to answer research question "Are there more than 50% of public transport users in NSW use the particular mode of transport found in Part a?" setup an appropriate hypotheses, perform hypotheses test and answer the research question by writing the conclusion of the test.

3. Section 3: Analysis of two variables in Dataset 1
NSW Government need to decide on whether they have to build an underground Railway line from either Parramatta, Bankstown or Gosford to central. To prepare a recommendation for this;
a. Give a numerical summary and an appropriate graphical display for the variables location, by only considering those three stations; and the variable count by considering the data with trains only.
b. Perform a suitable hypothesis test at a 5% level of significance to test whether there is difference between mean counts of taps on and off.
c. Use the conclusion of the test in part b and the outputs in part a to write a recommendation to NSW government.

4. Section 4: Collect and analysis Dataset2
You are interested in finding whether there is a difference in preference between different gender in terms of their transport mode (Bus, Train, Ferry and Light Rail). by considering appropriate number of cases and variable, give a proper graphical display and use it to write a comments.

Section 5: Discussion & Conclusion

Write an executive summary by combining all your findings in the previous sections which must be a valuable recommendation for NSW Transport. Give a suggestion for further research

TASK DESCRIPTION: PRESENTATION/INTERVIEW

A presentation/interview for the assignment is scheduled on Week 11, in your allocated tutorial.

You do NOT need to prepare a presentation material (e.g. power-point slides), instead, you will be asked to demonstrate and/or explain how you summarised the data and how you performed the analysis. You may be asked to reproduce what you have made in your written report (e.g. generate a chart or numerical summary using Excel or Statkey).

Attachment:- Data 15.rar

Verified Expert

The study design is an example of exploratory study design in which the research that is performed mainly to identify a solution for the solution for which the solution is yet to be derived. Initially descriptive statistics will be performed and it is general procedure to understand the distribution of the data

Reference no: EM132112520

Questions Cloud

Do you think that they should have access to direct lobbying : Do you think that they should have access to “Direct Lobbying?” Is this process enhancing or diminishing the government’s Bureaucratic behavior?
Holistic medicine center is opening in mixed urban community : A new holistic medicine center is opening in a mixed urban community that is starting to attract young professionals.
What type of study would be most appropriate : What type of study would be most appropriate to determine the economic value of the goods listed in question 1? Explain fully.
True regarding inventory turnover : Which statement is true regarding inventory turnover, In a PEST, how would a growing religious movement be categorized?
Analysis of single variable in dataset : brief introduction about the assignment and search related article and write a paragraph of summary which supports your assignment
Do some research on the given issue : In American negligence cases, if the plaintiff is successful, the plaintiff's attorney receives a contingency fee, i.e. a percentage of the damages awarded.
Describe situation that caused you broaden perspective : Describe a situation that caused you a broaden your perspective. Discuss Apple Inc ethical policy which include: trade secrets, discrimination, OSHA, marketing.
Which type of inventory consists of finished goods : Which type of inventory consists of finished goods? In the three factors of success model, what two components comprise the "acceptability” factor?
What are the counterpoints : Now in your final assignment, you will combine these writing techniques to write a stance essay. A stance essay takes a position on a topic and argues.

Reviews

inf2112520

11/1/2018 3:57:54 AM

Dataset 2, I was asked to "find whether there is a difference in preference between different gender in terms of their transport mode (Bus, Train, Ferry and Light Rail).by considering appropriate number of cases and variable, give a proper graphical display and use it to write a comments. thanks for making this assignment very simple and explained me all the aspects for the same..

inf2112520

11/1/2018 3:56:07 AM

Section 5: Discussion and Conclusion 5.a 5 Executive summary: 5 Write an executive summary by combining all your findings in the previous sections which must be a valuable recommendation for NSW Transport. 4.b. 2 Giving further research: 2 1.7 Written presentation

inf2112520

11/1/2018 3:56:01 AM

Correct Choice of Numerical summary: 1 Correct numerical values for all three Categories: 2 Comment: 2 a. Using appropriate numerical summary, describe the variables location with only categories Bankstown, Gosford and Parramatta and Count. 3.b. 6 Correct hypotheses: 1 Correct ANOVA table: 2 Correct p-value: 1 Correct conclusion: 2 c. Perform a suitable hypothesis test at a 5% level of significance to perform a hypoptheses test that there is a difference between the means of these categories. 3.c 5 Usage of conclusion in part b: 1 Usage of graph and numerical values: 2 Good recommondation: 2 d. Use conclusion in part b and outputs in part a appropriately to write a recommondation. Section 4: Collect and Analysis a data set 2 4.a 5 Correct Choice of graph: 1 Correct graph based on data:1 Title/label/legends:1 Use graph to answer the research question:2 a. Using appropriate graphical display, describe the variables in data set 2

inf2112520

11/1/2018 3:55:28 AM

2.a. 5 Correct choice of numerical summary: 1 Correct numerical values for all four Categories of the variable Mode: 2 Use graph and numerical summary to answer the research question: 2 a. Using suitable numerical summary, to answer the research question. 2.b. 5 Correct Hypotheses: 1 Checking Assumptions: 2 Correct Test Statistics: 2 b. Perform the hypotheses test for proportion with first three steps 2.b. 3 Correct P- Value: 1 Correct conclusion: 2 b. Perform the hypotheses test for proportion with last two steps Section 3: Analysis of two variables 3.a. 4 Correct Choice of graph: 1 Correct graph based on data: 1 Title/label/legends: 1 comment: 1 a. Using appropriate graphical display, describe the variables location with only categories Bankstown, Gosford and Parramatta and Count. 3.a. 5

inf2112520

11/1/2018 3:55:09 AM

Primary/secondary: 1 Types of variables: 1 Description of cases : 1 c. Dataset 2: Explain how you collect the data and discuss its limitation (e.g. whether your sample is biased). Is this primary or secondary data? What type of variable(s) is/are involved? You don’t need to display your data in this section. Section 2: Analysis of single variable 2.a. 5 Correct choice of graph: 1 Correct graph based on data: 1 Title/label/legends: 1 Comments: 2 a. Using suitable graphical display, describe the variable Mode for Dataset 1. Make sure your graph shows the appropriate features.

inf2112520

11/1/2018 3:28:18 AM

Clear description: 2 Primary/secondary: 1 Types of variables: 1 Description of cases: 1 b. Dataset 1: Give a short description about this dataset. Is this primary or secondary data? What types of variable(s) is involved? Describe the cases. 1.c. 5 Clear data collection description: 1 Limitation: 1

inf2112520

11/1/2018 3:27:39 AM

Section Mark Criteria Question Section 1: Introduction 1.a. 5 Clear and concise intro: 2 Proper citation: 1 A summary of a related article: 2 a. Give a brief introduction about the assignment, including your research question. Include a short summary of a related article with a proper citation. 1.b. 5

inf2112520

11/1/2018 3:27:17 AM

The assignment is correct There is a very important part I have uploaded a document that outlines all the requirements short summary (8-10 lines preferably) of related article to this topic along with citations and references It is 30 percent of my grade I am attaching an additional file that I have collected from my tutor. It is a marking rubric. Please check against this rubric to see if all the marking criteria have been met.

len2112520

9/14/2018 1:58:28 AM

The first file has requirements. Second file contains data set 1. Data set 2 needs to be created through surveys. you can just make it up. samplpe size needs to be atleast 20. and both word and excel files has to be there 1. Main report, in a Microsoft Word document file (this is the file that will be marked, it should contain all necessary tables and figures) 2. Dataset, in a Microsoft Excel file (this is just a supporting file) Main report (word document): 1. Size: A4 2. Use Assignment Cover Page (download from Moodle) with your details and signature 3. Single space 4. Font: Calibri, 11pt Dataset (excel document): 1. Dataset 1 in Sheet 1 2. Dataset 2 in Sheet 2 3. Data processing for each section in other sheets (rename the sheet appropriately)

Write a Review

Basic Statistics Questions & Answers

  Anthropological report on status of women

Based on anthropological reports in which the status of women is scored on a 10-point scale, the mean and standard deviation across many cultures are known. A new culture is found in which there is an unusual family arrangement.

  What is the probability that alice actually sent a one

Suppose for this part that she sends only one bit (a 0 or 1), with equal probabilities. If she sends a 0, there is a 5% chance of an error occurring.

  Calculate average age of all current first-time mothers

A May 8, 2008, report on National Public Radio (www.npr.org) noted that the average age of firsttime mothers in the United States is slightly higher.

  The wilcoxon signed rank statistic

The Wilcoxon signed rank statistic, The mean for the Wilcoxon signed rank test and The P-value for the Wilcoxon signed rank test

  A sample of 100 families was drawn for analysis of the

a sample of 100 families was drawn for analysis of the average grocery spending. assume that the individual weekly

  In the game of blackjack determine the odds of dealing

in the game of blackjack determine the odds of dealing yourself a blackjack ace face-card or ten from a standard

  Hypothesis testing-one-tailed test

A rental agent claims that the mean monthly rent,u, for apartments on the east side of town is less than$725 . A random sample of 13 monthly rents for apartments on the east side has a mean of$714

  Compare the answers obtained by using the normal

Compare the answers obtained by using the normal and Poisson approximations to the binomial law.

  In the clinical trials of the allergy medicine clarinex 50

in the clinical trials of the allergy medicine clarinex 50 out of 1655 individuals reported having dry mouth. in the

  Explain why the graph is deceptive

Hormone Replacement Therapy Again The bar chart shows a comparison of breast cancer rates for those who took HRT and those who took a placebo.

  Quantitative methods-power stations in south africa

The following table gives the coal usage and electricity generated by ten randomly selected power stations in South Africa in 1992:

  Calculate the usual measure of the level of uncertainty

Find the usual measure of the level of uncertainty in the percentage of loans you authorized that will never be repaid. Briefly interpret this number.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd