Analyse nsw transport system

Assignment Help Basic Statistics
Reference no: EM132110208

Statistical Modelling Assignment

OVERVIEW OF THE ASSIGNMENT

This assignment will test your skills of collecting and analysing data to answer a specific business problem. It also gives you the opportunity to apply the theories you have learned in this course such as finding numerical summaries, displaying with appropriate graphs and using statistical inferences to solve business problems, including constructing hypotheses, test them and interpret the findings. You may have to use two Data sets. One Data set will be sent to you via KOI student email individually and you need to find or collect another dataset.

Suppose you are working for an agency who analyse NSW transport system data to make a recommendation to improve public transport system. You will be given series of research questions. Use your knowledge that you gain from this course to answer these questions by displaying appropriate outputs of Excel, StatKey or Wolfram alpha. Use these answers to write an executive summary which might be a valuable recommendation to Transport NSW.

TASK DESCRIPTION: WRITTEN REPORT

There are two datasets involved in this assignment: Dataset 1 and Dataset 2, detailed below.

Dataset 1: You will receive an email that contains a dataset that is specifically allocated to you. This dataset is a subset of a data Opal Tap on and Tap Off Location - 8th to 14th August 2016 individual sample file, provided by the Transport for NSW Open Data and has been edited to only include a subset of the cases and variables.

The original dataset can be obtained and it is under the license of Creative Commons Attribution 3.0 Australia. Data dictionary of the edited dataset is given in the following table.

Variable

Description

Values

mode

Type of the public transport

Bus, Train, Ferry and Light Rail

date

Date of the tap on/off held

Date/month/year

tap

It is a tap on or off

On and Off

loc

Locations of stops. For bus

postcodes and others name of the stations

Postcodes and names of the stations

count

Total number tap on or off on the certain location and

the certain date

Number

Dataset 2: Collect data (e.g. via a survey) that will answer research question given in section 3. There is no requirement about the number of variables, sampling methods and sample size, but you need to justify your approaches in Section 1 (see below).

Both datasets should be saved in an Excel file (one file, separate worksheets). All data processing should be performed in Excel or Statkey.

Prepare a report in a document file (.doc or .docx) which includes all relevant tables and figures, using the following structure:

1. Section 1: Introduction
a. Give a brief introduction about the assignment and search related article and write a paragraph of summary which supports your assignment. You need to give the full citation of the article.
b. Dataset 1: Give a short description about this dataset. Is this primary or secondary data? What are types of variables involved? Explain briefly what are the possible cases used in this study.
c. Dataset 2: Explain how you collect the data and discuss its limitation (e.g. whether your sample is biased). Is this primary or secondary data? What is/are the type(s) of variable(s) involved? Give a description of cases you consider for this data set.

2. Section 2: Analysis of single variable in Dataset 1
a. To answer research question "Which type of public transport was most used by the NSW people during 8th to 14th of August 2016?", provide a suitable numerical summary and graphical display for the variables mode of Dataset 1. Give a detailed comment to answer the research question.
b. Now to answer research question "Are there more than 50% of public transport users in NSW use the particular mode of transport found in Part a?" setup an appropriate hypotheses, perform hypotheses test and answer the research question by writing the conclusion of the test.

3. Section 3: Analysis of two variables in Dataset 1
NSW Government need to decide on whether they have to build an underground Railway line from either Parramatta, Bankstown or Gosford to central. To prepare a recommendation for this;
a. Give a numerical summary and an appropriate graphical display for the variables location, by only considering those three stations; and the variable count by considering the data with trains only.
b. Perform a suitable hypothesis test at a 5% level of significance to test whether there is difference between mean counts of taps on and off.
c. Use the conclusion of the test in part b and the outputs in part a to write a recommendation to NSW government.

4. Section 4: Collect and analysis Dataset2
You are interested in finding whether there is a difference in preference between different gender in terms of their transport mode (Bus, Train, Ferry and Light Rail). by considering appropriate number of cases and variable, give a proper graphical display and use it to write a comments.

Section 5: Discussion & Conclusion

Write an executive summary by combining all your findings in the previous sections which must be a valuable recommendation for NSW Transport. Give a suggestion for further research

TASK DESCRIPTION: PRESENTATION/INTERVIEW

A presentation/interview for the assignment is scheduled on Week 11, in your allocated tutorial.

You do NOT need to prepare a presentation material (e.g. power-point slides), instead, you will be asked to demonstrate and/or explain how you summarised the data and how you performed the analysis. You may be asked to reproduce what you have made in your written report (e.g. generate a chart or numerical summary using Excel or Statkey).

Attachment:- Assignment Description.rar

Reference no: EM132110208

Questions Cloud

Write your proof for each part in stepwise fashion : Write your proof for each part in stepwise fashion and provide justification for each step.
Write a rule to find facts of who will be pinched : Write a rule (and then test it) to find facts of who will be pinched on Saint Patrick's Day for not wearing green.
How many frames can be sent continuously : State your reason by explaining a scenario that will result in protocol failure if we send 16 frames continuously.
Develop the simulation model and make six runs : The probability of revisiting a workstation is independent in that same part could be sent back many times with no change in the probability.
Analyse nsw transport system : BUS708 Statistics and Data Analysis - Statistical Modelling Assignment - significance to test whether there is difference between mean counts of taps on
What are the pros and cons of having a database language : What are the pros and cons of having a database language (like SQL) based on an industry accepted standard?
Which protocol are used in wlan : Which Protocol are used in WLAN (Wireless local area network). and describe functionality of these protocol in wireless local area network.
Discuss the trade-offs between sharing and security : Provide an instance that comes close to your ideal balance between resource sharing and protection against unauthorized resource access.
Design a local area network for the given case study : MN621 Advanced Network Design Report Assignment - Local Area Network Design and Setup, MIT Australia. Design a local area network for the given case study

Reviews

len2110208

9/11/2018 5:22:27 AM

Deadline to submit written report: Week 10 Friday (21), 11:59 pm You need to submit: 1. Main report, in a Microsoft Word document file (this is the file that will be marked, it should contain all necessary tables and figures) 2. Dataset, in a Microsoft Excel file (this is just a supporting file) Main report (word document): 1. Size: A4 2. Use Assignment Cover Page (download from Moodle) with your details and signature 3. Single space 4. Font: Calibri, 11pt Dataset (excel document): 1. Dataset 1 in Sheet 1 2. Dataset 2 in Sheet 2 3. Data processing for each section in other sheets (rename the sheet appropriately)

Write a Review

Basic Statistics Questions & Answers

  The birthday problem

Suppose there are C people, each of whose birthdays (month and day only) are equally likely to fall on any of the 365 days of a normal (i.e., non-leap) year

  Write regression equation that can be used to predict sales

Use the above results and write the regression equation that can be used to predict sales. Estimate the sales volume for an advertising expenditure of 3.5 million dollars and 45 salespeople. Give your answer in dollars

  Normal distribution of blood protoplasm

Porphyrin is a pigment in blood protoplasm and other body fluids that is significant in body energy and storage. Let x be a random variable that represents the number of milligrams of porphyrin per deciliter of blood.

  Detecting the difference in proportions

The trial failed to show significance. How many subjects would be required to detect the difference in proportions observed in the trial with 80% power?

  A company that manufacturers coffee for use in commercial

a company that manufacturers coffee for use in commercial machines monitors the caffeine content in its coffee. the

  How many calories should it have using model

Build a predictive model for number of calories using fat grams. If a pizza has 15 grams of fat, how many calories should it have, using your model?

  How to Interpret the Values of Correlations

What is Correlation and How to Interpret the Values of Correlations

  Economic tensions that appeared throughout colonial america

What were the more important political, social, religious, and economic tensions that appeared throughout colonial America in the seventeenth century?

  How many broken eggs do you expect to get

How many broken eggs do you expect to get?- What's the standard deviation?- What assumptions did you have to make about the eggs in order to answer this question?

  Concept of weighted average cost of capital

What is the average cost of capital per dollar raised (this is similar to the concept of weighted average cost of capital in your finance classes)?

  Short hedger or a long hedger in corn futures

Should the company be a short hedger or a long hedger in corn futures?

  Information about probability and decision making

One of the problems encountered by corporations in America is finding an adequate number of employees who want to move into management. Recent surveys of workers in America taken by the Department of Labor in Washington D. C.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd