Perform a suitable statistical analysis on dataset

Assignment Help Applied Statistics
Reference no: EM131994615

Statistics and Data Analysis Statistical Modelling Assignment -

OVERVIEW OF THE ASSIGNMENT -

This assignment will test your skill to collect and analyse data to answer a specific business problem. It will also test your understanding and skill to use statistical methods to make inferences about business data and solve business problems, including constructing hypotheses, test them and interpret the findings.

Gender gap is the difference between the salary of men and the salary of women. The reasons of gender gap are not only because of discrimination in hiring, but also includes the different industries that women and men are working, as well as many other reasons. By using an edited subset of the sample file from the Australian Taxation Office (ATO), your task is to summarise and analyse several aspects of the salary and occupation of the different gender. In addition, you are also asked to suggest one relevant research question and then collect and analyse a dataset that will answer your research question.

TASK DESCRIPTION: WRITTEN REPORT -

There are two datasets involved in this assignment: Dataset 1 and Dataset 2, detailed attached.

Dataset 1: You will receive an email that contains a dataset that is specifically allocated to you. This dataset is a subset of 2013-2014 individual sample file, provided by the ATO and has been edited to only include a subset of the cases and variables.

Dataset 2: Collect data (e.g. via a survey) that will answer your research question. There is no requirement about the number of variables, sampling methods and sample size, but you need to justify your approaches in Section 1 (see below).

Both datasets should be saved in an Excel file (one file, separate worksheets). All data processing should be performed in Excel or Statkey.

Prepare a report in a document file (.doc or .docx) which includes all relevant tables and figures, using the following structure:

1. Section 1: Introduction

a. Give a brief introduction about the assignment, including your research question. Include a short summary of a related article with a proper citation.

b. Dataset 1: Give a short description about this dataset. Is this primary or secondary data? What types of variable(s) is involved? Display the first 5 cases of your dataset.

c. Dataset 2: Explain how you collect the data and discuss its limitation (e.g. whether your sample is biased). Is this primary or secondary data? What type of variable(s) is/are involved? You don't need to display your data in this section.

2. Section 2: Descriptive Statistics

Use Dataset 1

a. Using suitable graphical display, describe the relationship between the variables Gender and Occ_code for Dataset 1. Make sure your graph shows the distribution of Gender for each Occ_code.

b. Using suitable graphical display, describe the relationship between the variables Gender and Sw_amt.

c. Using suitable numerical summary, describe the relationship between the variables Gender and Sw_amt.

d. Using suitable graphical display, describe the relationship between the variables Sw_amt and Gift_amt.

3. Section 3: Inferential Statistics

Use Dataset 1

a. List top 4 occupation based on median salary and find the proportion of the gender of those top 4 occupation.

b. Perform a suitable hypothesis test at a 5% level of significance to test whether the proportion of machinery operators and drivers who are male is more than 80%.

c. Perform a suitable hypothesis test at a 5% level of significance to test whether there is a difference in salary amount between gender.

Use Dataset 2

d. Perform a suitable statistical analysis on dataset 2 (the one you collected) that will answer your research question.

4. Section 4: Discussion & Conclusion

a. What can you conclude from your findings in the previous sections?

b. Give a suggestion for further research

TASK DESCRIPTION: PRESENTATION/INTERVIEW -

You do NOT need to prepare a presentation material (e.g. power-point slides), instead, you will be asked to demonstrate and/or explain how you summarised the data and how you performed the analysis. You may be asked to reproduce what you have made in your written report (e.g. generate a chart or numerical summary using Excel or Statkey).

Attachment:- Assignment Files.rar

Reference no: EM131994615

Questions Cloud

Describe your capsim teams experience : Describe your Capsim team's experience with the interactions of your team's functional areas.
Theoretical natural monopolist with down-sloping average : 1: For a theoretical natural monopolist with down-sloping average total cost (ATC) curve
How might this lead to increased income inequality : In developed countries like Canada or the United States, why is globalization likely to increase demand for skilled workers and reduce demand for unskilled work
How an individual can use effective communication techniques : Discuss how an individual can use effective communication techniques to overcome workplace challenges, encourage collaboration across groups.
Perform a suitable statistical analysis on dataset : BUS708 Statistics and Data Analysis Statistical Modelling Assignment - Perform a suitable statistical analysis on dataset 2 (the one you collected)
Result of widespread trademark violation : Are there any potential costs to the Chinese economy that may arise as a result of widespread trademark violation?
Describe the relevance of the article : Each article needs to be thoroughly summarized. The summary must describe the relevance of the article, and how the research findings support the action.
Describe key components of maintaining happiness at work : Describe key components of maintaining happiness at work by developing new habits, helping your coworkers and changing your relationship with stress.
Apprehensive about using technology in the classroom : Why are some teachers apprehensive about using technology in the classroom? What are some ways to get teachers to integrate more technology in lessons?

Reviews

len1994615

5/24/2018 3:35:01 AM

SUBMISSION REQUIREMENT - Deadline to submit written report: Week 10 Wednesday (23), 5pm. You need to submit 2 files to Turnitin: Main report, in a Microsoft Word document file (this is the file that will be marked, it should contain all necessary tables and figures) Dataset, in a Microsoft Excel file (this is just a supporting file). Main report (word document): Size: A4, Use Assignment Cover Page (download from Moodle) with your details and signature, Single space and Font: Calibri, 11pt.

len1994615

5/24/2018 3:34:55 AM

Dataset (excel document): Dataset 1 in Sheet 1, Dataset 2 in Sheet 2 and Data processing for each section in other sheets (rename the sheet appropriately). DEDUCTION, LATE SUBMISSION AND EXTENSION - Late submission penalty: - 5% of the total available marks per calendar day unless an extension is approved. For extension application procedure, please refer to Section 3.3 of the Subject Outline.

Write a Review

Applied Statistics Questions & Answers

  A variable for each measurement scale

1. Briefly describe your area of research interest (1-3 sentences is sufficient).2. List 4 variables that you might assess in a research project related to your research area. List one for each type of measurement scale: Nominal, ordinal, interval, a..

  In which class was his relative position higher

A student scored 76 on a general science test where the mean and standard deviation were 82 and 8, respectively he also scored 53 on a psychology test where the class mean and standard deviation were 58 and 3, respectively. In which class was his rel..

  Find more five-digit random numbers

Starting with these three random numbers and moving down the five leftmost columns of Table 7.1(a) to find more five-digit random numbers.

  Calculate the fixed-effects weighted mean and variance

Applied Economics Cost-Benefit Analysis Problems - Calculate the fixed-effects weighted mean, variance, standard deviation

  Calculate the maximum reduction in the standard deviation

Calculate the maximum reduction in the standard deviation

  The average revenue per retail user

The average revenue per retail user

  What is the quality of the references

Issues Capstone Poster Project: This was modified from an assignment developed by the Biology Faculty at BC, and derived from the Bio 100 Student Module. What is the quality of the references? Are the references relevant for the topic

  What is the coefficient of determination adjusted for degree

Perform a multiple regression in Excel and provide excel output for the regression model = βo + βo(Lot size) + βo(Trees) + βo(Distance) + ∈ Write down the equation for regression line. What is the standard error of estimate? Interpret its value. What..

  What about the numbers in addition to the average

What more would you want to know about the numbers in addition to the average - Post a response to the question

  Perform a regression analysis

Create a correlation table using Compa-ratio and the other interval level variables, except for Salary - What are the statistically significant correlations

  Use following sas program to prepare the sas data set

If we prepare a data set for exercise #9 of chapter 14 (question 3 of this homework), we can use SAS to answer all the questions in parts a), b), c) and d) of this exercise. The sample size (n) for this exercise was 45 and of these 45 third-graders 4..

  Should auditors understand proper graphing methods

Based on this interval, can we be 95 percent confident that more than 25 percent of all graphics appearing in the annual reports of U.K. firms are distorted? Explain. Does this suggest that auditors should understand proper graphing methods?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd