Describe the variable lodgement method for dataset

Assignment Help Applied Statistics
Reference no: EM131991408

Statistics and Data Analysis - Statistical Modelling Assignment

1 OVERVIEW OF THE ASSIGNMENT

This assignment will test your skill to collect and analyse data to answer a specific business problem. It will also test your understanding and skill to use statistical methods to make inferences about business data and solve business problems, including constructing hypotheses, test them and interpret the findings.

In Australia, many people need to lodge a tax return after the end of the financial year. They can prepare and lodge their own return or pay a registered tax agent to do it for them. By using a subset of the sample file from the Australian Taxation Office (ATO), your task is to summarise and analyse several aspects of this lodgement method. We are interested to know the proportion of people who lodge a tax return using a tax agent; whether there is a difference among the age groups in terms of their lodgement method; whether there is a relationship between total income and the lodgement method; and the relationship between total income and deduction amount. In addition, you are also asked to collect and analyse a dataset about international students' preference of tax return lodgement method.

2 TASK DESCRIPTION: WRITTEN REPORT

There are two datasets involved in this assignment: Dataset 1 and Dataset 2, detailed below.

Dataset 1: You will receive an email that contains a dataset that is specifically allocated to you. This dataset is a subset of 2013-2014 individual sample file, provided by the ATO and has been edited to only include a subset of the cases and variables.
Data dictionary of the edited dataset is given in the following table.

Variable name

Description

Values

Gender

Gender (sex)

0 = males, 1 = females

age_range

Age in five years ranges

0 = 70 and over 1 = 65 to 69

2 = 60 to 64

3 = 55 to 59

4 = 50 to 54

5 = 45 to 49

6 = 40 to 44

7 = 35 to 39

8 = 30 to 34

9 = 25 to 29

10 = 20 to 24

11 = under 20

Lodgment_method

Lodgment method

A = Tax Agent

 

 

S = Self Preparer

Tot_inc_amt

Total income

All numeric

Tot_ded_amt

Total deductions

All numeric

Dataset 2: Collect data (e.g. via a survey) from international students about whether they would use a tax agent to lodge a tax return in the future. There is no requirement about sampling methods and sample size, but you need to justify your approaches in Section 1.

Both datasets should be saved in an Excel file (one file, separate worksheets). All data processing should be performed primarily in Excel or by using Statkey tool

Prepare a report in a document file (.doc or .docx) which includes all relevant tables and figures, using the following structure:

1. Section 1: Introduction
a. Give a brief introduction about the assignment, and include a short summary of a related article with a proper citation.
b. Dataset 1: Give a short description about this dataset. Is this primary or secondary data? What types of variable(s) is involved? Display the first 5 cases of your dataset.
c. Dataset 2: Explain how you collect the data and discuss whether your sample is biased. Is this primary or secondary data? What type of variable(s) is/are involved? You don't need to display your data in this section.

2. Section 2: Lodgement Method - Dataset 1
Use Dataset 1
a. Using suitable graphical displays, describe the variable lodgement method for Dataset 1.
b. Calculate a 95% confidence interval of the proportion of tax payers who lodge the tax return by using an Agent.
c. Give a short comment about your finding.

3. Section 3: Lodgement Method - Dataset 2
Use Dataset 2
a. Using suitable graphical displays, describe the variable lodgement method for Dataset 2.
b. Calculate a 95% confidence interval of the proportion of tax payers who lodge the tax return by using an Agent.
c. Compare this result with the result in Section 2 and make a comment whether there is a difference between dataset 1 and dataset 2 in terms of lodgement method.

4. Section 4: Lodgement Method and Age Group
Use Dataset 1
a. Describe the relationship between the age group and lodgement method using suitable graphical display and numerical summary.
b. Perform a suitable hypothesis test at a 5% level of significance to test whether the two variables are associated.
c. Give a short comment about your finding.

5. Section 5: Lodgement Method and Total Income Amount
Use Dataset 1
a. Describe the relationship between total income and lodgement method using suitable graphical display and numerical summary.

b. Provide a comment about your result in part a (include a comment about the shape of the distribution, centre, spread and outliers).

6. Section 6: Total Income Amount and Deduction Amount
a. Describe the relationship between total income and total deduction using suitable graphical display and numerical summary, for each type of lodgement method.
b. Provide a comment about your result in part a.

7. Section 7: Conclusion
a. What can you conclude from your findings in the previous sections?
b. Give a suggestion for further research

3 TASK DESCRIPTION: PRESENTATION/INTERVIEW

A presentation/interview for the assignment is scheduled on Week 11, in your allocated tutorial.

You do NOT need to prepare a presentation material (e.g. power-point slides), instead, you will be asked to demonstrate and/or explain how you summarised the data and how you performed the analysis. You may be asked to replicate what you have made in your written report (e.g. generate a chart or numerical summary using Excel or Statkey).

Attachment:- Assignment Dataset.rar

Verified Expert

A mutual fund is an investment vehicle made up of a pool of moneys collected from many investors for the purpose of investing in securities such as stocks, bonds, money market instruments and other assets.

Reference no: EM131991408

Questions Cloud

Global financial crisis : Could you please help me explain in what ways the US banking system behaved unethically in the years during the global financial crisis?
Small amount of radiation escapes : Elmer's Glue when a small amount of radiation escapes. It is not deadly, but causes about $100 in damages to everyone in the city with a population of 1 million
Social cost of crimes committed : Calculate the effect that hiring the new policemen would have on the social cost of crimes committed.
Without the company making the new investment : What is the price per share of PSI stock today without the company making the new investment?
Describe the variable lodgement method for dataset : BUS708 Statistics and Data Analysis - Statistical Modelling Assignment - collect and analyse data to answer a specific business problem. It will also test your
Continue with the same dividend pattern moving forward : You expect that Big Bob’s and Silly Sam’s will continue with the same dividend pattern moving forward.
Analyze role that learned helplessness plays in depression : Analyze the role that learned helplessness plays in depression. Discuss cognitive interventions that could be used in treating a client who is presenting.
What is the wisconsin act 10 : What is the Wisconsin Act 10 and explain whether you would or would not have supported the law when it passed.
Number of apples consumed and xo is the number : Jack and Phil both like apples and oranges. Jack's preferences over these two goods are represented by the utility function U(xa, xo) = log(xa) + log(xo)

Reviews

len1991408

5/22/2018 1:24:26 AM

Deadline to submit written report: Week 10 Wednesday (23rd), 5pm You need to submit 2 files to Turnitin: 1. Main report, in a Microsoft Word document file (this is the file that will be marked, it should contain all necessary tables and figures) 2. Dataset, in a Microsoft Excel file (this is just a supporting file) Main report (word document): 1. Size: A4 2. Use Assignment Cover Page (download from Moodle) with your details and signature 3. Single space 4. Font: Calibri, 11pt Dataset (excel document): 1. Dataset 1 in Sheet 1 2. Dataset 2 in Sheet 2 3. Data processing for each section in other sheets (rename the sheet appropriately)

Write a Review

Applied Statistics Questions & Answers

  Let x denote the mean of a random sample of size 128

Let X denote the mean of a random sample of size 128

  Construct a stem-and-leaf display for the data

List all the values in a table and then construct a stem-and-leaf display for the data and construct a relative frequency histogram for these data with equal class widths, the first class being "$4 to less than $6".

  Brief literature review of factors influencing sales

Brief literature review of factors influencing sales - investigate the relationship between advertising expenditure and sales.

  A typical incoming telephone call

A typical incoming telephone call to your catalog sales force results in a mean order of $ 28.63 with a standard deviation of $ 13.91. You may assume that orders are received independently of one another.

  Estimate the difference between the proportion of population

Estimate the difference between the proportion of the population of low birth weight children and the proportion of the population of normal birth weight children who graduate from high school. Report a standard error for your estimate.

  Regression of the rate of inflation

Using the data on the growth rate of Money and inflation, run a regression of the rate of inflation on the rate of growth of the money supply

  Write a ten pages term paper about radiation safety programs

Write a ten pages term paper about Radiation Safety Programs. Cover and Reference citation pages are required but do not count toward the narrative page count.

  Find the ppv and npv for a population

Are the events of being man-made and being polluted independent and write out the sample space - Find the PPV and NPV for a population where 2% of the people have the disease.

  Advantages and disadvantages of repeated measures

Define mixed designs and name two assumptions of mixed designs

  The standard deviation of systolic blood pressure

A doctor claims that the standard deviation of systolic blood pressure is 12 mmHg. A random sample of 24 patients found a standard deviation of 14 mmHg. Assume the variable is normally distributed.At a = 0.01, what are the critical X²

  Plot a scatterplot of evaluations against beauty and draw

Plot a scatterplot of evaluations against beauty and draw (by hand or computer), including the fitted line/curve from your preferred model

  Alomega pharmaceuticalsalomega pharmaceuticals is a small

alomega pharmaceuticalsalomega pharmaceuticals is a small to mid size pharmaceutical company. the accounting

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd