Calculate confidence interval of proportion of tax payers

Assignment Help Applied Statistics
Reference no: EM131828789

Statistics and Data Analysis Statistical Modelling Assignment

OVERVIEW OF THE ASSIGNMENT - This assignment will test your skill to collect and analyse data to answer a specific business problem. It will also test your understanding and skill to use statistical methods to make inferences about business data and solve business problems, including constructing hypotheses, test them and interpret the findings.

In Australia, many people need to lodge a tax return after the end of the financial year. They can prepare and lodge their own return or pay a registered tax agent to do it for them. By using a subset of the sample file from the Australian Taxation Office (ATO), your task is to summarise and analyse several aspects of this lodgement method. We are interested to know the proportion of people who lodge a tax return using a tax agent; whether there is a difference among the age groups in terms of their lodgement method; whether there is a relationship between total income and the lodgement method; and the relationship between total income and deduction amount. In addition, you are also asked to collect and analyse a dataset about international students' preference of tax return lodgement method.

TASK DESCRIPTION: WRITTEN REPORT -

There are two datasets involved in this assignment: Dataset 1 and Dataset 2, detailed below.

Dataset 1: You will receive an email that contains a dataset that is specifically allocated to you. This dataset is a subset of 2013-2014 individual sample file, provided by the ATO and has been edited to only include a subset of the cases and variables. The original dataset is attached, and it is under the license of Creative Commons Attribution 3.0 Australia. Data dictionary of the edited dataset is given in the following table.

Variable name

Description

Values

Gender

Gender (sex)

0 = males, 1 = females

age_range

Age in five years ranges

0 = 70 and over

1 = 65 to 69

2 = 60 to 64

3 = 55 to 59

4 = 50 to 54

5 = 45 to 49

6 = 40 to 44

7 = 35 to 39

8 = 30 to 34

9 = 25 to 29

10 = 20 to 24

11 = under 20  

Lodgment_method

Lodgment method

A = tax Agent

S = Self Preparer

Tot_inc_amt

Total income

All numeric

Tot_ded_amt

Total deductions

All numeric

Dataset 2: Collect data (e.g. via a survey) from international students about whether they would use a tax agent to lodge a tax return in the future. There is no requirement about sampling methods and sample size, but you need to justify your approaches in Section 1 (see below).

Prepare a report in a document file (.doc or .docx) which includes all relevant tables and figures, using the following structure:

1. Section 1: Introduction

a. Give a brief introduction about the assignment, and include a short summary of a related article with a proper citation.

b. Dataset 1: Give a short description about this dataset. Is this primary or secondary data? What types of variable(s) is involved? Display the first 5 cases of your dataset.

c. Dataset 2: Explain how you collect the data and discuss whether your sample is biased. Is this primary or secondary data? What type of variable(s) is/are involved? You don't need to display your data in this section.

2. Section 2: Lodgement Method - Dataset 1

Use Dataset 1

a. Using suitable graphical displays, describe the variable lodgement method for Dataset 1.

b. Calculate a 95% confidence interval of the proportion of tax payers who lodge the tax return by using an Agent.

c. Give a short comment about your finding.

3. Section 3: Lodgement Method - Dataset 2

Use Dataset 2

a. Using suitable graphical displays, describe the variable lodgement method for Dataset 2.

b. Calculate a 95% confidence interval of the proportion of tax payers who lodge the tax return by using an Agent.

c. Compare this result with the result in Section 2 and make a comment whether there is a difference between dataset 1 and dataset 2 in terms of lodgement method.

4. Section 4: Lodgement Method and Age Group

Use Dataset 1

a. Describe the relationship between the age group and lodgement method using suitable graphical display and numerical summary.

b. Perform a suitable hypothesis test at a 5% level of significance to test whether the two variables are associated.

c. Give a short comment about your finding.

5. Section 5: Lodgement Method and Total Income Amount

Use Dataset 1

a. Describe the relationship between total income and lodgement method using suitable graphical display and numerical summary.

b. Provide a comment about your result in part a (include a comment about the shape of the distribution, centre, spread and outliers).

6. Section 6: Total Income Amount and Deduction Amount

a. Describe the relationship between total income and total deduction using suitable graphical display and numerical summary, for each type of lodgement method.

b. Provide a comment about your result in part a.

7. Section 7: Conclusion

a. What can you conclude from your findings in the previous sections?

b. Give a suggestion for further research.

Attachment:- Tax Return Dataset File.rar

Verified Expert

The assignment present about the static modelling and data analysis. The report present about the summarized the data and present the performance analysis. The assignment present with different section and explain with lodgement method including total income amount. Further explain about deduction amount. Explain relationship with total income and type of lodgement method. The solution provide using Microsoft words file.

Reference no: EM131828789

Questions Cloud

Selling process and the sales presentation : What is the difference, if any, between the selling process and the sales presentation?
What characteristic in the data to perform compression : Can you compress a set of bank statements using JPEG compression? MP3, JPEG, and MPEG all rely on what characteristic in the data to perform compression?
Define the term selling process : Define the term selling process. Second, list the major steps in the selling process on the left side of a piece of paper.
Situations salespeople commonly face : Below are 13 situations salespeople commonly face. For each situation, determine the mental buying stage that your prospect is experiencing.
Calculate confidence interval of proportion of tax payers : BUS708 Statistics and Data Analysis Statistical Modelling Assignment. Calculate a 95% confidence interval of the proportion of tax payers
Through one of local supermarkets : Think of a product sold through one of your local supermarkets. Assume you were recently hired by the product's manufacturer to contact the store's buyer
Buyer in the buyer dealing with salespeople : What's in it for me?" Finally, ask what superiors expect of a buyer in the buyer's dealing with salespeople.
Continue to explain your features : 1. Examine each item you mentioned to Ms. Hansen, stating what part of the customer benefit plan each of your comments is concerned with.
How would you plan the sales call : If you were Ralph, how would you plan the sales call?

Reviews

len1828789

1/23/2018 7:32:41 AM

SUBMISSION REQUIREMENT - Deadline to submit written report: Week 10 Wednesday (24), 5pm You need to submit 2 files to Turnitin: Main report, in a Microsoft Word document file, Dataset, in a Microsoft Excel file. Main report (word document): Size: A4, Use cover page with your details and signature , Single space and Font: Calibri, 11pt. Dataset (excel document): Dataset 1 in Sheet 1, Dataset 2 in Sheet 2 and Pivot tables and any other information in other sheets (rename the sheet appropriately).

len1828789

1/23/2018 7:32:34 AM

DEDUCTION, LATE SUBMISSION AND EXTENSION - There is a 2-mark deduction (out of 20) for students who do not address the specification in the submission requirement listed in Section 3 above. Late submission penalty: - 5% of the total available marks per calendar day unless an extension is approved. For extension application procedure, please refer to Section 3.2.1 of the Subject Outline.

Write a Review

Applied Statistics Questions & Answers

  Let x be a normally distributed random variable with x

Let X be a normally distributed random variable with x= 100 and n= 10. Find the probability that X  is between 70 and 120. (Round your answer to the nearest whole number percent.

  The production manager for the xyz manufacturing company

The production manager for the XYZ manufacturing company

  What is the mean rank for the good grades option

Which of the following would make you popular among your friends?Describe what additional preparation could be done.

  What is the mean and standard deviation

What is the mean and standard deviation (SD) for preoperative T score for CVLT Acquisition - distribution of scores for the postoperative CVLT Retrieval T scores is normal, the middle 68% of the patients had T scores between what two values?

  Hypothesis test and measure of effect size

Write a sentence demonstrating how a research report would present the results of the hypothesis test and the measure of effect size and determine whether there are any significant differences among the three treatment means.

  What is the justification for adding the variables

Create a research question using the General Social Survey - What independent variable is used and how is it measured?

  Probability that you will correctly reject a false null hypo

If the probability that you will correctly reject a false null hypothesis is 0.85 at 0.01 significance level, α is______________ and β is______________.

  Statistics helps us make decisions based on data analysis

Keep your eyes and ears open as you read or listen to the news this week. Find/discover an example of statistics in the news to discuss the following statement that represents one of the objectives of statistics analysis: "Statistics helps us make de..

  A variety of climbing and mountaineering equipment

A variety of climbing and mountaineering equipment.

  State the null and alternative hypotheses

a) State the Null and Alternative Hypotheses (1)

  Two variables that have perfect positive linear correlation

A.Two variables that have perfect positive linear correlation are the price per gallon of gasoline and the total cost of gasoline. Two variables that have perfect negative linear correlation are the distance from a door and the height of a wheelchair..

  Distribution of the number of halloween treats

What is the Z score of a child eating 20 treats - what is the Z score of a child eating 5 treats and faculty or administrative staff member

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd