Construct a frequency distribution of the expenditures

Assignment Help Basic Statistics
Reference no: EM131941947

Statistics for Business and Finance - Analysing Household Data

Have you been part of a national census? Privacy issues aside, a census provides lots of data that can inform government policies and actions - but to be useful, the data needs to be analysed and interpreted.

In this assignment, we will use statistical methods to analyse and interpret real-world demographic data.

The goal of this assignment is to
- Test your understanding of statistical methods and approaches
- Improve your ability to use Excel for manipulation of data (see here for some guides on using Excel)
- Understand the real-world applications and implications of statistics

To complete this assignment you must
- Complete a set of statistical analysis tasks on a unique data set (both tasks and data set will be provided to you)
- Submit a report in word detailing your response to each task (the final answer and reasoning including calculations that led to it)
- Submit an excel document that contains your data set and the calculations you used to complete the tasks

- Use the marking rubric as a guide when working on your assignment.
Instructions: Analysing Household Data
In this assignment you will be analysing and interpreting household data.

Step 1: Prepare Data Set
- You need to download and modify the generic data set to identify the sample you will be working on. More information on the data set is available below.
- Important: You do not need to analyse the entire data set - only draw a random sample of 250 households. This must be done first and the rest of your statistical analysis will be based on this data set.
- You will need to know how to manipulate data in Excel. You can use any other compatible spreadsheet tool (for example Numbers for Mac OS, OpenOffice, LibreOffice etc) but be aware that some of the functions differ slightly.
- There are plenty of resources online and in this LMS that will show you how to perform these tasks in Excel. Use these resources (and Google) to improve your skills as you progress. If you have a particularly difficult challenge, share it in the forum and your tutors or peers will help.

Step 2: Analyse Data & Submit Report

- You will be presented with a set of tasks to run on your unique data set (of 250 samples) - the full list of tasks are listed below.
- As much as possible, complete these tasks within the excel document. You will need to submit your excel document at the end of the assignment.

- You will also submit a brief report in word addressing all the tasks below. When responding to the tasks, please explain the reasoning behind your answer, and refer to your Excel sheet.

Data Set: Household data

The Data Set for this Assignment is available on LMS (Data Set for Assignment 01.xls). This includes information of 2000 households across the following variables.
- Income: Annual Income in AUD,
- ATaxlnc: After tax annual income in AUD
- Grocery: Annual expenditures on groceries in AUD
- Alcohol: Annual expenditures on alcohol in AUD
- Meals: Annual expenditures on meals eaten out in AUD
- Fuel: Annual expenditures on fuel in AUD
- Cloth: Annual expenditures on clothing in AUD
- Phone: Annual expenditures on phone in AUD
- Utilities: Annual expenditures on utilities (Water, Gas, Electricity) in AUD
- Texp: Annual total expenditures in AUD
- Children: Number of children in a household
- Adults: Number of adults in a household
- OwnHouse: This is a categorical variable and takes value 1 if a household owns a house and 0, otherwise.
- GHH: Gender of the Head of Household (M: Male, F: Female)
- Highest Degree: Highest Level of Education, where the Highest Level of Education is;

Task 1

A. Draw a random sample of two hundred (250) households as per the sample selection procedure. What sampling method have you used to select your sample data? In your opinion, is this the best method of sampling particularly when one is interested in characteristics like the gender of the household head, education levels etc., why or why not?

B. Compute the descriptive statistics and draw a Box-Whisker plot of Expenditures on the following variables (all series in one graph!);

(i) Alcohol (ii) Meals (iii) Fuel (iv) Phone

C. Use information from the descriptive statistics and the boxplots in part (B) above to present a summary of your findings by contrasting different features of these distributions.

Task 2
A. Construct a frequency distribution of the expenditures on Utilities, using the following classification (11 classes).
1 2 ... 10 11

Classes 0 - 300 300 - 600 ... 2700 - 3000 More than 3000
B. Using frequency distribution of the utilities above, what is the percentage of households who spend on Utilities
a. at the most $900 per annum
b. between $1500 and $2700 per annum, and
c. more than $3000 per annum.

Task 3
A. Find the top 5% value and the bottom 5% value of household's annual after-tax income (Ataxlnc). What do these two values imply?
B. The series OwnHouse represents whether a household owns a house or not. Let X be a random variable such that X = Number of households who own a house.
(i) Is this a quantitative or a qualitative variable?
(ii) What would be the probability distribution of this random variable if we choose randomly (a) Only 1 household? (b) 250 households? Provide any relevant condition(s) to justify your answer.
C. Draw a scatter plot of natural log of total expenditures against natural log of after-tax income, that is, In(texp) against In(ataxinc) and compute the coefficient of correlation. Express your finding of the relationship between the two variables.

Task 4
A. Construct a contingency table between the gender and the level of education.
B. What is the probability that the head of household is a male and her higher level of education is Intermediate?
C. What is the probability that the head of household is a female and has the Bachelor degree?
D. What is the proportion of having the Secondary as the highest degree from among males?
E. Do you think that the events "gender of household head is female" and "having the Master Degree" are independent?

Verified Expert

Starting from drawing a random sample, different characteristics of the data are explained through descriptive calculations and diagrams. Association between different variables are also examined graphically and distribution of few variables are also compared through box plots.

Reference no: EM131941947

Questions Cloud

What are your steps in securing the scene and evidence : What are your steps in securing the scene and evidence? What steps will you take to correctly process this scene?
Why is effective communication so important in a group : Why is effective communication so important in a group? Why is listening such an important attribute of communication?
Discuss steps in the career development process : Each individual will create a fictional character who may seek career counseling. The fictional character will need to include as follows from syllabus.
Population mean in the null hypothesis : Consider the following test Ho: m = 20 Ha: m ? 20 The population standard deviation is 10. Use a = .05. How large of a sample should be taken
Construct a frequency distribution of the expenditures : Construct a frequency distribution of the expenditures on Utilities, using the classification - What would be the probability distribution of this random
What is the amount of gain recognized on the exchange : Dennis exchanges business equipment with $50,000 adjusted basis for $5000 cash, What is the amount of gain recognized on the exchange
Operations and manufacturing environments : Control charts are monitoring schemes, widely used in operations and manufacturing environments, to determine when a process
What is the amount of forest gain or loss : Eugene Forest sold the following assets: stocks acquired 5 year ago (cost $10,000) for $15,000; What is the amount of Forest sec.1231 gain or loss
Significant difference exists between the true average : We did not find enough evidence to say a significant difference exists between the true average electric bill and $30.20

Reviews

len1941947

4/14/2018 4:38:52 AM

D) Professional presentation and communication (Includes language, structure and where necessary, appendix & references) (10% of total mark) Very clearly structured, with excellent use of language relevant to an academic and/or professional context. All references and support materials supplied. Clearly structured, with good use of language relevant to an academic and/or professional context. Most references and support materials supplied. Mostly structured, with generally clear use of language relevant to an academic and/or professional context, but crucial problems in some areas. Key references and support materials missing Poorly structured, and/or poor use of language relevant to an academic and/or professional context. Key references and support materials missing Poorly structured or unstructured. Poor use of language relevant to an academic and/or professional context. Key references and support materials missing

len1941947

4/14/2018 4:38:44 AM

C) Effectively applying results of statistical inference in a manner that is consistent with context and theory (Includes where necessary - analysing impacts and recommending decisions - understanding of theory and how it applies to specific context) (35% of total mark) Results applied within context, using accurate theory. Best possible outcomes reached, with detailed & well-reasoned conclusions. Key results applied mostly within context and theory. Some results not applied, or some context / theory mis-applied. Some conclusions not completed detailed or reasoned. Some results applied correctly but others missed. Some conclusions well- reasoned, but others missing or poorly reasoned Few results applied within appropriate context and theory. Context and theory largely inaccurate. Little attempt made at reasoning behind conclusions. Little or no results applied within any context, little or no effort at reasoning behind conclusions.

len1941947

4/14/2018 4:38:33 AM

B) Accurately applying appropriate statistical tools to analyse and infer data (Includes where necessary - Reasoning and work that leads to final results - Applying and calculating formulas - performing hypothesis testing - estimating the relationship between variables) (35% of total mark) Appropriate tools used accurately, with correct result. All reasoning provided, with clear linkages. Mostly appropriate tools used and mostly used correctly. Reasoning evident in most critical areas, if incomplete. Some correct tools used and used well – or most correct tools used, but not used well. Key aspects of reasoning missing or poor. Few tools applied and used accurately. Reasoning mostly missing or poor. Appropriate tools not used, and those that are used are not used correctly. No reasoning provided.

len1941947

4/14/2018 4:38:27 AM

Written Report (30 Marks) This report addresses the key points of the assignment. Where possible, you should include the data sets, formulae and diagrams you used to arrive at your conclusions. The report should be written in a tone that is suited to a professional or academic context. Excellent (> 80 %) Very good (70 – 79%) Good (60 – 69%) Fair (50 – 59%) Poor (<50%) MARK A) Collecting, manipulating and preparing data for statistical inference (Includes where necessary, - creating graphs, charts, tables - manipulating data in Excel, using appropriate formulas and functions) (20% of total mark) Complete and accurate. Data organised optimally, appropriate visualisation and organisation tools used. Data complete and accurate. Some manipulation and visualisation can be improved. Data mostly complete and accurate - key aspects done correctly. Data manipulation and visualisation can be improved. Incomplete – key parts are missing or done poorly. Manipulating and visualisation missing or done wrongly. Most of data set missing or done wrongly. Little or no manipulation and visualization.

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd