Draw a random sample of two hundred households

Assignment Help Advanced Statistics
Reference no: EM131940568

Statistics for Business and Finance - Analysing Household Data

Have you been part of a national census? Privacy issues aside, a census provides lots of data that can inform government policies and actions - but to be useful, the data needs to be analysed and interpreted.

In this assignment, we will use statistical methods to analyse and interpret real-world demographic data.

The goal of this assignment is to
- Test your understanding of statistical methods and approaches
- Improve your ability to use Excel for manipulation of data (see here for some guides on using Excel)
- Understand the real-world applications and implications of statistics

To complete this assignment you must
- Complete a set of statistical analysis tasks on a unique data set (both tasks and data set will be provided to you)
- Submit a report in word detailing your response to each task (the final answer and reasoning including calculations that led to it)
- Submit an excel document that contains your data set and the calculations you used to complete the tasks

- Use the marking rubric as a guide when working on your assignment.
Instructions: Analysing Household Data
In this assignment you will be analysing and interpreting household data.

Step 1: Prepare Data Set
- You need to download and modify the generic data set to identify the sample you will be working on. More information on the data set is available below.
- Important: You do not need to analyse the entire data set - only draw a random sample of 250 households. This must be done first and the rest of your statistical analysis will be based on this data set.
- You will need to know how to manipulate data in Excel. You can use any other compatible spreadsheet tool (for example Numbers for Mac OS, OpenOffice, LibreOffice etc) but be aware that some of the functions differ slightly.
- There are plenty of resources online and in this LMS that will show you how to perform these tasks in Excel. Use these resources (and Google) to improve your skills as you progress. If you have a particularly difficult challenge, share it in the forum and your tutors or peers will help.

Step 2: Analyse Data & Submit Report

- You will be presented with a set of tasks to run on your unique data set (of 250 samples) - the full list of tasks are listed below.
- As much as possible, complete these tasks within the excel document. You will need to submit your excel document at the end of the assignment.

- You will also submit a brief report in word addressing all the tasks below. When responding to the tasks, please explain the reasoning behind your answer, and refer to your Excel sheet.

Data Set: Household data

The Data Set for this Assignment is available on LMS (Data Set for Assignment 01.xls). This includes information of 2000 households across the following variables.
- Income: Annual Income in AUD,
- ATaxlnc: After tax annual income in AUD
- Grocery: Annual expenditures on groceries in AUD
- Alcohol: Annual expenditures on alcohol in AUD
- Meals: Annual expenditures on meals eaten out in AUD
- Fuel: Annual expenditures on fuel in AUD
- Cloth: Annual expenditures on clothing in AUD
- Phone: Annual expenditures on phone in AUD
- Utilities: Annual expenditures on utilities (Water, Gas, Electricity) in AUD
- Texp: Annual total expenditures in AUD
- Children: Number of children in a household
- Adults: Number of adults in a household
- OwnHouse: This is a categorical variable and takes value 1 if a household owns a house and 0, otherwise.
- GHH: Gender of the Head of Household (M: Male, F: Female)
- Highest Degree: Highest Level of Education, where the Highest Level of Education is;

Task 1

A. Draw a random sample of two hundred (250) households as per the sample selection procedure. What sampling method have you used to select your sample data? In your opinion, is this the best method of sampling particularly when one is interested in characteristics like the gender of the household head, education levels etc., why or why not?

B. Compute the descriptive statistics and draw a Box-Whisker plot of Expenditures on the following variables (all series in one graph!);

(i) Alcohol (ii) Meals (iii) Fuel (iv) Phone

C. Use information from the descriptive statistics and the boxplots in part (B) above to present a summary of your findings by contrasting different features of these distributions.

Task 2
A. Construct a frequency distribution of the expenditures on Utilities, using the following classification (11 classes).
1 2 ... 10 11

Classes 0 - 300 300 - 600 ... 2700 - 3000 More than 3000
B. Using frequency distribution of the utilities above, what is the percentage of households who spend on Utilities
a. at the most $900 per annum
b. between $1500 and $2700 per annum, and
c. more than $3000 per annum.

Task 3
A. Find the top 5% value and the bottom 5% value of household's annual after-tax income (Ataxlnc). What do these two values imply?
B. The series OwnHouse represents whether a household owns a house or not. Let X be a random variable such that X = Number of households who own a house.
(i) Is this a quantitative or a qualitative variable?
(ii) What would be the probability distribution of this random variable if we choose randomly (a) Only 1 household? (b) 250 households? Provide any relevant condition(s) to justify your answer.
C. Draw a scatter plot of natural log of total expenditures against natural log of after-tax income, that is, In(texp) against In(ataxinc) and compute the coefficient of correlation. Express your finding of the relationship between the two variables.

Task 4
A. Construct a contingency table between the gender and the level of education.
B. What is the probability that the head of household is a male and her higher level of education is Intermediate?
C. What is the probability that the head of household is a female and has the Bachelor degree?
D. What is the proportion of having the Secondary as the highest degree from among males?
E. Do you think that the events "gender of household head is female" and "having the Master Degree" are independent?

Verified Expert

This task provides a brief description on household data. the descriptive statistics was performed for continuous variables and the frequency distribution was performed for categorical variable. Probability distribution such as normal probability distribution was used to determine the probability values of the requested data

Reference no: EM131940568

Questions Cloud

Chromosome for sex determination : In butterflies, sex is determined by the ZW sex-determination system. Female butterflies are heterogametic and have both a Z sex
How much will rebecca pay in finance charges : Rebecca wants to borrow $5,000 for 4 years. If the lender charges her 7% simple interest, how much will Rebecca pay in finance charges?
The perspective of business information systems : A brief narrative of how an IS/IT is realized, initiated, designed, and implemented in terms of what/when/where/how this happened.
Fashion in the united states : Do you think we could produce and use methane in a similar fashion in the United States? Explain.
Draw a random sample of two hundred households : BUS5SBF - Statistics for Business and Finance - Draw a random sample of two hundred (250) households as per the sample selection procedure
Baseballs to number of illnesses in faces diseases : Healthcare workers have the potential baseballs to any number of illnesses in their faces diseases provide samples of a show you can take it three different
Calculate the money multiplier and aggregate money supply : calculate the aggregate money supply. calculate the money multiplier.
Which section of the analysis would be toughest : If you were assigned to a committee performing a SWOT analysis, which section of the analysis do you think would be the toughest to investigate? Why?
What is the operating cash flow of the project : What is the aftertax salvage value of the fixed asset? What is the operating cash flow of the project?

Reviews

len1940568

4/13/2018 4:57:15 AM

D) Professional presentation and communication (Includes language, structure and where necessary, appendix & references) (10% of total mark) Very clearly structured, with excellent use of language relevant to an academic and/or professional context. Clearly structured, with good use of language relevant to an academic and/or professional context. Mostly structured, with generally clear use of language relevant to an academic and/or professional context, but crucial problems in some areas. Poorly structured, and/or poor use of language relevant to an academic and/or professional context. Poorly structured or unstructured. Poor use of language relevant to an academic and/or professional context. All references and support materials supplied. Most references and support materials supplied. Key references and support materials missing Key references and support materials missing Key references and support materials missing

len1940568

4/13/2018 4:57:08 AM

B) Accurately applying appropriate statistical tools to analyse and infer data (Includes where necessary - Reasoning and work that leads to final results - Applying and calculating formulas - performing hypothesis testing - estimating the relationship between variables) Appropriate tools used accurately, with correct result. All reasoning provided, with clear linkages. Mostly appropriate tools used and mostly used correctly. Reasoning evident in most critical areas, if incomplete. Some correct tools used and used well – or most correct tools used, but not used well. Key aspects of reasoning missing or poor. Few tools applied and used accurately. Reasoning mostly missing or poor. Appropriate tools not used, and those that are used are not used correctly. No reasoning provided. (35% of total mark) C) Effectively applying results of statistical inference in a manner that is consistent with context and theory (Includes where necessary - analysing impacts and recommending decisions - understanding of theory and how it applies to specific context)

len1940568

4/13/2018 4:56:42 AM

Report (30 Marks) This report addresses the key points of the assignment. Where possible, you should include the data sets, formulae and diagrams you used to arrive at your conclusions. The report should be written in a tone that is suited to a professional or academic context. Excellent (> 80 %) Very good (70 – 79%) Good (60 – 69%) Fair (50 – 59%) Poor (<50%) MARK A) Collecting, manipulating and preparing data for statistical inference (Includes where necessary, - creating graphs, charts, tables - manipulating data in Excel, using appropriate formulas and functions) Complete and accurate. Data organised optimally, appropriate visualisation and organisation tools used. Data complete and accurate. Some manipulation and visualisation can be improved. Data mostly complete and accurate - key aspects done correctly. Data manipulation and visualisation can be improved. Incomplete – key parts are missing or done poorly. Manipulating and visualisation missing or done wrongly. Most of data set missing or done wrongly. Little or no manipulation and visualization.

len1940568

4/13/2018 4:56:16 AM

• This is an individual Assignment worth 20%. Each student will use a unique data set to complete the assignment. • You should use Excel for all your computational work. • Please provide detailed calculations for all the tasks. Not explaining how you arrived at your conclusions may result in only partial marks being awarded for those tasks. • Present your computer results in appropriate tabular form. Also, include graphs where required to support your answers. • Please do not copy your data set or assignment questions on the word files you are submitting since this will trigger plagiarism detection and might affect your submission. Submit the data set you use as a separate excel document.

Write a Review

Advanced Statistics Questions & Answers

  Find expected number of transition between visits to state i

Find the expected number of transitions between visits to any given state i. Argue that, starting from any state i, an eventual return to state ioccurs with probability 1.

  Best statistic used to compare the volatility in WEF

Statistics for Business and Finance - Do you think that is the best method of sampling? and what is the best statistic used to compare the volatility in WEF, WI, and FS values? Why?

  Show that the pair of variables is statistically independent

Find Pr{Xn+1 = i, Dn+1 = j | Dn} and show that the pair of variables (Xn+1, Dn+1) is statistically independent of Dn. What do your results mean relative to Burke's theorem.

  What is the numerical value of the t-statistic

PSY 5013 - Write the squared partial correlation between Y and x3 controlling for x1 and x2 as a function of squared multiple correlations only and Write out the squared multiple correlation between Y and x2 and x3 in terms of a sum of squared sim..

  Basic accounting principles

You have been nominated by your institution for a seminar because of your proficiency in basic accounting concepts. The participants and audience include college professors, practicing CPAs, and fellow students.

  Critical analysis of scholarly article

Scholarly writers are aware of their audience and base their writing on solid evidence rather than on assumptions and/or opinions. In addition, scholarly writers must also utilize a scholarly voice.

  Determining wacc-cost of equity

Ortiz Motors has a target capital structure of 40% debt and 60% equity. The yield to maturity on the company's outstanding bonds is 9%, and the company's tax rate is 40%. The CFO has calculated the company's WACC as 9.96%.

  Find the steady-state probabilities for the embedded chainf

Find the steady-state probabilities {πi; i ≥ 0} for the embedded chain. Assume that the transition rate νi out of state i, for i ≥ 0, is given by νi = 2i.

  Find the time average of the given quantity

Sketch the lower bound E [N(t)] /t ≥ 1/E [X] - 1/t on the same graph with (c). Sketch E rSN(t)+1 - tl as a function of t and find the time average of this quantity.

  Problem 11 there is a formula for sample size n with given

problem 11. there is a formula for sample size n with given margin of error m and condence level c for population

  Problems on advanced computer networks

Identify and explain the events that can change the state of the system also determine the percent of time that this storage space will be adequate to accommodate newly arrived jobs-CS524 Advanced Computer Networks

  What are the labor hours productivity

What is the average number of customers waiting in line to purchase a ticket and what is the probability that there are at least two others waiting in line to buy a ticket?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd