Compute the descriptive statistics

Assignment Help Basic Statistics
Reference no: EM131764184

Assignment 1

Instructions flickr photo shared by PaulSh under a Creative Commons ( BY-NC-ND ) license Analysing Australian Household Data Statistics for Business and Finance (BUS5SBF) Have you been part of a national census?

Privacy issues aside, a census provides lots of data that can inform a government policies and actions - but to be useful, the data needs to be analysed and interpreted.

In this assignment, we will use statistical methods to analyse and interpret real world demographic data. The goal of this assignment is to Test your understanding of statistical methods and approaches

Improve your ability to use Excel for manipulation of data Understand the real-world applications and implications of statistics To complete this assignment you must

Complete a set of statistical analysis tasks on a unique data set (both tasks and data set will be provided to you)

Please provide detailed calculations for all the tasks. Not explaining how you arrived at your conclusions may result in only partial marks being awarded for those tasks. Present your computer results in appropriate tabular form. Also include graphs where required to support your answers.

Don't do: Please do not copy your data set or assignment questions on the word files you are submitting since this will trigger plagiarism detection and might affect your submission. Submit the data set you use as a separate excel document. Copying someone else's work would result in plagiarism.

Please read LTU's policy on plagiarism stated in the Subject Learning Guide for this subject. Do not submit zip folders containing your work. Do: Submit the data set on excel file with your detailed working in the same file. You can use separate sheets in the same Excel file for each task with properly labelled parts. Submit your detailed report in word or pdf format only. Use the marking rubric as a guide when working on your assignment Instructions: Analysing Australian Household Data In this assignment you will be analysing and interpreting real data of two thousands (2000) Australian households. This data has been taken from 2005 HILDA (Household, Income and Labour Dynamics in Australia) Survey. We will treat these 2000 Households as population and would perform a statistical analysis by drawing a sample (which, indeed,would be a subsample!) of 200 households.

Step 1: Prepare Data Set You need to download and modify the generic data set to identify the sample you will be working on. More information on the data set is available below. Important: You do not need to analyse the entire data set - only draw a random sample of 200 households. This must be done first and the rest of your statistical analysis will be based on this data set. You will need to know how to manipulate data in Excel. You can use any other compatible spreadsheet tool (for example Numbers for Mac OS, OpenOffice, LibreOffice etc) but be aware that some of the functions differ slightly. There are plenty of resources online and in this LMS that will show you how to perform these tasks in Excel. Use these resources (and Google) to improve your skills as you progress. If you have a particularly difficult challenge, share it in the forum and your tutors or peers will help.

Step 2: Analyse Data & Submit Report You will be presented with a set of tasks to run on your unique data set (of 200 samples) - the full list of tasks are listed below. As much as possible, complete these tasks within the excel document. You will need to submit your excel document at the end of the assignment. You will also submit a brief report in word addressing all the tasks below. When responding to the tasks, please explain the reasoning behind your answer, and refer to your Excel sheet.

Data Set: Household data The Data Set for this Assignment is available on LMS (Data Set for Assignment 01.xls).

This includes information of 2000 households across the following variables.

Income: Annual Income in AUD, ATaxInc: After tax annual income in AUD Grocery: Annual expenditures on groceries in AUD Alcohol: Annual expenditures on alcohol in AUD Meals: Annual expenditures on meals eaten out in AUD Fuel: Annual expenditures on fuel in AUD Cloth: Annual expenditures on clothing in AUD Phone: Annual expenditures on phone in AUD Utilities: Annual expenditures on utilities (Water, Gas, Electricity) in AUD Texp: Annual total expenditures in AUD Children: Number of children in a household Adults: Number of adults in a household OwnHouse: This is a categorical variable and takes value 1 if a household owns a house and 0, otherwise. GHH: Gender of the Head of Household (M: Male, F: Female) Highest Degree: Highest Level of Education, where the Highest Level of Education is; P: Primary S: Secondary I: Intermediate B: Bachelor M: Master Tasks for Analysis of Data Set Complete the following tasks based on the unique data set you generated. These questions should be answered in a Word document, with brief reasoning to justify your answers.

Your answers and reasoning should correlate with the tables and graphs from the excel sheet.

Task 1

A. Draw a random sample of two hundred (200) households as per the sample selection procedure. What sampling method have you used to select your sample data? In your opinion, is this the best method of sampling, why or why not?

B. Compute the descriptive statistics and draw a Box-Whisker plot of Expenditures on the following variables (all series in one graph!); (i) Alcohol (ii) Meals (iii) Fuel (iv) Phone

C. Also, use an appropriate measure of variation to compare the variability in expenditures on these four variables. Explain, why is this an appropriate measure.

D. Present a summary of your findings about the shape and spread of the distribution of these variables using information from the boxplots and the descriptive statistics.

Task 2

A. Construct a frequency distribution of the expenditures on Utilities, using the following classification (9 classes); 1 2 ... 8 9 Classes 0 - 400 400 - 800 ... 2800 - 3200 More than 3200

B. What is the percentage of households that spend on Utilities at the most $ 1200 per annum between $1200 and $2400 per annum, and more than $2400 per annum.

C. Draw the histogram of the expenditures on Utilities by households in your sample. Do you think the utility expenditures are normally distributed? Provide the "statistical reason" for your answer?

Task 3

A. What is the top 10% value and the bottom 10 % value of household's annual after tax income (AtaxInc)? What does these two values imply?

B. What does the mean (average) of variable OwnHouse imply?

C. What is the probability that a randomly selected household will have a family size (FS= Adults + Children) equal to 5?

D. Draw a scatter plot of natural log of total expenditures against natural log of after tax income, that is, ln(texp) against ln(ataxinc) and compute the coefficient of correlation. Express your finding about the relationship between the two variables.

Task 4

A. Construct a contingency table between the gender and the level of education. Using information in this table, can we say that male and female heads of the households differ in their higher level of qualification?

B. What is the probability that the head of household is a female and her higher level of education is Intermediate?

C. What is the probability that the head of household is a male and has the Bachelor degree?

D. What is the proportion of having the Secondary as the highest degree from among females?

E. Do you think that the events "gender of household head is male" and "having the Master Degree" are independent?

Reference no: EM131764184

Questions Cloud

Identify at least three advantages of renting and leasing : Identify at least three advantages of renting/leasing an asset instead of purchasing
Discuss about the thermostat manufacturer claim : During the winter months, outside temperatures at the Starneses' cabin in Colorado can stay well below freezing (32°F, or 0°C) for weeks at a time.
Explain the law that affects posting rules and regulations : Explain the criteria a landlord must use when determining how many total people can reside in a unit?
Define german activities and makes no dividend distributions : GreenCo earns $1,500,000 in net profits from its German activities and makes no dividend distributions to BlueCo
Compute the descriptive statistics : Compute the descriptive statistics and draw a Box-Whisker plot of Expenditures - Construct a frequency distribution of the expenditures
Describe the internal control as preventive : Internal controls to address it. Describe the internal control as Preventive, Detective or Corrective.Organize your answer in a Risk/Control Matrix
Research Paper - Comparative advantage : Assignment - Topics for Research Paper - Comparative advantage: Explain the reasons, 2 to 3 countries and 1 to 2 industries and Some facts and figures
Which strategy do you think would be a good fit for you : There are many strategies that an investor can use to build their investment portfolio by investing in properties.
Measure the blood cholesterol levels of an srs : A sample of teens A study of the health of teenagers plans to measure the blood cholesterol levels of an SRS of 13- to 16-year-olds.

Reviews

len1764184

12/13/2017 6:40:35 AM

Marks Distribution (Total Marks = 30) Marks Distribution Task 1 3 + 3+ 1+ 2 = 9 Task 2 2 + 1.5 1.5 = 5 Task 3 2 + 1 + 1 + 2 = 6 Task 4 3 + 1 + 1 + 1 + 1 = 7 Report Organisation Proper Numbering of the tasks and questions, tables and graphs with captions, legends, and properly cited references (if any). 3

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd