Determine the inter-quartile range

Assignment Help Basic Statistics
Reference no: EM131994541

Assignment

Unit Learning Outcomes

a. Assess situations in which statistical analysis may be useful.

b. Apply a range of statistical measures and techniques that may be useful in analysing various situations.

c. Solve statistical problems using analytical methods manually and interpret the results.

d . Solve and interpret appropriate output from statistical computer packages.

e. Demonstrate the ability to apply statistical tools and techniques to solve problems to improve business decision-making process.

Assignment Description

The assignment has a total of six tasks to complete.

For the assignment you need to create your own data set using the data provided as LGAData.xls excel file.

Presentation

Your answers must be presented in task number order and be clearly labelled with the appropriate task number. Your assignment must be presented in Microsoft (MS) Word. Copy and paste any relevant Excel outputs to this document immediately before (above) any relevant written answers to each task. If you are unfamiliar with the use of the MS Word Equations Editor, you may write algebraic/ mathematical or statistical symbols and notation in neat handwritten form.

Your answers must be clear. You must highlight relevant items on any required Excel outputs and make reference to them in your written answers. When asked to perform a manual calculation (i.e. the use of MS Excel is not specified) you must show all working. This must include intermediate steps where relevant. Failure to do so will result in a loss of marks. Do not include the assignment questions nor the data with your submitted assignment.

Introduction

The Assignment Data (LGAData.xls) file contains, LGA data (sourced from ABS) for a population of 400 LGA in Australia. You are required to select a random sample of 50 LGA from this population. Variables in the data set are as below:

PN

ID number

ID

V1

LGA name

Name

V2

Gender

Male=1; Female=0

V3

Age Group

30-34 =1; 35-39=2; 39-44 =3; 45-50=4

V4

Age1

number of people 30-34 years old

V5

Age2

number of people 35-39 years old

V6

Age3

number of people 39-44 years old

V7

Age4

number of people 45-50 years old

V8

Income category

$650-$800=0; $801-$850=1

V9

Income1

number of people with $650-$800 weekly income

V10

Income2

number of people with $801-$850 weekly income

V11

Occupation type

Managers=1; Professionals=2;Sales workers=3;Administration=4

V12

Occupation1

number of Managers

V13

Occupation2

number of Professionals

V14

Occupation3

number of Sales workers

V15

Occupation4

number of Administration

V16

Industry of occupation

Mining=1; Agriculture=2;Manufacturing=3;Education=4

V17

Industry1

number of people in Mining

V18

Industry2

number of people in Agriculture

V19

Industry3

number of people in Manufacturing

V20

Industry4

number of people in Education

V21

Transport

0=car; 1=public transport

V22

Living with children

0=with; 1=without

Column A (PN), contains the LGA identification numbers from 001 to 400 LGA.

Selecting Random Sample and Creating your Sample Data File

To select your random sample, you need to open the PopulationLGAData.xls file and create a SampleLGAData Excel file of 50 samples (rows) which will include 50 LGA. To select these samples randomly one member of the group starts with 2 last digits of their student ID and adds 50 to it to create the data set. For example a student with ID of MIT172256 will start with row 56 to add 50 rows to it which will include rows 56-106. Then the data set for this group will include LGA 56-106.

You will need to submit a SOFT copy of your sample LGAdata. You will also need to report what student ID was used for random data selection. This sample data set will form the basis of the statistical presentation and analysis tasks of the assignment.

Assignment requirements
Answers to the assignment tasks must be based on the sample data file that you have created. All tasks in this assignment require you to obtain an Excel output prior to performing some analysis. Copy and Paste these outputs to your assignment MS- Word document immediately preceding any subsequent analysis. Charts and tables must have appropriate titles and numerical values must be rounded to an appropriate number of decimal places and accompanied by the correct units of measure.

Task 1
Use Excel to produce a Frequency Column Chart and; a Relative Frequency Pie-Chart for your sample to show the number and proportion, respectively, of people under each age group. Use these graphical summaries to answer the following questions:

(a) How many LGA in your sample indicate use of public transport?

(b) Which occupation type occurs most frequently in your sample?

(c) What proportion of LGA in your sample consists of people within 30-34 years old age group?

Task 2

(a) Use Excel to sort your sample "occupation 4" data. Use the percentile location formula;

Lp = (n +1).(P/100), and the three associated rules to determine:

(i) The 70th percentile.
(ii) The first and third quartiles.

Remember to show all working!
(b) Briefly explain what the 70th percentile that you have determined informs you about your sample "occupation 4" data.
(c) Determine the Inter-Quartile Range of your sample "occupation 4" data and provide a brief explanation of what information this statistic provides about your sample data.

Task 3
(a) Use Excel to produce a Descriptive Statistics table for your sample "occupation 4" data.
(b) Use results from Task 3 to determine manually for this data, the upper and lower inner fence limits;

IFUL = Q3 + 1.5 x IQR

IFLL = Q1 - 1.5 x IQR

(c) Based on the limits calculated in (b), choose from the numerical summary measures provided in the Descriptive Statistics table, and/or measures calculated previously in Task 4;

(i) an appropriate measure of central tendency, and,
(ii) an appropriate measure of dispersion for your sample "occupation 4"data.

Remember to show all working!

Provide a brief explanation of the reasoning behind your choice in both cases.

(d) Write a brief report on the "occupation 4" data paying particular attention, on the mean, median, quartiles and measures of variation.

Task 4

Remember to show all working! Failure to do so will result in the loss of marks.

(a) From the Descriptive Statistics table obtained in Task 4, identity three pieces of evidence that indicate whether your sample "occupation 4" data has been obtained from a normally distributed population or not. What is your conclusion?

Note: Make sure only one piece of evidence relates to the shape of the sample data.

(b) Regardless of your conclusion in above, assume the "occupation 4" data is normally distributed. Applying the Standard Normal tables, calculate how many "occupation 4" observations in your sample would expect to lie within 1.5 standard deviations of the mean (i.e. between z = -1.5 and z = +1.5).

(c) Use the mean and standard deviation from the Descriptive Statistics table of Task 4 to calculate the bound for 1.5 standard deviation spread from the mean. Using the "occupation 4" sample data, manually count the number of observations fall within the bound. State whether this count matches, approximately, your answer to (b) and hence whether this result confirms (or not) your conclusion in (a).

Task 5

(a) Use Excel to produce a Descriptive Statistics table for the "Occupation 4" variable in your sample suitable for constructing an interval estimate of the population mean "Occupation 4"
Hence determine:
(i) A point estimate of the mean "Occupation 4" of the population.
(ii) A 90% confidence interval estimate of the mean "Occupation 4" of the population.
(iii) Make a brief verbal statement explaining the meaning of the confidence interval estimate obtained in (ii) in the context of the variable in this task.
(b) If the population mean "Occupation 4"is actually 59, would you consider the interval estimate obtained in (a), to be satisfactory? Explain why or why not.

Task 6

(a) Use Excel to produce a Descriptive Statistics table for the $650-$800 income earners in your sample suitable for constructing an interval estimate of the population proportion of income earners. Hence determine:

(i) A point estimate of the proportion of $650-$800 income earners in the population.
(ii) A 99% confidence interval estimate of the $650-$800 income earners.

(b) Using the following formula:
(sample statistic) ± (critical z or t) x (standard error of the sample statistic)
Use the Empirical Rule for a Normal distribution to determine a 95% confidence interval estimate of the
$650-$800 income earners in the population.

(c) Compare, in terms of the precision, the interval manually calculated in (b) with the interval obtained from the Descriptive Statistics table in (a). Explain why the direction of the change in precision is expected.

Attachment:- Data Variables.rar

Reference no: EM131994541

Questions Cloud

How much money does the firm expect to collect in the month : The remaining 2 percent of sales is never collected. How much money does the firm expect to collect in the month of July?
Provide a mission statement and vision statement : You are a project manager with a national nonprofit health organization specializing in a single class of disease.
What is the interest rate on this offer : A store is offering a diamond ring for sale for 60 months at $100 per month. What is the interest rate on this offer?
Explain how a leader could use this tool in guiding : Explain how a leader could use this tool in guiding an organizational change. Review several of your peers' posts.
Determine the inter-quartile range : BB108 Business Statistics - Contribution and Participation - Identity pieces of evidence that indicate whether your sample "occupation 4" data has been
Explain the impact you think the transition from steve jobs : Explain the impact you think the transition from Steve Jobs to Tim Cook has had on Apple's primary stakeholder groups: customers, employees, and investors.
Discuss national planning frameworks core capabilities : Discuss how the National Planning Frameworks core capabilities and guiding principles applied to the Hurricane Matthew response.
Type i and ii decision error costs : The HR department is trying to fill a vacant position for a job with a small talent pool. Valid applications arrive every week or so, and the applicants
Discuss the effects that these changes would have : In 3 - 5 pages, explain which change model you would follow for the short-term change and which you would follow for the long-term change.

Reviews

len1994541

5/24/2018 3:03:30 AM

As this is a graduate Bachelor of Business unit, regular and active participation in lecture and tutorial discussions is expected from every student each week. Participation in this unit will be compiled and noted for each week. Please contact your Lecturer/Tutor as soon as possible if you are unable to participate in class discussions for any length of time (e.g., more than two "working" days in a row, weekends not included). Students are expected to ask and answer questions in all discussions. Points for class participation and completion of other class assignments will be recorded. Attendance at 10 lectures and tutorials as a minimum per student, per trimester is expected. Students are expected to participate throughout the entire trimester, including during the training session presentations made by other groups.

len1994541

5/24/2018 3:03:19 AM

Class Participation and Attendance During the trimester students will discuss various Statistics issues and cases, which are noted in the prescribed textbook. Active student participation in the lectures and tutorials is essential to the learning process. All students are expected to be prepared for and contribute to the lecture and tutorial discussions. To assist in your understanding of Statistics issues, you are encouraged to contribute to in-class discussions. Your participation and interaction with other students will be a major component of the class instruction. Part of your class participation will also be based upon your ability to recognise contemporary statistical issues.

len1994541

5/24/2018 3:01:44 AM

Presentation of the Assignment ? layout ? formatting ? spelling/grammar ? proof read ? professional format Comment: 5% References & Bibliography ? appropriate use of in-text and end text referencing ? appropriate number of references used ? quality of information sources and appendices Comment: 5% Total 100% Overall Comments: Assessor Name: Assessor Signature: = Marks Date:

len1994541

5/24/2018 3:01:33 AM

Criteria Possible Marks Marks Allocated Introduction to the assignment. ? Brief introduction to the problem Comment: 10% Clarity and depth of Responses to tasks which includes: ? Task1: Data visualisation ? Task2: Data location ? Task3: Descriptive statistics, central location and dispersion ? Task4: Distribution type, z-standard, variation ? Task5: Interval estimate of the population mean ? Task6: Interval estimate of the population proportion Comment: 80%

len1994541

5/24/2018 3:01:22 AM

Submission Guidelines • All work must be submitted on Moodle by the due date along with a completed Assignment Cover Page. • The assignment must be in MS Word format, 1.5 spacing, 11-pt Calibri (Body) font and 2 cm margins on all four sides of your page with appropriate section headings. • Reference sources must be cited in the text of the report, and listed appropriately at the end in a reference list using APA or IEEE referencing style for School of Business and School of Information Technology and Engineering respectively.

Write a Review

Basic Statistics Questions & Answers

  Analyze the data to make conclusions

Analyze the data to make conclusions. Perform test(s), interaction plot and post hoc multiple comparisons if needed.

  The corrosive effects of various soils on coated and

the corrosive effects of various soils on coated and uncoated steel pipe was tested by using a dependent sampling plan.

  Clarification of standard deviation

According to the data from the College Entrance Examination Board, scores on the SAT-I test have a mean of 1017, and Q1 is 880.

  State which is which and how you know

One of the histograms is a histogram of a sample (from a population with a skewed distribution) one is the distribution of many means.

  Examine the stem and leaf plot below find the original

examine the stem and leaf plot below. find the original data from the stem and leaf

  Structural equations modelling

Please total 1500 words or five double-spaced typed pages for all the answers of each questions and 3 references. STRUCTURAL EQUATIONS MODELLING (SEM) Model and Article Critique Assessment

  Combine an integrated understanding about education

The main objective of the study is to combine an integrated understanding about education in four different areas such as Age, Sex, Marital Status, and Parental Education.

  Information about statistics-hypothesis testing

The average monthly cot for a studio apartment in NYC is reported to be $1,000. You want to prove that average monthly rent is not what is stated.

  Substantiate or rebuff your friend claim

What statistical analysis would you use to substantiate or rebuff your friend's claim? Do you need additional data? Why or why not?

  What is the cutoff for high leverage in the given scenario

What is the cutoff for high leverage in this scenario? Using the rule-of-thumb for leverages presented in class, state whether this point has high leverage?

  Define the margin of error for the entire sample

Are based on a report of a Field poll printed in the Sacramento Bee on July 7, 2000 (Dan Smith, p. A4), with the headline, "Ratings Up for Boxer, Feinstein.

  1there are two statistics classes the first has 250

1.there are two statistics classes. the first has 250 students and the second has 200 students. in the first class the

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd