Reference no: EM131063446
Instructions for the descriptive statistics section of the computing assignment, You must do this section first because the descriptive statistics will help you understand the assignment, When you write up the assignment the descriptive statistics section will go in the middle of the assignment, however you really need to do it first.
You need to Open up the assignment data set (the file Computing assignment data use the student number and the filter to find your own sample)
Instructions for the computing assignment,
- When writing the assignment just use the existing headings for each section answer each section as best you can. Do not worry about a word limit or changing the format.
- You need to do section 3 first this will help you understand the rest of the assignment
- Submit the assignment to turnitin, the answers to all of the sections should go in a single document and you submit that to turnitinyou will be given more details about this later.
- you do not need to use references
- before doing the hypothesis testing section (section 6) you need should do the assignment preparation quizzes on moodle
Section 0: Cover sheet
You do need a cover sheet with the following
Student number, (this is very important it lets the marker check you have used numbers based on your student
Name
Subject: bus105 business statistics
Assignment: Computing Assignment
Section 1: Goals and format of the assignment
Paraphrase the following, there will be a similar discussions in the lectures and tutorials you can paraphrase that instead
"This assignment is supposed to help students understand the following concepts.
* You can ask a sample of people simple questions and give a numerical summary of the results
*The summary is not reliable, different samples would give different answers , This lack of reliability can be well explained by theoretical distributions such as the z distribution.
*You can use Hypothesis testing to use sample to answer a questions about the population
*The first sections of the assignment involve explaining and summarizing a large data set of simple data and interpreting the results the later sections discuss Hypothesis testing, the issues of collecting and working with large sets of data, and the fact it is very hard to fully understand the applications of the z distribution."
Section 2: Description of the data set
a) Expand upon and paraphrase the following, definitely describe each of the variables,
For each variable answer the question is it categorical or numerical?
"The data set is the survey responses of 100,000 people every student has their own sample of 100 people , Each person is shown one of 3 possible endings for a tv show, ending 1, ending 2 or ending 3 and they are asked two questions,
question 1: Do they like the movie?
Question 2: How much would they pay for the DVD?"
b) Also do one or both of the following
*Give some more questions that could be asked
*Criticise the existing questions
Section 3: Summary of the data set
You must do this section first because the descriptive statistics will help you understand the assignment, When you write up the assignment the descriptive statistics section will go in the middle of the assignment, however you really need to do it first.
You need to Open up the assignment data set (the file computer assignment with filters)
You must only use your sample of 100 numbers use the filter given in the excel sheet to find your own 100 numbers (this should become clear if you actually open the excel file
Read instructions given in the excel file and answer the following questions.
Answer the following questions (refer to the excel file for more instructions) you must get excel to give you your data set, your numbers will be different to the other students.
Question 1
For the variable Do they like the tv show
a) find
p ^1 =phat1=proportion of people that liked the tv show with ending 1
p ^2 =phat2=proportion of people that liked the tv show with ending 2
p ^3 =phat3=proportion of people that liked the tv show with ending 3
b) find
phat1-phat2
phat2-phat3
c) Give a chart that compares the proportion of the 3 different tv show endings
Question 2
a) find
x ¯1=xbar1=the average amount people would pay for the dvd of the tv showwith ending 1
x ¯2 =xbar2=the average amount people would pay for the dvd of the tv show with ending 2
x ¯3=xbar3=the average amount people would pay for the dvd of the tv show with ending 3
b) find
xbar1-xbar2
xbar2-xbar3
Question 3
needs you to give complete descriptive statistics and histograms for the variable how much you would pay for each of the 3 different endings so you need to split the dataset into 3 different endings
(you can just go to the assignment data set , click the data tab)
click the filter for which ending.
and pick the ending you want examine, so pick ending 1 and answer question a and b below
and repeat the process for the other endings.
a) for ending 1 give a histogram of the variable how much they would pay
for ending 2 give a histogram of the variable how much they would pay
for ending 3 give a histogram of the variable how much they would pay
you can use the webpage
https://www.wessa.net/rwasp_varia1.wasp
(you do not need to give the qq plot just give the histogram)
b) for ending 1
find the average, standard deviation, min, q1, median ,q3, max
for the variable how much would you pay?
you can use the webpage
https://www.calculatorsoup.com/calculators/statistics/descriptivestatistics.php
for ending 2
find the average, standard deviation, min, q1, median ,q3, max
for the variable how much would you pay?
you can use the webpage
https://www.calculatorsoup.com/calculators/statistics/descriptivestatistics.php
for ending 3
find the average, standard deviation, min, q1, median ,q3, max
for the variable how much would you pay?
you can use the webpage
https://www.calculatorsoup.com/calculators/statistics/descriptivestatistics.php
Section 4:Discuss the results of section 3
ComCompare and contrast the different tv show endings using the results in section 3
For the variable "do they like the tv show" you only compare the proportions,
However for the "how much would they pay" you should start by discussing the distribution of the variable for each ending by looking at the histogram and also discuss if the data is skewed or not skewed before making comparisons.
(If the mean is close to the median the data is not skewed if the mean is different to the median the data is skewed)
When commenting on the Histogram you can mention if there is one group 2 groups or 3 groups, also mention the values that are common, you do this by explaining where the peaks of the histogram are. (the peaks are the high points on the histogram and the high points have a high frequency so they are common)
Section 5: the problems of getting survey data in the real world
Paraphrase the following, there will be a similar discussions in the lectures and tutorials, you can paraphrase that instead
"When a business analyses the results of a survey in the real world it must pay careful attention to who is responding to the survey, maybe the people that respond to the survey are not the same as the people in the target market. If the people answering the survey are different to the population the survey maybe be biased and using the sample will not give an accurate representation of the population even if the sample size is large. A particular issue is that usually only a certain type of person (a person with a lot of free time) will actually respond to the survey, The fact that most surveys have a low response rate is a problem"
Section 6: Hypothesis tests
Just condsidering ending 1 and ending 2
1a) Test the claim that variable ending type is independent of the variable like tv show yes/no
1b) Test the claim that there is a difference in the amount people would pay for ending 1 and ending 2
You may use the following information
Which ending where they shown?
|
Population standard deviation of variable how much they would pay
|
Ending 1
|
σ1 = 5
|
Ending 2
|
σ2 = 4.9
|
Just condsidering ending 2 and ending 3
2a) Test the claim that variable ending type is independent of the variable like tv show yes/no
2b) Test the claim that there is a difference in the amount people would pay for ending 2 and ending 3
You may use the following information
Which ending where they shown?
|
Population standard deviation of variable how much they would pay
|
Ending 1
|
σ2 = 4.9
|
Ending 2
|
σ3 = 4.6
|
Section 7: A discussion about the concept of "sampling distribution"
Paraphrase one the following options , be careful when you do this phrases such as
"normally distributed"
"normal QQ plot"
"z distribution"
"sampling distribution of sample estimates"
They should not be changed in any way, they have a very specific meaning so using synonyms is totally inappropriate.
Section 8: conclusion
Discuss what you learnt after thinking about the previous sections.
Hi experts this is my assignment regarding business stats. There are instructions in them which the expert has to follow to compute the assignment all together.
Attachment:- assignment data.rar
Attachment:- Assignment.rar