Reference no: EM132660250
Assignment: Describing the data
Download the dataset and save onto your computer: PH303_census_2017.sav Start SPSS and open the dataset.
1) What are the two variables in the dataset?
2) Run a simple histogram for each variable. Copy and paste each histogram into your assignment. There is no need to make the histograms "pretty" with titles, etc. How many individuals are included in our sample?
3) For each variable, which is the better measure of the central tendency of the variable, the mean or the median?
4) For each variable that is better measured by the mean, use Analyze -> Descriptive Statistis -> Descriptives to create a table including the mean, standard deviation, and variance. Copy and paste the table into your assignment.
a. Prove to yourself that the variance is equal to the square of the standard deviation (you don't need to include anything in your assignment for this)
b. Create a boxplot for this variable. Copy and paste the boxplot into your assignment.
5) For each variable that is better measured by the median, use Analyze -> Descriptive Statistics -> Frequencies to create a table including the median, quartiles, minimum and maximum. Copy and paste the table into your assignment.
a. Calculate the IQR using these statistics
b. Calculate the Tukey fences using these statistics
c. Based on the Tukey fences, does this dataset have outliers? How do you know?
d. Create a boxplot for this variable, being sure to remove the observation number of any outliers. Copy and paste the boxplot into your assignment.