STAT 645 Biostatistics Assignment Problem

Assignment Help Applied Statistics
Reference no: EM132413758

STAT 645 Biostatistics Assignment -

1. For the income by degree and gender data set, contained in the file inc_deg_data.csv (Course Content/Data/incdeg):

(a) Make side-by-side box plots of income, with separate boxes for each of female arts (gender = 0, degree= 0), female science (gender= 0, degree= 1), male arts (gender= 1, degree = 0), and male science (gender= 1, degree= 1). Include labels on the x-axis to indicate which box goes with which category.

(b) Report the mean, median, standard deviation, and first and third quartiles of income.

(c) Report the mean, median, standard deviation, and first and third quartiles of income, now with income expressed in dollars (rather than 1,000s of dollars).

(d) Report the mean, median, standard deviation, and first and third quartiles of income (in 1,000s of dollars), now excluding the minimum and maximum values.

2. Set your random seed to be 101 (do set.seed(101)). Create a 100[1]5 matrix of random realizations from the standard normal distribution (normal with mean 0 and standard deviation 1).

(a) Report the column means (a vector of length 5). Demonstrate how you would do this (i) using the apply function and (ii) using vector/matrix arithmetic.

(b) Make a histogram of the row ranges; i.e., compute the range (maximum minus minimum) for each row, and make a histogram of the resulting 100 ranges.

3. Consider the gamma distribution with shape and scale parameters both equal 2; this corresponds to a mean of 4 and a variance of 8. Simulate samples of size n = 10; 30; 90 from this distribution, repeating B = 1000 times. For each simulated data set, compute the sample mean. Thus, you will have B = 1000 sample means for each of the three sample sizes. For each sample size, draw a probability histogram (as opposed to a frequency histogram, you can do this by setting probability = TRUE as an option to the hist function). Overlay the normal curve that would apply if the central limit theorem could be assumed to hold. Report the resulting three figures as a single three-panel figure.

4. In R create a matrix, named A, with 5 rows and 4 columns, such that the first three rows are random numbers generated from normal(0; 1) distribution while the last two rows contain random numbers generated from Uniform(2; 2). Create another matrix, named B, with 5 rows and 4 columns, such that the all elements are random draw from the Beta(2; 1) distribution. For creating A and B, use set.seed(101) and set.seed(102), respectively.

(a) Provide the code to obtain the column sum of A (sum of all entries for each each column).

(b) Provide the code to obtain A + B, then print the (4; 2) and (4; 4)th entries of this sum.

(c) Provide the code to obtain ABT, then print the (4; 2) and (4; 4)th entries of this multiplication.

(d) Obtain the inverse of BTA, and also obtain the determinant of BTA.

Reference no: EM132413758

Questions Cloud

Find the control limits for a c-chart : In one week, 1,500 orders were filled, and a total of 24 errors were discovered. Find the control limits for a c-chart.
Explain what a health determinant indicates : Health policy can be driven by the evaluation of health determinants. An example would be the need for cleaner drinking water
Discuss challenges faced by leaders in the article : Find a recent media or news article on the Internet concerning budget issues a police or other public sector agency is currently facing.
Difference between winning and succeeding : John Wooden - The Difference Between Winning and Succeeding." John Wooden was one of the greatest NBA coaches of the 20th century.
STAT 645 Biostatistics Assignment Problem : STAT 645 Biostatistics Assignment - Report the mean, median, standard deviation, and first and third quartiles of income, now with income expressed in dollars
How does physical design influence project planning : How does physical design influence project planning? How are vulnerability assessments used to identify improvement opportunities?
Computing the goodwill cost : Vicki's Shop is a famous bakery in Chicago. Its reputation for quality and service is well-known among locals and people in the nearby areas.
Relationship between gender and the number of hours spent : A researcher is interested in the relationship between gender and the number of hours spent studying for a test and performance on the test.
Research and theories on critical periods : How can you counsel this parent on the importance of providing a stimulating learning environment using the research and theories on critical periods?

Reviews

Write a Review

Applied Statistics Questions & Answers

  Write the poisson formula and describe possible values of x

Write the Poisson formula and describe the possible values of x. Starting with the smallest possible value of x, calculate p(x) for each value of x until p(x) becomes smaller than .001.

  Population proportion of union represented employees

What is the 99% confidence interval for π = the population proportion of union-represented employees who intend to vote for the labor contract?

  What is the probability of any x value

Given a normally distributed variable (x), if you know that for a value of x = 244 the z-score = 2.20, what is the probability of any x value being above 244?

  State your null and alternative hypothesis.

Wall Street securities firms paid out record year-end bonuses of $125,000 per employee. Suppose we take a sample at Goldman Sachs to see whether the year-end bonus is different from the population mean in Wall Street companies

  Est the following joint hypothesis

Can you explain any conflict between the implications of the results obtained about and and your expectations?

  How much idle time was incurred by the waxing process

In what order should the cars be processed through the facility that is the most efficient overall - How long from start to finish will it take to complete the processing the five cars?

  Give an example of an application of anova in an industrial

Give an example of an application of ANOVA in an industrial, operations, or manufacturing setting that is different from the examples provided in the overview.

  What is the probability of a student not doing homework

What is the probability of a student not doing homework or passing and what is the probability that the home team will win this game given that it is ahead at the half?

  Find the value of the linear correlation coefficient r

Construct and show a scatterplot of the data. Does it show a positive correlation, negative correlation or no correlation?Find the value of the linear correlation coefficient r.Based on the critical value and the correlation coefficient, is there s..

  X decreases and y decreases is this also a positive relation

In a positive relationship I know that if X increases y increases but if x decreases and y decreases is this also a positive relationship?

  Explain the meaning of the slope of the regression equation

Explain the meaning of the slope of the regression equation. Tell how the slope relates a person's bicep girth to his or her predicted weight

  Explain why ground-wave propagation is more effective

a) Explain why ground-wave propagation is more effective over sea water than dessert terrain. b) Why do stations in the AM broadcast band always use vertically polarized antennas

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd