STAT 645 - Biostatistics Assignment Problem

Assignment Help Applied Statistics
Reference no: EM132414197

STAT 645 - Biostatistics Assignment, Texas A&M University, USA

Q1. For the income by degree and gender data set, contained in the file inc_deg_data.csv (Course Content/Data/incdeg):

(a) Make side-by-side box plots of income, with separate boxes for each of female arts (gender = 0, degree= 0), female science (gender= 0, degree= 1), male arts (gender= 1, degree = 0), and male science (gender= 1, degree= 1). Include labels on the x-axis to indicate which box goes with which category.

(b) Report the mean, median, standard deviation, and first and third quartiles of income.

(c) Report the mean, median, standard deviation, and first and third quartiles of income, now with income expressed in dollars (rather than 1,000s of dollars).

(d) Report the mean, median, standard deviation, and first and third quartiles of income (in 1,000s of dollars), now excluding the minimum and maximum values.

Q2. Set your random seed to be 101 (do set.seed(101)). Create a 100 x 5 matrix of random realizations from the standard normal distribution (normal with mean 0 and standard deviation 1).

(a) Report the column means (a vector of length 5). Demonstrate how you would do this (i) using the apply function and (ii) using vector/matrix arithmetic.

(b) Make a histogram of the row ranges; i.e., compute the range (maximum minus minimum) for each row, and make a histogram of the resulting 100 ranges.

Q3. Consider the gamma distribution with shape and scale parameters both equal 2; this corresponds to a mean of 4 and a variance of 8. Simulate samples of size n = 10, 30, 90 from this distribution, repeating B = 1000 times. For each simulated data set, compute the sample mean. Thus, you will have B = 1000 sample means for each of the three sample sizes. For each sample size, draw a probability histogram (as opposed to a frequency histogram, you can do this by setting probability = TRUE as an option to the hist function). Overlay the normal curve that would apply if the central limit theorem could be assumed to hold. Report the resulting three figures as a single three-panel figure.

Q4. In R create a matrix, named A, with 5 rows and 4 columns, such that the first three rows are random numbers generated from normal(0, 1) distribution while the last two rows contain random numbers generated from Uniform(2, 2). Create another matrix, named B, with 5 rows and 4 columns, such that the all elements are random draw from the Beta(2, 1) distribution. For creating A and B, use set.seed(101) and set.seed(102), respectively.

(a) Provide the code to obtain the column sum of A (sum of all entries for each column).

(b) Provide the code to obtain A + B, then print the (4, 2) and (4, 4)th entries of this sum.

(c) Provide the code to obtain ABT, then print the (4, 2) and (4, 4)th entries of this multiplication.

(d) Obtain the inverse of BTA, and also obtain the determinant of BTA.

Reference no: EM132414197

Questions Cloud

What is the firm wac : The firm pays taxes at the marginal rate of 40%. What is the firm's WAC? Round your answer to the nearest tenth of a percent.
What is the present value of the tax savings : If the firm is in the 35% tax bracket and the appropriate discount rate is 9%, what is the present value of the tax savings?
BSBINN801 Lead innovative thinking and practice Assignment : BSBINN801 Lead innovative thinking and practice Assignment Help and Solution, Alpha Institute - Assessment Writing Service - Identify and describe a creativity
What will the npv be on the proposed project : What will the NPV be on the proposed project based on a weighted average cost of capital of 12%? Additionally, what will the project's IRR be?
STAT 645 - Biostatistics Assignment Problem : STAT 645 - Biostatistics Assignment, Homework Help, Texas A&M University, USA. Compute the range (maximum minus minimum) for each row
What is the reported cost of goods sold on gsi : What is the reported cost of goods sold on GSI's income statement for 2007 under the speci?c identi?cation method?
What is the maximum loan a manager : If the average historical losses in the agricultural sector total 9 %, what is the maximum loan amanager can lend to a farmer as a percentage of total capital?
Calculate steve debt service ratio : His taxation and Medicare liability totals $46,500 pa. Calculate Steve's debt service ratio (to the nearest percentage point).
Calculate book value of an asset-genetic insights co : Genetic Insights Co. purchases an asset for $15,966. This asset qualifies as a seven-year recovery asset under MACRS.

Reviews

Write a Review

Applied Statistics Questions & Answers

  Write a concise manuscript

You are required to write a concise "manuscript" draft using all the information you have. All the data analyses should be done yourself

  Calculate the difference for each subject

Calculate the difference for each subject, then calculate the mean and standard deviation of these differences.

  A group of organic pollutants found in a variety

Polychlorinated biphenyl (PCB) is among a group of organic pollutants found in a variety of products, such as coolants, insulating materials, and lubricants in electrical equipment. Disposal of items containing less than 50 parts per million (ppm) PC..

  What is the probability of making a type i error

In an experiment, one group of research participants is given 10 pages of material to proofread for errors. Another group proofreads the same material on a computer screen. The dependent variable is the number of errors detected in a 5­ minute per..

  What your conclusions about gender equal pay for equal work

Pick any other variable you are interested in and do a simple 2-way ANOVA without replication. Why did you pick this variable and what do the results show? Using the results for this week, What are your conclusions about gender equal pay for equal ..

  Testing hypotheses for means

Research Design and Statistical Design and the Skill Builder: Hypothesis Testing for Independent Samples t-test, which you can find by navigating back to your

  An airline is deciding how many reservations to accept

An airline is deciding how many reservations to accept for a new flight from Indianapolis to Chicago. The plane has a capacity of 200 seats.  Because only 90 percent of passengers typically show up for this flight, they need to determine how many tic..

  Design an experiment comparing modalities of training

Design an experiment comparing modalities of training - Identify the variables within this training study and explain how they will be manipulated

  Write two object classes and helper function to read in text

ST114: Final Project. Instructions: In this project, you will write two object classes and a helper function to read in text data

  The reaction times of teenage drivers are normally distrib

1)* Suppose that the reaction times of teenage drivers are normally distributed with a mean of 0.53 seconds and a standard deviation of 0.11 seconds. A) What is the probability that a teenage driver chosen at random will have a reaction time less tha..

  Would the null hypothesis be rejected

Using the results you got from Question 3 and a level of significance at .05, what are the two-tailed critical values associated with each? Would the null hypothesis be rejected

  How biostatistics is used to quantify the evidence

Consider how biostatistics is used to quantify the evidence and explore the unknown in health issues, most particularly in public health.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd