Completing a database lock and submission for analysis

Assignment Help Advanced Statistics
Reference no: EM133325300

Analysis Division Assignment

You are the lead biostatistician for the project 22-XYZ-01. The data management team provides you raw data from the lab vendor Health Labs and await your feedback before completing a database lock and submission for analysis. The DM team provides you two documents:

- Raw_Data_LAB.xlsx: Raw laboratory data
- LAB_Coding_Dictionary.pdf: Variable code dictionary

PART I

Prepare a brief (1-2 page maximum) feedback report for the DM team to further investigate/clean/format the data(base).

- Elaborate on any data that may be of concern through your review - provide rationales for your concerns, if you have any.
- Request for any additional information you may need from the DM team and/or the laboratory vendor in order to analyze the lab data.

Your analysis pipelines, written in R/SPSS, expects imported data via the TIDY data format:

- Provide instructions in your report to the DM team to re-format this database in order to have the data ready for analysis.

PART II

Prepare a short (1-3 page maximum) summary report for the data division lead, who does NOT have access to the raw data, but is familiar with the variable codes and laboratory parameters.

- Provide a general descriptive summary of the laboratory data.
- Present an EDA of the laboratory data; if performing outlier detection and/or imputations, provide rationales to justify any modifications to the raw database.

The client/sponsor is really eager to know (informally) if there is any evidence that the IP had any effect on the lab parameters through the course of the study:

- Apply a simple model to explore any potential trends in the data and include this in your report to the data division lead.

Attachment:- Analysis Division Assginment.rar

Verified Expert

This task provides a clear working example of descriptive and inferential statistics that helps the researcher to predict the lab factors that influence the platelet count. The descriptive statistics were computed for all the lab parameters and a box plot was constructed to determine the distribution of all those lab parameters. Apart from a few parameters all other lab parameters failed to validate the normality assumption

Reference no: EM133325300

Questions Cloud

Levels of inequality : An important theme in this class is that levels of inequality in the US are heavily impacted by structural factors and policy choices,
Explanation of anti-war social movement : An in-depth explanation of the Anti-War Social Movement. Explain using social movement theories.
Describe how a deep learning model works : Describe how a deep learning model works. Explain how Fitbit used Twitter data to improve its business What are the major challenges to social media
Create a data storage capability within your cloud platform : Create a data storage capability within your cloud platform. What is provided? Ensure to put together the elements, the functionality, the benefits.
Completing a database lock and submission for analysis : Analysis Division Assignment Elaborate on any data that may be of concern through your review - provide rationales for your concerns, if you have any
What is ethical or legal dilemma : What is an ethical or legal dilemma that might arise while working with a client who has been diagnosed with HIV/AIDS?
Provide a detail example of how the min-max normalization : provide a detail example of how the min-max normalization and Z-score standardization values are computed and also explain why a data scientist might want
Why does organization need inclusion plan : Why does an organization need an inclusion plan? Isn't the presence of a diverse workforce sufficient for the organization to realize positive outcomes?
Define the table structures in the database using sql : In M05, you should have submitted the ERD draft to your instructor for a review and feedback. All recommendations provided should have been applied

Reviews

Write a Review

Advanced Statistics Questions & Answers

  Construct a leading economic indicator

The index of leading economic indicators, compiled and published by the U.S. National Bureau of Economic Research, is composed of 12 time series, such as the average work hours of production in manufacturing, manufacturers

  Making t-ledger and a journal

I just need help with setting this accounting question up on a t-ledger and a journal it has to be on an excel format as well. I am just confused on how to set it up

  Are the chosen analyses appropriate for the variables

Are the chosen analyses appropriate for the variables/relationships under investigation, and are the assumptions underlying these analyses met? Are the analyses carried out correctly?

  Financial statements comparison of two companies

Research and find financial statements for two companies of your choosing. Drawing on information from this course (managerial accounting), write an essay summarizing which of the two is a better investment.

  Run a regression model to test for association

Run a regression model to test for association between diabetes and alcohol. Does this suggest that alcohol is associated with having diabetes

  A shop is selling laptops at regular price and at half

a shop is selling laptops at regular price and at half price. if the laptops are regular price a day they can be at

  Why the light reactions you find most interesting

Perform an online search related to the topic and share with the class one or more interesting fact(s) you discovered about the topic

  Plantwide predetermined oh rate

Red River Farm Machine makes a wide variety of products, all of which must be processed in the cutting and Assembly departments. For the year 2010, Red River budgeted total overhead of $993,000,

  What are fats and how are they used by the body

What are fats and how are they used by the body? List five foods that are rich in fat. Briefly explain what essential fatt acids are

  Question 1 for the following hypothesis testho nbsp mu

question 1 for the following hypothesis testho nbsp mu lenbspnbsp45ha nbsp mu gt 45nbsp nbsp nbsp alphanbsp 0.02with n

  Analyze the raw data that they have collected

How many people should be surveyed in a future study - What conclusions can reasonably be drawn from this information

  Analyzing production costs

A small publishing company is planning to publish a new book. The production costs will include onetime fixed costs (such as editing) and variable costs [such as printing).

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd