Completing a database lock and submission for analysis

Assignment Help Advanced Statistics
Reference no: EM133325300

Analysis Division Assignment

You are the lead biostatistician for the project 22-XYZ-01. The data management team provides you raw data from the lab vendor Health Labs and await your feedback before completing a database lock and submission for analysis. The DM team provides you two documents:

- Raw_Data_LAB.xlsx: Raw laboratory data
- LAB_Coding_Dictionary.pdf: Variable code dictionary

PART I

Prepare a brief (1-2 page maximum) feedback report for the DM team to further investigate/clean/format the data(base).

- Elaborate on any data that may be of concern through your review - provide rationales for your concerns, if you have any.
- Request for any additional information you may need from the DM team and/or the laboratory vendor in order to analyze the lab data.

Your analysis pipelines, written in R/SPSS, expects imported data via the TIDY data format:

- Provide instructions in your report to the DM team to re-format this database in order to have the data ready for analysis.

PART II

Prepare a short (1-3 page maximum) summary report for the data division lead, who does NOT have access to the raw data, but is familiar with the variable codes and laboratory parameters.

- Provide a general descriptive summary of the laboratory data.
- Present an EDA of the laboratory data; if performing outlier detection and/or imputations, provide rationales to justify any modifications to the raw database.

The client/sponsor is really eager to know (informally) if there is any evidence that the IP had any effect on the lab parameters through the course of the study:

- Apply a simple model to explore any potential trends in the data and include this in your report to the data division lead.

Attachment:- Analysis Division Assginment.rar

Verified Expert

This task provides a clear working example of descriptive and inferential statistics that helps the researcher to predict the lab factors that influence the platelet count. The descriptive statistics were computed for all the lab parameters and a box plot was constructed to determine the distribution of all those lab parameters. Apart from a few parameters all other lab parameters failed to validate the normality assumption

Reference no: EM133325300

Questions Cloud

Levels of inequality : An important theme in this class is that levels of inequality in the US are heavily impacted by structural factors and policy choices,
Explanation of anti-war social movement : An in-depth explanation of the Anti-War Social Movement. Explain using social movement theories.
Describe how a deep learning model works : Describe how a deep learning model works. Explain how Fitbit used Twitter data to improve its business What are the major challenges to social media
Create a data storage capability within your cloud platform : Create a data storage capability within your cloud platform. What is provided? Ensure to put together the elements, the functionality, the benefits.
Completing a database lock and submission for analysis : Analysis Division Assignment Elaborate on any data that may be of concern through your review - provide rationales for your concerns, if you have any
What is ethical or legal dilemma : What is an ethical or legal dilemma that might arise while working with a client who has been diagnosed with HIV/AIDS?
Provide a detail example of how the min-max normalization : provide a detail example of how the min-max normalization and Z-score standardization values are computed and also explain why a data scientist might want
Why does organization need inclusion plan : Why does an organization need an inclusion plan? Isn't the presence of a diverse workforce sufficient for the organization to realize positive outcomes?
Define the table structures in the database using sql : In M05, you should have submitted the ERD draft to your instructor for a review and feedback. All recommendations provided should have been applied

Reviews

Write a Review

Advanced Statistics Questions & Answers

  Relationship between speed, flow and geometry

Write a project proposal on relationship between speed, flow and geometry on single carriageway roads.

  Logistic regression model

Compute the log-odds ratio for each group in Logistic regression model.

  Logistic regression

Foundations of Logistic Regression

  Probability and statistics

The tubes produced by a machine are defective. If six tubes are inspected at random , determine the probability that.

  Solve the linear model

o This is a linear model. If your model needs a different engine, then you need to rethink your approach to the model. Remember, there are no IF, Max, or MIN statements in linear models.

  Plan the analysis

Plan the analysis

  Quantitative analysis

State the hypotheses that you are going to test.

  Modelise as a markov chain

modelise as a markov chain

  Correlation and regression

What are the degrees of freedom for regression

  Construct a frequency distribution for payment method

Construct a frequency distribution for Payment method

  Perform simple linear regression

Perform simple linear regression

  Quality control analysis

Determining the root causes

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd