How many unique missing data patterns exist

Assignment Help Basic Computer Science
Reference no: EM131222444

Using your software of choice and the data from Exercise 1, examine the missing data patterns that exist in the file.

a. How many unique missing data patterns exist?

b. Which variables have some missing data?

c. Which variables have full data?

Exercise 1:

These exercises consider data from the 2005-2006 NHANES. The objective of this set of exercises is to perform a typical imputation and analysis session. The logical steps include examination of missing data, imputation of missing data values using a multiple imputation software tool of choice, and analysis of the imputed data sets using a companion software tool of choice capable of handling multiply imputed data sets. Begin by downloading the following subset of data from the 2005-2006 NHANES: c11 _ exercises _ nhanes.dta (available from the book Web site). Note that this data set is limited to adults 18+ years of age and those that completed the NHANES medical examination (n = 5,534). The variables used in the imputation and analysis are gender (RIAGENDR), body mass index (BMXBMI), race/ethnicity (RIDRETH1), age (RIDAGEYR), and systolic blood pressure (BPXSY1). The data set also contains the NHANES complex design variables and probability weight (SDMVSTRA, SDMVPSU, WTMEC2YR).

a. Examine simple descriptive statistics for these variables (means, proportions, ranges, and counts of missing values) keeping in mind that the full n is 5,334. Use a software tool of choice for this step.

b. Pay close attention to the types of the variables with missing data (continuous, ordinal, binary, or nominal) and the amount of missing data; that is, what percent are missing on each variable? Prepare a table including the type of each variable in the data set along with the missing data rate for that variable.

Reference no: EM131222444

Questions Cloud

Do students play chess game or just go watch a movie : In this report, you are going to survey American students on what they do in their leisure time. Do they play chess game or just go watch a movie?
Examine simple descriptive statistics for these variables : Examine simple descriptive statistics for these variables (means, proportions, ranges, and counts of missing values) keeping in mind that the full n is 5,334. Use a software tool of choice for this step.
Calculate the price of the bonds : Assuming that the yield to maturity of each bond remains at 8.4% over the next 4 years, calculate the price of the bonds at each of the following years to maturity. Round your answer to the nearest cent.
Supply chain flexibility and supply chain quality management : OMGT2087 - Logistics System - supply chain flexibility and supply chain quality management - provide critical analysis on the recent literature review in logistics/supply chain performance indicators and logistics/supply chain practices
How many unique missing data patterns exist : Using your software of choice and the data from Exercise 1, examine the missing data patterns that exist in the file.
Possibility of expanding export business : Logan has discussed the possibility of expanding his export business through a second sporting goods distributor in the United Kingdom; this second distributor would cover a different territory than the first distributor.
Prepare syntax to impute the missing values : Use M = 5, or prepare five imputed data sets during this step, and be sure to use a seed value so that your results can be replicated at a later time.
Emergency infusions of investment income : During FY 2008, some of the operating losses were offset with emergency infusions of investment income from restricted net assets; however, in November, the Midwest Healthcare System Board of Trustees passed a resolution prohibiting that practice...
What is the name of the variable : Execute the imputation commands and produce five imputed data sets for subsequent analysis. Save the imputed data set, and make sure to now use this imputed data set for all subsequent analyses.

Reviews

Write a Review

Basic Computer Science Questions & Answers

  What are the three aspects of reliability

What are the three aspects of reliability?

  Information assets to estimate for risk management purposes

If the organization has three information assets to estimate for risk management purposes which vulnerability must be estimated for additional controls first? Which vulnerability must be evaluated last?

  What cost factors are considered when a new tool is evaluate

What cost factors are considered when a new tool is evaluated? Why is it required that the tool can be used even when the scale of your project goes up?

  Total capacity of main memory in mbytes

A block direct mapping cache has line/slot that contains 4 words of data. The cache size is 16k line. Main memory contains 16k blocks of 128 byte each.a) What is the total capacity of main memory in Mbytes?

  Briefly discuss the various organizational approaches

1) As a member of an IT staff, how can you use social media to support e-commerce? You can search business websites to find good practices of using social media in e-commerce.

  Relevance of a web browser in the internet

Differentiate between server side, client side programming and the relevance of a web browser in the internet.

  How sarbanes-oxley affected the agency

Explain how Sarbanes-Oxley strengthened the enforcement of securities fraud and helped with the implementation of accounting reforms.

  What is beta in the financial world

What is beta in the financial world? What is standard deviation in the financial world? What type of risk does each measure? What assumption do you make about the stock when you use beta as a measure of its risk?

  The global security policy

Assignment Preparation: Activities include independent student reading and research.

  Write queries to retrieve data for tsql

Write SQL statements that will retrieve the following data from a database, using Subqueries and Joins. Using the Northwind database, write a SQL SELECT statement that will retrieve the data for the following questions:

  Example of a linux based

1) Please provide one example of a Linux Based (i.e. router, hypervisor, appliance, etc.) system. Provide a brief product description, use, and function.

  Show that among any group of positive integer

Let d be a positive integer.  Show that among any group of d+ 1 (not essentially consecutive) positive integer there are at least two with the similar reminder when they are divided by d.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd