Identify the key characteristics of the data

Assignment Help Other Subject
Reference no: EM132210049

Assignment - Need is 10 pages of solutions with description.

Learning objectives -

1: Demonstrate a practical understanding of core quantitative data analysis methods in data science applications and research.

2: Demonstrate skills in implementing these methods on real data using a software package and in critically evaluating and interpreting the results.

3: Evaluate the strength and the weaknesses of quantitative analysis methods alongside an understanding of how and when to use or combine methods.

Assignment - The data contains 3000 observations on the following 11 variables.

year - Year that wage information was recorded.

age - Age of worker.

maritl - A factor with five levels indicating marital status 1. Never Married 2. Married 3. Widowed 4. Divorced and 5. Separated.

race - A factor indicating race with levels 1. White 2. Black 3. Asian and 4. Other

education - A factor indicating education level with levels 1. < HS Grad 2. HS Grad 3. Some College 4. College Grad and 5. Advanced Degree indicating education level

region - Region of the country (mid-atlantic only).

jobclass - A factor indicating type of job with levels 1. Industrial and 2. Information.

health - A factor indicating health level of worker with levels 1. <=Good and 2. >=Very Good.

health_ins - A factor indicating whether worker has health insurance with levels 1. Yes and 2. No.

logwage - Log of workers wage.

wage - Workers raw wage.

1. Explore the data. Plot and produce summary statistics to identify the key characteristics of the data (for some of the variables listed above) and produce a report of your findings. 5 - 10 tables or figures are expected accompanied by a description of your main findings. The topics that you might choose to discuss include: possible issues with the data collection, identification of possible outliers or mistakes in the data, role of missing data (if any), distribution of the variables provided, relationships between variables.

2. What are the pairwise associations between variables in the dataset? Use correlation analysis, scatter plots, box plots, and a chi-squared test to test for associations between pairs. You can choose 3-4 associations to test for. What are the underlying assumptions of the statistical test that you applied? Are the assumptions satisfied? What do these test results mean?

3. Use multiple linear regressions to establish which variables affect the level of wages. Why one could focus on predicting log-wage, and not directly wage? Which variables can be used to predict wages?

1. Carry out a descriptive analysis and draw plots aimed at finding the answer to the question above.

2. Perform a multiple linear regression of logwage on some or all of the other variables.

3. Discuss the interpretation of the results and check the residuals plot. Discuss any weakness of this analysis and its effectiveness to answer the question above.

Attachment:- Data File.rar

Reference no: EM132210049

Questions Cloud

Demonstrate that it works with table and another table : This method will return the number of elements in the array that are odd numbers. Note that the arrays might be rectangular or ragged.
Write a personal development plan : BSBWOR501 - Manage personal work priorities and professional development - What interpersonal skills do you use to establish and build positive relationships
Consequences tied to risks in pursuit of performance goals : You witnessed or been a party to in regard to seeing the connection between positive and negative consequences tied to risks in pursuit of performance goals?
Write a program to get the state of pb5 and pb6 bits : Write a program to get the state of PB5 and PB6 bits. When both of them are HIGH, send $FF to PORTC; otherwise send $00 to PORTC.
Identify the key characteristics of the data : Explore the data. Plot and produce summary statistics to identify the key characteristics of the data and produce a report of your findings
Find all the triangles with integer side lengths : Write a program to find all the triangles with integer side lengths and a user specified perimeter.
Organizations ability to respond to changing requirements : Environmental and competitive factors play an important role in organizations’ ability to respond to changing requirements.
Levels of strategy-core-supporting and operational : Discuss the main differences between the three (3) levels of strategy: core, supporting, and operational.
Medium-sized manufacturer in switzerland : If you were CEO of a medium-sized manufacturer in Switzerland, what are the options you may consider in response to the spike of the Swiss franc?



1/7/2019 10:43:05 PM

Need 10 pages of solutions with description. All learning outcomes must be demonstrated. Technical Content (40%) - Choice of appropriate methods and Implementation. Interpretation (40%) - Justification of methods used, discussion of model assumptions and Interpretation of results. Presentation (20%) - Written presentation and Clear / appropriate choice of graphs.


1/7/2019 10:42:39 PM

FORMAT OF THE ASSESSMENT - The coursework should include the R commands used to produce the results, the plots and the tables. Marks will be given for writing the R commands and obtaining relevant results but full marks will be given only if the assignment includes also the justification for the methods used, where appropriate, and the interpretation of the results. The Maximum number of pages: 10. The format of the assessment should be doc, docx or pdf.

Write a Review

Other Subject Questions & Answers

  Evaluate each of the three cases according to the same one

use three cases one representing ideological terrorism e.g. baader-meinhof in germany the second representing

  Either aristotle is correct or galileo is wrong

If Newton's analysis of motion is correct, then Galileo is correct provided that Aristotle is wrong. If Galileo is incorrect then Newton is incorrect. Therefore either Aristotle is correct or Galileo is wrong.

  Finding the radial velocity of a target

There are two methods for finding the radial velocity of a target. One is based on the Doppler shift, the other is based on the rate of change or range with time delta(R)/delta(t).

  Provide summary of the situation including specific examples

Provide a summary of the situation, including specific examples. Discuss the current law or lack thereof. Be 2 pages in length, not including the required.

  What are the ethical concerns regarding presidents actions

Assignment: Environmental Law and Ethics. What are the ethical concerns regarding the President's actions, and what do you believe Eric should do?

  Find a peer-reviewed journal article

1) Use the online library to find a peer-reviewed journal article that uses qualitative methods.

  Contrast the underlying theories of the interventions

Respond to a colleague's intervention choice that differs from yours. Contrast the underlying theories of the two interventions.

  Create a lesson plan that integrates language objectives

Create a lesson plan that integrates language objectives, content objectives, and best instructional practices for ELLs, as well as a method for authentic.

  How you would use it to facilitate your collaborations

What communication models (A common language for team communication see page 22 i.e. SBAR or other models from week 8 content) do you presently use when communicating with other healthcare providers in your area of practice. a. with other nurses a..

  Evaluate employee empowerment initiatives

Discuss how to distinguish whether an employee empowerment initiative is driven by Model I values or Model II values.

  Discuss the limitations policies have on government power

Discuss the limitations policies have on government power. Use at least one of the assigned articles to support your position and cite the reference appropriately.

  What is the world heritage-galapagos islands-unesco

What is the World Heritage? The location of the Galapagos Islands. How is Darwin related to the Galapagos Islands? Many of the plant and animal species that inhabit the Galapagos Islands are endemic. What does this mean?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd