Create a new dataset which has shorter name

Assignment Help Applied Statistics
Reference no: EM131437292

Logistic Regression Assignment

There are 3 parts to this assignment that must be done in R-studio, please complete ALL parts.

This script will look a little bit different than the last one -- it is going to build upon your R skills that you've already developed. 

I strongly recommend using the code we reviewed in class as an example for this -- use that to check your code and make sure you are on the right track.

a. Create a new dataset which has shorter name and same information.

b. Run frequencies and look at the following variables.

c. Show how the various variables differ by levels of the DFREE variable. We use CrossTable () to show the relationship between two categorical variables.

The IV Drug Use History at Admission variable has been created as a dummy variable as opposed to categorical. Look for ivhx2 and ivcx 3 in your dataset, you can use these dummy variables separately when you build your model.

Let us create a Naive model, meaning what are the probabilities of staying drug free without any predictors.

CONTINUE TO BUILD YOUR MODEL, ADDING THE VARIABLES YOU'VE DECIDED TO INCLUDE ONE BY ONE.

There are some interaction variables that are already written in the dataset. Try to apply those existed interaction variables or the interaction variable you created.

keep track of the odds radios, the significance of the variables you add, the model significance (modelsig), and the diagnostics (logisticPseudoR2s).

Attachment:- Assignment.rar

Reference no: EM131437292

Questions Cloud

Discuss the inappropriateness of addressing spiritual needs : They may also worry about inappropriateness of addressing spiritual needs, hold the belief that spirituality is synonymous with religion; they may also have difficulty separating the client's spiritual dimension from their psychosocial dimension.
Completing a research appraisal about a women : For this week, you will be completing a research appraisal about a women's health topic in which you may be interested. The purpose is to complete only a synthesis of the topic. To start, only look at studies that are specific to your women's heal..
Suppose the system has two stages : They can produce 7.5 units per hour, there are two machines, and the yield is 70%. Stage 2: They can produce 6 units per hour, there are three machines, and the yield is 85%. How many units can this system produce in one year, if it operates 8 hours ..
European data privacy-what is the price for free internet : What is the "price" for a free internet? Describe the new privacy law that was unveiled by the European Commission that deals with this issue. Briefly discuss three technology related privacy issues that we face in the U.S. now.
Create a new dataset which has shorter name : This script will look a little bit different than the last one -- it is going to build upon your R skills that you've already developed. Create a new dataset which has shorter name and same information. Run frequencies and look at the following vari..
Done in case of an accident : What are the ways in which first aid is done in case of an accident?
What is the arr of investment opportunity : Sinclair Wholesalers plc is currently considering opening a new sales outlet in Coventry. Two possible sites have been identified for the new outlet. Site A has a capacity of 30,000 sq. metres. It will require an average investment of £6 million, ..
Experiment a high school student : You live in a rural are. Your local radio station reports the results of an experiment a high school student conducted to test the effects of a healthy diet on the saturated fat content of cow's milk.
Identify why each section in this chapter fits the sections : Identify why each section in this chapter fits the sections discussed in class.Discuss each item that you fit under these sections and point out the differences.Write about the limitations (if any) you can think of using the book approach.

Reviews

len1437292

3/23/2017 1:30:13 AM

Subject- R-Studio. There are 3 parts to this assignment that must be done in R-studio, please complete ALL parts. I will send the other files (.cvs, .txt) for this assignment and the .cvs file, and the .txt file to be used with R-studio assignment.

Write a Review

Applied Statistics Questions & Answers

  Specific motors manufactures three different car

Specific Motors manufactures three different car models, Model X, Model Y, and Model Z, withrespective net revenues of$1000, $3000, and $6000. Each Model X requires 40 labor-hours and 1 tonof steel to produce, each Model Y requires 65 labor-hours and..

  A decision maker worst option has an expected value

A decision maker's worst option has an expected value

  1 length of pregnancies the length of human pregnancies

1. length of pregnancies the length of human pregnancies from conception to birth varies according to a distribution

  The normal approximation to the binomial

1. The J.O. Supplies Company buys calculators from a Korean supplier. The probability of a defective calculator is 20%. If 14 calculators are selected at random, what is the probability that less than 3 of the calculators will be defective?

  Find the probability of the average price

The cost of unleaded gasoline in the Bay Area once followed an unknown distribution with a mean of $4.59 and a standard deviation of $0.10. Sixteen gas stations from the Bay Area are randomly chosen. We are interested in the average cost of gasoline ..

  Four different types of solar energy collectors were tested.

Four different types of solar energy collectors were tested. Each was tested at five randomly chosen times, and the power (in watts) was measured. The results were as follows.

  Expected value is within one standard deviation

The expected value is within one standard deviation of themean(so you must calculate both the standard deviation and expected value for your game).

  The mean sales per customer µ for all of the sales

The mean sales per customer µ for all of the sales for your company last month is not known. Based on your past experience, you are willing to assume that the population standard deviation of sales, σ, is about $220. If you take a random sample of 10..

  Compute the mean and variance as functions of time

Compute the mean and variance as functions of time of the standard Ornstein-Uhlenbeck process with initial value Xt0 = 1. Show for the European put option under the BS model that the corresponding P&L process is zero

  Find point estimates of confidence intervals

Find point estimates of and 95 percent confidence intervals for the true total number, t, and the true proportion, p, of overstated accounts among all of the investment firm's accounts.

  Find probability that at least fourty of randomly selected

Assuming that the company's claim is valid, use the normal approximation to the binomial to calculate the probability that at least 40 of the 250 randomly selected consumers would say that they would never shop at that store again if they were sub..

  Construct a hypothesis test

Continuing with the previous example, the hat company now wants to get a sense of how many hats men and women will purchase. Construct a hypothesis test to determine whether these sample have different means while assuming different population varian..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd