Build a logistic regression model that can predict

Assignment Help Other Subject
Reference no: EM133130703

Assessment Description

It is important to have experience implementing one of the most common applications of regression currently used in business, finance, and healthcare. Questions like should a loan be approved, is a driver entitled a discount, and will a patient survive are all answered with a form of logistic regression (i.e., with a Yes/No answer).

Using a dataset representing applications for a bank loan, the task will be to build a logistic regression model that can predict whether or not a loan will be approved.

Useful R functions for this assignment are:
1. Data explorations: na(), summary()
2. Split data into train/test: sample()
3. Build the model: glm(), summary()
4. Model performance evaluation: predict()
5. Model validation: library(gains), gains(), plot(), lines(), dim(), library(car), vif(), glm(), summary(), predict(), ifelse()
6. Validate prediction: table(), mean()
7. Results interpretation: library(ROCR), predict(), prediction(), performance()

For this activity, perform the following:
Load the "application_record.csv," located in the topic Resources,(Credit Card Approval Prediction | Kaggle) into a data frame and perform initial exploratory tasks:
1. Display representative portions of the data.
2. Check for missing values and clean the data.
3. Check for outliers and decide if and how to process them.
Formally state what your model will predict using the variables in the data.
Split the data into a training set and a testing set with a split ratio of 70:30.

Build the Predictive Model:
1. Define the formula for the glm().
2. Run the model.
3. Interpret the results, referring to the p-values.

Evaluate the Model Performance:
1. Compare the predicted versus actual values.
2. Search for any predictions that differ significantly from the actual values.

Validate the Model:
1. Produce a Gain and Lift chart and use it to describe the performance of the model.
2. Measure the Variation Inflation Factor (VIF) to test for multicollinearity. If changes are necessary to the model based in VIF, state and implement them.
3. Has the formula, as defined in the previous section, changed? Why or why not?
4. If changes to the model occurred, repeat the validation steps on the new model.

Make Predictions:
1. Demonstrate a few examples of predictions your model can make.
2. Validate the predictions by calculating the misclassification error.
3. Interpret the results.
State a few suggestions for improving the model.

Submit a professionally written and formatted R Markdown document knitted as a PDF. Make sure the documentation contains the R code, relevant plots, your analysis, and the appropriate citations and references.

While APA style is not required for the body of this assignment, solid academic writing is expected, and documentation of sources.

Attachment:- Assessment_Description.rar

Reference no: EM133130703

Questions Cloud

What is the equilibrium price if demand is reduced by half : 1. What is the equilibrium price if demand is reduced by half?
Develop a graph that forecasts profit : Develop a graph that forecasts profit for the next five years. Please note that Excel is a great tool for creating a graph for this project
Compute Takata Company accounts receivable turnover : Question - In 2021, Takata Company has net credit sales of $1207000 for the year. Compute Takata Company's accounts receivable turnover
Provide the journal entry : Provide the journal entry if Rogers sold 1,000 shares of treasury stock for $25 per share. The treasury stock was repurchased at $20 per share.
Build a logistic regression model that can predict : Compare the predicted versus actual values - Search for any predictions that differ significantly from the actual values - Produce a Gain and Lift chart
Innovative sanitation product internationally : Executive management team which is exploring the possibility of producing, marketing and distributing their new, innovative sanitation product internationally
Create the adjusted net income : The bonds have a face value of P250,000 and pay a stated interest rate of 6%. Create the adjusted 2019 net income
Business logistics-supply chain management process : Transportation plays a vital role in the business logistics/supply chain management process.
Determine the total cost of the Snow Man project : The cost of direct materials on the job was $|9,000 and the direct labor rate is $30 per hour. Determine the total cost of the Snow Man project

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd