Calculate the average amount of loans granted by all states

Assignment Help Other Subject
Reference no: EM132629719

Assignment: There are several CSV files, start with the word document to understand the nature of the data and broad expectations for the final case analysis. You are expected to explore and perform exploratory data analysis and the final analysis.

Data Details: You are given six years of lending data (2012 - 2017) in csv format. The data files are relatively larger than what you have used during this course so far. The size of each file is different and depends upon the number of loans the company issued in a year. It can be noted that the file size are relatively larger 2015 onward, which is when the company went public and started lending more loans. Each file has 31 columns (variables) and the description of each column is provided in the DataDictionary.xls file.

In addition to that, you are also given the states characteristics in a file called states.csv. This file contains demographic information like population size, median income, unemployment rate etc.

Lastly, you are given a regions file called states_regions.csv that contains larger regions and divisions that each state falls in. For example, New Hampshire is in the Northeast region and New England division.

There are three sections to this case: Merging and cleaning (15 points), Data Analysis (60 points), Visualization (25 points) totaling 100 points.

Merging and Cleaning: Stack all six Lending Club files together on top of each other. Now join the states.csv file with the stacked file using state name as the primary key. Finally, merge the state_regions file with the combined file so that you have one large file containing lending club and states geographic and demographic information.

Analysis: Use the above file to analyze and answer the following questions:

1) Find the distribution of number of loans by state, regions and divisions. Describe in your own words the geographic differences in the number of loans. Also, analyze your results by comparing number of loans per capita. Did you notice any missing states in the Lending Club data? If yes, then find out why.

2) Compare the average amount of loans granted by all states and divisions. Which states and divisions have the highest and lowest average loan amounts?

3) Compare the average interest rate charged and average loan amount by the loan Grade. Do you notice any patterns?

4) Run a frequency distribution of number of loans, average loan amount and average interest rate for each state by year (2012 through 2017). Describe the changing patterns in those numbers.

5) Is there a relationship with the population size of a state and the average loan amount given? Is there a relationship between Grade of loans and median income level in a state?

6) This is an open-ended question where you are asked to share an interesting fact that you found through data analysis.

Visualization: 1) Create a plot of interest rates and Grade or a loan and describe the pattern.

2) Create a map of US states and color code the map with the average amount of loans given.

3) Show visually the relationship between the annual income of the recipient and the loan amount obtained from Lending Club

4) Create a plot that shows the relationship between the length of employment and amount of loan obtained.

5) Create a "regional" map and show an interesting relationship of your liking.

Reference no: EM132629719

Questions Cloud

What is the net cash received over the life of the bond : The straight line method of amortization is used for both premiums & discounts. What is the net cash received over the life of the bond investment
Discuss charitable contributions for a corporation : Question - Discuss charitable contributions for a corporation. How do they differ from individual charitable contributions? Please explain fully
How attribution-based perspective enhances innovation : Find another article that adds to the overall findings of the case and note how attribution-based perspective enhances successful innovation implementations.
Prepare serial dilutions and perform spread plate technique : Prepare serial dilutions and perform a spread plate technique to enumerate the number of 'table bacteria present in a bacteriological sample.
Calculate the average amount of loans granted by all states : Compare the average amount of loans granted by all states and divisions. Which states and divisions have the highest and lowest average loan amounts?
BitGold case study : BitGold's IPO took place in Canada; however, as a global platform, BitGold aimed to appeal to users in developing countries around the world.
What costs should client expect to pay for cloud-based data : Define and describe NAS. Assume you must implement a shared file system within the cloud. What company would you select? Why? What costs should your client.
How many shirts should the retailer order to maximize profit : a) How many shirts should the retailer order to maximize his profits?
What is the optimal order quantity per order for kristen : Kristen orders paper take-out bags with her logo printed on them.

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd