Build a model to predict a countrys life expectancy

Assignment Help Python Programming

Reference no: EM132937043

301312 Applied Machine Learning Report

1 Report Overview

Given the "Life Expectancy Data.csv" dataset, build a model to predict a country's "Life expectancy" using some of the following features from the "Life Expectancy Data.csv" dataset1:

• Year;
• Status;
• Adult Mortality;
• infant deaths;
• Alcohol;
• percentage expenditure;
• Hepatitis B;
• Measles;
• BMI;
• under-five deaths;
• Polio;
• Total expenditure;

• Diphtheria;
• HIV/AIDS;
• GDP;
• Population;
• thinness 1-19 years;
• thinness 5-9 years;
• Income composition of resources;
• Schooling.

2 Report's Contents
Ideally, your report should contain the following contents corresponding to the machine learning project checklist we discussed during Week 2's lecture.

2.1 Frame the Problem
At this initial step, you may first consider what type of machine learning solution would the problem take, e.g.:
• supervised or unsupervised learning;
• batch or mini-batch/online learning;
• instance-based or model based,
etc.

2.2 Get the Data
Preferably, the data can be loaded automatically from a fixed folder within your local machine 2, e.g., see the download script from Slide No. 134 of Week 2 lecture. It is also a good idea to convert the dataset into a panda frame format.
Examine the general dataset structure and perhaps consider missing (null) values within the columns (attributes) of some instances (you may also profile you data set using the info() method of panda data frame objects). Recall also that it is at this step where you should create your test set.

2.3 Explore the Data to Gain Insights
Visualise the data to look for possible correlations 3. You may also want to experiment with different attribute combinations.
2.4 Prepare the Data for Machine Learning Algorithms
At this step, you may consider:
Data cleansing: null/missing values cannot be handled by some machine learning algorithms.
Handling non-numerical data: convert text/categorical data into numerical.
Custom transformers: creating your own custom transformers, e.g., see the code in Slide No.

322 in Week 2's lecture that introduces combined attributes as new features.
Feature scaling: some machine learning algorithms (e.g., SVMs) are sensitive to unscaled fea- tures, perhaps you may consider scaling the features for these algorithms.

Transformation pipelines: ideally, automate the whole data transformation and training pro- cesses, e.g., see Slide No. 348 of Week 4's lecture.

2.5 Select and Train a Model
2.5.1 Consider several models and evaluate using cross-validation
For this step, you may further consider training several models, e.g.:
• Linear/logistic/softmax regression;
• Polynomial regression;
• SVM regression;
• Decision trees/random forests;
• Ensemble learning;
• Artificial neural networks,
etc. Each model can be further evaluated using cross-validation, e.g., see Slides No. 378-415. Preferably, you should also discuss why you have not considered some of the models above in your machine learning solution. Also, consider the computation cost of training and generating the predictions from your models.

2.5.2 Fine-tuning the model
You may further consider fine-tuning your model using:
• Grid/Randomized search;
• Performance measures, e.g.: accuracy, precision, f1 scores, mean square error, etc;
• Ensemble methods;
• Evaluating on the test set.

3 Machine Learning Solution Format

Your machine learning solution should be coded under Python and where the machine learning algorithm classes are from the scikit-learn library. You should submit a zipped folder containing both your Report document and your Python codes. Your report should contain enough empirical evaluations and arguments to show that your machine learning model is indeed fit- enough.

Attachment:- Applied Machine Learning Report.rar

Reference no: EM132937043

Questions Cloud

What is the market requiring as a return on the company : If dividends are expected to grow at a constant rate of 4% indefinitely, what is the market requiring as a return on this company's stock?

What total factory overhead costs would be : What total factory overhead costs would be? Beginning raw materials inventory. PhP6,000. Maintenance of factory equipment 2,800

What is the total manufacturing cost for the period : What is the total manufacturing cost for the period? If beginning and ending work in process inventories are PhP5,000 and PhP15,000, respectively.

Explain the main financial goal that business leaders have : Contrast the forms of business organization according to their legal structure, scope, owners, responsibilities, advantages and disadvantages

Build a model to predict a countrys life expectancy : Build a model to predict a country's "Life expectancy" using some of the following features from the Life Expectancy Data.csv

Describe a static criminal history : Can you describe a static criminal history, and provide examples

Which project should be chosen based on the incremental irr : Based on the incremental IRR, which project should be chosen? Comparing Investment Criteria- Wii Brothers, a game manufacture, has a new idea.

Find a new personal physician : Why is it important to ask healthcare professionals to find a new personal physician who will meet all your needs rather than a satisfied friend or relative

Creating life work balance : Discuss any 4 issues and their solutions in creating life work balance as well as role of HR proffessionals in address to target organization?

User Account

All Pages