Reference no: EM132237925
Assignment - The aims of this assignment are to put into practice the concepts covered in lectures, apply these to a real dataset, and to demonstrate your ability to use Python to carry out machine learning tasks.
The main steps you need to carry out and report on are:-
- Identifying and providing a clear statement of the problem you wish to explore.
- Summarising and visualising the data set.
- Preparing your dataset for analysis (data cleansing, choosing appropriate features to model) etc.
- Choosing appropriate models for your chosen problem and building, running and evaluating these.
- Presenting and interpreting the results of your models and relating these to your initial problem.
The data you are going to be working on comes from the Airbnb data for Bristol. The data contains information about all the Airbnb properties in Bristol as of November 2018 (2,375 and 28 attributes), and problem you choose to work on is entirely up to you.
You can work on either a classification or regression tasks but should aim to apply a range (around 2-4) of techniques including the basic ones (which might be useful as baseline comparisons) and also the more sophisticated ones covered in this class, but the emphasis should be on using the techniques appropriately and interpreting the results.
Your assignment needs to be submitted in Jupyter notebook format (and submitted as a .ipynb file), and should include the Python code used and developed interleaved with explanations of the steps you are taking and interpretations and analysis of the outcomes. I would recommend that you still use Spyder (or similar) for developing the code, but then use Jupyter notebook to create the final presentation. Below are some illustrations of reports which use this approach:
- Using Python to see how the Times writes about men and women
- An open science approach to a recent false-positive between solar activity and the Indian monsoon
- Kaggle Competition | Titanic Machine Learning from Disaster
- An example machine learning notebook
- An exploratory statistical analysis of the 2014 World Cup Final
Your report should be approximately 10 pages in length (tricky to say, given the format, but based on the generated pdf or printed html), but the emphasis should be on the appropriate application of techniques and critical interpretation of results.
Attachment:- Assignment Files.rar