Summarising and visualising the data set

Assignment Help Other Subject
Reference no: EM132237925

Assignment - The aims of this assignment are to put into practice the concepts covered in lectures, apply these to a real dataset, and to demonstrate your ability to use Python to carry out machine learning tasks.

The main steps you need to carry out and report on are:-

  • Identifying and providing a clear statement of the problem you wish to explore.
  • Summarising and visualising the data set.
  • Preparing your dataset for analysis (data cleansing, choosing appropriate features to model) etc.
  • Choosing appropriate models for your chosen problem and building, running and evaluating these.
  • Presenting and interpreting the results of your models and relating these to your initial problem.

The data you are going to be working on comes from the Airbnb data for Bristol. The data contains information about all the Airbnb properties in Bristol as of November 2018 (2,375 and 28 attributes), and problem you choose to work on is entirely up to you.

You can work on either a classification or regression tasks but should aim to apply a range (around 2-4) of techniques including the basic ones (which might be useful as baseline comparisons) and also the more sophisticated ones covered in this class, but the emphasis should be on using the techniques appropriately and interpreting the results.

Your assignment needs to be submitted in Jupyter notebook format (and submitted as a .ipynb file), and should include the Python code used and developed interleaved with explanations of the steps you are taking and interpretations and analysis of the outcomes. I would recommend that you still use Spyder (or similar) for developing the code, but then use Jupyter notebook to create the final presentation. Below are some illustrations of reports which use this approach:

  • Using Python to see how the Times writes about men and women
  • An open science approach to a recent false-positive between solar activity and the Indian monsoon
  • Kaggle Competition | Titanic Machine Learning from Disaster
  • An example machine learning notebook
  • An exploratory statistical analysis of the 2014 World Cup Final

Your report should be approximately 10 pages in length (tricky to say, given the format, but based on the generated pdf or printed html), but the emphasis should be on the appropriate application of techniques and critical interpretation of results.

Attachment:- Assignment Files.rar

Reference no: EM132237925

Questions Cloud

Does this company use centralization or decentralization : Does this company use centralization or decentralization? Have they chosen the best method? Should they go with the opposite structure?
What are some formal and informal linkages : What are some formal and informal linkages that you have encountered at your college or university?
Discuss the benefits and drawbacks of regional integration : Discuss the benefits and drawbacks of regional integration. Introduction body conclusion and reference
Discuss the most beneficial pricing strategies : Choose the most beneficial pricing strategies and suggest two ways in which this selection could potentially affect consumer adoption of the new product.
Summarising and visualising the data set : Identifying and providing a clear statement of the problem you wish to explore. Summarising and visualising the data set
Analyze the three levels of employee engagement : Create an optimal motivational approach by combining two motivational theories that you as a leader of the organization would use to build.
How does validating a selection test strengthen : How does validating a selection test strengthen the defense of that test against a claim of disparate impact?
Motivation for implementing knowledge management system : What was the organization's prime motivation for implementing a knowledge management system
Malcolm baldrige school of business to demonstrate : Who are about to complete their programs of study at the Malcolm Baldrige School of Business to demonstrate their familiarity with the Baldrige Core Values

Reviews

len2237925

2/19/2019 11:21:30 PM

Think carefully about the problem you want to work on and the main question you are trying to answer. And take you time to make sure you understand the data. It is also not necessary to use every attribute - you may find yourself working with many or just a few. The emphasis in this assignment is also much on the process: if you find that the techniques you have chosen don't work very well or fail to produce particularly interesting results, then this is not a problem provided you followed the appropriate steps to understand and prepare the data and select appropriate models and can provide some insights or explanations into why your model failed.

len2237925

2/19/2019 11:21:24 PM

You can work on either a classification or regression tasks but should aim to apply a range (around 2-4) of techniques including the basic ones (which might be useful as baseline comparisons) and also the more sophisticated ones covered in this class, but the emphasis should be on using the techniques appropriately and interpreting the results. Your report should be approximately 10 pages in length (tricky to say, given the format, but based on the generated pdf or printed html), but the emphasis should be on the appropriate application of techniques and critical interpretation of results.

len2237925

2/19/2019 11:21:18 PM

This assignment is out of 25 and will follow the following marking scheme: Identifying and providing a clear statement of the problem you wish to explore (3) Summarising and visualising your data set (5) Preparing your dataset for analysis (data cleansing, choice of features) etc. (5) Choice of models, application, evaluation and validation (8) Interpretation and explanation of the results of your models and implications of these for your initial problem (4).

Write a Review

Other Subject Questions & Answers

  What role does national culture play

Some would say that countries are becoming more similar due to globalization. Would you agree or disagree? Why? What role does national culture play?

  What is the function of a myth

What is the function of a myth? What is its value? Analyze in some detail what accounts for the power of myths in religion, providing examples from a variety of traditions. What is the relationship between myth and doctrine?

  Changes mean to health care industry

Could you help me evaluate the impact technological changes have had on the economics of health care and what these changes mean to the health care industry? I need to address least two (2) changes.

  Compare the break-even point of each company

"Bert Company and Ernie Company are competitors in the same industry. Compare the break-even point of each company

  What did you learn from the film

Did the interaction with the person change your view of discrimination? If so, explain how the interaction has affected you either positively or negatively. If it did not change your view of discrimination, explain why.

  Describe the costume design vs fashion design

You have thought through the readings, please explain what you think the main difference is between "Set Design" versus "Interior Design"?

  What is the labor content associated with serving customer

Four employees at a fast-food restaurant each perform one of the four activities in serving a customer: greet customer, take order, process order.

  What has fed doing to address current economic situation

What has the Fed been doing to address the current economic situation? Do you think it is doing the right thing? Should it do more or less

  What internal aspects of an organization need

There are several factors to consider before implementation of a HRIS. What internal aspects of an organization need to be examined before choosing a HRIS? What external factors need to be considered before choosing a HRIS

  Describe two contributions made to nursing

Describe two contributions made to nursing, provide a brief description of Florence Nightingale

  What is the class width of the histogram

Construct a frequency distribution with 5 classes - create a histogram of the data using the frequency and develop a certain program.

  Countries with high profit-investment potential

A panel of political scientists diplomats and business experts from the US and Europe has been convened to assess the risks and opportunities of investing

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd