Apply learning algorithms to perform machine learning tasks

Assignment Help Other Subject
Reference no: EM133770262

Machine Learning - Regression Analysis

Learning Outcome 1: Apply learning algorithms to perform machine learning tasks.
Learning Outcome 2: Implement practical machine learning: data pre-processing, analysis, model selection, and interpret the results.
Learning Outcome 3: Communicate clearly and effectively using the technical language of machine learning to a range of stakeholders

Task Summary
In this Assessment, you will use a linear regression Machine Learning (ML) algorithm to analyse data and draw conclusions. To help you create and document the ML model and the results, you will follow the end-to-end CRoss-Industry Standard Process for Data Mining (CRISP-DM) (Chapman et al., 2000) methodology. Further, to guide you through the analysis and the writing of the report, a template for your Jupyter Notebook has been provided.

The CRISP-DM template is a necessary resource for the completion of this Assessment. You must consult the CRISP-DM template for further details and information.

Context

In this Assessment, you will complete an end-to-end ML exercise using real-world data. In your future workplaces, you will often be expected to undertake similar exercises using suitable data sets. The template and the use of a methodology will ensure that you do not simply perform an analysis and present your results; rather, you will be able to share the output of the analysis and discuss why and how you adopted the methodology with your work colleagues.

This data set provides wine quality data across 11 traits, including acidity, residual sugar and alcohol concentration.

You must consider the CRISP-DM from the outset. As the report template indicates, the first stage is Business Understanding. This stage requires consideration of the project problem.

Task Instructions
You will use your Jupyter Notebook on the Microsoft Azure ML platform or Google Colab and Python 3.6 as the language for all three assessments.
Ultimately, the Notebook will contain both your ML code, data and report documentation.
Your Assessment will be evaluated based on the major stages of the CRISP-DM process as set out in the Notebook template with prompts. The process comprises:
Business Understanding;
Data Understanding;
Data Preparation;
Modelling;
Evaluation; and
Deployment.
The six multi-step stages of the CRISP-DM must be undertaken to complete this Assessment. Note: For ease of working and to complete this Assessment, you should document what you are doing in your Notebook as you progress through the activities (e.g., the steps undertaken and the rationale for the selection of the code). The template will prompt you on how to work through the end-to-end ML process.

Stage 1: Business Understanding
This section serves as an introduction. You should write a clear and concise narrative, expressing what you are trying to achieve with regards to your evaluation criteria. Think in terms of ML; for example, the prediction algorithm, the data set selected, what you are seeking from the data set and how you intend to understand the value of your prediction capability.
Assess the current situation. See 1.1 of the CRISP-DM template (1.1).

Stage 2: Data Understanding
Acquire the relevant wine quality data set from the UCI repository for your prediction model.

Explicitly specify the data source by providing a specific link and the name of the data set (e.g., red wine, white wine or both) and the method of acquisition (e.g., direct from the URL or a download of the .csv file). The steps taken need to be clearly stated. (2.1).
Read this data set into your Notebook. (2.1).
Describe the data set inclusive of variables, units and levels. (2.2).
Verify the data quality by analysing the data set for structure and missing data. (2.3).
Conduct an initial data exploration using data visualisation, reporting and querying the data. (2.4).
Use the pairplot function in seaborn to determine the relationship, if any, between the variables. Include the output or the visualisation of the pairplot function in your Notebook and comment on it. (2.4.2).

Stage 3: Data Preparation
Select the data that you will use for the analysis. (3.1).
Clean the data you have selected to improve the quality of the data. (3.2).

Stage 4: Modelling
For this Assessment, you are required to use the linear regression model.
Import the linear regression model into your code. (4.1).
Record any modelling assumptions. (4.2).
Run your model over the data set. (4.3).
Record the parameter settings, your rationale for your choice of values and the actual model generated. (4.3).
Revise any parameter settings for subsequent model runs. Document all the revisions until the best model is reached. (4.4).

Stage 5: Evaluation
Assess the ML results. Ensure you include a statement as to whether the model meets the evaluation criteria.

Stage 6: Deployment
For this Assessment, you are not required to deploy your model. For this stage, simply include any lessons that you learned and that you wish to share in relation to the things that went right and wrong, the areas in which you did well and in which you could improve. You can also detail any of your other experiences in completing this Assessment.

Reference no: EM133770262

Questions Cloud

Whose work you will review for milestone : How will you find the peer whose work you will review for Milestone 6?
Avenue-in-depth wargaming method : Identify which statements are true for the avenue-in-depth wargaming method.
What are some effective communication strategies : What are some effective communication strategies that can be used to address conflicts in a project, and how can project managers ensure
What does it mean to prefer asynchronous communication : What does it mean to prefer asynchronous communication over synchronous communication? In which there was a preference to communicate asynchronously.
Apply learning algorithms to perform machine learning tasks : MLN 601 Machine Learning, Regression Analysis, Torrens University Australia - Implement practical machine learning: data pre-processing, analysis,
Develop skills you can use in your professional life : How can the peer-review process help you develop skills you can use in your professional life?
Create effective job design : Create an effective job design. The right skills can be matched to a job when tasks and necessary KSAs are clearly defined
What does the web page address mean and what are urls parts : What does the Web page address mean and what are URL's parts? How many parts are the ULR made of and name them?
Employer performance expectations for the role : Understanding workplace characteristics can ensure the environment supports employer's performance expectations for the role.

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd