Contrast at least three different data mining algorithms

Assignment Help Other Subject
Reference no: EM133148809

Assignment: Analytics Report

Overview

The purpose of this task is to provide students with practical experience in writing a data analytical report to provide useful insights, pattern and trends in a chosen dataset in the light of a set of tasks required within this document. This dataset will be chosen from the UC Irvine Machine Learning Repository1. This activity will give students the opportunity to show innovation and creativity in applying the WEKA data mining software, and designing useful visualization and data mining solutions presented as an analytics report.

Project Details

You will use an analytical tool (i.e. WEKA) to explore, analyse and visualise a dataset of your choosing. An important part of this work is preparing a good quality report, which details your choices, content, and analysis, and that is of an appropriate style.
The dataset should be chosen from the following repository:

UC Irvine Machine Learning Repository

The aim is to use the data set allocated to provide interesting insights, trends and patterns amongst the data. Your intended audience is the CEO and middle management of the Company for whom you are employed, and who have tasked you with this analysis.

Tasks

Task 1 - Data choice. Choose any dataset from the repository that has at least five attributes, and for which the default task is classification. Transform this dataset into the ARFF format required by WEKA.

Task 2 - Background information. Write a description of the dataset and project, and its importance for the organization. Provide an overview of what the dataset is about, including from where and how it has been gathered, and for what purpose. Discuss the main benefits of using data mining to explore datasets such as this. This discussion should be suitable for a general audience. Information must come from at least two appropriate sources be appropriately referenced.

Task 3 - Data description. Describe how many instances does the dataset contain, how many attributes there are in the dataset, their names, and include which is the class attribute. Include in your description details of any missing values, and any other relevant characteristics. For at least 5 attributes, describe what is the range of possible values of the attributes, and visualise these in a graphical format.

Task 4 - Data preprocessing. Preprocess the dataset attributes using WEKA's filters. Useful techniques will include remove certain attributes, exploring different ways of discretizing continuous attributes and replacing missing values. Discretizing is the conversion of numeric attributes into "nominal" ones by binning numeric values into intervals2. Missing values in ARFF files are represented with the character "?"3. If you replaced missing values explain what strategy you used to select a replacement of the missing values. Use and describe at least three different preprocessing techniques.

Task 5 - Data mining. Compare and contrast at least three different data mining algorithms on your data, for instance:. k-nearest neighbour, Apriori association rules, decision tree induction. For each experiment you ran describe: the data you used for the experiments, that is, did you use the entire dataset of just a subset of it. You must include screenshots and results from the techniques you employ.

Task 6 - Discussion of findings. Explain your results and include the usefulness of the approaches for the purpose of the analysis. Include any assumptions that you may have made about the analysis. In this discussion you should explain what each algorithm provides to the overall analysis task. Summarize your main findings.

Task 7 - Report writing. Present your work in the form of an analytics report.

Attachment:- Analytics Report.rar

Reference no: EM133148809

Questions Cloud

Benefits to receiving constructive feedback : What are the benefits to receiving constructive feedback?
Prevent a courier service from committing theft or fraud : What policies, trainings, penalties, regulations, security measures, or contracts should a company set to prevent a courier service from committing theft or fra
How much did the corporation save in employer premiums : Employee Employment Insurance premiums for a Corporation were $50,000.00 for 2021. How much did the Corporation save in Employer premiums
Should college athletes be paid : Should College Athletes be paid? Why or why not? What do college presidents think about this? What does athletic directors think? What do coaches think?
Contrast at least three different data mining algorithms : Compare and contrast at least three different data mining algorithms on your data, for instance:. k-nearest neighbour, Apriori association rules, decision tree
What is the current lump-sum investment required : What is the current lump-sum investment required to fund his child's education? Assume that after-tax rate of return that Matthew is able to earn
Analyze the location of a small business in area : 1. Analyze the location of a small business in your area. Discuss your findings in terms of how much success/failure can be attributed to its physical location.
Discuss an article about worldcom involving audit failure : The purpose of the audit is to provide assurance as to the accuracy of financial statements. Discuss an article about WORLDCOM involving audit failure
Mentoring and coaching opportunities : A) Describe how you would ensure a new employee was exposed to mentoring and coaching opportunities.

Reviews

Write a Review

Other Subject Questions & Answers

  What are core assumptions of the biopsychological approach

What are the core assumptions of the biopsychological approach

  What have you done to prepare for your certification

What have you done to prepare for your certification? Have you completed the scheduled tasks assigned on your timeline? If not, what are your plans to stay.

  What is the probability that he could fulfill all prophecies

If Jesus of Nazareth was just an ordinary man, what is the probability that he could fulfill all the prophecies by chance?"

  Create an online community for collaboration

Throughout this course you will own and operate The Broadway Cafe taking advantage of business practices discussed in this text to increase profits.

  The bsn nurse''s role in palliative care

What is the BSN nurse's role in palliative care and how does that role differ from the role of the AND or diploma-prepared nurse?

  Explain how you intend to use the resources

Explain how you intend to use these resources, and how they might benefit you academically and professionally. The paper follows correct APA format for title.

  Make informed decisions about interventions

Why is it important to collaborate with families and other team members in nonjudgmental ways to make informed decisions about interventions and life planning?

  Explain the aspect of social media use in the workplace

Potential examples include the importance of companies embracing social media, advertising through social media, policies involving social media.

  Budget narrative and an itemized budget summary

this part of a Grant Proposal has two sections: the Budget Narrative and an itemized Budget Summary- The Narrative is written first by reviewing the activities associated with the Project Objectives.

  JGR 300 Performing Under Pressure Assignment

JGR 300 Performing Under Pressure Assignment Help and Solution, Strayer University - Assessment Writing Service

  What name has been given to this phenomenon

Changes in memory can occur as a result of specific influences that operate after a memory is first formed. In a classic experiment, researchers showed subjects a filmed traffic accident. What name has been given to this phenomenon

  Why did the authors use multiple regression

Why did the authors use multiple regression? Do you think it is the most appropriate choice? Why or why not?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd