Discuss the main benefits of using data mining

Assignment Help Database Management System
Reference no: EM133150020

Assignment: Analytics Report

Overview

The purpose of this task is to provide students with practical experience in writing a data analytical report to provide useful insights, pattern and trends in a chosen dataset in the light of a set of tasks required within this document. This dataset will be chosen from the UC Irvine Machine Learning Repository. This activity willgive students the opportunity to show innovation and creativity in applying the WEKA data mining software, and designing useful visualization and data mining solutions presented as an analytics report.

Project Details

You will use an analytical tool (i.e. WEKA) to explore, analyse and visualise a dataset of your choosing.An important part of this work is preparing a good quality report, which details your choices, content, and analysis, and that is of an appropriate style.
The dataset should be chosen from the following repository:

UC Irvine Machine Learning Repository

The aim is to use the data set allocated to provide interesting insights, trends and patterns amongst the data. Your intended audience is the CEO and middle management of the Company for whom you are employed, and who have tasked you with this analysis.

Tasks

Task 1 - Data choice. Choose any dataset from the repository that has at least five attributes, and for which the default task is classification. Transform this dataset into the ARFF format required by WEKA.

Task 2 - Background information. Write a description of the dataset and project, and its importance for the organization. Provide an overview of what the dataset is about, including from where and how it has been gathered, and for what purpose. Discuss the main benefits of using data mining to explore datasets such as this. This discussion should be suitable for a general audience. Information must come from at least two appropriate sources be appropriately referenced.

Task 3 - Data description. Describe how many instances does the dataset contain, how many attributes there are in the dataset, their names, and include which is the class attribute. Include in your description details of any missing values, and any other relevant characteristics. For at least 5 attributes, describe what is the range of possible values of the attributes, and visualise these in a graphical format.

Task 4 - Data preprocessing. Preprocess the dataset attributes using WEKA's filters. Useful techniques will include remove certain attributes, exploring different ways of discretizing continuous attributes and replacing missing values. Discretizing is the conversion of numeric attributes into "nominal" ones by binning numeric values into intervals . Missing values in ARFF files are represented with the character "?" . If you replaced missing values explain what strategy you used to select a replacement of the missing values. Use and describe at least three different preprocessing techniques.

Task 5 - Data mining. Compare and contrast at least three different data mining algorithms on your data, for instance:. k-nearest neighbour, Apriori association rules, decision tree induction. For each experiment you ran describe: the data you used for the experiments, that is, did you use the entire dataset of just a subset of it. You must include screenshots and results from the techniques you employ.

Task 6 - Discussion of findings. Explain your results and include the usefulness of the approaches for the purpose of the analysis. Include any assumptions that you may have made about the analysis. In this discussion you should explain what each algorithm provides to the overall analysis task. Summarize your main findings.

Task 7 - Report writing. Present your work in the form of an analytics report.

Attachment:- Assignment-Analytics Report.rar

Reference no: EM133150020

Questions Cloud

What factors and forces contributed to scope creep : What factors and forces contributed to scope creep in this case? How could scope creep have been better managed by the Nelsons?
How much share premium must be reallocated by pe inc : So Corp. issues an additional 20,000 shares to unrelated parties for P2.500.000. How much share premium must be reallocated by Pe Inc
What is the gain in disposal or deconsolidation : Simon Company but retains a 40% interest in the former subsidiary valued at P2,000.000. What is the gain in disposal or deconsolidation
Creating your personal budget : How can the use of the budget be used in the planning process? How can this financial information influence short-term and long-term decisions?
Discuss the main benefits of using data mining : Explain your results and include the usefulness of the approaches for the purpose of the analysis. Include any assumptions that you may have made
Contrast two different decision-making models : Contrast two different decision-making models addressed in your course resources, explaining how each would be used to approach the hiring decision.
Determine the issues price : At the time of issuing the securities the market requires a rate of return 10 percent. Determine the issues price
Which department should the business shut down : Assume that the business's fixed costs of £100,000 are spread evenly among the cost figures of the four departments. Which department should business shut down
Research various codes of ethics in marketing research : Research various codes of ethics in marketing research. Explain how the codes of ethics support conscious capitalism.

Reviews

Write a Review

Database Management System Questions & Answers

  Write the sql code that will create the table structure

Write the SQL code that will create the table structure for a table named EMP

  Construct the final dataset

The data preparation phase covers all activities to construct the final dataset (data that will be fed into the modeling tool(s)) from the initial raw data

  Create visual logic flow chart from the following pseudocode

Create visual logic flow chart from the following pseudocode.

  Find names of students who have higher gpa from table

List the students ID, name, GPA, and course Number such that all students have GPA greater than 3 . 5 and enrolled in a course in Jan 1, 2011. Find the names of all students who have GPA greater than 3.

  Online registration application

Imagine an online registration application. Use your registration form as starting point for your analysis. Identify the entities that a potential database will need, along with their attributes.

  Implementation of information gathering component

Based on the pseudocode developed in Subtask 1.1, you are to implement the Information Gathering Component in this task.

  Ist all carer groups with an expired permit.

ICT211 Database Design. List the patient id, accession id, animal name, and breed for all animals, sorted by animal type, that are currently being treated (where they have not been released, or sent to a carer or other facility). List all Carer Gro..

  What are the objects of access database

What are the different Number field types in Access and What are the two types of data processing techniques - What are the objects of Access Database

  Identify the aspects of the database design

Identify the aspects of the database design that can be denormalized. Explain the key ways in which the business rules support the degree of normalization.

  Find two different trees that have the same list of nodes

Find an example of a tree whose inorder and postorder traversals yield the same list of nodes.

  Create class named incident that consists of public property

Create a class named Incident that consists of one public property for each column in the Incidents table and a method named CustomerIncidentDisplay that formats an incident for display on the Customer Survey page.

  Identify at least two relationship strength types

Discussion: The Entity Relationship Model- Identify at least two relationship strength types that can be used within the entity relationship model.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd