Classification and clustering methods

Assignment Help Other Subject
Reference no: EM132128155

Assignment Project: Data Mining using R

The goal of this project is to applying association rule mining, classification and clustering methods on theMushroom or Ionosphere and groceriesdata sets. For detailed information about the mush room or Ionosphere data set, refer to the Machnie Learning Repositoryprovided by the University of California, Irvine. You can download and read more about the data there.

The groceries Dataset
Imagine 10000 receipts sitting on your table. Each receipt represents a transaction with items that were purchased. The receipt is a representation of stuff that went into a customer's basket. That is exactly what the Groceries Data Set contains: a collection of receipts with each line representing 1 receipt and the items purchased. Each line is called a transaction and each column in a row represents an item.

Task 1: Data Pre-processing

Read the data in R. There are many ways to read in csv tables in R. For more details, please refer to data import/export in R

For the clustering experiments, the column for class labels need to be removed. Refer to lecture Module 10 to see how to do so.

Verify if any other pre-processing is beneficial for the analysis. For example, replacing missing values, attribute range normalization, converting numerical or string to nominal values etc.

Task 2: Data Mining

- Association Rule Mining experiments: Using R to explorer "association rules" on the groceries dataset.Try out different algorithms. Visualize the result you found. Report any interesting association rules discovered in the experiments and explain why they are interesting.

- Classification experiments: Using to construct classifiers on the mushroom or Ionosphere dataset. Randomly split the data set in the training and test data set (80% v.s. 20%). Select at least one classifier from each of the following two categories of classifiers: Tree-based models, Bayes classifiers, and Rule-based classifiers. Compare the result of the chosen classifers.

- Clustering experiments: Using R explorer clusters on the mushroom or Ionospheredataset.Select and compare two clustering algorithms from R(e.g. k-means v.s. density-based). Use R to visually explore the resulting clusters.

- For all the above experimentations, try different parameter settings to fine tune the outcome. In principle select methods that work well on the given data set.

Task 3: Prepare a report

Your report should contain the following:

- Theoretical Discussion: Limited to two pages discussing about data preprocessing steps, the motivation for selecting a particular method, and how the parameters are chosen.

- Results: Include results and screenshots of the above experimentations.

- Discussion and error analysis: Try to interpret the results of your model. Discuss intuitions or hypothesis that can be obtained by visual inspections of the resulting classes or clusters. Mention about assumptions if any, discuss issues that might have affected the model's performance.

- References: If you are using information from other sources apart from R manual and official website, you should cite them.

Attachment:- Assignment.zip

Verified Expert

Data mining is a process used by companies to turn raw data into useful information. By using software to look for patterns in large batches of data, businesses can learn more about their customers to develop more effective marketing strategies, increase sales and decrease costs. Data mining depends on effective data collection, warehousing and computer processing.

Reference no: EM132128155

Questions Cloud

What is greece global health issues : What is Greece's global health issues and how can they be combated?
Morphed into the concept of diversity : Do you think there is some type of diversity we really aren't interested in? Or, perhaps what we really are looking for is an end to discrimination
Assess the effectiveness of countermeasures : Explain the changes in technology as they have affected the United States intelligence community and their enemies.
What caused these changes : Create a 500-word essay depicting the evolution of democracy from the time of President Jefferson to President Jackson. Be sure to include the following.
Classification and clustering methods : Discuss intuitions or hypothesis that can be obtained by visual inspections of the resulting classes or clusters - In principle select methods that work well
Post a brief description of an ethical conflict : Post a brief description of an ethical conflict that the human services professional in the media presentation is facing. Explain why it is an ethical conflict.
What are costco key success factors : What are Costco's key success factors (KSFs) ? Which of the 11 sociotechnical principles can be seen in Costco?
Review of research-tested intervention programs : Choose a topic or topics that you'd like to learn more about. You may select additional criteria such as age and setting to narrow your results.
Impact professional relationships : Explaining how professional etiquette can impact professional relationships. Consistently displaying proper etiquette is a reflection of one

Reviews

inf2128155

11/20/2018 12:46:28 AM

Good work.. Really appreciate the this service. I used ExpertsMind so many times and from the beginning to end its a really good communication and service. When talking about the assignment its wonderful and hope I will get really good mark on it. Thanks.

len2128155

10/1/2018 10:04:18 PM

Submission Instructions This section is intended for submission instructions in learning systems. Grading Report Section Max. points Theoretical discussion and data-preprocessing 5% Results 10% Error analysis & references 5% Total 20%

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd