Data mining using unsupervised and supervised learning

Assignment Help Database Management System
Reference no: EM13779816

Objectives: Data Mining using Unsupervised and Supervised Learning Approaches

Assume that a local company has collected a data set from their ecommerce website and ask you to analyze it. However, the company didn't provide much of background information about the data itself, e.g., the nature of attributes for the data set. However, based on the discussion with the people who collected the data and your observation on the data set, you felt that the first or second column, X1 or X2 may be decision column.

The basic strategy you will use is first to determine the decision column (or class attribute) using K-means clustering algorithm (unsupervised learning approach) to verify if the result of clustering is consistent with either attribute X1, X2, or both X1 and X2. Once the decision column(s) is determined, you build a model (or concepts) using supervised learning approach hoping that you will be able to offer an advice to the company for their business. To successfully complete the data analysis using this strategy, perform the following tasks:

(a) Use K-means algorithm (unsupervised learning) to cluster the data set and to verify the class field(s).

(b) Using the class field(s) determined in step (a), perform a supervised learning using any of those learning algorithms discussed in class such as Version Space, Decision Tree, and Neural Network, and build a model.

To perform above tasks, you are allowed to use either an existing system or program you implemented. However, in order to receive the maximum bonus points your program should work properly and must be powerful enough for effective data analysis. Otherwise, only a partial bonus point may be given. Therefore, it is more important to complete the above tasks (a) and (b) than implementing your own program.

Write a brief report that summarizes your data analysis activities and results including (1) your name(s) and contact email addresses; the percentage contribution to this assignment if the assignment was completed by a team. If a team cannot reach a consensus on the individual contribution, include the individual's claimed percent contribution with a brief description on specific tasks performed, (2) the language used for K-means algorithm implementation or the source of the software used, parameter settings such as K specifying how you determined the best K, clustering results, verified class field(s), and other relevant information to the task, (3) the name of the supervised learning algorithm used, the source of the implementation or software, parameter settings if any, the result of learning including the learned model and other relevant information, (4) the results of your data analysis, useful advice to the company's business, etc., and (5) other relevant discussion about your experience and data analysis results.

Reference no: EM13779816

Questions Cloud

Awareness of oppression and arousing sympathy of supporters : By creating awareness of oppression and arousing sympathy of supporters, the arts can be a form of protest. Identify and describe an example of how either black slaves or white abolitionists used the arts as a form of protest against slavery. Be s..
Intellectual disability, autism, and multiple disabilities : Identify areas of curriculum necessary for students with mild to moderate disabilities and explain why they are needed.
Research design and data collection : Identify the variables in this study. What are some extraneous variables that might impact your research? How would you control for extraneous variables?
Merits of the liquidators arguments : The merits of the liquidator's arguments, in British company law, that Mr Lay cannot recover his loan from the company and that he should instead be made to contribute to the company's debt on the ground that there is no difference between him and..
Data mining using unsupervised and supervised learning : Data Mining using Unsupervised and Supervised Learning Approaches, Use K-means algorithm (unsupervised learning) to cluster the data set and to verify the class field(s).
Write a paper about competence based education : Write a paper about Competence Based Education.
Internal and external stakeholders : Identify the company's goals and identify the following, specifically:
Find the optimal solution using the simplex method : Find the optimal solution using the simplex method based on the equation z= 2A+3B subject to the following constraints 2.1A+1B less than and equal to 6
Evidence-based psychological interventions : According to the text, the imbalance in the diversity of clinical psychologists

Reviews

Write a Review

Database Management System Questions & Answers

  Knowledge and data warehousing

Design a dimensional model for analysing Purchases for Adventure Works Cycles and implement it as cubes using SQL Server Analysis Services. The AdventureWorks OLTP sample database is the data source for you BI analysis.

  Design a database schema

Design a Database schema

  Entity-relationship diagram

Create an entity-relationship diagram and design accompanying table layout using sound relational modeling practices and concepts.

  Implement a database of courses and students for a school

Implement a database of courses and students for a school.

  Prepare the e-r diagram for the movie database

Energy in the home, personal energy use and home energy efficiency and Efficient use of ‘waste' heat and renewable heat sources

  Design relation schemas for the entire database

Design relation schemas for the entire database.

  Prepare the relational schema for database

Prepare the relational schema for database

  Data modeling and normalization

Data Modeling and Normalization

  Use cases perform a requirements analysis for the case study

Use Cases Perform a requirements analysis for the Case Study

  Knowledge and data warehousing

Knowledge and Data Warehousing

  Stack and queue data structure

Identify and explain the differences between a stack and a queue data structure

  Practice on topic of normalization

Practice on topic of Normalization

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd