Apply the apriori algorithm on given dataset

Assignment Help PL-SQL Programming
Reference no: EM131460847

Data Organization for Data Analysts

Data Mining Concepts

1. On describing discovered knowledge using association rules

One of the major techniques in data mining involves the discovery of association rules. These rules correlate the presence of a set of items with another range of values for another set of variables. The database in this context is regarded as a collection of transactions, each involving a set of items, as shown below.

Trans ID          Items Purchased

101                  milk, bread, eggs

102                  milk, juice

103                  juice, butter

104                  milk, bread, eggs

105                  coffee, eggs

106                  coffee

107                  coffee, juice

108                  milk, bread, cookies, eggs

109                  cookies, butter

110                   milk, bread

1.1 Apply the Apriori algorithm on this dataset.

Note that, the set of items is {milk, bread, cookies, eggs, butter, coffee, juice}. You may use 0.2 for the minimum support value.

1.2 Show two rules that have a conftdence of 0.7 or greater for an itemset containing three items.

2. On describing discovered knowledge using classiftcation

Classiftcation is the process of learning a model that describes different classes of data and the classes should be pre-determined. Consider the following set of data records:

RID

Age

City

Gender

Education

Repeat Customer

101

20..30

NY

F

College

YES

102

20..30

SF

M

Graduate

YES

103

31..40

NY

F

College

YES

104

51..60

NY

F

College

NO

105

31..40

LA

M

High school

NO

106

41..50

NY

F

College

YES

107

41..50

NY

F

Graduate

YES

108

20..30

LA

M

College

YES

109

20..30

NY

F

High school

NO

110

20..30

NY

F

college

YES

2.1 Assuming that the class attribute is Repeat Customer, apply a classiftcation algorithm to this dataset.

3. On describing discovered knowledge using clustering

Consider the following set of two-dimensional records:

RID

Dimension 1

Dimension 2

1

8

4

2

5

4

3

2

4

4

2

6

5

2

8

6

8

6

3.1 Use the K-means algorithm to cluster this dataset. You can use a value of 3 for K and can assume that the records with RIDs 1, 3, and 5 are used for the initial cluster centroids (means).

3.2 What is the difference between describing discovered knowledge using clustering and describing it using classiftcation.

Verified Expert

The paper is about Data Mining. Data mining is well growing domain to maintain the large collection of data. In this paper Apriori algorithm is applied in the given table and this algorithm is predict the maximum possibility of the combined product selling. This prediction algorithm is the idea of the artificial Intelligence used in the growing technologies. This paper also contains the classification and the clustering algorithm to segregate the similar collection of data. The paper has been prepared in Microsoft word.

Reference no: EM131460847

Questions Cloud

Calculate the point estimate of the population mean : A random sample produced the following data: Calculate the point estimate of the population mean.
What they have written regarding value-added costs : Participate in follow-up discussion by reviewing your classmates post and expanding upon what they have written regarding value-added and non-value-added costs.
Define social psychology : Discuss how social psychology can be applied to one of the helping professions (counseling, social work, education, law enforcement.
Estimated standard error for the sample : What is the estimated standard error for the sample mean difference?
Apply the apriori algorithm on given dataset : CIND - Data Organization for Data Analysts Data Mining Concepts - What is the difference between describing discovered knowledge using clustering and describing it using classiftcation.
Decision for a hypothesis test : Which of the following is the correct decision for a hypothesis test using a = .05.
Describe how given cuts will affect patient-centered care : Describe how these cuts will affect patient-centered care. Describe how these cuts will benefit patient care or business systems.
The effects of birth order on a persons personality : Discuss the strengths and weaknesses of each piece. If the articles talk to each other (that is, if they support or contrast with one another), explain how?
Values for each and comment on your findings : The labor cost for the first set was $75 and it was $100 for the second. Compute z values for each and comment on your findings.

Reviews

inf1460847

5/2/2017 6:03:19 AM

Was surprisingly awesome rating considering how he completed my paper in 12 hours! Met the due date! a couple of modification must be made yet an entirely decent paper. Exceptionally content with his work :)

len1460847

4/13/2017 3:06:40 AM

Can you please send me the solution in the MS Word - What is the difference between describing discovered knowledge using clustering and describing it using classiftcation.

Write a Review

PL-SQL Programming Questions & Answers

  Construct a query that will show the number of days

Imagine that you work for a finance industry-based organization. Your organization is looking to submit its database design documentation to an evaluation team in order to meet Sarbanes-Oxley (SOX) compliance.

  Make a visio erd with primary and foreign keys

Combine the two diagrams make a visio ERD with Primary and Foreign keys.

  Database in omnymbus

/*Using the STUDENT table in the MISLab1 database in Omnymbus, perform the following tasks: Note the first SELECT is there to label the output, DUAL is a "dummy" table. The second SELECT is the solution.

  Describe group,union,join and insert

Write a SQL query that joins Customer and Store table in the Kudler database and uses BETWEEN to restrict record selection

  The database design

Update the TOC to reflect the new section.Name the document CS251_ _Final.doc.Submit the document for grading.Submit your database file to the Submission Area.

  Security administrator for a small company

You are the security administrator for a small company. You have a single server that is used as your Web server and e-commerce server. It is in your office, separate and distinct from all other systems.

  Script that creates and calls a function named

Write a script that creates and calls a function named fnDiscountPrice that calculates the discount price of an item in the OrderItems table (discount amount subtracted from item price). To do that, this function should accept one parameter for th..

  Display all the columns from the orders table

Display all the columns from the Orders table that were paid with a Visa Card and have been shipped to the customer (hint: not a null). Order results by the Item Price in descending order.

  Prepare heritage data for classification learning

Load heritage data release 3 (preprocessed to binary representation, including demographics and output attribute(s)) - Perform exploratory analysis - Create at least three classification models for predicting hospitalization based on Year 1 data.

  Pretend that you are on the boards

Pretend that you are on the boards of the American National Standards Institute (ANSI) and the International Organization for Standards (ISO), two of the organizations who standardized SQL

  Create a package containing a procedure and a function

Follow the steps to create a package containing a procedure and a function pertaining to basket information. (Note: The first time you compile the package body doesn't give you practice with compilation error messages.)

  Run the lab_03_01.sql script

Run the lab_03_01.sql script in the attached file to create the SAL_HISTORY table. Display the structure of the SAL_HISTORY table.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd