Describe how data mining can help the company

Assignment Help Other Subject
Reference no: EM132287881

Question 1: Suppose that you are employed as a data mining consultant for an Internet search engine company. Describe how data mining can help the company by giving specific examples of how techniques, such as clustering, classification, association rule mining, and anomaly detection can be applied.

Question 2: Identify at least two advantages and two disadvantages of using color to visually represent information.

Question 3: Consider the XOR problem where there are four training points: (1, 1, -),(1, 0, +),(0, 1, +),(0, 0, -). Transform the data into the following feature space:

Φ = (1, √ 2x1, √ 2x2, √ 2x1x2, x2 1, x2 2).

Find the maximum margin linear decision boundary in the transformed space.

Question 4: Consider the following set of candidate 3-itemsets: {1, 2, 3}, {1, 2, 6}, {1, 3, 4}, {2, 3, 4}, {2, 4, 5}, {3, 4, 6}, {4, 5, 6}

Construct a hash tree for the above candidate 3-itemsets. Assume the tree uses a hash function where all odd-numbered items are hashed to the left child of a node, while the even-numbered items are hashed to the right child. A candidate k-itemset is inserted into the tree by hashing on each successive item in the candidate and then following the appropriate branch of the tree according to the hash value. Once a leaf node is reached, the candidate is inserted based on one of the following conditions:

Condition 1: If the depth of the leaf node is equal to k (the root is assumed to be at depth 0), then the candidate is inserted regardless of the number of itemsets already stored at the node.

Condition 2: If the depth of the leaf node is less than k, then the candidate can be inserted as long as the number of itemsets stored at the node is less than maxsize. Assume maxsize = 2 for this question.

Condition 3: If the depth of the leaf node is less than k and the number of itemsets stored at the node is equal to maxsize, then the leaf node is converted into an internal node. New leaf nodes are created as children of the old leaf node. Candidate itemsets previously stored in the old leaf node are distributed to the children based on their hash values. The new candidate is also hashed to its appropriate leaf node.

How many leaf nodes are there in the candidate hash tree? How many internal nodes are there?

Consider a transaction that contains the following items: {1, 2, 3, 5, 6}. Using the hash tree constructed in part (a), which leaf nodes will be checked against the transaction? What are the candidate 3-itemsets contained in the transaction?

Question 5: Consider a group of documents that has been selected from a much larger set of diverse documents so that the selected documents are as dissimilar from one another as possible. If we consider documents that are not highly related (connected, similar) to one another as being anomalous, then all of the documents that we have selected might be classified as anomalies. Is it possible for a data set to consist only of anomalous objects or is this an abuse of the terminology?

You will need to ensure to use proper APA citations with any content that is not your own work.

with zero plagiarism needed.

Reference no: EM132287881

Questions Cloud

Major raw materials or locate near the major customers : There are two alternatives under consideration: locate near the major raw materials or locate near the major customers.
Difference between using correlation as opposed to cosine : What is the conceptual difference between using the correlation as opposed to cosine similarities? [Hint: how are the missing values in the matrix handled
Determine the standard time for job : A worker-machine operation was found to involve 3.3 minutes of machine time per cycle in course of 40 cycles of stopwatch study. determine standard time for job
Who should be on the district curriculum advisory council : What kind of needs assessment will you need to do? List at least 3 questions or items that you would include on a needs assessment.
Describe how data mining can help the company : Suppose that you are employed as a data mining consultant for an Internet search engine company. Describe how data mining can help the company.
Determine the isotropic free space loss : Determine the isotropic free space loss at 4 GHz for the shortest path to a synchronous satellite from earth (35,863 km)
Compare and contrast the two evidence-based strategies : Explain the importance of using evidence-based strategies to assist all children's success in an inclusive learning environment.
How user interface-based design influence user adoption : Implementing a new system, or modifying an existing one, can create organizational change. This change can impact how employees work, how information technology
General category for categorizing transformations : Which of the following is not a general category for categorizing transformations?

Reviews

Write a Review

Other Subject Questions & Answers

  Choose an organization you have worked for or any

select an organization you have worked for or any organization of interest and discuss how decision analysis could be

  Narrative description of approach

Develop a narrative description of an approach to meaning and value. Draw from experience and outlook, including ethical standards and values, career plans and ambitions, or views of growth and self-development.

  Developing field in psychology

A developing field in psychology is called Positive Psychology, which is exploring ways to help people become happier and productive in life. Research the Internet to learn more about this type of psychology.

  Discuss the major arguments made regarding globalization

Discuss the major arguments made by your authors regarding globalization. As a global manager are you for -OR- against globalization? Why?

  An additional five unacceptable behaviors

Social Media- Portraying yourself or the department in a negative light should be prohibited.

  Anglo-american and african-american styles

The interweaving of Anglo-American and African-American styles and influences may be considered the single most important factor in the development of American music.

  Minority only in quebec and nunavut

In the cases of Quebec and New Brunswick, the vast majority of the non-Anglo phone population speaks French

  Differences between the republic and democratic parties

The Republican and Democratic parties stand for and support different reasons. Identify three differences between the Republic and Democratic parties. Next, discuss in three separate sentences how each of these differences impact you as a citizen

  Summarize history of your countrys sponsorship of terrorism

Describe the level and type of support the chosen country gives to the group(s) it sponsors (i.e. money, training, equipment, intelligence, etc.).

  Explain the investigating methods of increasing productivity

The Hawthorne Effect was a research focused on investigating methods of increasing productivity in an electrical company (McCarney, 2007).

  Discuss how does social security interacts

How does Social Security interacts with the Criminal Justice System

  Between 2009 and 2014 british unemployed has been reduced

you must write an essay of 500 words on the following topicbetween 2009 and 2014 british unemployed has been reduced

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd