What problems if any do you run into

Assignment Help Computer Engineering
Reference no: EM131372443

Introduction to Data Mining Problems

Use R to devise a book recommendation system for the data uploaded to Blackboard.  In particular, develop a system that can recommend up to three books for an arbitrary user that can be entered into R after sourcing your code.  Develop such a system using both a:

(a) User-based collaborative filtering approach. Use Euclidean, Manhattan, correlational, and cosine similarity distance measures. What problems (if any) do you run into?

(b) Item-based collaborative filtering approach. Use an adjusted cosine similarity approach as discussed in class. How does this approach compare to the user-based approach?

To load the data into R you will need to use the read.csv function.  (i.e. read.csv(filename,header=TRUE)).  Please type in ?read.csv" to the R console to see the syntax if you would like further info regarding the function's syntax.  

Make your programs functions, where the names of users, can be entered into the R prompt.  

(c) What are some general problems with both approaches? Conceptually speaking, how can these issues be ameliorated?

Reference no: EM131372443

Questions Cloud

What is probability that average will exceed given value : Assume this is an average from a population with standard deviation of $1.5. If a random sample of 30 months is selected, what is the probability that its average will exceed $4.00?
Probability that a batch will be acceptable to the consumer : What is the probability that a batch will be acceptable to the consumer? Is the probability large enough to be an acceptable level of performance?
Perform research utilizing resources : To complete the current event activity, perform research utilizing resources such as the Internet, magazine publications, newspapers, and journals on a current event that illustrates the implementation of a new Information Systems technology.
Two key strategic decisions made by your current team : Identify two key strategic decisions made by your current team, department, or organization. How could those decisions have been enhanced by optimization models? Support your rationale with evidence from readings or external research.
What problems if any do you run into : DATS 6103: Introduction to Data Mining Problems. User-based collaborative filtering approach. Use Euclidean, Manhattan, correlational, and cosine similarity distance measures. What problems (if any) do you run into
What are the leadership qualities she possesses : Referring to the Cheung Yan: China’s Paper Queen Case Study (Text, pp 675-683),How has Cheung Yan (Zhang Yin) seen such success as a strategic leader? What are the leadership qualities she possesses? do you contribute Cheung Yan’s success to characte..
Enterprise systems consist of several : Enterprise systems consist of several carefully selected modules that are integrated together as a solution to ensure alignment and synchronisation of operational and business processes.
Evaluate business strategies for quality management : HA540-1: Evaluate business strategies for quality management and continuous improvement of operations. Instructions: In this Assignment, you will be evaluating dimensions of quality in healthcare and how various industries can apply these concepts..
What is a confidence interval and why is it useful : What is a confidence interval, and why is it useful? What is a confidence level?- Explain why in classical statistics describing a confidence interval in terms of probability makes no sense.

Reviews

len1372443

1/27/2017 1:06:22 AM

There is some flexibility with respect to how you construct the details of your recommendation system beyond your nearest neighbor algorithm. For example, you may use more than one nearest neighbor to make your algorithm better and you can weight the distances appropriately as discussed in class. Please feel free to discuss what your code is doing in a Word document or PDF and submit that along with your assignment. This will make it easier for the grader to understand the logic behind your algorithm. Make sure your program ignores zero values for the purposes of computing distances. Otherwise your recommendation system will be influenced by unrated books. Use an estimated rating of above 5 as a threshold for the recommendation system. If your model cannot provide any recommendations for a particular individual, then please have it say so. You can discuss this in (c).

Write a Review

Computer Engineering Questions & Answers

  Make use of automated tools to check web site

make Use of automated tools to check Web site: you can validate your site compliance with HTML/CSS/Dublin Core metadata standards and broken links.

  Questionarrays and control structures are important tools

questionarrays and control structures are important tools while programming. an array contains a number of variables

  Addressing and naming model

Sketch a plan for development of the addressing and the naming model in an environment of following given scenario: Ten (10) departments in the 1,000-employee organization. Equal separation by geography

  Transfering the power over ethernet

A recent article in an industry magazine discussed the ability to transfer the Power over Ethernet (PoE) and an emerging technology which is able to transfer the Power over Fiber (PoF).

  Explain the difference between object-oriented programming

define the difference between object-oriented programming and procedural (or structural or processual) programming. What, if anything, does the OO model bring to the table and improve upon what was out there pre-OO.

  Assembly program to find out the price of a car rental

Write down an Assembly program in order to find out the price of a car rental. The car being rented costs $45 per day and frequent renters get a $15 discount on the total bill.

  Design a full adder circuit which adds three binary digits

Design a full-adder circuit which adds three binary digits xi, yi and carry in ci. Your circuit should compute the sum out si, and carry out ci as shown in given Figure.

  Categorizing the threat

Download a password cracker developed for your operating system. Run the cracker on your system. Describe the results from cracker.

  Consider the following snapshot of a system

Problem 1: Consider the following snapshot of a system:

  Linux versus microsoft windows server

Develop an 8- to 10-slide Microsoft PowerPoint presentation for the executive team at your selected Virtual Organization. The presentation must do the following

  What is the role of a pilot project in information systems

jim watanabe was in his new car driving down i-5 on his way to work. he dreaded the phone call he knew he was going to

  Questionthink about a cellular system with a total

questionthink about a cellular system with a total bandwidth of 30 mhz. each full duplex voice or control channel uses

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd