Difference between using correlation as opposed to cosine

Assignment Help Basic Statistics
Reference no: EM132287884

Overview

The Institute for Statistics Education at Statistics.com asks students to rate a variety of aspects of a course as soon as the student completes it. The Institute is contemplating instituting a recommendation system that would provide students with recommendations for additional courses as soon as they submit their rating for a completed course. Consider the excerpt from student ratings of online statistics courses shown in Table 1 below, and the problem of what to recommend to student E.N.

Table 1

Ratings of online statistics courses: 4 = Best, 1 = worst, blank = not taken association table week 6.png

In R Your Job is To:

Consider a user-based collaborative filter. This requires computing correlations between all student pairs. For which students is it possible to compute correlations with E.N.? Compute them.

Then, tell me:

Which single course should we recommend to E.N. based on the single nearest student to E.N.? Explain why.

Based on the cosine similarities of the nearest students to E.N., which course should be recommended to E.N.?

What is the conceptual difference between using the correlation as opposed to cosine similarities? [Hint: how are the missing values in the matrix handled in each case?]

Then:

With large datasets, it is computationally difficult to compute user-based recommendations in real time, and an item-based approach is used instead. Returning to the rating data (not the binary matrix), let's now take that approach.

If the goal is still to find a recommendation for E.N., for which course pairs is it possible and useful to calculate correlations?
Just looking at the data, and without yet calculating course pair correlations, which course would you recommend to E.N., relying on item-based filtering? Calculate two course pair correlations involving your guess and report the results.

Finally:

Apply item-based collaborative filtering to this dataset (using R) and based on the results, recommend a course to E.N.

Reference no: EM132287884

Questions Cloud

What would the price elastic of demand before this product : A cut in price from Br 1.50 to Br 1.20 leads demand for a product rise by 10% What would the price elastic of demand before this product ? interpret the result
Discuss compensation equity issues : Discuss compensation equity issues. Explain both actual inequity-perceived inequity in compensation as it relates to both the internal and external environment
How you will approach the design for the welovevideo : Explain why this approach was chosen over others. Highlight the steps and processes that will be adhered to, along with the output expected from the approach.
Major raw materials or locate near the major customers : There are two alternatives under consideration: locate near the major raw materials or locate near the major customers.
Difference between using correlation as opposed to cosine : What is the conceptual difference between using the correlation as opposed to cosine similarities? [Hint: how are the missing values in the matrix handled
Determine the standard time for job : A worker-machine operation was found to involve 3.3 minutes of machine time per cycle in course of 40 cycles of stopwatch study. determine standard time for job
Who should be on the district curriculum advisory council : What kind of needs assessment will you need to do? List at least 3 questions or items that you would include on a needs assessment.
Describe how data mining can help the company : Suppose that you are employed as a data mining consultant for an Internet search engine company. Describe how data mining can help the company.
Determine the isotropic free space loss : Determine the isotropic free space loss at 4 GHz for the shortest path to a synchronous satellite from earth (35,863 km)

Reviews

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd