Creating a movie recommendation system

Assignment Help Other Subject
Reference no: EM132411823

Assignment -

For this project, you will be creating a movie recommendation system using the MovieLens dataset. The version of movielens included in the dslabs package (which was used for some of the exercises in PH125.8x: Data Science: Machine Learning) is just a small subset of a much larger dataset with millions of ratings. You can find the entire latest MovieLens dataset here. You will be creating your own recommendation system using all the tools we have shown you throughout the courses in this series. We will use the 10M version of the MovieLens dataset to make the computation a little easier.

You will download the MovieLens data and run code we will provide to generate your datasets.

First, there will be a short quiz on the MovieLens data. You can view this quiz as an opportunity to familiarize yourself with the data in order to prepare for your project submission.

Second, you will train a machine learning algorithm using the inputs in one subset to predict movie ratings in the validation set.

SECOND PROJECT -

For this project, you will be applying machine learning techniques that go beyond standard linear regression. You will have the opportunity to use a publicly available dataset to solve the problem of your choice. You are strongly discouraged from using well-known datasets, particularly ones that have been used as examples in previous courses or are similar to them (such as the iris, titanic, mnist, or movielens datasets, among others) - this is your opportunity to branch out and explore some new data! The UCI Machine Learning Repository and Kaggle are good places to seek out a dataset. Kaggle also maintains a curated list of datasets that are cleaned and ready for machine learning analyses. Your dataset must be automatically downloaded in your code or included with your submission.

The ability to clearly communicate the process and insights gained from an analysis is an important skill for data scientists. You will submit a report that documents your analysis and presents your findings, with supporting statistics and figures. The report must be written in English and uploaded as both a PDF document and an Rmd file. Although the exact format is up to you, the report should include the following at a minimum:

an introduction/overview/executive summary section that describes the dataset and summarizes the goal of the project and key steps that were performed;

a methods/analysis section that explains the process and techniques used, such as data cleaning, data exploration and visualization, any insights gained, and your modeling approach;

a results section that presents the modeling results and discusses the model performance; and

a conclusion section that gives a brief summary of the report, its limitations, and future work (the last two are recommended but not necessary).

Your project submission will be graded both by your peers and by a staff member. The peer grading will give you an opportunity to check out the projects done by other learners.

Attachment:- Assignment Files.rar

Reference no: EM132411823

Questions Cloud

What events led you to choose nursing as a career : What events led you to choose nursing as a career at this stage in your life? Why are you choosing to pursue this degree at NYU? 500 words.
How gaining more knowledge improve your clinical practice : Briefly discuss how gaining more knowledge of nursing theory through participation in this course has improved your clinical practice thus far.
Develop a comprehensive plan of care with short term goals : Develop a comprehensive plan of care/treatment with short and long term goals and include safety needs, special considerations regarding personal needs.
Identify additional concepts of nursing : Theories are derived from conceptual models and are comprised of concepts and propositions. The only concepts that are common to all nursing theories.
Creating a movie recommendation system : For this project, you will be creating a movie recommendation system using the MovieLens dataset. Use a publicly available dataset to solve the problem
Identify and articulate concepts relevant to your practice : This will help you identify your own values and beliefs about the established metaparadigms and metatheories of the discipline. It will also help you identify.
Discuss two areas of difficulty you encountered : Discuss two areas of difficulty you encountered or two new nursing interventions you learned this week at your clinical site. I'm doing my rotations.
CS3103 - Operating Systems Assignment : CS3103 - Operating Systems Assignment Help and Solution, City University of Hong Kong, Hong Kong. Project - Parallel Zip. To learn how to parallelize a program
Analyze each phase of the project : Throughout the semester I was presenting phase by phase until I finished my project. This week's assignment is as follows: To unite all the phases in a single.

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd