Big Data Small Project using R Language

Assignment Help Advanced Statistics
Reference no: EM131502121

Big Data Small Project using R Language

The work will combine research on a specific problem/technique related to Big Data Processing, implementation of a computing solution and presentation of the results.

There is an initial set of proposed topics. However, it is allowed (and encouraged) to propose alternative small projects, or variants, which will have to be discussed with the lecturers.

The expected work will include performing research on the subject topic, selecting and implementing a computing solution based on technologies covered in the lectures such as Map/Reduce, Mongo DB and data analytic technique, and obtain the desired results from the processed data. While the topic provides some general guidelines on what the coursework will consist of. It is expected thatyou will take these guidelines, and suggest a specific proposal of what are they aiming to achieve in the project.

Music Recommendation System

Music recommendation systems are becoming a hot topic these days due to increase in number of online listeners to systems like Spotify. Recommending users with relevant songs and predicting which songs will be liked by a particular user is always a very good feature for any music application. You are to developing a music recommendation system based on the Million Song Dataset.

Predict short term movements in stock prices

The basic assumption is that the stock price largely depends on both inside and outside factors, where inside factor include company performance (earnings and profits), company news (introducing new products, securing a new large contract, etc), and outside factor such as industry performance, investor sentiment (bull market or bear market, news sentiments), economic environment (interest rates, economic outlook and inflation, etc).

Twitter to predict the next best restaurant

Yelp has a data set that include restaurant rankings and reviews. One idea for this project is to use tweets to predict restaurant star ratings. This would enable you combine Yelp data with twitter data.

Have you provided a context for the project? Have you provided a description of the data? Have you loaded the data? Have you explored/processed the data? have you provided script(s) for pre-analysis? Have you identified the objective of the analysis and the technique to be used?

How are presenting the result of the analysis?

Compulsory Requirement

Topic must be approved first but if from the above suggestion then that is not required. you need to make sure you that you MUST have the analytics process (exploration, cleaning, modelling). You also need to show that you can apply noSql and Hadoop (including a related technology).

And also, I need to see that you have got the data set and upload in to R and scripts in your computer as the evidence of your ownership of the work.

Reflective Critique

You should keep a reflective diary of your progress during the assignment. It should cover your activities and how you collected other material on the methods used. This should be submitted as an appendix to the report developed for part two and is subject to the same submission criteria.

You also need to evaluate the solution that you are proposing and how would you improve it.

Assignment Files -

https://www.dropbox.com/s/n6c871v0rc862wa/R%20Programming%20Assignment.rar?dl=0

Reference no: EM131502121

Questions Cloud

Which strategies are the most dangerous : Identify the weaknesses in each strategy. (Hint: How do you think the bond rating agencies reacted to California's 2003 budget?)
Analyze any international trade opportunities : Analyze any comparative advantages and international trade opportunities. Define the type of market in which your selected product will compete.
Technical feasibility the same for every organization : For a given information technology project, is technical feasibility the same for every organization?
Paper on the strategic planning process : Write a 4 page paper on the strategic planning process. The assignment needs to be in APA format.
Big Data Small Project using R Language : The work will combine research on a specific problem/technique related to Big Data Processing, implementation of a computing solution and presentation of result
Employers offer benefits to employees : In understanding why employers offer benefits to employees, discuss the effects government mandated benefits have on wages and profits.
Provide an explanation of your data : Provide an explanation of your data and include such items as rate of increase/decrease and other factors you consider important to note.
Synthesize biblical and historical theology : Discuss the various theological methods used to synthesize biblical and historical theology into a theological doctrine.
Create a diagram of the organizational structure : Create a diagram of the organizational structure showing the hierarchy and chain of command.

Reviews

len1502121

5/20/2017 2:46:31 AM

Topic must be approved first but if from the above suggestion then that is not required. you need to make sure you that you MUST have the analytics process (exploration, cleaning, modelling). You also need to show that you can apply noSql and Hadoop (including a related technology). And also, I need to see that you have got the data set and upload in to R and scripts in your computer as the evidence of your ownership of the work. Content focus and reflection topic- The entire reflection is well written, clear, highly accurate and consistent. The topic is relevant, clear and detailed. The content demonstrates knowledge of reflective thinking and reflection. Reflection displays critical thinking about the topic. The standard is excellent to exceptional.

len1502121

5/20/2017 2:46:26 AM

Organisation of information. All aspects of the reflection are well presented in a logical manner, following the cycle of reflection. Excellent to exceptional standard. Reflection, analysis and self-disclosure - The reflection demonstrates excellent analytical skills together with self-disclosure to reveal insight and well developed critical thinking skills. Topic of reflection is clear and highly detailed. Excellent to exceptional standard. Reflection evaluation and outcomes- The reflection demonstrates excellent critical thinking skills related to topic and leads to an outcome that is comprehensive, thoughtful and relevant to future nursing practice. Excellent to exceptional standard. Referencing, formatting, paragraphing, fluency, and style of writing- Free or almost free from referencing, formatting, spelling, and/or grammatical errors. Clear, succinct, and effective language used throughout the paper. Content flows smoothly and logically.

Write a Review

Advanced Statistics Questions & Answers

  Market values of stocks

Given the following market values of stocks in your portfolio and their expected rates of return, what is the expected rate of return for your common stock portfolio?

  Interpreting mean and fuzzy data

You have been asked to ship a package from Pennsylvania to California. You need to get it there in 2 days, heads will roll if it is late.You call 2 carriers, and ask them what their average transit time is to California.

  Find both time-average interval and time-average number

Find a closed form solution to J,i piνi, where νi is the rate at which transitions out of state i occur. For each i, find both the time-average interval and the time-average number of overall state transitions between successive visits to i.

  Types of business transactions

The general manager of a business encounters many different types of business transactions. Provide an example for each of the following transactions that would describe the effect on the accounting equation.

  Pujols industries-bond issue entries

On February 28, 2006, Pujols Industries issued 10% bonds, dated January 1, with a face amount of $48 million. The bonds were priced at $42 million (plus accrued interest) to yield is 12%.

  Sales and marketing career path-tip sheet

Consider the top 2-3 careers in Sales or Marketing you would like to enter one day. Do some research at places like Monster and compile some data for each of these career paths. In particular, collect salary information, experience and degree requ..

  Factor analysis project

Factor analysis project, Prepare a report of the results of 2 and a half double-spaced pages along with tables associated with the results. Also include a log stating the steps used in the research, and any pertinent SPSS printouts.

  How many standard deviations is the sample mean

How many standard deviations is the sample mean from the mean of the sampling distribution?

  Application of research in social sciences

I need information for a "Reflection" paper on the Application of Research in the Social Sciences (Reflection is a personal response demonstrating the understanding of the issue at hand and connecting it to personal observation, previous experienc..

  Relevant components of medical records

Administrative Data-entails the patient's confidential personal information such as contact information, insurance, and anything that would validate the patients true identity.

  Reliance on statistical process control

SPC will allow an organization to become more efficient in their business practices (i.e. improved productivity and an increased level of performance measures). SPC will enable managers to make informed business decisions and develop comprehensive..

  1 at the bottom left side of the applet set n equal to 10

1. at the bottom left side of the applet set n equal to 10 and then check the animate box. now click on flip. record

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd