Determine a new data-driven business process

Assignment Help Other Subject
Reference no: EM133360684

Assessment: Sentiments Expressed in Hotel Reviews

The main idea behind gauging sentiment is based on the notion that certain words are known to convey negative sentiments, while others are positive. For example, "sad," "angry," or "disappointment" convey negativity, while in contrast, "happy," "celebrate," or "pleased" represent positivity. This assignment represents a combination of two key tasks in data mining: sentiment analysis and text mining. The text mining is the first step, the output of which is used in sentiment analysis. In addition, while this course is using the Python programming language, this assignment uses R. The reason is primarily skill maintenance as both Python and R are used in this area.

Given a dataset of reviews of three hotels, identify which one is most positively reviewed to help managers prioritize renovation plans and upcoming marketing campaigns. Refer to the hotel reviews datasets included in this topic, hotel1.csv, hotel2.csv, and hotel3.csv.

The algorithm for sentiment analysis used in this model consists of a few steps:

1. Acquire a set of text-based data, known to contain expressions of opinions about a common topic.
2. Parse the text to extract a list of all the sentences.
3. Traverse the sentences and search for words associated with a list of words labeled as positive or
4. Calculate the ratio of positive to negative words.
5. Use the positive/negative ratio to quantify the sentiment expressed in the entire dataset.

Complete the following specific steps in R:

1. Install and load the syuzhet
2. Load the three hotel reviews datasets into data frames.
3. Explore and clean the data.
4. Convert each set of reviews into sets of sentences using the get_sentences()
5. Verify the output of the last step.

6. Build the sentiment analysis model.

1. Extract sentiments for each hotel using the get_sentiment()
2. Examine the first 10 values in each of the resulting numeric vectors for each hotel. What are the most positive and the most negative sentences for each hotel (among the first 10 sentences)? Explain. Since the results of this analysis are actionable items, the model calculates the ratio of positive to negative. There are other sentiment analysis data, in which neutral sentiments are valuable, like those expressed towards an artist or a politician. Does your model calculate neutral sentiments as well? If yes, how are you processing these results? If not, why not?
3. Calculate measures of central tendencies for each hotel's reviews, then summarize your findings and their meaning.
4. Visualize the sentiments using the plot()
5. Plot the trendlines of the sentiments for each hotel using
the simple_plot() function and examine the resulting normalized normative time curves.
6. Use the zoo library and the rollmean() function to compute the moving averages of sentiments for the three hotels.
7. Rescale the curves by using the x component of the (x,y,z) vector with values (0,1) returned by the rescale_x_2()
8. Plot the rescales curves.
7. Interpret the results.
1. Compare the reviews by focusing on the shape of the vectors that represent the reviews. Use the method of cosine similarity to compare the vectors, more specifically the discrete cosine transform (DCT).
2. Use the get_dct_transform() function, which produces smoothed results on a scale of 0 to 100.
3. Plot the DCT smoothing and time normalization for each hotel using the plot()
4. Verify the length of each vector to confirm that it is 100 using the length()
5. Plot all three curves in one graph for easier visual comparison of their DCT smoothing and time normalization.
6. Calculate the correlation of each pair of vectors using the cor()
7. Discuss the significance of these results to managers of the hotels reviewed.
8. Ethical practices:
1. Reflect on the possible abuses that might occur during the analysis, interpretation, and use of data and results.
2. Substantiate your reflection with concrete examples from your analysis and interpretation in a "what if" scenario.

Prepare this assignment according to the guidelines found in the APA Style Guide.

Benchmark Information

Question: Determine whether or not a new data-driven business process is using customer data in an ethically sustainable way.

Attachment:- Sentiment_Expressed.rar

Reference no: EM133360684

Questions Cloud

Essay comparing spontaneous and biogenesis theories : Write an essay comparing spontaneous and biogenesis theories. Proponents of each theory carried experiments to prove their hypotheses right.
How many networks are possible in classes a, b and c : How many networks are possible in classes A, B and C? How many hosts are possible in each network in classes A, B and C? How many IP addresses are possible
Bacterial quantification by culture lab reporting worksheet : Many lessons learned because of scientific experiments come from the reporting and analysis of data.
What is your back-up plan in the event your computer : What is(are) your back-up plan(s) in the event your computer and/or your ISP goes down for a short or long period of time? What is your plan to access
Determine a new data-driven business process : Determine whether or not a new data-driven business process is using customer data in an ethically sustainable way
Calculate some basic statistics, create an employee lookup : calculate some basic statistics, and create an employee lookup section. As you construct formulas, make sure you use absolute, relative, and mixed cell
Describe various components of light microscope : Describe the various components of the light microscope and how they contribute to both changes in resolution and magnification.
State media used to grow colonies in laboratories : State the media used to grow colonies in laboratories today. Define the media used today. How did Walther Hesse know that the media he introduced would work?
Why people who consume large amounts of sugar : why people who consume large amounts of sugar. Define homeostasis and explain how your pancreatic hormones work to maintain this.

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd