How many sentences are generated after tokenization

Assignment Help Programming Languages
Reference no: EM131666074

Assignment

Please follow the steps and answer the questions:

Load the necessary packages first:
library(tm)
library(openNLP)
library(textstem)

1. Import the twitter dataset using readreadline() function.

2. Inspect the first 10 tweets from the data.

3. We will specifically look at a tweet at index number 8. Assign a variable name to this tweet. For example: tweet8.

4. Check the data type of tweet8. Before changingthe tweet8 to String type, firstly allow tweet8 to go through stemming. How many words in this sentence has been stemmed? What are the original form and base form respectively?

5. Remove the stop words from stemmed tweet8. Compare the original tweet8 and transformed tweet8. What are the stop words removed?

6. Reassign the 8th tweet to tweet8. Lemmatize the tweet8. What are the words has been lemmatized? What are the original forms and base forms respectively?

7. Change tweet8 to String type.

8. Use sentence tokenization function to segment tweet8. How many sentences are generated after tokenization?

9. Use word tokenization function to divide the words from the sentences. How many words have been generated? Display the words and sentences.

10. Use part of speech tagging function to assign POS tag to each word. Check the word and POS frequency. How many words have been assigned POS tags "VBD" (verb past tense)? What are the words being assigned with POS tags "VBD"?

11. Use name entity recognition function to detect name entities from this tweet. Does this function detect any name entities?

12. Use parsing function to parse this tweet. How many verb phrases (VP) are there? What components compose the last verb phrase? (If your parser does not work, you could skip this question)

Submission:

Create a R script file and write the R commands for each question. Write down the answers to the questions such as "How many words in this sentence has been stemmed?" as R script comments in the same R script file. Submit the file on blackboard.

Attachment:- Lab-Tweets.rar

Reference no: EM131666074

Questions Cloud

Discuss computer was correctly figured for the program : Seller alleges that it was Buyer's responsibility to check whether his computer was correctly figured for the program
How is plastic made into a sustainable product in panama : How is plastic made into a sustainable product in Panama? What companies have utilized it to create either cleaner water,
Marriott and starwood hotels-resorts worldwide merger : Read this article and then find an article about the Marriott and Starwood Hotels & Resorts Worldwide merger.
Write a report on Exporting Wine to the UK : Write a report on given topic. Topic - Exporting Wine to the UK. Filter 4 - Target Markets and Corporate Factors. Likely Target Market in the Allocated Country
How many sentences are generated after tokenization : Use sentence tokenization function to segment tweet8. How many sentences are generated after tokenization? How many words have been generated?
Determining the non-profit organization : Which of these do you think are the most vital or most useful in a non-profit organization? Why?
Calculate customer-level operating income for customers : All customers except E ordered units in the same order size. Customer E's order quantity varied, so E got a discount part of the time but not all the time.
What role do the international monetary fund : What role do the International Monetary Fund (IMF) and the World Bank play? And what are some of the potential benefits or harms of these organizations?
Economic order quantity if price is not consideration : M. P. VanOyen Manufacturing has gone out on bid for a regulator component. What is the economic order quantity if price is not a? consideration?

Reviews

Write a Review

Programming Languages Questions & Answers

  Write a haskell program to calculates a balanced partition

Write a program in Haskell which calculates a balanced partition of N items where each item has a value between 0 and K such that the difference b/w the sum of the values of first partition,

  Create an application to run in the amazon ec2 service

In this project you will create an application to run in the Amazon EC2 service and you will also create a client that can run on local machine and access your application.

  Explain the process to develop a web page locally

Explain the process to develop a Web page locally

  Write functions

These 14 questions covers java class, Array, link list , generic class.

  Programming assignment

If the user wants to read the input from a file, then the output will also go into a different file . If the user wants to read the input interactively, then the output will go to the screen .

  Write a prolog program using swi proglog

Write a Prolog program using swi proglog

  Create a custom application using eclipse

Create a custom Application Using Eclipse Android Development

  Create a application using the mvc architecture

create a application using the MVC architecture. No scripting elements are allowed in JSP pages.

  Develops bespoke solutions for the rubber industry

Develops bespoke solutions for the rubber industry

  Design a program that models the worms behavior

Design a program that models the worm's behavior.

  Writing a class

Build a class for a type called Fraction

  Design a program that assigns seats on an airplane

Write a program that allows an instructor to keep a grade book and also design and implement a program that assigns seats on an airplane.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd