State of the union code, Other Subject

Assignment Help:

1)  The average number of sentences per word over time

Do this two ways:

First, using a simple regex that splits on puntuation. Second, using the natural language toolkit's sentence tokenizer.
2)  The average number of unique words per 100 words over time

    - remove common words ( see the lecture slide for a list )

Do this two ways:

First: assume that different words are all unique, even if they share suffixes ( like run and running )

Second: using the stemming code from the NLTK.

Make sure that you sort the speeches by their date, and write the data into a text file with two columns: date and statistic.

EC) Build a word cloud, as seen in the slides from lectures 11 and 12

Part 2: Collect an interesting web statistic.

Use urlweb and a website of your choice to collect a statistic. Write a one paragraph description of your statistic at the top of the code
file.

Examples include sports game data, weather statistics, name statistics, etc.


Related Discussions:- State of the union code

Security key in paypal, Security key In early 2007, PayPal initiate an ...

Security key In early 2007, PayPal initiate an optional security key as an supplementary safety measure against fraud. A user account attached to a security key has a customize

Components of human skeleton, Components of Human Skeleton: Questions: ...

Components of Human Skeleton: Questions: I need a report on Latin American Cultures. Latin American Cultures report for about 2500 words? Would you able to assist me in report?

Various functions of a safety and health committee, Question : (a) Saf...

Question : (a) Safety and Health Committee is a forum where employees and management representatives sit together to review and take action on safety and health issues at a pl

Types of diseases caused by drugs, SOLVE is an interactive educational prog...

SOLVE is an interactive educational programme designed by the ILO to assist in the development of policy and action to address psychosocial hazards or problems at workplaces. The

Fema 100b test answers, #help I need to pass the fema 100b test question ar...

#help I need to pass the fema 100b test question are difficult.

Writing, how do I get better at writing?

how do I get better at writing?

Languages in latin american cultures, Language Spanish is the most com...

Language Spanish is the most commonly used language in many Latin American countries. Portuguese is the main language of Brazil, and even French is spoken in some small parts

Flash programming, Im gonna make an exam task, can you help me? how much do...

Im gonna make an exam task, can you help me? how much does it cost?

Authentication of electronic fund transfer, AUTHENTICATION EFTS transac...

AUTHENTICATION EFTS transactions may be go together with by methods to validate the card and the card holder. The merchant may manually confirm the card holder's signature, or

Property - term of wide signification, 'Property' is a Term of Wide Signifi...

'Property' is a Term of Wide Signification Leaving aside for a moment the issue of moral limitation, it is clear that the perspective of excludability has an otherwise liberati

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd