What is the map value for each system

Assignment Help Computer Engineering
Reference no: EM131500263

Assignment

1. Instead of returning nothing for a query, a search engine should return some results even if they are incorrect." Do you agree or disagree? Explain.

2. What are the differences between static and dynamic summaries? Describe a scenario where each type of summary would be the best solution to a search query.

3. Consider an information need for which there are 6 relevant documents in the collection. Contrast two systems run on this collection. Their top 10 results are judged for relevance as follows:

a. Complete this table with numerical values:

 

System 1

System 2

 

 

Recall

Precision

 

Recall

Precision

1

R

 

 

N

 

 

2

N

 

 

R

 

 

3

R

 

 

N

 

 

4

N

 

 

N

 

 

5

N

 

 

R

 

 

6

R

 

 

R

 

 

7

N

 

 

R

 

 

8

N

 

 

N

 

 

9

R

 

 

N

 

 

10

R

 

 

N

 

 

b. What is the MAP value for each system? (equation okay)

c. What is F β=1 for each system with 10 documents returned?

4. True/False

a) _____ The tf-idf weight increases with the number of occurrences of a term within a document
b) _____ The tf-idf weight increases with the rarity of the term in the collection.
c) _____ The summary information displayed by a search engine must come from the "description" meta tag in a html file.
d) _____ Hard clustering is more common and easier than soft clustering.
e) _____ Pseudo relevance feedback is the same as indirect relevance feedback
f) _____ Query expansion means to double the number of results shown to the user
g) _____ Hierarchical agglomerative clustering is a "top down" clustering technique.
h) _____ Linear classifiers partition the dataspace into overlapping regions.
i) _____ K-means clustering is an example of unsupervised learning.
j) _____ Hub sites should be scored higher than authoritative sites.

0.80

0.70

0.90

0.00

0.50

0.10

0.50

0.75

0.25

5.The following probabilities have been Long Sweet Green determined from a training set of 1000. Cucumber

Given a sample that is long, sweet and green; Jalapeno what is the probability that it would be classified Other as a Cucumber using a naïve bayes classifier? Show your work.

6. Apply the KNN algorithm to classify the data item (14) with this known set of data and classes: { (10,1), (11,1), (15,2), (12,1), (18,2), (9,1), (20,2), (17,2) }. Show your work for K = 3 and K = 5.

7. Given this portion of a web graph

1968_Web graph.jpg

a) Show the node adjacency matrix

b) Convert the adjacency matrix to a transition probability matrix (i.e. Markov chain) for PageRank.

9. Complete the table below so that the cosine similarity for the query "brown cat" against document three is 1.0 ( SMART nnc.nnc ).

 

q

qq

d1

d2

d3

dd1

dd2

dd→3

qq•dd1

qq•dd2

qq•dd3

Brown

1

.707

2

1

 

.632

.277

 

.447

.196

 

Cat

1

.707

1

1

 

.316

.277

 

.223

.196

 

how

0

0

1

1

 

.316

.277

 

0

0

 

meow

0

0

0

3

 

0

.832

 

0

0

 

now

0

0

2

1

 

.632

.277

 

0

0

 

Reference no: EM131500263

Questions Cloud

How does this observation relate to the tax incentives : A financial reporter recently commented that McDonalds went through a period when it bought up many of its own franchises.
Establish a web-service business : BGB plans to establish a Web-service business five years from today. The business is expected to last forever once it is established.
Determine the expected growth rate for dividends : Determine the expected growth rate for dividends. What is the stock price using the dividend discount model?
What is difference between the expected returns of stocks : What is the difference between the expected returns of these stocks?
What is the map value for each system : What is the MAP value for each system? What is F ß=1 for each system with 10 documents returned? Show the node adjacency matrix.
Faraway moving company is involved in major plant expansion : The Faraway Moving Company is involved in a major plant expansion that involves the expenditure of ?$219 million in the coming year.
Writing technology in ancient india hs 4006 science and : Write a report on the topic "Writing Technology in Ancient India - HS 4006 - Science and Technology in 20th Century".
What is average accounting rate of return : Delta Mu Delta is considering purchasing some new equipment costing $393,000. What is the average accounting rate of return?
Two mutually exclusive machines : The Perez Company has the opportunity to invest in one of two mutually exclusive machines that will produce a product it will need for the foreseeable future.

Reviews

Write a Review

Computer Engineering Questions & Answers

  Bourne shell and design suitable functions

Bourne shell and design suitable functions

  What is structured programming

Suppose f is a function that returns the result of reversing the string of symbols given as its input, and g is a function that returns the concatenation of the two strings given as its input. If x is the string abcd, what is returned by g(f(x),x)..

  While a typical database is created the structure

When a typical database is created the structure is created before the data is actually loaded into the database. What problems exist when someone wants to add or delete from existing structure? What methods can you see for accomplishing the restr..

  What is the format of main memory address

What is the format of main memory address.

  Dma is executed by a dma controller that doesnt capture

write a 200- to 300-word short-answer response to the followingthe idea behind it is to free up the cpu so it can do

  List five applications of personal computers

List five applications of personal computers. Is there a limit to the applications of computers? Do you envision any radically different and exciting applications in the near future? If so, what?

  Terminate and cause the zombie tasks to be deallocated

while a child process is fork()ed, a parent may wait for the successful completion of the child via the wait() service (or one of its variants) so that the return result of that application can be read from the process descriptor block.

  Design a powerpoint presentation based on the scenario

design a PowerPoint presentation based on the scenario. You have been asked to present tips on time management skills to new students at an online university. Your group will work together to organize and create a presentation with your advice.

  In this exercise you are required to write a computer

in this exercise you are required to write a computer program which will calculate the voltage across a resistor in the

  Find out a website with obvious usability issues

define addressing why you think the site you selected is usable or not. Be sure to include the URL of the website you are referring to.

  Make a 3-4 page paper not comprising title page and

prepare a 3-4 page paper not including title page and references about 350 words per page comparing and contrasting two

  What is the maximum rate

assume an 802.11b station is configured to always reserve the channel with the RTS/CTS sequence. Suppose this station suddenly wants to transmit 1,000 bytes of data, and all other stations are idle at this time.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd