Calculate the kappa measure between the two judges

Assignment Help Theory of Computation
Reference no: EM131713405

A. Consider these documents:

Consider the table of term frequencies for 3 documents denoted Doc1, Doc2, Doc3 in Figure 6.9. Compute the tf-idf weights for the terms car, auto, insurance, best, for each document, using the idf values from Figure 6.8.

 

Doc1

Doc2

Doc3

car

27

4

24

auto

3

33

0

insurance

0

33

29

best

14

0

17

Figure 6.9 Table tf value

Term

dft

idft

car

18,165

1.65

auto

6723

2.08

insurance

19,241

1.62

best

25,235

1.5

Figure 6.8 Table idf value. The idf's of terms with various frequencies in the Reuters collection of 806,791 documents.

B. Consider these documents:

Compute the vector space similarity between the query "digital cameras" and the document "digital cameras and video cameras" by filling out the empty columns in Table 6.1. Assume N = 10,000,000, logarithmic term weighting (wf columns) for query and document, idf weighting for the query only and cosine normalization for the document only. Treat and as a stop word. Enter term counts in the tf columns. What is the final similarity score?      (Please provide the details of the calculation.)

 

Query

Document

Word

tf

wf

df

idf

qi =wf-idf

tf

wf

di =normalized wf

digital

 

 

10,000

 

 

 

 

 

video

 

 

100,000

 

 

 

 

 

cameras

 

 

50,000

 

 

 

 

 

C. Why is the idf of a term always finite?

D. Sketch the frequency-ordered postings for the data in Figure 6.9.

E. Let the static quality scores for Doc1, Doc2 and Doc3 in Figure 6.11 be respectively 0.25, 0.5 and 1. Sketch the postings for impact ordering when each postings list is ordered by the sum of the static quality score and the Euclidean normalized tf values in Figure 6.11.

F. Derive the equivalence between the two formulas for F measure shown in the following Equation, given that α = 1/(β2 + 1).

F = 1/[α(1/p)+ (1- α)1/R]+= ((β2+1)PR)/(β2P+R)

G. What is the relationship between the value of F1 and the break-even point?

H. Below is a table showing how two human judges rated the relevance of a set of 12 documents to a particular information need (0 = nonrelevant, 1 = relevant). Let us assume that you've written an IR system that for this query returns the set of documents {4, 5, 6, 7, 8}.

Document ID

Judge 1

Judge 2

1

0

0

2

0

0

3

1

1

4

1

1

5

1

0

6

1

0

7

1

0

8

1

0

9

0

1

10

0

1

11

0

1

12

0

1

a. Calculate the kappa measure between the two judges.

b. Calculate precision, recall, and F1 of your system if a document is considered relevant only if the two judges agree.

c. Calculate precision, recall, and F1 of your system if a document is considered relevant if either judge thinks it is relevant.

Reference no: EM131713405

Questions Cloud

Discuss human rights issues where national biases : authority to be effective especially in environmental and human rights issues where national biases or irresponsibility
Develop formal scope statement : Develop a formal Scope Statement. Develop a Communication Plan.
Supreme court exercise the power of judicial review : What is the power of judicial review and how does the Supreme Court exercise the power of judicial review?
Discuss economic structuralism or english school ration : Summarize the advantages and disadvantages of one of the following four schools of thought: Realism, Liberalism, Economic Structuralism or English School Ration
Calculate the kappa measure between the two judges : Let us assume that you've written an IR system that for this query returns set of documents {4, 5, 6, 7, 8}. Calculate the kappa measure between the two judges
What role does the executive play in our government : In considering these specific powers, what role does the Executive play in our government?
Define what are the various roles of the participants : what are the various roles of the participants, and how does power affect both your organization as an entity and the people in your organization
Human resource selection and development across cultures : Human Resource Selection and Development Across Cultures.
Discuss nothing to do with the scale of the cross-section : The contour interval has nothing to do with the scale of the cross-section. If you place a ruler with inches against the graph

Reviews

Write a Review

Theory of Computation Questions & Answers

  Finite-state machine design

Create a finite-state machine design to turn your FPGA development board into a simple programmable music box.

  Redundant sequence identi cation

Redundant sequence identi cation

  Compute a shortest superstring

Dynamic programming algorithm to compute a shortest superstring.

  Propositional and predicate logic

Write down a structural induction principle for the PlayTree free type

  Design a syntactic analyzer

Design a syntactic analyzer for the language specified by the grammar

  Design unambiguous grammar to parse expressions

Write a program would read two numbers and then print all numbers between the first and the second, inclusive. Design unambiguous grammar to parse expressions

  Consider a logic function with three outputs

Consider a logic function with three outputs,  A ,  B , and  C , and three inputs,  D ,  E , and  F . The function is defined as follows:  A  is true if at least one input is true,  B  is true

  Considering a single programmed operating system

Considering a single programmed operating system, what is the minimal total time required to complete executions of the two processes? You should explain your answer with a diagram.

  How to construct an nfa

Give a construction that assumes you are given a DFA for L and show how to construct an NFA (with or without ε-moves) to recognize sort(L).

  Equivalence classes to construct minimal dfa for language

How many equivalence classes does this relation have and what are they? Use these equivalence classes to construct the minimal DFA for the language.

  Impact of moore-s law on data center costs

Discuss the impact of Moore's law on data center costs on such things as servers and communications equipment. List at least 3 steps or recommendations your data center can take to offset some or all of the effect of Moore's law.

  Problem encountered in statements in predicate logic

How the problem would be encountered in attempting to represent the following statements in Predicate logic. it should be possible to: John only likes to see French movies.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd