Doc1: Database is a collection of information that is organized so that it can b
ID: 3747785 • Letter: D
Question
Doc1: Database is a collection of information that is organized so that it can be easily accessed, managed and updated. Most databases contain multiple tables and several fields
Doc2: Search Engine refers to a huge database of internet resources such as web pages, newsgroups, programs, images etc. It helps to locate information on World Wide Web
Doc3: We tend to associate search engines with computer technology. Privacy issues emerges because search engine companies can collect personal information
Based on above ducuments please answer
Boolean and Vector queries.
a.Please design three Boolean queries, (for example, web AND search) and list therelevant documents for each query.
b.Please use the Vector model to query on the inverted index, and compare theresult with the Boolean model. (Hint: you can use cosine similarity and set asimilarity threshold
Explanation / Answer
1. a. database and engine - Doc1 , Doc2 and Doc3
b. newsgroup and organized - Doc 1, Doc 2
c. locate and privacy - Doc2 , Doc3
2. Term DocId Frequency IDF tf = fXidf
database 1,2 2 3/2 - 1.5 3
engine 2,3 2 3/2 - 1.5 3
newsgroup 2 1 3/1 - 3 3
organized 1 1 3/1 - 3 3
locate 2 1 3/1 - 3 3
privacy 3 1 3/1 - 3 3
Related Questions
drjack9650@gmail.com
Navigate
Integrity-first tutoring: explanations and feedback only — we do not complete graded work. Learn more.