CS 609 Database Management and Exploration on the Web
Information Retrieval
Describe the term frequency (TF) and the inverse document frequency
(IDF) in IR system. Compute the similarity of documents based on TF/IDF.
XML
Explain the structure model of semi-structured/XML) databases.
XML schema
Explain the schema model of XML databases. Describe how to check the validity of
given XML databases against their schemas.
XML queries
Explain the basic concepts of XML query models.
Information integration
Describe (at a high-level) the schema-level matching techniques.