Advanced Database Management Systems - Pune University December 2010 Questions / ADBMS November December 2010 Question Paper / B.Tech Information Technology ADBMS Pune University question paper with answer
Total No. of Questions : 12] [Total No. of Pages : 7 P1319
[3864]-417
B.E. (IT)
ADVANCED DATABASE MANAGEMENT
(2003 Course) (414442)
Time : 3 Hours] [Max. Marks : 100
Instructions to the candidates :
1) Answers to the two sections should be written in separate books.
2) Neat diagrams must be drawn wherever necessary.
3) Assume suitable data, if necessary.
4) Section I : Q. 1 or Q. 2, Q. 3 or Q. 4, Q. 5 or Q. 6.
5) Section II : Q. 7 or Q. 8, Q. 9 or Q. 10, Q. 11 or Q. 12.
SECTION - I
Q1) a) Explain Parallel Hash Join with suitable example. [6]
b) Histograms are used for constructing load balanced range partition. [6]
i) Suppose a histogram where values are between 1 and 100, and are partitioned into 10 ranges, 1-10, 11-20, ........, 91-100, with frequencies 15, 5, 20, 10, 10, 5, 5, 20, 5 and 5, respectively. Express a load balanced range partitioning function to divide the values into 5 partitions.
ii) Write an algorithm for computing a balanced range partition with p partitions, given a histogram of frequency distributions containing n ranges.
c) Explain a nonuniform memory architecture (NUMA). [5]
OR
Q2) a) Describe interoperation parallelism, left-deep trees versus bushy trees, and query cost estimation. [6]
b) Evaluate how well partitioning techniques support the following types of data access. [6]
i) Scanning the entire relation.
ii) Locating tuple associatively.
iii) Locating all tuples such that the value of given attribute lies within a specified range.
c) Explain cache-coherency protocol. [5]
Q3) a) State different types of failures in distributed systems and explain failure handling in distributed database using 2 Phase Commit Protocol. [5]
b) Consider the relations: [7]
Employee (name, address, salary, plant_number)
Machine (machine_number, type, plant_number)
Assume that the employee relation is fragmented horizontally by plant_number, and that each fragment is stored locally at its corresponding plant site. Assume that machine relation is stored in its entirely at the Armonk site. Describe a good strategy for processing each of the following queries.
i) Find all employees at the plant that contains machine number 101.
ii) Find all machines at the Almaden plant.
iii) Find employee ⋈ machine.
c) Explain the technique that the database system you are using provides for dealing with inconsistent states that can be reached with lazy propagation of updates. [5]
OR
Q4) a) Define semi-join. Compute semi-join r ⋉s for the relations r and s. [5]
Relation r | | Relation s | ||||
A | B | C | | C | D | E |
1 | 2 | 3 | | 3 | 4 | 5 |
4 | 5 | 6 | | 3 | 6 | 8 |
1 | 2 | 4 | | 2 | 3 | 2 |
5 | 3 | 2 | | 1 | 4 | 1 |
8 | 9 | 7 | | 1 | 2 | 3 |
b) Consider multiple-granularity locking protocol. In distributed databases, the site containing the root object in the hierarchy can become a bottleneck. Modify the protocol to allow only intension locks on the root and implicitly grant all possible intension locks to every transaction. [7]
i) Explain why this modification works correctly, in that transactions continue to be able to set locks on desired parts of the hierarchy.
ii) Explain how it reduces the demand on the root.
c) Explain how LDAP can be used to provide multiple hierarchical view of data, without replicating the base-level data. [5]
Q5) a) Consider following DTD for bibliography. [12]
<!ELEMENT bib (book*)>
<!ELEMENT book (title, (author+ | editor+), publisher, price)>
<!ATTLIST book year CDATA #REQUIRED>
<!ELEMENT author (last, first)>
<!ELEMENT editor (last, first, affiliation)>
<!ELEMENT title (#PCDATA)>
<!ELEMENT last (#PCDATA)>
<!ELEMENT first (#PCDATA)>
<!ELEMENT affiliation (#PCDATA)>
<!ELEMENT publisher (#PCDATA)>
<!ELEMENT price (#PCDATA)>
Create XML document, XML Schemas and solve the following queries in XQuery on the bibliography fragment.
i) List books published by Addison-Wesley after 1991, including their year and title.
ii) Find pairs of books that have different titles but the same set of authors (possibly in a different order).
iii) For each book in the bibliography, list the title and authors, grouped inside a “result” element.
b) Describe the various issues for efficient evaluation of XML Queries. [4]
OR
Q6) a) <?xml version = "1.0" encoding = "UTF-8"?> [8]
<!ELEMENT bids (bid_tuple*)>
<!ELEMENT bid_tuple (userid, itemno, bid, bid_date)>
<!ELEMENT userid (#PCDATA)>
<!ELEMENT itemno (#PCDATA)>
<!ELEMENT bid (#PCDATA)>
<!ELEMENT bid_date (#PCDATA)>
Create XML document, XML Schemas and solve the following queries in XQuery on the bibliography fragment.
i) List the item number and description of the item(s) that received the largest number of bids, and the number of bids it (or they) received.
ii) List item numbers and average bids for items that have received three or more bids, in descending order by average bid.
b) Explain XML schemas restrictions and facets. [4]
c) Write a short note on SOAP. [4]
SECTION – II
Q7) a) Explain Binning method and Regression method to handle noisy data in Data Warehouse. [6]
b) Explain Data Reduction strategies in Data Warehouse. [6]
c) Write a short note on Materialized view. [5]
OR
Q8) a) Design conceptual model for Financial Services data warehouse. [6]
b) List various features of fact data and explain the guidelines to be followed while determining facts from dimensions. [6]
c) Explain how meta data can be used for data transformation and loading, and query generation in data warehouse. [5]
Q9) a) Consider following training set : [10]
Outlook | Temperature | Humidity | Wind | Class Attribute |
Sunny | Hot | High | FALSE | N |
Sunny | Hot | High | TRUE | N |
Overcast | Hot | High | FALSE | P |
Rain | Mild | High | FALSE | P |
Rain | Cool | Normal | FALSE | P |
Rain | Cool | Normal | TRUE | N |
Overcast | Cool | Normal | TRUE | P |
Sunny | Mild | High | FALSE | N |
Sunny | Cool | Normal | FALSE | P |
Rain | Mild | Normal | FALSE | P |
Sunny | Mild | Normal | TRUE | P |
Overcast | Mild | High | TRUE | P |
Overcast | Hot | Normal | FALSE | P |
Rain | Mild | High | TRUE | N |
Write ID3 Classification algorithm. Construct a decision tree based on above training set using ID3.
b) Explain how to handle candidate item sets using hash tree with suitable example. [7]
OR
Q10) a) Consider following dataset: [9]
Object | Attribute 1 | Attribute 2 | Attribute 3 |
A | 1 | 1 | 2 |
B | 3 | 2 | 4 |
C | 3 | 4 | 6 |
D | 4 | 6 | 3 |
Write K-means clustering algorithm. Find the cluster for the objects in data set with K = 2.
b) Consider following training data set : [8]
Income | Student | Credit_rating | Buys_Computer | |
<=30 | high | No | Fair | no |
<=30 | high | No | Excellent | no |
31...40 | high | No | Fair | yes |
>40 | medium | No | Fair | yes |
>40 | low | Yes | Fair | yes |
>40 | low | Yes | Excellent | no |
31...40 | low | Yes | Excellent | yes |
<=30 | medium | No | Fair | no |
<=30 | low | Yes | Fair | yes |
>40 | medium | Yes | Fair | yes |
<=30 | medium | Yes | Excellent | yes |
31...40 | medium | No | Excellent | yes |
31...40 | high | Yes | Fair | yes |
>40 | medium | No | Excellent | no |
Write Naïve Bayesian Classifier algorithm. Consider Buys_Computer as a Class Attribute with values yes and no classes. Find the class label for data sample.
X = (age < = 30, Income = medium, Student = yes Credit_rating = Fair) using Naïve Bayesian Classifier.
Q11) a) Define Information Retrieval System. Describe how it is differ from database system. [6]
b) Write short notes on Signature Files. [5]
c) Explain the following terms in Information Retrieval with suitable example. [5]
i) Synonyms
ii) Homonyms
iii) Proximity
iv) TF-IDF
OR
Q12) a) Explain any two techniques that support the evaluation of Boolean and Ranked queries. [6]
b) Write short notes on: [10]
i) Web Crawler.
ii) Document Indexing.
***************
No comments:
Post a Comment