Sunday, March 22, 2015

Advanced Database Management Systems - Pune University December 2010 Questions

Advanced Database Management Systems - Pune University December 2010 Questions / ADBMS November December 2010 Question Paper / B.Tech Information Technology ADBMS Pune University question paper with answer



Total No. of Questions : 12]                                          [Total No. of Pages : 7 P1319
[3864]-417
B.E. (IT)
ADVANCED DATABASE MANAGEMENT
(2003 Course) (414442)
Time : 3 Hours]                                                                       [Max. Marks : 100
Instructions to the candidates :
1) Answers to the two sections should be written in separate books.
2) Neat diagrams must be drawn wherever necessary.
3) Assume suitable data, if necessary.
4) Section I : Q. 1 or Q. 2, Q. 3 or Q. 4, Q. 5 or Q. 6.
5) Section II : Q. 7 or Q. 8, Q. 9 or Q. 10, Q. 11 or Q. 12.

SECTION - I

Q1) a) Explain Parallel Hash Join with suitable example. [6]
b) Histograms are used for constructing load balanced range partition. [6]
i) Suppose a histogram where values are between 1 and 100, and are partitioned into 10 ranges, 1-10, 11-20, ........, 91-100, with frequencies 15, 5, 20, 10, 10, 5, 5, 20, 5 and 5, respectively. Express a load balanced range partitioning function to divide the values into 5 partitions.
ii) Write an algorithm for computing a balanced range partition with p partitions, given a histogram of frequency distributions containing n ranges.
c) Explain a nonuniform memory architecture (NUMA). [5]

OR

Q2) a) Describe interoperation parallelism, left-deep trees versus bushy trees, and query cost estimation. [6]
b) Evaluate how well partitioning techniques support the following types of data access. [6]
i) Scanning the entire relation.
ii) Locating tuple associatively.
iii) Locating all tuples such that the value of given attribute lies within a specified range.
c) Explain cache-coherency protocol. [5]

Q3) a) State different types of failures in distributed systems and explain failure handling in distributed database using 2 Phase Commit Protocol. [5]
b) Consider the relations: [7]
Employee (name, address, salary, plant_number)
Machine (machine_number, type, plant_number)
Assume that the employee relation is fragmented horizontally by plant_number, and that each fragment is stored locally at its corresponding plant site. Assume that machine relation is stored in its entirely at the Armonk site. Describe a good strategy for processing each of the following queries.
i) Find all employees at the plant that contains machine number 101.
ii) Find all machines at the Almaden plant.
iii) Find employee machine.
c) Explain the technique that the database system you are using provides for dealing with inconsistent states that can be reached with lazy propagation of updates. [5]

OR

Q4) a) Define semi-join. Compute semi-join r s for the relations r and s. [5]
Relation r

Relation s
A
B
C

C
D
E
1
2
3

3
4
5
4
5
6

3
6
8
1
2
4

2
3
2
5
3
2

1
4
1
8
9
7

1
2
3

b) Consider multiple-granularity locking protocol. In distributed databases, the site containing the root object in the hierarchy can become a bottleneck. Modify the protocol to allow only intension locks on the root and implicitly grant all possible intension locks to every transaction. [7]
i) Explain why this modification works correctly, in that transactions continue to be able to set locks on desired parts of the hierarchy.
ii) Explain how it reduces the demand on the root.
c) Explain how LDAP can be used to provide multiple hierarchical view of data, without replicating the base-level data. [5]
Q5) a) Consider following DTD for bibliography. [12]
<!ELEMENT bib (book*)>
<!ELEMENT book (title, (author+ | editor+), publisher, price)>
<!ATTLIST book year CDATA #REQUIRED>
<!ELEMENT author (last, first)>
<!ELEMENT editor (last, first, affiliation)>
<!ELEMENT title (#PCDATA)>
<!ELEMENT last (#PCDATA)>
<!ELEMENT first (#PCDATA)>
<!ELEMENT affiliation (#PCDATA)>
<!ELEMENT publisher (#PCDATA)>
<!ELEMENT price (#PCDATA)>
Create XML document, XML Schemas and solve the following queries in XQuery on the bibliography fragment.
i) List books published by Addison-Wesley after 1991, including their year and title.
ii) Find pairs of books that have different titles but the same set of authors (possibly in a different order).
iii) For each book in the bibliography, list the title and authors, grouped inside a “result” element.
b) Describe the various issues for efficient evaluation of XML Queries. [4]

OR

Q6) a) <?xml version = "1.0" encoding = "UTF-8"?> [8]
<!ELEMENT bids (bid_tuple*)>
<!ELEMENT bid_tuple (userid, itemno, bid, bid_date)>
<!ELEMENT userid (#PCDATA)>
<!ELEMENT itemno (#PCDATA)>
<!ELEMENT bid (#PCDATA)>
<!ELEMENT bid_date (#PCDATA)>
Create XML document, XML Schemas and solve the following queries in XQuery on the bibliography fragment.
i) List the item number and description of the item(s) that received the largest number of bids, and the number of bids it (or they) received.
ii) List item numbers and average bids for items that have received three or more bids, in descending order by average bid.
b) Explain XML schemas restrictions and facets. [4]
c) Write a short note on SOAP. [4]

SECTION – II

Q7) a) Explain Binning method and Regression method to handle noisy data in Data Warehouse. [6]
b) Explain Data Reduction strategies in Data Warehouse. [6]
c) Write a short note on Materialized view. [5]

OR

Q8) a) Design conceptual model for Financial Services data warehouse. [6]
b) List various features of fact data and explain the guidelines to be followed while determining facts from dimensions. [6]
c) Explain how meta data can be used for data transformation and loading, and query generation in data warehouse. [5]

Q9) a) Consider following training set : [10]
Outlook
Temperature
Humidity
Wind
Class Attribute
Sunny
Hot
High
FALSE
N
Sunny
Hot
High
TRUE
N
Overcast
Hot
High
FALSE
P
Rain
Mild
High
FALSE
P
Rain
Cool
Normal
FALSE
P
Rain
Cool
Normal
TRUE
N
Overcast
Cool
Normal
TRUE
P
Sunny
Mild
High
FALSE
N
Sunny
Cool
Normal
FALSE
P
Rain
Mild
Normal
FALSE
P
Sunny
Mild
Normal
TRUE
P
Overcast
Mild
High
TRUE
P
Overcast
Hot
Normal
FALSE
P
Rain
Mild
High
TRUE
N

Write ID3 Classification algorithm. Construct a decision tree based on above training set using ID3.
b) Explain how to handle candidate item sets using hash tree with suitable example. [7]

OR

Q10) a) Consider following dataset: [9]
Object
Attribute 1
Attribute 2
Attribute 3
A
1
1
2
B
3
2
4
C
3
4
6
D
4
6
3

Write K-means clustering algorithm. Find the cluster for the objects in data set with K = 2.

b) Consider following training data set : [8]
Income
Student
Credit_rating
Buys_Computer
<=30
high
No
Fair
no
<=30
high
No
Excellent
no
31...40
high
No
Fair
yes
>40
medium
No
Fair
yes
>40
low
Yes
Fair
yes
>40
low
Yes
Excellent
no
31...40
low
Yes
Excellent
yes
<=30
medium
No
Fair
no
<=30
low
Yes
Fair
yes
>40
medium
Yes
Fair
yes
<=30
medium
Yes
Excellent
yes
31...40
medium
No
Excellent
yes
31...40
high
Yes
Fair
yes
>40
medium
No
Excellent
no

Write Naïve Bayesian Classifier algorithm. Consider Buys_Computer as a Class Attribute with values yes and no classes. Find the class label for data sample.
X = (age < = 30, Income = medium, Student = yes Credit_rating = Fair) using Naïve Bayesian Classifier.

Q11) a) Define Information Retrieval System. Describe how it is differ from database system. [6]
b) Write short notes on Signature Files. [5]
c) Explain the following terms in Information Retrieval with suitable example. [5]
i) Synonyms
ii) Homonyms
iii) Proximity
iv) TF-IDF

OR

Q12) a) Explain any two techniques that support the evaluation of Boolean and Ranked queries.                                                [6]
b) Write short notes on:                                     [10]
i) Web Crawler.
ii) Document Indexing.

***************







No comments:

Post a Comment