Tribhuwan University

Institute of Science and Technology

2075

Bachelor Level / Fourth Year / Seventh Semester / Science

B.Sc in Computer Science and Information Technology (CSC420)

(Data Warehousing and Data Mining)

Full Marks: 60

Pass Marks: 24

Time: 3 Hours

Candidates are required to give their answers in their own words as for as practicable.

The figures in the margin indicate full marks.

Section A

Long Answers Questions

Attempt any TWO questions.
[2*10=20]
1.
List some issues of multimedia mining. Describe how back propagation is used in classification.[10]
2.
Describe how bitmap and join indexing are used to represent OLAP data. Explain the different components of data warehouse.[10]
3.
Give any two types of association rules with example. Trace the results of using the Apriori algorithm on the grocery store example with support threshold 2 and confidence threshold 60%. Show the candidate and frequent itemsets for each database scan. Enumerate all the final frequent itemsets. Also indicate the association rules that are generated.

$\begin{array}{|c|l|}\hline \text{Transaction\_ID} & \text{Items} \\ \hline \text{T1} & \text{HotDogs, Buns, Ketchup} \\ \text{T2} & \text{HotDogs, Buns} \\ \text{T3} & \text{HotDogs, Coke, Chips} \\ \text{T4} & \text{Chips, Coke} \\ \text{T5} & \text{Chips, Ketchup} \\ \text{T6} & \text{HotDogs, Coke, Chips} \\ \hline \end{array}$
[10]
Section B

Short Answers Questions

Attempt any Eight questions.
[8*5=40]
4.
What is the purpose of cluster analysis in data mining? Explain. [5]
5.
How does KDD differ with data mining? Describe the stages of data mining. [5]
6.
Explain OLAP operations with examples. [5]
7.
Explain the primitives of data mining query language. [5]
8.
How different schema are used to model data warehouse? Explain. [5]
9.
Describe the significances of pre-computation of data cube. [5]