Tribhuwan University

Institute of Science and Technology

2081

Bachelor Level / Fourth Year / Eighth Semester / Science

Bachelors in Information Technology (BIT454)

(Data Warehousing and Data Mining)

Full Marks: 60

Pass Marks: 24

Time: 3 Hours

Candidates are required to give their answers in their own words as for as practicable.

The figures in the margin indicate full marks.

Section A

Long Answers Questions

Attempt any TWO questions.
[2*10=20]
1.
List different clustering approaches. Write the algorithm of K-means algorithm. What are its limitations?[10]
2.
What do you understand by classification by back propagation? How is it different from Bayesian classification? Explain.[10]
3.
Generate the frequent itemset from the following data using the Apriori algorithm and find the strong association rules. Minimum Support = 60%, Minimum Confidence = 75%.

$\begin{array}{|c|c|}\hline \text{TID} & \text{Items} \\ \hline 1 & \{A, C, D\} \\ 2 & \{B, C, D\} \\ 3 & \{A, B, C, D\} \\ 4 & \{B, D\} \\ 5 & \{A, B, C, D\} \\ \hline \end{array}$
[10]
Section B

Short Answers Questions

Attempt any Eight questions.
[8*5=40]
4.
What do you mean by text mining? Explain with an example. [5]
5.
How do you carry out social network analysis? List two of its uses. [5]
6.
Illustrate the use of outlier analysis with an example. [5]
7.
Why is data preprocessing required? Explain any two data preprocessing methods. [5]
8.
Differentiate between data warehouse and operational database. [5]
9.
What are the data mining functionalities? Explain. [5]
10.
Write short notes on: (a) OLAP (b) Laplace Smoothing [5]
11.
Define full cube. Discuss the working mechanism of beam search. [5]
12.
Explain any two conceptual modeling techniques of data warehousing. [5]