CSIT 7th Semester
Data Warehousing And Data Mining Board Question Paper 2080


CSC 410-2080 ✡
Tribhuvan University
Institute of Science and Technology
2080
Bachelor Level/Fourth Year/Seventh Semester/Science
Computer Science Information Technology (CSC 410)
(Data Warehousing And Data Mining)
(New Course)
Full Marks:60 Pass Marks:24 Time:3 hours

Candidates are required to give their answers in their own words as for as practicable.
The figures in the margin indicate full marks

Section A
Long Answer Questions
Attempt any Two question.
[2x10=20]
1.

State Apriori property. Find frequent itemsets and association rules from the transaction database given below using Apriori algorithm. Assume min. support is 50% and min confidence is 75%.

Transaction IDItems Purchased
1Juice
2Juice
3Bread, Butter
4Juice, Milk
5Cheese, Juice, Milk

2.

How classification differs from regression. Train ID3 classifier using the dataset given below. Then predict class label for the data

AgeCompetitionTypeProfit (Class Label)
oldYesSWDown
oldNoSWDown
oldNoHWDown
MidYesSWDown
MidYesHWDown
MidNoHWUp
MidNoSWUp
NewYesSWUp
NewNoHWUp
NewNoSWUp

3.

Why the concept of data mart is important? Discuss different data warehouse scheme with examples.

Section B

Attempt any Eight questions

[8x5=40]
4.

How OLAP differ from data mining? Explain various stages of KDD with suitable block diagram.

5.

Discuss different ways of smoothing noisy data along with suitable examples.

6.

How many cuboids are possible from 5-dimensional data? Discuss the concept of full cube and iceberg cube.

7.

How K-mediods clustering differs from K-means clustering? Divide the following data points into two clusters using k-mediods algorithm. Show computation upto 2 iterations.
{(70,85), (65,80), (72,88), (75,90), (60,50), (64,55), (62,52), (63,58)}

8.

Discuss working of DBSCAN algorithm.

9.

Which algorithm is used for training multi-layer perceptron? Discuss the algorithm in detail.

10.

Explain the OLAP operations with example.

11.

Discuss the concept of multimedia data mining along with the concept of similarity search.

12.

Write down short notes on
a. Support Vector Machine
b. Multi-dimensional Data Model