State Apriori property. Find frequent itemsets and association rules from the transaction database given below using Apriori algorithm. Assume min. support is 50% and min confidence is 75%.
| Transaction ID | Items Purchased |
| 1 | Juice |
| 2 | Juice |
| 3 | Bread, Butter |
| 4 | Juice, Milk |
| 5 | Cheese, Juice, Milk |
How classification differs from regression. Train ID3 classifier using the dataset given below. Then predict class label for the data
| Age | Competition | Type | Profit (Class Label) |
| old | Yes | SW | Down |
| old | No | SW | Down |
| old | No | HW | Down |
| Mid | Yes | SW | Down |
| Mid | Yes | HW | Down |
| Mid | No | HW | Up |
| Mid | No | SW | Up |
| New | Yes | SW | Up |
| New | No | HW | Up |
| New | No | SW | Up |
Why the concept of data mart is important? Discuss different data warehouse scheme with examples.
Attempt any Eight questions
[8x5=40]How OLAP differ from data mining? Explain various stages of KDD with suitable block diagram.
Discuss different ways of smoothing noisy data along with suitable examples.
How many cuboids are possible from 5-dimensional data? Discuss the concept of full cube and iceberg cube.
How K-mediods clustering differs from K-means clustering? Divide the following data points into two clusters using k-mediods algorithm. Show computation upto 2 iterations.
{(70,85), (65,80), (72,88), (75,90), (60,50), (64,55), (62,52), (63,58)}
Discuss working of DBSCAN algorithm.
Which algorithm is used for training multi-layer perceptron? Discuss the algorithm in detail.
Explain the OLAP operations with example.
Discuss the concept of multimedia data mining along with the concept of similarity search.
Write down short notes on
a. Support Vector Machine
b. Multi-dimensional Data Model