Page 48 - DCAP603_DATAWARE_HOUSING_AND_DATAMINING
P. 48

Data Warehousing and Data Mining




                    notes          Fill in the blanks:
                                   3.   A ........................ is a collection of tables, each of which is assigned a unique name.

                                   4.   Data  warehouses  are  constructed  via  a  process  of  data  cleaning,  data  integration,  data
                                       transformation, data loading and ........................
                                   5.   A ........................ is a set of records representing transactions, each with a time stamp, an
                                       identifier and a set of items.
                                   6.   ................................... databases include video, images, audio and text media.
                                   7.   Time-series databases contain time related data such ........................
                                   8.   Data ........................ is a summarisation of general features of objects in a target class, and
                                       produces what is called characteristic rules.
                                   9.   ........................ is based on the association rules.
                                   10.   Clustering is also called ........................ classification.
                                   11.   ........................ is a term used to describe the “process of discovering patterns and trends in
                                       large data sets in order to find useful decision-making information.”

                                   2.17 review Questions

                                   1.   What is data mining? How does data mining differ from traditional database access?
                                   2.   Discuss, in brief, the characterization of data mining algorithms.

                                   3.   Briefly explain the various tasks in data mining.
                                   4.   What is visualization? Discuss, in brief, the different visualization techniques.
                                   5.   Discuss the evolution of data mining as a confluence of disciplines.
                                   6.   What issues should be addressed by data mining algorithms and products? Why are they
                                       relevant?
                                   7.   Discuss the need for metrics in data mining.
                                   8.   “Data mining can often have far reaching social implications.” Discuss this statement.
                                   9.   Discuss, in brief, important implementation issues in data mining.
                                   10.   Distinguish between the KDD process and data mining.

                                   11.   Discuss how database and OLTP systems are related to data mining.
                                   12.   Write a short note on the ER model. What advantage does it offer over a trivial DBMS?

                                   answers: self assessment

                                   1.   (b)                              2.   (c)
                                   3.   relational database              4.   periodic data refreshing

                                   5.   transaction database             6.   Multimedia
                                   7.   stock market data                8.   characterization
                                   9.   Association analysis             10.  unsupervised
                                   11.  Data mining






          42                               LoveLy professionaL university
   43   44   45   46   47   48   49   50   51   52   53