Page 47 - DCAP603_DATAWARE_HOUSING_AND_DATAMINING
P. 47

Unit 2: Data Mining Concept




          2.14 summary                                                                          notes


          l z  The information and knowledge gained can be used for applications ranging from business
               management, production control, and market analysis, to engineering design and science
               exploration.

          l z  Data mining can be viewed as a result of the natural evolution of information technology.
          l z  An evolutionary path has been witnessed in the database industry in the development of
               data collection and database creation, data management and data analysis functionalities.

          l z  Data  mining  refers  to  extracting  or  “mining”  knowledge  from  large  amounts  of  data.
               Some other terms like knowledge mining from data, knowledge extraction, data/pattern
               analysis, data archaeology, and data dredging are also used for data mining.

          l z  Knowledge discovery as a process and consists of an iterative sequence of the data cleaning,
               data integration, data selection data transformation, data mining, pattern evaluation and
               knowledge presentation.
          2.15 keywords


          Data cleaning: To remove noise and inconsistent data.
          Data integration: Multiple data sources may be combined.
          Data mining: It refers to extracting or “mining” knowledge from large amounts of data.
          Data selection: Data relevant to the analysis task are retrieved from the database.
          Data transformation: Where data are transformed or consolidated into forms appropriate for
          mining by performing summary or aggregation operations.
          KDD: Many people treat data mining as a synonym for another popularly used term.
          Knowledge  presentation:  Visualisation  and  knowledge  representation  techniques  are  used  to
          present the mined knowledge to the user.
          Pattern evaluation: To identify the truly interesting patterns representing knowledge based on
          some interestingness measures.
          2.16 self assessment


          Choose the appropriate answers:
          1.   KDD stands for
               (a)   Knowledge Design Database
               (b)   Knowledge Discovery Database

               (c)   Knowledge Discovery Design
               (d)   Knowledge Design Development
          2.   DBMS stands for
               (a)   Database Management Spot

               (b)   Design Management System
               (c)   Database Management System
               (d)   Database Manager System




                                           LoveLy professionaL university                                    41
   42   43   44   45   46   47   48   49   50   51   52