Page 259 - DCAP603_DATAWARE_HOUSING_AND_DATAMINING
P. 259

Unit 13: Metadata and Data Warehouse Quality




             the results                                                                        notes
             Preparation was highly efficient. The Trillium Software System made it easy to modularize
             pattern-matching  logic.  Its  patternmatching  techniques  gave  us  the  ability  to  quickly
             identify that one scenario out of thousands of possible combinations that applied to an
             individual.
             With  all  preparation  completed,  the  team  used  the  Trillium  Software  System’s  batch
             processing facility to cleanse and load data. The process was completed in half the estimated
             time due to the adequate preparation and match logic debugging.
             AT&T relies on the cleansed data in every customer-based business process and system. As
             a result, the company knows its customers better and consequently provides better service.
             The Trillium Software System gives the company a cleansing platform that makes it easier
             to refine the data cleansing process for optimal results.


          13.3 summary


          l z  We have extended our meta modeling framework for data warehouse architecture and
               quality by a model for data warehouse processes and have specialized this model to the
               case of data warehouse evolution. In detail, we have addressed the problem of evolution
               of data warehouse views.
          l z  The management of the metadata in our repository system.
          l z  ConceptBase  allows  us  to  query  and  analyze  the  stored  metadata  for  errors  and
               deficiencies.
          l z  In addition, features like client notification and active rules of ConceptBase support the
               maintenance of the data warehouse components and keep data warehouse users up-to-
               date on the status of the data warehouse.

          13.4 keywords

          Data Quality: Data quality (DQ) is an extremely important issue since quality determines the
          data’s usefulness as well as the quality of the decisions based on the data.

          Data Warehouse Staging Area: The Data Warehouse Staging Area is temporary location where
          data from source systems is copied.
          Logical Schema: The logical schema provides an understandable view of the data in the Data
          Warehouse, and supports an efficient import process.
          Physical Store: The physical store for the Data Warehouse includes one database that you can
          query using SQL queries.

          13.5 self assessment


          Choose the appropriate answers:
          1.   HTML stands for:
               (a)   Hypertext Makeup Language
               (b)   Hypertext Markup Language
               (c)   Heterogeneous Markup Language

               (d)   Homogenous Markup Language



                                           LoveLy professionaL university                                   253
   254   255   256   257   258   259   260   261   262   263   264