Page 259 - DCAP603_DATAWARE_HOUSING_AND_DATAMINING
P. 259
Unit 13: Metadata and Data Warehouse Quality
the results notes
Preparation was highly efficient. The Trillium Software System made it easy to modularize
pattern-matching logic. Its patternmatching techniques gave us the ability to quickly
identify that one scenario out of thousands of possible combinations that applied to an
individual.
With all preparation completed, the team used the Trillium Software System’s batch
processing facility to cleanse and load data. The process was completed in half the estimated
time due to the adequate preparation and match logic debugging.
AT&T relies on the cleansed data in every customer-based business process and system. As
a result, the company knows its customers better and consequently provides better service.
The Trillium Software System gives the company a cleansing platform that makes it easier
to refine the data cleansing process for optimal results.
13.3 summary
l z We have extended our meta modeling framework for data warehouse architecture and
quality by a model for data warehouse processes and have specialized this model to the
case of data warehouse evolution. In detail, we have addressed the problem of evolution
of data warehouse views.
l z The management of the metadata in our repository system.
l z ConceptBase allows us to query and analyze the stored metadata for errors and
deficiencies.
l z In addition, features like client notification and active rules of ConceptBase support the
maintenance of the data warehouse components and keep data warehouse users up-to-
date on the status of the data warehouse.
13.4 keywords
Data Quality: Data quality (DQ) is an extremely important issue since quality determines the
data’s usefulness as well as the quality of the decisions based on the data.
Data Warehouse Staging Area: The Data Warehouse Staging Area is temporary location where
data from source systems is copied.
Logical Schema: The logical schema provides an understandable view of the data in the Data
Warehouse, and supports an efficient import process.
Physical Store: The physical store for the Data Warehouse includes one database that you can
query using SQL queries.
13.5 self assessment
Choose the appropriate answers:
1. HTML stands for:
(a) Hypertext Makeup Language
(b) Hypertext Markup Language
(c) Heterogeneous Markup Language
(d) Homogenous Markup Language
LoveLy professionaL university 253