Page 9 - DCAP606_BUSINESS_INTELLIGENCE
P. 9

Business Intelligence




                    Notes          1.2.2 Offline Extract, Transform and Load (ETL)

                                   Early on, the one common interface that was provided between the disparate systems in an
                                   association was magnetic tape. Tape formats were standardized, and any system could compose
                                   tapes that could be read by other systems. Thus, the first data warehouses were fed by magnetic
                                   tapes prepared by the various systems inside the association. However, that left the difficulty of
                                   data disparity. The data written by the different systems reflected their native data associations.




                                     Notes  The data written to tape by one system often had little relation to the similar data
                                     written by another system.

                                   Even more important was that the data warehouse’s database was designed to support the
                                   analytical functions needed for the business intelligence function. This database design was
                                   typically a highly organised database with complex indices to support Online Analytical
                                   Processing (OLAP). Databases configured for OLAP allowed complex analytical and ad hoc
                                   queries with rapid execution time. The data fed to the data warehouse from the enterprise
                                   systems was converted to a format significant to the data warehouse.
                                   To explain the difficulty of initially stacking this data into a data warehouse, holding it updated,
                                   and resolving discrepancies, Extract, Transform and Load (ETL) utilities were evolved. As their
                                   name suggests, these utilities extract data from source databases, change/transform them into
                                   the widespread data warehouse format, and load them into the data warehouse, as shown in
                                   Figure 1.2.

                                                       Figure 1.2: Extract/Transform/Load Structure


























                                                           ETL – Extract/Transofrm/Load
                                   Source: http://www.gravic.com/shadowbase/images/uses/etl.png
                                   The transform function is the key to the achievement of this approach. Its job is to request a
                                   series of rules to extracted data so that it is properly formatted for loading into the data warehouse.
                                   An example of transformation rules includes:

                                       The selection of data to load.
                                       The translation of encoded items (for example, 1 for male, 2 for female to M, F).



          4                                 LOVELY PROFESSIONAL UNIVERSITY
   4   5   6   7   8   9   10   11   12   13   14