Page 19 - DCAP603_DATAWARE_HOUSING_AND_DATAMINING
P. 19

Unit 1: Data Warehouse Practice




          1.6.4 oLap server architectures                                                       notes

          We describe here the physical implementation of an OLAP server in a Data Warehouse. There are
          three different possible designs:
          1.   Relational OLAP (ROLAP)
          2.   Multidimensional OLAP (MOLAP)
          3.   Hybrid OLAP (HOLAP)

          roLap

          ROLAP stores the data based on the already familiar relational DBMS technology. In this case,
          data  and  the  related  aggregations  are  stored  in  RDBMS,  and  OLAP  middleware  is  used  to
          implement handling and exploration of data cubes. This architecture focuses on the optimisation
          of the RDBMS back end and provides additional tools and services such as data cube navigation
          logic. Due to the use of the RDBMS back end, the main advantage of ROLAP is scalability in
          handling large data volumes.

                 Example: ROLAP engines include the commercial IBM Informix Metacube (www.ibm.
          com) and the Micro-strategy DSS server (www.microstrategy.com), as well as the open-source
          product Mondrian (mondrian.sourceforge.net).

          MoLap

          In contrast to ROLAP, which uses tuples as the data storage unit, the MOLAP uses a dedicated
          n-dimensional array storage engine and OLAP middleware to manage data. Therefore, OLAP
          queries are realised through a direct addressing to the related multidimensional views (data
          cubes). Additionally, this architecture focuses on pre-calculation of the transactional data into
          the aggregations, which results in fast query execution performance. More specifically, MOLAP
          precalculates and stores aggregated measures at every hierarchy level at load time, and stores
          and indexes these values for immediate retrieval. The full precalculation requires a substantial
          amount of overhead, both in processing time and in storage space. For sparse data, MOLAP
          uses sparse matrix compression algorithms to improve storage utilisation, and thus in general is
          characterised by smaller on-disk size of data in comparison with data stored in RDBMS.

                 Example: MOLAP products are the commercial Hyperion Ebasse (www.hyperion.com)
          and the Applix TM1 (www.applix.com), as well as Palo (www.opensourceolap.org), which is an
          open-source product.

          hoLap

          To  achieve  a  tradeoff  between  ROLAP’s  scalability  and  MOLAP’s  query  performance,  many
          commercial OLAP servers are based on the HOLAP approach. In this case, the user decides which
          portion of the data to store in the MOLAP and which in the ROLAP. For instance, often the low-
          level data are stored using a relational database, while higher-level data, such as aggregations,
          are stored in a separate MOLAP. An example product that supports all three architectures is
          Microsoft’s OLAP Services (www.microsoft.com/), which is part of the company’s SQL Server.

          1.7 getting heterogeneous Data into the Warehouse


          All data warehouses store their data grouped together by subject areas that reflect the general
          usage of the data (Customer, Product, Finance etc.). The general principle used in the majority




                                           LoveLy professionaL university                                    13
   14   15   16   17   18   19   20   21   22   23   24