Page 160 - DCAP606_BUSINESS_INTELLIGENCE
P. 160

Unit 11: Data Mining




          of product selection and placement conclusions, coupon offers etc. Companies such as phone  Notes
          service providers and music clubs can use data mining to create a “churn analysis,” to assess
          which customers are expected to stay as subscribers and which ones are likely to switch to a
          competitor.
          In the public sector, data mining applications were initially used as a means to detect fraud and
          waste, but now they are used for purposes such as measuring and improving program
          performance.

          Self Assessment


          Fill in the blanks:
          4.   Information can be converted into .................................
          5.   ............................ refers to removal noise and inconsistent data.
          6.   .......................... method uses some variable to predict unknown or future values of other
               variables.

          11.4 Data Mining Issues


          Privacy

          One of the key matters raised by data mining technology is not an enterprise or technological
          one, but a social one. It is the issue of individual privacy. Data mining makes it possible to
          investigate routine enterprise transactions and glean a significant amount of information about
          persons buying habits and preferences.

          Data Integrity

          Clearly, data analysis can only be as good as the data that is being analysed. A key implementation
          dispute is integrating inconsistent or redundant data from distinct sources.


                 Example: A bank may sustain credit cards accounts on several distinct databases. The
          addresses (or even the titles) of a single cardholder may be different in each. Software should
          convert data from one system to another and choose the recently entered address.

          Confusion

          Another issue of concern is to decide is if it is better to set up a relational database structure or
          a multidimensional one. In a relational structure, data is retained in tables, allowing ad hoc
          queries. In a multidimensional structure, on the other hand, groups of cubes are arranged in
          arrays, with subgroups created according to category. While multidimensional organisations
          facilitate multidimensional data mining, relational structures perform better in client/server
          environments.

          Cost

          Finally, there is the issue of cost. While system hardware costs have fallen spectacularly inside
          the past five years, data mining and data warehousing are inclined to be self-reinforcing. The
          more mighty the data mining queries, the larger the utility of the data being gleaned from the





                                           LOVELY PROFESSIONAL UNIVERSITY                                   155
   155   156   157   158   159   160   161   162   163   164   165