Professional Documents
Culture Documents
Characteristics
Relational OLAP
Multidimensional OLAP
10/15/08 Sudarshan 2
OLAP
(Online Analytical Processing)
10/15/08 Sudarshan 3
Multi-Tiered Architecture
Monitor
& OLAP Server
other Metadata
source Integrator
s Analysis
Operational Extract Query
Transform Data Serve Reports
DBs Load
Refresh
Warehouse Data mining
Data Marts
Data Sources Data Storage OLAP Engine Front-End Tools
10/15/08 Sudarshan 4
What is OLAP?
10/15/08 Sudarshan 5
OLAP
Create an advanced data analysis environment
that supports decision making, business
modeling and operation research activities.
Characteristics of OLAP
• Use multidimensional data analysis technique
• Provide advance database support
• Provide easy-to-use end user interfaces.
• Support client/server architecture.
10/15/08 Sudarshan 6
Two types of database activity
OLTPand OLAP
OLTP: On-Line Transaction Processing
Short transactions, both queries and updates
(e.g., update account balance, enroll in course)
Queries are simple
(e.g., find account balance, find grade in course)
Updates are frequent
(e.g., concert tickets, seat reservations, shopping
carts)
10/15/08 Sudarshan 7
OLAP: On-Line Analytical Processing
Long transactions, usually complex queries
(e.g., all statistics about all sales, grouped by
dept and month)
“Data mining” operations
Infrequent updates
10/15/08 Sudarshan 8
OLTP Compared With OLAP
10/15/08 Sudarshan 11
Multidimensional OLAP
MOLAP extends OLAP functionality to MDBMS
Best suited to manage, store or analyze
multidimensional data.
Proprietary techniques used in MDBMS.
MDBMS and users visualize the stored data as a 3-
dimensional cube i.e data cube.
MOLAP data bases are known to be much faster
than their ROLAP counter parts.
Data cubes are held in memory called “cube cache”.
10/15/08 Sudarshan 12
ROLAP vs MOLAP
Characteristics ROLAP MOLAP
10/15/08 Sudarshan 13
ROLAP vs MOLAP
Characteristics ROLAP MOLAP
10/15/08 Sudarshan 14
Implementation of the OLAP Server
ROLAP: Relational OLAP – data is stored in
tables in relational database or extended
relational databases. They use an RDBMS to
manage the warehouse data and aggregations
using often a star schema.
• They support extensions to SQL.
Advantage: Scalable.
Disadvantage: No direct access to cells.
10/15/08 Sudarshan 15
Implementation of the OLAP Server
MOLAP:Multidimensional OLAP - implements the
multidimensional view by storing data in special
multidimensional data structures.
10/15/08 Sudarshan 16
Characteristics of OLAP
Fast - means that the system targeted to deliver most
responses
to user within about five second, with the simplest
analysis taking no more than one second and very few
taking more than 20 sec.
Share - means that the system implements all the
security requirements for confidentiality and, if multiple
write access is needed, concurrent update location at
an appropriated level not all applications need users to
write data back, but for the growing number that do,
the system should be able to handle multiple updates in
a timely, secure manner.
10/15/08 Sudarshan 17
Analysis - means that the system can cope with any
business logic and statistical analysis that it relevant for
the application and the user, keep it easy enough for
the target user. Although some pre programming may
be needed we do not think it acceptable if all
application definitions have to be allow the user to
define new adhoc calculations as part of the analysis
and to report on the data in any desired way, without
having to program so we exclude products (like Oracle
Discoverer) that do not allow the user to define new
adhoc calculation as part of the analysis and to report
on the data in any desired product that do not allow
adequate end user oriented calculation flexibility.
10/15/08 Sudarshan 18
Multidimensional - is the key requirement.
OLAP system must provide a multidimensional
conceptual view of the data, including full
support for hierarchies, as this is certainly the
most logical way to analyze business and
organizations.
Information - are all of the data and derived
information needed? Wherever it is and
however much is relevant for the application.
We are measuring the capacity of various
products in terms of how much input data they
can handle, not how many gigabytes they take
to store it.
10/15/08 Sudarshan 19
What appears to be
the end may really
be a new beginning.
10/15/08 Sudarshan 20