Professional Documents
Culture Documents
Scenario 1
ABC Pvt Ltd is a company with branches at
Mumbai, Delhi, Chennai and Banglore. The
Sales Manager wants quarterly sales report.
Each branch has a separate operational system.
Delhi
Sales per item type per branch
for first quarter.
Chennai
Banglore
Sales
Manager
Report
Delhi
Data
Warehouse
Chennai
Banglore
Query &
Analysis tools
Sales
Manager
Scenario 2
One Stop Shopping Super Market has huge
operational database.Whenever Executives wants
some report the OLTP system becomes
slow and data entry operators have to wait for
some time.
Operational
Database
Management
Solution 2
Extract
Solution 2
Data Entry
Operator
Report
Transaction
Data Entry
Operator
Operational
database
Extract
data
Data
Warehouse
Manager
Scenario 3
Cakes & Cookies is a small,new company.President
of the company wants his company should grow.He
needs information so that he can make correct
decisions.
Solution 3
Improve
Solution 3
Expansio
n
sales
Data
Warehouse
President
time
Improvemen
t
Inmonss definition
A data warehouse is
-subject-oriented,
-integrated,
-time-variant,
-nonvolatile
collection of data in support of managements
decision making process.
Subject-oriented
Data
Integration
Data Warehouse
is constructed by integrating
multiple heterogeneous sources.
Data Preprocessing are applied to ensure
consistency.
RDBMS
Legacy
System
Flat File
Data
Warehouse
Data Processing
Data Transformation
Integration
In
terms of data.
encoding structures.
Measurement of
attributes.
physical attribute.
of data
naming conventions.
Data type format
remarks
Time-variant
Provides
Nonvolatile
Data
load
access
Operational
Information
Characteristics
Operational processing
Informational processing
Orientation
Transaction
Analysis
User
Clerk,DBA,database
professional
Knowledge workers
Function
Decision support
Data
Current
Historical
View
Detailed,flat relational
Summarized,
multidimensional
DB design
Application oriented
Subject oriented
Unit of work
Access
Read/write
Mostly read
Operational
Information
Focus
Data in
Information out
Number of records
accessed
tens
millions
Number of users
thousands
hundreds
DB size
100MB to GB
100 GB to TB
Priority
Metric
OLAP Servers
Metadata
Repository
Reconciled data
External
Sources
Extract
Transform
Load
Refresh
Analysis
Serve
Query/Reporting
Operational
Dbs
Data Mining
DATA SOURCES
TOOLS
DATA MARTS
Warehouse server
almost always a relational DBMS,rarely flat
files
OLAP servers
to support and operate on multi-dimensional
data structures
Clients
Query and reporting tools
Analysis tools
Data mining tools
Schema
Fact Constellation Schema
Snowflake Schema
Star Schema
A single,large
Fact Table
Store Key
Period Key
Store Name
Product Key
Year
City
Period Key
Quarter
State
Units
Month
Region
Price
Time Dimension
Product Key
Product Desc
Product
Dimension
Benefits: Easy to understand, easy to define hierarchies,
reduces no. of physical
joins.
SnowFlake Schema
Variant
Fact Table
Store Key
Period Key
Product Key
Year
Period Key
Quarter
Units
Month
Price
Product Key
Product Desc
Product
Dimension
Drawbacks: Time consuming joins,report generation slow
Region
Time Dimension
Fact Constellation
Multiple
Shipping
Fact Table
Product Key
Product
Dimension
Product Key
Shipper Key
Period Key
Product Desc
Product Key
Store Key
Units
Store Key
Period Key
Price
Units
Store
Dimension
Store Key
Store Name
City
State
Region
Price
Selection
Data Preprocessing
Fill missing values
Remove inconsistency
Data
Case Study
Afco
Sales Information
Report: The number of units sold.
113
Report: The number of units sold over time
January
February
March
April
14
41
33
25
Sales Information
Report : The number of items sold for each product with
time
Wheat Bread
17
8
Cheese
16
Swiss Rolls
25
21
Time
Product
Sales Information
Report: The number of items sold in each City for each
product with time
Feb Mar
Pune
Cheese
16
Swiss Rolls
16
Wheat Bread
Cheese
Swiss Rolls
Apr
10
7
8
15
City
Time
Jan
Product
Sales Information
Report: The number of items sold and income in each region for
each product with time.
Jan
Rs
Feb
U
Rs
Mar
U
Pune
Apr
Rs
Rs
7.44
24.80
10
17.36
21.20
Cheese
7.95
42.40
16
15.90
Swiss Rolls
7.32
29.98
16
10.98
7.44
Wheat Bread
Cheese
7.95
Swiss Rolls
7.32
16.47
27.45
15
Product
Mumbai
Month
Units
Rupees
7.95
Mumbai
Cheese
January
7.32
Pune
7.95
Pune
Cheese
January
7.32
Mumbai
Swiss Rolls
February
16
42.40
Month
Units
Rupees
589
1/1/1998
7.95
1218
1/1/1998
7.32
589
1/1/1998
7.95
1218
1/1/1998
7.32
589
2/1/1998
16
42.40
Product_Name
Product_Category_ID
589
Wheat Bread
590
White Bread
288
Coconut Cookies
Product_Category_Id Product_Category
1
Bread
Cookies
City
Region
Country
Mumbai
West
India
Pune
NorthWest
India
Sales Fact
Region
Product
Product
Category
Region
Time
OLAP Cube
City
Product
Time
Units
Dollars
All
All
All
113
251.26
Mumbai
All
All
64
146.07
Mumbai
White Bread
All
38
98.49
Mumbai
13
32.24
Mumbai
7.44
Mumbai
7.44
OLAP Operations
Drill Down
Product
Category e.g Electrical Appliance
Region
Time
OLAP Operations
Drill Up
Product
Category e.g Electrical Appliance
Region
Time
OLAP Operations
Slice and Dice
Product
Region
Region
Product=Toaster
Time
Time
OLAP Operations
Pivot
Product
Time
Region
Product
Time
Region
OLAP Server
An
Presentation
Product
Region
Reporting
Tool
Report
Time
Data Warehouse
Online analysis processing(OLAP).
Presentation.
Cleaning ,Selection &
Integration
Presentation
RDBMS
Flat File
Client
business perspective
-it is latest marketing weapon
-helps to keep customers by learning more about
their needs .
-valuable tool in todays competitive fast evolving
world.
tools
MS Excel Pivot Chart
VB Applications
References
Thank You