Professional Documents
Culture Documents
For example, a query that requests the total sales income and quantity sold
for a range of products in a specific geographical region for a specific time
period can typically be answered in a few seconds or less regardless of how
many hundreds of millions of rows of data are stored in the data warehouse
database.
To make it easier, on a regular basis, to query and report data from multiple
transaction processing systems ,external data sources for querying or
reporting purposes.
To prevent persons who only need to query and report transaction
processing system data from having any access whatsoever to transaction
processing system databases and logic used to maintain those databases.
Data Warehouse – Overview
For example, a query that requests the total sales income and
quantity sold for a range of products in a specific geographical
region for a specific time period can typically be answered in a few
seconds or less regardless of how many hundreds of millions of
rows of data are stored in the data warehouse database.
Entity–Relationship Modeling
Customer Demographics
CustomerSubscriptions Salesperson
Zones
City
Dimensional Modeling
Subscription Sales
Customer Date
EffectiveDateKey
CustomerKey
SubscriptionsKey
Payment PaymentKey Subscriptions
CampaignKey
SalesPersonKey
RouteKey
Demographics Key
Campaign UnitsSold Salesperson
DollarsSold
DiscountCost
PremiumCost
Route Demographics
Kimball- Dimensional life cycle diagram
Technical Product
Architecture Selection &
Design Installation
Business
Data Staging Maintenance
Project Dimensional Physical
Requirement Design & Deployment and
Planning Modeling Design
Development Growth
Definition
End-User End-User
Application Application
Specification Development
Project Management
Sales Data Warehouse
Business Users Requirements
Technical Architecture
Product [ Software] Selection
Dimensional Modeling
Logical Design
Data Staging Design & Development
Building Data Cube using SQL Analysis Services
End User Application Specification & Development
Deployment
Maintenance & Growth
Requirements Gathering
This phase involves the following steps:
Collect some business questions the users want an answer for.
Gather details/requirements from the business users
Get user sign off on the business questions.
Business Questions:
Can we profile our "best subscribers" to pull lists of "like" non-subscribers
that we could touch in some way?
Who exists in the marketplace and have we touched them?
Can we build a loyalty model based on a subscriber's payment history?
Sales Data Warehouse High Level Technical Architecture
Source Systems Data Staging Area Presentation Area Data Access Tools
Services Subscription
Transform from Sales
source to Dimensional. SQL
Marketing/Sales Atomic and Reporting
Data Extract target.
Maintain summary Services
conformed data.
dimensions. Business Excel
process. Access
Load
Data Storage Design Access
Demographics Goals
Extract Flat files or
relational tables Ease-of-use.
Query
Design Goals performance.
Staging
Name Phone throughput. Dimensional Bus:
Data Extract Integrity and Conformed facts
consistency. and dimensions
Product Selection
Hardware Specs:
AMD Opteron Processor 252
2.6 GHz, 3.83 GB RAM
Operating System: Windows Server 2003
Software Specs:
Kimball Data Warehouse Tool [Create staging and production databases]
Microsoft ® SQL Server™ 2000 [ETL { Extract Transform Load} ]
Microsoft ® SQL Server™ 2005 Integration Services [ Nightly Automation]
Microsoft ® SQL Server™ 2005 Analysis Services [ Create OLAP Data
Cube]
Microsoft ® SQL Server™ 2005 Reporting Services[ End User Reports]
Internet Information Services [ IIS 6.0] [ Web Server to Host the Reports]
Dimensional Modeling
Design Dimensions
Attributes of the dimension
Hierarchy in the dimension
Dimensional Bus Matrix
Design Fact Tables
Dimension Hierarchy-Subscriptions
Subscription
Name
Date
Sales
Customer
Conditions
Subscription Sales
Address Salesperson
Route
Logical Design
Fact Table Design
CustomerSubscriptionK
ey int N PK ID Surrogate Primary Key
StopDateKey int FK DimDate The most recent stop date for this customer 2
Extracting : Reading and understanding the source data, copying the parts
that are needed to the staging area.
Transforming: Possible transformation steps in the data staging area
Cleaning the data – correct misspellings, deal with missing data elements,
parsing into standard formats.
Purging selected data which is not required
Combining data sources, by matching exactly on key values or performing fuzzy
matches on non-key attributes.
Creating surrogate keys for each dimensional record.
Building aggregates to boost performance of common queries.
Loading – Loading the transformed data into the production database.
Data Staging -ETL Architecture
Kimball Data Modeling
Source Schema Source Queries tool
Creates the tables for Creates views (queries) Creates the staging and
the common source that feed data to production database tables
database production and metadata.
DTS
Data Transformation
Services Package
Database Dates Configuration
Configuration File File
Database connection Important date info for
information for the ETL the ETL process
process
Data Staging –Development for Sales Data Warehouse
SQL Server Analysis Services 2005 provides tools for developing OLAP applications
Build the cube using SQL Analysis Services and deploy it to SQL Analysis Services
Server.
End User Application Development
DM Sales
DM Sales
Non-
Solicited
Other
Benefits to Marketing
Increased telemarketing close rates and increased direct mail response rates
Reduced cost and use of outside telemarketing services and reduced print and
mailing costs
Identification of new product bundling and distribution opportunities
Increased acquisition and retention rates, and reduced cost of acquisitions
Benefits to Advertising
An increase in the annual rate of revenue growth.
Increase in new advertisers
Improved targeting capabilities
.
QUESTIONS ?