You are on page 1of 45

AMTI 4

Data Management: Data,


Databases and Warehousing
Antonius Rachmat C
Data Life Cycle Process

Chapter 3 2
Data Mart
• D
Data
t yang tterdapat
d t dalam
d l d t warehouse
data h d
dapat
t
dibagi perbagian sesuai dengan kebutuhan dalam
informasi ->
> Data Mart
• Data mart memiliki karakteristik yang sama
dengan data warehouse
warehouse, perbedaannya hanya
terdapat pada jumlah data yang dimiliki.
• Dalam data mart,
mart data yang ada hanya berasal
dari satu bagian atau satu departemen saja,
sedangkan pada data warehouse, data yang ada
berasal dari seluruh bagian dalam perusahaan
tersebut.
Forms for Modeling Data (ERD)
• Entities : suatu kesatuan data yang unik yang
memisahkannya dari entitas lainnya

• Attributes : ciri khas yang menempel pada entitas,


biasa disebut field

• Relations : relasi yang terjadi antar entitas yang


dihubungkan oleh atribut masing-masing entitas

Chapter 3 4
E-R Diagrams

• Rectangles represent entity sets.


• Diamonds represent relationship sets.
• Lines link attributes to entity sets and entity sets to relationship sets.
• Ellipses represent attributes
• Double ellipses represent multivalued attributes.
attributes
• Dashed ellipses denote derived attributes.
• Underline indicates p
primary
y key
y attributes
E-R Diagram With Composite, Multivalued, and Derived
Attributes
Relationship Sets with Attributes
Example of primary and foreign keys
keys.

3-8
Hierarchy of Data

Chapter 3 9
Hierarchy of Data (cont’d)
(cont d)

Chapter 3 10
Indexed sequential access method
(ISAM)

3-11
Indexing
• Indexing sering dilupakan
• Indexing pada tabel biasanya dilakukan
otomatis pada primary key
– Tidak
Tid k d
demikian
iki d dengan fifield
ld llainnya
i
• Field y
yangg sering
g digunakan
g untuk q
query
y
select klause where SEBAIKNYA diindex!
– Index berdasarkan ASCII / Numerik
– Index berdasarkan binary
Computer based system

Computer-based files of this type cause problems such as redundancy,


inconsistency, and data isolation.

3-13
(a) Centralized database
database. (b) Distributed database with
complete or partial copies of the central database in
3-14
more than one location.
Data Problems

Copyright 2010 John Wiley & 3-15


Sons, Inc.
Adv of DBMS

3-16
DBMS

Ex: MySQL, FoxPro, SQLServer, Postgres, Access, SQLite, Oracle

Database management system provides access to all data in the


database.

3-17
DBMS connector
• Dibutuhkan juga DBMS connector yang
g
berguna untuk menghubungkan
g g antara
DBMS dengan bahasa pemrograman
yang sifatnya general
• Connector sangat ditentukan oleh bahasa
pemrogramannya
• Contoh: JDBC, ODBC, Connector for
.NET
The Data Warehouse & Data Management

Chapter 3 19
DW dan DM
• DW
DW: A generici tterm ffor a system
t off storing,
t i
retrieving and managing large amounts of data
– Software often includes sophisticated compression
and hashing techniques for fast searching and
g
filtering
• DM: type of data warehouse designed mainly to
address a specific function or department’s needs
– Often uses aggregation or summarization of the
data to enhance query performance
– Important,
I t t however,
h to
t maintain
i t i the
th ability
bilit tto access
the underlying base data to enable drilldown
y
analysis as necessary y
Model of an enterprise
p data warehouse.
ETL : Extraction, Transformation, and Loading

(Source: From Syncsort, synchsort.com. Used with


permission.) 3-21
Data Warehouse
• D
Datat di d
dalam
l d
data
t warehouse
h mempunyaii
karakter khusus berupa time-series dalam
b t kd
bentuk data
t historical
hi t i l
• Data terdiri dari suatu seri dari “pemotretan”
keadaan yang ditandai dengan waktu dan
nilai data pada saat “pemotretan” tersebut
• Hal ini dapat digunakan untuk melakukan
trend analysis dari data tersebut
• Data warehouse tidak diupdate secara terus
menerus,, tapip secara p
periodik
DW = OLAP
• O
On-Line
Li A Analytical
l ti l P
Processing
i
• Umumnya y digunakan
g oleh p
pembuat
kebijakan seperti analis dan manajer
• Data yang ada di dalam sebuah data
warehouse umumnya berasal dari
bermacam-macam OLTP (On-Line
Transaction Processing) systems
• Dikarakterisasikan oleh data reading
(pelaporan, queries) bukan data entry
The use of DW

Copyright 2010 John Wiley & 3-24


Sons, Inc.
Why DW Fails

Copyright 2010 John Wiley & 3-25


Sons, Inc.
Why Mine Data? Commercial Viewpoint
• Lots of data is being collected
and warehoused
– Web data, e-commerce
– purchases at department/
grocery stores
– Bank/Credit Card
transactions
i
• Data collected and stored at
enormous speeds (GB/hour)
– remote sensors on a satellite
– telescopes
t l scanning
i ththe skies
ki
– scientific simulations
generating terabytes of data
What is Data Mining?
• Exploration & analysis, by automatic or
semi-automatic means, of
large quantities of data
in order to discover
meaningful patterns
Data Mining
the extraction
th t ti off predictive
di ti information
i f ti
from large databases.
• Data trend, connection and behavior
pattern analysis
• Data quality -> data cleaning
• Data mining tools -> RapidMiner
• Use for predictive and description
• Statistical techniques and algorithms
Data Mining Tasks
Tasks...
• Classification [Predictive]
• Clustering [Descriptive]
• Association Rule Discovery [Descriptive]
• Sequential Pattern Discovery [Descriptive]
• Regression [Predictive]
• Deviation/Anomaly Detection [Predictive]
Data Visualization
• Visualisasi adalah konversi data ke dalam format visual
atau tabel sehingga karakteristik dari data dan relasi
diantara item data atau atribut dapat di analisis atau
dilaporkan

Suhu permukaan laut pada bulan Juli tahun 1982


Teknik Visualisasi
• Histogram
Teknik Visualisasi Data
• Blok Plot
Teknik Visualisasi data
• Scatter Plot
Teknik Visualisasi data
• Countour Plot
Apa itu Business Intelligence?
• B
Business
i IIntelligence
t lli iis th
the processes,
technologies, and tools that help us change data
into information, information to knowledge, and
knowledge into plans that guide organization.
• Technologies
g for g
gathering,g storing,
g analysing
y g
dan providing access to data to help enterprise
users make better business decisions.
• Business Intelligence menjelaskan tentang
suatu konsep dan metode bagiamana untuk
meningkatkan kualitas pengambilan keputusan
bisnis berdasarkan sistem yang berbasiskan
data.
Apa itu Business Intelligence?
Intelligence?...
Tiga Aspek Penting Business
Intelligence

1. Converting Data Into Information


2 Making Better Decisions Faster
2.
3. Rational Approach to Management
Arsitektur BI
BI Pyramid
Technologies Supporting BI
• D
Database
t b systems
t and
dddatabase
t b iintegration
t ti
• Data warehousing, data stores and data
marts
• Enterprise resource planning (ERP) systems
• Query
Q andd reportt technologies
t h l i
• Data mining and analytics tools
• Decision support systems
• Customer relation management software
• Product lifecycle and supply chain
management systems
Examples of BI Careers
BI careers cross over all ll industries:
i d ti
• BI solution architects and integration
g
specialists
• Business and BI analysts
• BI application developers and testers
• Data warehouse specialists
• Database analysts,
y , developersp and testers
• Database support specialists
Kategori Analisis Bisnis
Bagaimana BI bekerja?
BI Component
• Data
D t
– Data store collection by individual applications
• Production systems
– Gathering of all data stores
• Data warehousing
g
• Presentation
– Scorecards & Dashboards
– Enterprise Reporting
– OLAP Analysis
– Advanced & Predictive Analysis
– Alerts & Proactive Notification
NEXT
• Networking: Discovery, Communication,
Collaboration

You might also like