BIS 05 1 Foundation of BI
BIS 05 1 Foundation of BI
BIS 05 1 Foundation of BI
Intelligence
Databases and Information
System
Introduction
Redundant, inconsistent and
fragmented data across organization
Challenge in querying and reporting
Presence of data across various
systems and integrating them is a
challenge
Target
Getting timely, accurate and relevant
information
Better data management techniques
with support of good data model
Terminology
Terminology
Data Redundancy-data stored in
multiple places
Data Inconsistency Same attribute
has different values
Lack of Flexibility Support for adhoc
reports
Poor Security Lack of control on data
Lack of data sharing and availability
Sharing data even if available in
timely manner is challenge.
Database Management
System
Collection of data to server multiple
applications by centralizing data and
eliminating data redundancy &
incosistency
DBMS is a software that permits an
organization to centralize data,
manage them efficiently and provide
access to stored data
Relational DBMS
Stores data in 2 dimensional table
Each row represents a record/tuple
Each row can be uniquely identified
and retrieved by key field
One of the key is designated as
primary key
Relationship with other table
maintained by foreign key
Object-Oriented DBMS
Treat data as objects
Better suited for structured and
unstructured data like voice, video,
images etc
Relatively slow as compared to DBMS
Hybrid ORDBMS is a good solution
DBMS Capabilities
Data Definition capability to specify
structure of the content of database
Data Dictionary stores definition of data
elements and their characteristics
Benefits
Querying and reporting via SQL
CUD operations via DML
Data Mart
Subset of data warehouse in which a
summarized or highly focused portion of the
organizations data is placed in a separate
database
ETL
Extract, Transform and Load
BI Dashboard
BI Tools
Online Analytical Processing(OLAP)
Multidimensional data analysis to view
same data using multiple dimensions
Dimension may be
product,pricing,region,data etc
Data Mining
Discovery-driven, finding patterns and
relationship
Output includes association ,
sequences ,clusters ,
classifications and forecasts
BI Tools
Text Mining
Analyzing unstructured data like call
center transcripts , survey responses,
email, memo etc.
Web Mining
Finding patterns from contents of web
pages which includes text, image,audio
etc
ETL