Fundamentals of Data Warehouses

Download as pdf or txt
Download as pdf or txt
You are on page 1of 3

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/268315502

Fundamentals of Data Warehouses

Book · January 2000


DOI: 10.1007/978-3-662-04138-3

CITATIONS READS
336 4,478

4 authors:

Matthias Jarke Maurizio Lenzerini


RWTH Aachen University Sapienza University of Rome
750 PUBLICATIONS   16,090 CITATIONS    414 PUBLICATIONS   21,082 CITATIONS   

SEE PROFILE SEE PROFILE

Yannis Vassiliou Panos Vassiliadis


National Technical University of Athens University of Ioannina
140 PUBLICATIONS   2,980 CITATIONS    225 PUBLICATIONS   5,762 CITATIONS   

SEE PROFILE SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Transkriptionstheorie View project

ConceptBase Development View project

All content following this page was uploaded by Yannis Vassiliou on 03 September 2014.

The user has requested enhancement of the downloaded file.


Fundamentals of Data Warehouses
2nd Revised and Extended Edition
by Matthias Jarke, Maurizio Lenzerini, Yannis Vassiliou, Panos Vassiliadis

Springer-Verlag, 2003
214 pages, list price EUR 39.95
ISBN: 3-540-42089-4

Review by:
Vernon Hoffner, Lawrence Technological University
College of Management
Hoffner@ltu.edu

During the last decade the field of data section introduces data warehousing. The first
warehousing has grown significantly. Many chapter provides a definition and overview of
organizations are either actively looking at this the data warehouse and its components. It also
technology or have currently implemented one introduces the subject areas that will be covered
or more data warehouses or data marts to in more detail in the later chapters. The second
support corporate decision making. In today’s chapter presents areas and issues of relevant
economic environment, the competitive edge research. This chapter introduces a focus on
frequently comes from the proactive use of the modeling and measuring data quality and data
information that companies have been collecting warehouse quality, one of the strengths of the
in their operational systems. They are realizing text. These topics are covered in more detail in
the significant potential this information can the last section of the book.
have for their organization. The data warehouse The next section looks at the process of
provides users with access to these large obtaining the data and loading it into the data
amounts of integrated, nonvolatile, time variant warehouse. In chapter three the authors define
data that can be used to track business trends, source integration and how it can be applied to
facilitate forecasting and improve strategic the process of integrating the schemas of the
decisions. source data to construct an integrated enterprise
schema. We are reminded that this is a
Summary of the Book continuing process in order to maintain a quality
The authors state “this book is an introduction collection of data from the changing source
and sourcebook for practitioners, graduate systems. Source integration is also the
students, and researchers interested in the state foundation of the transformation and loading of
of the art and the state of the practice in data the data into the data warehouse, the topics of
warehousing.” There have been a wide variety the next chapter. Data warehouse refreshment is
of books written for practitioners on the topic of the process of integrating, cleansing, and
data warehousing in the last few years. transforming the source data in preparation of
However, this is the first book I have seen that physically loading it into the data warehouse.
focuses on the data warehousing from the point The authors discuss many techniques of data
of interest of researchers. Jarke et al. present a cleaning and provide references to the literature
good starting point and foundation for someone for more details on data cleaning and
interested in data warehousing and the related transformation. The final portion of the chapter
research issues. covers the process of loading the data into the
Consisting of 178 pages of text, this data warehouse, including the quality factors and
book is organized into four sections, each design choices necessary to insure a timely and
consisting of two chapters. The introductory accurate refreshment process.

SIGMOD Record, Vol. 32, No. 2, June 2003 55


The third section focuses on the data data warehouse development and operation in
structures for the data warehouse and the the Foundation of Data Warehouse Quality
efficient access to that data. Chapter five begins project.
with a short review of the online transaction
processing data structure needs to provide a Target Audience
contrast with the data structure needs for online The extensive reference to supporting academic
analytical processing. This leads to a and research literature satisfies the authors’ goal
description of the multidimensional view of the of making this an excellent sourcebook for
data needed by the end users and the several graduate students and researchers. Practitioners
ways the multidimensional view can be with good modeling and conceptualization
physically implemented with the database. The capability will also appreciate this approach.
authors complete their presentation of the The text covers the essential areas for the
modeling of the data warehouse with a development of a quality data warehouse, but it
discussion of the role of aggregates in the data does not provide the cookbook solutions that are
warehouse. The role of aggregates is continued provided in the trade press. This style probably
in the next chapter, which concentrates on the will not be greatly appreciated by the majority of
optimization of query processing. From the practitioners who, in many instances will be
perspective of the end users, the response time to looking for the answer to their immediate
their queries is very important, and today problem.
everyone is expecting a “quick” reply. The
authors describe several methods of improving Reviewer’s Appreciation
query performance and reference the underlying The writing style and the presentation of the
research for readers who want or need to pursue material are a refreshing change for the plethora
additional information in this area. of popular press books that focus on the
The last section brings together the experiences of the consultant/writer. The book
topics of the previous chapters and develops an is easy to read and it provides a good graphical
integrated focus on quality in the design and presentation for the conceptual modeling
operation of a data warehouse. Chapter seven approach to the development of quality data
develops a model for integrating the components warehouses.
of the data warehouse into a unified architecture The extensive bibliography is also
for an overall perspective on quality of the data greatly appreciated. It will provide a good
warehouse. A major factor in supporting quality starting point for anyone interested in pursuing a
is the role played by metadata. The authors then more in-depth study of the supporting literature.
describe how their architecture with supporting In conclusion, I would recommend this
metadata can be used to implement a high book as one of the required texts for an
quality data warehouse. The last chapter advanced graduate course in data warehousing.
describes the application of this quality focus to

56 SIGMOD Record, Vol. 32, No. 2, June 2003


View publication stats

You might also like