D56261GC10 Appendix A DW
D56261GC10 Appendix A DW
D56261GC10 Appendix A DW
THESE eKIT MATERIALS ARE FOR YOUR USE IN THIS CLASSROOM ONLY. COPYING eKIT MATERIALS FROM THIS
COMPUTER IS STRICTLY PROHIBITED
Table of Contents
Practices for Lesson 1 ......................................................................................................... 3
Practice 1-1: Identifying the Available SQL Developer, Oracle Analytic Workspace
Manager (AWM) 11g, Oracle Warehouse Builder (OWB) 11g, Oracle Application
Express 3.1, and Oracle Database 11g Documentation Resources................................. 4
Practice 1-2: Creating and Using New SQL Developer Database Connections........... 11
Practice 1-3: Browsing Your sh and dm Schemas Tables ........................................... 12
Practices for Lesson 2 ....................................................................................................... 15
Practice 2-1: Data Warehousing and Business Intelligence ......................................... 16
Practice 2-2: Exploring the Analytic Workspace Manager (AWM) 11g ..................... 19
This is the first of many practices in this course. The solutions (if you require them) can
be found in Appendix A: Practices and Solutions. Practices are intended to cover most
of the topics that are presented in the corresponding lesson.
In practice 1-1, you review the available SQL Developer, Oracle Analytic Workspace
Manager (AWM) 11g, Oracle Warehouse Builder (OWB) 11g, and Oracle
Application Express 3.1 resources.
In practice 1-2, you create and use new SQL Developer database connections.
In practice 1-3, you browse some of the tables in the dm schema using SQL
Developer.
In this practice, you review the available SQL Developer, Oracle Analytic Workspace
Manager (AWM) 11g, Oracle Warehouse Builder (OWB) 11g, and Oracle Application
Express 3.1 resources.
1) Familiarize yourself with Oracle SQL Developer, as needed, using Appendix C:
Using SQL Developer.
3) Test the new connection. If the status is Success, connect to the database by using the
new connection.
a) Double-click the SH Schema icon on the Connections tabbed page.
b) Click the Test button in the New/Select Database Connection window. If the
status is Success, click the Connect button.
4) Create a new database connection named DM Schema.
a) Right-click the SH Schema connection in the Object Navigation tree and select
the Properties menu option.
b) Enter DM Schema as the connection name and dm as the username and oracle
as the password, and then click Save. This creates the new connection.
c) Repeat step 3 to test the new DM schema connection.
If the SQL Worksheet is not displayed, click the Open SQL Worksheet (Ctrl +
Shift + W) icon in the SQL Developer toolbar. Enter the code as displayed in the
Click the Run Script (F5) icon. The resulting rows are displayed on the Scripts
Output tabbed page as follows:
In practice 2-1, you answer some questions about what you learned in this lesson.
In practice 2-2, you perform the following tasks:
Start AWM.
Explore the SALESTRACK workspace.
Identify the dimensions and cubes in the workspace.
In practice 2-3, you log in to APEX as a developer and review an interactive report
1) Start the AWM tool by clicking the AWM icon on your desktop.
Click the AWM icon on your computer desktop to start the tool. The Analytic
Workspace Manager screen is displayed as follows:
2) Expand the Databases node. Expand oracle11g. The Connect to database dialog
box is displayed as follows:
4) Expand the Schemas > DM > Analytic Workspaces > SALESTRACK (attached
RW) nodes as follows:
6) Observe the data in the dimensions and cubes, and answer the following questions:
a) What are the members of the GEOGRAPHY dimension?
Right-click the GEOGRAPHY dimension, and then select the View data
GEOGRAPHY option from the pop-up menu. Click the Expand All
icon to view all members of the GEOGRAPHY dimension. There are 252
Geography values.
What are the levels and attributes of the GEOGRAPHY dimension?
What are the Measures and Calculated Measures in the SALES_CUBE cube?
Expand the node for the SALES_CUBE cube. Expand the Measures and
Calculated Measures nodes as follows:
1) Start your Web browser and navigate to the Application Express (APEX) login page:
http://localhost:8080/apex
2) The Application Express Login page is displayed. Log in using the following details
and then click Login.
a) Workspace: salestrack
b) Username: dm
c) Password: dm
3) Click the Application Builder icon. A list of the available applications is displayed.
1) Indicate whether the following statements about warehouse data are true or false.
2) What are the typical components of a data warehouse and what is their significance?
Consider a simplified portion of the Roy Independent School District (RISD) scenario.
You use this information in the next practice in this lesson:
The RISD wants to build a Data Warehouse environment that will provide decision
makers throughout the school district with information to help improve student
achievement. School districts are a form of special-purpose districts, which help to
operate the local public primary, middle, and high schools. Public education is a function
of these districts serving one or more towns. Data that is currently processed
independently in multiple applications will be integrated into a single environment to
facilitate data reporting and analysis across test areas.
Report data will be available online and can be downloaded into local applications
where appropriate (for example, spreadsheets and PC databases) to perform
additional analysis or for integration with local data.
User groups, such as students, parents, teachers, district officials, and school staff
(principals and school administrators), will be restricted to accessing information
associated with their responsibilities, needs, and skills.
The majority of end users will view simple reports. Dashboard reports will provide
summarized tabular or graphical information, or both, to various user groups. For
example, students will be able to view their own data. For the first phase, test results
will be kept only for the primary state mandated assessments.
Data will be kept from the year 2000. Each Year falls within a Time Period. Each
Time Period is associated with Year and Semester. There are two semesters: Semester
1 and Semester 2.
Students from grades 3 through 12 will take these assessment tests in Mathematics
and English.
Reports will show students scores (out of 100) in Mathematics and English for a
particular grade. Aggregate reports will show Time, Grade, and Location.
There are three levels of location: District, Location type (elementary, middle, high),
and Campus.
1) In this practice, you identify the facts, measures, hierarchies, slowly changing
dimensions, and the probable attributes for the dimensions and facts for the RISD
scenario given.
Consider a simplified portion of the RISD scenario that you reviewed in practice 4-1:
The RISD wants to build a Data Warehouse environment that will provide decision
makers throughout the school district with information to help improve student
achievement. School districts are a form of special-purpose districts, which help to
operate the local public primary, middle, and high schools. Public education is a
D.Access D.Time
F.Student
Scores
D.Student D.Location
Note: The most probable dimensions and facts are indicated in the diagram. You can
think of adding more dimensions or facts.
Note: If you have time, go through the following guided practice using Oracle
Warehouse Builder.
In this guided practice, you experiment with some of the basic functionality of the Oracle
Data Warehouse Builder. In this practice, you log in to Oracle Warehouse Builder and
create two dimensions: CHANNELS1_DIM and TIME1_DIM.
2) Before logging in with the username (if not already displayed) and password, you
must establish connect information. Click Show Details. The Design Center Logon
window expands as follows:
1 2
13) Click Next. The Create Dimension Step 7 of 8: Pre Create Settings window is
displayed. Observe the Pre Create Settings that you have selected.
Compare your results with the following screen capture:
3
19) Right-click the CHANNEL1_DIM dimension, and then select Validate from the pop-
up menu as follows:
20) Close the Validation Results window. Select Close from the File menu.
Create a new dimension named TIME1_DIM using the Create Dimension Wizard, by
performing the following steps:
1) In the Project Explorer panel of the Design Center window, expand the nodes in the
following order:
SH_HANDSON > Databases > Oracle > TARGET_MODULE
7) Click Next. The Create Time Dimension Step 4 of 6: Levels window is displayed
as follows:
18) Save your changes. Click the Save All icon from the toolbar. The following
warning window is displayed as follows.
Note: If you have time, go through the following guided practice using Oracle
Warehouse Builder.
In this guided practice, you create a new relational SALES1_CUBE cube and dimension
it using the CHANNEL1_DIM , TIME1_DIM, and the PRODUCT_DIM and
CUSTOMER_DIM prebuilt dimensions in the SH_HANDSON project.
Note: To add a dimension, select a dimension from the Available Dimensions: list, and
then click the Add button. The dimension is now displayed in the Selected
Dimensions: list.
11) Save your changes. Click the Save All icon from the toolbar. The following
warning window is displayed as follows.
In this practice, you identify the suitable indexes for the attributes and answer questions
related to indexing and parallelism. You also identify the best strategy to implement
security for the RISD scenario. Finally, you review the other concepts explained in the
lesson.
1) Review the following physical model that you completed in practice 4-2.
No formal solutions
2) You have the historical data from the year 2000 for students and a huge facts table
called Students Scores. What type of partitioning do you think would best suit this
scenario, and why?
In this scenario, range partitioning would suit the best because the data can be
partitioned based on the year.
Note: The preceding indexes are suggested based on the cardinality of these
columns. Also, note that for all primary and unique key columns, the Oracle
database generates B-tree indexes automatically.
4) For a query to use star transformation, which of the following conditions should be
met?
a) B-tree indexes should be created on foreign key columns of the fact table.
b) The STAR_TRANSFORMATION_ENABLED parameter should be set to TRUE.
c) Rule-based optimizer should be used.
d) Bitmap indexes should be created on foreign key columns of the fact table.
e) Cost-based optimizer should be used.
f) All of the above
6) Based on the scenario presented in practice 4-1, find out the best way to implement
the data access security. What are the user roles that you can think of?
The best way to implement the data access security in this case is using a
combination of role-based security and virtual private database (VPD). Some of
the possible roles to restrict access are:
Student
Parent
Teacher
Admin
Principal or Management
In this practice, you identify the tools and techniques that will aid in the ETL process for
the RISD. You also answer questions based on the given scenario.
Note: If you have time, go through the Oracle Warehouse Builder guided practice 6-2. In
this guided practice, you create maps for the following:
1) The RISD currently has huge data in an operational system, which has to be extracted
and placed in the data warehouse. What do you think is a good choice among the
following for extracting data into the data warehouse?
a) Full and online extraction
b) Incremental and online extraction
c) Incremental and offline extraction
d) Full and offline extraction
2) Which of the following features of Oracle Database11g do you think will aid the ETL
process for the RISD? (Select any two options.)
a) Change Data Capture
b) Query rewrite
c) Partition pruning
d) External tables
The correct answers are a) and d). (Other options are also useful for data
warehousing, but not for the ETL process.)
3) It is decided that the RISD Data Warehouse (RISD DW) project will support Student
Information System (SIS). The RISD will supply annual flat files for each subject
area such as State Developed Alternative Assessment (SDAA II), and will also
provide periodic supplementary data files for new students. To support the scenario,
which of the following features of OWB can you take advantage of?
a) OWB offers easy-to-use wizards to define the dimensional model.
b) OWB supports both Third Normal Form (3NF) and star schema designs.
c) SQL*Loader control files for loading data from flat files.
d) OWB supports both relational and multidimensional designs.
e) All of the above
The correct answer is c). Other options are also features of OWB, but the feature
that is more useful for the scenario here is c).
4) Explore the perf_etl1_viewlet_swf.html viewlet in the
/oracle/home/viewlets/perf_etl1 folder. This viewlet demonstrates the ETL
capabilities of OWB. Answer the interactive questions in the viewlet.
No formal solution
Note: If you have time, go through the following guided practice using Oracle
Warehouse Builder.
In this practice, you create the maps for the CHANNEL1_DIM dimension and the
SALES1_CUBE cube that you created in practices 4-3 and 4-4 respectively.
3) Right-click the Mappings node, and then select New from the pop-up menu as
follows:
10) In the Mapping Editor window, click the Auto-Layout icon from the toolbar to
bring the mapping to its default size. Maximize both objects by clicking the
Maximize icon. By using the drag-and-drop feature, move the CHANNELS table to
the left side of the canvas and the CHANNEL1_DIM dimension to the right. After you
finish, close the Mapping Editor window. Compare your results with the following
screen capture:
Hint: You can double-click CHANNEL_MAP from the Mappings node to open the
In this practice, you create a map between the SALES source table and the
SALES1_CUBE cube that you have created in practice 4-4.
1) In the Project Explorer pane, navigate to the Mappings node: Expand Databases >
Oracle > TARGET_MODULE > Mappings.
2) Right-click the Mappings node, and then select New from the pop-up menu. The
Create Mapping dialog box is displayed. Enter SALES1_MAP as the mapping name,
and then click OK. The Mapping Editor: SALES1_MAP window is displayed as
follows:
4) Next, drag the Table Operator from the Palette panel and drop it onto the
Mappings panel. In the Add Table Operator dialog box, select SALES from the
SOURCE_MODULE, and then click OK. Arrange the layout and resize the tables as
desired. Compare your results with the following screen captures.
7) Right-click the expression operator EXPRESSION header, and then select Open
Details from the pop-up menu. In the Expression Editor window, click the Output
Attributes tab and then click Add.
Click
OUTGRP1.TIME_ID.
2
10) In the Attribute Properties panel displayed on the left side of the Mapping Editor:
SALES1_MAP window, click the field next to the Expression property. Next, click
the ellipsis button to the right of the field that you clicked. The Expression Builder:
Expression for TIME_ID window is displayed.
11) In the Expression for TIME_ID text field, enter the following statement, and then
click OK:
TO_NUMBER(TO_CHAR(INGRP1.TIME_ID , 'YYYYMMDD'),
'99999999')
In this practice, you identify the suitable staging model for the RISD data warehouse.
You also identify the problems, and the best-suited transformation techniques for the
RISD data based on the given scenario. Answer the questions based on the given
scenario:
1) By now, you know that the RISD has huge data in its operational system. It is also
agreed that the RISD will be responsible for data cleansing before data is extracted
from the operational data store (ODS) to load the data warehouse. Which of the
staging models do you think would be best suited for this scenario?
a) Remote staging model
b) Model without any staging area
c) Onsite staging model
1) Select the factors that you would consider while planning the refresh strategy for the
RISD data warehouse:
a) The volume of historical data (The RISD has large volumes of historical data.)
b) The type of mechanism used to capture changes in the data
c) The frequency of refresh
d) The network traffic (Traffic is low during the nights after 9 PM.)
The correct answers are b), c), and d).
In this practice, you answer questions about some of the things you learned in this lesson.
1) Which of the following statements about the metadata are incorrect? (Select any two
options.)
a) Metadata is descriptive data about the warehouse data and ETL processes.
b) Metadata is useful only for the DBAs and data warehouse administrators.
c) Metadata does not contain information about algorithms used for creating
derived data.
d) Any successful implementation of a warehouse can be accredited to the good
THESE eKIT MATERIALS ARE FOR YOUR USE IN THIS CLASSROOM ONLY. COPYING eKIT MATERIALS FROM THIS
COMPUTER IS STRICTLY PROHIBITED