BIT 1201 Database Lesson 1
BIT 1201 Database Lesson 1
BIT 1201 Database Lesson 1
Lesson One
The Database Environment and Development
Process
The Database Environment and Development Process
Learning Objectives:
• Define terms
• File based systems
• Explain advantages and disadvantages of databases
• List components of database environment
• Identify categories of database applications
• Explain roles of individuals
• Explain differences between personal, multi-tiered, and
enterprise data management
• Explain three-schema architectures (external, conceptual,
internal)
Definitions
• Database: organized collection of logically related data
• Data: stored representations of meaningful objects and
events
– Structured: numbers, text, dates
– Unstructured: images, video, documents
• Information: data processed to increase knowledge in the
person using the data
• Metadata: data that describes the properties and context of
user data
Figure :Converting Data to Information (1 of 2)
(a) Data in context
Figure:Converting Data to Information (2 of 2)
(b) Summarized data
Table 1-1 Example Metadata for Class Roster
• Descriptions of the properties or characteristics of the data, including data
types, field sizes, allowable values, and data context
Name Type Length Mi Ma Description Source
n imum ximum
• Controlling Redundancy
• In file systems each application program has its own private files. In this
case, the duplicated copies of the same data is created in many places.
In DBMS, all data of an organization is integrated into a single database
file. The data is recorded in only one place in the database and it is not
duplicated.
• Sharing of Data
• In DBMS, data can be shared by authorized users of the organization.
The database administrator manages the data and gives rights to users
to access the data. Many users can be authorized to access the same
piece of information simultaneously. The remote users can also share
same data. Similarly, the data of same database can be shared between
different application programs.
• Data Consistency
• By controlling the data redundancy, the data consistency is
obtained. If a data item appears only once, any update to its
value has to be performed only once and the updated value is
immediately available to all users. If the DBMS has controlled
redundancy, the database system enforces consistency.
• Integration of Data
• In Database management system, data in database is stored in
tables. A single database contains multiple tables and
relationships can be created between tables (or associated data
entities). This makes easy to retrieve and update data.
• Integration Constraints
• Integrity constraints or consistency rules can be applied to database so
that the correct data can be entered into database. The constraints may
be applied to data item within a single record or the may be applied to
relationships between records.
• Data Security
• Form is very important object of DBMS. You can create forms very
easily and quickly in DBMS. Once a form is created, it can be used many
times and it can be modified very easily. The created forms are also
saved along with database and behave like a software component. A
form provides very easy way (user-friendly) to enter data into database,
edit data and display data from database. The non-technical users can
also perform various operations on database through forms without
going into technical details of a fatabase.
• Report Writers
• Most of the DBMSs provide the report writer tools used to create
reports. The users can create very easily and quickly. Once a report is
created, it can be used may times and it can be modified very easily. The
created reports are also saved along with database and behave like a
software component.
• Control Over Concurrency
• In a computer file-based system, if two users are allowed to access data
simultaneously, it is possible that they will interfere with each other. For
example, if both users attempt to perform update operation on the same
record, then one may overwrite the values recorded by the other. Most
database management systems have sub-systems to control the
concurrency so that transactions are always recorded with accuracy.
• Backup and Recovery Procedures
• In a computer file-based system, the user creates the backup of
data regularly to protect the valuable data from damage due to
failures to the computer system or application program. It is very
time consuming method, if amount of data is large. Most of the
DBMSs provide the 'backup and recovery' sub-systems that
automatically create the backup of data and restore data if
required.
• Data Independence
• The separation of data structure of database from the application
program that uses the data is called data independence. In DBMS,
you can easily change the structure of database without modifying
the application program.
The Main disadvantages of DBMS
• Cost of Hardware and Software
• A processor with high speed of data processing and memory of large
size is required to run the DBMS software. It means that you have to
up grade the hardware used for file-based system. Similarly, DBMS
software is also very costly,.
• Cost of Data Conversion
• When a computer file-based system is replaced with database
system, the data stored into data file must be converted to database
file. It is very difficult and costly method to convert data of data file
into database. You have to hire database system designers along
with application programmers. Alternatively, you have to take the
services of some software house. So a lot of money has to be paid
for developing software.
• Cost of Staff Training
• Most database management system are often complex systems so the training
for users to use the DBMS is required. Training is required at all levels, including
programming, application development, and database administration. The
organization has to be paid a lot of amount for the training of staff to run the
DBMS.
• Appointing Technical Staff
• The trained technical persons such as database administrator, application
programmers, data entry operations etc. are required to handle the DBMS. You
have to pay handsome salaries to these persons. Therefore, the system cost
increases.
• Database Damage
• In most of the organization, all data is integrated into a single database. If
database is damaged due to electric failure or database is corrupted on the
storage media, the your valuable data may be lost forever.
Components of the Database Environment
• Data modeling and design tools – automated tools used to design
databases and application programs
• Repository – centralized storehouse of metadata
• Database Management System (DBMS) – software for managing the
database
• Database – storehouse of the data
• Application Programs – software using the data
• User Interface – text, graphical displays, menus, etc. for user
• Data/Database Administrators – personnel responsible for maintaining
the database
• System Developers – personnel responsible for designing databases
and software
• End Users – people who use the applications and databases
Figure: Components of the Database
Environment
Evolution of Database Systems
• Driven by four main objectives:
– Need for program-data independence in order to reduce
maintenance
– Desire to manage more complex data types and structures
– Ease of data access for less technical personnel
– Need for more powerful decision support platforms
Examples of Database Applications:
• Banking: transactions
• Airlines: reservations, schedules
• Universities: registration, grades
• Sales: customers, products, purchases
• Online retailers: order tracking, customized
recommendations
• Manufacturing: production, inventory, orders, supply
chain
• Human resources: employee records, salaries, tax
deductions
The Range of Database Applications
• Personal Databases
– Typical size in the megabytes
– Intended for one user
• Departmental Multi-Tiered Client/Server Databases
– Typical size in the gigabytes
– Intended for several users, usually doesn’t exceed 100,
department-wide
• Enterprise Applications
– Typical size in the gigabytes, terabytes, or even petabytes
– Intended for a very large user base, company wide
Figure: Multi-Tiered Client/Server Database
Architecture
Types of Enterprise Applications
• Enterprise Systems (typically involve relational databases)
– Backbone of an organization
– Enterprise resource planning (ERP)
– Customer relationship management
– Supply chain management
– Human resource management and payroll
• Data Warehouses (typically involve relational databases)
– Integrates data from multiple data sources
– Maintain historical data
– Help identify patterns and trends
• Data Lakes (often don’t involve relational databases)
– Large integrated repository for internal and external data that does not follow
a predefined schema
Views
• A view mechanism.
– Provides users with only the data they want or
need to use.
• Allows each user to have his or her own view
of the database.
• A view is essentially some subset of the
database.
Benefits include:
– Reduce complexity;
– Provide a level of security;
– Provide a mechanism to customize the
appearance of the database;
– Present a consistent, unchanging picture of the
structure of the database, even if the underlying
database is changed.
ANSI SPARC 3 Level Database Architecture
- External level
- Conceptual level
- Internal level (includes physical data storage)
• The 3 Level Architecture has the aim of enabling
users to access the same data but with a
personalised view of it. The distancing of the
internal level from the external level means that
users do not need to know how the data is
physically stored in the database. This level
separation also allows the Database
Administrator (DBA) to change the database
storage structures without affecting the users'
views.
• External Level (User Views)
A user's view of the database describes a part of the database that is relevant to a
particular user. It excludes irrelevant data as well as data which the user is not
authorised to access.
Conceptual Level
The conceptual level is a way of describing what data is stored within the whole
database and how the data is inter-related. The conceptual level does not specify
how the data is physically stored.
Internal Level
The internal level involves how the database is physically represented on the
computer system. It describes how the data is actually stored in the database and
on the computer hardware.
Database Schema
• The database schema provide an overall description of the
database structure (not actual data). There are three types
of schema which relate to the 3 Level Database
Architecture.