ECEG-4191 Database Systems
ECEG-4191 Database Systems
ECEG-4191 Database Systems
Database Systems
Chapter 01
Introduction
Database Applications
• Businesses: Banking, Insurance, Retail, Transportation, Healthcare, Manufacturing
• Recently:
• Social Networks, Environmental and Scientific Applications, Medicine and Genetics
database.
Database System:
The DBMS software together with the data itself. Sometimes, the application programs
Entities
_____________ (e.g., students, courses)
Relationships (e.g., John is takes ECEG 4191)
_____________
File System v.s. DBMS
File System is a collection of individual ____
files accessed by application programs.
Example: Store the data in comma-separated value (CSV) files.
Use a separate file per entity.
Write applications to access and manipulate this data as they are needed
- Data Isolation
Multiple files and formats
- Integrity Problems
• Integrity constraints (e.g., account balance > 0) become “buried” in program code rather than being stated explicitly
• Hard to add new constraints or change existing ones
- Schema Change
• requires changing file formats
• need to rewrite virtually all applications
Limitation of Traditional File Based Systems (Cont.)
• Atomicity of updates
• Failures may leave database in an inconsistent state with partial updates carried out
• Example: Transfer of funds from one account to another should either complete or not happen at all
requirements.
Availability of up-to-date information:
Extremely important for on-line transaction systems such as airline, hotel, car
reservations.
Economies of scale:
Wasteful overlap of resources and personnel can be avoided by consolidating data and
Construct or Load the initial database contents on a secondary storage medium (typically
hard disk)
Processing and Sharing by a set of concurrent users and application programs – yet,
keeping all data valid and consistent
Those who actually use and control the database content, and those who design,
Those who design and develop the DBMS software and related tools, and the
computer systems operators.
Database Users: DB Administrators (DBAs) and Designers
Database Administrators:
Responsible for authorizing/controlling access to the database;
coordinating and monitoring its use; acquiring software and hardware
resources; and monitoring efficiency of operations.
Database Designers:
Responsible for defining database structure, constraints, and transactions;
communicate with users to understand their needs.
Database Users: End-users
End-users: use the database for queries, reports, and updating the database content. Can be
categorized into:
Casual end-users
Access database occasionally when needed
Naïve (or Parametric) end-users
Use previously implemented and tested programs (called “canned transactions”) to
Sophisticated end-users:
These include business analysts, scientists, engineers, others thoroughly familiar with the
system capabilities.
Many use tools in the form of software packages that work closely with the stored database.
Stand-alone end-users:
Mostly maintain personal databases using ready-to-use packaged applications.
An example is a tax program user that creates its own internal database.
• Internal schema at the internal level to describe physical storage structures and access paths (e.g
indexes).
• Typically uses a physical data model.
• Conceptual(logical) schema at the conceptual level to describe the structure and constraints for the
whole database for a community of users.
• Uses a conceptual or an implementation data model.
• External schemas at the external level to describe the various user views.
• Usually uses the same data model as the conceptual schema.
This is also known as the ANSI/SPARC (American National Standards Institute/ Standards Planning And
Requirements Committee) architecture, after the committee that proposed it (Tsichritzis & Klug, 1978).
The three-schema(ANSI-SPARC) architecture
• New technologies are emerging from the so-called non-SQL to manage vast
amounts of data generated on the web:
• Example: NOSQL (Non-SQL, Not Only SQL) systems (Chapter 24)
• A large amount of data now resides on the “cloud” which means it is in huge
data centers using thousands of machines.
Why is “big data” a “big deal”?
• Private Sector
• Walmart handles more than 1 million customer transactions every hour, which is imported
into databases estimated to contain more than 2.5 petabytes of data
• Facebook handles 40 billion photos from its user base
• Falcon Credit Card Fraud Detection System protects 2.1 billion active accounts world-wide
• Science
• Large Synoptic Survey Telescope will generate 140 Terabyte of data every 5 days