Topic 1 Introduction To Database

Download as pdf or txt
Download as pdf or txt
You are on page 1of 19

Topic X Introduction

to Database

LEARNING OUTCOMES
When you have completed Topic, you should be able to: 1. Identify the characteristics of file-based systems. 2. Identify the limitations of file-based systems. 3. Define database and database management systems (DBMS). 4. Identify the functions of a DBMS. 5. Identify the components of the DBMS environment. 6. Identify the advantages and disadvantages of DBMSs.

TABLE OF CONTENTS
Introduction 1.1 Introduction to Database 1.1.1 Basic Concepts and Definition 1.2 Traditional File-based Systems 1.2.1 File-based Approach 1.2.2 Limitations of File-based Approach 1.3 Database Approach 1.3.1 The database 1.3.2 The Database Management Systems (DBMS) 1.4 Roles in the Database Environment Summary Key Terms References ;

TOPIC 1 INTRODUCTION TO DATABASE

INTRODUCTION

Hi there everyone. Welcome to the Database Systems class. Have you heard of the words database or database system? If you have, then you will have a better understanding of these words by taking this course. But, if you havent heard of them, then, dont worry. By taking this course, you will be guided until you know, understand and able to apply it to the real world problem. You might ask yourself, why do you need to study database systems? Well, this is similar as asking youself why do you need to study programming, operating system or other IT-related subjects. The answer is that database system has become an important component of successful businesses and organisations. Since you might probably intend to be a manager, entrepreneur or IT professionals, it is vital to have a basic understanding of database systems. Topic 1 introduces the area of database management systems, examining the problems with the traditional file-based systems and discusses what database management systems can offer. In this first section, I will explain some uses of database systems that we can find in our everyday life. Then, in the next section, we will compare file-based system with database systems. Next, we will discuss the roles that people perform in the database environment and lastly we will discuss the advantages and disadvantages of database management systems.

TOPIC 1 INTRODUCTION TO DATABASE W

1.1

INTRODUCTION TO DATABASES

Now, let me start by asking you some questions regarding your common activities. I am sure that you always go to the supermarket to purchase your goods and go to the automated teller machine to withdraw or deposit some money. Have you ever wondered where all these data come from or how they are being stored? And have you ever wonder whether your account is being balanced correctly? For your information, all these activities are possible with the existence of database management systems (DBMSs). What this means is that our life is affected by database technology. Computerised databases are important to the functioning of modern organisations. Well, before we proceed further, let us take a look to the definition below. What is DBMS? DBMS is a software system that enables users to define, create, maintain, and control access to the database(Connoly and Begg, 2005). What is a database? A database is a shared collection of logically related data, and a description of this data, designed to meet the information needs of an organisation (Connoly and Begg, 2005). What is database application? A database application is a program that manages and controls access to the database (Connoly and Begg, 2005). What is database system? A database system is a collection of application programs that interact with the database along with the DBMS and database itself (Connoly and Begg, 2005). The number of database applications has increased tremendously over the past two decades (Jeffrey et. al. 2007). Use of database to support customer relationship management, online shopping and employee relationship management is growing. But, before we discuss any further on this topic, lets examine some applications of database systems that you have used but without realising that you are accessing a database system in your daily life such as: (a) Purchase from the supermarket When you purchase goods from a supermarket, you would notice that the checkout assistant would scan the bar codes of your purchases and the total of your payment would be calculated.

TOPIC 1 INTRODUCTION TO DATABASE

Basically, what has happened is that the bar code reader is linked to an application program that uses the bar code to find out the price of the item and the price will be displayed on the cash register. Then, the program would reduce the number of such item in stock. If the reorder level falls below a specified predefined value, the database system would automatically place an order to obtain more stocks of that item. In this case, the sales manager can keep track of the items that were sold and need to be ordered. (b) Purchase using your credit card When you purchase an item using a credit card, your credit card will be swiped using the card reader that is linked to a database that contains information about the purchase that you have made using your credit card. The database application program would use your credit card number to check if the price of the item that you wish to purchase together with the total purchase that you have made that month is within your credit limit. Once the purchase is confirmed, the information about your recent purchased would be added to the database.

So, now, do you realise that so far you are part of the user of database systems? The database technology not only improves the daily operations of organisations but also the quality of decisions made. For instance, with the database systems, a supermarket can keep track of its inventory and sales in a very short time. This may lead to a fast decision in terms of making new orders of products. In this case, the products will always be available for the customers. Thus, the business may grow as customers satisfaction is always met. In other words, it would be an advantage to those who collect, manage and interpret information effectively in todays world.

1.1.1

Basic Concepts and Definitions

Now, notice that in the previous discussion, I sometimes use the word data, but sometimes I use the word information. Do you think that there is a difference between data and information? If your answer is yes, then, you are correct. But, what is the difference between data and information? What is data? Data is collection of unprocessed items that may consists of text, numbers, images and video (Shelly et. al. 2007). Today, data can be represented in various forms like sound, images and video. For instance, you can record your speech into a computer using the computers microphone. Images taken using a digital camera or scanned using a scanner can also be

TOPIC 1 INTRODUCTION TO DATABASE W

transferred into a computer. So, actually, there are so many different types of data around us. Can you name some other data that you might have used or produced before? Now, the next thing that we will discuss is that how can we make our data meaningful and useful? This can be done by processing it. What is information? Information refers to the data that have been processed in such a way that the knowledge of the person who uses the data is increased (Jeffrey et. al. 2007). For instance, the speech that you have recorded and images that you have stored in a computer could be used as part of your presentation using any of your presentation software. The speech may represent some definitions of the terms that are included in your presentation sides. Thus, by including it into your presentation, the recorded speech has more meaning and usefulness. The images could also be sent to your friends through electronic mails for them to view. What this means is that you have transformed the data that you have stored into information once you have done something with it. In other words, computers process data into information. In this course, we are concerned with the organisation of data and information and how it can be used in analysis and decision making. The more data and information that you have, the better your analysis and decision making would be. But, how can you store all these large volume of data and information? This is where a database comes in. The next section will discuss about the traditional file-based system and to examine its limitations, and also to understand why database systems are needed. SELF-CHECK 1.1 1. Define database system and explain one example where database system can be used in your daily life. 2. Name a software system that enables users to define, create, maintain, and control access to the database. 3. Name a shared collection of logically related data, and a description of this data, designed to meet the information needs of an organisation.

TOPIC 1 INTRODUCTION TO DATABASE

1.2

TRADITIONAL FILE-BASED SYSTEMS

Now, lets talk about the traditional file-based system.

1.2.1

File-based Approach

What is a file-based system? A file-based system is a collection of application programs that perform services for the end-users such as students reports for the academic office and lecturers report for the deans office. Each program defines and manages its own data (Connoly and Begg, 2005). Traditionally, manual files are being used to store all internal and external data within an organisation. These files are being stored in cabinets and for security purposes, the cabinets are locked or located in a secure area. When any information is needed, you may have to search starting from the first page until you found the information that you are looking for. To speed up the searching process, you may create an indexing system to help you locate the information that you are looking for quickly. You may have such system that store all your results or important documents. The manual filing system works well if the number of items stored is not large. However, this kind of system may fail if you want to do a cross-reference or process any of the information in the file. Then, computer-based data processing emerge and it replaces the traditional filing system with computer-based data processing system or file-based system. However, instead of having a centralised store for the organisations operational access, a decentralised approach was taken. In this approach, each department would have their own file-based system where they would monitor and control separately. Lets refer to the following example. (a) File processing system at Make-Believe real estate company Make-Believe real estate company has three departments, that are, Sales, Contract and Personnel. Each of these departments were physically located in the same building, but in separate floors, and each has its own file-based system. The function of the Sales department is to sell and rent properties. The function of the Contract department is handle the lease agreement associated with properties for rent. The function of the Personnel department is to store the information about the staff. Figure 1.1 illustrates the file-based system for Make-Believe real estate company. Each department has its own

TOPIC 1 INTRODUCTION TO DATABASE W

application program that handles similar operations like data entry, file maintenance and generation of reports.

Figure 1.1: File-based system for Make-Believe real estate company

By looking at Figure 1.1, we can see that the sales executive can store and retrieve information from the sales files through sales application programs. The sales files may consist of information regarding the property, owner and client. Figure 1.2 illustrates examples of the content of these three files. Figure 1.3 shows the content of the Contract files while Figure 1.4 is for the Personnel File. Notice that the client file in the sales and contract departments are the same. What this means is that duplication occurs when using decentralised file-based system.

TOPIC 1 INTRODUCTION TO DATABASE

Property File
Property No. PH01 Street 23 Jln Tepak 11/9 4-2, Perdana Apt City Shah Alam Subang Jaya Postcode 40000 Type House Room 4 Bathroom 3 Rent 1000 Owner No. OH01

PA01

41500

Apt

800

OA01

Owner File
Owner No. OH01 OA01 First Name Maria Lim Last Name Rahim Ah Ling Address 9, Jln Tungsten 7/19, 40000 Shah Alam 12, Jln Sari 9/11, 40000 Shah Alam Tel No. 03-55551234 03-55554321

Client File
Client No. PH01 PA01 First Name Salina Foo Last Name Aiman Hong Leong City Shah Alam Subang Jaya Tel No. 0355559876 0355555678 Preference Type 1 House House Preference Type 2 Apt Apt Max Rent 1000 800

Figure 1.2: The Property, Owner and Client files used by sales department

TOPIC 1 INTRODUCTION TO DATABASE W

Lease File
Lease No 20011 20012 Property No PH01 PA01 Client No PH01 PA01 Payment Method Visa Mastercard Deposit 500 600 paid Y N Rent Start 1/1/07 10/10/07 Rent Finish 1/1/08 31/10/08

Property_for-Rent File
propertyNo PH01 PA01 street 23 Jln Tepak 11/9 4-2, Perdana Apt city Shah Alam Subang Jaya postcode 40000 41500 rent 1000 800

Client File
Client No. CL01 CL02 First Name Salina Foo Last Name Aiman Hong Leong City Shah Alam Subang Jaya Tel No. 0355559876 0355555678 Preference Type 1 House House Preference Type 2 Apt Apt Max Rent 1000 800

Figure 1.3: The Lease, Property and Client files used by contract department

Personnel File
Personnel No First Name Last Name Date of Birth

Street

City

Postcode

Qualification

Start

Figure1.4: The Personnel file used by the personnel department

By referring to Figures 1.2, 1.3 and 1.4, we can see that a file is simply a collection of records while a record is a collection of fields and a field is a collection of alphanumeric characters. Thus, the Personnel file in Figure 1.4 consists of two records and each record consists of nine fields. Now, can you list the number of records and fields in the Client file as shown in Figure 1.3?

10 X

TOPIC 1 INTRODUCTION TO DATABASE

Now, lets discuss about the limitations of the file-based system that we have discusses earlier. No doubt, file-based systems proved to be a great improvement over manual filing system. But, a few problems still occur with this system, especailly, if the volume of the data and information increases.

1.2.2

Limitations of File-based Approach

What are the disadvantages or limitations of file-based system? Can you identify one? Well, actually, there exists several limitations associated with the conventional file-based system, and they are explained below. Separation and isolation of data Now, suppose that you want to match the requirements of your clients with the available properties? How are you going to do this? Basically, what you will have to do is go to the sales department and access the property and client files to match the requirements with the properties available. How are you going to access the information from these two files? Well, you can create a temporary file of those clients who have house as the preferred type and search for the available house from the property file. Then, you may create another temporary file of those clients who have apartment as the preferred type and do the searching again. The search would be more complex if you have to access to more than two files and from different departments. In other words, the separation and isolation of data would make the retrieval process time consuming. Duplication of data If you were to look back at Figures 1.2 and 1.3, you will notice that both the sales and contract departments have the property and client files. This duplication would waste time as the data would be entered twice even though in two different departments. The data may be entered incorectly which leads to different information from both departments. Besides that, more storage is being used and this can be associated with cost as extra storage is needed, meaning the cost will be increased. Another disadvantage of duplication of data is that there may be no consistency when updating the files. Suppose that the rental cost is being updated in the property file of the sales department but not in the contract department. Then, problems may occur as the client may be informed with two different costs. You can imagine the problem that may arise due to this. Program-Data dependence The physical structure of the files like the length of the text for each field is defined in the application program. Thus, if the property department decides to change the clients first name from ten characters to twenty

TOPIC 1 INTRODUCTION TO DATABASE W

11

characters, then, the file description of the first name for all the affected files need to be modified. What this means is that the length of the first name for the owner and client file in the property department need to be changed also. It is often difficult to locate all affected programs by such changes. Try to imagine if you have a lot of files in your file-based system and you may have to check each file for such modification, dont you think that this would be very time consuming? Limited data sharing By looking back at Figures 2 and 3, we can see that the contract department does not have the owner file as sales department. What this means is that if the contract department would like to access information regarding the owner of a property, no direct access is allowed. This request may need to go through the management of both departments, and again, the overall process may be time consuming. Now, after understanding the limitations of file-based system, lets discuss a solution to the above limitations. The answer would be introducing database system. This will be explained in the next section. SELF-CHECK 1.2 1. What is file-based system? 2. List two limitations of file-based system.

1.3

DATABASE APPROACH

How can database approach improve the limitations of file-based system? Can you identify at least one advantage of database approach compared to file-based approach? Well, the database approach emphasizes the integration and sharing of data throughout the organisation which means that all departments should be able to integrate and share the same data. The detail advantages of database approach is explained below. 1. Program-data independence With database approach, data descriptions are stored in a central location called the repository, separately from the application program. Thus, it allows an organisations data to change and evolve without changing the application programs that process the data. What this means is that the changing of data would be easier and faster.

12 X

TOPIC 1 INTRODUCTION TO DATABASE

2.

Planned data redundancy and improved data consistency Ideally, each data should be recorded in only one place in the database. Thus, a good database design would integrate redundant data files into a single logical structure. In this case, any updates of data would be easier and faster. In fact, we can avoid wasted storage space that results from redundant data storage. By controlling data redundancy, the data would also be consistent. Increased productivity of application development A database approach reduces the cost and time for developing new database applications. What this means is that with the same database, different applications can be developed. Thus, there is no need to design and develop a new database for different applications (Hoffer, et. al. 2007).

3.

1.3.1

The Database

Can you recall the definition of database? It is a shared collection of logically related data, and a description of this data, designed to meet the information needs of an organisation (Connoly and Begg, 2005). In other words, it is a large repository of data that can be used by many users at the same time. It is also defined as a self-describing collection of integrated records because it consists of a description of the data. The description of the data is called system catalog or data dictionary of metadata (Connoly and Begg, 2005). The database approach separates the structure of the data from the application programs and this approach is known as data abstraction. Thus, we can change the internal definition of an object in the database without affecting the users of the object, provided that the external definition remains the same. For instance, if we were to add a new field to a record or create a new file, then the existing applications are unaffected. More examples of this will be shown in the next Topic. Some other terms that you need to understand are entity, attribute and relationships. An entity is a specific object (for example a department, place, or event) in the organisation that is to be represented in the database. An attribute is a property that explains some characteristics of the object that we wish to record. A relationship is an association between entities (Connoly and Begg, 2005). Figure 1.5 illustrates an example of an EntityRelationship (ER) diagram for part of a department in an organisation.

TOPIC 1 INTRODUCTION TO DATABASE W

13

Figure 1.5: An example of Entity-Relationship diagram

By referring to Figure 1.5, we can see that it consists of two entities (the rectangles), that are, Department and Staff. It has one relationsip, that is, has, where it indicates that a department has many staffs. For each entity, there is one attribute, that is, Department No and StaffNo. In other words, the database holds data that is logically related. More explanations on this will be discussed in later Topics. SELF-CHECK 1.3 1. What is metadata?

2. Define entity, attribute and relationships.

1.3.2

The Database Management Systems (DBMS)

Now, what about DBMS? Can you recall the definition of DBMS from earlier explanation? A DBMS is a software that interacts with the users application programs and the database (Connoly and Begg, 2005). Initially, DBMSs provide efficient storage and retrieval of data. But, as the marketplace and innovation demands increase, DBMSs have evolved to provide broad range of features for data acquisition, storage, dissemination, maintenance, retrieval and formatting which make the DBMSs more complex. Let us now discuss in detail some of the common features of DBMS. Database definition In defining a database, the entities stored in tables (an entity is defined as a cluster of data usually about a single item or object that can be accessed) and relationships that indicate the connections among the tables must be specified. Most DBMSs provide several tools to define databases. The Structured Query Language (SQL) is an industry standard language supported by most DBMSs that can be used to define tables and

14 X

TOPIC 1 INTRODUCTION TO DATABASE

relationships among tables (Mannino 2001). More discussions on SQL will be in later Topics. Nonprocedural access The most important feature of DBMSs is the ability to answer queries. A query is a request to extract useful data. For instance, in a student DBMS where a few tables may have been defined, like personal information table and result table and a query might be a request to list the names of the students who will be graduating next semester. Nonprocedural access allows users to submit queries by specifying what parts of a database to retrieve (Mannino 2001). More discussions on queries will be in later Topics. Application development Most DBMSs provide graphical tools for building complete applications using forms and reports. For instance, data entry forms provide an easy way to enter and edit data. Report forms provide easy to view results of a query (Mannino 2001). Transaction processing Transaction processing allows a DBMSs to process large volumes of repetitive work. A transaction is a unit of job that should be processed continously without any interruptions from other users and without loss of data due to failures. An example of a transaction is making an airline reservation. The user does not know the details about the transaction processing other than the assurance that the process is reliable and safe (Mannino 2001). Database tuning include a few monitoring processing that could improve the performance. Utility programs can be used to reorganize a database, select physical structures for better performance and repair damaged parts of a database. This feature is important for DBMSs that support large databases with many simultaneous users and usually known as Enterprise DBMSs. On the other hand, desktop DBMSs run on personal computers and small servers that support limited transaction processing features usually use by small businesses (Mannino 2001).

TOPIC 1 INTRODUCTION TO DATABASE W

15

1.4

ROLES IN THE DATABASE ENVIRONMENT

Now, this section will explain the people involved in the DBMS environment. Basically, there are four types of people that are involved in the DBMS environment, that are, data and database administrators database designers application developers end-users Now, lets talk about them in detail. Data and database administrators The data and database administrators are those who manage the data resources in a DBMS environment. This include database planning, development and maintenance of standards, policies and procedures, and conceptual/logical database design where they work together with senior managers. In other words, some of their roles are as follows : production of proprietary and open-source technologies and databases on diverse platforms that must be managed simultaneously in many organisations; rapid growth in the size of databases; the expansion of applications that require linking corporate databases to the Internet. Database designers There exists two types of database designers, namely, logical database designer and physical database designer. The logical database designer is responsible to identify the data, relationships between the data and the constraints on the data that is to be stored in the database. He/she needs to have a thorough understanding of the organisations data. On the other hand, a physical database designer needs to decide how the logical database design can be physically developed. He or she is responsible to map the logical database design into a set of tables, selecting specific storage structures and access methods for the data to produce good performance and design the security measures needed for the data (Connoly and Begg 2005).

16 X

TOPIC 1 INTRODUCTION TO DATABASE

Application Developers An application developer is responsible to provide the required implementation for the end-users. Usually, an application developer works on the specification produced by the system analysts. The applications may be written in a third-generation or fourth-generation programming language. End-users The end-users are the customers for the database that have been designed to serve their information needs. End users can be categorized as naive users or sophisticated users. Naive users usually do not know much about DBMS where they would only use simple commands or select from a list of options provided by the application. On the other hand, sophisticated users usually have some knowledge about the structure and facilities offered by the DBMS. They would use high-level query language to retrieve their needs. Some may even write their own application programs. SELF CHECK 1.4 Who are the people involved in the database environment? Briefly explain their responsibilities.

The Database Management System (DBMS) is currently an important component of an information system and has changed the way many organisations operate. The predecessor to the DBMS was the file-based system where each program defines and manages its own data. Thus, data redundancy and data dependence become major problems. The database approach was introduced to resolve the problems with file-based system. All access to the database can be made through the DBMS. Some advantages of the database approach are control of data redundancy, data consistency, sharing of data and improvement of security and integrity. Some disadvantages are complexity, and cost.

TOPIC 1 INTRODUCTION TO DATABASE W

17

Data Database Database application Database system Database Management System (DBMS)

Entity File-based system Information Metadata Relationship SQL

Review Questions
1. Define each of the following key terms: a. Data b. Information c. Database d. Database application e. Database system f. Database Management System List two disadvantages of file-based systems. List two examples of database systems other than that have been discussed in this Topic. Discuss the main components of the DBMS environment and they are related to each other. Discuss the roles of the following personnel in the database environment: a. Database administrator b. Logical database designer c. Physical database designer d. Application developer e. End-user

2. 3.

4.

5.

18 X

TOPIC 1 INTRODUCTION TO DATABASE

Study the University Student Affairs case study presented below. In what ways would a DBMS help this organisation? What data can you identify that needs to be represented in the database? What relationships exist between the data items? Data requirements : Students Student identification number First and last name Home address Date of birth Sex Semester of study Nationality Program of study Recent Cumulative Grade Point average (CGPA) College (A college is an accommodation provided for the students. Each college in the university has the following information) College name College address College office number College manager Number of rooms Room number Sample query transactions List the names of students who are staying in the colleges List the number of empty rooms in the colleges List the names of students within specific CGPA

TOPIC 1 INTRODUCTION TO DATABASE W

19

Connoly, T. & Begg, C. (2005). Database systems: A practical approach to design, implementation, and management, (4th ed.). Harlow: Addison Wesley. About.com: Databases (n.d.). Retrieved December 29, 2009, from http://databases.about.com/ Hoffer, J., Prescott, M. & McFadden, F. (2007). Modern database management (8th ed.). Saddle River, NJ: Prentice-Hall. Mannino, M. V. (2001). Database: Application development & design. New York: McGraw-Hill.

You might also like