The Era of Big Data: Databases, Information Systems, & Artificial Intelligence
The Era of Big Data: Databases, Information Systems, & Artificial Intelligence
The Era of Big Data: Databases, Information Systems, & Artificial Intelligence
Databases,
Information Systems,
& Artificial Intelligence
Chapter 8
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Chapter Topics
UNIT8A:
UNIT 8A:Files
Files&&Databases
Databases
8.1 Managing Files: Basic Concepts
8.1 Managing Files: Basic Concepts
8.2 Database Management Systems
8.2 Database Management Systems
8.3 Database Models
Using Information Technology, 11e
3
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Using Information Technology, 11e
4
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
• A database is a logically organized collection of related data
designed and built for a specific purpose.
• Data is stored hierarchically for easier storage and retrieval.
Using Information Technology, 11e
5
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Data Storage
Hierarchy
Using Information Technology, 11e
6
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
A key field (primary key) is a field (or fields) in a record that holds
unique data that identifies that record from all the other records in
the table and in the database.
Using Information Technology, 11e
Systems
8
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Database Management System (DBMS): software that enables users to
store, modify, and extract information from a database
• DBMS benefits:
• Reduced data redundancy (redundant data is stored in multiple places, which
Using Information Technology, 11e
areas that can affect sales and marketing managers’ decisions as well as the decisions of
administrators of educational institutions, hospitals, and other organizations
• Increased security—Although various departments may share data, access to specific
information can be limited to selected users—called authorization control.
10
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
3 Principal Database Components
• Data Dictionary
• Repository that stores the data definitions and descriptions of the structure of the
data and the database
Using Information Technology, 11e
• DBMS Utilities
• Programs that allow you to maintain the database by creating, editing, deleting data,
records, and files
• Also include automated backup and recovery
• Report Generator
• Program for producing on-screen or printed readable documents from all or part of a
database
11
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Database Administrator (DBA)
• Coordinates all related activities and needs for an organization’s
database
Using Information Technology, 11e
12
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Using Information Technology, 11e
13
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
A database model determines the information a database will contain and
how it will be used and how the items in the database relate to one another.
Hierarchical Database
• Fields or records are arranged in related groups resembling a family tree
Using Information Technology, 11e
14
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Hierarchical Database
Using Information Technology, 11e
15
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Network Database: created to represent a more complex data
relationship effectively, improve database performance, and impose
a database standard.
Using Information Technology, 11e
16
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Network
Database
Using Information Technology, 11e
17
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
• Relational Database: grew out of the hierarchical and network
database models
• Relates or connects data in different files through the use of primary
Using Information Technology, 11e
19
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Relational Database (continued)
• Users employ SQL (structured query language) to create, modify, maintain, and
query the database
• Query by Example uses sample record forms to allow users to define the
Using Information Technology, 11e
20
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Object-Oriented Database
• Uses “objects,” software written in small, reusable chunks, as elements
within data files
• An object consists of:
Using Information Technology, 11e
21
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Multidimensional Database
• Models data as facts, dimensions, or numerical
answers for use in the interactive analysis of large
amounts of data for decision-making purposes
Using Information Technology, 11e
22
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Brief Database Model Overview
Hierarchical database Fields or records are arranged in a family tree, with child records subordinate to
parent or higher-level records
Using Information Technology, 11e
Network database Like a hierarchical database, but each child record can have more than one
parent record
Relational database Relates, or connects, data in different files (tables) through the use of a key, or
common data element
Object-oriented database Uses objects (software written in small, reusable chunks) as elements within
database files; multimedia
Multidimensional database Models data as facts, dimensions, or numerical measures for use in the
interactive analysis of large amounts of data
23
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Using Information Technology, 11e
24
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Data mining is the computer-assisted process of sifting through and analyzing
vast amounts of data to extract hidden patterns and meaning and to discover new
knowledge.
• Data is fed into a data warehouse through the following steps:
Using Information Technology, 11e
25
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Data Mining
Using Information Technology, 11e
26
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Methods for searching for patterns in the data and interpreting the
results
• Regression analysis
Using Information Technology, 11e
• Develops mathematical formula to fit patterns in the data that has been extracted
• Formula is then applied to other data sets of the same type to predict future trends
• Classification analysis
• Statistical pattern-recognition process that is applied to data sets with more than just
numerical data
27
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
UNIT 8B: Big Data, Information Systems, & Artificial
Intelligence
Using Information Technology, 11e
Big Data aims to tap all the web data and other data that is
outside corporate databases. Big Data typically means applying
the tools of artificial intelligence to vast new sources of data
beyond that captured in standard databases. The new data
sources include web-browsing data trails, social network
communications, sensor data, and surveillance data.
28
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
8.5 The Evolving World of
Using Information Technology, 11e
Big Data
29
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Three Implications of Big Data:
1. Big Data derives from a bundle of old & new data sources, both old
and new—web pages, sensor signals, GPS location data from smartphones,
browsing habits, genetic information, and surveillance videos. To make
Using Information Technology, 11e
down to the cellular level. In business management, for example, new kinds
of measurement could replace old ideas, organizations, and ways of
thinking about the world.
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
3. Big data could lead to better decision making: Not only can
data-driven insights be used to make sense of incredibly complex
situations, Big Data “can help compensate for our
Using Information Technology, 11e
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Uses of Big Data:
• Big Data is finding major uses in medical research, marketing,
politics, and even entertainment programming, to name just a few
Using Information Technology, 11e
areas.
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
8.6 Information Systems in
Using Information Technology, 11e
Organizations
Using Databases to Help Make Decisions
34
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
An information system is a combination of people, hardware,
software, communication devices, and databases that processes data
and information for a specific purpose.
Using Information Technology, 11e
35
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Most organizations have 6 departments within which information
must flow, horizontally:
• Research and development
Using Information Technology, 11e
• Production (operations)
• Marketing and sales
• Accounting and finance
• Human resources (personnel)
• Information systems (IS)
36
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Besides the 6 departments, many organizations also have 3 levels of
management, where information flows vertically:
• Strategic-level management
• Top managers (CEOs, COOs, CFOs, CIOs) concerned with long-term, or
Using Information Technology, 11e
37
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
• A Newer Information Flow: Decentralized Organizations
• The pyramid management structure is flattened somewhat as employees are
given more authority to make day-to-day decisions.
• Employees increasingly linked to a central database.
Using Information Technology, 11e
38
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Computer-based information systems: information systems that are a
combination of hardware, software, and telecommunications networks
that people build and use to collect, create, and distribute data.
Using Information Technology, 11e
40
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
OIS
Using Information Technology, 11e
41
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
2. Transaction Processing System (TPS)
• Transactions are recorded events of routine business activities, such as
bills, orders, and inventory
• TPS systems keep track of the transactions needed to conduct a business
Using Information Technology, 11e
• Features of a TPS:
• Input and output: transaction data
• For operational (low-level) managers
• Produces detail reports (specific information about routine activities)
• One TPS for each department
• Basis for management information systems (MIS) and decision support systems
(DSS)
42
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
3. Management Information System (MIS)
• Computer-based information system that uses data recorded by a TPS as
input to programs that produce routine reports as output
• Features
Using Information Technology, 11e
• Inputs are processed transaction data; outputs are summarized, structured reports
• Designed for tactical (mid-level) managers
• Draws from all departments
• Produces several kinds or reports: summary, exception, periodic, and demand
43
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
4. Decision Support System (DSS)
• Computer information system that provides a flexible tool for analysis
and helps management focus on the future
• Features
Using Information Technology, 11e
• Inputs are external data and internal data such as summarized reports and processed
transaction data; outputs are demand reports from top managers
• Assists tactical (mid-level) managers in decision making
• Produces analytic models
• Developed to support the types of decisions faced by managers in
specific industries
44
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
5. Executive Support System
• Easy-to-use DSS made especially for strategic (top-level) managers
to support strategic decision making
•
Using Information Technology, 11e
45
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Using Information Technology, 11e
47
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Using Information Technology, 11e
48
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Artificial intelligence is the branch of computer science concerned with
making computers behave like humans.
• Two approaches to AI are conventional AI, based on machine learning, and
computational intelligence, based on experimental and trial-and-error methods.
Using Information Technology, 11e
• Pattern recognition
• Virtual reality and simulation devices
• Robotics
• Fuzzy logic
• Neural networks
51
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Expert Systems
• Built by knowledge engineers
• Include surface knowledge and deep knowledge
Using Information Technology, 11e
52
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Expert
System
Using Information Technology, 11e
53
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Natural language processing
• Allows users to interact with a system using normal language
• The study of ways for computers to recognize and understand human
language
Using Information Technology, 11e
Intelligent agents
• A form of software with built-in intelligence that monitors work patterns,
asks questions, and performs work tasks on your behalf; shop bots are
intelligent agents
54
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Pattern recognition
• Involves a camera and software that identify recurring visual patterns by
mapping them against similar patterns stored in a database (e.g., visual
surveillance and ID of suspicious people)
Using Information Technology, 11e
55
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Robotics
• The development and study of machines that can perform actions that
are normally done by people
• Robots grouped by locomotion system: grouped according to their means
Using Information Technology, 11e
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Neural network: consists of a network of processors that are
interconnected in a way that is similar to the connections
between neurons, or nerve cells, in the human body. The neural
Using Information Technology, 11e
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
8.8 Artificial Life, the Turing
Using Information Technology, 11e
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Artificial intelligence leads to the question of how can we know a machine is
truly intelligent, which figures in the Turing test.
Turing Test: In 1950 Allen Turing predicted computers would eventually be
able to mimic human thinking.
Using Information Technology, 11e
60
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Smarter-Than-Human Computers
The Singularity
• A moment when humans would have created self-aware, smarter-than-human
machines capable of designing computers and robots that are better than humans
Using Information Technology, 11e
61
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Ethics in A.I.
• Ethics underlies everything having to do with AI.
• Computer software is subtly shaped by the ethical judgments and
assumptions of its creators; there is no human-values-free / bias-free
Using Information Technology, 11e
software.
• Will AI cause humans to lose control of computer systems?
62
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Databases: Concerns about Privacy & Identity Theft
• Databases have facilitated loss of privacy and identity theft, which have
Using Information Technology, 11e
© 2015 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.