Effective Alarm Management For Dynamic and Vessel Control Systems
Effective Alarm Management For Dynamic and Vessel Control Systems
Steve Savoy
Operations Session
OPERATIONS SESSION
11 Oct 2010
Steve Savoy
Operations Session
1. Introduction
Marine control systems are always fitted with an alarming function. Most of the existing systems
have tabular alarm displays that often do not provide operators with a meaningful interface. The
existing alarm management guidelines mainly address applications for process control and aviation.
These same techniques can be applied to marine control systems even as these systems normally do
not have the large number of alarms that are associated with these processes. This paper will outline
these same techniques showing how they can be used to improve operator interfaces for marine
systems.
2. Alarm Management
An alarm is defined as a mechanism for informing an operator of an abnormal process condition for
which an operator action is required. The operator is alerted in order to prevent or mitigate process
upsets and disturbances1 Alarms must be used for events that require operator action. The operator
action is not just acknowledgement but also a response to the event. Alarms provide an indication of a
system fault, required action, degraded system operation or compromised operating capabilities.
Alarm management is the processes and practices for determining, documenting, designing,
monitoring, and maintaining alarm messages1. Good alarm management leads to more efficient
operation, better operator situation awareness and mitigation of system faults. The process starts with
a defined alarm philosophy.
3. Alarm History
Almost all major industrial accidents have had an alarm component as a contributing cause. Poor
alarm management has been identified as one of the major causal factors.
The beginning of basic alarm management started during WWII. During this time period airplanes
were rapidly evolving so that the pilots were faced with more instrumentation and less time to react in
an abnormal situation. The military started to develop instrumentation and displays that provided
better pilot situational awareness by the use of better displays.
In 1979 the Three Mile Island nuclear power plant accident had poor alarm displays as a contributing
factor. The Presidents Commission said The Control Panel is huge, with hundreds of alarms, and
there are some key indicators placed in locations where the operator cannot see them2. After this
accident the nuclear power industry began to improve on operator displays and console layouts. This
was a result of studying the human factors of plant operation. Factors such as maximum number of
alarms that an operator can process, prioritizing and grouping of critical related alarms allow
operators to better handle abnormal situations.
As a result of this and other catastrophic accidents, the process industry has developed guidelines and
standards over the past 30+ years. The first alarm displays were lighted mimic panels where each
lighted square indicated an alarm condition. In the 1960 the Instrument Society of America
11 Oct 2010
Steve Savoy
Operations Session
(International Society of Automation) (ISA) released ISA_RP18 Specifications and Guides for the
Use of General Purpose Annunciators. This was followed by ISA-18.1-1979 Annunciator Sequences
and Specifications. The latest evolution of this document is ANSI/ISA-18.2-2009 Management of
Alarm Systems for the Process Industries. Another revision of this document is planned to be
released in October of 2010.
The Abnormal Situation Management Consortium was formed in 1994. The consortiums purpose
was to study a number of plant incidents where alarms had been identified as a causal factor. The
goal was to develop a better response to alarms. The Engineering Equipment and Materials Users
Association issued Publication 191, Alarm Systems: A Guide to Design, Management and
Procurement. This was one of the first comprehensive documents on alarm management.
4. Marine Regulatory
The marine regulatory documents give some guidance to alarm system implementation. The ABS
Guidance Notes on Ergonomic Design of Navigation Bridges 2003 has a section on alarms. It
recommends that alarms be prioritized, grouped properly, nuisance ones avoided and ambiguities
eliminated. The majority of the alarm references in the ship or MODU classification sections specify
required equipment alarm but they provide little guidance on alarm management.
5. Alarm Management
There are formalized steps for implementing alarm management for any system. The process starts
with a definition document. There are a number of common steps to implement an alarm management
system.
4.1 Develop Alarm Philosophy
This process develops a guidance and specification document that defines overall design and
implementation of an alarm management system. It should address alarm selection and
justification. It serves as a design specification document. It defines operator interfaces and
response, classes of alarms that are to be prioritized, and rationalization of all alarms with the
alarm philosophy. Many of these alarms would be defined in the system design specifications.
4.2 Rationalization and Documentation
This step defines all alarms in detail. Each alarm should be examined for consistency and
priority. System conditions that generate the alarm should be defined. This step would generate
an alarm list or database with all information related to each alarm. There should also be FMEA
information incorporated into the list.
11 Oct 2010
Steve Savoy
Operations Session
11 Oct 2010
Steve Savoy
Operations Session
11 Oct 2010
Steve Savoy
Operations Session
8.3. Prioritization
There should be a prioritization or hierarchy that lends itself to grouping. This will give the operator
a good display so critical alarms can be identified separately from less serious ones.
8.4. Minimize Nuisance
A good rationalization process should be able to minimize these types of alarms. Good equipment
maintenance or set points will keep measured parameters correct so that these alarms are not
generated.
8.5. Minimize Floods
This is more difficult to control since system upsets can cause other valid alarms. An example would
be thruster alarms generated in response to a faulty thruster. A properly functioning thruster can
generate alarms due to the fact that it cannot azimuth or meet power requirements causing compare
alarms.
9. Dynamic Positioning and VMS Alarm Systems
Presently most systems have one alarm list presented in time sequence. Normally alarm displays are a
small portion of the the overall operator HMI displays. Most systems have an alarm history page(s) that
can display all events for over a day or more. Data Loggers can display events for longer time periods but
these are not fully utilized for alarm management.
Alarms can be due to an actual event e.g. a motor temperature or the failure of a portion of the control
system in which multiple alarms are generated.
9.1. General DP alarm groups
Dynamic positioning systems lend themselves to natural groupings according to function. The groups
can be classed as follows;
Power -All alarms relate to power generation and power management functions.
Thrusters
Hardware failure - Alarms related to the hardware control system failures e.g. azimuth or
power drive systems.
Operational limits - Alarms that alert the operator of azimuth or a power compare
parameter that may be out of bounds.
11 Oct 2010
Steve Savoy
Operations Session
Sensors - Alarms generated by sensor systems including hardware failures, control parameter
limits or median test.
Consequence - Alarms generated by the system consequence analysis function. These types warn
of improper power configuration, insufficient power in the event of a failure, etc.
Control - Alarms related to the control system function. These would include position parameter
limit alarms.
Hardware - These are alarms related to a hardware failure of a device e.g. loss of power or invalid
data from a sensor.
Network Data Communications - Alarms generated by communication errors or controller
failures.
9.2. Analysis of DP System Alarms
A typical system alarm list can be analyzed for the number of percentages of alarms in each of the
general groups. This may give guidance as to possible grouping for HMI displays. By segregating
the system alarms into two or more groups for display will give the operator better differentiation of
alarm displays. A typical system has the following percentages of alarm groups;
Consequence
Control
Hardware
Power
Sensors
Thrusters
2%
12%
20%
6%
42%
20%
If the above alarms are ranked by severity of 1 to 3, with 1 being most critical then the following numbers
are generated;
Priority 1
Priority 2
Priority 3
12%
70%
18%
In this case only 12% of the alarms are ranked as critical e.g. position excursion, exceeding watch circles
or insufficient power available. Normally these would occur after multiple priority 2 alarms occur.
11 Oct 2010
Steve Savoy
Operations Session
20
40
60
80
100
120
140
The following graph shows the number of alarms per day over one month. The large spike during
the 20-June-10 date is due to a rig move where the acoustics system was being setup as well as
additional equipment testing.
Number of Alarms
11 Oct 2010
24-Jun-10
22-Jun-10
20-Jun-10
18-Jun-10
16-Jun-10
14-Jun-10
12-Jun-10
10-Jun-10
8-Jun-10
6-Jun-10
4-Jun-10
2-Jun-10
31-May-10
29-May-10
27-May-10
25-May-10
23-May-10
Steve Savoy
Operations Session
11 Oct 2010
Steve Savoy
Operations Session
00:00
T4 Azimuth Compare
00:04
Heading Excursion
00:13
Position Excursion
00:53
00:53
00:55
01:16
01:49
03:06
T4 Offline
03:09
03:18
03:18
03:18
03:29
T3 Thrust Compare
03:41
03:41
04:01
T4 Azimuth Alarm
04:12
T4 Offline
04:17
04:55
Heading Excursion
05:17
Position Excursion
05:29
06:35
07:38
11 Oct 2010
Steve Savoy
09:11
Operations Session
11 Oct 2010
Steve Savoy
Operations Session
References
1
11 Oct 2010