Incident Report Assignment SAW AUNG THU HEIN
Incident Report Assignment SAW AUNG THU HEIN
Incident Report Assignment SAW AUNG THU HEIN
2. Executive Summary
3. Impact of Incident
1
b. Others
Describe other possible implications, e.g., financial and legal, that the incident will / may have on
the institution.
As data integrity is considered a higher priority than availability, the storage system is
designed to automatically cease communicating under these conditions. In doing so, the
system preserved full data integrity.
In spite of the machine’s high availability and redundancy, these incorrect procedures
caused the outage.
Chronology of Events
One Solutions Pte Ltd determined that a repeated failure to apply the correct procedure
when addressing instability in the communications link of the storage subsystem resulted
in the service outage on 5 July 2019.
One Solutions Pte Ltd’s immediate priority was to ensure that customer data was not in
any way compromised while services were being restored as quickly as possible. BAN
BANK' services were restored the same morning with full and complete data integrity.
Prior to the outage, the following events took place:
3 July 2019, The cable in question was replaced. The One Solutions Pte Ltd field
7.50pm engineer did not use the machine’s maintenance interface but used the
instructions given by the support centre. Although this was done using an
incorrect step, the error message ceased. The storage system was still
functioning.
2
4 July 2019, The error message reappeared. This time, it indicated instability in the cable
2.55pm and associated electronic cards. The One Solutions Pte Ltd field engineer
was despatched for the second time to the data centre. He diagnosed and
escalated the issue to the regional One Solutions Pte Ltd support centre.
4 July 2019, Based on instructions from the regional One Solutions Pte Ltd support
5.16pm centre, the cable was removed for inspection and reseated, using the same
incorrect step. The error message ceased. The storage system continued
functioning.
4 July 2019, The error message reappeared. Over the next five hours and 22 minutes, the
6.14pm regional One Solutions Pte Ltd support centre analysed the log from the
machine and recommended to the field engineer that he unplug the cable
and check for a bent pin. The storage system continued functioning.
4 July 2019,
The One Solutions Pte Ltd field engineer did not find a bent pin and
11.38pm
reseated the cable. The error message persisted. The storage system was
still functioning and able to communicate with the mainframe. The
regional One Solutions Pte Ltd support centre and the One Solutions Pte
Ltd field engineer continued diagnosing the issue, including reseating the
cable for a second time.
• Subsequently, BAN BANK was contacted and authorised a cable change
at 2.50am, a quiet period, which is standard operating procedure. While
waiting to replace the cable, the One Solutions Pte Ltd field engineer
decided to inspect the cable again to ensure that it was not defective and that
it was installed properly. He then unplugged the cable for inspection using
the previous incorrect procedure recommended by the regional One
Solutions Pte Ltd support centre.
5 July 2019,
The cable was replaced using the same procedures. This caused errors that
2.58am
threatened data integrity. As a result, the storage system ceased
communicating in order to protect the data.
At this point, BAN BANK banking services were disrupted.
If the correct procedures had been used, the storage system would have
automatically suspended the communications link and the machine would
have instructed the engineer to replace the cable and both cards together
and maintain redundancy of the system.
As data integrity is considered a higher priority than availability, the
storage system is designed to automatically cease communicating under
these conditions. In doing so, the system preserved full data integrity.
3
In spite of the machine’s high availability and redundancy, these incorrect
procedures caused the outage.
5. Conclusion
4
Declaration
1. I declare that all information given in this report and in the attached annexes (if any) are
true and accurate.
Signature
Name of Approver
Date