How To Build A Disaster Recovery Plan: Best Practices, Templates and Tools
How To Build A Disaster Recovery Plan: Best Practices, Templates and Tools
How To Build A Disaster Recovery Plan: Best Practices, Templates and Tools
EXECUTIVE SUMMARY
How do you start building a DR plan? While there are lots of tools from
vendors, it's hard to find a practical approach that comes from first-hand
experience.
No amount of money or planning can stop some IT disasters from happening. But a good disaster
recovery plan can reduce your downtime from a week or a day to hours or even minutes.
Like any important project, DR starts with planning, followed by best-practice templates and procedures, which in turn are
implemented, in part, by the right tools.
In addition to identifying mission-critical applications and any infrastructure they rely on, you should also identify the data
these applications and tasks need to have access to.
This can include recent email, customer databases, and any documents, spreadsheets, presentations and other
"unstructured" files used by project/product management, development, sales, manufacturing, etc.
Your company has accumulated a substantial amount of data over time hundreds of gigabytes, perhaps terabytes or even
petabytes. But only some often a small fraction of this data has to be made available again quickly.
DR Planning 101
What Causes IT Disasters
The cause of an IT disaster may be small and specific. A power supply, CPU, network interface card, RAM, fan, or other component
on an individual server may fail. A brief power fluctuation may scramble data or disrupt a program's activity.
An entire data center going down is rare, but can happen. Weather may take down external power or network service. The resulting
fire, flood, or building damage may bring down your entire computer room or data center.
For real world examples, Rich Webster, Director of Operations at Zetta, recalls, While at Zetta, I've had a backhoe break one of the
network connections to my data center, a code upgrade to edge routers not go well, and PDU breakers at a hosting company fail.
DR Planning 101
Recovery Point Objective (RPO) and Recovery Time Objective (RTO)
The data that you want to regain availability to in a timely fashion is called the Recovery Point Objective (RPO).
How soon you want this data available again is called the Recovery Time Objective (RTO).
How much IT downtime for mission-critical applications and data is acceptable depends on many factors (notably cost), and will
vary from one company to the next -- but in general, acceptable downtime today is minutes-to-hours, compared to days to a week
or more from a decade ago.
www.zetta.net
Redundancy is one popular approach to avoiding or minimizing many IT disaster events. For example, servers, storage
and network gear can be configured with two power supplies, connected in turn to separate power sources. Servers,
firewalls, UPSs and other gear, even entire sites, can be duplicated. Network and electrical service can be supplied by two
separate utilities, on separate cables. Data can be stored across multiple hard drives.
DR Planning 101
Hosting Applications vs. Outsourcing
Another critical component of managing the risk of data disasters is assessing whether it's time to outsource any of your IT
applications and services, and move them to the cloud. At Zetta, says Webster, We had been running our own phone PBX and
email in-house, on-site. By moving these to cloud-based services, we now have redundant phone service centers via the cloud, so
sales and support stays up if we have an 'IT event' at our office.
www.zetta.net
DR Planning 101
Make DR Documentation Available
There's a lot of information associated with a disaster recovery plan. Contact information for vendors, your employees, utility
companies, and other organizations you may need to talk with. IT equipment inventories including serial numbers and warrantee
information, circuit IDs, building maps, etc.
Make sure that you have copies of this information that you can access even if all your regular IT and possibly the wired and
wireless phone networks is offline. Consider storing a copy online, and also keeping a secured copy on your smartphone, tablet,
or notebook, and on a flash drive.
Link
TechTarget
http://searchdisasterrecovery.techtarget.com/tip/Top-five-free-disaster-recovery-plan-templates
IBM
http://publib.boulder.ibm.com/iseries/v5r1/ic2924/index.htm?info/rzaj1/rzaj1sampleplan.htm
http://www.drp.msu.edu/documentation/stepbystepguide.htm
http://www.tamuct.edu/departments/informationtechnology/extras/ITDisasterRecoveryPlan.pdf
www.zetta.net
4 Disaster Recovery Options Local Storage, Mounted Web Drive, Web Browser and
Software Client
1. Local Recovery: Recover data from local storage over
the LAN.
2. Mounted Drive: Data backed up to Zetta can be
mounted as a drive letter, enabling full file-system
recoveries, as easily as accessing a network share.
3. Web-Based Recovery: If your data center has been damaged by a fire or tornado, for example, all that is needed
to retrieve lost data is a web browser. Simply login to you System Mangement Portal and browser the file system to
most critical files.
4. ZettaMirror: ZettaMirror is award-winning backup software that replicates data creating a second copy that is
always available online. This option is most common for Exchange & SQL data base recoveries.
If youre lucky and have fortified your IT infrastructure your company may escape major IT-impacting
disaster events.
During my 20 years in IT, I haven't yet had to invoke a full DR plan -- although I have come close, says
Webster. But I have had to invoke parts of my DR plan, about once a quarter. You have to be ready to do
some level of DR periodically.
CONTACT US
WWW.ZETTA.NET