In-Depth
ATA-Class Storage -- The End of Tape Backups?
DRBC #4: With a solid DRBC plan in place, it might be time to consider replacing your backup plan with more reliable technology that doesn't call for the janitor at a remote site to occasionally change tapes.
- By Bill Heldman
- 06/27/2006
Hey buddy, do you happen to have a floppy? What if someone asked you
such a ridiculous question today? It's absurd, isn't it? You'd probably
reply back: "Where's your flash drive pal? Floppies are so 20th century!"
The same is true of enterprise-class tape backup systems -- only on a
vastly larger scale.In the old days of DRBC we used tape backup devices
and racks upon racks of backup tapes in order to protect our enterprise.
In one (fairly current) job I was in, the operations department still
used five 25-year-old Storage Tek tape backup units. On a monthly basis
they would routinely rotate 5500 tapes. All this for a little over a gigabyte
of mainframe storage! One tape would be used for a little 1MB file, another
for a few hundred kilobytes and so on.
Job security, yes. Practicality, absolutely not.
Think of all of the little one-off 4mm, 8mm, DAT and LDAT tape systems
that shipped with the servers you or someone before you ordered. Administrators
everywhere are still chasing after those tapes! And, people are still
buying robotic tape changing machines to routinely replace the LDAT tapes
in the backup system -- expensive tapes they've procured over the years.
Here's the million dollar DRBC question: Are your tapes routinely going
offsite, using some sort of rotation system? Answer: Doubtful. The sad
truth is that the majority of organizations have elaborate tape back-up
systems but do nothing with the tapes once they've recorded backups onto
them. There is no routine "ship them offsite" mechanism. It
is quite likely that the tapes sit on shelves or in tape containers in
a back room and are not thought of again until the next time they're needed.
Is this good DRBC practice? Of course you know the answer to that. Absolutely
not!
Accident
Waiting to Happen |
It is not unreasonable to say that many administrators
may have never even run a restoration test on their
tape backups to see if they can actually restore a file
from tape.
This happened to me one time. I was using Arcserve
over Windows NT (yes, I'm dating myself). I saw in the
tape backup administration software that I had successfully
backed up files -- at least I thought I had. But I never
practiced actually restoring the files to make sure
that I could do it -- or at the least, to work out the
details of exactly how to restore the files, should
I need to. Instead, I just naively assumed that since
I'd validated the files were out there on tape, they
were ripe for the taking.
Not so! On the fateful day when I actually needed to
perform a SQL Server database restoration, guess what?
Even though the tape back-up software said the files
were available, there was no way I could persuade the
restoration to actually do its thing. The result? Many
hours of lost time recreating the database. Fortunately,
there wasn't much data in it yet. What if it had been
loaded with data? I would've probably been let go.
|
|
|
"But", you say, "I can't give up the incredibly expensive
investment I have in my tape back-up units. And what about the back-up
software I use? That software cost us an arm and a leg! You've gone too
far. You're talking a major capital investment in software and hardware,
not to mention an Operations and Maintenance investment in professional
services to get it installed and operational."
True. You do have a capital investment in your tape back-up gear. But,
realistically speaking, how long has it been in service? If the answer
is two years or more, odds are you are close to the end of the depreciation
cycle for that equipment and it's time to venture into greener pastures.
You should begin to think of purchasing ATA-class disk array equipment
to replace your backup units. We'll talk more about these devices later
on.
The software you'll likely be able to keep, as it will probably be able
to converse with today's ATA disk array equipment. And if not, there is
software available that will make it easy for you to transport data from
your servers to the disk array without using tape backup software. But
for the time being keep "I'll probably be able to leverage my old
tape back-up software across the new ATA-class disk arrays" in the
back of your mind for later.
The bottom line is that the new breed of ATA-class storage arrays are
relatively inexpensive, they have awesome companion software available
for them, they are manufactured by world-class manufacturers, and they
provide you with a safe way to produce backups and restorations much more
quickly and reliably than before. Plus, they juxtapose nicely with DRBC
plans.
You want ATA-class back-up operations because the speed with which you
can do your backups is no longer limited by the groggy speed of the tapes.
Now your primary bottleneck will be your network bandwidth, and we all
know that today's networks are usually quite robust.
Used to be that performing a backup was an overnight thing. Now you can
plan your backups to fit your daytime schedule. Restores were a problem
because they required navigating the software until you found the volume
you wanted, then actually finding the tape or tapes that you needed in
order to retrieve the data. This slow, painful operation is now quick
and much easier than before.
The Gear
Let's talk about some of the manufacturers -- folks you may have
already dealt with. You may even possess some of their equipment. Please
note that I've just listed three of my favorite players. There are many
others to choose from.
Network Appliance -- This
company has an amazing lineup of SAN and NAS gear for almost any undertaking.
This company's products work well, and the company stands behind them.
The NearStore Virtual Tape Library (VTL) is a disk-to-disk SATA-based
backup appliance that appears as a tape library to your backup software.
Available in capacities from 4.5 Terabytes (TB) to 168TB, there is plenty
of room for you to back up old data and get it off of your servers. The
VTL is visible by today's enterprise-class backup products such as Legato,
Symantec, HP, et al.
If you've been hankering to get rid of your clunky old tape backup software
program, you can combine the VTL with NetApp's SnapVault software. This
combination immediately pops you into a modern disk-based backup paradigm
that allows you to reach out and quickly back up anything on the net.
NetApp also has other enterprise-class SAN and NAS arrays that can be
utilized in conjunction with the NearStore. Architecturally, you can probably
envision SAN arrays sitting at your organization's main points of operation,
fanning out offsite to one or more NearStores that pull in data from the
enterprise and store it out of harm's way, all over a fast wire. No more
hauling tapes around (which few of us did anyway), or worrying about Father,
Son, Grandfather or Towers of Hanoi backup strategies.
LeftHand Networks --
LeftHand is in the business of iSCSI SANs. Because they're iSCSI, you
don't have to mess around with complicated Fibre Channel connections,
and the equipment is thus less expensive to operate. Talk to the big players
like EMC and you might hear stories about iSCSI not being standardized
across vendors -- scary interoperability stories. That's because the big
players have a huge investment in FC arrays. But the truth is, iSCSI works
as a standard and LeftHand has been shipping gear for a long time now.
LeftHand has some software called "SAN iQ/Snap". The key word
here is "snap", meaning, more or less, a snapshot of the data
that you want to back up. The idea is straightforward: You designate the
files that you want to snap and allow iQ/Snap to do its work. Note that
these files are on the SAN, not necessarily on a local server. Once the
snap has been taken, it's an easy thing to move it to another device out
of harm's way. Note that iQ/Snap will work with some implementations of
Windows-based backup software.
Snaps typically require a dedicated piece of space on the SAN in which
to hold the snap. Not so in LeftHand's case, which can place the snapshot
copy elsewhere.
LeftHand is a Microsoft Certified Partner, and this shows up in their
products. Note that alongside iQ/Snap, LeftHand has an implementation
of Microsoft's Volume Shadow Copy Service (VSS), allowing your server
VSS implementations to initiate and manage snaps from the LeftHand SAN,
ostensibly for movement elsewhere. Additionally, LeftHand has leveraged
Microsoft's Multipath I/O (MPIO) framework to optimize the ability of
a Windows Server 2003 server to work with SAN data. LeftHand's SAN iQ/Remote
Copy software allows you to pick up data from remote locations, copying
it to the SAN for safekeeping. This would be tremendously useful in those
locations where you had no administrator and wanted to be sure you were
adequately and accurately backing up the site.
It seems to me that the LeftHand implementations are proprietary, single-stop-shopping
environments. This might mean extra effort in getting a LeftHand device
to "play nicely in the sandbox" with another device on the network.
Be that as it may, the upsides are the cost-effectiveness of the software
and hardware -- all wrapped up in an iSCSI framework.
All
in a Day's Work |
In another not so long ago job, we had a little over
50 geographically disparate sites -- single offices,
really -- with just one or two people in them. We had
locations in Chicago, Atlanta, Houston and so forth.
Each site had a server, with a little tape drive in
it, and no dedicated administrator. Instead, we had
the sites broken out into four or five regions, each
of which had assigned to it what I'd basically call
a PC Technician on steroids. This person would routinely
fly around to all of his sites, solving user problems,
helping prepare reports, and, most importantly, managing
those all-important backups.
You can imagine how easily the backups did not happen
once he left the office and things settled down
for a week or two. This PC Technician was expecting
the office manager, or the administrative assistant
(in one case, the janitor) to rotate the nightly tapes.
You can guess how effective the process was.
We accessed the servers remotely with Microsoft Systems
Management Server Remote Tools (this was a year or two
before RDP was available). One time I accessed a site
and was startled to see the banner of a CD-based game
on the screen! Evidently the office manager decided
he wanted to install this game to see how it worked
and left the CD in the caddy at the installation banner.
Fortunately, he or she did not get the opportunity to
go through with the installation, but it was freaky
to think that someone hired to sell product was able
to nonchalantly walk into a server closet and insert
a CD.
Now imagine that you have a snap copy software product
that can reach out and touch these volumes, quickly
grabbing a snapshot of each for storage on the disk
array. How much easier would it be to restore the server
if that manager had, for instance, gone through with
the game installation and caused a BSOD on the machine?
|
|
|
EMC -- One of my favorite companies
is EMC. I take a ribbing from a lot of people; I suppose because EMC is
like Microsoft, one of those companies everyone loves to hate. But my
experience has been that all EMC people I've been in contact with -- salespersons,
technical reps, installers and service personnel -- have been top-grade.
I like the products, and I like the way I'm treated when I'm an EMC customer.
EMC's ATA-class box is called the Clariion. The flagship backup and restoration
appliance is currently the Clariion disk library series. Using a device
in this series allows you guaranteed interoperability with other elements
of your network that need backed up, and provides you with a path from
Small to Medium Business (SMB), all the way up to large-scale enterprises.
EMC's software for backups is called EMC Networker. There is a Networker
module for most enterprise ERP, database and email implementations.
EMC has an Insignia line of software and hardware designed for SMBs.
So How Do I Start?
Clearly, these are just three examples in a universe of vendors
who are in the backup-to-disk market. The technology is tried and proven.
It works, and it gives time back to administrators -- most of whom typically
spend their days as though their hair is on fire. It is time to consider,
architect, plan and implement a backup-to-disk scenario for your DRBC
plan.
So how do you start? Well, I think the best place is at the top, with
the CIO. You need to understand four basic things in order to prepare
a document that's going to get you what you want:
Outline the need with statistics -- Most administrators try to
sell their managers on products by simply insisting that this is what
they absolutely must have to continue operation. Folks, let me be frank.
Your CIO didn't just fall off of a load of pumpkins. She knows that up
till now, you've gotten along just fine without all the latest and greatest
software and hardware. And, she has a variety of hands out, all begging
for funds.
What you need to do is document the time and money that is wasted using
current methodologies. You can show how things have gotten busier over
time -- where once the situation might've been OK, now it is a nightmare.
You must illustrate to your CIO how the situation has gotten out of hand
so that she clearly understands the problem you face.
Understand and talk about capitalization -- Once you've prepared
a sound problem statement -- one that is going to get noticed and get
some airplay -- the next thing to do is to make sure that you understand
how big equipment is capitalized. You must understand financing, depreciation,
the projected length of operation of the new equipment, the project plan
needed to implement the system, and any additional costs that are needed
to install it.
You want to be able to illustrate to your CIO why it's OK to bag those
old tape back-up devices, and how you propose financing the new backup-to-disk
arrays and software. You want to tell her how long you anticipate keeping
the devices, because this denotes the depreciation value of the equipment.
Also, it would be wise to point out anticipated Mean Time Between Failures
(MTBF), as this illustrates any expected downtime you might encounter.
By simply talking in capitalization terms, your CIO grasps that you basically
understand what she is up against when it comes to budget planning and
spending.
Understand and talk about O&M -- Big-league gear like this
doesn't come as a standalone box. There are ongoing maintenance costs
you have to pay for things like 24x7x365 monitoring, as well as software
upgrades. You should be able to talk about the operations costs -- those
people in your shop that will actively monitor and maintain the equipment,
so that she understands what today's gear looks like as contrasted with
what you're proposing. It could be that full-time employees (FTE) that
are currently working backup operations could be redeployed elsewhere,
thanks to the deployment of the new solution.
By putting things into a dollars and cents mentality, it will help you
rationalize the expenditure you're proposing. She will realize that you
are trying to think like she thinks, and not just demanding that you must
have the latest and greatest thing.
Risk assessment -- You also want to provide a full risk assessment.
You start by outlining the risks that the organization faces today, given
the current situation. Then you move into how the new implementation will
reduce those risks. You should also talk about the risks you expect to
encounter with the new proposed solution. Don't be a "pie-in-the-sky-guy".
There will be risks that arise with the new system. You should spend some
time thinking about what they are and documenting a mitigation strategy.
Note how this new solution fits into your over all DRBC architecture
-- By this time, you've doubtless been dialoging with your CIO regarding
DRBC. Your segue moment comes when you can show how the current tape backup
paradigm really isn't DRBC -- if anything it's a loose fit disguised as
DRBC. And you can point out how the new system will map very nicely into
your DRBC architecture because one or more of the disk arrays will live
elsewhere, away from headquarters.
When you're done preparing your presentation, schedule an appointment,
meet with your CIO and other dignitaries and make your case. You start
with why the current situation is no longer palpable, given the current
infrastructure. Next you go into the research that you did with regard
to the latest solutions -- including a discussion of acceptable industry
standards. Finally, you introduce your proposal, including your risk assessment
and end with the proposed cost of the new solution, along with financing
plans, and DRBC architecture fit.
Then you ask for the dough.
Risky
Business |
In
my second article in this series, I talked about
Risk Management, and said that it consists of: Analysis,
Assessment and Mitigation. What I neglected to mention
is that one form of Mitigation is something we call
"Risk Transfer."
What this means is that you recognize a risk and decide
you won't fool with it. You'll outsource the problem
to someone else.
Example: You determine that you need a DRBC
offsite location. After assessing build-out costs and
all of the other headaches, you determine that your
friendly local offsite vendor could just as easily handle
the load (for a fee, of course), offloading you from
the risk of maintaining a DRBC offsite location.
Risk Transfer is a very viable choice to make when
going through a risk assessment methodology. Don't rule
it out at any stage, any risk.
|
|
|
Let's Rewind
The point to this is that if you have not already replaced your current
tape backup system, why not? There is literally no need for such gear
in the enterprise any longer. Disks are fast and reliable, tapes are not.
You can take snapshots all day long, and when you are finally ready to
get rid of very old data, you can simply burn it to DVD for safe, long-term
storage.