Tag Archive: carbonite

More On the Carbonite Backup Failures

David Friend, the CEO of Carbonite, left a comment on my blog entry about Carbonite.  I’d like to applaud his efforts for taking the time to do that, but the comment raised some ugly questions.  It appears that they were putting data on 15-drive RAID 5 arrays.  RAID 5 is the most cost effective array setup (other than RAID 0, which offers no data protection).

RAID 5 will only tolerate a single drive failure – if you lose more than one drive in an array, the whole array is gone and must be restored from backup.  As SATA drives grow larger and larger in capacity, they take longer and longer to rebuild when one goes bad, because so much data must be copied over to the new drive.  If a second drive fails while the first one is being rebuilt, you’re completely out of luck, and must restore from backup.  The more drives you add in a RAID 5 array, the riskier it gets, because it’s more likely that one of the drives will fail in the time span of the rebuild.  That’s pretty dangerous for a company that makes its living off your backups being available.

Worse, it isn’t clear from the interviews I’ve seen that Carbonite actually backed up your data somewhere other than those RAID 5 arrays.  It appears that their attitude towards backup was that YOUR machine held THEIR backup: when they ran into problems with their RAID arrays, Friend commented that:

“Carbonite automatically restarted all 7,500 backups…”

Meaning, they started getting data again from their clients’ machines, not restored them from tapes or other arrays.  This is further evidenced by interviews that the Enterprise Storage Group conducted with Friend last year before the lawsuit came out, as blogged by Steve Duplessie of the Enterprise Storage Group. When asked how Carbonite protected their backups, Friend replied that:

“This is backup – not archiving, so if that ever happened you’d still have your data on your PC.”

Like Bryan Oliver says, the only reason we do backups is to do restores.  If the answer to restore problems is to use your live server, that’s a failure.  Carbonite didn’t protect against regional internet outages, either:

“Regional internet outages (we use multiple redundant carriers) would take us offline if they all failed. But again, unless you were in the middle of a restore when it happened, you’d probably never notice.”

Translation: if you aren’t using our services, you’ll never notice when we’re down.  How about replication from one site to another?

“…we don’t replicate data across multiple sites. The likelihood of losing data because of software bugs or human error is probably orders of magnitude greater.”

Errr, not sure why he’d say that, since Carbonite wasn’t protecting against data loss due to bugs either.

I can see where Carbonite’s coming from: they view themselves as a cheap way to protect data, and it works most of the time.  That’s the same approach I take with Amazon S3 cloud storage for my personal backups, incidentally: for a few bucks a month, it’s a cheap insurance policy.  I have my data replicated across my laptop, my Time Machine external hard drive, and my VMware server.  If there’s a house fire, my stuff might be on Amazon S3.  It also might not.  Amazon hasn’t made me any promises about the safety and security of my data.

My problem with Carbonite’s approach is that they seemed to take my personal data backups even less seriously than I do, maintaining a single copy in a single place.  For all I know, maybe Amazon’s using that exact same approach.  Only time and lawsuits will tell.

For more commentary on the Carbonite problems, check out Steve’s blog post, “Head in the Cloud? Or Just up your……..?” (And yes, he apparently stole that from Wilbur.)

Brent Ozar

Brent specializes in performance tuning for SQL Server, VMware, and storage. He's one of the very few Microsoft Certified Masters of SQL Server, a published author, and a Microsoft MVP. He likes travel, Jeeps, Apple gear, jokes, and writing about himself in the third person. Read more and contact Brent.

Website - Twitter - Facebook - More Posts

Another backup failure: Carbonite

TechCrunch reports that Carbonite, an online backup company, lost customer data.

But wait, this is different: it’s not their fault.  They’re suing Promise Technology, makers of popular storage gear, for selling them bogus equipment.  Bogus equipment?  You mean, like hard drives that fail?  That’s horrible!  Who could expect something like that?  Who could know about the dangers that lurk around every corner?

The Statistics are Staggering Alright

The Statistics are Staggering Alright

Carbonite’s web site warns, “You need to be aware that losing your most valuable files is a very real possibility.  You need to take proper precautions.”

Who knew they were referring to their own services?

Don’t point and laugh and say it could never happen to you because you do your own backups in-house, because I’ve seen too many backup strategies fail for too many reasons.  For the love of your own job, never mind your company’s revenue stream, take some time this week to:

  • Automate your backup testing – build a set of T-SQL scripts to automatically restore your production databases onto another server.  Restore a different server every day onto the same target testbed box.
  • Test your backups manually – if you don’t have the time to script the tests, just go run a restore of your largest backup.  Ideally, check the ones that hit tape, because those are the most risky.
  • Check every server’s job logs – I’ve seen so many cases where backups stopped working on a SQL Server, and alerting had also long ago stopped alerting.  These two failures are a 1-2 punch to the jaw of your career.
  • Find your single points of failure – if you’re relying on a single cloud vendor for all of your data protection, that’s a risk.  If you’re backing up straight to tape and you’ve only got one tape jukebox in-house, that’s a risk.
  • Figure out who you’re going to sue – because hey, work is hard.  If you can’t do it right, get rich trying.

Brent Ozar

Brent specializes in performance tuning for SQL Server, VMware, and storage. He's one of the very few Microsoft Certified Masters of SQL Server, a published author, and a Microsoft MVP. He likes travel, Jeeps, Apple gear, jokes, and writing about himself in the third person. Read more and contact Brent.

Website - Twitter - Facebook - More Posts