Hope none of y'all were affected by this

ThePlanet.com is a huge series of data centers that host thousands of servers.  This was in just one of their Houston data centers and they also have DCs in the Dallas area.  Yesterday they had a transformer explode that took out 3 walls and power to more than 9,000 servers.  Fortunately none of the servers appeared to be physically damaged, but until power gets restored they're basically paperweights. 

It's a good reminder that even if your host keeps their own backups, you need to be keeping a set too. 

http://www.theregister.co.uk/2008/06/01/the_planet_houston_data_center_fire/
Quote
Fire at The Planet takes down thousands of websites

Faulty transformer blamed

A fire at The Planet's H1 data center in Houston, Texas on Saturday has taken out thousands of websites.

In messages posted on the web hosting firm's forum, the company blamed a faulty transformer for the fire. No servers or networking equipment were damaged, but the data centre remains without power, after The Planet shut down all generators "as instructed by the fire department.

About 9,000 servers and 7,500 customers are affected by the outage. B3ta.com, a popular British comedy site, is a high profile casualty.

The Planet is posting hourly updates on its forum. It has five other data centers, which remain up and running.



http://forums.theplanet.com/index.php?showtopic=90185
Quote
From Doug Erwin:
This evening at 4:55pm CDT in our H1 data center, electrical gear shorted, creating an explosion and fire that knocked down three walls surrounding our electrical equipment room. Thankfully, no one was injured. In addition, no customer servers were damaged or lost.

We have just been allowed into the building to physically inspect the damage. Early indications are that the short was in a high-volume wire conduit. We were not allowed to activate our backup generator plan based on instructions from the fire department.

This is a significant outage, impacting approximately 9,000 servers and 7,500 customers. All members of our support team are in, and all vendors who supply us with data center equipment are on site. Our initial assessment, although early, points to being able to have some service restored by mid-afternoon on Sunday. Rest assured we are working around the clock.

We are in the process of communicating with all affected customers. we are planning to post updates every hour via our forum and in our customer portal. Our interactive voice response system is updating customers as well.

There is no impact in any of our other five data centers.

I am sorry that this accident has occurred and I apologize for the impact.
Logged


I am a moderator for the support board for a huge forum hosting service, and their servers are run by ThePlanet. They have over 80 servers and they're slowly coming back online. Lots of "why isn't my board running" type questions from the members. It's kept the moderators quite busy yesterday and today.

As you said, a good reminder.
Logged

Comments and Harsh Critiques gladly accepted. My photos are ok to edit.

My photos and art: http://wildmaven.org

It looks like there are a couple of big free forum providers that were affected. 

We were about 10 minutes away from being affected.  I was just about to sign up with ThePlanet before giving one more look around when I found the host I'm using.  Not that it couldn't happen at the data center my server is in, but I'm sure glad it didn't. 
Logged


One of the servers looks to be completely unrecoverable! Tongue
Logged

Comments and Harsh Critiques gladly accepted. My photos are ok to edit.

My photos and art: http://wildmaven.org

Ouch
Logged

-John
Sarcasm, frustrating the clueless since 3000 b.c.

Quote
One of the servers looks to be completely unrecoverable!
One of yours or just one of the 9,000?  I'm guessing one of yours, although I'm hoping that's not what you meant. 

Reading about this has made me go back and look at my backup procedures.  While I'm backing up everything daily, it's not in a format that I can get back up quickly.  Had my server been affected by this it would take me at a minimum several hours, and more likely a few days, to get everything back online after the server became available. 

I spent several hours yesterday righting a script to sync backups from my web server to my remote backup server.  WHM (the server software that runs on cPanel servers) makes this fairly easy by including a script that will copy everything from an account to a single gzipped file that can then be uploaded to any other cPanel server.  It's made for moving accounts from server to server, but it works really well as a backup.  The problem is that there wasn't an easy way to have it do every account on the server and gzipped files don't synchronize very well and I had to copy the entire file every time.   My server only has about 4.5gb of stuff on it right now, but wanting to backup every night that would be almost 10% of my monthly bandwidth allotment just for the backups.  I did find a way around both of these problems though. 
Logged