Friday, June 26, 2009
« CSLA .NET for Silverlight video series i... | Main | If you can’t reach me… »

I’ve had quite the experience over the past couple weeks.

Three weeks ago I was in Las Vegas speaking at VS Live. While there, I realized I’d forgotten to copy some key files to my laptop before leaving home, but Windows Home Server made that a non-issue, since it provides a secure web interface to my files. Awesome!

Then I got home and discovered that one of the two additional hard drives I added to my WHS machine was failing. This was unpleasant, but not cause for alarm since all my key files are set to duplicate.

(I only discovered the failure because WHS started crashing, and I looked in the Windows system event log to find the drive failure notifications – they’d been occurring for several days, but I don’t check my system event log daily, so I didn’t know – this is the one place where WHS really let me down – I still don’t know why Windows knew the drives were going to fail, but WHS blindly ignored this clear intelligence…)

Unfortunately I couldn’t get WHS to dismount (remove) the failing hard drive. After 3-4 tries, it finally did remove the drive. This took 2.5 days, since each failure took 12-24 hours, as did the final success.

I should also note that I was under serious time pressure, because I was flying out to Norway for the NDC conference and only had about 3.5 days to solve the problem!

After the failed drive was removed, things were obviously not right on the WHS machine. Clearly the remove didn’t work right or something. Poking around a bit further, I found that the second additional hard drive was also failing. What are the odds of two drives failing at once? Small, but yet there I was.

I quickly bought and installed a brand new hard drive (Seagate this time, since the dual failures were Western Digitals) and tried to remove the second failing drive. The attempt was still running when I flew to Norway.

Fortunately Live Mesh allowed me to use remote desktop to get back into my network, and I kept trying to remove the drive (failure after failure) while in Norway.

When I returned from Norway I manually removed the drive. Clearly it wasn’t going to remove through software. I can’t say this made matters worse, but it sure didn’t make them better either. Now WHS still wouldn’t remove the drive even though it was shown as “missing”. It had “locked files” and couldn’t be removed.

Thanks to some excellent help from the Microsoft WHS forum (thanks Ken!) I came to realize that my only option at this point was to repair the WHS OS – basically do a reinstall. I have the cute little HP appliance, and it comes with a server restore disk – pop it into my desktop machine, run the wizard and in very little time I had my server back, just like when I bought it originally.

OK, so now I have a functioning WHS again, but it is empty, blank – all my data is gone!

I’ve been here before (a couple times) with other servers though, so I have backups for my backups. All “critical” data is always in 3 places. So I just restored my server backup and got back my “critical” files – everything for my work, all the family photos and home videos of the kids, etc.

Here’s the catch though – I rapidly discovered that my “non-critical” data is actually pretty critical. Things like music, videos and miscellaneous files.

The music I was able to recover from a Zune device. I tried my Zune device, but that was a mistake. As soon as I connected it to my desktop machine it synced – and it discovered I’d “deleted” all my music and so it cleared the device. Damn!

Fortunately my son also has a fully-synced Zune, and I connected his to my desktop machine as a guest. No automatic sync, and so I was able to highlight all music on his device and say “copy to my collection”. Just like that all our music was back on the server.

I still don’t have any videos or miscellaneous files. They are gone. Arguably this isn’t the end of the world, as technically I can get back anything that really matters – by re-downloading, or getting files from friends, etc. But that’s all a pain in the butt and a waste of time, so it is unfortunate.

(it might be that I can recover some of them from the two defunct hard drives – using various data recovery tools I may be able to connect them to my desktop machine and retrieve some of the files – but that’s also a big hassle and may not be worth the effort)

So what did I learn out of all this?

  1. WHS is awesome, and I still really love it
  2. WHS can’t handle two hard drives failing at once – if that happens you better have a backup for your server
  3. “Critical” files include things that aren’t really critical like music and maybe videos – external hard drives to backup the server are relatively cheap – just get a 2 TB external drive and back up everything – that’s my new motto!

Oh, and I’m now using IDrive to get offsite backups for my truly critical files. I know, I didn’t need it in this case, but the whole experience got me thinking about floods, tornadoes, fire, etc. What if I did lose my family photos or home videos? The last 15 years of my life is digital, and nearly all record would be lost in such a case. Having automatic backups of that data, along with other important documents and files seems really wise.

So now my super-critical files are in at least 4 places (one offsite). My critical files (using my newly expanded definition) are in at least 3 places. And my non-critical files are in 2 places. I’m so redundant I’m starting to feel like NASA :)

Friday, June 26, 2009 3:37:24 PM (Central Standard Time, UTC-06:00)  #    Disclaimer  |  Comments [9]  | 
Saturday, June 27, 2009 4:50:58 AM (Central Standard Time, UTC-06:00)
You are anything but redundant Rocky, just ask the CSLA.Net community !

;-)
Steve Roberts
Saturday, June 27, 2009 8:12:08 AM (Central Standard Time, UTC-06:00)
I feel for you! Failed HDDs (or even ones just removed from the OS by surprise) can be a right pain, and yes failing HDD notification is far too well hidden!

I've had a few hard drives fail on me over the years as well, always WD drives - so I no longer purchase WD HDDs! I've found the Samsung drives to be really good and also very quiet which is great for a machine running 24x7 at home. I also don't purchase 2 drives at the same time if their for a mirror type config, if their from the same batch they could well have similar faults and fail at the same time.

I'm using a combination of TK8 and JungleDisk to send my critical files to the cloud (due to my stuff being many small files TK8 is good to create a single file which costs less to send), this is working out well for me and not to expensive and much easier than trying to remember to find a blank DVD, label it, store it safely and so on.
Saturday, June 27, 2009 8:17:07 AM (Central Standard Time, UTC-06:00)
Also...

I wasn't able to submit my comments using FireFox (3.0.11) - I kept getting "Please enter the code shown" errors (yes I was entering the right code!) so I switch to IE8 and it worked fine (with the exception of a JavaScript error "Line: 1 Error: 'coco' is undefined" when comments are shown).
Monday, June 29, 2009 4:03:27 PM (Central Standard Time, UTC-06:00)
Bummer! I have to admit I did get a chuckle out of the comedy of errors unfolding as you deleted your Zune archive in a sync. I've done something similar before. Ouch! I'm glad to hear you were able to recover using another device.

I've been seriously considering buying a WHS box because I've amassed a ton of critical data in the modern internet age. Since online backup space is still rather expensive, I'm considering cycling a synced drive that I could bring to work with me every week to achieve an off-site backup that isn't too old.

...btw I'm having the same Firefox issue as Stephen.
Tuesday, June 30, 2009 10:34:52 AM (Central Standard Time, UTC-06:00)
Now is the time to go out and buy a copy of Ghost and a big external HDD :-) I've always found bit for bit backup / restore to work far better than trying to restore just the "important files" because you will always miss something and it takes ages to configure all the many settings to get the OS back to just the way you want it anyway!
Simon Geering
Thursday, July 02, 2009 1:48:49 PM (Central Standard Time, UTC-06:00)
Rocky,

You mention that you were able to use Live Mesh to get into your network/RDP, can you shed some light on how you added the WHS to the Mesh? I've seen some articles on the web, but you have first hand experince and I sort of know you...

Regards,
Stephen
Stephen Patten
Friday, July 10, 2009 12:08:56 PM (Central Standard Time, UTC-06:00)
My WHS is not on the Mesh. My desktop is on the Mesh, and I used Live Mesh to RDP back into my desktop. From the desktop I have access to my network, including WHS and my other servers, etc.
Thursday, July 23, 2009 8:03:17 AM (Central Standard Time, UTC-06:00)
I recently came across your blog and have been reading along. I thought I would leave my first comment. I don't know what to say except that I have enjoyed reading. Nice blog. I will keep visiting this blog very often.

Susan

http://8080proxy.com
Thursday, July 30, 2009 11:01:56 AM (Central Standard Time, UTC-06:00)
Excuse me. The louder he talked of his honor, the faster we counted our spoons. Help me! Need information about: Estate ragazzi torino. I found only this - dirty sexy money. Resurgent auto finance, llc provides a full range of risk management and cash flow alternatives to the auto lending and consumer credit granting industry. Columbia credit union offers competitively priced business mastercard platinum plus credit cards through fia card services. With respect :cool:, Iain from Islands.
Comments are closed.