AcronymFilter: SATA RAID on PCI. I had one, now it's broken.
I've been needing more space for all my layered photoshop files, so I picked up a couple of Seagate SATA 300G drives. I already have a couple of 300s on the motherboard SATA ports, so I purchased a Belkin PCI card to give me two more ports. (while I was at it, I bought a case with a lot more power, so that's not the problem.) This setup worked flawlessly for a little over two weeks - heck, the RAID worked before I even ran the driver install.
Two nights ago, I left the computer running some Photoshop scripts (layered, masked sharpening) on giant stitched files, and woke up to the computer off, and refusing to reboot. After the safe mode prompt, the list of system files being loaded stopped at mup.sys, and if I tried to restart windows normally, it would start to fade into the Windows XP splash screen with progress bar, then hang.
Not having time for bullshit yesterday morning, I poked around during work and found something online that suggested I remove my wireless mouse dongle. When I got home, I went into BIOS, and all my IDE drives except the boot drive had been un-recognized, so I got those back, and booted the machine. The belkin RAID text-screen then told me that one of the drives was offline, so my RAID was incomplete. I continued, and pulled the mouse dongle after the safe mode selector screen, and after a minute or two, the logon screen showed up. I quickly put in a PS2 mouse, and was able to run windows normally.
The raid log says one of the two drives took a nap at 1:30 in the am (about what time I expected the scripts to end). I turned off the machine, removed the PCI RAID control from the slot, and the damn thing works as it did two weeks ago, before I added the drives. Can anyone suggest either 1) a way to make the SATA card I've got work, or 2) a PCI RAID SATA card that won't send me into conniptions?
If you hooked up the drives as a RAID 0 array, for performance reasons, you were alternately striping data first to one drive, then the other. The failure of any drive in a RAID 0 set breaks the set, and data is unrecoverable. RAID 0 is never worth using for data storage, since it cuts reliability significantly. And if you are using 32 bit Windows XP, you can't get the performance benefits of even RAID 0, due to internal limitations of Windows storage drivers.
If you hooked up the drives and put them in RAID 1, you should be able to recover all the data you had, as each drive contained a mirror copy of the contents of the other. You just need to put the Belkin card back in, and use its utilities to break the RAID set back to individual volumes, then replace the failed drive, and re-RAID the set.
It's also possible the Belkin card supports compound RAID levels for 2 drive sets, such as RAID 0+1 or RAID 10 (1+0), but if you don't understand RAID, it doesn't seem probable you would have set up those advanced modes.
posted by paulsc at 11:59 AM on March 21, 2007