mdadm -A /dev/md0 /dev/sd{a,b,c,e,f}1
mdadm: /dev/md0 assembled from 3 drives - not enough to start the array.
Then I ran:
mdadm --examine /dev/sd*1And receive the following output:
/dev/sda1:
Magic : a92b4efc
Version : 0.90.00
UUID : 1ef7f14c:daf5731e:26644315:e48bd8ad
Creation Time : Sun Sep 5 21:10:15 2010
Raid Level : raid5
Used Dev Size : 1953511936 (1863.01 GiB 2000.40 GB)
Array Size : 7814047744 (7452.06 GiB 8001.58 GB)
Raid Devices : 5
Total Devices : 4
Preferred Minor : 0
Update Time : Sun May 20 21:30:36 2012
State : clean
Active Devices : 4
Working Devices : 4
Failed Devices : 1
Spare Devices : 0
Checksum : beba01c0 - correct
Events : 65642
Layout : left-symmetric
Chunk Size : 64K
Number Major Minor RaidDevice State
this 1 8 33 1 active sync /dev/sdc1
0 0 8 17 0 active sync /dev/sdb1
1 1 8 33 1 active sync /dev/sdc1
2 2 0 0 2 faulty removed
3 3 8 1 3 active sync /dev/sda1
4 4 8 65 4 active sync /dev/sde1
/dev/sdb1:
Magic : a92b4efc
Version : 0.90.00
UUID : 1ef7f14c:daf5731e:26644315:e48bd8ad
Creation Time : Sun Sep 5 21:10:15 2010
Raid Level : raid5
Used Dev Size : 1953511936 (1863.01 GiB 2000.40 GB)
Array Size : 7814047744 (7452.06 GiB 8001.58 GB)
Raid Devices : 5
Total Devices : 5
Preferred Minor : 0
Update Time : Fri Feb 10 18:55:25 2012
State : clean
Active Devices : 5
Working Devices : 5
Failed Devices : 0
Spare Devices : 0
Checksum : be35dd2b - correct
Events : 58402
Layout : left-symmetric
Chunk Size : 64K
Number Major Minor RaidDevice State
this 2 8 49 2 active sync /dev/sdd1
0 0 8 17 0 active sync /dev/sdb1
1 1 8 33 1 active sync /dev/sdc1
2 2 8 49 2 active sync /dev/sdd1
3 3 8 1 3 active sync /dev/sda1
4 4 8 65 4 active sync /dev/sde1
/dev/sdc1:
Magic : a92b4efc
Version : 0.90.00
UUID : 1ef7f14c:daf5731e:26644315:e48bd8ad
Creation Time : Sun Sep 5 21:10:15 2010
Raid Level : raid5
Used Dev Size : 1953511936 (1863.01 GiB 2000.40 GB)
Array Size : 7814047744 (7452.06 GiB 8001.58 GB)
Raid Devices : 5
Total Devices : 4
Preferred Minor : 0
Update Time : Mon May 21 00:16:09 2012
State : clean
Active Devices : 3
Working Devices : 3
Failed Devices : 2
Spare Devices : 0
Checksum : beba28c8 - correct
Events : 65646
Layout : left-symmetric
Chunk Size : 64K
Number Major Minor RaidDevice State
this 4 8 65 4 active sync /dev/sde1
0 0 8 17 0 active sync /dev/sdb1
1 1 0 0 1 faulty removed
2 2 0 0 2 faulty removed
3 3 8 1 3 active sync /dev/sda1
4 4 8 65 4 active sync /dev/sde1
mdadm: No md superblock detected on /dev/sdd1.
/dev/sde1:
Magic : a92b4efc
Version : 0.90.00
UUID : 1ef7f14c:daf5731e:26644315:e48bd8ad
Creation Time : Sun Sep 5 21:10:15 2010
Raid Level : raid5
Used Dev Size : 1953511936 (1863.01 GiB 2000.40 GB)
Array Size : 7814047744 (7452.06 GiB 8001.58 GB)
Raid Devices : 5
Total Devices : 4
Preferred Minor : 0
Update Time : Mon May 21 00:16:09 2012
State : clean
Active Devices : 3
Working Devices : 3
Failed Devices : 2
Spare Devices : 0
Checksum : beba2886 - correct
Events : 65646
Layout : left-symmetric
Chunk Size : 64K
Number Major Minor RaidDevice State
this 3 8 1 3 active sync /dev/sda1
0 0 8 17 0 active sync /dev/sdb1
1 1 0 0 1 faulty removed
2 2 0 0 2 faulty removed
3 3 8 1 3 active sync /dev/sda1
4 4 8 65 4 active sync /dev/sde1
/dev/sdf1:
Magic : a92b4efc
Version : 0.90.00
UUID : 1ef7f14c:daf5731e:26644315:e48bd8ad
Creation Time : Sun Sep 5 21:10:15 2010
Raid Level : raid5
Used Dev Size : 1953511936 (1863.01 GiB 2000.40 GB)
Array Size : 7814047744 (7452.06 GiB 8001.58 GB)
Raid Devices : 5
Total Devices : 4
Preferred Minor : 0
Update Time : Mon May 21 00:16:09 2012
State : clean
Active Devices : 3
Working Devices : 3
Failed Devices : 2
Spare Devices : 0
Checksum : beba2890 - correct
Events : 65646
Layout : left-symmetric
Chunk Size : 64K
Number Major Minor RaidDevice State
this 0 8 17 0 active sync /dev/sdb1
0 0 8 17 0 active sync /dev/sdb1
1 1 0 0 1 faulty removed
2 2 0 0 2 faulty removed
3 3 8 1 3 active sync /dev/sda1
4 4 8 65 4 active sync /dev/sde1
Sorry for the length on that. Any ideas on what to do next? I read this guide but it is a bit over my head and seems to indicate the original order of the disks being important, but I think I might have jacked with it enough that I lost that information, or maybe not? Given that the drives themselves report as healthy, I really think this is a weird software issue.mdadm --create --assume-clean --level=5 --raid-devices=5 /dev/md0 /dev/sda1 /dev/sdb1 /dev/sdc1 missing /dev/sde1Like this, without line breaks?
mdadm --assemble --scan to see if mdadm can automatically detect the array.mdadm --assemble /dev/md0 /dev/sda1 /dev/sdb1 /dev/sdc1 /dev/sdd1 /dev/sde1You are not logged in, either login or create an account to post comments
When you do your examine report, the drives think they are different drives than they used to be. a thinks it is c, b thinks it is d, c thinks it is e, e thinks it is a, f thinks it is b
You also have different versions of the superblock- some of the drives think two drives are bad, some of the drives think one is bad, and one thinks they are all good.
So I'm thinking the drives did in fact get reordered. Could be because the failing drive caused the controller to stall on POST and the drives got reported to the OS in the wrong order.
Meanwhile, one of the drives (the original sdd, I think) developed bad sectors or completely dropped out of the array. Then when the machine rebooted, it tried to auto assemble with the reordered drives and tried to use the bad/defunct one as the good one and got completely confused.
What I would do is pull the current sdb (that thinks it was sdd in the array). That's very likely the one that originally went bad and is probably the one with bad sectors. Double check your logs to make sure.
So after that, you'll have to do the create thing listed in the guide you mention. It looks correct to me. I would do it readonly until you are sure about the correct order.
Once you get it going, in the future, save the contents of your mdadm --examine command and also the results of a smartctl -i /dev/sd*1 so that you know the serial number of the drives in the array and what letter they are. So if in the future this happens again, you'll know the correct order based on serial number.
I think there is a way to create/assemble raid devices using disk labels instead of physical positions. This might be something to try, since your system seems to like to reorder drives.
posted by gjc at 2:30 PM on January 20