Jump to content

Hey guys, I am not new to RAID, although I am definitely a noob when it comes to it. IE: I know all about how it works and the different variants, but I don't use it much and just have a solid general understanding but nothing in depth like an IT person would who deals with it often.

 

Issue is, I just acquired some Seagate 3 TB drives *not RAID specific, just consumer drives* 7200.14's, and I decided hell, I will upgrade my old RAID solution on my HTPC that was pretty crap. I had to update my mobo's BIOS since it wasn't happy at all about anything, so nbd. Deleted my old RAID after moving my data to an external, set up new RAID, and away we go. Load into windows 10, Intel RST shows evrything is cool, start the copy, and first issue is I am only getting ~25 mbps. So that kinda sux, but I figure the intel RAID controller on my Z67 board isn't top notch, so thus is life. I let it go overnight, only to wake up to a FAILED RAID array. The lowest drive in Intel RST showed as dropped out. So I decide maybe something weird happened, so I cancel the transfer, delete and redo the RAID just to be safe, and this morning it happened again, lowest drive in Intel RST was dropped out.

 

So I am not sure how I RST works, but I assume this means the same drive failed, I will have to investigate more later when I get home from work to see if the same drives always show up in the same order in I RST based on SATA port on the mobo, if so, the same drive has dropped out twice in 1.5 days.... I have a few RAID setups and have never had this issue, so I am not sure what is causing it. Only thing I can think of is the drive being bad. This mobo had a RAID 5 on it before and it never had a hiccup, and if I do remember right when these drives where not in RAID before in my parents PC, once or twice over a year span one of the drives was no longer showing in Windows - I assume it was this drive since its having issues.

 

Would a possible firmware update help? Or is there just something wrong with the drive? And if so, being a consumer drive would Seagate even warranty it if I say it has RAID issues, probably not. At least I can fall back on it just disappeared in Windows while not in RAID...

Rig: i7 13700k +Contact Frame - - Asus Z790-P Wifi - - RTX 4080 - - 4x16GB 6000MHz - - Samsung 990 Pro 2TB NVMe Boot + Main Programs - - Crucial P3 2TB NVMe for photo work - - Corsair RM850x - - Sound BlasterX EA-5 - - Corsair XC8 JTC Edition - - Corsair GPU Full Cover GPU Block - - PTM 7950 - - XT45 X-Flow 420 + UT60 280 rads externally mounted - - EK XRES RGB PWM - - Fractal Define S2 - - DellAlienware AW3423DWF 34" -- Logitech Pro X Superlight - - Logitech G710+ - - LTT Northern Lights Deskpad

 

Headphones/amp/dac: Schiit Bifrost Multibit - -  Schiit Lyr 3 - - Fostex TR-X00 - - Sennheiser HD 6xx

 

Homelab/Media Server: Proxmox VE host - - 512 NVMe Samsung 980 RAID Z1 for VM's/Proxmox boot - - Xeon e5 2660 V4- - Supermicro X10SRF-i - - 128 GB ECC 2133 - - 10x8TB WD Red RAID Z2 - - 2x 800 GB SAS SSD’s (1 SLOG, 1 L2Arc) - - 45 HomeLab HL15 15 Drive 4U - - Corsair RM650i - - LSI 9305-16i HBA - - TreuNAS + many other VM’s

 

Unifi UDM Pro in front of full unifi network infrastructure

 

iPhone 17 Pro - - MacBook Air M3

Link to comment
https://linustechtips.com/topic/479469-raid-5-issues/
Share on other sites

Link to post
Share on other sites

@{EAC} Shoot em UP Test the drive on its own? Do some benchmarks compared to the other ones and see if it is slower/faulty. If not, firmware upgrade.

CPU: i5 4670k @ 3.4GHz + Corsair H100i      GPU: Gigabyte GTX 680 SOC (+215 Core|+162 Mem)     SSD: Kingston V300 240GB (OS)      Headset: Logitech G930 

Case: Cosair Vengance C70 (white)                RAM: 16GB TeamGroup Elite Black DDR3 1600MHz       HDD: 1TB WD Blue                              Mouse: Logitech G602

OS: Windows 7 Home Premium                       PSUXFX Core Edition 750w                                                Motherboard: MSI Z97-G45               Keyboard: Logitech G510

Link to comment
https://linustechtips.com/topic/479469-raid-5-issues/#findComment-6430914
Share on other sites

Link to post
Share on other sites

@{EAC} Shoot em UP Test the drive on its own? Do some benchmarks compared to the other ones and see if it is slower/faulty. If not, firmware upgrade.

I could try that, but I am thinking I will just try and RMA it. It dropped out of normal windows environment on me a few times but I never thought anything of it, but now after thinking about it more that tells me there has to be an issue with the drive. I would try firmware first, I suppose I will see if they have new firmware for the 7200.14's. I have A LOT of them deployed in RAID and not in RAID environments and this is the first issue I have had, so I don't think its firmware since most of them are all on the same or older firmware... But worth a shot.

Rig: i7 13700k +Contact Frame - - Asus Z790-P Wifi - - RTX 4080 - - 4x16GB 6000MHz - - Samsung 990 Pro 2TB NVMe Boot + Main Programs - - Crucial P3 2TB NVMe for photo work - - Corsair RM850x - - Sound BlasterX EA-5 - - Corsair XC8 JTC Edition - - Corsair GPU Full Cover GPU Block - - PTM 7950 - - XT45 X-Flow 420 + UT60 280 rads externally mounted - - EK XRES RGB PWM - - Fractal Define S2 - - DellAlienware AW3423DWF 34" -- Logitech Pro X Superlight - - Logitech G710+ - - LTT Northern Lights Deskpad

 

Headphones/amp/dac: Schiit Bifrost Multibit - -  Schiit Lyr 3 - - Fostex TR-X00 - - Sennheiser HD 6xx

 

Homelab/Media Server: Proxmox VE host - - 512 NVMe Samsung 980 RAID Z1 for VM's/Proxmox boot - - Xeon e5 2660 V4- - Supermicro X10SRF-i - - 128 GB ECC 2133 - - 10x8TB WD Red RAID Z2 - - 2x 800 GB SAS SSD’s (1 SLOG, 1 L2Arc) - - 45 HomeLab HL15 15 Drive 4U - - Corsair RM650i - - LSI 9305-16i HBA - - TreuNAS + many other VM’s

 

Unifi UDM Pro in front of full unifi network infrastructure

 

iPhone 17 Pro - - MacBook Air M3

Link to comment
https://linustechtips.com/topic/479469-raid-5-issues/#findComment-6430946
Share on other sites

Link to post
Share on other sites

I could try that, but I am thinking I will just try and RMA it. It dropped out of normal windows environment on me a few times but I never thought anything of it, but now after thinking about it more that tells me there has to be an issue with the drive. I would try firmware first, I suppose I will see if they have new firmware for the 7200.14's. I have A LOT of them deployed in RAID and not in RAID environments and this is the first issue I have had, so I don't think its firmware since most of them are all on the same or older firmware... But worth a shot.

Yeah fair enough.

CPU: i5 4670k @ 3.4GHz + Corsair H100i      GPU: Gigabyte GTX 680 SOC (+215 Core|+162 Mem)     SSD: Kingston V300 240GB (OS)      Headset: Logitech G930 

Case: Cosair Vengance C70 (white)                RAM: 16GB TeamGroup Elite Black DDR3 1600MHz       HDD: 1TB WD Blue                              Mouse: Logitech G602

OS: Windows 7 Home Premium                       PSUXFX Core Edition 750w                                                Motherboard: MSI Z97-G45               Keyboard: Logitech G510

Link to comment
https://linustechtips.com/topic/479469-raid-5-issues/#findComment-6431009
Share on other sites

Link to post
Share on other sites

I guess the main reason for this thread was to find out if most of the time it is just a bad drive. Or do consumer drives "tend" to fall out of RAID arrays? I assume not.... I am pretty sure that isn't a normal thing to happen and there must be something wrong with the drive to cause it to just quite talking to the RAID controller and cause it to drop. Putting the computer to sleep and waking it fixed it which is interesting.

Rig: i7 13700k +Contact Frame - - Asus Z790-P Wifi - - RTX 4080 - - 4x16GB 6000MHz - - Samsung 990 Pro 2TB NVMe Boot + Main Programs - - Crucial P3 2TB NVMe for photo work - - Corsair RM850x - - Sound BlasterX EA-5 - - Corsair XC8 JTC Edition - - Corsair GPU Full Cover GPU Block - - PTM 7950 - - XT45 X-Flow 420 + UT60 280 rads externally mounted - - EK XRES RGB PWM - - Fractal Define S2 - - DellAlienware AW3423DWF 34" -- Logitech Pro X Superlight - - Logitech G710+ - - LTT Northern Lights Deskpad

 

Headphones/amp/dac: Schiit Bifrost Multibit - -  Schiit Lyr 3 - - Fostex TR-X00 - - Sennheiser HD 6xx

 

Homelab/Media Server: Proxmox VE host - - 512 NVMe Samsung 980 RAID Z1 for VM's/Proxmox boot - - Xeon e5 2660 V4- - Supermicro X10SRF-i - - 128 GB ECC 2133 - - 10x8TB WD Red RAID Z2 - - 2x 800 GB SAS SSD’s (1 SLOG, 1 L2Arc) - - 45 HomeLab HL15 15 Drive 4U - - Corsair RM650i - - LSI 9305-16i HBA - - TreuNAS + many other VM’s

 

Unifi UDM Pro in front of full unifi network infrastructure

 

iPhone 17 Pro - - MacBook Air M3

Link to comment
https://linustechtips.com/topic/479469-raid-5-issues/#findComment-6431046
Share on other sites

Link to post
Share on other sites

@{EAC} Shoot em UP

For your speed issue, yeah that will be your controller. Remember that your CPU has to calculate parity on-the-fly while doing parity RAID. It's a potential bottleneck too.

This is why I vastly prefer RAID 10. No parity to calculate. Much more speed and equal to or better redundancy. You just get less space (which most people consider the deal breaker). 

I was looking for Linus' video on RAID 5 vs RAID 6, but I can't find it (may have forgotten the title).

TL;DR: With larger HDDs, RAID 5 is bad because an Unrecoverable Read Error will make a drive drop out of the array even if the rest of the drive is fine. 1 bit being unreadable makes the rest useless (in a RAID anyway). 

This may not be your problem, but that one drive you have could be having this issue where it's unable to write/read from a portion of the disk.

Usually, if your RAID array is failing, it isn't just 1 drive with this issue since RAID 5 is redundant for 1 drive failure. It's usually two or more that have it and cause a RAID array to fail. 

It's a sinister issue because it isn't cut and dry. It could work once, you think everything is fine, then it fail again once you've started using it. 

A drive dropping out and the RAID failing after it tried to rebuild or you dumped a ton of data on it is what happened to me at my workplace with our 7 drive RAID 5 array. One drive would say it failed, and the rest would seem fine, but it was always a different drive (this is likely because we had 7 drives in the array which makes that more likely to happen). 

Anyway, I'd either RMA or at least check the SMART data on the failing drive(s). It usually lists UREs (Unrecoverable Read Errors) that occur or something like it. Super high seek time is the issue basically. 

To give more regarding that, when your drive has a URE occur, it keeps seeking that bit of data, unable to get it. Your RAID controller is waiting on the drive, but the drive is still trying to find what it needs. Eventually, your RAID controller gives up on waiting (like 45 seconds usually?) and just goes "your array failed". 

Western Digital Reds and other large/enterprise storage oriented drives have shorter "we give up" times set in them for UREs so that they don't cause this. i.e. a single file is corrupt, but the rest are fine and accessible, as compared to the whole drive being considered "failed". Regular consumer drives have longer seek times for UREs before giving up because it's more important for regular consumers who don't use RAID to get that bit of data back as they are far less likely to have backups. 

You could just use RAID 6, because the extra redundancy will basically prevent this from happening (two drives have to have a URE or fail in RAID 5 for it to happen, and that's somewhat likely, but three drives have to do this for the RAID 6 to fail, and that's much less likely). 

It's just a natural issue with RAID 5. Only 1 drive of redundancy and you are using 3TB drives (bigger the drive = more likely to have a URE). There's no way around it other than using different HDDs or a different kind of RAID.

† Christian Member †

For my pertinent links to guides, reviews, and anything similar, go here, and look under the spoiler labeled such. A brief history of Unix and it's relation to OS X by Builder.

 

 

Link to comment
https://linustechtips.com/topic/479469-raid-5-issues/#findComment-6431062
Share on other sites

Link to post
Share on other sites

I guess the main reason for this thread was to find out if most of the time it is just a bad drive. Or do consumer drives "tend" to fall out of RAID arrays? 

 

Consumer drives to tend to fall out of RAID arrays. Basically if a read/write request takes longer than normal and the drive doesn't report back the raid controller assumes the drive isn't responding so it drops out. 

RAID drives get around this with a function thats built into the firmware that's called TLER (I think seagate has a different name for it). The shortest explanation is it basically allows the drive to say 'hey I'm here, I'm just getting some read errors and it's taking me a little longer than normal, don't drop me'. It's obviously more complicated than that, but you get the idea. 

Link to comment
https://linustechtips.com/topic/479469-raid-5-issues/#findComment-6431111
Share on other sites

Link to post
Share on other sites

@{EAC} Shoot em UP

For your speed issue, yeah that will be your controller. Remember that your CPU has to calculate parity on-the-fly while doing parity RAID. It's a potential bottleneck too.

This is why I vastly prefer RAID 10. No parity to calculate. Much more speed and equal to or better redundancy. You just get less space (which most people consider the deal breaker). 

I was looking for Linus' video on RAID 5 vs RAID 6, but I can't find it (may have forgotten the title).

TL;DR: With larger HDDs, RAID 5 is bad because an Unrecoverable Read Error will make a drive drop out of the array even if the rest of the drive is fine. 1 bit being unreadable makes the rest useless (in a RAID anyway). 

This may not be your problem, but that one drive you have could be having this issue where it's unable to write/read from a portion of the disk.

Usually, if your RAID array is failing, it isn't just 1 drive with this issue since RAID 5 is redundant for 1 drive failure. It's usually two or more that have it and cause a RAID array to fail. 

It's a sinister issue because it isn't cut and dry. It could work once, you think everything is fine, then it fail again once you've started using it. 

A drive dropping out and the RAID failing after it tried to rebuild or you dumped a ton of data on it is what happened to me at my workplace with our 7 drive RAID 5 array. One drive would say it failed, and the rest would seem fine, but it was always a different drive (this is likely because we had 7 drives in the array which makes that more likely to happen). 

Anyway, I'd either RMA or at least check the SMART data on the failing drive(s). It usually lists UREs (Unrecoverable Read Errors) that occur or something like it. Super high seek time is the issue basically. 

To give more regarding that, when your drive has a URE occur, it keeps seeking that bit of data, unable to get it. Your RAID controller is waiting on the drive, but the drive is still trying to find what it needs. Eventually, your RAID controller gives up on waiting (like 45 seconds usually?) and just goes "your array failed". 

Western Digital Reds and other large/enterprise storage oriented drives have shorter "we give up" times set in them for UREs so that they don't cause this. i.e. a single file is corrupt, but the rest are fine and accessible, as compared to the whole drive being considered "failed". Regular consumer drives have longer seek times for UREs before giving up because it's more important for regular consumers who don't use RAID to get that bit of data back as they are far less likely to have backups. 

You could just use RAID 6, because the extra redundancy will basically prevent this from happening (two drives have to have a URE or fail in RAID 5 for it to happen, and that's somewhat likely, but three drives have to do this for the RAID 6 to fail, and that's much less likely). 

It's just a natural issue with RAID 5. Only 1 drive of redundancy and you are using 3TB drives (bigger the drive = more likely to have a URE). There's no way around it other than using different HDDs or a different kind of RAID.

 

Yes, this all makes sense. Except for the speed of the CPU issue, this is on a 3770k lol. I fully understand the SATA controller isn't exactly top notch here, but the CPU sure shouldn't be the issue, I admit the communication lines of the mobo are not set up for super RAID 5 performance though, so that is fine and I accept it.

 

I may try RAID 10 (if my mobo even supports it, it should) as your right, this would solve all the speed issues, although in HDtune I was getting like 500+ mbps read out of the RAID 5 which is plenty fine, but 25 mbps writes kinda suck, although I feel like 25 is REALLY low, maybe has to do with this drive being bad and its seek times being all over the map causing the entire array to slow down. I suppose I will delete the RAID once more and see if there are any smart errors on any of the drives after loading them up with a bunch of DATA, my main drive in that rig is an SSD, so I will just copy a bunch of data onto all 4 drives at once from my SSD and see what happens.

 

I am thinking RMA, but I will try this first I suppose.

Consumer drives to tend to fall out of RAID arrays. Basically if a read/write request takes longer than normal and the drive doesn't report back the raid controller assumes the drive isn't responding so it drops out. 

RAID drives get around this with a function thats built into the firmware that's called TLER (I think seagate has a different name for it). The shortest explanation is it basically allows the drive to say 'hey I'm here, I'm just getting some read errors and it's taking me a little longer than normal, don't drop me'. It's obviously more complicated than that, but you get the idea. 

Correct, I agree with this, but I feel like it shouldn't happen 100% of the time lol

Rig: i7 13700k +Contact Frame - - Asus Z790-P Wifi - - RTX 4080 - - 4x16GB 6000MHz - - Samsung 990 Pro 2TB NVMe Boot + Main Programs - - Crucial P3 2TB NVMe for photo work - - Corsair RM850x - - Sound BlasterX EA-5 - - Corsair XC8 JTC Edition - - Corsair GPU Full Cover GPU Block - - PTM 7950 - - XT45 X-Flow 420 + UT60 280 rads externally mounted - - EK XRES RGB PWM - - Fractal Define S2 - - DellAlienware AW3423DWF 34" -- Logitech Pro X Superlight - - Logitech G710+ - - LTT Northern Lights Deskpad

 

Headphones/amp/dac: Schiit Bifrost Multibit - -  Schiit Lyr 3 - - Fostex TR-X00 - - Sennheiser HD 6xx

 

Homelab/Media Server: Proxmox VE host - - 512 NVMe Samsung 980 RAID Z1 for VM's/Proxmox boot - - Xeon e5 2660 V4- - Supermicro X10SRF-i - - 128 GB ECC 2133 - - 10x8TB WD Red RAID Z2 - - 2x 800 GB SAS SSD’s (1 SLOG, 1 L2Arc) - - 45 HomeLab HL15 15 Drive 4U - - Corsair RM650i - - LSI 9305-16i HBA - - TreuNAS + many other VM’s

 

Unifi UDM Pro in front of full unifi network infrastructure

 

iPhone 17 Pro - - MacBook Air M3

Link to comment
https://linustechtips.com/topic/479469-raid-5-issues/#findComment-6431158
Share on other sites

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×