Jump to content

interesting unraid failure

porina

Of course, when it comes to data storage, interesting is a bad thing. I was doing a routine backup when I noticed a little red X on one of the drives. I poked around the web interface and apparently there were write errors. I did a short SMART test and that reported back ok, and the SMART attributes were ok too.

 

I'm about to see if I can salvage a spare disk from an old server as a temporary replacement and let the array rebuild. The 4 drives in the backup server are all WD reds, and my old server used non-NAS Toshiba drives. While not ideal, it's still better than nothing.

 

Once extracted I'll connected the suspect drive and run a full test on it. The machine that houses the unraid is a HP Microserver Gen8 so I would hope it is not failing in some way.

Gaming system: R7 7800X3D, Asus ROG Strix B650E-F Gaming Wifi, Thermalright Phantom Spirit 120 SE ARGB, Corsair Vengeance 2x 32GB 6000C30, RTX 4070, MSI MPG A850G, Fractal Design North, Samsung 990 Pro 2TB, Acer Predator XB241YU 24" 1440p 144Hz G-Sync + HP LP2475w 24" 1200p 60Hz wide gamut
Productivity system: i9-7980XE, Asus X299 TUF mark 2, Noctua D15, 64GB ram (mixed), RTX 3070, NZXT E850, GameMax Abyss, Samsung 980 Pro 2TB, random 1080p + 720p displays.
Gaming laptop: Lenovo Legion 5, 5800H, RTX 3070, Kingston DDR4 3200C22 2x16GB 2Rx8, Kingston Fury Renegade 1TB + Crucial P1 1TB SSD, 165 Hz IPS 1080p G-Sync Compatible

Link to comment
Share on other sites

Link to post
Share on other sites

WD's own tool doesn't report anything in SMART, but using another tool I see the raw value for seek error rate is non-zero, which I don't think is normal. Doing a long scan on it now. I also note the other hard disk in the system I connected it to (old WD green) is showing pending sectors so that's probably on the way out too...

 

With the replacement drive in the server, it is estimating 21 hours for a rebuild... maybe I should seriously look at double-redundancy.

Gaming system: R7 7800X3D, Asus ROG Strix B650E-F Gaming Wifi, Thermalright Phantom Spirit 120 SE ARGB, Corsair Vengeance 2x 32GB 6000C30, RTX 4070, MSI MPG A850G, Fractal Design North, Samsung 990 Pro 2TB, Acer Predator XB241YU 24" 1440p 144Hz G-Sync + HP LP2475w 24" 1200p 60Hz wide gamut
Productivity system: i9-7980XE, Asus X299 TUF mark 2, Noctua D15, 64GB ram (mixed), RTX 3070, NZXT E850, GameMax Abyss, Samsung 980 Pro 2TB, random 1080p + 720p displays.
Gaming laptop: Lenovo Legion 5, 5800H, RTX 3070, Kingston DDR4 3200C22 2x16GB 2Rx8, Kingston Fury Renegade 1TB + Crucial P1 1TB SSD, 165 Hz IPS 1080p G-Sync Compatible

Link to comment
Share on other sites

Link to post
Share on other sites

bad SATA cable or port?

never seen a failure like that....

it's always a good idea to have double-redundancy

****SORRY FOR MY ENGLISH IT'S REALLY TERRIBLE*****

Been married to my wife for 3 years now! Yay!

Link to comment
Share on other sites

Link to post
Share on other sites

On 11/6/2016 at 4:51 AM, porina said:

WD's own tool doesn't report anything in SMART, but using another tool I see the raw value for seek error rate is non-zero, which I don't think is normal. Doing a long scan on it now. I also note the other hard disk in the system I connected it to (old WD green) is showing pending sectors so that's probably on the way out too...

 

With the replacement drive in the server, it is estimating 21 hours for a rebuild... maybe I should seriously look at double-redundancy.

I have had something similar a few times, but it was also affecting the read speeds, was the sata cable.

 

If you have them in a RAID, most raid cards don't pass through the SMART status. (I know the gen7 onboard doesn't)

Link to comment
Share on other sites

Link to post
Share on other sites

7 hours ago, samiscool51 said:

bad SATA cable or port?

never seen a failure like that....

it's always a good idea to have double-redundancy

 

1 hour ago, Blake said:

I have had something similar a few times, but it was also affecting the read speeds, was the sata cable.

 

If you have them in a RAID, most raid cards don't pass through the SMART status. (I know the gen7 onboard doesn't)

It is a HP microserver so everything is set up already. I could reseat the master connector on the mobo but there's not much I can do about the rest.

 

It took about 24h but the rebuild with another drive went smoothly and I have a working array again. The suspect drive passed a full surface read, then a full surface write so far. The SMART value for seek error rate has gone back to zero again now. I'm hesitant to put this drive back in the array, given how long it takes to do a rebuild. So I'll reuse it elsewhere as an ancient WD green is now showing SMART warnings with pending sectors.

 

I would add more redundancy but the enclosure has 4 bays and they're all used. I looked for systems with more drive bays but haven't identified one yet at the right price/performance I'm looking for.

Gaming system: R7 7800X3D, Asus ROG Strix B650E-F Gaming Wifi, Thermalright Phantom Spirit 120 SE ARGB, Corsair Vengeance 2x 32GB 6000C30, RTX 4070, MSI MPG A850G, Fractal Design North, Samsung 990 Pro 2TB, Acer Predator XB241YU 24" 1440p 144Hz G-Sync + HP LP2475w 24" 1200p 60Hz wide gamut
Productivity system: i9-7980XE, Asus X299 TUF mark 2, Noctua D15, 64GB ram (mixed), RTX 3070, NZXT E850, GameMax Abyss, Samsung 980 Pro 2TB, random 1080p + 720p displays.
Gaming laptop: Lenovo Legion 5, 5800H, RTX 3070, Kingston DDR4 3200C22 2x16GB 2Rx8, Kingston Fury Renegade 1TB + Crucial P1 1TB SSD, 165 Hz IPS 1080p G-Sync Compatible

Link to comment
Share on other sites

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×