-
Posts
12,134 -
Joined
-
Last visited
Content Type
Forums
Status Updates
Blogs
Events
Gallery
Downloads
Store Home
Everything posted by Windows7ge
-
-
I only had to fight tooth and nail to get it but my replacement SSD turned into an upgraded SSD.
Gen3 -> Gen4
7300 PRO -> 7400 PRO
Just an FYI be careful buying from 3rd party marketplace sellers. I just went through Hell getting this replacement. Amazon could't do anything for me, the seller ghosted me. I had to talk to Micron who just said they aren't the distributor. Gave them the S/N which then directed me to Crucial. Crucial processed my RMA. Told me they didn't have 7300's anymore. Offered me a 7400. Sent me a confirmation for replacement. Sent me another e-mail saying they were out of stock. Then a couple days ago I got another email that my replacement finally shipped.
So that sucked.
Worse, I'm starting to suspect the motherboard is killing the SSD's but I don't know why. This is the second SSD I've replaced in this slot.
Each time I replace the SSD I start getting this error in the log:
Nov 11 08:46:42 intel kernel: [1346300.041503] {62}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 0 Nov 11 08:46:42 intel kernel: [1346300.042451] {62}[Hardware Error]: It has been corrected by h/w and requires no further action Nov 11 08:46:42 intel kernel: [1346300.042893] {62}[Hardware Error]: event severity: corrected Nov 11 08:46:42 intel kernel: [1346300.043288] {62}[Hardware Error]: Error 0, type: corrected Nov 11 08:46:42 intel kernel: [1346300.043683] {62}[Hardware Error]: section_type: PCIe error Nov 11 08:46:42 intel kernel: [1346300.044068] {62}[Hardware Error]: port_type: 4, root port Nov 11 08:46:42 intel kernel: [1346300.044452] {62}[Hardware Error]: version: 3.0 Nov 11 08:46:42 intel kernel: [1346300.044842] {62}[Hardware Error]: command: 0x0547, status: 0x0010 Nov 11 08:46:42 intel kernel: [1346300.045228] {62}[Hardware Error]: device_id: 0000:17:00.0 Nov 11 08:46:42 intel kernel: [1346300.045620] {62}[Hardware Error]: slot: 21 Nov 11 08:46:42 intel kernel: [1346300.045998] {62}[Hardware Error]: secondary_bus: 0x18 Nov 11 08:46:42 intel kernel: [1346300.046369] {62}[Hardware Error]: vendor_id: 0x8086, device_id: 0x2030 Nov 11 08:46:42 intel kernel: [1346300.046747] {62}[Hardware Error]: class_code: 060400 Nov 11 08:46:42 intel kernel: [1346300.047113] {62}[Hardware Error]: bridge: secondary_status: 0x0000, control: 0x0013 Nov 11 08:46:42 intel kernel: [1346300.055863] pcieport 0000:17:00.0: AER: aer_status: 0x00001000, aer_mask: 0x00002000 Nov 11 08:46:42 intel kernel: [1346300.056233] pcieport 0000:17:00.0: [12] Timeout Nov 11 08:46:42 intel kernel: [1346300.056606] pcieport 0000:17:00.0: AER: aer_layer=Data Link Layer, aer_agent=Transmitter ID
When I lookup what component is at address 17:00.0:
17:00.0 PCI bridge: Intel Corporation Sky Lake-E PCI Express Root Port A (rev 04)
So this is either part of the CPU or C621 PCH and might be killing the SSDs. I'm not sure. If I lose a 3rd SSD I'm just going to stop using that slot and assume the motherboard has a problem.
-
22 minutes ago, TopHatProductions115 said:
Why not test a replacement motherboard? Just curious...
Dual socket LGA3647 w/ 7 active PCI_e slots as my primary hypervisor server with multiple things passed through.
I don't really have it in me to find a temporary replacement board without taking it offline for a long period of time.
Reading the motherboard schematic the PCH only gets one whopping PCI_e lane so if the current SSD in that slot a Kingston DC1000B is getting 1/4th the bandwidth and sharing it with other devices. I'm better off putting it on a riser card and putting it in a CPU2 slot. The Intel QPI link has much higher bandwidth. It'd be worth it. I might just put them on the Supermicro 4x22110 riser card I have if that happens.
-
Actually now that you really brought it to my attention I think I have to swap my SSD's with my HDDs. There's no way all of this is sharing a single PCI_e Gen3 lane. That caps multiple Gig performance to 1GB/s.