Jump to content

This problem has somewhat followed me through the entire ship-of-Theseus that is my main rig.

 

Whenever I'm doing something with very high I/O utilization (i.e. hundreds of MBps), the machine gets remarkably unstable and crashes at random. It will flash the BSoD but the screen gets garbled by my ultrawide, but it never writes a dump. Nothing meaningful shows up in the Windows Event Log. For example, I've tried doing a full write/format on three USB HDDs and (at most) made it about 1TB into 8TB, after several attempts.

 

The first times this arose was because I was flying too close to the sun trying to run 3200 with a 1600X, but every time since I've never been able to conclusively figure out what's causing the instability. Sometimes simple file copies will result in data corruption upon verification (a habit I got into from the first iteration). Sometimes it goes away on its own for a while. My hunch is that it's related to Infinity Fabric, but I have very little to go off of.

 

I have over the lifetime of the machine replaced:

  • The CPU (1600X > 3800XT > 5800X).
  • The motherboard (going from Asus B350-F to Asus B550-Creator)
  • The RAM (going from 16GB G.Skill 3200CL14 to 32GB G.Skill 3600CL16) (I haven't tried with XMP off, but I also don't want to waste nice RAM, so we'll save that for absolute last)
  • The GPU (GTX770 > RX 5600 XT > RTX 3060 Ti > RX 6600)
  • The PSU (CX650M to RM750x)
  • The case (now a Hyte Y60)
  • The OS storage (Samsung 960 PRO to 990 PRO)
  • The OS (reinstalled several times, though not since I replaced the RAM)
  • The cooling...
  • [literally no longer contains any functional part of the original machine]

 

I've run Memtests, changed out USB cards and cables, run it on a UPS, the whole shebang.

 

The weird part to me is I have a pretty similar machine (Asrock B450, different 5800X, the old 3200CL14 RAM kit (at 3600, no less), GTX 1070Ti, Samsung 9A1) that does these tasks perfectly.

 

My next steps are going to be to reseat the memory and reinstall Windows (which was on the todo list because of Win11 anyways), but is there some other step I'm missing? I might even swap the 5800Xs (and RMA with AMD?).

Main System (Byarlant): Ryzen 7 5800X | Asus B550-Creator ProArt | EK 240mm Basic AIO | 32GB G.Skill DDR4 3600MT/s CL16 | XFX Speedster SWFT 210 RX 6600 | Samsung 990 PRO 2TB / Samsung 960 PRO 512GB / 4× Crucial MX500 2TB (RAID-0) | Corsair RM750X | Silicom (Intel) X540-AT2 10G NIC | Inateck USB 3.0 Card | Hyte Y60 Case | Dell U3415W Monitor | Keychron K4 Brown (white backlight)

 

Laptop (Narrative): Lenovo Flex 5 81X20005US | Ryzen 5 4500U | 16GB DDR4 3200MT/s (soldered) | Vega II 384SP Graphics | SKHynix P31 1TB NVMe SSD | Intel AX200 Wifi | Asus 2.5G USB NIC | Asus ProArt PA278QV | Keychron K12 Blue (RGB backlight)

 

Proxmox Server (Veda): Ryzen 7 3800XT | ASRock Rack X470D4U | Corsair H80i v2 | 64GB Micron DDR4 ECC 3200MT/s | 4× WD 10TB / 4× Seagate 14TB Exos / 8× WD 12TB (custom external SAS enclosure) / 2× Samsung PM963a 960GB SSD | Seasonic Prime Fanless 500W | Intel X550-T2 10G NIC | LSI 9300-8i HBA | Adaptec 82885T SAS Expander | Fractal Design Node 804 Case

 

Proxmox Server (La Vie en Rose)GMKtec Mini PC | Ryzen 7 5700U | 32GB Lexar DDR4 (SODIMM) | Vega II 512SP Graphics | Lexar 1TB 610 Pro SSD | 2× Realtek 8125 2.5G NICs


Media Center/Video Capture (Jesta Cannon): Ryzen 5 1600X | ASRock B450M Pro4 R2.0 | Noctua NH-L12S | 16GB Crucial DDR4 3200MT/s | EVGA GTX750Ti SC | UMIS NVMe SSD 256GB / TEAMGROUP MS30 1TB | Corsair CX450M | Viewcast Osprey 260e Video Capture | TrendNet (Aquantia AQC107) 10G NIC | LG UH12NS30 BD-ROM | Silverstone Sugo SG-11 Case | Sony XR65A80K

 

Workbench (Doven Wolf): Lenovo m715q | Ryzen Pro 3 2200GE | 16GB Crucial DDR4 3200MT/s (SODIMM) | Vega 8 Graphics | SKHynix (OEM) 256GB NVMe SSD | uni 2.5G USB NIC | HDMI add-in module

 

Network:

Spoiler
                       ┌─────────────── Office/Rack ───────────────────────────────────────────────────────┐
Google Fiber Webpass ── Cloud Gateway Max ═╦════ Flex 2.5-8 ═╦════ Flex XG ═╦═ Veda
                           La Vie en Rose ═╣ La Vie en Rose ═╬═ Doven Wolf  ╠═ Veda-NAS
                                     Veda ─╜      Narrative ═╝              ╟─ Switch 8-60W ─┬─ Veda
╔═══════════════════════════════════════════════════════════════════════════╝                └─ Veda (IPMI)
║    ┌ Closet ┐     ┌───────── Bedroom ─────────┐
╚════ Flex XG ═╦╤═══ Flex XG ═╤╦═ Byarlant
        (PoE)  ║│             │╠═ Narrative 
Kitchen Jack ══╣└─ Dual PoE ┐ │╚═ Jesta Cannon*
   (Testing)   ║┌─ Injector ┘ └── Work Laptop
     Bedroom ══╝│
        Jack #2 │        ┌──────── Media Center ───────────────────────────┐
                └──────── Switch 8 ────────────┬─ nanoHD Access Point (PoE)
Notes:                                         ├─ Sony PlayStation 4 
─── is Gigabit / ═══ is Multi-Gigabit          ├─ Pioneer VSX-S520
* = cable passed from Bedroom to Media Center  └─ Sony XR65A80K (Google TV)

 

Link to comment
https://linustechtips.com/topic/1615467-ryzen-am4-unstable-under-high-io/
Share on other sites

Link to post
Share on other sites

10 hours ago, AbydosOne said:

It will flash the BSoD but the screen gets garbled by my ultrawide, but it never writes a dump.

That glitch can happen on all higher than 1080p resolution monitors, not just ultrawides. Because you don't get dump files let's see check what the crash errors are at least. In Event Viewer, find the Kernel-Power event ID 41 crashes (If they are 6008 they don't have any info, that event is from it crashing during the crash. Inception style). Select the Details tab and screenshot 3-5 of them. You can also right click → save on them (Highlighting multiple works) and upload the .evtx file. 

Link to post
Share on other sites

10 hours ago, AbydosOne said:

but is there some other step I'm missing?

What have you tried? Different BIOS versions?

Note: Users receive notifications after Mentions & Quotes. 

Feel free: To ask any question, no matter what question it is, I will try to answer. I know a lot about PCs but not everything.

current PC:

Ryzen 5 5600 |16GB DDR4 3200Mhz | B450 | GTX 1080 ti [further details on my profile]

PC configs I used before:

  1. Pentium G4500 | 4GB/8GB DDR4 2133Mhz | H110 | GTX 1050
  2. Ryzen 3 1200 3,5Ghz / OC:4Ghz | 8GB DDR4 2133Mhz / 16GB 3200Mhz | B450 | GTX 1050
  3. Ryzen 3 1200 3,5Ghz | 16GB 3200Mhz | B450 | GTX 1080 ti
Link to post
Share on other sites

11 hours ago, Bjoolz said:

Because you don't get dump files let's see check what the crash errors are at least. In Event Viewer, find the Kernel-Power event ID 41 crashes

TIL: you can get the BSoD code from the error 41 entries... I thought those only indicated the unexpected shutdown and didn't contain any info on why. All of the recent ones are 0x124 WHEA_UNCORRECTABLE_ERROR.

 

10 hours ago, podkall said:

What have you tried? Different BIOS versions?

I updated the BIOS a few months ago (after realizing it had a BIOS from 2021 still). I believe it's on 3802 now.

 

I went through and reset the BIOS last night and set everything back up from scratch. (Why are fan profiles part of the BIOS config? Why is fan control so hard in the first place?) I also updated the chipset drivers per some other discussions around on Zen3 and 0x124 BSoDs.

 

I tossed a couple of things at it that would have previously caused issues and nothing happened, but it wasn't exactly the workflow that was causing issues before (I was working in the kitchen not the bedroom; the HDD docks are in the bedroom). Tried y-cruncher 5B and it was stable.

 

 

On a fun aside I forgot to add: when it crashes, it removes my 10G switches (and only the 10G switches) from under my Unifi controller. I swear I have the weirdest edge-case bugs. Took it crashing like three times before I put those two things together.

Main System (Byarlant): Ryzen 7 5800X | Asus B550-Creator ProArt | EK 240mm Basic AIO | 32GB G.Skill DDR4 3600MT/s CL16 | XFX Speedster SWFT 210 RX 6600 | Samsung 990 PRO 2TB / Samsung 960 PRO 512GB / 4× Crucial MX500 2TB (RAID-0) | Corsair RM750X | Silicom (Intel) X540-AT2 10G NIC | Inateck USB 3.0 Card | Hyte Y60 Case | Dell U3415W Monitor | Keychron K4 Brown (white backlight)

 

Laptop (Narrative): Lenovo Flex 5 81X20005US | Ryzen 5 4500U | 16GB DDR4 3200MT/s (soldered) | Vega II 384SP Graphics | SKHynix P31 1TB NVMe SSD | Intel AX200 Wifi | Asus 2.5G USB NIC | Asus ProArt PA278QV | Keychron K12 Blue (RGB backlight)

 

Proxmox Server (Veda): Ryzen 7 3800XT | ASRock Rack X470D4U | Corsair H80i v2 | 64GB Micron DDR4 ECC 3200MT/s | 4× WD 10TB / 4× Seagate 14TB Exos / 8× WD 12TB (custom external SAS enclosure) / 2× Samsung PM963a 960GB SSD | Seasonic Prime Fanless 500W | Intel X550-T2 10G NIC | LSI 9300-8i HBA | Adaptec 82885T SAS Expander | Fractal Design Node 804 Case

 

Proxmox Server (La Vie en Rose)GMKtec Mini PC | Ryzen 7 5700U | 32GB Lexar DDR4 (SODIMM) | Vega II 512SP Graphics | Lexar 1TB 610 Pro SSD | 2× Realtek 8125 2.5G NICs


Media Center/Video Capture (Jesta Cannon): Ryzen 5 1600X | ASRock B450M Pro4 R2.0 | Noctua NH-L12S | 16GB Crucial DDR4 3200MT/s | EVGA GTX750Ti SC | UMIS NVMe SSD 256GB / TEAMGROUP MS30 1TB | Corsair CX450M | Viewcast Osprey 260e Video Capture | TrendNet (Aquantia AQC107) 10G NIC | LG UH12NS30 BD-ROM | Silverstone Sugo SG-11 Case | Sony XR65A80K

 

Workbench (Doven Wolf): Lenovo m715q | Ryzen Pro 3 2200GE | 16GB Crucial DDR4 3200MT/s (SODIMM) | Vega 8 Graphics | SKHynix (OEM) 256GB NVMe SSD | uni 2.5G USB NIC | HDMI add-in module

 

Network:

Spoiler
                       ┌─────────────── Office/Rack ───────────────────────────────────────────────────────┐
Google Fiber Webpass ── Cloud Gateway Max ═╦════ Flex 2.5-8 ═╦════ Flex XG ═╦═ Veda
                           La Vie en Rose ═╣ La Vie en Rose ═╬═ Doven Wolf  ╠═ Veda-NAS
                                     Veda ─╜      Narrative ═╝              ╟─ Switch 8-60W ─┬─ Veda
╔═══════════════════════════════════════════════════════════════════════════╝                └─ Veda (IPMI)
║    ┌ Closet ┐     ┌───────── Bedroom ─────────┐
╚════ Flex XG ═╦╤═══ Flex XG ═╤╦═ Byarlant
        (PoE)  ║│             │╠═ Narrative 
Kitchen Jack ══╣└─ Dual PoE ┐ │╚═ Jesta Cannon*
   (Testing)   ║┌─ Injector ┘ └── Work Laptop
     Bedroom ══╝│
        Jack #2 │        ┌──────── Media Center ───────────────────────────┐
                └──────── Switch 8 ────────────┬─ nanoHD Access Point (PoE)
Notes:                                         ├─ Sony PlayStation 4 
─── is Gigabit / ═══ is Multi-Gigabit          ├─ Pioneer VSX-S520
* = cable passed from Bedroom to Media Center  └─ Sony XR65A80K (Google TV)

 

Link to post
Share on other sites

57 minutes ago, AbydosOne said:

TIL: you can get the BSoD code from the error 41 entries... I thought those only indicated the unexpected shutdown and didn't contain any info on why. All of the recent ones are 0x124 WHEA_UNCORRECTABLE_ERROR.

I need the crash parameters. BugCheckParameter 1-4. 

 

Because you aren't getting dump files, I'm guessing parameter 1 is 0x0000000000000010 which is NVMe error. 

Link to post
Share on other sites

8 hours ago, Bjoolz said:

I need the crash parameters. BugCheckParameter 1-4. 

 

Because you aren't getting dump files, I'm guessing parameter 1 is 0x0000000000000010 which is NVMe error. 

Here's a couple (slightly sanitized of personal info):

 

Spoiler
- <Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
- <System>
  <Provider Name="Microsoft-Windows-Kernel-Power" Guid="{331c3b3a-2005-44c2-ac5e-77220c37d6b4}" /> 
  <EventID>41</EventID> 
  <Version>8</Version> 
  <Level>1</Level> 
  <Task>63</Task> 
  <Opcode>0</Opcode> 
  <Keywords>0x8000400000000002</Keywords> 
  <TimeCreated SystemTime="2025-06-17T22:10:18.2765351Z" /> 
  <EventRecordID>252333</EventRecordID> 
  <Correlation /> 
  <Execution ProcessID="4" ThreadID="8" /> 
  <Channel>System</Channel> 
  <Computer>AbydosOne-Byarlant</Computer> 
  <Security UserID="S-1-5-18" /> 
  </System>
- <EventData>
  <Data Name="BugcheckCode">292</Data> 
  <Data Name="BugcheckParameter1">0x10</Data> 
  <Data Name="BugcheckParameter2">0x0</Data> 
  <Data Name="BugcheckParameter3">0x0</Data> 
  <Data Name="BugcheckParameter4">0x0</Data> 
  <Data Name="SleepInProgress">0</Data> 
  <Data Name="PowerButtonTimestamp">0</Data> 
  <Data Name="BootAppStatus">0</Data> 
  <Data Name="Checkpoint">0</Data> 
  <Data Name="ConnectedStandbyInProgress">false</Data> 
  <Data Name="SystemSleepTransitionsToOn">0</Data> 
  <Data Name="CsEntryScenarioInstanceId">0</Data> 
  <Data Name="BugcheckInfoFromEFI">true</Data> 
  <Data Name="CheckpointStatus">0</Data> 
  <Data Name="CsEntryScenarioInstanceIdV2">0</Data> 
  <Data Name="LongPowerButtonPressDetected">false</Data> 
  </EventData>
  </Event>

 

Spoiler
- <Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
- <System>
  <Provider Name="Microsoft-Windows-Kernel-Power" Guid="{331c3b3a-2005-44c2-ac5e-77220c37d6b4}" /> 
  <EventID>41</EventID> 
  <Version>8</Version> 
  <Level>1</Level> 
  <Task>63</Task> 
  <Opcode>0</Opcode> 
  <Keywords>0x8000400000000002</Keywords> 
  <TimeCreated SystemTime="2025-06-16T23:58:45.3226881Z" /> 
  <EventRecordID>252198</EventRecordID> 
  <Correlation /> 
  <Execution ProcessID="4" ThreadID="8" /> 
  <Channel>System</Channel> 
  <Computer>AbydosOne-Byarlant</Computer> 
  <Security UserID="S-1-5-18" /> 
  </System>
- <EventData>
  <Data Name="BugcheckCode">0</Data> 
  <Data Name="BugcheckParameter1">0x0</Data> 
  <Data Name="BugcheckParameter2">0x0</Data> 
  <Data Name="BugcheckParameter3">0x0</Data> 
  <Data Name="BugcheckParameter4">0x0</Data> 
  <Data Name="SleepInProgress">0</Data> 
  <Data Name="PowerButtonTimestamp">0</Data> 
  <Data Name="BootAppStatus">0</Data> 
  <Data Name="Checkpoint">0</Data> 
  <Data Name="ConnectedStandbyInProgress">false</Data> 
  <Data Name="SystemSleepTransitionsToOn">0</Data> 
  <Data Name="CsEntryScenarioInstanceId">0</Data> 
  <Data Name="BugcheckInfoFromEFI">false</Data> 
  <Data Name="CheckpointStatus">0</Data> 
  <Data Name="CsEntryScenarioInstanceIdV2">0</Data> 
  <Data Name="LongPowerButtonPressDetected">false</Data> 
  </EventData>
  </Event>

 

Spoiler
- <Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
- <System>
  <Provider Name="Microsoft-Windows-Kernel-Power" Guid="{331c3b3a-2005-44c2-ac5e-77220c37d6b4}" /> 
  <EventID>41</EventID> 
  <Version>8</Version> 
  <Level>1</Level> 
  <Task>63</Task> 
  <Opcode>0</Opcode> 
  <Keywords>0x8000400000000002</Keywords> 
  <TimeCreated SystemTime="2025-06-14T20:34:02.8659892Z" /> 
  <EventRecordID>252038</EventRecordID> 
  <Correlation /> 
  <Execution ProcessID="4" ThreadID="8" /> 
  <Channel>System</Channel> 
  <Computer>AbydosOne-Byarlant</Computer> 
  <Security UserID="S-1-5-18" /> 
  </System>
- <EventData>
  <Data Name="BugcheckCode">292</Data> 
  <Data Name="BugcheckParameter1">0x10</Data> 
  <Data Name="BugcheckParameter2">0x0</Data> 
  <Data Name="BugcheckParameter3">0x0</Data> 
  <Data Name="BugcheckParameter4">0x0</Data> 
  <Data Name="SleepInProgress">0</Data> 
  <Data Name="PowerButtonTimestamp">0</Data> 
  <Data Name="BootAppStatus">0</Data> 
  <Data Name="Checkpoint">0</Data> 
  <Data Name="ConnectedStandbyInProgress">false</Data> 
  <Data Name="SystemSleepTransitionsToOn">9</Data> 
  <Data Name="CsEntryScenarioInstanceId">0</Data> 
  <Data Name="BugcheckInfoFromEFI">true</Data> 
  <Data Name="CheckpointStatus">0</Data> 
  <Data Name="CsEntryScenarioInstanceIdV2">0</Data> 
  <Data Name="LongPowerButtonPressDetected">false</Data> 
  </EventData>
  </Event>

 

One has param 1 = 0x0, but the other two are 0x10.

Main System (Byarlant): Ryzen 7 5800X | Asus B550-Creator ProArt | EK 240mm Basic AIO | 32GB G.Skill DDR4 3600MT/s CL16 | XFX Speedster SWFT 210 RX 6600 | Samsung 990 PRO 2TB / Samsung 960 PRO 512GB / 4× Crucial MX500 2TB (RAID-0) | Corsair RM750X | Silicom (Intel) X540-AT2 10G NIC | Inateck USB 3.0 Card | Hyte Y60 Case | Dell U3415W Monitor | Keychron K4 Brown (white backlight)

 

Laptop (Narrative): Lenovo Flex 5 81X20005US | Ryzen 5 4500U | 16GB DDR4 3200MT/s (soldered) | Vega II 384SP Graphics | SKHynix P31 1TB NVMe SSD | Intel AX200 Wifi | Asus 2.5G USB NIC | Asus ProArt PA278QV | Keychron K12 Blue (RGB backlight)

 

Proxmox Server (Veda): Ryzen 7 3800XT | ASRock Rack X470D4U | Corsair H80i v2 | 64GB Micron DDR4 ECC 3200MT/s | 4× WD 10TB / 4× Seagate 14TB Exos / 8× WD 12TB (custom external SAS enclosure) / 2× Samsung PM963a 960GB SSD | Seasonic Prime Fanless 500W | Intel X550-T2 10G NIC | LSI 9300-8i HBA | Adaptec 82885T SAS Expander | Fractal Design Node 804 Case

 

Proxmox Server (La Vie en Rose)GMKtec Mini PC | Ryzen 7 5700U | 32GB Lexar DDR4 (SODIMM) | Vega II 512SP Graphics | Lexar 1TB 610 Pro SSD | 2× Realtek 8125 2.5G NICs


Media Center/Video Capture (Jesta Cannon): Ryzen 5 1600X | ASRock B450M Pro4 R2.0 | Noctua NH-L12S | 16GB Crucial DDR4 3200MT/s | EVGA GTX750Ti SC | UMIS NVMe SSD 256GB / TEAMGROUP MS30 1TB | Corsair CX450M | Viewcast Osprey 260e Video Capture | TrendNet (Aquantia AQC107) 10G NIC | LG UH12NS30 BD-ROM | Silverstone Sugo SG-11 Case | Sony XR65A80K

 

Workbench (Doven Wolf): Lenovo m715q | Ryzen Pro 3 2200GE | 16GB Crucial DDR4 3200MT/s (SODIMM) | Vega 8 Graphics | SKHynix (OEM) 256GB NVMe SSD | uni 2.5G USB NIC | HDMI add-in module

 

Network:

Spoiler
                       ┌─────────────── Office/Rack ───────────────────────────────────────────────────────┐
Google Fiber Webpass ── Cloud Gateway Max ═╦════ Flex 2.5-8 ═╦════ Flex XG ═╦═ Veda
                           La Vie en Rose ═╣ La Vie en Rose ═╬═ Doven Wolf  ╠═ Veda-NAS
                                     Veda ─╜      Narrative ═╝              ╟─ Switch 8-60W ─┬─ Veda
╔═══════════════════════════════════════════════════════════════════════════╝                └─ Veda (IPMI)
║    ┌ Closet ┐     ┌───────── Bedroom ─────────┐
╚════ Flex XG ═╦╤═══ Flex XG ═╤╦═ Byarlant
        (PoE)  ║│             │╠═ Narrative 
Kitchen Jack ══╣└─ Dual PoE ┐ │╚═ Jesta Cannon*
   (Testing)   ║┌─ Injector ┘ └── Work Laptop
     Bedroom ══╝│
        Jack #2 │        ┌──────── Media Center ───────────────────────────┐
                └──────── Switch 8 ────────────┬─ nanoHD Access Point (PoE)
Notes:                                         ├─ Sony PlayStation 4 
─── is Gigabit / ═══ is Multi-Gigabit          ├─ Pioneer VSX-S520
* = cable passed from Bedroom to Media Center  └─ Sony XR65A80K (Google TV)

 

Link to post
Share on other sites

9 hours ago, AbydosOne said:

Here's a couple (slightly sanitized of personal info):

 

  Hide contents
- <Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
- <System>
  <Provider Name="Microsoft-Windows-Kernel-Power" Guid="{331c3b3a-2005-44c2-ac5e-77220c37d6b4}" /> 
  <EventID>41</EventID> 
  <Version>8</Version> 
  <Level>1</Level> 
  <Task>63</Task> 
  <Opcode>0</Opcode> 
  <Keywords>0x8000400000000002</Keywords> 
  <TimeCreated SystemTime="2025-06-17T22:10:18.2765351Z" /> 
  <EventRecordID>252333</EventRecordID> 
  <Correlation /> 
  <Execution ProcessID="4" ThreadID="8" /> 
  <Channel>System</Channel> 
  <Computer>AbydosOne-Byarlant</Computer> 
  <Security UserID="S-1-5-18" /> 
  </System>
- <EventData>
  <Data Name="BugcheckCode">292</Data> 
  <Data Name="BugcheckParameter1">0x10</Data> 
  <Data Name="BugcheckParameter2">0x0</Data> 
  <Data Name="BugcheckParameter3">0x0</Data> 
  <Data Name="BugcheckParameter4">0x0</Data> 
  <Data Name="SleepInProgress">0</Data> 
  <Data Name="PowerButtonTimestamp">0</Data> 
  <Data Name="BootAppStatus">0</Data> 
  <Data Name="Checkpoint">0</Data> 
  <Data Name="ConnectedStandbyInProgress">false</Data> 
  <Data Name="SystemSleepTransitionsToOn">0</Data> 
  <Data Name="CsEntryScenarioInstanceId">0</Data> 
  <Data Name="BugcheckInfoFromEFI">true</Data> 
  <Data Name="CheckpointStatus">0</Data> 
  <Data Name="CsEntryScenarioInstanceIdV2">0</Data> 
  <Data Name="LongPowerButtonPressDetected">false</Data> 
  </EventData>
  </Event>

 

  Hide contents
- <Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
- <System>
  <Provider Name="Microsoft-Windows-Kernel-Power" Guid="{331c3b3a-2005-44c2-ac5e-77220c37d6b4}" /> 
  <EventID>41</EventID> 
  <Version>8</Version> 
  <Level>1</Level> 
  <Task>63</Task> 
  <Opcode>0</Opcode> 
  <Keywords>0x8000400000000002</Keywords> 
  <TimeCreated SystemTime="2025-06-16T23:58:45.3226881Z" /> 
  <EventRecordID>252198</EventRecordID> 
  <Correlation /> 
  <Execution ProcessID="4" ThreadID="8" /> 
  <Channel>System</Channel> 
  <Computer>AbydosOne-Byarlant</Computer> 
  <Security UserID="S-1-5-18" /> 
  </System>
- <EventData>
  <Data Name="BugcheckCode">0</Data> 
  <Data Name="BugcheckParameter1">0x0</Data> 
  <Data Name="BugcheckParameter2">0x0</Data> 
  <Data Name="BugcheckParameter3">0x0</Data> 
  <Data Name="BugcheckParameter4">0x0</Data> 
  <Data Name="SleepInProgress">0</Data> 
  <Data Name="PowerButtonTimestamp">0</Data> 
  <Data Name="BootAppStatus">0</Data> 
  <Data Name="Checkpoint">0</Data> 
  <Data Name="ConnectedStandbyInProgress">false</Data> 
  <Data Name="SystemSleepTransitionsToOn">0</Data> 
  <Data Name="CsEntryScenarioInstanceId">0</Data> 
  <Data Name="BugcheckInfoFromEFI">false</Data> 
  <Data Name="CheckpointStatus">0</Data> 
  <Data Name="CsEntryScenarioInstanceIdV2">0</Data> 
  <Data Name="LongPowerButtonPressDetected">false</Data> 
  </EventData>
  </Event>

 

  Hide contents
- <Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
- <System>
  <Provider Name="Microsoft-Windows-Kernel-Power" Guid="{331c3b3a-2005-44c2-ac5e-77220c37d6b4}" /> 
  <EventID>41</EventID> 
  <Version>8</Version> 
  <Level>1</Level> 
  <Task>63</Task> 
  <Opcode>0</Opcode> 
  <Keywords>0x8000400000000002</Keywords> 
  <TimeCreated SystemTime="2025-06-14T20:34:02.8659892Z" /> 
  <EventRecordID>252038</EventRecordID> 
  <Correlation /> 
  <Execution ProcessID="4" ThreadID="8" /> 
  <Channel>System</Channel> 
  <Computer>AbydosOne-Byarlant</Computer> 
  <Security UserID="S-1-5-18" /> 
  </System>
- <EventData>
  <Data Name="BugcheckCode">292</Data> 
  <Data Name="BugcheckParameter1">0x10</Data> 
  <Data Name="BugcheckParameter2">0x0</Data> 
  <Data Name="BugcheckParameter3">0x0</Data> 
  <Data Name="BugcheckParameter4">0x0</Data> 
  <Data Name="SleepInProgress">0</Data> 
  <Data Name="PowerButtonTimestamp">0</Data> 
  <Data Name="BootAppStatus">0</Data> 
  <Data Name="Checkpoint">0</Data> 
  <Data Name="ConnectedStandbyInProgress">false</Data> 
  <Data Name="SystemSleepTransitionsToOn">9</Data> 
  <Data Name="CsEntryScenarioInstanceId">0</Data> 
  <Data Name="BugcheckInfoFromEFI">true</Data> 
  <Data Name="CheckpointStatus">0</Data> 
  <Data Name="CsEntryScenarioInstanceIdV2">0</Data> 
  <Data Name="LongPowerButtonPressDetected">false</Data> 
  </EventData>
  </Event>

 

One has param 1 = 0x0, but the other two are 0x10.

The one that is 0x0 isn't from a BSOD. So it's not 0, it's empty. From this the NVMe would be the main suspect. I don't know if you had this exact issue with the old SSD, but because storage errors are driver reported (All other WHEA errors are CPU reported) it doesn't have to be the drive itself. It could be the driver, the M.2 slot or the motherboard.

Link to post
Share on other sites

4 hours ago, Bjoolz said:

From this the NVMe would be the main suspect. I don't know if you had this exact issue with the old SSD, but because storage errors are driver reported (All other WHEA errors are CPU reported) it doesn't have to be the drive itself. It could be the driver, the M.2 slot or the motherboard.

This is an intriguing possibility... I reseated both NVMe drives. Both are Samsung PRO drives, so I'm not jumping to the conclusion the drives are dodgy yet.

 

After more redditing, I bumped the SoC voltage up to 1.15V. It was stable for >1.2TB of data transfer at ~350MBps (combined) for several hours, so I guess we'll revisit this if/when it reoccurs. Could Infinity Fabric instability manifest as an NVMe error?

 

I'm half curious if just removing from AC (to move it for testing) was enough to reset something...

Main System (Byarlant): Ryzen 7 5800X | Asus B550-Creator ProArt | EK 240mm Basic AIO | 32GB G.Skill DDR4 3600MT/s CL16 | XFX Speedster SWFT 210 RX 6600 | Samsung 990 PRO 2TB / Samsung 960 PRO 512GB / 4× Crucial MX500 2TB (RAID-0) | Corsair RM750X | Silicom (Intel) X540-AT2 10G NIC | Inateck USB 3.0 Card | Hyte Y60 Case | Dell U3415W Monitor | Keychron K4 Brown (white backlight)

 

Laptop (Narrative): Lenovo Flex 5 81X20005US | Ryzen 5 4500U | 16GB DDR4 3200MT/s (soldered) | Vega II 384SP Graphics | SKHynix P31 1TB NVMe SSD | Intel AX200 Wifi | Asus 2.5G USB NIC | Asus ProArt PA278QV | Keychron K12 Blue (RGB backlight)

 

Proxmox Server (Veda): Ryzen 7 3800XT | ASRock Rack X470D4U | Corsair H80i v2 | 64GB Micron DDR4 ECC 3200MT/s | 4× WD 10TB / 4× Seagate 14TB Exos / 8× WD 12TB (custom external SAS enclosure) / 2× Samsung PM963a 960GB SSD | Seasonic Prime Fanless 500W | Intel X550-T2 10G NIC | LSI 9300-8i HBA | Adaptec 82885T SAS Expander | Fractal Design Node 804 Case

 

Proxmox Server (La Vie en Rose)GMKtec Mini PC | Ryzen 7 5700U | 32GB Lexar DDR4 (SODIMM) | Vega II 512SP Graphics | Lexar 1TB 610 Pro SSD | 2× Realtek 8125 2.5G NICs


Media Center/Video Capture (Jesta Cannon): Ryzen 5 1600X | ASRock B450M Pro4 R2.0 | Noctua NH-L12S | 16GB Crucial DDR4 3200MT/s | EVGA GTX750Ti SC | UMIS NVMe SSD 256GB / TEAMGROUP MS30 1TB | Corsair CX450M | Viewcast Osprey 260e Video Capture | TrendNet (Aquantia AQC107) 10G NIC | LG UH12NS30 BD-ROM | Silverstone Sugo SG-11 Case | Sony XR65A80K

 

Workbench (Doven Wolf): Lenovo m715q | Ryzen Pro 3 2200GE | 16GB Crucial DDR4 3200MT/s (SODIMM) | Vega 8 Graphics | SKHynix (OEM) 256GB NVMe SSD | uni 2.5G USB NIC | HDMI add-in module

 

Network:

Spoiler
                       ┌─────────────── Office/Rack ───────────────────────────────────────────────────────┐
Google Fiber Webpass ── Cloud Gateway Max ═╦════ Flex 2.5-8 ═╦════ Flex XG ═╦═ Veda
                           La Vie en Rose ═╣ La Vie en Rose ═╬═ Doven Wolf  ╠═ Veda-NAS
                                     Veda ─╜      Narrative ═╝              ╟─ Switch 8-60W ─┬─ Veda
╔═══════════════════════════════════════════════════════════════════════════╝                └─ Veda (IPMI)
║    ┌ Closet ┐     ┌───────── Bedroom ─────────┐
╚════ Flex XG ═╦╤═══ Flex XG ═╤╦═ Byarlant
        (PoE)  ║│             │╠═ Narrative 
Kitchen Jack ══╣└─ Dual PoE ┐ │╚═ Jesta Cannon*
   (Testing)   ║┌─ Injector ┘ └── Work Laptop
     Bedroom ══╝│
        Jack #2 │        ┌──────── Media Center ───────────────────────────┐
                └──────── Switch 8 ────────────┬─ nanoHD Access Point (PoE)
Notes:                                         ├─ Sony PlayStation 4 
─── is Gigabit / ═══ is Multi-Gigabit          ├─ Pioneer VSX-S520
* = cable passed from Bedroom to Media Center  └─ Sony XR65A80K (Google TV)

 

Link to post
Share on other sites

22 minutes ago, Tetras said:

Have you been using risers in each case?

Old case didn't have one. I investigated that possibility when my GPU was the culprit by rebuilding the entire machine into the old case and didn't come up with anything (the since-RMAed-and-replaced GPU has been fine, near as I can tell).

Main System (Byarlant): Ryzen 7 5800X | Asus B550-Creator ProArt | EK 240mm Basic AIO | 32GB G.Skill DDR4 3600MT/s CL16 | XFX Speedster SWFT 210 RX 6600 | Samsung 990 PRO 2TB / Samsung 960 PRO 512GB / 4× Crucial MX500 2TB (RAID-0) | Corsair RM750X | Silicom (Intel) X540-AT2 10G NIC | Inateck USB 3.0 Card | Hyte Y60 Case | Dell U3415W Monitor | Keychron K4 Brown (white backlight)

 

Laptop (Narrative): Lenovo Flex 5 81X20005US | Ryzen 5 4500U | 16GB DDR4 3200MT/s (soldered) | Vega II 384SP Graphics | SKHynix P31 1TB NVMe SSD | Intel AX200 Wifi | Asus 2.5G USB NIC | Asus ProArt PA278QV | Keychron K12 Blue (RGB backlight)

 

Proxmox Server (Veda): Ryzen 7 3800XT | ASRock Rack X470D4U | Corsair H80i v2 | 64GB Micron DDR4 ECC 3200MT/s | 4× WD 10TB / 4× Seagate 14TB Exos / 8× WD 12TB (custom external SAS enclosure) / 2× Samsung PM963a 960GB SSD | Seasonic Prime Fanless 500W | Intel X550-T2 10G NIC | LSI 9300-8i HBA | Adaptec 82885T SAS Expander | Fractal Design Node 804 Case

 

Proxmox Server (La Vie en Rose)GMKtec Mini PC | Ryzen 7 5700U | 32GB Lexar DDR4 (SODIMM) | Vega II 512SP Graphics | Lexar 1TB 610 Pro SSD | 2× Realtek 8125 2.5G NICs


Media Center/Video Capture (Jesta Cannon): Ryzen 5 1600X | ASRock B450M Pro4 R2.0 | Noctua NH-L12S | 16GB Crucial DDR4 3200MT/s | EVGA GTX750Ti SC | UMIS NVMe SSD 256GB / TEAMGROUP MS30 1TB | Corsair CX450M | Viewcast Osprey 260e Video Capture | TrendNet (Aquantia AQC107) 10G NIC | LG UH12NS30 BD-ROM | Silverstone Sugo SG-11 Case | Sony XR65A80K

 

Workbench (Doven Wolf): Lenovo m715q | Ryzen Pro 3 2200GE | 16GB Crucial DDR4 3200MT/s (SODIMM) | Vega 8 Graphics | SKHynix (OEM) 256GB NVMe SSD | uni 2.5G USB NIC | HDMI add-in module

 

Network:

Spoiler
                       ┌─────────────── Office/Rack ───────────────────────────────────────────────────────┐
Google Fiber Webpass ── Cloud Gateway Max ═╦════ Flex 2.5-8 ═╦════ Flex XG ═╦═ Veda
                           La Vie en Rose ═╣ La Vie en Rose ═╬═ Doven Wolf  ╠═ Veda-NAS
                                     Veda ─╜      Narrative ═╝              ╟─ Switch 8-60W ─┬─ Veda
╔═══════════════════════════════════════════════════════════════════════════╝                └─ Veda (IPMI)
║    ┌ Closet ┐     ┌───────── Bedroom ─────────┐
╚════ Flex XG ═╦╤═══ Flex XG ═╤╦═ Byarlant
        (PoE)  ║│             │╠═ Narrative 
Kitchen Jack ══╣└─ Dual PoE ┐ │╚═ Jesta Cannon*
   (Testing)   ║┌─ Injector ┘ └── Work Laptop
     Bedroom ══╝│
        Jack #2 │        ┌──────── Media Center ───────────────────────────┐
                └──────── Switch 8 ────────────┬─ nanoHD Access Point (PoE)
Notes:                                         ├─ Sony PlayStation 4 
─── is Gigabit / ═══ is Multi-Gigabit          ├─ Pioneer VSX-S520
* = cable passed from Bedroom to Media Center  └─ Sony XR65A80K (Google TV)

 

Link to post
Share on other sites

On 6/18/2025 at 12:04 PM, AbydosOne said:

This problem has somewhat followed me through the entire ship-of-Theseus that is my main rig.

 

Whenever I'm doing something with very high I/O utilization (i.e. hundreds of MBps), the machine gets remarkably unstable and crashes at random. It will flash the BSoD but the screen gets garbled by my ultrawide, but it never writes a dump. Nothing meaningful shows up in the Windows Event Log. For example, I've tried doing a full write/format on three USB HDDs and (at most) made it about 1TB into 8TB, after several attempts.

 

The first times this arose was because I was flying too close to the sun trying to run 3200 with a 1600X, but every time since I've never been able to conclusively figure out what's causing the instability. Sometimes simple file copies will result in data corruption upon verification (a habit I got into from the first iteration). Sometimes it goes away on its own for a while. My hunch is that it's related to Infinity Fabric, but I have very little to go off of.

 

I have over the lifetime of the machine replaced:

  • The CPU (1600X > 3800XT > 5800X).
  • The motherboard (going from Asus B350-F to Asus B550-Creator)
  • The RAM (going from 16GB G.Skill 3200CL14 to 32GB G.Skill 3600CL16) (I haven't tried with XMP off, but I also don't want to waste nice RAM, so we'll save that for absolute last)
  • The GPU (GTX770 > RX 5600 XT > RTX 3060 Ti > RX 6600)
  • The PSU (CX650M to RM750x)
  • The case (now a Hyte Y60)
  • The OS storage (Samsung 960 PRO to 990 PRO)
  • The OS (reinstalled several times, though not since I replaced the RAM)
  • The cooling...
  • [literally no longer contains any functional part of the original machine]

 

I've run Memtests, changed out USB cards and cables, run it on a UPS, the whole shebang.

 

The weird part to me is I have a pretty similar machine (Asrock B450, different 5800X, the old 3200CL14 RAM kit (at 3600, no less), GTX 1070Ti, Samsung 9A1) that does these tasks perfectly.

 

My next steps are going to be to reseat the memory and reinstall Windows (which was on the todo list because of Win11 anyways), but is there some other step I'm missing? I might even swap the 5800Xs (and RMA with AMD?).

Do you still have your older CPUs and/or motherboards (swap one at a time to find the problematic part!) to try with?

AMD Ryzen™ 5 5600g w/ Radeon Graphics | 16GB DDR4-3200 RAM | 256GB NVME SSD + 2TB HDD | Amazon Basics 2.0 Speakers

                                                                                            I'M JUST A REAL-LIFE TOM SAWYER

Link to post
Share on other sites

1 minute ago, KidKid said:

Do you still have your older CPUs and/or motherboards (swap one at a time to find the problematic part!) to try with?

Some parts I do, but most are allocated to other machines right now.

 

Besides, if the problem has followed me despite having replaced all the parts, how would putting the old ones back in help? (either changing the part didn't effect anything or both parts contributed to the issue, making it a moot point)

 

The only variable that seems to make a meaningful difference is the brand on the motherboard, as my Asrock AM4 stuff has been nothing if not perfectly reliable while my Asus boards have always had weird quirks like this.

Main System (Byarlant): Ryzen 7 5800X | Asus B550-Creator ProArt | EK 240mm Basic AIO | 32GB G.Skill DDR4 3600MT/s CL16 | XFX Speedster SWFT 210 RX 6600 | Samsung 990 PRO 2TB / Samsung 960 PRO 512GB / 4× Crucial MX500 2TB (RAID-0) | Corsair RM750X | Silicom (Intel) X540-AT2 10G NIC | Inateck USB 3.0 Card | Hyte Y60 Case | Dell U3415W Monitor | Keychron K4 Brown (white backlight)

 

Laptop (Narrative): Lenovo Flex 5 81X20005US | Ryzen 5 4500U | 16GB DDR4 3200MT/s (soldered) | Vega II 384SP Graphics | SKHynix P31 1TB NVMe SSD | Intel AX200 Wifi | Asus 2.5G USB NIC | Asus ProArt PA278QV | Keychron K12 Blue (RGB backlight)

 

Proxmox Server (Veda): Ryzen 7 3800XT | ASRock Rack X470D4U | Corsair H80i v2 | 64GB Micron DDR4 ECC 3200MT/s | 4× WD 10TB / 4× Seagate 14TB Exos / 8× WD 12TB (custom external SAS enclosure) / 2× Samsung PM963a 960GB SSD | Seasonic Prime Fanless 500W | Intel X550-T2 10G NIC | LSI 9300-8i HBA | Adaptec 82885T SAS Expander | Fractal Design Node 804 Case

 

Proxmox Server (La Vie en Rose)GMKtec Mini PC | Ryzen 7 5700U | 32GB Lexar DDR4 (SODIMM) | Vega II 512SP Graphics | Lexar 1TB 610 Pro SSD | 2× Realtek 8125 2.5G NICs


Media Center/Video Capture (Jesta Cannon): Ryzen 5 1600X | ASRock B450M Pro4 R2.0 | Noctua NH-L12S | 16GB Crucial DDR4 3200MT/s | EVGA GTX750Ti SC | UMIS NVMe SSD 256GB / TEAMGROUP MS30 1TB | Corsair CX450M | Viewcast Osprey 260e Video Capture | TrendNet (Aquantia AQC107) 10G NIC | LG UH12NS30 BD-ROM | Silverstone Sugo SG-11 Case | Sony XR65A80K

 

Workbench (Doven Wolf): Lenovo m715q | Ryzen Pro 3 2200GE | 16GB Crucial DDR4 3200MT/s (SODIMM) | Vega 8 Graphics | SKHynix (OEM) 256GB NVMe SSD | uni 2.5G USB NIC | HDMI add-in module

 

Network:

Spoiler
                       ┌─────────────── Office/Rack ───────────────────────────────────────────────────────┐
Google Fiber Webpass ── Cloud Gateway Max ═╦════ Flex 2.5-8 ═╦════ Flex XG ═╦═ Veda
                           La Vie en Rose ═╣ La Vie en Rose ═╬═ Doven Wolf  ╠═ Veda-NAS
                                     Veda ─╜      Narrative ═╝              ╟─ Switch 8-60W ─┬─ Veda
╔═══════════════════════════════════════════════════════════════════════════╝                └─ Veda (IPMI)
║    ┌ Closet ┐     ┌───────── Bedroom ─────────┐
╚════ Flex XG ═╦╤═══ Flex XG ═╤╦═ Byarlant
        (PoE)  ║│             │╠═ Narrative 
Kitchen Jack ══╣└─ Dual PoE ┐ │╚═ Jesta Cannon*
   (Testing)   ║┌─ Injector ┘ └── Work Laptop
     Bedroom ══╝│
        Jack #2 │        ┌──────── Media Center ───────────────────────────┐
                └──────── Switch 8 ────────────┬─ nanoHD Access Point (PoE)
Notes:                                         ├─ Sony PlayStation 4 
─── is Gigabit / ═══ is Multi-Gigabit          ├─ Pioneer VSX-S520
* = cable passed from Bedroom to Media Center  └─ Sony XR65A80K (Google TV)

 

Link to post
Share on other sites

4 hours ago, AbydosOne said:

After more redditing, I bumped the SoC voltage up to 1.15V. It was stable for >1.2TB of data transfer at ~350MBps (combined) for several hours, so I guess we'll revisit this if/when it reoccurs. Could Infinity Fabric instability manifest as an NVMe error?

Not sure what you'd expect to see from infinity fabric instability, but the I/O die (which I think the SoC voltage supplies) has a number of dependants, including the memory controller, PCIE bus and (some of the) USB ports. The primary M.2 slot (above the primary PCI-E) is usually connected to the CPU and the secondary to the chipset.

Link to post
Share on other sites

5 hours ago, AbydosOne said:

This is an intriguing possibility... I reseated both NVMe drives. Both are Samsung PRO drives, so I'm not jumping to the conclusion the drives are dodgy yet.

Try updating the NVME drivers

 

Sometimes they do cause problems that magically fix after that

Note: Users receive notifications after Mentions & Quotes. 

Feel free: To ask any question, no matter what question it is, I will try to answer. I know a lot about PCs but not everything.

current PC:

Ryzen 5 5600 |16GB DDR4 3200Mhz | B450 | GTX 1080 ti [further details on my profile]

PC configs I used before:

  1. Pentium G4500 | 4GB/8GB DDR4 2133Mhz | H110 | GTX 1050
  2. Ryzen 3 1200 3,5Ghz / OC:4Ghz | 8GB DDR4 2133Mhz / 16GB 3200Mhz | B450 | GTX 1050
  3. Ryzen 3 1200 3,5Ghz | 16GB 3200Mhz | B450 | GTX 1080 ti
Link to post
Share on other sites

9 hours ago, AbydosOne said:

This is an intriguing possibility... I reseated both NVMe drives. Both are Samsung PRO drives, so I'm not jumping to the conclusion the drives are dodgy yet.

 

After more redditing, I bumped the SoC voltage up to 1.15V. It was stable for >1.2TB of data transfer at ~350MBps (combined) for several hours, so I guess we'll revisit this if/when it reoccurs. Could Infinity Fabric instability manifest as an NVMe error?

 

I'm half curious if just removing from AC (to move it for testing) was enough to reset something...

We can check if you have any WHEA events in addition to the BSODs. The events would tell us which drive which could help with troubleshooting if the issue continues. If you have informational and warning events from WHEA (Gray and yellow icon), these are usually not helpful, but I'll take a look. 

Open Event Viewer → Windows Logs → System. On the right hand side, select "Filter Current Log". In the new window that pops up find the Event Sources dropdown menu and select "WHEA-logger". Click Ok to apply the filter. If you have any WHEA events, highlight them, right click and save. Upload the .evtx file to the forum or if you want to copy the important bit, it's the RawData field in the Details tab.

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×