Jump to content

Hard Resetting - Kernel Power

Go to solution Solved by Bjoolz,
1 hour ago, mrtherm said:

Here we go,

 

A fatal hardware error has occurred.

Reported by component: Processor Core
Error Source: Machine Check Exception
Error Type: Cache Hierarchy Error
Processor APIC ID: 7 (and a 6 some of them)

mrtherm-filtered-whea.evtx 68 kB · 0 downloads

The info in the general tab is what reported the error, not what caused the error. To see what caused the error you have to decode the error packet in the RawData fields using the UEFI documentation.

 

The CPU is ordering the shutdowns. These show up as "Previous Error", meaning it's not from the actual error, just a notification that it shut down the PC from the previous error. We can't see the original error with these issues. 90% of the time it's the CPU, but it can also be a PCIe device or even the motherboard is the CPU/PCIe section is faulty.

 

We have seen a lot of voltage issues on 5000 series so we could try setting a static voltage which has fixed a lot of 5900x and 5950x CPUs that have had lots of different CPU related crashes. In the BIOS, set a static voltage to the CPU of 1.3v and to the SoC set a static voltage of 1.1v (CPU Vcore and VCORE SOC on this motherboard). 

Alright folks i need some help trying to identify a very annoying problem.

 

The Problem:

- Hard PC reset with no warnings or BSOD

 

Occurs:

- Heavy games - Palworld & Red Dead Redemption 2

- Does not do it on Destiny 2, BeamNG Drive

- Has done it randomly on idle

- Hard to manually trigger the Problem.

 

Troubleshooting I've Done

- turned off Automatic Restart to try see info

- Memtest & Windows Memory Check Passed

- Brand new PSU

- Stress test GPU & CPU

- Hard drive checks

- Updated & Reset BIOS

- Fresh Windows Install

- I've unplugged an RGB strip and turned off 1 of 4 Noctua case fans (in case there's some sort of power limit reach?)

- Stock clocks on all hardware

- Thermals all seem fine. CPU max temps (celsius) ~80 / GPU max ~70

 

Event Viewer

- Critical Kernel Power error, Incorrectly shut down etc.

 

PC Specs

Ryzen 5900X

Gigabyte X570 AORUS PRO WIFI

MSI RTX 2080 Super

Noctua NH-U12S

2x Corsair Vengeance (4x16GB) 3200MHz CL16 DDR4
CORSAIR RM850x

Focusrite Scarlett 2i4 USB Interface

 

My Thoughts

- at first i thought a dodgy capacitor on the PSU under load, brand new PSU this weekend did the same thing.

- Could it be dodgy VRAM? How can i run a test for that?

- will reseat cpu cooler & refresh thermal paste (my understanding is a thermal shudown, would shutdown, not restart)

- Would a dying usb audio interface trigger a hard reset?

 

I'm open to ideas on how to identify this.

Link to comment
https://linustechtips.com/topic/1614737-hard-resetting-kernel-power/
Share on other sites

Link to post
Share on other sites

Open Event Viewer → Windows Logs → System. On the right hand side, select "Filter Current Log". In the new window that pops up find the Event Sources dropdown menu and select "WHEA-logger". Click Ok to apply the filter. If you have any WHEA events, highlight them, right click and save. Upload the .evtx file to the forum.

 

If you don't have any WHEA events, the main suspect would be power. 

Link to post
Share on other sites

PSU test in OCCT? Maybe a combined CPU load + GPU load like Core Cycler or Prime95 combined with Furmark?

Doesn't crash under Furmark? How about 3Dmark Stability test?

Normal 3DMark scores?

Any undervolts or overclocks applied?

Triggering VRM protections can cause both shutdowns or hard resets without a BSOD.

Latest SSD firmware? SSD temperatures?

M.S.C.E. (M.Sc. Computer Engineering), IT specialist in a hospital, 32+ years of gaming, 20+ years of computer enthusiasm, Geek, Trekkie, anime fan

  • Main PC: AMD Ryzen 7 5800X3D - EK AIO 360 D-RGB - Arctic Cooling MX-4 - Asus Prime X570-P - 4x8GB DDR4 3200 HyperX Fury CL16 - Sapphire AMD Radeon 6950XT Nitro+ - 1TB Kingston Fury Renegade - 2TB Kingston Fury Renegade - 512GB ADATA SU800 - 960GB Kingston A400 - Seasonic PX-850 850W  - custom black ATX and EPS cables - Fractal Design Define R5 Blackout - Windows 11 x64 23H2 - 3 Arctic Cooling P14 PWM PST - 5 Arctic Cooling P12 PWM PST
  • Peripherals: LG 32GK650F - Dell P2319h - Logitech G Pro X Superlight with Tiger Ice - Madlions MAD 68HE Pro - EndGame Gear MPC890 - Genius HF 1250B - Akliam PD4 - Sennheiser HD 560s - Tripowin Vivace - Simgot EM6L - Truthear Zero - QKZ x HBB - 7Hz Salnotes Zero - Logitech C270 - Behringer PS400 - BM700  - Colormunki Smile - Speedlink Torid - Jysk Stenderup - LG 24x External DVD writer - Konig smart card reader
  • Laptop: Acer E5–575G-386R 15.6" 1080p (i3 6100U + 12GB DDR4 (4GB+8GB) + GeForce 940MX + 256GB nVME) Win 10 Pro x64 22H2 - Logitech G305 + AAA Lithium battery
  • Networking: Asus TUF Gaming AX6000 - Huawei OptiXstar EG8145X6-10 - 1000/500 Mbps fiber optic Internet access
  • TV and gadgets: TCL 50EP680 50" 4K LED + Sharp HT-SB100 75W RMS soundbar - Samsung Galaxy Tab A8 10.1" - OnePlus 13 16GB/512GB - OnePlus 9 256GB - Olymous Cameda C-160 - GameBoy Color - Miyoo A30 Spruce
  • Streaming/Server/Storage PC: AMD Ryzen 5 3600 - LC-Power LC-CC-120 - MSI B450 Tomahawk Max - 2x4GB ADATA 2666 DDR4 - 120GB Kingston V300 - Toshiba DT01ACA100 1TB - Toshiba DT01ACA200 2TB - 2x WD Green 2TB - Sapphire Pulse AMD Radeon R9 380X - 550W EVGA G3 SuperNova - Chieftec Giga DF-01B - White Shark Spartan X keyboard - Roccat Kone Pure Military Desert strike - Logitech S-220 - Philips 226L
  • Livingroom PC (dad uses): AMD FX 8300 - Arctic Freezer 64 - Asus M5A97 R2.0 Evo - 2x4 GB DDR3 1833 Kingston - MSI Radeon HD 7770 1GB OC - 120GB Adata SSD - 500W Fractal Design Essence - DVD-RW - Samsung SM 2253BW - Logitech G710+ - wireless vertical mouse - MS 2.0 speakers
Link to post
Share on other sites

8 hours ago, Bjoolz said:

Open Event Viewer → Windows Logs → System. On the right hand side, select "Filter Current Log". In the new window that pops up find the Event Sources dropdown menu and select "WHEA-logger". Click Ok to apply the filter. If you have any WHEA events, highlight them, right click and save. Upload the .evtx file to the forum.

 

If you don't have any WHEA events, the main suspect would be power. 

Here we go,

 

A fatal hardware error has occurred.

Reported by component: Processor Core
Error Source: Machine Check Exception
Error Type: Cache Hierarchy Error
Processor APIC ID: 7 (and a 6 some of them)

mrtherm-filtered-whea.evtx

Link to post
Share on other sites

1 hour ago, mrtherm said:

Here we go,

 

A fatal hardware error has occurred.

Reported by component: Processor Core
Error Source: Machine Check Exception
Error Type: Cache Hierarchy Error
Processor APIC ID: 7 (and a 6 some of them)

mrtherm-filtered-whea.evtx 68 kB · 0 downloads

The info in the general tab is what reported the error, not what caused the error. To see what caused the error you have to decode the error packet in the RawData fields using the UEFI documentation.

 

The CPU is ordering the shutdowns. These show up as "Previous Error", meaning it's not from the actual error, just a notification that it shut down the PC from the previous error. We can't see the original error with these issues. 90% of the time it's the CPU, but it can also be a PCIe device or even the motherboard is the CPU/PCIe section is faulty.

 

We have seen a lot of voltage issues on 5000 series so we could try setting a static voltage which has fixed a lot of 5900x and 5950x CPUs that have had lots of different CPU related crashes. In the BIOS, set a static voltage to the CPU of 1.3v and to the SoC set a static voltage of 1.1v (CPU Vcore and VCORE SOC on this motherboard). 

Link to post
Share on other sites

1 hour ago, Bjoolz said:

The info in the general tab is what reported the error, not what caused the error. To see what caused the error you have to decode the error packet in the RawData fields using the UEFI documentation.

 

The CPU is ordering the shutdowns. These show up as "Previous Error", meaning it's not from the actual error, just a notification that it shut down the PC from the previous error. We can't see the original error with these issues. 90% of the time it's the CPU, but it can also be a PCIe device or even the motherboard is the CPU/PCIe section is faulty.

 

We have seen a lot of voltage issues on 5000 series so we could try setting a static voltage which has fixed a lot of 5900x and 5950x CPUs that have had lots of different CPU related crashes. In the BIOS, set a static voltage to the CPU of 1.3v and to the SoC set a static voltage of 1.1v (CPU Vcore and VCORE SOC on this motherboard). 

Tried those values and failed on boot. Had to load optimized defaults.

Link to post
Share on other sites

16 minutes ago, mrtherm said:

Tried those values and failed on boot. Had to load optimized defaults.

I don't have any suggestions then. I'm not into overclocking or anything so messing with voltages is a bit outside my wheelhouse.

 

I did see that I forgot telling you to make sure that Precision Boost Overdrive (PBO) is set as Disabled in the BIOS. You could also try the voltage tweak again after disabling it if it was enabled. 

Link to post
Share on other sites

1 minute ago, Bjoolz said:

I don't have any suggestions then. I'm not into overclocking or anything so messing with voltages is a bit outside my wheelhouse.

 

I did see that I forgot telling you to make sure that Precision Boost Overdrive (PBO) is set as Disabled in the BIOS. You could also try the voltage tweak again after disabling it if it was enabled. 

yeah will do, thanks anyway.

It seems like a very annoying issue and it's very random.

Link to post
Share on other sites

4 hours ago, mrtherm said:

yeah will do, thanks anyway.

It seems like a very annoying issue and it's very random.

And just to clarify, the main suspect from the get-go was a faulty CPU so this was just for trying to bring it back into stability. So if nothing works, the CPU is likely faulty. Though it could be a PCIe device (And remember that most components on the motherboard connect over PCIe internally. Audio, network, storage controllers, etc).

Link to post
Share on other sites

22 hours ago, mrtherm said:

Error Source: Machine Check Exception
Error Type: Cache Hierarchy Error

I once had this kind of self restart error too, I blamed my then new Ryzen 3600 to the point of RMA-ing it. It still happened with the replacement CPU.

 

The ultimate culprit turned out to be my GPU at the time, an RX 5600 XT bought used. See if you or a friend has a GPU to spare, swap out your current card, see if it still restarts.

PhiLia093

 

CPU: AMD Ryzen 5 3600 3.6 GHz 6-Core Processor
CPU Cooler: Deepcool GAMMAXX 400 V2 64.5 CFM CPU Cooler
Motherboard: ASRock B450M Steel Legend Micro ATX AM4 Motherboard, BIOS P4.60
Memory: ADATA XPG 32GB GB (2 x 16GB) DDR4-3200 CL16 Memory
Storage: HP EX900 500 GB M.2-2280 PCIe 3.0 X4 NVME Solid State Drive, PNY CS900 1 TB 2.5" Solid State Drive
Video Card: Colorful iGame RTX 4060 Ti 16GB
Power Supply: Cooler Master MWE Bronze V2 650 W 80+ Bronze Certified ATX Power Supply
Operating System: Microsoft Windows 11 Pro
Monitor: Acer QG240Y S3 24.0" 1920 x 1080 180Hz Monitor

Headphones: Superlux HD668B

IEMs: Moondrop Chu 2, Tangzu Wan'er Studio Edition

Speakers: Logitech Z313

Phone: Poco F6

Link to post
Share on other sites

On 6/12/2025 at 6:26 PM, emothxughts said:

I once had this kind of self restart error too, I blamed my then new Ryzen 3600 to the point of RMA-ing it. It still happened with the replacement CPU.

 

The ultimate culprit turned out to be my GPU at the time, an RX 5600 XT bought used. See if you or a friend has a GPU to spare, swap out your current card, see if it still restarts.

I had a Ryzen 2600X that i'm currently testing with.

Backup GPU is ready as well.

 

Will pretty much swap out as many hardware i have backup to accurately figure it out.

So far hasn't crashed. Will keep testing/running games.

 

On a side note, still getting this with the changed CPU??

"The device driver for the Trusted Platform Module (TPM) encountered a non-recoverable error in the TPM hardware, which prevents TPM services (such as data encryption) from being used. For further help, please contact the computer manufacturer."

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×