Jump to content

Hello,

 

I am experiencing a large amount of hardware errors in Windows 11 as of today, which has caused the operating system to lock up twice today displaying an error message that Windows isn't responding and force closing will cause data loss. It's not a complete lock up, the taskbar and cursor still works but the usable space above the taskbar goes completely black or white and nothing is displayed with the cursor displaying an hourglass. The first "lock up" occured when I opened the widgets panel after updating NVIDIA drivers and It became unresponsive even though I could interact with the taskbar, and the second during a Twitch stream on chrome (same story, could interact but cant see anything and "not responding" message from windows itself). BTW I was updating the GPU drivers because of the errors, hoping that it would fix it but it of course didn't.

 

When I enter reliability history located in Control Panel, I can see a total of six errors that occured today, and looking back 4 weeks there are occasional errors (up to 3 that I can see) spread around fairly consistently but not six in one day. I initially thought that these errors was caused by MSI afterburner, so I stopped using it temporarily and no errors for six days, until today. I also suspect that a couple of the errors didn't get reported as just now I encountered another lock up at 10:20ish pm and its currently 10:38 pm as of writing. When I enter the advanced view in the errors they are all listed as "Event Name: LiveKernelEvent" and all of them have one of the following codes: 1a8, 1b8, 141, 193, 117.

 

Screenshot2023-05-10223222.thumb.png.a4f4bc9158555d372c8d3d522e7f5b72.png

The system is entirely stock except for XMP (DDR5, 5200) and an undervolt on my CPU (13700k, -60mv offset + MSI lite load 8), but when I look up these codes, I can find some information online suggesting they are GPU (RTX 3070 Ti) related but I cannot be sure. For the mean time I have reverted CPU to stock voltage even though I am almost certain that it is not the culprit as there are no WHEA errors, and the system never experienced a BSOD or complete freeze/stutter. I also ran a few XTU stress tests for around 45 mins and the CPU was completely fine.

 

I have spent the whole day troubleshooting, reinstalling chipset and network drivers, validating integrity of Windows files (DISM, SFC), install of new Nvidia driver (531.79), manual reinstall of said Nvidia driver, multiple windows diagnostics, switched to GPU secondary BIOS and none of this has worked. The only recent change today to the system is iCUE 5 downloaded and installed itself, and since then I have experienced 6-8 of these errors, but looking back in reliability history I can see that over the past four weeks there has been between 1 and 3 errors roughly anywhere between 2-4 days apart. Although, none of them caused the system to hang up, and yes they are all the same error codes as listed above.

 

I don't know where to go from here and I just need my system usable because right now it's heavily unreliable. I still don't know what the exact problem is, if it's the GPU itself, one of the drivers or Windows/Windows setting. I would appreciate any help and will reply as soon as I can, thanks.

 

Hardware:

13700k, Stock frequencies

2x16gb DDR5, XMP 5200mhz

MSI Pro z790-A DDR5

Gigabyte GeForce RTX 3070 Ti Gaming OC 8gb

Link to post
Share on other sites

Update:

 

I have since been troubleshooting further and still no hope. I have swapped out the GPU for another Nvidia one and still getting errors, so that at least rules out the GPU itself, aslo tested with iGPU and there were no errors but I only was using it for a few hours so not really sure. I also did a clean reinstall of the Nvidia drivers using DDU in safe mode and I initially thought this worked as it was fine for a few hours, but just got what felt like another error, the driver reset itself causing screen to flash black temporarily but there are no logs in reliability history or event viewer. This happened just after I logged in after turning the PC on. Also forgot to mention when an error is about to occur the mouse cursor starts lagging behind and some UI elements/programs freeze up, but the keyboard still works and audio is still functional. The mouse also lags quite significantly when GeForce Experience is open in the foreground.

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×