Jump to content

HungryHamster

Member
  • Posts

    313
  • Joined

  • Last visited

Posts posted by HungryHamster

  1. 34 minutes ago, Agall said:

    RTX 4090 really only cares for a lower TDP. Usually a RTX 4090 will be power and/or voltage limited, even when running full bore 133% TDP. Lowering that down to even 60% doesn't net that much loss in performance, but its a tipping point. Anything like 80-90% is almost negligible compared to stock in actual games.

     

    Its probably staying at its boost clock because its not TDP or voltage limited, to answer your question.

    OK, thanks for the info. If I understand you correctly, the only way I will be able to get to 2510MHz (-100MHz from the default) would be to power limit the card? I would do that by setting the power target slider to maybe like 80%? I have already tried setting that to 80% before and it still crashed. I was mainly worried that something was causing my card to overclock too much (I thought 2610MHz was the max I should be seeing).

  2. 5 minutes ago, PDifolco said:

    There's no reason to cap clocks, if you just want more efficiency you should undervolt and limit power

    On my 3080 I managed to have it run at max 270W (nearly 20% less power) for less than 10% performance loss

    Sorry, maybe I wasn't clear enough in my post, but I want to underclock because someone recommended trying this to resolve an issue with my graphics card crashing. They recommended lowering it by 100MHz. It's just a test to see if it will still crash. I still don't know why it's running at 2820MHz and why moving the boost clock speed slider lower by -130MHz has no effect.

  3. 2 minutes ago, PDifolco said:

    Undervolting shouldn't be static, use afterburner to set a voltage/clock curve

    Thanks for your response. So I actually need to mess with the voltage to get a lower clock speed?

     

    Also, do you know if it's normal for my card to be running at 2820MHz without any overclocking?

  4. I'm trying to troubleshoot an issue with my Asus ROG Strix RTX 4090 OC edition crashing and it was recommended I try to underclock it. I downloaded Asus GPU Tweak III and moved the "GPU Boost Clock" slider to the left to about -130 MHz yet I'm still getting clock speeds not only higher than that, but also about +200MHz higher than the default of 2610MHz. Can someone explain what is going on here and if this is normal? I'm worried something is causing my card to OC when I actually want to underclock it. 

    GPUclock.thumb.jpg.980b7cde8dca80c316a25767918d5623.jpg

  5. On 6/27/2023 at 11:54 AM, Agall said:

    How's it wired to your Thor P2 1000W? Are you using the adapter or an internally adapted cable?

     

    Also have you tested the other vBIOS that you can toggle to with the Quiet or Performance mode switch on the top of the card?

    I tested the other vBIOS that you can toggle to with the Quiet or Performance mode switch on the top of the card and unfortunately I was still getting this error. Thanks for the suggestion, though. I've added it to the growing list of troubleshooting steps I've tried.

  6. 3 minutes ago, BetteBalterZen said:

    Copy that. Have you tried running for a while with an older driver?

    Yes, I have tried running on an older driver that was working for me before I started having this issue.

  7. 6 hours ago, Bjoolz said:

    The BIOS of the motherboard, not the VBIOS.

     

    13 hours ago, Bjoolz said:

    It's the GPU driver, but the reason for the crash sounds more software related. The crash error is "An attempt was made to release a semaphore such that its maximum count would have been exceeded." As you have already tried DDU, could you update the BIOS? The board has flashback so just keep a BIOS on a USB stick in case of it crashing during the update. 

    Thanks for analyzing those .dmp files for me. It does seem like the crash is more software related, but I've done DDU and even completely uninstalled/reinstalled everything with a clean Windows 11 install and the problem still persists. It's really strange. I will take up your suggestion to update my mobo BIOS even though I have no experience doing that before.

  8. 12 hours ago, BetteBalterZen said:

    I had the same issue with my PC, with drivers through 528.49 to 531.79. Somehow after 531.79 the issue have stopped on my system, even though the nvlddmkm error and higher latency is still mentioned as current issues in the driver changes. You can find various people with PC crashes, CTD, etc, in the last couple of months with nvlddmkm errors mentioned in the Event Viewer. I have a feeling yours here simply is a driver issue. 

    Thanks for your response. Unfortunately, I am still having the issue after using DDU to uninstall the drivers and I have tried both installing new drivers and reinstalling older drivers that used to work for me. In fact, I only started having this problem after 531.79.

  9. 11 minutes ago, Bjoolz said:

    It's the GPU driver, but the reason for the crash sounds more software related. The crash error is "An attempt was made to release a semaphore such that its maximum count would have been exceeded." As you have already tried DDU, could you update the BIOS? The board has flashback so just keep a BIOS on a USB stick in case of it crashing during the update. 

    Thanks again for your help. I have already updated the BIOS of the video card and the problem still persists. Are you suggesting I try updating the motherboard BIOS? 

  10. 2 hours ago, Raging_Storm said:

    currently dealing with the exact same issue to a T, our specs are verry similar same brand mobo same brand ram same gpu same cpu  I cant for the life of me figure it out and i sent the first card in for rma they sent me a different one and same problem, no issues if i put my evga 2080 ti ftw in tho im baffled 

    Sorry to hear you are dealing with this issue too. It's really frustrating and from my research it seems like many other 4090 users have the error:

     

    https://www.overclock.net/threads/massive-rtx-4090-problems-driver-or-hardware.1801381/page-65

     

    If you haven't already, you might try some of the troubleshooting steps that I have already tried because these are all things that people on other forums have said worked for them. Unfortunately, it's my understanding that this EventViewer error code/message basically just means something wrong happened with the GPU and could be indicative of any number of issues. If you do find some kind of resolution, please do let me know.

  11. 22 minutes ago, Agall said:

    How's it wired to your Thor P2 1000W? Are you using the adapter or an internally adapted cable?

     

    Also have you tested the other vBIOS that you can toggle to with the Quiet or Performance mode switch on the top of the card?

    Thanks for your response. I'm not exactly sure what you mean by how it is wired. I'm using the "CableMod RT-Series Pro ModFlex Sleeved 12VHPWR Cable Kit for ASUS and Seasonic with the 16-pin to 4 x 8-pin PCI-e Cable." I have had no issues with this for 6 months.

     

    I have not tried to toggle that switch from performance to quiet mode, but I will try that. I was under the impression that only changed the fan profile.

     

     

  12. 25 minutes ago, emosun said:

    do we get to know what the specs or temps are or is that it?

    My PC specs are in the PCpartpicker link in my signature. I checked CPU and GPU temperature while gaming and both are normal (under 70C).  

  13. Hello,

    I think I am having issues with my GPU (Asus ROG Strix RTX 4090) crashing. What happens is that after playing a game for about 1-10 minutes my display goes black and my GPU fans go to 100%. EventViewer sometimes produces the following errors together:

     

    1.       “The description for Event ID 14 from source nvlddmkm cannot be found. Either the component that raises this event is not installed on your local computer or the installation is corrupted. You can install or repair the component on the local computer.

    If the event originated on another computer, the display information had to be saved with the event.

    The following information was included with the event:

    \Device\Video3

    0000(0000) 00000000 00000000

    Element not found”

    Note: I have also seen this error with “\Device\0000016d

     

    badfbadf(badfbadf) 00000000 00000000” listed instead.

     

     

    And

     

    2      “The description for Event ID 0 from source nvlddmkm cannot be found. Either the component that raises this event is not installed on your local computer or the installation is corrupted. You can install or repair the component on the local computer.

    If the event originated on another computer, the display information had to be saved with the event.

    The following information was included with the event:

    \Device\Video3

    UCodeReset TDR occurred on GPUID:100

    Element not found”

     

    I have also seen this error in EventViewer:

    3     "Display driver nvlddmkm stopped responding and has successfully recovered."

     

    A couple times a .dmp file has been created, but I don’t know how to analyze that or if it would be helpful.

    After researching this issue it appears that nvlddmkm errors are very generic and could be any number of things. Here are the troubleshooting steps I have already tried based on researching this issue:

     

    ·        DDU drivers in safe mode (without internet connected) and updated to latest driver or old drivers.

    ·        Changed power management mode to “Prefer maximum performance.”

    ·        Changed Hardware accelerated GPU scheduling to off.

    ·        Completely Uninstalled all 3rd party lighting controllers (Asus Armoury Crate, Corsair iCUE, Razer Synapse)

    ·        Tried using a DisplayPort cable instead of HDMI and also different HDMI ports.

    ·        Scanned SSDs with Samsung magician. Both are in good health.

    ·        Checked CPU and GPU temperature while gaming and both are normal (under 70C).  

    ·        Disabled XMP profile for my RAM

    ·        Changed Link State Power Management to OFF

    ·        Changed ECC state to On

    ·        Updated GPU Bios

    ·        Clean reinstall of Windows 11

    ·        Power limit GPU to 80%

    ·        Flipped the switch on the graphics card that sets it to Quiet instead of Performance vBIOS profile 

    ·        Changed user permission of nvlddmkm.sys to full control

    ·        Disabled "Fast startup" for Windows 11

     

    While 90% of the time this crash occurs within the first 10 minutes of gaming, I have also had this occur a few times while away from my computer (my computer not running anything). Another time I was able to game for hours and within 10 minutes of stopping and just browsing the internet, the same crash occurred. One time it occurred right after booting into Windows.

     

    I have had this PC for over 6 months and to the best of my knowledge my issues started only after updating to the latest version of ASUS Armoury Crate about 1 month ago. Since then, the issue has gone from occurring maybe once a week to every day.   

     

    This community has always been a huge help to me, and I would really appreciate your thoughts or suggestions. I'm really desperate for help.

    Thanks in advance!

  14. 19 hours ago, johnno23 said:

    Without some info on what event viewer was showing no one has any idea what to look for?

    just copy and paste the errors into a post so that people can then try to assist you.

     

    Just guessing right now but try removing armoury crate and see if the issue persists ?

    I completely uninstalled armoury crate and that seems to have fixed the issue. No crashes since I uninstalled it. If for some reason it starts happening again, I will definitely try the other things you recommended. Thanks again for your help.

  15. 2 minutes ago, johnno23 said:

    Without some info on what event viewer was showing no one has any idea what to look for?

    just copy and paste the errors into a post so that people can then try to assist you.

     

    Just guessing right now but try removing armoury crate and see if the issue persists ?

    Thanks for your response. Should I be looking in system events or application events?

     

    It looks like I got this error event in system:

     

    The description for Event ID 14 from source nvlddmkm cannot be found. Either the component that raises this event is not installed on your local computer or the installation is corrupted. You can install or repair the component on the local computer.

     

    If the event originated on another computer, the display information had to be saved with the event.

     

    The following information was included with the event:

    \Device\Video3
    CMDre 0000004c 00003ffc ffffffff 00000007 ffffffff

     

     

     

  16. Problem:

     

    - Starting today my PC will disconnect from Display port and the fans on my gfx card will go to 100%. I then have to reboot. This usually happens 1 minute after gaming or after about 20 mins of watching video.

     

    I haven't changed anything until this started happening except for updating my ASUS Armoury Crate to the newest version yesterday which I did without issue. Since this issue started happening today I have tried updating Windows and gfx card drivers.

     

    I'm not sure what the issue could be. It seems like probably something with the gfx card? I'm looking in Event Viewer and have found a few things that happen before the crash, but I'm not sure which one I'm looking for.

     

    Any help would be greatly appreciated!

×