Jump to content

My 2080 ti is showing it's age... maybe?

MuchachoSal

Howdy all, I've had an ASUS 2080 ti for about two years (this one: https://www.amazon.com/gp/product/B07GJYS2WM/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&psc=1) and for more or less, it's worked out pretty well.   I had some fan trouble back in July that caused some massive throttling but I was able to use some machine oil to grease the bearings - everything went well after that.

 

Now, I have what sort of seems to be a new problem that's only crept up within the last three weeks, although I wouldn't be surprised if it's related to the fans.   In short - the card now crashes when it's "pushed to the limit".   Games like Overwatch, Assassin's Creed Odyssey, Horizon Zero Dawn, Control, etc. - all games that can push the limit - crash around the time that the card reaches 69-70c.  In the past, I've been able to run them at 1440p (~60 fps, whatever) with max settings on each without much problem, but anytime I do that now, the games crash pretty quickly after starting up - the renderer crashing is the error that I get almost 100% of the time now.   


In order to prevent crashing, I have to force a 60 fps limit and also dial down the internal resolution to around 1080p.  I also have to use MSI to crank up the fans to 100%.  Even then, I still get some crashes.  I've reloaded windows, I've gone to previous Nvidia drivers, etc.   Same problems.   It's really perplexing.     

 

The 1.png screenshot shows GPU-Z while running Horizon Zero Dawn with 1152p resolution.  The black box at the end is when the game crashes.  I noticed that the sole Perf Cap reason switches to Pwr (Green) before the crash, which is something that I saw similarly when I had my fan problem back in July.  Also - and this is hard to tell from the screenshot, so I've attached the log -  Fan 2's speed, just right before the crash, drops about 2.5% in relation to Fan 1's speed.   When the speed (slightly) drops, the temp and gpu load both go up to the point where the crash takes place (around 16:59:11).

 

So, something in my card is basically saying "nope, no can do".    My fans *seem* to be fine.   I'd be open to getting a new cooling system for my card but I honestly wouldn't know how to pull that off without one of Linus' hacks, and even then I'm not confident it would fix the real problem. Any ideas here?  Need any more info?

 

Thanks in advance!

 

1.png

GPU-Z Sensor Log.txt

Link to comment
Share on other sites

Link to post
Share on other sites

32 minutes ago, Grabhanem said:

Have you tried something like Furmark to isolate the problem to GPU load?

i wouldn't recommend this, it might push his gpu off a cliff, if he cant game normally after a ddu or windows install that's enough evidence imho.

 

@MuchachoSal if it's related more to temps and not load, u can try taping or clipping on a big 3000rpm 120mm fan (on top of the faulty fan) and pull air from it to prevent it from ever getting to 70C (janky but cheap solution, squeezed an extra year out of my 970 with very low temps), if that doesn't work, time to hit up warranty 

5950x 1.33v 5.05 4.5 88C 195w ll R20 12k ll drp4 ll x570 dark hero ll gskill 4x8gb 3666 14-14-14-32-320-24-2T (zen trfc)  1.45v 45C 1.15v soc ll 6950xt gaming x trio 325w 60C ll samsung 970 500gb nvme os ll sandisk 4tb ssd ll 6x nf12/14 ippc fans ll tt gt10 case ll evga g2 1300w ll w10 pro ll 34GN850B ll AW3423DW

 

9900k 1.36v 5.1avx 4.9ring 85C 195w (daily) 1.02v 4.3ghz 80w 50C R20 temps score=5500 ll D15 ll Z390 taichi ult 1.60 bios ll gskill 4x8gb 14-14-14-30-280-20 ddr3666bdie 1.45v 45C 1.22sa/1.18 io  ll EVGA 30 non90 tie ftw3 1920//10000 0.85v 300w 71C ll  6x nf14 ippc 2000rpm ll 500gb nvme 970 evo ll l sandisk 4tb sata ssd +4tb exssd backup ll 2x 500gb samsung 970 evo raid 0 llCorsair graphite 780T ll EVGA P2 1200w ll w10p ll NEC PA241w ll pa32ucg-k

 

prebuilt 5800 stock ll 2x8gb ddr4 cl17 3466 ll oem 3080 0.85v 1890//10000 290w 74C ll 27gl850b ll pa272w ll w11

 

Link to comment
Share on other sites

Link to post
Share on other sites

Two years, eh? Have taken the card out to look at the heatsink? I mean after two years, the heatsink may have accumulated dust or dust bunnies that may impede proper cooling. 

Main Rig: AMD AM4 R9 5900X (12C/24T) + Tt Water 3.0 ARGB 360 AIO | Gigabyte X570 Aorus Xtreme | 2x 16GB Corsair Vengeance DDR4 3600C16 | XFX MERC 310 RX 7900 XTX | 256GB Sabrent Rocket NVMe M.2 PCIe Gen 3.0 (OS) | 4TB Lexar NM790 NVMe M.2 PCIe4x4 | 2TB TG Cardea Zero Z440 NVMe M.2 PCIe Gen4x4 | 4TB Samsung 860 EVO SATA SSD | 2TB Samsung 860 QVO SATA SSD | 6TB WD Black HDD | CoolerMaster H500M | Corsair HX1000 Platinum | Topre Type Heaven + Seenda Ergonomic W/L Vertical Mouse + 8BitDo Ultimate 2.4G | iFi Micro iDSD Black Label | Philips Fidelio B97 | C49HG90DME 49" 32:9 144Hz Freesync 2 | Omnidesk Pro 2020 48" | 64bit Win11 Pro 23H2

2nd Rig: AMD AM4 R9 3900X + TR PA 120 SE | Gigabyte X570S Aorus Elite AX | 2x 16GB Patriot Viper Elite II DDR4 4000MHz | Sapphire Nitro+ RX 6900 XT | 500GB Crucial P2 Plus NVMe M.2 PCIe Gen 4.0 (OS)2TB Adata Legend 850 NVMe M.2 PCIe Gen4x4 |  2TB Kingston NV2 NVMe M.2 PCIe Gen4x4 | 4TB Leven JS600 SATA SSD | 2TB Seagate HDD | Keychron K2 + Logitech G703 | SOLDAM XR-1 Black Knight | Enermax MAXREVO 1500 | 64bit Win11 Pro 23H2

 

 

 

 

 

 

Link to comment
Share on other sites

Link to post
Share on other sites

So, a few more things:

 

1. I know the CPU isn't the problem.  Last week, I removed the 19% overclock I had on my 10600k and I ended up having the same problem.

2. I just ran Furmark. (before seeing xg32's post, Oh well :))  The first time I ran a 1440p benchmark it crashed at 80% finished.  The second time it didn't crash at all 🤷‍♂️  I've never actually ran Furmark before.  Reminds me of the the Eye of Sauron, but a cuddly Eye of Sauron.  Interestingly enough the GPU temps didn't get that high before the crash took place. 

3. I'll look at the heatsink - it might need some TLC.  Like I mentioned earlier, I did fix one of the fans so I have done at least some operations before. 

4. And sure, I'll see what I can do about pulling more air via another fan.  And then warranty as a last resort.  Heck, I'd even buy a 3080 if there was one available at retail price right now. 🤦‍♂️

 

Thanks for the tips all!

Link to comment
Share on other sites

Link to post
Share on other sites

2 hours ago, MuchachoSal said:

So, a few more things:

 

Heck, I'd even buy a 3080 if there was one available at retail price right now. 🤦‍♂️

Good luck with that!!! 😈

Main Rig: AMD AM4 R9 5900X (12C/24T) + Tt Water 3.0 ARGB 360 AIO | Gigabyte X570 Aorus Xtreme | 2x 16GB Corsair Vengeance DDR4 3600C16 | XFX MERC 310 RX 7900 XTX | 256GB Sabrent Rocket NVMe M.2 PCIe Gen 3.0 (OS) | 4TB Lexar NM790 NVMe M.2 PCIe4x4 | 2TB TG Cardea Zero Z440 NVMe M.2 PCIe Gen4x4 | 4TB Samsung 860 EVO SATA SSD | 2TB Samsung 860 QVO SATA SSD | 6TB WD Black HDD | CoolerMaster H500M | Corsair HX1000 Platinum | Topre Type Heaven + Seenda Ergonomic W/L Vertical Mouse + 8BitDo Ultimate 2.4G | iFi Micro iDSD Black Label | Philips Fidelio B97 | C49HG90DME 49" 32:9 144Hz Freesync 2 | Omnidesk Pro 2020 48" | 64bit Win11 Pro 23H2

2nd Rig: AMD AM4 R9 3900X + TR PA 120 SE | Gigabyte X570S Aorus Elite AX | 2x 16GB Patriot Viper Elite II DDR4 4000MHz | Sapphire Nitro+ RX 6900 XT | 500GB Crucial P2 Plus NVMe M.2 PCIe Gen 4.0 (OS)2TB Adata Legend 850 NVMe M.2 PCIe Gen4x4 |  2TB Kingston NV2 NVMe M.2 PCIe Gen4x4 | 4TB Leven JS600 SATA SSD | 2TB Seagate HDD | Keychron K2 + Logitech G703 | SOLDAM XR-1 Black Knight | Enermax MAXREVO 1500 | 64bit Win11 Pro 23H2

 

 

 

 

 

 

Link to comment
Share on other sites

Link to post
Share on other sites

Sounds to me like memory or some power component overheating.

 

As a last case I’d try the card in a different PC or a different power supply. Might even be your RAM and not the graphics card at all. 
 

I’d just send it in for warranty repair if it still has one. Clearly a manufacturing defect since the card is so new still. 

That's an F in the profile pic

 

 

Link to comment
Share on other sites

Link to post
Share on other sites

Ddu reinstall

msi afterburner custom fan curve 

max power and voltage slide and dial back ur normal oc by 20mhz unless ur already at stock 

give that a go 

 

keep eye on temps ect

-13600kf 

- 4000 32gb ram 

-4070ti super duper 

Link to comment
Share on other sites

Link to post
Share on other sites

Remove one memory stick, do the test and see what happens. Then use the stick you've just removed by itself on another test run. Report. This will troubleshoot the RAM in case that's the problem. Also, what PSU are you using, and for how long have you used it?

Link to comment
Share on other sites

Link to post
Share on other sites

Normally I'd say RMA the card, but I wouldn't be surprised if throwing machine oil at it voided the warranty. I'd start DDU'ing the drivers, then I'd check for problems with the RAM and the PSU, since I don't see crazy temperatures in that screenshot. If nothing works you can try to throw an aftermarket cooler on there. 

For example:

https://www.arctic.de/en/Accelero-Xtreme-IV/DCACO-V800001-GBA01

PSU tier list // Motherboard tier list // Community Standards 

My System:

Spoiler

AMD Ryzen 5 3600, Gigabyte RTX 3060TI Gaming OC ProFractal Design Meshify C TG, 2x8GB G.Skill Ripjaws V 3200MHz, MSI B450 Gaming Plus MaxSamsung 850 EVO 512GB, 2TB WD BlueCorsair RM850x, LG 27GL83A-B

Link to comment
Share on other sites

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×