Jump to content

There is no 970 'memory bug'

The card does have 4gb vram, and it does have a real 256 bit memory bus, its just that with the cut down sm's, the 970 cannot access all 4 gb of ram as efficiently as the full 980 chip, and this isn't really suprising, since the gtx 970 is a cut down version of the gtx 980...

 

 

 

The GeForce GTX 970 is equipped with 4GB of dedicated graphics memory. However the 970 has a different configuration of SMs than the 980, and fewer crossbar resources to the memory system. To optimally manage memory traffic in this configuration, we segment graphics memory into a 3.5GB section and a 0.5GB section. The GPU has higher priority access to the 3.5GB section. When a game needs less than 3.5GB of video memory per draw command then it will only access the first partition, and 3rd party applications that measure memory usage will report 3.5GB of memory in use on GTX 970, but may report more for GTX 980 if there is more memory used by other commands. When a game requires more than 3.5GB of memory then we use both segments.

 

Source: http://www.pcper.com/news/Graphics-Cards/NVIDIA-Responds-GTX-970-35GB-Memory-Issue

 

 

 

 

tl;dr/captain obvious

cut down version of gtx 980 doesn't perform as well as gtx 980

Link to comment
https://linustechtips.com/topic/297691-there-is-no-970-memory-bug/
Share on other sites

Link to post
Share on other sites

PSU Tier List | CoC

Gaming Build | FreeNAS Server

Spoiler

i5-4690k || Seidon 240m || GTX780 ACX || MSI Z97s SLI Plus || 8GB 2400mhz || 250GB 840 Evo || 1TB WD Blue || H440 (Black/Blue) || Windows 10 Pro || Dell P2414H & BenQ XL2411Z || Ducky Shine Mini || Logitech G502 Proteus Core

Spoiler

FreeNAS 9.3 - Stable || Xeon E3 1230v2 || Supermicro X9SCM-F || 32GB Crucial ECC DDR3 || 3x4TB WD Red (JBOD) || SYBA SI-PEX40064 sata controller || Corsair CX500m || NZXT Source 210.

Link to post
Share on other sites

Sounds... as a very well made excuse. If that WAS the reason, they'd have replied that right away.

Want to help researchers improve the lives on millions of people with just your computer? Then join World Community Grid distributed computing, and start helping the world to solve it's most difficult problems!

 

Link to post
Share on other sites

Sorry but I am not letting Nvidia off the hook for this one.  The GTX 970 should be using all of the 4 GB of memory when you need it and when people with some cards can't do it they deserve a refund.

Too many ****ing games!  Back log 4 life! :S

Link to post
Share on other sites

Sorry but I am not letting Nvidia off the hook for this one.  The GTX 970 should be using all of the 4 GB of memory when you need it and when people with some cards can't do it they deserve a refund.

Then ask both AMD and NVIDIA for refunds for 256-bit 4GB cards in the past not using all of the VRAM due to bus limitations?

 

I don't disagree with you, but people didn't flip their shit so much over that back then like they are over this now.

Main Rig: CPU: AMD Ryzen 7 5700X3D | RAM: 32GB (2x16GB) KLEVV CRAS XR RGB DDR4-3600 | Motherboard: Gigabyte B550I AORUS PRO AX | Storage: 500GB Crucial P3 Plus, 4TB Silicon Power UD90 | GPU: AsRock Radeon RX 9070 XT Steel Legend | Cooling: ThermalTake Floe 280mm w/ be quiet! Pure Wings 3 | Case: Sliger SM580 (Black) | PSU: Corsair SF850

Main Server: CPU: AMD Ryzen 9 5950X | RAM: 64GB (2x32GB) Corsair Vengeance LPX DDR4-3200 | Motherboard: ASUS Crosshair VII Hero WiFi | Storage: 512GB SKHynix NVMe | GPUs: NVIDIA TITAN Xp 2-way SLI | Cooling: Thermalright Frozen Prism 360mm | Case: Corsair 5000D Airflow (White) | PSU: Seasonic Focus GM850

File and Media Server (AOOSTAR WTR Pro): CPU: AMD Ryzen 7 5825U | RAM: 32GB (2x16GB) Silicon Power DDR4-3200 SODIMMs | Storage: 1TB Samsung 970 EVO Plus, 2x14TB Western Digital Ultrastar DC HC530

Link to post
Share on other sites

Sorry but I am not letting Nvidia off the hook for this one.  The GTX 970 should be using all of the 4 GB of memory when you need it and when people with some cards can't do it they deserve a refund.

Nothing's changed since the card's release. the card performs just as well as indicated in reviews and benchmarks, and the card does legitimately have 4gb of vram, and the card can utilize all 4gb of vram. there is nothing to refund. This would be far from the only video card with some kind of performance bottleneck somehwere.

Link to post
Share on other sites

This will likely be Nvidias line, or something similar. The fanboys are already making this excuse without Nvidia prompting. In the end Nvidia will either ignore the issue and keep saying we are looking into it if prompted, or come up with the line that TECHNICALLY it does have 4 GB of memory. Unless of course this can be fixed via software, but if its a hardware problem expect this to be how they handle it.

 

Edit: Having read the link, I guess that IS Nvidias line. In other words they are saying there is no fix, it is what it is, and we think the card is fine this way.

 

So basically if you want to access a full 4 gb of VRAM, get a 980 or an r9 290/x.

CPU: Ryzen 7 3700x,  MOBO: ASUS TUF X570 Gaming Pro wifi, CPU cooler: Noctua U12a RAM: Gskill Ripjaws V @3600mhz,  GPU: Asus Tuf RTX OC 3080 PSU: Seasonic Focus GX850 CASE: Lian Li Lancool 2 Mesh Storage: 500 GB Inland Premium M.2,  Sandisk Ultra Plus II 256 GB & 120 GB

Link to post
Share on other sites

It IS the reason

Proof? "Because Nvidia said it?"

 

Conspiracy theories.....

 

Sorry but I am not letting Nvidia off the hook for this one.  The GTX 970 should be using all of the 4 GB of memory when you need it and when people with some cards can't do it they deserve a refund.

But.... it IS using all the memory?

Want to help researchers improve the lives on millions of people with just your computer? Then join World Community Grid distributed computing, and start helping the world to solve it's most difficult problems!

 

Link to post
Share on other sites

Nothing's changed since the card's release. the card performs just as well as indicated in reviews and benchmarks, and the card does legitimately have 4gb of vram, and the card can utilize all 4gb of vram. there is nothing to refund. This would be far from the only video card with some kind of performance bottleneck somehwere.

This doesn't hurt performance, it adds visual stuttering among other things that can't be measured by an FPS counter.

Spoiler

Prometheus (Main Rig)

CPU-Z Verification

Laptop: 

Spoiler

Intel Core i3-5005U, 8GB RAM, Crucial MX 100 128GB, Touch-Screen, Intel 7260 WiFi/Bluetooth card.

 Phone:

 Game Consoles:

Spoiler

Softmodded Fat PS2 w/ 80GB HDD, and a Dreamcast.

 

If you want my attention quote my post, or tag me. If you don't use PCPartPicker I will ignore your build.

Link to post
Share on other sites

Nothing's changed since the card's release. the card performs just as well as indicated in reviews and benchmarks, and the card does legitimately have 4gb of vram, and the card can utilize all 4gb of vram. there is nothing to refund. This would be far from the only video card with some kind of performance bottleneck somehwere.

We're talking huge performance tanking when going over 3.5GB though, which is a significant problem which needs a fix. It has 4GB of VRAM and can't use it without tanking it's performance, it's an issue.

Main Rig: CPU: AMD Ryzen 7 5700X3D | RAM: 32GB (2x16GB) KLEVV CRAS XR RGB DDR4-3600 | Motherboard: Gigabyte B550I AORUS PRO AX | Storage: 500GB Crucial P3 Plus, 4TB Silicon Power UD90 | GPU: AsRock Radeon RX 9070 XT Steel Legend | Cooling: ThermalTake Floe 280mm w/ be quiet! Pure Wings 3 | Case: Sliger SM580 (Black) | PSU: Corsair SF850

Main Server: CPU: AMD Ryzen 9 5950X | RAM: 64GB (2x32GB) Corsair Vengeance LPX DDR4-3200 | Motherboard: ASUS Crosshair VII Hero WiFi | Storage: 512GB SKHynix NVMe | GPUs: NVIDIA TITAN Xp 2-way SLI | Cooling: Thermalright Frozen Prism 360mm | Case: Corsair 5000D Airflow (White) | PSU: Seasonic Focus GM850

File and Media Server (AOOSTAR WTR Pro): CPU: AMD Ryzen 7 5825U | RAM: 32GB (2x16GB) Silicon Power DDR4-3200 SODIMMs | Storage: 1TB Samsung 970 EVO Plus, 2x14TB Western Digital Ultrastar DC HC530

Link to post
Share on other sites

The card does have 4gb vram, and it does have a real 256 bit memory bus, its just that with the cut down sm's, the 970 cannot access all 4 gb of ram as efficiently as the full 980 chip, and this isn't really suprising, since the gtx 970 is a cut down version of the gtx 980...

 

 

 

 

Source: http://www.pcper.com/news/Graphics-Cards/NVIDIA-Responds-GTX-970-35GB-Memory-Issue

 

 

 

 

tl;dr/captain obvious

cut down version of gtx 980 doesn't perform as well as gtx 980

Why was I sold a card that said 4GB? Shouldn't it have said "3.5GB With 512MB Slow Access Memory?" 256bit 4GB is in the product's title, and that implies that the card has a 256bit wide bus to access the mem. still feel ripped off and nothing will change that unless NVIDIA compensates for this.

4790k @ 4.6 (1.25 adaptive) // 2x GTX 970 stock clocks/voltage // Dominator Platnium 4x4 16G //Maximus Formula VII // WD Black1TB + 128GB 850 PRO // RM1000 // NZXT H440 // Razer Blackwidow Ultimate 2013 (MX Blue) // Corsair M95 + Steelseries QCK // Razer Adaro DJ // AOC I2757FH

Link to post
Share on other sites

Why was I sold a card that said 4GB? Shouldn't it have said "3.5GB With 512MB Slow Access Memory?" 256bit 4GB is in the product's title, and that implies that the card has a 256bit wide bus to access the mem. still feel ripped off and nothing will change that unless NVIDIA compensates for this.

the 970 does not have an asymetrical design like the gtx 660, the card does have a 256-bit bus. The card's memory access is bottlenecked by sm resources, not the bus.

Link to post
Share on other sites

Notice the r9 290 doesn't have this same problem versus the 290x.

 

Looks like Nvidia pulled a fast one on people. They already sold a shit ton of 970s so what do they care if a few more tech geeks pass on a new 970 because of this issue. As for disgruntled 970 owners, they know most of them are dedicated to Nvidia, and by the time they are ready to buy a new card a couple years from now they will be too excited about Nvidias next new thing to care or jump ship to AMD.

CPU: Ryzen 7 3700x,  MOBO: ASUS TUF X570 Gaming Pro wifi, CPU cooler: Noctua U12a RAM: Gskill Ripjaws V @3600mhz,  GPU: Asus Tuf RTX OC 3080 PSU: Seasonic Focus GX850 CASE: Lian Li Lancool 2 Mesh Storage: 500 GB Inland Premium M.2,  Sandisk Ultra Plus II 256 GB & 120 GB

Link to post
Share on other sites

This is what happens when you let marketing morons go wild. The 980 isn't a 165W card and the 970 can't use all of its VRAM properly. Also what the frack is wrong with Nvidia saying the Tegra X1 is a 1TFlop chip when they are actually counting the 16bit half precision performance which was abandoned over a decade ago. When in fact the single percision FP32 performance is 500GFlops at PEAK, so not even when it goes into actual products with real-world thermal and power limits.

Link to post
Share on other sites

It's not a memory bug but instead a misinformation/false advertising. 

| Intel i7-3770@4.2Ghz | Asus Z77-V | Zotac 980 Ti Amp! Omega | DDR3 1800mhz 4GB x4 | 300GB Intel DC S3500 SSD | 512GB Plextor M5 Pro | 2x 1TB WD Blue HDD |
 | Enermax NAXN82+ 650W 80Plus Bronze | Fiio E07K | Grado SR80i | Cooler Master XB HAF EVO | Logitech G27 | Logitech G600 | CM Storm Quickfire TK | DualShock 4 |

Link to post
Share on other sites

the 970 does not have an asymetrical design like the gtx 660, the card does have a 256-bit bus. The card's memory access is bottlenecked by sm resources, not the bus.

My bad. Can you elaborate on what a SM is, I feel kind of dumb.

4790k @ 4.6 (1.25 adaptive) // 2x GTX 970 stock clocks/voltage // Dominator Platnium 4x4 16G //Maximus Formula VII // WD Black1TB + 128GB 850 PRO // RM1000 // NZXT H440 // Razer Blackwidow Ultimate 2013 (MX Blue) // Corsair M95 + Steelseries QCK // Razer Adaro DJ // AOC I2757FH

Link to post
Share on other sites

My bad. Can you elaborate on what a SM is, I feel kind of dumb.

SM (also referred to as 'SMM' in maxwell) = Streaming multiprocessor

 

In maxwell Each SMM has 128 CUDA cores (aka stream processors).

 

With the GTX 970 they cut down the card by severing a trio of SMM units, while leaving the rest of the card pretty much the same as the gtx 980. This is the reason for the less efficient memory access.

 

This article does a good job explaining how the reduced SMM count in the 970 effects performance compared to the 980:

 

http://techreport.com/blog/27143/here-another-reason-the-geforce-gtx-970-is-slower-than-the-gtx-980

 

 

The pixel fillrate can be linked to the number of ROPs for some GPUs, but it’s been limited elsewhere for years for many Nvidia GPUs. Basically there are 3 levels that might have a say at what the peak fillrate is :

  • The number of rasterizers
  • The number of SMs
  • The number of ROPs

On both Kepler and Maxwell each SM appears to use a 128-bit datapath to transfer pixels color data to the ROPs. Those appears to be converted from FP32 to the actual pixel format before being transferred to the ROPs. With classic INT8 rendering (32-bit per pixel) it means each SM has a throughput of 4 pixels/clock. With HDR FP16 (64-bit per pixel), each SM has a throughput of 2 pixels/clock.

On Kepler each rasterizer can output up to 8 pixels/clock. With Maxwell, the rate goes up to 16 pixels/clock (at least with the currently released Maxwell GPUs).

So the actual pixels/cycle peak rate when you look at all the limits (rasterizers/SMs/ROPs) would be :

GTX 750 : 16/16/16

GTX 750 Ti  : 16/20/16

GTX 760 : 32/24/32 or 24/24/32 (as there are 2 die configuration options)

GTX 770 : 32/32/32

GTX 780 : 40/48/48 or 32/48/48 (as there are 2 die configuration options)

GTX 780 Ti : 40/60/48

GTX 970 : 64/52/64

GTX 980 : 64/64/64

Extra ROPs are still useful to get better efficiency with MSAA and so. But they don’t participate in the peak pixel fillrate.

That’s in part what explains the significant fillrate delta between the GTX 980 and the GTX 970 (as you measured it in 3DMark Vantage). There is another reason which seem to be that unevenly configured GPCs are less efficient with huge triangles splitting (as it’s usually the case with fillrate tests).

asus-strix-feature.jpgSo the GTX 970's peak potential pixel fill rate isn't as high as the GTX 980's, in spite of the fact that they share the same ROP count, because the key limitation resides elsewhere. When Nvidia hobbles the GTX 970 by disabling SMs, the effective pixel fill rate suffers.

Link to post
Share on other sites

SM (also referred to as 'SMM' in maxwell) = Streaming multiprocessor

 

In maxwell Each SMM has 128 CUDA cores (aka stream processors).

 

With the GTX 970 they cut down the card by severing a trio of SMM units, while leaving the rest of the card pretty much the same as the gtx 980. This is the reason for the less efficient memory access.

 

This article does a good job explaining how the reduced SMM count in the 970 effects performance compared to the 980:

 

http://techreport.com/blog/27143/here-another-reason-the-geforce-gtx-970-is-slower-than-the-gtx-980

So why isn't there a deticated memory controller? I feel like I'm missing something. 

4790k @ 4.6 (1.25 adaptive) // 2x GTX 970 stock clocks/voltage // Dominator Platnium 4x4 16G //Maximus Formula VII // WD Black1TB + 128GB 850 PRO // RM1000 // NZXT H440 // Razer Blackwidow Ultimate 2013 (MX Blue) // Corsair M95 + Steelseries QCK // Razer Adaro DJ // AOC I2757FH

Link to post
Share on other sites

So why isn't there a deticated memory controller? I feel like I'm missing something. 

There are four 64 bit memory controllers. What happens is each SM uses 'crossbars' to access memory channels. The issue here is that because of the disabled SMs there's an increased load on fewer crossbars and efficiency is lost.

Link to post
Share on other sites

Am i the only one right in thinking that Nvidia don't want to fix it because the 970 may gain a performance increase to the 980 therefore counting the 980 as an unusal buy and making the 970 power to price ratio higher than any other card?

 

Literally all it sounds like.

                                                      Professional Graphics Designer | Case: NZXT Phantom Orange and Black | Motherboard: MSI SLI PLUS X99S

                                                                                                        CPU: Intel i7 5820K | Graphics Card: Zotac nVidia 1080 AMP!

                                   RAM: Corsair Vengeance 16GB DDR4 2400Mhz | Storage: Samsung 850 PRO 256GB, Western Digital Black 3TB & Western Digital Red 3TB | 

                                                        Monitors: Acer Predator XB271HU 27", Acer Predator XB270HAbprz 27" and BenQ GL240 24" | PSU: Corsair AX860i |

Link to post
Share on other sites

There are four 64 bit memory controllers. What happens is each SM uses 'crossbars' to access memory channels. The issue here is that because of the disabled SMs there's an increased load on fewer crossbars and efficiency is lost.

I see. Thanks for explaining!

But this just sounds like a simple oversight to me, an unseen effect. 

cmon nvidia, get it toghether

4790k @ 4.6 (1.25 adaptive) // 2x GTX 970 stock clocks/voltage // Dominator Platnium 4x4 16G //Maximus Formula VII // WD Black1TB + 128GB 850 PRO // RM1000 // NZXT H440 // Razer Blackwidow Ultimate 2013 (MX Blue) // Corsair M95 + Steelseries QCK // Razer Adaro DJ // AOC I2757FH

Link to post
Share on other sites

There are four 64 bit memory controllers. What happens is each SM uses 'crossbars' to access memory channels. The issue here is that because of the disabled SMs there's an increased load on fewer crossbars and efficiency is lost.

If THIS causes a significant performance drop, then it is a hardware flaw / poor engineering / implementation.

I can ask my Engineering professors, and Engineers out in the field, and they would most likely say the same.

AMD Ryzen 9000 Rig

  • AMD R7 9800X3D + Alphacool CORE 1 w/ Performance Mount Kit + Thermal Grizzly AM5 Contact Frame
  • Gigabyte X870E Aorus Pro Ice
  • 32GB (16GB X2) G.Skill Trident Z5 Neo RGB DDR5-6400
  • Sapphire NITRO+ 6800 XT Special Edition + EKwb Full Cover Block
  • Custom Loop w/ 2x 360mm Radiators
  • WD SN850X + WD SN750 + Samsung 980
  • EVGA P2 850W + Red/White CableMod Cables
  • Lian-Li O11 Dynamic EVO XL

AMD Ryzen 5000 Rig

  • AMD R7-5800X
  • Gigabyte B550 Aorus Pro AC
  • 32GB (16GB X 2) Crucial Ballistix RGB DDR4-3600
  • Gigabyte Vision RTX 3060 Ti OC
  • EKwb D-RGB 360mm AIO
  • Intel 660p NVMe 1TB + Crucial MX500 1TB + WD Black 1TB HDD
  • EVGA P2 850W + White CableMod cables
  • Lian-Li LanCool II Mesh - White

Intel i7-8086K / Z390 Rig (Decommissioned Q2' 2025)

Intel i7-6800K / X99 Rig (Officially Decommissioned, Dead CPU returned to Intel)
Intel i5-4690K / Z97 Rig (Decommissioned)

AMD FX-8350 / 990FX Rig (Decommissioned)

AMD Phenom II X6 1090T / 890FX Rig (Decommissioned)

 

<> Electrical Engineer , B.Eng <>

<> Electronics & Computer Engineering Technologist (Diploma + Advanced Diploma) <>

<> Electronics Engineering Technician for the Canadian Department of National Defence <>

Link to post
Share on other sites

If THIS causes a significant performance drop, then it is a hardware flaw / poor engineering / implementation.

I can ask my Engineering professors, and Engineers out in the field, and they would most likely say the same.

It is a flaw in the design, the way they cut it down certainly isn't perfect, but overall the 970 is still a very efficient card, and this whole 'memorygate' thing is pretty overblown to say the least.

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×