Jump to content

Why has stacking vRAM been so difficult?

Because the way SLI/XFIRE work, both GPUs require the same information in memory as they both work together rendering the same scene (and there would be far too much latency for cross card memory access to occur).

 

/VRAM stacking is only an issue with SLI/XFIRE, not multi gpu configs in general. 

PSU Tier List | CoC

Gaming Build | FreeNAS Server

Spoiler

i5-4690k || Seidon 240m || GTX780 ACX || MSI Z97s SLI Plus || 8GB 2400mhz || 250GB 840 Evo || 1TB WD Blue || H440 (Black/Blue) || Windows 10 Pro || Dell P2414H & BenQ XL2411Z || Ducky Shine Mini || Logitech G502 Proteus Core

Spoiler

FreeNAS 9.3 - Stable || Xeon E3 1230v2 || Supermicro X9SCM-F || 32GB Crucial ECC DDR3 || 3x4TB WD Red (JBOD) || SYBA SI-PEX40064 sata controller || Corsair CX500m || NZXT Source 210.

Link to post
Share on other sites

Because the way SLI/XFIRE work, both GPUs require the same information in memory as they both work together rendering the same scene (and there would be far too much latency for cross card memory access to occur).

 

/VRAM stacking is only an issue with SLI/XFIRE, not multi gpu configs in general. 

Would some sort of proprietary connector that is faster than PCIE 3.0 be able to be used to connect the GPUs together to reduce latency? 

Link to post
Share on other sites

As the title says. 

Because after the 5th chip they topple over.

 

jk, but CFX/SLI function in the sense that the cards work together, but they don't share resources. A gpu will render a frame, then the other GPU renders the next frame, the this cycle repeats. This is called "Alternate Frame Rendering"(AFR).

Link to post
Share on other sites

Because after the 5th chip they topple over.

 

jk, but CFX/SLI function in the sense that the cards work together, but they don't share resources. A gpu will render a frame, then the other GPU renders the next frame, the this cycle repeats. This is called "Alternate Frame Rendering"(AFR).

Right, but as I posted above, can't you use a faster connection so the two GPUs can work on two different things. I mean right now, it sounds like a total waste to have multiple cards in a gaming rig. 

Link to post
Share on other sites

Would some sort of proprietary connector that is faster than PCIE 3.0 be able to be used to connect the GPUs together to reduce latency? 

not likely, the amount of bandwidth from the gpu to the vram is so much higher than any connector we have right now, that we will have to rely on DX12 to solve our problems :/

Link to post
Share on other sites

not likely, the amount of bandwidth from the gpu to the vram is so much higher than any connector we have right now, that we will have to rely on DX12 to solve our problems :/

even faster than the proposed PCIE 4.0?
Link to post
Share on other sites

even faster than the proposed PCIE 4.0?

yes. 

 

Think about the reasons behind HBM -- i.e. getting the memory as close to the core has possible to improve performance. There is a physical limitation on connection speeds, and distance actually plays a crucial factor.

PSU Tier List | CoC

Gaming Build | FreeNAS Server

Spoiler

i5-4690k || Seidon 240m || GTX780 ACX || MSI Z97s SLI Plus || 8GB 2400mhz || 250GB 840 Evo || 1TB WD Blue || H440 (Black/Blue) || Windows 10 Pro || Dell P2414H & BenQ XL2411Z || Ducky Shine Mini || Logitech G502 Proteus Core

Spoiler

FreeNAS 9.3 - Stable || Xeon E3 1230v2 || Supermicro X9SCM-F || 32GB Crucial ECC DDR3 || 3x4TB WD Red (JBOD) || SYBA SI-PEX40064 sata controller || Corsair CX500m || NZXT Source 210.

Link to post
Share on other sites

not likely, the amount of bandwidth from the gpu to the vram is so much higher than any connector we have right now, that we will have to rely on DX12 to solve our problems :/

TBH, Nvidia needs to start using a new connector since their SLI bridge is 9Gbps (that's if I'm remembering it correctly-either way its slow).

 

yes. 

 

Think about the reasons behind HBM -- i.e. getting the memory as close to the core has possible to improve performance. There is a physical limitation on connection speeds, and distance actually plays a crucial factor.

Though the shorter distance is mainly to help reduce latency, with the connection material, resistance and width (as in width of the track) affecting speed.

           .;ldkO0000Okdl;.                michael@SUSE-BlackBox
        .;d00xl:^''''''^:ok00d;.            OS: openSUSE 20260405
      .d00l'                'o00d.          Kernel: x86_64 Linux 6.19.11-1-default
    .d0K^'  Okxoc;:,.          ^O0d.        Uptime: 2d 21h 52m
   .OVVAK0kOKKKKKKKKKKOxo:,      lKO.       Packages: 6556
  ,0VVAKKKKKKKKKKKKK0P^,,,^dx:    ;00,      Shell: bash 5.3.9
 .OVVAKKKKKKKKKKKKKk'.oOPPb.'0k.   cKO.     Resolution: 3840x1080
 :KVAKKKKKKKKKKKKKK: kKx..dd lKd   'OK:     DE: KDE
 lKlKKKKKKKKKOx0KKKd ^0KKKO' kKKc   lKl     WM: KWin
 lKlKKKKKKKKKK;.;oOKx,..^..;kKKK0.  lKl     GTK Theme: Breeze-Dark [GTK2], Breeze [GTK3]
 :KAlKKKKKKKKK0o;...^cdxxOK0O/^^'  .0K:     Icon Theme: breeze-dark
  kKAVKKKKKKKKKKKK0x;,,......,;od  lKP      Disk: 13T / 22T (60%)
  '0KAVKKKKKKKKKKKKKKKKKK00KKOo^  c00'      CPU: AMD Ryzen 7 5800X3D 8-Core @ 16x 4.55295GHz
   'kKAVOxddxkOO00000Okxoc;''   .dKV'       GPU: AMD Radeon RX 6700 XT (radeonsi, navi22, ACO, DRM 3.64, 6.19.11-1-default)
     l0Ko.                    .c00l'        RAM: 13127MiB / 48094MiB
      'l0Kk:.              .;xK0l'          
         'lkK0xc;:,,,,:;odO0kl'             
             '^:ldxkkkkxdl:^'    

 

Link to post
Share on other sites

The speed of electricity is an issue at these sorts of speeds. Even a very wide connector would be 100,000x slower in latency than going to vram on device and transferring in the order of 500GB/s would be extremely difficult.

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×