Jump to content

VRAM Pooling will work in all or only supported applications?

phongle123

Hello, when pooling VRAM will it work in in general on all applications or will it only be supported in applications that are coded to support VRAM pooling?

 

What happens if it does not support it and you have the 2 GPUs connected. Will it simply only use 1 of the 2 GPUs VRAM or will it error out?

Link to comment
Share on other sites

Link to post
Share on other sites

NVLink does VRAM pooling when Nvidia says it can do VRAM pooling. Currently you need the pro workstation/compute cards with two NVLink connectors for VRAM pooling, and probably for Nvidia to flip the magic software switch that lets it happen.

¯\_(ツ)_/¯

 

 

Desktop:

Intel Core i7-11700K | Noctua NH-D15S chromax.black | ASUS ROG Strix Z590-E Gaming WiFi  | 32 GB G.SKILL TridentZ 3200 MHz | ASUS TUF Gaming RTX 3080 | 1TB Samsung 980 Pro M.2 PCIe 4.0 SSD | 2TB WD Blue M.2 SATA SSD | Seasonic Focus GX-850 Fractal Design Meshify C Windows 10 Pro

 

Laptop:

HP Omen 15 | AMD Ryzen 7 5800H | 16 GB 3200 MHz | Nvidia RTX 3060 | 1 TB WD Black PCIe 3.0 SSD | 512 GB Micron PCIe 3.0 SSD | Windows 11

Link to comment
Share on other sites

Link to post
Share on other sites

11 minutes ago, BobVonBob said:

NVLink does VRAM pooling when Nvidia says it can do VRAM pooling. Currently you need the pro workstation/compute cards with two NVLink connectors for VRAM pooling, and probably for Nvidia to flip the magic software switch that lets it happen.

Didnt know that you couldnt pool VRAM yet. It will also work for the TITAN RTX. It says so on the official product page. 

 

Since VRAM Pooling is not a real thing yet then I guess no one besides nvidia knows the answer to this question. 

Link to comment
Share on other sites

Link to post
Share on other sites

I would imagine it should be transparent to the application programmer and what they see is the combined total. However, they would still have to be aware of certain use cases that may cause performance issues since while NVLink is faster than PCIe 3.0 x16, it's still much slower than VRAM speeds. So if anything, all this would allow you to do if you were looking at this from a games perspective allow for a larger texture/asset pool since I imagine passing around render targets would be a bad idea over a slower bus.

Link to comment
Share on other sites

Link to post
Share on other sites

VRAM pooling is only available on Quadro level cards.  The application also has to be programed to support it.

 

Due to these limits, it will only be supported in CAD, offline rendering, and compute apps more than likely.

Link to comment
Share on other sites

Link to post
Share on other sites

36 minutes ago, KarathKasun said:

VRAM pooling is only available on Quadro level cards.  The application also has to be programed to support it.

 

Due to these limits, it will only be supported in CAD, offline rendering, and compute apps more than likely.

I already stated it works in Titan RTX since it states it on the official site. The whole point of why I created this thread is for offline rendering. Additionally, I wouldn't want to spend 5k on GPUs to not have an application support VRAM Pooling and according to @BobVonBob VRAM Pooling isnt even available right now. So, idk if VRAM pooling works out of the box for everything as in I'll have 2x the VRAM all around or it will only work in specifically coded softwares.

 

50 minutes ago, Mira Yurizaki said:

I would imagine it should be transparent to the application programmer and what they see is the combined total. 

From what I'm getting you're saying that the program/software should simply receive a minor update to adjust to VRAM Pooling to shows the 2x VRAM of a pooled NVLink SLI Card? This won't be for gaming so I don't much care for those results. 

Link to comment
Share on other sites

Link to post
Share on other sites

4 minutes ago, phongle123 said:

I already stated it works in Titan RTX since it states it on the official site. The whole point of why I created this thread is for offline rendering. Additionally, I wouldn't want to spend 5k on GPUs to not have an application support VRAM Pooling and according to @BobVonBob VRAM Pooling isnt even available right now.

 

From what I'm getting you're saying that the program/software should simply receive a minor update to adjust to VRAM Pooling to shows the 2x VRAM. This won't be for gaming so I don't much care for that results. 

https://www.chaosgroup.com/blog/what-does-the-new-nvidia-rtx-hardware-mean-for-ray-tracing-gpu-rendering-v-ray
 

V-Ray supports it.

Link to comment
Share on other sites

Link to post
Share on other sites

6 minutes ago, phongle123 said:

From what I'm getting you're saying that the program/software should simply receive a minor update to adjust to VRAM Pooling to shows the 2x VRAM. This won't be for gaming so I don't much care for that results. 

If I were a hardware manufacturer, that's how I would make it look like to the application. It would seem silly for the application developer to have to manage and worry about a low level detail like this. And if I ask them to be aware of it and code specifically for it, unless they're a huge customer and they need the performance, they're probably not going to care. The only thing that the application developer would need to be aware of are those certain situations where the slower NVLink could impact performance, but that's a matter of profiling the application and avoiding scenarios that lead to the hardware doing that. Basically, it'd be no different than multi-socket CPU systems. Applications see the system with a single pool of RAM (even though there's two or more) and a bank of logical processors.

Link to comment
Share on other sites

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×