Jump to content

According to the following article Nvidia has confirmed that 9xx cards will support asynchronous compute but that it is currently disabled in the driver. Once enabled we should expect performance to increase quite a bit. Hopefully this puts to rest all the DX12 claims about them falling so far behind AMD.

 

Source:

http://www.extremetech.com/gaming/223567-amd-clobbers-nvidia-in-updated-ashes-of-the-singularity-directx-12-benchmark

CPU: AMD 5950X    MB: Asus ROG Crosshair VIII Dark Hero    RAM: HyperX Predator 64GB    GPU: Nvidia RTX 3090 Ti FE    SSD: Seagate FireCuda 530 2TB    
PSU: EVGA 1200w P2    COOLING: EK AIO Elite 360    CASE: Fractal Design Torrent 
   DISPLAY: LG CX48 4k OLED    AUDIO: HIFIMAN Arya SE

Link to comment
https://linustechtips.com/topic/554642-nvidia-9xx-asynchronous-compute/
Share on other sites

Link to post
Share on other sites

4 minutes ago, Kryptyx said:

Hopefully this puts to rest all the DX12 claims about them falling so far behind AMD

Can I have some of what you're smoking 

 

As long as there is a bandwagon to jump on, people will jump on one. 

PSU Tier List | CoC

Gaming Build | FreeNAS Server

Spoiler

i5-4690k || Seidon 240m || GTX780 ACX || MSI Z97s SLI Plus || 8GB 2400mhz || 250GB 840 Evo || 1TB WD Blue || H440 (Black/Blue) || Windows 10 Pro || Dell P2414H & BenQ XL2411Z || Ducky Shine Mini || Logitech G502 Proteus Core

Spoiler

FreeNAS 9.3 - Stable || Xeon E3 1230v2 || Supermicro X9SCM-F || 32GB Crucial ECC DDR3 || 3x4TB WD Red (JBOD) || SYBA SI-PEX40064 sata controller || Corsair CX500m || NZXT Source 210.

Link to post
Share on other sites

1 minute ago, djdwosk97 said:

Can I have some of what you're smoking 

 

As long as there is a bandwagon to jump on, people will jump on one. 

I guess one can hope...

CPU: AMD 5950X    MB: Asus ROG Crosshair VIII Dark Hero    RAM: HyperX Predator 64GB    GPU: Nvidia RTX 3090 Ti FE    SSD: Seagate FireCuda 530 2TB    
PSU: EVGA 1200w P2    COOLING: EK AIO Elite 360    CASE: Fractal Design Torrent 
   DISPLAY: LG CX48 4k OLED    AUDIO: HIFIMAN Arya SE

Link to post
Share on other sites

Async compute is purely hardware based and Maxwell lacks the physical part needed to do Async compute. Driver based Async compute isn't real and is really Synchronous compute.

[Out-of-date] Want to learn how to make your own custom Windows 10 image?

 

Desktop: AMD R9 3900X | ASUS ROG Strix X570-F | Radeon RX 5700 XT | EVGA GTX 1080 SC | 32GB Trident Z Neo 3600MHz | 1TB 970 EVO | 256GB 840 EVO | 960GB Corsair Force LE | EVGA G2 850W | Phanteks P400S

Laptop: Intel M-5Y10c | Intel HD Graphics | 8GB RAM | 250GB Micron SSD | Asus UX305FA

Server 01: Intel Xeon D 1541 | ASRock Rack D1541D4I-2L2T | 32GB Hynix ECC DDR4 | 4x8TB Western Digital HDDs | 32TB Raw 16TB Usable

Server 02: Intel i7 7700K | Gigabye Z170N Gaming5 | 16GB Trident Z 3200MHz

Link to post
Share on other sites

3 minutes ago, DeadEyePsycho said:

Async compute is purely hardware based and Maxwell lacks the physical part needed to do Async compute. Driver based Async compute isn't real and is really Synchronous compute.

Unless I'm mistaken, Maxwell supports Async compute, there's just no hardware pipeline for it -- so just the pipeline needs to be implemented in drivers (which is manageable).

PSU Tier List | CoC

Gaming Build | FreeNAS Server

Spoiler

i5-4690k || Seidon 240m || GTX780 ACX || MSI Z97s SLI Plus || 8GB 2400mhz || 250GB 840 Evo || 1TB WD Blue || H440 (Black/Blue) || Windows 10 Pro || Dell P2414H & BenQ XL2411Z || Ducky Shine Mini || Logitech G502 Proteus Core

Spoiler

FreeNAS 9.3 - Stable || Xeon E3 1230v2 || Supermicro X9SCM-F || 32GB Crucial ECC DDR3 || 3x4TB WD Red (JBOD) || SYBA SI-PEX40064 sata controller || Corsair CX500m || NZXT Source 210.

Link to post
Share on other sites

9 minutes ago, djdwosk97 said:

Unless I'm mistaken, Maxwell supports Async compute, there's just no hardware pipeline for it -- so just the pipeline needs to be implemented in drivers (which is manageable).

Maxwell has compute shaders which can do Async but they rely heavily on context switching which is delaying a thread to be used later and as a result increases latency. AMD has compute engines that are specifically designed to do async and do so much better than Nvidia. Basically async is supposed to give higher FPS with lower latency which it does on AMD. Nvidia increases FPS but does not decrease latency. 

 

Basically it comes down to designing specifically for async or just tacking it on as an after thought.

[Out-of-date] Want to learn how to make your own custom Windows 10 image?

 

Desktop: AMD R9 3900X | ASUS ROG Strix X570-F | Radeon RX 5700 XT | EVGA GTX 1080 SC | 32GB Trident Z Neo 3600MHz | 1TB 970 EVO | 256GB 840 EVO | 960GB Corsair Force LE | EVGA G2 850W | Phanteks P400S

Laptop: Intel M-5Y10c | Intel HD Graphics | 8GB RAM | 250GB Micron SSD | Asus UX305FA

Server 01: Intel Xeon D 1541 | ASRock Rack D1541D4I-2L2T | 32GB Hynix ECC DDR4 | 4x8TB Western Digital HDDs | 32TB Raw 16TB Usable

Server 02: Intel i7 7700K | Gigabye Z170N Gaming5 | 16GB Trident Z 3200MHz

Link to post
Share on other sites

Maxwell has no hardware async compute, Nvidia has to do driver updates for it on the software level which isn't as effective in terms of performance boost if it were on the hardware level.

 

Tinfoil Hat Time: Nvidia purposely got rid of async compute at the hardware level to force Maxwell users to upgrade to Pascal.

 

 

Link to post
Share on other sites

If Async Compute is disabled in driver then how did they ran Async Compute benchmark with Nvidia card? Thru voodoo magic?

| Intel i7-3770@4.2Ghz | Asus Z77-V | Zotac 980 Ti Amp! Omega | DDR3 1800mhz 4GB x4 | 300GB Intel DC S3500 SSD | 512GB Plextor M5 Pro | 2x 1TB WD Blue HDD |
 | Enermax NAXN82+ 650W 80Plus Bronze | Fiio E07K | Grado SR80i | Cooler Master XB HAF EVO | Logitech G27 | Logitech G600 | CM Storm Quickfire TK | DualShock 4 |

Link to post
Share on other sites

34 minutes ago, AlexTheRose said:

D0UGGcX.jpg

funniest shit is, you're not totally wrong.

 

Async support takes quite a bit more than a driver update, the actual core design needs to be re-worked for it to take make a real difference, even if they "implement" it through a software patch, their still lacking the built-in support you find in amd's lineup. Plainly, even when maxwell sees async support the game will still play out with amd getting a much larger boost to performance than the boost nvidia will get.

 

 

Updated 2021 Desktop || 3700x || Asus x570 Tuf Gaming || 32gb Predator 3200mhz || 2080s XC Ultra || MSI 1440p144hz || DT990 + HD660 || GoXLR + ifi Zen Can || Avermedia Livestreamer 513 ||

New Home Dedicated Game Server || Xeon E5 2630Lv3 || 16gb 2333mhz ddr4 ECC || 2tb Sata SSD || 8tb Nas HDD || Radeon 6450 1g display adapter ||

Link to post
Share on other sites

On 25-2-2016 at 8:43 AM, xAcid9 said:

If Async Compute is disabled in driver then how did they ran Async Compute benchmark with Nvidia card? Thru voodoo magic?

Through some sort of software emulation probably. Unsuprising the performance declined by using it, and why there was a visible "wall" that nvidia hit up to a certain performance point. As the 980TI performs the same as the GTX 980. That wouldn't make sense, in any form of shape, unless there's a common denominator.

 

And can I just add that I really don't understand the hype. I still see the 390X and 980 butting heads like they have been doing on low-drawcall / CPU-unintensive games. And may I also remind people that all 390X's are aftermarket units running overclocks, and not running "reference" speeds, since there are none. And that most Nvidia cards in the tests are running reference values. Most aftermarket models run 1329-1400 mhz instead of 1216. 

 

The only real winner is the Fury X, the rest... really uninspiring leap in performance (in relative terms, yes they jump in performance from DX11 because their DX11 drivers are terrible). And the superlatives being thrown around like AMD is making mincemeat out of Maxwell (that is apparently disabling async support) are way out of proportion.

 

I mean it's great AMD's performing atleast up to what the hardware is supposed to do now, and not being limited by CPU speeds, meaning they can atleast compete again with Maxwell and possibly Pascal. But for the love of god people, try to contain your hypetrains.

 

EDIT: and can I also add that the CPU overhead AMD has still isn't really resolved? Only postponed due to the lower overhead from the new API.

 

aos_low_gpu3.png

 

aos_low_gpu4.png

 

The 970 is still saturated much earlier.

Link to post
Share on other sites

Good but this isn't this the exact same thing they said back in September?

http://www.guru3d.com/news-story/nvidia-will-fully-implement-async-compute-via-driver-support.html

 

Not saying they are lying but it's been about 6 months since AOTs alpha came out with the feature and even longer since DX12 spec came out. Their driver team better get on it... Oxide has been providing them with ashes of the singularity source code since 2014.

Link to post
Share on other sites

6 minutes ago, Humbug said:

Not saying they are lying but it's been about 6 months since AOTs alpha came out with the feature and even longer since DX12 spec came out. Their driver team better get on it...

To be honest, the stakes are only high because people are making it so. It's one early-access game, should they dump a good portion of their R&D onto a game subject to change? Or wait for some stable build to release and then optimize.

 

Not rationalizing, but from a grand perspective...it seems pretty silly for one developer to force this much investment out of the hardware manuf. It's not even a big budget AAA title that is going to be bundeled or anything. And the game seems pretty mediocre past the new tech. It's not gettin raving reviews on steam either. 

 

This just pretty much seems to boil down to marketing at this point, if it hadn't been hyped noone would've batted an eye about the game. Still, now that it is, they can't slack.

Link to post
Share on other sites

17 minutes ago, Majestic said:

To be honest, the stakes are only high because people are making it so. It's one early-access game, should they dump a good portion of their R&D onto a game subject to change? Or wait for some stable build to release and then optimize.

 

Not rationalizing, but from a grand perspective...it seems pretty silly for one developer to force this much investment out of the hardware manuf. It's not even a big budget AAA title that is going to be bundeled or anything. And the game seems pretty mediocre past the new tech. It's not gettin raving reviews on steam either. 

 

This just pretty much seems to boil down to marketing at this point, if it hadn't been hyped noone would've batted an eye about the game. Still, now that it is, they can't slack.

I think at least some of the hype is because it is the first real demo of DX12 in an actual game. It isn't a particularly long stretch to assume that if nVidia doesn't have Async-Compute and is quite far behind AMD's GPUs right now, that if nVidia doesn't implement it at some point then AMD will still wipe the floor with nVidia's cards in other titles that use it.

AMD Ryzen 7800 X3D, MSI B650 Project Zero, Antec C5, Gigabyte RTX 4080 Super Aero

 

Nikon D500 | Nikon 300mm f/4 PF  | Nikon 200-500 f/5.6 | Nikon 50mm f/1.8 | Tamron 70-210 f/4 VCII | Sigma 10-20 f/3.5 | Nikon 17-55 f/2.8 | Tamron 90mm F2.8 SP Di VC USD Macro | Neewer 750II

Link to post
Share on other sites

18 minutes ago, Fetzie said:

I think at least some of the hype is because it is the first real demo of DX12 in an actual game. It isn't a particularly long stretch to assume that if nVidia doesn't have Async-Compute and is quite far behind AMD's GPUs right now, that if nVidia doesn't implement it at some point then AMD will still wipe the floor with nVidia's cards in other titles that use it.

But it currently isn't "swiping the floor". That sort of superlative would be reserved if they beat them on a substantial margin, like 40-50% or higher. The 380 clearly beats the 960, no question. The 970, once on DX12, clearly loses to the 390. The 980 and 390X are pretty much tied. And the Fury X beats the 980TI. With the added caveat that current tests that are out are between factory overclocked AMD cards, versus tame/reference Nvidia cards (that are supposedly not unlocked, but i'm going to assume this is marketing BS).

 

aos_crazy_5.png

 

By significant margins, but not by landslides to warrant such superlatives. Nothing unlike we've seen the 980TI do in all the major DX11 titles in the past two years. And i'm going to wait for A. more games with DX12, B. nvidia's claimed "support" for async, C. what vulcan is doing, which I think is more important if we want to finally get rid of being forced to windows, D. how much async is going to be actually implemented by developers not being funded by AMD (c'mon, they've been advertising if for quite some time), before i'm really picking a side on this debate. Right now, the only thing i'm trying to do is provide a dampening factor on atleast some of the hype and superlatives being hurled around. 

 

As for the current online discussions;

All i'm seeing, is people that have been in denial about objective performance now suddently don't have to be in denial anymore, since they have objectivity on their side now (actually being better), yet are being extremely smug and obnoxious about it. They've completely jumped the shark and went from playing the victim card with gameworks, to mocking nvidia customers about Asynchonous Compute. Which is basically the same thing as Gameworks represented for AMD with Tessallation. It ran poorly because AMD lacked the drawcall throughput and Tessellation efficiency, not because it was "black box".

 

But somehow, this is supposed to be good for all people (eventhough 80% of the market has nvidia), as opposed to Tessellation being evil, black box and nefarious of intent. It's incredibly hypocritical if you ask me. And the worst thing is, is that the people being incredibly smug and unapologeticly hyping this, were the same people crying wolf with Gameworks.

 

We've done a full 180 now. And noone sees the irony of it.

Link to post
Share on other sites

10 minutes ago, Majestic said:

But it currently isn't "swiping the floor". That sort of superlative would be reserved if they beat them on a substantial margin, like 40-50% or higher. The 380 clearly beats the 960, no question. The 970, once on DX12, clearly loses to the 390. The 980 and 390X are pretty much tied. And the Fury X beats the 980TI. With the added caveat that current tests that are out are between factory overclocked AMD cards, versus tame/reference Nvidia cards (that are supposedly not unlocked, but i'm going to assume this is marketing BS).

 

aos_crazy_5.png

 

According to that graphic the nVidia cards have GPU Boost turned on (which will which will give them an overclock to something in the region of 1150ish MHz).

AMD Ryzen 7800 X3D, MSI B650 Project Zero, Antec C5, Gigabyte RTX 4080 Super Aero

 

Nikon D500 | Nikon 300mm f/4 PF  | Nikon 200-500 f/5.6 | Nikon 50mm f/1.8 | Tamron 70-210 f/4 VCII | Sigma 10-20 f/3.5 | Nikon 17-55 f/2.8 | Tamron 90mm F2.8 SP Di VC USD Macro | Neewer 750II

Link to post
Share on other sites

7 minutes ago, Fetzie said:

According to that graphic the nVidia cards have GPU Boost turned on (which will which will give them an overclock to something in the region of 1150ish MHz).

Aftermarket 970's boost from 1329 to 1400mhz, depending on the vendor. Now scaling isn't 100%, so I can't add that percentage to the score. 

Link to post
Share on other sites

18 minutes ago, Majestic said:

Aftermarket 970's boost from 1329 to 1400mhz, depending on the vendor. Now scaling isn't 100%, so I can't add that percentage to the score. 

So that's a 30% overclock over "stock"?

AMD Ryzen 7800 X3D, MSI B650 Project Zero, Antec C5, Gigabyte RTX 4080 Super Aero

 

Nikon D500 | Nikon 300mm f/4 PF  | Nikon 200-500 f/5.6 | Nikon 50mm f/1.8 | Tamron 70-210 f/4 VCII | Sigma 10-20 f/3.5 | Nikon 17-55 f/2.8 | Tamron 90mm F2.8 SP Di VC USD Macro | Neewer 750II

Link to post
Share on other sites

6 minutes ago, Fetzie said:

So that's a 30% overclock over "stock"?

Stock boost is 1178mhz. So ~19%-ish over stock. But 1500mhz isn't an uncommon OC for a 970, which is 27%. Now again, scaling isn't 100%, so the scores won't increase by 27%. But certainly to the degree of 10-15% or 20% for the OC, being very tame in expectations.

 

Especially the 980TI has some pretty substantial scaling, because it has a fairly low baseclock to start with, but can OC to the same hights as the other maxwell chips. 

Link to post
Share on other sites

This is simply Nvidia forcing the card to do something their card was not originally designed to do which is pure desperation... :(

Zen-III-X8-5900X (Gamestation 5)

Spoiler

Case: Medion Micro-ATX Case / Case Fan Front: SUNON MagLev PF70251VX-Q000-S99 70mm / Case Fan Rear: Fanner Tech(Shen Zhen)Co.,LTD. 80mm (Purple) / Controller: Sony Dualshock 4 Wireless (DS4Windows) / Cooler: AMD Near-silent 125w Thermal Solution / CPU: AMD Ryzen 5 3600, 6-cores, 12-threads, 4.2/4.2GHz, 35,3MB cache (T.S.M.C. 7nm FinFET) / CPU: AMD Ryzen 9 5900X, 12(8)-cores, 24(16)-threads, 4.5/4.8GHz, 70.5MB(68,35MB) cache (T.S.M.C. 7nm FinFET) / Display: HP 24" L2445w (64Hz OC) 1920x1200 / GPU: MSI GeForce GTX 970 4GD5 OC "Afterburner" @1450MHz (T.S.M.C. 28nm) / GPU: ASUS Radeon RX 6600 XT DUAL OC RDNA2 32CUs @2.6GHz 10.6 TFLOPS (T.S.M.C. 7nm FinFET) R.ID (NimeZ drivers) / Keyboard: HP KB-0316 PS/2 (Nordic) / Motherboard: ASRock B450M Pro4, Socket-AM4 (SAM enabled) / Mouse: Razer Abyssus 2014 / PCI-E: ASRock USB 3.1/A+C (PCI Express x4) / PSU: EVGA SuperNOVA G2, 550W / RAM A1 & B1: G.SKILL DDR4-3600MHz CL18-20-21-39-60-1T "Samsung 8Gbit C-Die" (2x8GB) / RAM A2 & B2: HyperX DDR4-3600MHz CL16-18-19-37-85-1T "SK Hynix 8Gbit CJR" (2x16GB) / Operating System: Windows 10 Home / Sound 1: Zombee Z500 / Sound 2: Logitech Stereo Speakers S-150 / Storage 1 & 2: Samsung 850 EVO 500GB SSD / Storage 3: Western Digital My Passport 2.5" 2TB HDD / Storage 4: Western Digital Elements Desktop 2TB HDD / Storage 5: Kingston A2000 1TB M.2 NVME SSD / Wi-fi & Bluetooth: ASUS PCE-AC55BT Wireless Adapter (Intel)

 Lake-V-X6-10600 (Gaming PC)

R23 score MC: 9190pts | R23 score SC: 1302pts

R20 score MC: 3529cb | R20 score SC: 506cb

Spoiler

Case: Cooler Master HAF XB Evo Black / Case Fan(s) Front: Noctua NF-A14 ULN 140mm Premium Fans / Case Fan(s) Rear: Corsair Air Series AF120 Quiet Edition (red) / Case Fan(s) Side: Noctua NF-A6x25 FLX 60mm Premium Fan / Controller: Sony Dualshock 4 Wireless (DS4Windows) / Cooler: Cooler Master Hyper 212 Evo / CPU: Intel Core i5-10600(ASUS Performance Enhancement), 6-cores, 12-threads, 4.4/4.8GHz, 13,7MB cache (Intel 14nm++ FinFET) / Display: ASUS 24" LED VN247H (67Hz OC) 1920x1080p / GPU: Gigabyte Radeon RX Vega 56 Gaming OC GCN5 56CUs @1.7GHz 12.19 TFLOPS (Samsung 14nm FinFET) R.ID (NimeZ drivers) / Keyboard: Logitech Desktop K120 (Nordic) / Motherboard: ASUS PRIME B460 PLUS, Socket-LGA1200 (SAM enabled) / Mouse: Razer Abyssus 2014 / PCI-E: ASRock USB 3.1/A+C (PCI Express x4) / PSU: EVGA SuperNOVA G2, 850W / RAM A1 & B1: HyperX DDR4-2666MHz CL13-15-15-30-45-2T "Samsung 8Gbit C-Die" (2x8GB) / RAM A2 & B2: Juhor DDR4-3200MHz CL16-20-20-38-72-2T "SK Hynix 8Gbit MFR" (2x16GB) / Operating System: Windows 10 Home / Sound: Zombee Z300 / Storage 1 & 2: Samsung 850 EVO 500GB SSD / Storage 3: Seagate® Barracuda 2TB HDD / Storage 4: Seagate® Desktop 2TB SSHD / Storage 5: Crucial P1 1000GB M.2 SSD/ Storage 6: Western Digital WD7500BPKX 2.5" HDD / Wi-fi: TP-Link TL-WN851N 11n Wireless Adapter (Qualcomm Atheros)

Vishera-X8-9370 | R20 score MC: 1476cb

Spoiler

Case: Cooler Master HAF XB Evo Black / Case Fan(s) Front: Noctua NF-A14 ULN 140mm Premium Fans / Case Fan(s) Rear: Corsair Air Series AF120 Quiet Edition (red) / Case Fan(s) Side: Noctua NF-A6x25 FLX 60mm Premium Fan / Case Fan VRM: SUNON MagLev KDE1209PTV3 92mm / Controller: Sony Dualshock 4 Wireless (DS4Windows) / Cooler: Cooler Master Hyper 212 Evo / CPU: AMD FX-8370 (Base: @4.4GHz | Turbo: @4.7GHz) Black Edition Eight-Core (Global Foundries 32nm) / Display: ASUS 24" LED VN247H (67Hz OC) 1920x1080p / GPU: MSI GeForce GTX 970 4GD5 OC "Afterburner" @1450MHz (T.S.M.C. 28nm) / GPU: Gigabyte Radeon RX Vega 56 Gaming OC @1501MHz (Samsung 14nm FinFET) / Keyboard: Logitech Desktop K120 (Nordic) / Motherboard: MSI 970 GAMING, Socket-AM3+ / Mouse: Razer Abyssus 2014 / PCI-E: ASRock USB 3.1/A+C (PCI Express x4) / PSU: EVGA SuperNOVA G2, 850W PSU / RAM 1, 2, 3 & 4: Corsair Vengeance DDR3-1866MHz CL8-10-10-28-37-2T (4x4GB) 16.38GB / Operating System 1: Windows 10 Home / Sound: Zombee Z300 / Storage 1: Samsung 850 EVO 500GB SSD (x2) / Storage 2: Seagate® Barracuda 2TB HDD / Storage 3: Seagate® Desktop 2TB SSHD / Wi-fi: TP-Link TL-WN951N 11n Wireless Adapter

Godavari-X4-880K | R20 score MC: 810cb

Spoiler

Case: Medion Micro-ATX Case / Case Fan Front: SUNON MagLev PF70251VX-Q000-S99 70mm / Case Fan Rear: Fanner Tech(Shen Zhen)Co.,LTD. 80mm (Purple) / Controller: Sony Dualshock 4 Wireless (DS4Windows) / Cooler: AMD Near-silent 95w Thermal Solution / Cooler: AMD Near-silent 125w Thermal Solution / CPU: AMD Athlon X4 860K Black Edition Elite Quad-Core (T.S.M.C. 28nm) / CPU: AMD Athlon X4 880K Black Edition Elite Quad-Core (T.S.M.C. 28nm) / Display: HP 19" Flat Panel L1940 (75Hz) 1280x1024 / GPU: EVGA GeForce GTX 960 SuperSC 2GB (T.S.M.C. 28nm) / GPU: MSI GeForce GTX 970 4GD5 OC "Afterburner" @1450MHz (T.S.M.C. 28nm) / Keyboard: HP KB-0316 PS/2 (Nordic) / Motherboard: MSI A78M-E45 V2, Socket-FM2+ / Mouse: Razer Abyssus 2014 / PCI-E: ASRock USB 3.1/A+C (PCI Express x4) / PSU: EVGA SuperNOVA G2, 550W PSU / RAM 1, 2, 3 & 4: SK hynix DDR3-1866MHz CL9-10-11-27-40 (4x4GB) 16.38GB / Operating System 1: Ubuntu Gnome 16.04 LTS (Xenial Xerus) / Operating System 2: Windows 10 Home / Sound 1: Zombee Z500 / Sound 2: Logitech Stereo Speakers S-150 / Storage 1: Samsung 850 EVO 500GB SSD (x2) / Storage 2: Western Digital My Passport 2.5" 2TB HDD / Storage 3: Western Digital Elements Desktop 2TB HDD / Wi-fi: TP-Link TL-WN851N 11n Wireless Adapter

Acer Aspire 7738G custom (changed CPU, GPU & Storage)
Spoiler

CPU: Intel Core 2 Duo P8600, 2-cores, 2-threads, 2.4GHz, 3MB cache (Intel 45nm) / GPU: ATi Radeon HD 4570 515MB DDR2 (T.S.M.C. 55nm) / RAM: DDR2-1066MHz CL7-7-7-20-1T (2x2GB) / Operating System: Windows 10 Home / Storage: Crucial BX500 480GB 3D NAND SATA 2.5" SSD

Complete portable device SoC history:

Spoiler
Apple A4 - Apple iPod touch (4th generation)
Apple A5 - Apple iPod touch (5th generation)
Apple A9 - Apple iPhone 6s Plus
HiSilicon Kirin 810 (T.S.M.C. 7nm) - Huawei P40 Lite / Huawei nova 7i
Mediatek Dimensity 700 (T.S.M.C 7nm) - Cherry Mobile Aqua S10 Pro 5G
Mediatek MT2601 (T.S.M.C 28nm) - TicWatch E
Mediatek MT6580 (T.S.M.C 28nm) - TECNO Spark 2 (1GB RAM)
Mediatek MT6592M (T.S.M.C 28nm) - my|phone my32 (orange)
Mediatek MT6592M (T.S.M.C 28nm) - my|phone my32 (yellow)
Mediatek MT6735 (T.S.M.C 28nm) - HMD Nokia 3 Dual SIM
Mediatek MT6737 (T.S.M.C 28nm) - Cherry Mobile Flare S6
Mediatek MT6739 (T.S.M.C 28nm) - my|phone myX8 (blue)
Mediatek MT6739 (T.S.M.C 28nm) - my|phone myX8 (gold)
Mediatek MT6750 (T.S.M.C 28nm) - honor 6C Pro / honor V9 Play
Mediatek MT6765 (T.S.M.C 12nm) - TECNO Pouvoir 3 Plus
Mediatek MT6797D (T.S.M.C 20nm) - my|phone Brown Tab 1
Qualcomm MSM8926 (T.S.M.C. 28nm) - Microsoft Lumia 640 LTE
Qualcomm MSM8974AA (T.S.M.C. 28nm) - Blackberry Passport
Qualcomm SDM710 (Samsung 10nm) - Oppo Realme 3 Pro

 

Link to post
Share on other sites

53 minutes ago, Majestic said:

 

. Which is basically the same thing as Gameworks represented for AMD with Tessallation. It ran poorly because AMD lacked the drawcall throughput and Tessellation efficiency, not because it was "black box".

 

But somehow, this is supposed to be good for all people (eventhough 80% of the market has nvidia), as opposed to Tessellation being evil, black box and nefarious of intent. It's incredibly hypocritical if you ask me.

 

thing is game developers were caught using over-tesselation, slowing down performance on all GPUs (including NVIDIA) for no added visual benefit. It's not just a gameworks concern it has happened to game assets which were even not part of gameworks. That is something that everybody should fight against because nobody wins from that. Even Nvidia users are getting short-changed.

 

personally I don't think that all DX12 and Vulkan games will perform like AOTS. Most games don't have so much going on and only some of them will implement asynchronous compute. I think NVIDIA will be competitive. And also asynchronous compute doesn't slow down Nvidia GPUs. It makes AMD faster. I.e. Nobody is getting slowed down by it, no losers.

Link to post
Share on other sites

Just now, Humbug said:

thing is game developers were caught using over-tesselation, slowing down performance on all GPUs (including NVIDIA) for no added visual benefit. It's not just a gameworks concern it has happened to game assets which were even not part of gameworks. That is something that everybody should fight against because nobody wins from that. Even Nvidia users are getting short-changed.

 

personally I don't think that all DX12 and Vulkan games will perform like AOTS. Most games don't have so much going on and only some of them will implement asynchronous compute. I think NVIDIA will be competitive. And also asynchronous compute doesn't slow down Nvidia GPUs. It makes AMD faster.

You're assumingly talking about The Witcher 3? Which is one isolated example, to which nefarious intent of Nvidia has never been attributed by outset. And to iterate, the 64x tessellation decision fell on CD Project RED to make. I'm not apologizing, I just think it could've just as easily been a mistake. I think it's just a bit too easy to partake in such broadstroking based on one example.

 

That last part I agree with.

Link to post
Share on other sites

3 minutes ago, Majestic said:

You're assumingly talking about The Witcher 3? Which is one isolated example, to which nefarious intent of Nvidia has never been attributed by outset. And to iterate, the 64x tessellation decision fell on CD Project RED to make.

 

not just witcher 3. Arkham Origins and Crysis 2 come to mind. I too blame game developers for this, they have to take responsibility for their code.

Link to post
Share on other sites

There is a misleading discussion that keeps happening around this issue. Nvidia's cards already support Asynchronous Compute, CUDA is based on it as is the entire DX12 API and its working even in DX11. The cards are already capable of taking asynchronous work and scheduling it there is no question of that at all. The only issue is whether Graphics can run concurrently with compute, rather than being scheduled serially. The point of which is usually to reduce latency but it can also allow more compute time per frame depending on how an engine is written. So its important to understand what is and is not in dispute, its not asynchronous compute its whether compute and graphics can work concurrently given a mixed workload involving both.

 

The official documentation on this from Nvidia says it can do one graphics task in parallel with some compute tasks, a bundle of amateur testers have sort of shown it currently can't but might be able to and Nvidia is still saying it needs to work on the drivers. They might be lying about it actually working concurrently or intentionally misleading people by using the word asynchronous instead of concurrent but we don't really know. What we do know is that Nvidia doesn't think its drivers do it yet so at the very least, the question only really remains if they will ever actually release something that does produce a reasonable performance boost in this game or not. We aren't going to know until Nvidia says its done doing what its doing and we see how it actually works.

 

Its worth knowing for example that Nvidia has hardware schedulers for the jobs that are a lot more sophisiticated than AMDs, but they might also be a problem. They may very well not be designed for doing compute and graphics in parallel and hence resulting in more software from their part than they would like to do, a workaround. But AMD uses a lot less hardware and does a lot more of the scheduling work on the CPU, which is where some of the DX11 overhead is likely coming from because they do a lot more in software than Nvidia does.

 

Its not an important benchmark its not a game I care about but it should at least shake the tree enough and we should be getting to the truth soon, its just a little too early to call it right now.

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×