Jump to content
59 minutes ago, DocSwag said:

Let's look at half precision first:

P100 has 21 tflops half precision

Radeon Pro Duo has 16 tflops single and supports half precision do I'm assuming it has 16 tflops half precision.

That's 1.33 times

 

At single:

P100 is 10 tflops

Radeon pro duo is 16 tflops

Radeon pro duo wins by 1.66 times

 

At double: 

P100 is 5 tflops

Firepro w9100 (or whatever the firepro 290x is called) is 2.5 tflops

P100 wins by 2 times

 

If those numbers are true, then it is the best floating-point ratio (2:1) for NVidia GPUs in quite a while.

Read the community standards; it's like a guide on how to not be a moron.

 

Gerdauf's Law: Each and every human being, without exception, is the direct carbon copy of the types of people that he/she bitterly opposes.

Remember, calling facts opinions does not ever make the facts opinions, no matter what nonsense you pull.

Link to post
Share on other sites

Just now, Castdeath97 said:

So Nvidia still haven't gotten bored from trying to fit tegra's in cars, *sigh*. All these rumours about mobile GPUs coming out turned out to be false then.

 

I think they're trying to put pascal(or tesla, I don't remember) inside it now o.O

i5 2400 | ASUS RTX 4090 TUF OC | Seasonic 1200W Prime Gold | WD Green 120gb | WD Blue 1tb | some ram | a random case

 

Link to post
Share on other sites

If there's one thing we can all take away from this... Deep Learning 

| MOBO MSI Z97 Gaming 3 | CPU i7-4790K @ 4.7GHz | RAM G.Skill 16GB 2133MHz | SSD Kingston 240GB / HDD Seagate 3TB | GPU MSI Gaming GTX 970 |

| Cooling CM Nepton 240M | Display Philips BDM4065UC 40in 4K | Keyboard Corsair K70 Cherry Blue | Mouse Logitech G502 | Case Fractal Design Core 2500 |

PSU Rankings

Link to post
Share on other sites

3 minutes ago, Pohernori said:

 

I think they're trying to put pascal(or tesla, I don't remember) inside it now o.O

What's next, Gameworks in cars!

If you want to reply back to me or someone else USE THE QUOTE BUTTON!                                                      
Pascal laptops guide

Link to post
Share on other sites

1 minute ago, ivhyn said:

There's driveworks iirc

Maybe we can have hairworks in cars!

If you want to reply back to me or someone else USE THE QUOTE BUTTON!                                                      
Pascal laptops guide

Link to post
Share on other sites

19 minutes ago, Castdeath97 said:

So can anyone give me a TL DR for the event since the OP hasn't updated his post with anything.

Deep Learning

| MOBO MSI Z97 Gaming 3 | CPU i7-4790K @ 4.7GHz | RAM G.Skill 16GB 2133MHz | SSD Kingston 240GB / HDD Seagate 3TB | GPU MSI Gaming GTX 970 |

| Cooling CM Nepton 240M | Display Philips BDM4065UC 40in 4K | Keyboard Corsair K70 Cherry Blue | Mouse Logitech G502 | Case Fractal Design Core 2500 |

PSU Rankings

Link to post
Share on other sites

10 minutes ago, i_build_nanosuits said:


yeah, one post above it's all there...these are the specs for the testla P100 that will start shipping soon for high performance computing...and will most likely be the next big pascal titan gaming GPU for next year of something.

And all the rest is about deep learning, neuro networks...and other skynet related stuff :)

no the tesla P100 will start shipping in Q1 2017, what they are shipping now is sample for ppl who make servers to prepare for next year, nvidia need a year of saving up chips to sell, because the yields of a new process with a 600mm² should be so bad, that most of them will be defective, and since the cost include the defects and the good chips, they cannot make it a consumer product like titan, unless you are prepared to pay 10-15k$ for a gpu.

Link to post
Share on other sites

3 minutes ago, alamox said:

no the tesla P100 will start shipping in Q1 2017, what they are shipping now is sample for ppl who make servers to prepare for next year, nvidia need a year of saving up chips to sell, because the yields of a new process with a 600mm² should be so bad, that most of them will be defective, and since the cost include the defects and the good chips, they cannot make it a consumer product like titan, unless you are prepared to pay 10-15k$ for a gpu.

so how long do you think before the big pascal chip meant for consumer grade cards (next titan) could be hitting the market?

2 years from now?!

| CPU: Core i7-8700K @ 4.89ghz - 1.21v  Motherboard: Asus ROG STRIX Z370-E GAMING  CPU Cooler: Corsair H100i V2 |
| GPU: MSI RTX 3080Ti Ventus 3X OC  RAM: 32GB T-Force Delta RGB 3066mhz |
| Displays: Acer Predator XB270HU 1440p Gsync 144hz IPS Gaming monitor | Oculus Quest 3 VR

Link to post
Share on other sites

The real question is: how many ACE's will Pascal have?

CPU: Intel Core i7 7820X Cooling: Corsair Hydro Series H110i GTX Mobo: MSI X299 Gaming Pro Carbon AC RAM: Corsair Vengeance LPX DDR4 (3000MHz/16GB 2x8) SSD: 2x Samsung 850 Evo (250/250GB) + Samsung 850 Pro (512GB) GPU: NVidia GeForce GTX 1080 Ti FE (W/ EVGA Hybrid Kit) Case: Corsair Graphite Series 760T (Black) PSU: SeaSonic Platinum Series (860W) Monitor: Acer Predator XB241YU (165Hz / G-Sync) Fan Controller: NZXT Sentry Mix 2 Case Fans: Intake - 2x Noctua NF-A14 iPPC-3000 PWM / Radiator - 2x Noctua NF-A14 iPPC-3000 PWM / Rear Exhaust - 1x Noctua NF-F12 iPPC-3000 PWM

Link to post
Share on other sites

10 minutes ago, zMeul said:

none

ACE's are AMD's territory, GeForces have CUDA Cores

ACE refers to  ''Asynchronous Compute Engine'', of which the AMD Fury x has 8. For the express purpose of having different tasks run... you guessed it, asynchronously. On the Fury x diagram, there is also a single ''Graphics Command Proccesor''.

 

Nvidias counterpart on the GM200 chip (titan / 980Ti) would be the ''Gigathread Engine''.

 

 

I doubt Nvidia would use the same name if they opted for running graphics task asynchronously, but i have not heard any information about it.

Motherboard: Asus X570-E
CPU: 3900x 4.3GHZ

Memory: G.skill Trident GTZR 3200mhz cl14

GPU: AMD RX 570

SSD1: Corsair MP510 1TB

SSD2: Samsung MX500 500GB

PSU: Corsair AX860i Platinum

Link to post
Share on other sites

Anything new about the ROP count?

If - and I say IF - AMD manages to increase the performance of single next-gen GCN cores (higher than Maxwell levels) and pack enough ROPs, Vega could beat GP100... I sincerely hope so.

On a mote of dust, suspended in a sunbeam

Link to post
Share on other sites

LOL... I seriously have to laugh at the failed attempts from AMD fangirls to try and discredit Pascal...

 

O and those complaining at no consumer stuff... try keep your panties on till Computex ;)

Spartan 1.0

Spoiler

CPU: Intel Core i7-4770K 3.5GHz Quad-Core Processor

CPU Cooler: Cooler Master Seidon 120XL 86.2 CFM Liquid CPU Cooler

Motherboard: Asus Maximus VI Extreme ATX LGA1150 Motherboard
Memory: Corsair Dominator 32GB (4 x 8GB) DDR3-1600 Memory
Storage: OCZ Vector Series 512GB 2.5" Solid State Drive
Storage: Seagate Desktop HDD 4TB 3.5" 7200RPM Internal Hard Drive

Video Card: EVGA GeForce GTX 980 4GB Classified ACX 2.0 Video Card
Case: Thermaltake Urban S41 ATX Mid Tower Case
Power Supply: Corsair 1200W 80+ Platinum Certified Fully-Modular ATX Power Supply
Optical Drive: LG BH16NS40 Blu-Ray/DVD/CD Writer
Optical Drive: LG BH10LS30 Blu-Ray/DVD/CD Writer
Operating System: Microsoft Windows 10 Pro 64-bit
Sound Card: Creative Labs ZXR 24-bit 192 KHz Sound Card
Monitor: 2x Asus VG278HE 27.0" 144Hz Monitor
Keyboard: Logitech G19s Wired Gaming Keyboard
Keyboard: Razer Orbweaver Elite Mechanical Gaming Keypad Wired Gaming Keyboard
Mouse: Logitech G700s Wireless Laser Mouse
Headphones: Creative Labs EVO ZxR 7.1 Channel  Headset
Speakers: Creative Labs GigaWorks T40 Series II 32W 2ch Speakers

Hades 1.0

Spoiler

Laptop: Dell Alienware 15 2015

CPU: i7-4720HQ CPU

Memory: 16GB DDR3 SODIMM RAM

Storage: 256GB M.2 SSD

Storage: 1TB 5400rpm 2.5" HDD

Screen: 15.6" FHD Display

Video Card: Nvidia GTX 970M with 3GB

Operating System: Windows 10 Pro

Project: Spartan 1.2 PLEASE SUPPORT ME NEW CHANNEL > Tech Inquisition

Link to post
Share on other sites

4 hours ago, Agost said:

Anything new about the ROP count?

If - and I say IF - AMD manages to increase the performance of single next-gen GCN cores (higher than Maxwell levels) and pack enough ROPs, Vega could beat GP100... I sincerely hope so.

it's a MONSTER:

 

http://wccftech.com/nvidia-pascal-gpu-gtc-2016/

 

The GP100 GPU is comprised of  3840 CUDA cores, 240 texture units and a 4096bit memory interface. The 3840 CUDA cores are arranged in six Graphics Processing Clusters, or GPCs for short. Each of these has 10 Pascal Streaming Multiprocessors. As mentioned earlier in the article the Tesla P100 features a cut down GP100 GPU. This cut back version has 3584 CUDA cores and 224 texture mapping units.

 

Each Pascal streaming multiprocessor includes 64 FP32 CUDA cores, half that of Maxwell. Within each Pascal streaming multirprocessor there are two 32 CUDA core partitions, two dispatch units, a warp scheduler and a fairly large instruction buffer, matching that of Maxwell.

 

The massive GP100 GPU has significantly more pascal streaming multiprocessors, or CUDA core blocks.  Because each of these has access to a register file that’s the same size of Maxwell’s 128 CUDA core SMM. This means that each Pascal CUDA core has access to twice the register files. In turn we should expect even more performance out of each Pascal CUDA cores compared to Maxwell.

  • 5.3 teraflops double-precision performance, 10.6 teraflops single-precision performance and 21.2 teraflops half-precision performance with NVIDIA GPU BOOST™ technology

  • 160GB/sec bi-directional interconnect bandwidth with NVIDIA NVLink

    Advertisements
  • 16GB of CoWoS HBM2 stacked memory

  • 720GB/sec memory bandwidth with CoWoS HBM2 stacked memory

  • Enhanced programmability with page migration engine and unified memory

  • ECC protection for increased reliability

  • Server-optimized for highest data center throughput and reliability

    Tesla P100 Boosts To Nearly 1.5Ghz

    Perhaps one of the most exciting, yet perhaps predictable, revaluations about the GP100 Pascal flagship GPU is that it can achieve clocks even higher than Maxwell. Despite Nvidia opting for very conservative clock speeds on its professional GPUs like the Tesla & Quadro products the P100 actually has a base clock speed of 1328mhz and a boost clock speed of 1480mhz. Considering that GPU Boost 2.0 actually allows these cards to operate at even higher clock speeds than the nominal boost clock.

    We’re looking at actual frequencies of upwards of 1500Mhz on the GeForce equivalent of the P100. What is inevitably going to launch as the next GTX Titan. This means boost clocks of even upwards of 1600Mhz on factory overclocked models, and perhaps 2Ghz+ manual overclocks. This should be extremely exciting news to all GeForce fans.

Nvidia Pascal – 2X Perf/Watt With 16nm FinFET, Stacked Memory ( HBM2 ), NV-Link And Mixed Precision Compute

| CPU: Core i7-8700K @ 4.89ghz - 1.21v  Motherboard: Asus ROG STRIX Z370-E GAMING  CPU Cooler: Corsair H100i V2 |
| GPU: MSI RTX 3080Ti Ventus 3X OC  RAM: 32GB T-Force Delta RGB 3066mhz |
| Displays: Acer Predator XB270HU 1440p Gsync 144hz IPS Gaming monitor | Oculus Quest 3 VR

Link to post
Share on other sites

37 minutes ago, i_build_nanosuits said:

it's a MONSTER:

 

http://wccftech.com/nvidia-pascal-gpu-gtc-2016/

 

The GP100 GPU is comprised of  3840 CUDA cores, 240 texture units and a 4096bit memory interface. The 3840 CUDA cores are arranged in six Graphics Processing Clusters, or GPCs for short. Each of these has 10 Pascal Streaming Multiprocessors. As mentioned earlier in the article the Tesla P100 features a cut down GP100 GPU. This cut back version has 3584 CUDA cores and 224 texture mapping units.

 

Each Pascal streaming multiprocessor includes 64 FP32 CUDA cores, half that of Maxwell. Within each Pascal streaming multirprocessor there are two 32 CUDA core partitions, two dispatch units, a warp scheduler and a fairly large instruction buffer, matching that of Maxwell.

 

The massive GP100 GPU has significantly more pascal streaming multiprocessors, or CUDA core blocks.  Because each of these has access to a register file that’s the same size of Maxwell’s 128 CUDA core SMM. This means that each Pascal CUDA core has access to twice the register files. In turn we should expect even more performance out of each Pascal CUDA cores compared to Maxwell.

  • 5.3 teraflops double-precision performance, 10.6 teraflops single-precision performance and 21.2 teraflops half-precision performance with NVIDIA GPU BOOST™ technology

  • 160GB/sec bi-directional interconnect bandwidth with NVIDIA NVLink

    Advertisements
  • 16GB of CoWoS HBM2 stacked memory

  • 720GB/sec memory bandwidth with CoWoS HBM2 stacked memory

  • Enhanced programmability with page migration engine and unified memory

  • ECC protection for increased reliability

  • Server-optimized for highest data center throughput and reliability

    Tesla P100 Boosts To Nearly 1.5Ghz

    Perhaps one of the most exciting, yet perhaps predictable, revaluations about the GP100 Pascal flagship GPU is that it can achieve clocks even higher than Maxwell. Despite Nvidia opting for very conservative clock speeds on its professional GPUs like the Tesla & Quadro products the P100 actually has a base clock speed of 1328mhz and a boost clock speed of 1480mhz. Considering that GPU Boost 2.0 actually allows these cards to operate at even higher clock speeds than the nominal boost clock.

    We’re looking at actual frequencies of upwards of 1500Mhz on the GeForce equivalent of the P100. What is inevitably going to launch as the next GTX Titan. This means boost clocks of even upwards of 1600Mhz on factory overclocked models, and perhaps 2Ghz+ manual overclocks. This should be extremely exciting news to all GeForce fans.

Nvidia Pascal – 2X Perf/Watt With 16nm FinFET, Stacked Memory ( HBM2 ), NV-Link And Mixed Precision Compute

you understand that the numbers are bad for a chip of that size ? it's a freaking 600mm² chip with barely 20% more compute than a furyX in single precision about the same double precision as a 295x2, on a node twice as small, if the tesla P100 was 300mm² size that would be impressive, but those numbers are bad news for a 16nm 600mm² chip, especialy at that clock 1328mhz with 300TDP, seriously how big a chip should be to have double the compute ?

the expected compute from 16nm 600mm² chip should be around 18/20teraflops single precision, and about 10teraflops double precision, and even more because of the clock speed

Link to post
Share on other sites

19 minutes ago, alamox said:

you understand that the numbers are bad for a chip of that size ? it's a freaking 600mm² chip with barely 20% more compute than a furyX in single precision about the same double precision as a 295x2, on a node twice as small, if the tesla P100 was 300mm² size that would be impressive, but those numbers are bad news for a 16nm 600mm² chip, especialy at that clock 1328mhz with 300TDP, seriously how big a chip should be to have double the compute ?

the expected compute from 16nm 600mm² chip should be around 18/20teraflops single precision, and about 10teraflops double precision, and even more because of the clock speed

A 295x2 is 50% larger at 900mm and consumes twice as much power while providing half the performance. 

 

So I don't really think that pascal's performance is "bad news".

PSU Tier List | CoC

Gaming Build | FreeNAS Server

Spoiler

i5-4690k || Seidon 240m || GTX780 ACX || MSI Z97s SLI Plus || 8GB 2400mhz || 250GB 840 Evo || 1TB WD Blue || H440 (Black/Blue) || Windows 10 Pro || Dell P2414H & BenQ XL2411Z || Ducky Shine Mini || Logitech G502 Proteus Core

Spoiler

FreeNAS 9.3 - Stable || Xeon E3 1230v2 || Supermicro X9SCM-F || 32GB Crucial ECC DDR3 || 3x4TB WD Red (JBOD) || SYBA SI-PEX40064 sata controller || Corsair CX500m || NZXT Source 210.

Link to post
Share on other sites

11 minutes ago, djdwosk97 said:

A 295x2 is 50% larger at 900mm and consumes twice as much power while providing half the performance. 

 

So I don't really think that pascal's performance is "bad news".

Considering it's a 2013 architecture, then yeah kinda. You have to compare to contemporary architectures. Fiji and Vega makes this look less than impressive.

Watching Intel have competition is like watching a headless chicken trying to get out of a mine field

CPU: Intel I7 4790K@4.6 with NZXT X31 AIO; MOTHERBOARD: ASUS Z97 Maximus VII Ranger; RAM: 8 GB Kingston HyperX 1600 DDR3; GFX: ASUS R9 290 4GB; CASE: Lian Li v700wx; STORAGE: Corsair Force 3 120GB SSD; Samsung 850 500GB SSD; Various old Seagates; PSU: Corsair RM650; MONITOR: 2x 20" Dell IPS; KEYBOARD/MOUSE: Logitech K810/ MX Master; OS: Windows 10 Pro

Link to post
Share on other sites

1 minute ago, Notional said:

Considering it's a 2013 architecture, then yeah kinda. You have to compare to contemporary architectures. Fiji and Vega makes this look less than impressive.

I never said I was impressed by pascal. Fiji has 4.3 tflops, is 600mm, and consumes about 300w. So pascal is about 20% faster than Fiji, that's still a decent generational improvement, although I still expect Polaris to pull ahead in compute. 

 

PSU Tier List | CoC

Gaming Build | FreeNAS Server

Spoiler

i5-4690k || Seidon 240m || GTX780 ACX || MSI Z97s SLI Plus || 8GB 2400mhz || 250GB 840 Evo || 1TB WD Blue || H440 (Black/Blue) || Windows 10 Pro || Dell P2414H & BenQ XL2411Z || Ducky Shine Mini || Logitech G502 Proteus Core

Spoiler

FreeNAS 9.3 - Stable || Xeon E3 1230v2 || Supermicro X9SCM-F || 32GB Crucial ECC DDR3 || 3x4TB WD Red (JBOD) || SYBA SI-PEX40064 sata controller || Corsair CX500m || NZXT Source 210.

Link to post
Share on other sites

Just now, djdwosk97 said:

I never said I was impressed by pascal. Fiji has 4.3 tflops, is 600mm, and consumes about 300w. So pascal is about 20% faster than Fiji, that's still a decent generational improvement, although I still expect Polaris to pull ahead in compute. 

 

Never claimed you did?

However considering this chip is larger than Fiji, and using a node almost half as large as Fiji's, including a much higher clockrate, it's really not impressive. The more I think about it, the less impressive Pascal is.

Watching Intel have competition is like watching a headless chicken trying to get out of a mine field

CPU: Intel I7 4790K@4.6 with NZXT X31 AIO; MOTHERBOARD: ASUS Z97 Maximus VII Ranger; RAM: 8 GB Kingston HyperX 1600 DDR3; GFX: ASUS R9 290 4GB; CASE: Lian Li v700wx; STORAGE: Corsair Force 3 120GB SSD; Samsung 850 500GB SSD; Various old Seagates; PSU: Corsair RM650; MONITOR: 2x 20" Dell IPS; KEYBOARD/MOUSE: Logitech K810/ MX Master; OS: Windows 10 Pro

Link to post
Share on other sites

5 minutes ago, Notional said:

Never claimed you did?

However considering this chip is larger than Fiji, and using a node almost half as large as Fiji's, including a much higher clockrate, it's really not impressive. The more I think about it, the less impressive Pascal is.

The chip itself is virtually the same size -- both are about 600mm. And I don't think clock speeds mean all that much as long as power consumption (and heat) and performance are decent. Now the fact that it's clocked so high and on a significantly smaller node and only yields a 20% performance gain doesn't necessarily bode well for gaming performance (assuming gaming performance follows in compute's footsteps -- which isn't necessarily the case).

 

Also, are there other factors to consider aside from raw tflops/power consumption/heat when it comes to compute that could affect performance? 

PSU Tier List | CoC

Gaming Build | FreeNAS Server

Spoiler

i5-4690k || Seidon 240m || GTX780 ACX || MSI Z97s SLI Plus || 8GB 2400mhz || 250GB 840 Evo || 1TB WD Blue || H440 (Black/Blue) || Windows 10 Pro || Dell P2414H & BenQ XL2411Z || Ducky Shine Mini || Logitech G502 Proteus Core

Spoiler

FreeNAS 9.3 - Stable || Xeon E3 1230v2 || Supermicro X9SCM-F || 32GB Crucial ECC DDR3 || 3x4TB WD Red (JBOD) || SYBA SI-PEX40064 sata controller || Corsair CX500m || NZXT Source 210.

Link to post
Share on other sites

3 minutes ago, djdwosk97 said:

The chip itself is virtually the same size -- both are about 600mm. And I don't think clock speeds mean all that much as long as power consumption (and heat) and performance are decent. Now the fact that it's clocked so high and on a significantly smaller node and only yields a 20% performance gain doesn't necessarily bode well for gaming performance (assuming gaming performance follows in compute's footsteps -- which isn't necessarily the case).

 

Also, are there other factors to consider aside from raw tflops/power consumption/heat when it comes to compute that could affect performance? 

In gaming, sure, compute won't be the be all end all. We've seen this with Maxwell contra Fiji in DX11, but DX12 will use async compute more and more, so gaming cards are wise to support it extensively.

 

As for compute, it also depends on how you program for it, and what type of programming you use, etc, etc. Are we talking general compute or async compute for gaming?

 

Point is, that if Pascal is "only" Maxwell with some added compute functionality and a die shrink, then it's not improbably to see AMD with a new architecture, excellent compute capabilities and a die shrink, beating out Pascal.

 

But afaik, we won't see Big Pascal or Vega until December/January, so it's difficult to say anything about the koth cards.

Watching Intel have competition is like watching a headless chicken trying to get out of a mine field

CPU: Intel I7 4790K@4.6 with NZXT X31 AIO; MOTHERBOARD: ASUS Z97 Maximus VII Ranger; RAM: 8 GB Kingston HyperX 1600 DDR3; GFX: ASUS R9 290 4GB; CASE: Lian Li v700wx; STORAGE: Corsair Force 3 120GB SSD; Samsung 850 500GB SSD; Various old Seagates; PSU: Corsair RM650; MONITOR: 2x 20" Dell IPS; KEYBOARD/MOUSE: Logitech K810/ MX Master; OS: Windows 10 Pro

Link to post
Share on other sites

I really hope the only reason Nvidia did not announce any new pascal gaming cards is that they did not want to harm the sales of their 900-series, and they want to wait with the announcement right until they are ready to ship the cards - crossing my fingers for computex. I mean, they probably could have said everything about pascal if they wanted to, they even had two pascal modules ready for the drive px 2 system they showed. 

 Asus ROG Strix GTX 1080 | Intel i5-6600K@4.6GHz | Asus Maxiumus VIII Hero | G-Skill Trident Z 2X8GB DDR4 3200MHz

 Phanteks Enthoo Evolv ATX TG | Samsung 850 Evo 1TB | Fractal Design Kelvin S36 | EVGA SuperNova G2 750W

Acer Predator X34A | Logitech Proteus Spectrum G502 | Corsair K70 Cherry MX Brown

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now


×