CUDA core "strength"
Different architectures run at different speeds. It's the same situation as in CPUs: Zen 3 cores don't perform the same as Zen 1 cores, which don't perform the same as Rocket Lake cores, which don't perform the same as...
Ampere's CUDA cores appear much weaker because Nvidia decided to be deceptive about CUDA core counts in basically the same way AMD was with their Bulldozer CPUs. Ampere has two CUDA processors per SM (streaming multiprocessor) but only the CUDA cores are duplicated, all of the other supporting infrastructure isn't. This provides huge performance uplift in some applications, mostly compute, by nearly doubling the raw floating point throughput of each SM, but it provides almost no benefits in gaming. With regards to gaming, each SM is a bit faster, but it's slower per CUDA processor because there's two of them now. Of course the marketing team ran with the CUDA core numbers, because those are bigger, despite not actually being comparable to previous generations.
7 minutes ago, DANK_AS_gay said:980 Ti = 2816 Cores @ 1000Mhz
780 Ti = 2880 Cores @ 876Mhz
the 780 ti is rated for 210 GFLOPS double precision
the 980 ti is rated for 176 GFLOPS double precision
the gtx 580 is good for 790 GFLOPS in double precision mode since after the 500 series, nvidia started disabling the dp units in geforce chips and saving them for the quadros/teslas
Where did you even find this? The 780 Ti and 980 Ti numbers are roughly correct, but the 580 number is 4 times too high, it should be about 197 GFLOPS.

Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now