Nvidia Slide reveals numbers on Single and Double precision for Flagship Pascal GPU

Published by

teaser

A slide from an Nvidia presentation is stirring some things on the web as it seems as some info on the Flagship GPU on the new 16nm Pascal architecture (likely named GP100) would be able to perform numbers that are astonishing.



Slides from "The Future of HPC and The Path to Exascale" shows a roadmap with a DP GFLOPS/W value for Pascal. The presentation's date is between the GTC 2013 roadmap which does not contain Pascal and the GTC 2014 roadmap which does contain Pascal. But thus this presentation is dated (2015) and likely estimated, we do have to say that.

The slides from "Manuel Ujaldón CUDA Fellow @ Nvidia" however show performance numbers for both single and double precision, double-precision floating-point (DPFP) wise Nvidia seems to be reaching (or is aiming for) a 4 TFLOP/s throughput (at least on their HPC parts). That would be 3x over the current 1.3 TFLOP/s on the Tesla K20 which on it's end is based on "Kepler" GK110 silicon. 

  


  

Single-precision then, it would be as high as 12 TFLOP/s. That is four times that a GK110, and roughly double a GM200 (6.4 TFLOPS for the 980 Ti). The slide does reveal one other thing, the GP100 is inidicated to use stacked HBM2 memory as the memory bandwidth is set at 1 TB/s. 

Download the presentation here. Well .. yummie !?

Nvidia Slide reveals numbers on Single and Double precision for Flagship Pascal GPU


Share this content
Twitter Facebook Reddit WhatsApp Email Print