Nvidia Announces PCI-Express version of Tesla V100 accelerator
Nvidia announced a PCI Expres version 'card' of its the Tesla V100 accelerator, to be releases later this year. The unit has 16GB HBM2 memory and the Volta GPU has been fitted with 5120 sharder processors. .
The PCI-E card of the Tesla V100 is intended to be a server part for stuff like deep-learning, research and analysis. The card is clocked slightly slower compared to the original module model and comes with a TDP of 250 watts. Nvidia claims these cards will be releases 'later this year'.
Specifications of the PCIe form factor include:
- 7 teraflops double-precision performance, 14 teraflops single-precision performance and 112 teraflops half-precision performance with NVIDIA GPU BOOST™ technology
- 16GB of CoWoS HBM2 stacked memory, delivering 900GB/sec of memory bandwidth
- Support for PCIe Gen 3 interconnect (up to 32GB/sec bi-directional bandwidth)
- 250 watts of power
Tesla V100 (SXM2) |
Tesla V100 (PCIe) |
Tesla P100 (SXM2) |
Tesla P100 (PCIe) |
|
Architecture | Volta | Volta | Pascal | Pascal |
Gpu | GV100 (815mm2) | GV100 (815mm2) | GP100 (610mm2) | GP100 (610mm2) |
Shader cores | 5120 | 5120 | 3584 | 3584 |
Tensor cores | 640 | 640 | AFTER | AFTER |
Core Speed | ? | ? | 1328MHz | ? |
Boost Clock | 1455MHz | ~ 1370MHz | 1480MHz | 1300MHz |
Memory Speed | 1.75Gbps HBM2 | 1.75Gbps HBM2 | 1.4Gbps HBM2 | 1.4Gbps HBM2 |
Memory | 4096-bit | 4096-bit | 4096-bit | 4096-bit |
memory Bandwidth | 900GB / sec | 900GB / sec | 720GB / sec | 720GB / sec |
Vram | 16GB | 16GB | 16GB | 16GB |
L2 cache | 6MB | 6MB | 4MB | 4MB |
Half Precision | 30 TFLOPS | 28 TFLOPS | 21.2 TFLOPS | 18.7 TFLOPS |
Single Precision | 15 TFLOPS | 14 TFLOPS | 10.6 TFLOPS | 9.3 TFLOPS |
Double Precision | 7.5 TFLOPS (1/2 rate) |
7 TFLOPS (1/2 rate) |
5.3 TFLOPS (half rate) |
4.7 TFLOPS (1/2 rate) |
Tensor Performance (Deep Learning) |
120 TFLOPS | 112 TFLOPS | AFTER | AFTER |
Transistors | 21 billion | 21 billion | 15.3 billion | 15.3 billion |
TDP | 300W | 250W | 300W | 250W |
Form Factor | Mezzanine (SXM2) | PCIe | Mezzanine (SXM2) | PCIe |
Process | TSMC 12nm FFN | TSMC 12nm FFN | TSMC 16nm FinFET | TSMC 16nm FinFET |
One of the biggest innovations of the V100 compared to the P100 are the all new Tensor Cores, the GV100 GPU has 640 them: eight per sm. Nvidia claims huge performance gains for applications that can make use of it. At regular fp32- and fp64 calculations the GV100 is about 1.5 times as fast as the GP100. NVIDIA Tesla V100 GPU accelerators for PCIe-based systems are expected to be available later this year from NVIDIA reseller partner and manufacturers, including Hewlett Packard Enterprise (HPE).
NVIDIA Adds GeForce GTX 1060 to Prepare for Battle bundle - 03/29/2017 08:46 AM
NVIDIA is adding the GeForce GTX 1060 GPU to their Prepare for Battle GeForce GTX bundle, which already includes GeForce GTX 1070, 1080 and 1080Ti GPUs. Gamers can get a free copy of either For Honor ...
Nvidia Announces GeForce GTX 1080 Ti at 699 USD - 03/01/2017 10:42 AM
Nvidia has been hosting a livestream making some announcments, the new high-end GTX 1080 Ti features 3584 CUDA Cores, 224 Texture Units, a 352-bit memory controller and 11 GB of GDDR5X memory. This me...
AMD and NVIDIA AIB GPU Market Share from 2002 to 2016 - 11/24/2016 12:39 PM
An interesting slide has been compiled that shows the varying market share relative to sales for add in board graphics cards sales from both AMD and NVIDIA relative to generational releases....
NVIDIA Adds Telemetry to Latest Drivers - How To Disable It - 11/07/2016 10:00 AM
Over the weekend some news broke that Nvidia has added telemetry logging into it's driver. We know that GeForce Experience already is collecting users, hardware and game data, but new to us is that t...
Nvidia announces GeForce GTX 1050 and 1050 Ti - 10/18/2016 05:28 PM
Nvidia is excited to announce the newest members of the NVIDIA GeForce Family, the GeForce GTX 1050 and GeForce GTX 1050 Ti. The GeForce GTX 1050 delivers great experiences for every gamer on a budget...
Senior Member
Posts: 5838
Joined: 2003-09-15
Nice!
•7 teraflops double-precision performance, 14 teraflops single-precision performance and 112 teraflops half-precision performance with NVIDIA GPU BOOST™ technology
•16GB of CoWoS HBM2 stacked memory, delivering 900GB/sec of memory bandwidth
•Support for PCIe Gen 3 interconnect (up to 32GB/sec bi-directional bandwidth)
•250 watts of power
112 Tflops of 16bit performance? Really Nvidia? That can't be real!?!
Senior Member
Posts: 586
Joined: 2008-06-20
Nvidia continues its mega streak after the beautifully managed Pascal lineup!
Can't wait for Volta GeForce!
Senior Member
Posts: 390
Joined: 2017-06-09

There wont be one. There will be a Quadro V100 like the Quadro P100. It would play games too.
Senior Member
Posts: 3460
Joined: 2011-05-10
112 Tflops of 16bit performance? Really Nvidia? That can't be real!?!
Tensor cores.
There wont be one. There will be a Quadro V100 like the Quadro P100. It would play games too.
Okay.. GV102 then. Think we know what I meant

Senior Member
Posts: 3460
Joined: 2011-05-10
I can't wait for a GeForce variant