NVIDIA talks about Pascal - Will be fast and has 3D Stacked Memory
Over at GTC, NVIDIA CEO Jen-Hsun Huang forcasted that the future 16nm Pascal GPU will be up to ten times faster than Maxwel in terms of deep learning performance. Pascal will offer up to 32GB of 3D-stacked memory, mixed-precision computing support and NVIDIA's NVLink high-speed interconnect when it launches in 2016.
NVIDIA’s Pascal GPU architecture, set to debut next year, will accelerate deep learning applications 10X beyond the speed of its current-generation Maxwell processors. So read that well, not 10x faster, but 10x faster deep learning applications. And hey, then there is the memory, up to 32GB of 3D-stacked memory (HBM) with oh say .. 750GB/s of bandwidth.
NVIDIA CEO and co-founder Jen-Hsun Huang revealed details of Pascal and the company’s updated processor roadmap in front of a crowd of 4,000 during his keynote address at the GPU Technology Conference, in Silicon Valley.
“It will benefit from a billion dollars worth of refinement because of R&D done over the last three years,” he told the audience.
The rise of deep learning – the process by which computers use neural networks to teach themselves – led NVIDIA to evolve the design of Pascal, which was originally announced at last year’s GTC. Pascal GPUs will have three key design features that will result in dramatically faster, more accurate training of richer deep neural networks – the human cortex-like data structures that serve as the foundation of deep learning research. Along with up to 32GB of memory — 2.7X more than the newly launched NVIDIA flagship, the GeForce GTX TITAN X — Pascal will feature mixed-precision computing. It will have 3D memory, resulting in up to 5X improvement in deep learning applications. And it will feature NVLink – NVIDIA’s high-speed interconnect, which links together two or more GPUs — that will lead to a total 10X improvement in deep learning.
Mixed-Precision Computing – for Greater Accuracy
Mixed-precision computing enables Pascal architecture-based GPUs to compute at 16-bit floating point accuracy at twice the rate of 32-bit floating point accuracy. Increased floating point performance particularly benefits classification and convolution – two key activities in deep learning – while achieving needed accuracy.
3D Memory – for Faster Communication Speed and Power Efficiency
Memory bandwidth constraints limit the speed at which data can be delivered to the GPU. The introduction of 3D memory will provide 3X the bandwidth and nearly 3X the frame buffer capacity of Maxwell. This will let developers build even larger neural networks and accelerate the bandwidth-intensive portions of deep learning training. Pascal will have its memory chips stacked on top of each other, and placed adjacent to the GPU, rather than further down the processor boards. This reduces from inches to millimeters the distance that bits need to travel as they traverse from memory to GPU and back. The result is dramatically accelerated communication and improved power efficiency.
NVLink – for Faster Data Movement
The addition of NVLink to Pascal will let data move between GPUs and CPUs five to 12 times faster than they can with today’s current standard, PCI-Express. This is greatly benefits applications, such as deep learning, that have high inter-GPU communication needs.
NVLink allows for double the number of GPUs in a system to work together in deep learning computations. In addition, CPUs and GPUs can connect in new ways to enable more flexibility and energy efficiency in server design compared to PCI-E.
NVIDIA Tegra X1 Mobile Processor Released - 01/05/2015 10:51 AM
Nvidia releases its NVIDIA Tegra X1 mobile processor. The latest mobile processor has a 256-core Maxwell architecture GPU, 8 ARM CPU cores, and can deliver 60 frames per second of 4K video. It packs o...
NVIDIA Tesla K80 dual-GPU Compute Accelerator - 11/17/2014 07:17 PM
The Tesla K80 dual-GPU is the new flagship offering of the Tesla Accelerated Computing Platform, the leading platform for accelerating data analytics and scientific computing. It combines the world's...
NVIDIA To Launch GRID based On-Demand Game Streaming Service - 11/14/2014 09:30 AM
Nvidia will be launching a cloud gaming service called Nvidia Grid alongside the upcoming update. Grid allows owners of either Nvidia Shield device to stream Windows PC games from the cloud, and fro...
NVIDIA to offer Ubisoft Bundle - 11/05/2014 10:43 AM
Nvidia is releasing the “Pick Your Path” bundle, this means that anyone that buys a GTX 980, 970, 780 Ti, 780 or 980M or 970M notebook get their choice of one of three games: A...
NVIDIA Turf Effects GameWorks technology Video - 10/24/2014 10:02 AM
NVIDIA Turf Effects is a new NVIDIA GameWorks technology which empowers users to simulate and render massive grass simulation with physical interaction. Our grass technology provides a fully geometric...
Senior Member
Posts: 110
Joined: 2012-12-24
Does this mean - next gen GPUs will have 12, 16 and 24 GB of vRAM? And the high-end Titan model 32 GB?
Seems to be a big leap for one generation.. I doubt this will happen, sounds more like a marketing trick to me.
Senior Member
Posts: 1786
Joined: 2012-10-07
I wonder if this has any relevance to gaming? The extra memory bandwidth is gonna help, but I don't know if anything else he said is gonna specifically help gaming??
Senior Member
Posts: 3695
Joined: 2009-01-03
Does this mean - next gen GPUs will have 12, 16 and 24 GB of vRAM? And the high-end Titan model 32 GB?
Seems to be a big leap for one generation.. I doubt this will happen, sounds more like a marketing trick to me.
It's expected, the only way VRAM is heading is upwards. 3D stacked allows density increase much faster than previous generations, which required shrinking of manufacturing process.
You have mobile phones with bigger resolutions than your average PC now. Games/technology will be aiming push 4K an above resolution plus VR is starting take off too, so demands on VRAM capacity will be greater. Basically don't buy card now thinking 8GB VRAM will be enough to be future proof for 3-4 years, because it won't be.
Senior Member
Posts: 14639
Joined: 2014-07-21
I don't see anything besides the stacked memory to have much relevance to gaming. I guess that deep learning thing is more directed at a professional, cumputing interested audience.
Senior Member
Posts: 1261
Joined: 2013-02-22
This sounds like its being way over hyped i doubt much is going to change.