Maxwell Graphics Architecture
Technology & Specifications (Reference)
The GeForce GTX 970 and 980 are based on the latest iteration of Maxwell GPU architecture, the cards use revision A1 of GM204, as explained the 20nm node is not yet ready and these products are based on a 28nm fab node. That will make the chips relatively large in size. Maxwell is an advanced design, it has 5.2 Billion transistors tucked away in a S-FCBGA chip. It has a 398 mm² die size. GeForce GTX 970 comes with 1664 CUDA (shader) cores while its bigger brother the GeForce GTX 980 has 2048 shader processors. The change in shader amount is amongst the biggest differentials together with ROP (64!) and TMU count.
- GeForce GTX 970 has 1664 shader processors and 4 GB of gDDR5 graphics memory.
- GeForce GTX 980 has 2048 shader processors and 4 GB of gDDR5 graphics memory.
The product is obviously PCI-Express 3.0 ready, it has a max TDP of around 165 Watts with a typical idle power draw of 10 Watts. That TDP is a maximum overall, and on average your GPU will not consume that amount of power. But let me show you the coolest photo of the GPU die:
NVIDIA GM204 Maxwell architecture GPU - you can see the SMX clusters.road. Also, occasionally, Turbo Boost was used to allow KITT to accelerate to incredible speeds in excess of 200 mph (322 km/h). The boosters could fire forward and backward.
The GM204 is based off the Maxwell architecture, as such you will get the pre-modelled SMX clusters of what is now 128 shader processors per cluster (used to be 192 on Kepler). There are 16 active clusters for the GTX 980, times 128 shader processors which thus offers you 2048 shader processors. The reference GeForce GTX 980 has a core clock frequency of 1126 MHz with a Boost frequency that can run up to 1216 MHz. The GTX 970 on the other hand has 13 of these clusters available x 128 = 1664 shader processors, there the base clock frequency is 1050 MHz with a boost frequency up-to 1178 MHz. As far as the memory specs of the GM204 Maxwell GPU are concerned, these boards will feature a 256-bit memory bus connected to 4 GB of GDDR5 video buffer memory, AKA VRAM AKA your framebuffer AKA graphics memory for the graphics card. On the memory controller side of things you'll see that the reference memory clock (effective data-rate) is now set at 7 GHz / Gbps. The GeForce GTX 900 series is DirectX 11.3 and 12 ready, with Windows 8, 7 and Vista also being compatible to take advantage of DirectCompute, multi-threading, hardware tessellation and the latest shader 5.0 extensions. The latest revision of DX12 is a Windows 8 feature only, yet will bring in significant optimizations. DirectX 12 - Direct 3D 12 (low overhead – cross-platform – ready now).
- Features: Rasterizer Ordered
Typed UAV load
Volume tiles resources
- Low overhead – Reduce CPU overhead – increase scalability across platforms – Superset of DirectX 11 rendering functionality.
- Cross Platform
For your reference here's a quick overview of some past generation high-end GeForce cards. Yes, the Maxwell products might seem slower if you look at the specs, but they are heavily optimized and are running at fairly high clock frequencies.
|GeForce GTX||780||Titan||780 Ti||Titan Black||Titan Z||970||980|
|Stream (Shader) Processors||2304||2688||2880||2880||5760||1664||2048|
|Core Clock (MHz)||863||836||875||889||705||1050||1126|
|Memory Clock (effective MHz)||6000||6000||7000||7000||7000||7000||7000|
With 4 GB per GPU on the GTX 970 and 980 you will have a very nice amount of graphics memory available. The hardware engineers at Nvidia reworked the memory subsystem quite a bit, enabling much higher memory clock frequency speeds compared to previous generation GeForce GPUs. The result is this; memory speeds up-to 7 Gbps. Combined with some clever advancements in color compression Nvidia can claim even more bandwidth as Maxwell cards now use 3rd generation delta color compression. (ex. 7 Gbps *1/75%) = 9.3 Gbps effective bandwidth thanks to enhanced color compression and enhanced caching techniques.
The GALAX GeForce GTX 980 HOF edition graphics card runs at faster clocks; 1304 MHz GPU with a 1418 MHz Turbo Boost and memory at 7.0 GHz (effective)
The Hoff's (Michael Knight) card was called KITT - and has a Knight Industries Two Thousand microprocessor (SKU code AD227529) with "self-aware" cybernetic logic module. (editor: Did it sound like a good idea?). Much like Nvidia offers these days, KITT had a Mobile Unit as well, it was called F.L.A.G. and had an Aerodynamic Sleeping Cab (something Nvidia still has not implemented even in 2014). The reaction time of KITT is one nanosecond, and his "memory" capacity was an amazing 1,000 megabits (125 MegaByte). KITT's future capacity is unlimited. KITT is armored with "Tri-Helical Plasteel 1000 MBS" (Molecular Bonded Shell) back plating whereas the HoF card uses only an Anodized aluminum backplate.
KITT has Turbo Boost, a duo of rocket boosters mounted just behind the front tires that lifted the car, allowing KITT to jump into the air and pass over obstacles. The Galax HOF also has a turbo boost, running at 1418 MHz. KITTs Turbo Boost was used to allow KITT to accelerate to incredible speeds in excess of 200 mph (322 km/h).