The AMD Fiji Pro GPU
AMD Fiji
So, AMD likes to stick to island series GPUs for their product development codenames, for the Fury series the Fiji GPU gets its denomination from there. AMD Fiji is the new GPU used, we are going to talk about the GPU now and through part of the next page we'll talk a little more in-depth about the HBM that sits on that GPU. The Fiji series of GPUs is the next iteration of the GCN (Graphics Core next) architecture. Fiji is based on revision 1.2 GCN, just like Tonga (285 & 380 series). Basically, if we took Tonga and scale that upwards that would be Fiji (but that is an overly generalized explanation).
Fiji is still based on a 28nm fabrication node, 20nm failed and until we see 16 and 14nm viable for big GPUs, the 28nm node really is the only alternative. That was the same dilemma for Nvidia with the GTX 980 Ti of course. Both GPUs as a result are big. For Fiji that Mac Daddy is 5x5cm in size. Bigger means higher voltage usage, and thus more heat to deal with. You can only imagine what a chip like this would be like on 14nm, at half the size. AMD did huge things with Fiji though, they tucked away 8.9 Billion transistors on the GPU and that's the number without the memory. It gives them a staggering 8.6 TFLOPS of performance to work with. That, combined with HBM memory, is to be considered serious stuff. The GPU has a whopping amount of transistors. At 28nm AMD applied the GCN 1.2 architecture, albeit you will learn that there have been a few tweaks at cache levels as the L2 cache is now 2MB for example.
An interesting step from AMD was the move towards a 4096-bit memory bus thanks to HBM. Nobody really expected it as in the past AMD merely placed GDDR on their products. You will notice very similar structures compared to Tonga (Radeon R9 265 and 380). This is GCN 1.2, yet multitudes of it. The Fiji XT GPU as such has 4096 shader processors / and Fiji Pro 3584 shader proceccors, these are clustered up in groups of 64 / 56 shader processors per CU (compute unit).
- 64 units times 64 shaders = 4096.
- 56 units times 64 shaders = 3584.
The 1000 MHz core clock frequency is not a fixed clock on these cards. But in almost all scenarios we have seen it stick precisely at 1000 MHz. But sure, if the card passes its power limitations or overheats then the card would clock up/down in-between these monitored limiters.