AMD Radeon RX Vega 64 8GB review

Graphics cards 1022 Page 5 of 34 Published by


Architecture and Specifications

Product overview

Vega 10 is being fabbed on Global Foundries’ 14nm FinFET-based process technology, which is the densest foundry process available. FinFET transistors are crucial to reducing power consumption and enabling operating voltages that are 150mV lower than the previous generation, thereby cutting active power by 30% from a 1V baseline. For the last six years, graphics processors have relied on 28nm high-k/metal nodes. We have moved towards the last gen 14nm FinFET. The most affordable model will be the Radeon RX Vega 56 which will offer 10.5 TFLOPS of fp32 performance at a price of 399 USD. All three cards have been fitted with 8 GB of HBM2 graphics memory, two stacks thus with a 2048-bit memory bus. The 64 editions will offer 484 GB/s of bandwidth, the 56 shader cluster edition get 410 GB/s of graphics memory bandwidth. So when we break down the products in a more simplistic way we see the following: 

AMD Radeon RX Vega 64 LCS RX Vega 64 LE RX Vega 56 RX 580 RX 570 RX 560
Graphics Core Vega 10 Vega 10 Vega 10 Polaris 20 XTX Polaris 20 Pro Polaris 
Fabrication Process 14 nm FinFET 14 nm FinFET 14 nm FinFET 14 nm FinFET 14 nm FinFET 14 nm FinFET
Single Precision fp32 13.7 TFLOPs 12.66 TFLOPs  10.5 TFLOPs 6.1 TFLOPs 5.1 TFLOPs ~2 TFLOPs
Memory 8GB HBM2 8GB HBM2 8 GB HBM2 4/8 GB GDDR5 4/8 GB GDDR5 2/4 GB GDDR5
Clock Frequency 
Base / Boost
1,406 MHz
1,677 MHz
1,247 MHz
1,546 MHz
1,156 MHz
1,471 MHz
1,257 MHz
1,340 MHz
1,168 MHz
1,244 MHz
1,175 MHz
Compute Units 64 64 56 36 32 14
Shader Processors 4096 4096 3584 2304 2048 1024
Memory Interface 2048-bit 2048-bit 2048-bit 256-bit 256-bit 128-bit
ROPs 64 64 64 32 32 16
TMUs 256 256 224 144 128 56
Memory Spee 945 MHz 945 MHz 800 MHz 8 Gbps 7 Gbps 7 Gbps
Memory Bandwidth 484 GB/s 484 GB/s 410 GB/s 256 GB/s 211 GB/s 112 GB/s
Board power 435 W 295 W 210 W 185 W 150 W 75 W
MSRP 600 USD 499 USD 399 USD $199 (4 GB)
$229 (8 GB)
$149 (4 GB)
$179 (8 GB)
$99 (2 GB)
$119 (4 GB)

Basically, the Vega GPU is based on 64 shader clusters with 64 shader processors each. So that makes a nice 4096 shader processors in total. These shader partitions are tied to 64 ROPs. TMUs wise for a fully enabled unit AMD has always used a 4:1 ratio meaning 64 CUs x 4 = 256 texture units, which is a good number. Memory sits on a 2048-bit wide bus (two memory stacks of 1024-bits each) spread over the 64-bit controllers.  

An bit more architecture

For the Radeon RX Vega 64 - sixty-four shader clusters with 64 shaders per cluster equal to 4096 shader processors that turbo to the 1.545 GHz dynamic clock frequency.


The Vega 10 GPU is based on a whopping 12.5 billion transistors and is fabbed on a 14nm FinFET LPP process. It comes with over 45MB of SRAM cache across the chip and thus holds either an 8 GB (consumer) or 16 GB (Pro) configuration. While we'll get into the details of Vega in our review, the GPU is based on a design with 4 synchronous compute pipes, four next gen geometry engines and for the fully activated SKU, 64 next gen compute units which will get you 4096 shaders processors and 56 x 64 = 3584 shader procs for the 56 model.


The full 64 model would be tied to 64 ROPs and 256 / 224 texture memory units. G33k stuff: L2 cache has been doubled to 4 MB. Render back-ends are now clients of the L2.


To the left, the new Vega 10 GPU with two 4 GB HBM2 stacks - to the right, the older Fiji GPU (Fury) holding four 1 GB stacks

Vega is fitted with HBM2 memory (vertically stacked graphics memory placed on die) as well as using a new IO gateway, the graphics memory cache is a synonym for High Bandwidth Cache. Earlier on, AMD stated that Vega is a GPU with 200 new features. A number of things are key for AMD with this scalable new GPU architecture. Vega 10 is using High Bandwidth Video Memory, VRAM, graphics memory or whatever you like to call it. The specific type used is HBM2 memory. The graphics engineers from AMD claim that HBM2 will offer you 5x the power efficiency compared to any other graphics memory including GDDR5, and yes, that is huge.


Another benefit obviously is capacity. HBM is on-chip vertically stacked (slabs of memory cells placed on top of each other) and you know it, when it comes to caches and memory bigger is simply better (in terms of storage volume). With the second iteration of HBM, HBM2, AMD now has 8x the density per stack with a 50% smaller footprint.

Share this content
Twitter Facebook Reddit WhatsApp Email Print