Guru3D.com
  • HOME
  • NEWS
    • Channels
    • Archive
    • Search
    • Submit
  • DOWNLOADS
    • New Downloads
    • Categories
    • Archive
    • Search
    • Submit
  • GAME REVIEWS
  • ARTICLES
    • Editorials
    • Guru3D VGA Charts
    • Rig of the Month
    • Join ROTM
    • PC Buyers Guide
    • Dated content
    • More Categories
  • HARDWARE REVIEWS
    • Videocards
    • Processors
    • Audio
    • Motherboards
    • Memory and Flash
    • SSD Storage
    • Chassis
    • Power Supply
    • Laptop and Mobile
    • Smartphone
    • Networking
    • Keyboard Mouse
    • Cooling
    • Knowledgebase
    • Search articles
    • More Categories
  • FORUMS
  • SEARCH
    • Search Articles
    • Search News
    • Search Files
  • NEWSLETTER
  • CONTACT

New Reviews
Gigabyte GeForce GTX 650 Ti Boost OC WindForce 2X review
MSI Radeon HD 7790 TurboDuo OC review
Metro Last Light VGA Graphics Benchmark performance test
Noctua NH-U12S and NH-U14S review
ASUS GeForce GTX 670 DirectCU Mini review
OCZ Vertex 3.20 SSD review
Gigabyte Radeon HD 7790 2GB OC review
Cooler Master Eisberg 240L Prestige review
Guru3D and OCZ Contest - PC Power 1200W PSU Giveaway
MSI GeForce GTX 650 Ti BOOST OC review

New Downloads
MSI Afterburner 3.0.0 Beta 10 Download
PhysX System Software 9.13.0325 Download
GPU-Z Download 0.7.1
HWiNFO32 4.18 Download
HWiNFO64 4.18 Download
GeForce 320.14 BETA Driver Download
Nvidia Lifelike Human Face Rendering Tech Demo Download
3DMark Download v1.1.0
XBMC Media Center Download 12.0 2
RTSS Rivatuner Statistics Server Download v5.1.1


New Forum Topics
by: Hilbert Hagedoorn NVIDIA GeForce GTX 780, GTX 770 and GTX 760 Tiby: CeeJay.dk SweetFX Shader Suite release and discussion thread #3by: Ralph Kerbal Space Programby: Agent-A01 Geforce GTX TITAN Owner Clubby: Hilbert Hagedoorn Microsoft Xbox One console shownby: msi-afterburner MSI Afterburner 3.0.0 Beta 10(2013-05-22)by: Crowbar Skyrim graphics mod question...by: Rich_Guy AMD Catalyst 13.5 CAP 1 Releasedby: dk_lightning Mouse/Input Lagby: RedSeptember Call of Juarez - Gunslinger


Online Users
There are currently 1780 user(s) online:
dk_lightning, Google, MSN, Olvik, vidra, Yahoo


Guru3D.com » Review » PowerColor Radeon HD 7950 PCS+ review » Page 5

PowerColor Radeon HD 7950 PCS+ review

Posted by Hilbert Hagedoorn on: 01/30/2012 02:00 PM [ 0 comment(s) ]

Graphics engine architecture
Tweet

 

Graphics engine architecture

So I kept the more complex stuff for last in the technology overview. If this seems a little too techy for you, skip this page please. AMD is moving away from the VLIW5 and VLIW4 architecture we have seen in the last generation of products. If anything, VLIW4 has shown certain inefficiencies in the Radeon HD 6900 series and while VLIW designs are fine for graphics they are not so grand for computing.

The new graphics core architecture is now marketed as GCN, which is short for Graphics Core Next architecture and the architecture building block has changed significantly to remove certain inefficiencies seen in the VLIW architecture.

A GCN in its essence is the basis of a GPU that performs well at both graphical and computing tasks. For the compute side of things the new GCN Compute unit model has been introduced, it is designed for better utilization, high throughput and multi tasking. E.g. performance, performance, performance.

So your basic new Shader cluster is one called a (GCN) Compute Unit:

  • Non-VLIW Design
  • 16 wide SIMD Units
  • 64 KB registers / SIMD Unit

Now if we take 4 of these SIMD Units that will be the basis of one Compute Unit (CU), each SIMD unit is 16 wide, times four per compute unit means that each CU unit has 64 shader processors. The GPU has 28 Compute units meaning 64SIMDs x 28 CUs = 1792 Shader processors (for the R7950).

  • Engine has Dual Geometry engines / Asynchronous Compute engines
  • 8 render backends / 32 color ROPs per clock cycle / 128 Z/Stencil ROPs per clock
  • Engine ties to 768KB R/W L2 cache
  • Tahiti GPU Pro has up-to 28 Compute Units
  • Tahiti GPU XT has up-to 32 Compute Units

The Graphics Core Next Compute Unit (CU) has about the same floating point power per clock as the previous one (i.e. Cayman). It also has the same amount of register space (for the vector units).  Each CU also has it's own registers and local data share.

Again: one compute unit just as a Cayman SIMD is a collection of shader processors, four SIMDs form one compute unit. Caymans (6900) problem was that it was not so efficient with multiple tasks at once.

Cayman had/has 16 4-wide VLIW processing elements for a total of 16x4=64 operations in parallel, while the new architecture has 4 16-wide vector processors, again for a total of 4x16=64 operations per clock. GCN also has a scalar processor that Cayman does not.

The distinction is in its bare essence that GCN does not need instruction level parallelism, each of the four 16-wide SIMD vector units execute a different wavefront being the whole 64-sized wavefront taking four cycles.

Radeon HD 7970

So the theoretical floating point power stays more or less the same per CU, but GCN will be more efficient since it does not require instruction level parallelism (we assume it costs some more area/transistors as well). The outcome? Compiling also becomes much more uncomplicated and that means more efficiency and thus there it is again, better performance.

GCN is all about creating a GPU good for both graphics and computing purposes. Oh and all compute units... combined with the other ASIC components form the GPU.





25 pages « < 4 5 6 7 next »


Guru3D.com » Articles » PowerColor Radeon HD 7950 PCS+ review » Page 5

Related Articles
PowerColor 7790 TurboDuo OC review
We test and review the PowerColor Radeon HD 7790 TurboDuo OC edition incl FCAT Frametimes. The new graphics card is intended to boost a little more performance into entry-level gaming. The PowerColor TurboDuo HD7790 OC clocks at 7.5% overclocking speed on boost engine, packed with dual-fan cooling and S-shape heat pipe direct touch technology.

PowerColor Radeon HD 7950 PCS+ review
PowerColor is the first in our line-up of R7950 reviews with a customized model. It is the PCS version that clocks in at a cool 880 MHz on the graphics core with it's memory clocked default at an effective data rate of 5000 MHZ. Armed witha custom cooler it is silent, and even cooler compared to the reference model.

PowerColor Radeon 6870 PCS+ review
This is the R6870 PCS+ version where PowerColor pre-overclocks the card to 940 MHz (900 reference) and clock the memory at 4400 MHz coming from 4200 MHz. This should give the card a nifty nice boost.

PowerColor Radeon 6850 PCS+ review
PowerColor is as always never late to arrive at the party, they submitted a Radeon HD 6850 for a test here at Guru3D.com and as such we'd be more than happy to bring you a full review on one of their newest products today, the PCS+ version of the Radeon HD 6850 that comes pre-overclocked.

Follow Guru3D on Google+ - Facebook - YouTube - Twitter © 2013