GeForce GTX 590 review
Posted by Hilbert Hagedoorn on: 03/23/2011 02:00 PM [ 0 comment(s) ]
The Product Architecture
It isn't exactly a secret anymore what the GTX 590 exactly is, in very short words it is one graphics card made out of two graphics processors. Internally they are connected through a PCIe connection (on board) and then they operate in SLI mode. To accomplish something really sturdy and powerful, NVIDIA took their fastest GPUs available and without any compromises on the number of shader processors etc inside that GPU they started designing the card.
The end result is a dual-GPU product that runs two ICs called and tagged as the GF110, the very same stuff you find in the GeForce GTX 580, and in it's bare essence the GTX590 is in fact two 580'ies albeit clocked slower.
So that's two GF110 GPUs on a single graphics card connected through a NF200 bridge chip with a total of 1024 Shader processors for both GPUs and 3 GB of GDDR5 memory, all that on a 12-layer 11" PCB. The GPUs each get 10 power phases. Cooling wise NVIDIA extends the vapor-chamber cooling, yet now they make use of a dual-radiator design.
So yeah, it is hard to realize that the 9th of November last year these GPUs where released, this is how fast the product cycle is updated these days.
The new Dual-GPU solution is of course DirectX 11 ready. With Windows 7 and Vista also being DX11 ready all we need are some games to take advantage of DirectCompute, multi-threading, hardware tessellation and new shader 5.0 extensions. DX11 is going to be good and once tessellation kicks into games, better looking.
- GeForce GTX 590 : 1024 SP, 384-bit (2x), 365W TDP
- GeForce GTX 580 : 512 SP, 384-bit, 243W TDP
- GeForce GTX 480 : 480 SP, 384-bit, 250W TDP
- GeForce GTX 470 : 448 SP, 320-bit, 225W TDP
The GPU that empowers it all has small architectural changes compared to the initial GTX 480 FERMI design (which GF110 is based of), some stuff was stripped away and some additional functional units for tessellation, shading and texturing have been added. Make note that the GPU still is big, as the fabrication node is still 40nm. TSMC canceled the 32nm fab node preventing this chip from being smaller.
The two GF110 graphics processors have sixteen shader clusters embedded in them (called SMs) bringing a full 512 shader processors per GPU.
To keep TDP, power consumption and noise levels at really acceptable values, NVIDIA did keep the clock frequency and thus voltage lower than that GTX 580. That's 607 MHz on the graphics processors and 3414 MHz (effective data-rate) on the GDDR5 memory. But that will still chunk out ridiculous performance numbers of course.
|Stream (Shader) Processors||128||240||240 x2||480||512||1024|
|Core Clock (MHz)||675||648||576||700||772||607|
|Shader Clock (MHz)||1675||1476||1242||1400||1544||1215|
|Memory Clock (effective MHz)||2200||2400||2000||3700||4000||3414|
|Memory amount||512 MB||1024 MB||1792 MB||1536||1536||3072|
|Memory Interface||256-bit||512-bit||448-bit x2||384-bit||384-bit||384-bit|
|Two Dual link DVI||Yes||Yes||Yes||Yes||Yes||Yes|
Memory wise NVIDIA has sizable and expensive memory volumes due to their architecture, we pass 1 GB per GPU as standard these days for most of NVIDIA's series 500 graphics cards. Each memory partition utilizes one memory controller on the respective GPU, which will get 256MB of memory tied to it.
- The GTX 590 has six (x2) memory controllers (12x256MB) = 3072 MB of GDDR5 memory
- The GTX 580 has six memory controllers (6x256MB) = 1536 MB of GDDR5 memory
As you can understand, the memory partitions, bus-width and combination of GDDR5 memory (quad data rate) allow the GPU to work with a high framebuffer bandwidth (effective). Let's put most of the data in a chart to get an idea and overview of some specific changes:
|Graphics card||GeForce GTX 470||GeForce GTX 480||GeForce GTX 580||GeForce GTX 590|
|Streaming Multiprocessors (SM)||14||15||16||32|
|Graphics Clock (Core)||607 MHz||700 MHz||772 MHz||607 MHz|
|Shader Processor Clock||1215 MHz||1401 MHz||1544 MHz||1215 MHz|
|Memory Clock / Data rate MHz||837 / 3348||924 / 3696||1000 / 4000||854 / 3414|
|Graphics memory||1280 MB||1536 MB||1536 MB||3072 MB|
|Memory bandwidth||134 GB/s||177 GB/s||192 GB/s||164 GB/s per GPU|
|Power connectors||2x6-pin PEG||1x6-pin PEG, 1x8-pin PEG||1x6-pin PEG, 1x8-pin PEG||2x8-pin PEG|
|Max board power (TDP)||215 Watts||250 Watts||244 Watts||365 Watts|
|Recommended Power supply||550 Watts||600 Watts||600 Watts||700 Watts|
|GPU Thermal Threshold||105 degrees C||105 degrees C||97 degrees C||97 degrees C|
So we talked about the core clocks, specifications and memory partitions. Obviously there's a lot more to talk through. Now, at the end of the pipeline we run into the ROP (Raster Operation) engine and each GF110 GPU has 48 units for features like pixel blending and AA. There's a total of 64 texture filtering units per GPU available for the GeForce GTX 590. The math is simple here, each SM has four texture units tied to it.
- GeForce GTX 580 has (16 SMs X 4 Texture units) = 64 x 2 = 128 TUs
- GeForce GTX 580 has 16 SMs X 4 Texture units = 64 TUs
Though still a 40nm based chip, the GF110 GPU comes with almost 3 billion transistors embedded into it. So that's 6 billion transistors active in your PC -- so if the Sims still doesn't run smooth... I dunno what would do the job ;)
The TDP is roughly 365 Watt, a very respectable number really. TDP = Thermal Design Power. Roughly translated, when you stress everything on the graphics card 100%, your maximum power consumption is the TDP.
The GeForce GTX 590 comes with both two 8-pin power connectors to get enough current and still have some for overclocking.
We test and review the Gigabyte GeForce GTX 780 WindForce 3x OC edition. The graphics card comes witha factory overclock and the new WindForce 3X 2 Slot 450W fan sink with Triangle Cool fans, as they like to cool it. That would be three silent 80mm fans. Overall the card is almost as fast as a GeForce GTX Titan, 100% cool and 100% silent. We test the product with the hottest games like Metro: Last light, Battlefield 3, Sleeping Dogs, Far Cry 3, Medal of Honor Warfighter, Hitman Absolution and many more.
GeForce GTX 780 SLI and Multi monitor review
We review the GeForce GTX 780 SLI and also do a SurroundView session with three monitors. The boards used are reference from NVIDIA. Over the next few pages we'll tell you a bit about multi-GPU gaming, the challenges, the requirements and of course a nice tasty benchmark session with the latest games. We'll have a peek at temperatures and power consumption of the GeForce GTX Titan cards in 2-way SLI mode to monitor it's generated performance.
GeForce GTX 780 review
We test and review the GeForce GTX 780. The GeForce GTX 780 is NVIDIAs all new high-end graphics card based in their Flagship product, the GTX Titan. This means it is based on the GK110 GPU and has an whopping 7.1 Billion transistors. That makes it a nice chunk faster opposed to the GeForce GTX 680 GPU. We test the product with the hottest games like Metro: Last light, Battlefield 3, Sleeping Dogs, Far Cry 3, Medal of Honor Warfighter, Hitman Absolution and many more.
Gigabyte GeForce GTX 650 Ti Boost OC WindForce 2X review
In this article we review the Gigabyte GeForce GTX 650 Ti Boost OC WindForce 2X with that OC for a factory tweak and the Windforce indicating a silent yet powerful two fan cooling solution. The product is customized with a new PCB, cooling and a few tweaks, it has 2GB of memory with both that memory and the core base-clock slightly overclocked. An tasty product at an interesting price in the lower segment of the mainstream market.