Guru3D.com
  • HOME
  • NEWS
    • Channels
    • Archive
    • Search
    • Submit
  • DOWNLOADS
    • New Downloads
    • Categories
    • Archive
    • Search
    • Submit
  • GAME REVIEWS
  • ARTICLES
    • Editorials
    • Guru3D VGA Charts
    • Rig of the Month
    • Join ROTM
    • PC Buyers Guide
    • Dated content
    • More Categories
  • HARDWARE REVIEWS
    • Videocards
    • Processors
    • Audio
    • Motherboards
    • Memory and Flash
    • SSD Storage
    • Chassis
    • Power Supply
    • Laptop and Mobile
    • Smartphone
    • Networking
    • Keyboard Mouse
    • Cooling
    • Knowledgebase
    • Search articles
    • More Categories
  • FORUMS
  • SEARCH
    • Search Articles
    • Search News
    • Search Files
  • NEWSLETTER
  • CONTACT

New Reviews
ASUS Maximus VI Extreme Z87 motherboard review
ASUS GeForce GTX 780 DirectCU II OC review
Fractal Design Arc Midi R2 review
Corsair Vengeance K70 review
MSI GeForce GTX 770 Lightning review
EVGA GeForce GTX 770 SC review
Plextor M5M 256GB mSATA SSD review
AMD A10 6800K review
SanDisk Extreme II 120 - 240 and 480 GB SSD review
ASUS Sabertooth Z87 motherboard review

New Downloads
Media Player Classic Home Cinema v1.6.8 Download
Sandra 2013 SP4 19.50 download
MSI Afterburner 3.0.0 Beta 10 Download
AMD Catalyst 13.6 BETA 2 Download
CPU-Z 1.6.4
AIDA64 Download version 3.00
AMD Catalyst 13.6 BETA Download
PrecisionX Download Version 4.2.0
GeForce 320.18 WHQL Driver Download
AMD Catalyst Application Profile Download 13.5 CAP1


New Forum Topics
by: BlackZero Downsampling with AMD: Guide and Demonstrationby: Taint3dBulge AMD FX 9590 to cost $960by: anticupidon Summer projects?by: Enmity 27" pls, 29" ips or 120hz 1080p?by: PhazeDelta1 Samsung 840 and Pro Firmware Update DXT08B0Q / DXM05B0Qby: Mufflore How about this for a 3D PC projector!by: bishi Geforce GTX 780 Owners Clubby: msi-afterburner MSI Afterburner 3.0.0 Beta 10(2013-05-22)by: J.B.west Help me choose music for my gfs restaurantby: asder00 AMD Catalyst 13.x (13.150.0.0 May 23)


Online Users
There are currently 2115 user(s) online:
Blackops_2, Google, Hyper, MSN, volkov956, Yahoo


Guru3D.com » Review » KFA2 GeForce GTX 660 EX OC review » Page 5

KFA2 GeForce GTX 660 EX OC review - Page 5

Posted by Hilbert Hagedoorn on: 09/14/2012 09:18 AM [ 0 comment(s) ]

Tweet

 

The graphics architecture that is Kepler

As you can understand, the massive memory partitions, bus-width and combination of GDDR5 memory (quad data rate) allow the GPU to work with a very high framebuffer bandwidth (effective). Let's again put most of the data in a chart to get an idea and better overview of changes:

Graphics card GeForce GTX
660
GeForce GTX
660 Ti
GeForce GTX
670
GeForce GTX 680 GeForce GTX 690
Fabrication node 28nm 28nm 28nm 28nm 28nm
Shader processors 960 1344 1344 1536 3072
Streaming Multiprocessors (SM) 5 7 7 8 16
Texture Units 80 112 112 128 128x2
ROP units 24 24 32 32 32x2
Graphics Clock (Core) 980/1033 MHz 915 / 980MHz 915 / 980MHz 1006/1058MHz 915/1019MHz
Shader Processor Clock 980/1033 MHz 915 / 980MHz 915 / 980MHz 1006/1058MHz 915/1019MHz
Memory Clock / Data rate MHz 1502 / 6008 MHz 1502 / 6008 MHz 1502 / 6008 MHz 1502 / 6008 MHz 1502 / 6008 MHz
Graphics memory 2048 MB 2048 MB 2048 MB 2048 MB 4096 MB
Memory interface 192-bit 192-bit 256-bit 256-bit 256-bit
Memory bandwidth 144 GB/s 144 GB/s 192 GB/s 192 GB/s 192 GB/s
Power connectors 1x6-pin PEG 2x6-pin PEG 2x6-pin PEG 2x6-pin PEG 2x8-pin PEG
Max board power (TDP) 140 Watts 150 Watts 170 Watts 170 Watts 300 Watts
Recommended Power supply 450 Watts 450 Watts 500 Watts 550 Watts 750 Watts
GPU Thermal Threshold 98 degrees C 98 degrees C 98 degrees C 98 degrees C 98 degrees C

So we talked about the core clocks, specifications and memory partitions. Obviously there's a lot more to talk through the GPU architecture for example. To understand a graphics processor you simply need to break it down into pieces to better understand it.  

Let's first look at the raw data that most of you can understand and grasp. This bit will be about the Kepler architecture, if you're not interested in g33k talk by all means please browse to the next page.

ASUS GTX 660

So above we see the GK106 block diagram that entails the Kepler architecture. Let's break it down into bits and pieces.

A fully operating GK106 will have:

  • 960 CUDA processors (Shader cores)
  • 192 CUDA core clusters (per SM).
  • 5 geometry units
  • 3 raster Units
  • 80 Texture Units
  • 24 ROP engines
  • 192-bit GDDR5 memory bus
  • DirectX 11.1

Above thus a fully operating GK106 as used on the GTX 660. So the more important thing to focus on are the SM (block of shader processors) clusters (or SMX as NVIDIA likes to call it for the GTX 660, which  has 192 Shader processors. That's radically different from Fermi, the GeForce GTX 580 for example had 32 shader processors per SM cluster. 960 : 192 = 5 Shader clusters (SMs). Let's blow up one such cluster:

GeForce GTX 680

Above the block diagram for a single Shader processor cluster, aka SM or SMX as NVIDIA now calls it. The SMX has quite a bit more bite in terms of shader, texture and geometry processing. 192 CUDA cores, that's six times the number of cores per SM opposed to Fermi. Now, at the end of the pipeline we run into the ROP (Raster Operation) engine and the GTX 660 again has 24 engines for features like pixel blending and AA, the GTX 660 Ti has 24 of these activated.

There's a total of 80 texture filtering units available for the GK106. The math is simple here, each SM has 16 texture units tied to it.

  • GeForce GTX 580 has 16 SMs X 4 Texture units = 64
  • GeForce GTX 660 Ti has 5 SMs X 16 Texture units = 80
  • GeForce GTX 660 Ti has 7 SMs X 16 Texture units = 112
  • GeForce GTX 670 has 7 SMs X 16 Texture units = 112
  • GeForce GTX 680 has 8 SMs X 16 Texture units = 128

Above the GK105 host interface - The Gigathread engine, three GPCs, three memory controllers, the ROP partitions, a 384 KB L2 cache. ROP partitions are nearby to the L2 cache, Each shader cluster then is tied to L1 and a shared L2 cache. Shading performance is going be increased quite bit, geometry performance will get a nice boost as well.





27 pages « < 4 5 6 7 next »



Related Articles
KFA2 GeForce GTX 770 EX OC review
In this review we take the KFA2 GeForce GTX 770 EX OC edition for a benchmarkin' test-drive. This model graphics card comes with a factory overclock and a new design cooler with dual 90mm fans. Overall the card is sitting in-between the GeForce GTX 680 and GeForce GTX 780 , 100% cool and 100% silent. We test the product with the hottest games like Metro: Last light, Battlefield 3, Sleeping Dogs, Far Cry 3, Medal of Honor Warfighter, Hitman Absolution and many more.

KFA2 GeForce GTX 660 EX OC review
We reviwe the GeForce GTX 660 EX OC from KFA2. the product comes factory tweaked for you and has been equipped with a dual-slot dual-fan cooling solution. It doesn't make any noise and it plays games .. hard.

KFA2 GeForce GTX 670 EX OC review
Say hello to our KFA2 GeForce GTX 670 EX OC review, in this article we'll look at a really pleasant offering from KFA2. The product is a factory overclocked, custom cooled, custom designed PCB GeForce GTX 670, yes this is the EX OC edition.

KFA2 GeForce GTX 570 MDT X4 review
We review the KFA2 GeForce GTX 570 MDT X4. The card supports 2 / 3 / 4 monitors in span mode and 2x2 in stack mode over the DVI connectors. KFA2 takes that GTX 570 to the next level though, custom PCB, custom cooling and then a finger licking default overclock at 800 MHz on the graphics core.

Follow Guru3D on Google+ - Facebook - YouTube - Twitter © 2013