Guru3D.com
  • HOME
  • NEWS
    • Channels
    • Archive
    • Search
    • Submit
  • DOWNLOADS
    • New Downloads
    • Categories
    • Archive
    • Search
    • Submit
  • GAME REVIEWS
  • ARTICLES
    • Editorials
    • Guru3D VGA Charts
    • Rig of the Month
    • Join ROTM
    • PC Buyers Guide
    • Dated content
    • More Categories
  • HARDWARE REVIEWS
    • Videocards
    • Processors
    • Audio
    • Motherboards
    • Memory and Flash
    • SSD Storage
    • Chassis
    • Power Supply
    • Laptop and Mobile
    • Smartphone
    • Networking
    • Keyboard Mouse
    • Cooling
    • Knowledgebase
    • Search articles
    • More Categories
  • FORUMS
  • SEARCH
    • Search Articles
    • Search News
    • Search Files
  • NEWSLETTER
  • CONTACT

New Reviews
Gigabyte GeForce GTX 650 Ti Boost OC WindForce 2X review
MSI Radeon HD 7790 TurboDuo OC review
Metro Last Light VGA Graphics Benchmark performance test
Noctua NH-U12S and NH-U14S review
ASUS GeForce GTX 670 DirectCU Mini review
OCZ Vertex 3.20 SSD review
Gigabyte Radeon HD 7790 2GB OC review
Cooler Master Eisberg 240L Prestige review
Guru3D and OCZ Contest - PC Power 1200W PSU Giveaway
MSI GeForce GTX 650 Ti BOOST OC review

New Downloads
MSI Afterburner 3.0.0 Beta 10 Download
PhysX System Software 9.13.0325 Download
GPU-Z Download 0.7.1
HWiNFO32 4.18 Download
HWiNFO64 4.18 Download
GeForce 320.14 BETA Driver Download
Nvidia Lifelike Human Face Rendering Tech Demo Download
3DMark Download v1.1.0
XBMC Media Center Download 12.0 2
RTSS Rivatuner Statistics Server Download v5.1.1


New Forum Topics
by: Hilbert Hagedoorn Microsoft Xbox One console shownby: Stone Gargoyle Call of Duty 2013by: Hilbert Hagedoorn AMD loses 2nd place processor market, now 4thby: Terepin GeForce + Triple Buffering + Windows 8 = impossible?by: Stone Gargoyle Xbox World reveals Next Gen Xbox?by: Hilbert Hagedoorn Download MSI Afterburner 3.0.0 Beta 10by: SlackerITGuy What's the Radeon experience in BF3?by: NiukNiuk Guild Wars 2 Design Manifestoby: sverek 6870 Crossfireby: Von Dach Performance Tweaks


Online Users
There are currently 2562 user(s) online:
Andrea_23, angmar, antonyfrn, Chillin, FatBoyNL, Google, Horus-Anhur, keenan, kosh_neranek, Lane, leopr, Live Search, MSN, n2k3, pbvider, quaker3, spajdrik, Undying, xpeedx, Yahoo, zer0_c0ol


Guru3D.com » Review » GeForce GTX Titan preview » Page 3

GeForce GTX Titan preview

Posted by Hilbert Hagedoorn on: 02/19/2013 02:56 PM [ 218 comment(s) ]

Kepler GK110
Tweet

 

The graphics architecture that is Kepler GK110

As you can understand, the massive memory partitions, bus-width and combination of GDDR5 memory (quad data rate) allow the GPU to work with a very high framebuffer bandwidth (effective). Let's again put most of the data in a chart to get an idea and better overview of changes:

Graphics card GeForce GTX 480 GeForce GTX 580 GeForce GTX 680 GeForce GTX Titan
Fabrication node 40nm 40nm 28nm 28nm
Shader processors 480 512 1536 2688
Streaming Multiprocessors (SMX) 15 16 8 14
Texture Units 60 64 128 224
ROP units 48 48 32 48
Graphics Clock (Core) 700 MHz 772 MHz 1006/1058 MHz 836/876 MHz
Shader Processor Clock 1401 MHz 1544 MHz 1006/1058 MHz 836/876 MHz
Memory Clock / Data rate 924 MHz / 3696 MHz 1000 MHz / 4000 MHz 1502 MHz / 6008 MHz 1502 MHz / 6008 MHz
Graphics memory 1536 MB 1536 MB 2048 MB 6144 MB
Memory interface 384-bit 384-bit 256-bit 384-bit
Memory bandwidth 177 GB/s 192 GB/s 192 GB/s 288 GB/s
Power connectors 1x6-pin PEG, 1x8-pin PEG 1x6-pin PEG, 1x8-pin PEG  2x6-pin PEG 1x6-pin PEG, 1x8-pin PEG
Max board power (TDP) 250 Watts 244 Watts 170 Watts 250 Watts
Recommended Power supply 600 Watts 600 Watts 550 Watts 600 Watts
GPU Thermal Threshold 105 degrees C 97 degrees C 98 degrees C 95 degrees C

So we talked about the core clocks, specifications and memory partitions. Obviously there's a lot more to talk through. We feel that to be able to understand a graphics processor you simply need to break it down into small pieces to better understand it. Let's first look at the raw data that most of you can understand and grasp. This bit will be about the Kepler GK110 architecture, if you're not interested in geek talk by all means please browse to the next page.

Right, so have a close look at the GK110 die as shown above. You'll notice the five green clusters. These are the polymorph GPC engines, each containing 3 SMX clusters, 5x3 = 15 SMX clusters in total. You'll spot six 64-bit memory interfaces, bringing in a 384-bit path towards the graphics memory. That's instant extra memory bandwidth by the way, combined with a 6 Gbps clock the cards can reach 288 GB/sec.
 

 


So, above we see the GK110 block diagram that entails Kepler architecture. Let's break it down into bits and pieces. The GK110 will have:

  • 2880 or optional 2688 CUDA processors (Shader cores)
  • 192 CUDA cores per cluster (SMX) 
The more important things to focus on are the SM (block of shader processors) clusters (or SMX as NVIDIA likes to call it for the GTX 600 series) which have 192 Shader processors. 
 
 
SMX: 192 single‐precision CUDA cores, 64 double‐precision units, 32 special function units (SFU), and 32 load/store units.
 

When we zoom in ever further at one SMX cluster (192 shader processors) we see a change from the GK104 (GTX 680) as there are 64 double-precision math units.
 
See, the GeForce GTX 680 SMX had 192 single-precision (SP) floating point CUDA Cores, and 8 double-precision (DP) CUDA cores. As a result, DP operations per clock ran at effectively 1/24 the SP rate. For GTX TITAN it includes a full 64 DP CUDA Cores per SMX (compared to 192 SP CUDA Cores), or 1/3rd the number of DP cores to SP for substantially more double-precision horsepower. 

So based on a full 15 SMX 2880 shader cores chip the GK110 has 960 DP units linked to its total of 2,880 CUDA cores,that would be 896 DP units on today's tested GTX Titan with 14 activated SMXes.

Double precision wise, to unlock full performance you must open the NVIDIA Control Panel, navigate to “Manage 3D Settings”. In the Global Settings box you will find an option titled “CUDA – Double Precision”, but... GeForce GTX Titan runs at reduced clock speeds when full double-precision is enabled. Still a great option if you are working on CUDA applications.

The SMX has quite a bit more bite in terms of shader, texture and geometry processing. 192 CUDA cores, that's six times the number of cores per SM opposed to Fermi. In the pipeline we run into the ROP (Raster Operation) engine and the GK110 has 48 engines for features like pixel blending and AA.

The GK110 has 64KB of L1 cache for each SMX plus a special 48KB texture unit memory that can be utilized as a read-only cache. L2 cache wise things remain the same across the SMX units compared to the GK104, 1.5MB. The GPU’s Texture units are a valuable resource for compute programs with a need to sample or filter image data. The texture throughput in Kepler is significantly increased compared to Fermi – each SMX unit contains 16 texture filtering units.

  • GeForce GTX 580 has 16 SMX x 4 Texture units = 64
  • GeForce GTX 680 has 8 SMX x 16 Texture units = 128
  • GeForce GTX Titan has 14 SMX x 16 Texture units = 224

So there's a total 15 SMX x 16 TU = 240 texture filtering units available for the GK110 silicon itself (if all SMXes were enabled). Still with me?





11 pages « 2 3 4 5 next »


Guru3D.com » Articles » GeForce GTX Titan preview » Page 3

Related Articles
Gigabyte GeForce GTX 650 Ti Boost OC WindForce 2X review
In this article we review the Gigabyte GeForce GTX 650 Ti Boost OC WindForce 2X with that OC for a factory tweak and the Windforce indicating a silent yet powerful two fan cooling solution. The product is customized with a new PCB, cooling and a few tweaks, it has 2GB of memory with both that memory and the core base-clock slightly overclocked. An tasty product at an interesting price in the lower segment of the mainstream market.

ASUS GeForce GTX 670 DirectCU Mini review
In this article we review the ASUS GeForce GTX 670 DirectCU Mini edition, a compact performance graphics card designed primarily for small form factor PCs with mini ITX motherboards. The dual-slot card measures just 17cm and features the NVIDIA GTX 670 GPU. ASUS has re-engineered the DirectCU cooler to fit small form factor cases. While shorter, it introduces a copper vapor chamber placed directly on top of the GPU for faster heat spreading and dispersal with 20% lower temperatures than reference GTX 670.

MSI GeForce GTX 650 Ti BOOST OC review
In this article we review the MSI GeForce GTX 650 Ti BOOST OC edition review with that OC for a factory tweak. The product is customized with a new PCB, cooling and a few tweaks, it has 2GB of memory with both that memory and the core base-clock slightly overclocked. Overall an interesting product at an interesting price in the lower segment of the mainstream market.

EVGA GeForce GTX 650 Ti Boost SC edition review
In this article we review the EVGA GeForce GTX 650 Ti Boost SC edition review with that SC for superclocked. The product is fairly reference looking but does come with EVGA's own styled cooler, it has 2GB of memory with both that memory and the core baseclock slightly overclocked quite significant.

Follow Guru3D on Google+ - Facebook - YouTube - Twitter © 2013