Guru3D.com
  • HOME
  • NEWS
    • Channels
    • Archive
  • DOWNLOADS
    • New Downloads
    • Categories
    • Archive
  • GAME REVIEWS
  • ARTICLES
    • Rig of the Month
    • Join ROTM
    • PC Buyers Guide
    • Guru3D VGA Charts
    • Editorials
    • Dated content
  • HARDWARE REVIEWS
    • Videocards
    • Processors
    • Audio
    • Motherboards
    • Memory and Flash
    • SSD Storage
    • Chassis
    • Media Players
    • Power Supply
    • Laptop and Mobile
    • Smartphone
    • Networking
    • Keyboard Mouse
    • Cooling
    • Search articles
    • Knowledgebase
    • More Categories
  • FORUMS
  • NEWSLETTER
  • CONTACT

New Reviews
Fractal Design Pop Air RGB Black TG review
Palit GeForce GTX 1630 4GB Dual review
FSP Dagger Pro (850W PSU) review
Razer Leviathan V2 gaming soundbar review
Guru3D NVMe Thermal Test - the heatsink vs. performance
EnGenius ECW220S 2x2 Cloud Access Point review
Alphacool Eisbaer Aurora HPE 360 LCS cooler review
Noctua NH-D12L CPU Cooler Review
Silicon Power XPOWER XS70 1TB NVMe SSD Review
Hyte Y60 chassis review

New Downloads
AMD Radeon Software Adrenalin 22.6.1 WHQL driver download
GeForce 516.59 WHQL driver download
Media Player Classic - Home Cinema v1.9.22 Download
AMD Chipset Drivers Download v4.06.10.651
CrystalDiskInfo 8.17 Download
AMD Radeon Software Adrenalin 22.6.1 Windows 7 driver download
ReShade download v5.2.2
HWiNFO Download v7.26
7-Zip v22.00 Download
GeForce 516.40 WHQL driver download


New Forum Topics
AMD Radeon Software Adrenalin 22.6.1 - Driver download and discussion NVIDIA GeForce 516.59 WHQL driver download & Discussion AMD Radeon Software - UWP Extreme 4-Way Sli Tuning According to Asus and Gigabyte, motherboard sales will fall by 25% this year. Tensor Core equivalent Likely to Get Embedded in AMD rDNA3 Windows Defender can Significantly Impact Intel CPU Performance? [3rd-Party Driver] Amernime Zone Radeon Insight 22.5.1 WHQL Driver Pack (Released) FSR Thread NVIDIA seems to halt producing the 12 GB RTX 3080




Guru3D.com » Review » Palit GeForce GTX 760 JetStream review » Page 2

Palit GeForce GTX 760 JetStream review - The Technology and Specs

by Hilbert Hagedoorn on: 06/25/2013 02:48 PM [ 3] 0 comment(s)

Tweet

 

Technology and Specs

So then, it's time to talk business. The GeForce GTX 680 GeForce or GTX 670 GeForce GTX 760 being reviewed today is based on Kepler GPU architecture, which we all are familiar with by now. The GeForce GTX 760 is based on the 28nm GK104 GPU, the same as the GTX 670 and 680 uses, yet with a few shader clusters disabled. Still, the 10" long GeForce GTX 760 boasts a good 1152 CUDA (shader processors) cores. The product is obviously PCI-Express 3.0 ready and has a TDP of give or take a typical draw of 150~160W. But let me first show you the actual GK104 die:



NVIDIA GK104 Kepler architecture GPU used in the Geforce GTX 760 and 770
 

As far as the memory specs of the GK104 Kepler GPU are concerned, the boards will feature a 256-bit memory bus connected to 2 GB or alternatively 4 GB of GDDR5 video buffer memory. On the memory controller side of things you'll see very significant improvements as the reference memory clock is now set at 6 GHz / Gbps. This boils down to a memory bandwidth of 192GB/s on that 256-bit memory bus. Both the GPU core and the shader processor domain are clocked at 1:1, meaning both the core and shader domain clock in at a 1046 MHz base clock. With this release, NVIDIA now has the first series 700 cards on its way. The new graphics adapters are of course DirectX 11.1 ready. With Windows 8, 7 and Vista also being DX11.1 ready all we need are some games to take advantage of DirectCompute, multi-threading, hardware tessellation and the latest shader 5.0 extensions. For Kepler, NVIDIA kept their memory controllers GDDR5 compatible. Memory wise NVIDIA has nice large memory volumes due to their architecture, we pass 2 GB as standard these days.

 

The graphics architecture that is Kepler

As you can understand, the massive memory partitions, bus-width and combination of GDDR5 memory (quad data rate) allow the GPU to work with a very high framebuffer bandwidth (effective). Let's again put most of the data in a chart to get an idea and better overview of changes:

Graphics card GeForce
GTX 680
GeForce
GTX 760
GeForce
GTX 770
GeForce
GTX 780
GeForce
GTX Titan
Fabrication node 28nm 28nm 28nm 28nm 28nm
Shader processors 1536 1152 1536 2304 2688
Streaming Multiprocessors (SMX) 8 6 8 12 14
Texture Units 128 96 128 192 224
ROP units 32 32 32 48 48
Graphics Clock (Core) 1006 MHz 980 MHz 1046 863 836
Boost Processor Clock 1058 MHz 1033 MHz 1085 MHz 900 MHz 876 MHz
Memory Clock / Data rate  MHz / 6008 MHz 1502 MHz / 6008 MHz 1750 MHz / 7000 MHz 1502 MHz / 6008 MHz 1502 MHz / 6008 MHz
Graphics memory 2048 MB 2048 MB 2048 MB 3072 MB 6144 MB
Memory interface 256-bit 256-bit 256-bit 384-bit 384-bit
Memory bandwidth 192 GB/s 192 GB/s 224 GB/s 288 GB/s 288 GB/s
Power connectors 2x6-pin PEG  2x6-pin PEG 1x6-pin PEG, 1x8-pin PEG 1x6-pin PEG, 1x8-pin PEG 1x6-pin PEG, 1x8-pin PEG
Max board power (TDP) 170 Watts 170 Watts 230 Watts 250 Watts 250 Watts
Recommended Power supply 550 Watts 550 Watts 600 Watts 600 Watts 600 Watts
GPU Thermal Threshold 98 degrees C 98 degrees C 95 degrees C 95 degrees C 95 degrees C

So we talked about the core clocks, specifications and memory partitions. Obviously there's a lot more to talk through. To understand a graphics processor you simply need to break it down into smaller pieces to better understand it. Let's first look at the raw data that most of you can understand and grasp. This bit will be about the Kepler architecture, if you're not interested in g33k talk by all means please browse to the next page.
 

GeForce GTX 680

 

So above we see the GK104 block diagram that entails the Kepler architecture. Let's break it down into bits and pieces. The reference GK104 will have:

  • 1536 CUDA processors (Shader cores)
  • 192 CUDA core clusters (SM/SMX).
  • 8 geometry units
  • 4 raster Units
  • 128 Texture Units
  • 32 ROP engines
  • 256-bit GDDR5 memory bus
  • DirectX 11.1

The more important thing to focus on are the SM (block of shader processors) clusters (or SMX as NVIDIA likes to call it for the GTX 680/770, which  has 192 Shader processors. That's radically different from Fermi, the GeForce GTX 580 for example had 32 shader processors per SM cluster. 1536 : 192 = 8 Shader clusters (SMs). Let's blow up one such cluster:

 GeForce GTX 680

Above the block diagram for a single Shader processor cluster, aka SM or SMX as NVIDIA now calls it. The new SMX has quite a bit more bite in terms of shader, texture and geometry processing. 192 CUDA cores, that's six times the number of cores per SM opposed to Fermi. Now, at the end of the pipeline we run into the ROP (Raster Operation) engine and the GTX 680 again has 32 engines for features like pixel blending and AA. There's a total of 128 texture filtering units available for the GeForce GTX 680. The math is simple here, each SM has 16 texture units tied to it.

  • GeForce GTX 580 has 16 SMs X 4 Texture units = 64
  • GeForce GTX 680 & 770 have 8 SMs X 16 Texture units = 128
  • GeForce GTX 760 have 6 SMs X 16 Texture units = 96

Above the GK104 host interface - The Gigathread engine, four GPCs, four memory controllers, the ROP partitions, a 768 KB L2 cache. Each GPC has eight polymorph engines - ROP partitions are nearby to the L2 cache, each shader cluster is then tied to L1 and a shared L2 cache. Shading performance is going be increased quite bit, geometry performance will get a nice boost as well. NVIDIA is using 64KB Shared Memory/L1 per SMX – please note that they have a 16/48 – 48/16 ratio here for graphics/compute, as before with Fermi. For L2, 128KB per 64-bit memory controller. So that adds up to 512KB L2. In regards to architectural changes, on top of the pipeline NVIDIA has now added new Polymorph 2.0 (world space processing) engines and raster (screen space processing) engines; they act like a mini CPU really.




25 pages 1 2 3 4 next »



Related Articles
Palit GeForce GTX 1630 4GB Dual review
NVIDIA has released a budget series graphics card, don't expect flying framerates, but a fun little card for entry-level gaming. Meet the GeForce GTX 1630 4GB from Palit, in a DUAL fan version....

Palit GeForce RTX 3090 Ti GameRock OC review
Palit offers the GeForce RTX 3090 GameRock OC edition graphics card, which we review today. The 'Ti' edition of the GeForce RTX 3090 has been in the works for quite some time. The flagship series h...

Palit GeForce RTX 3050 DUAL OC review
We test the NVIDIA GeForce RTX 3050, a new high-end graphics card. In specific the Palit Dual OC model has 8GB of memory 2560 Shader processors and a factory boost speed of 1822 MHz (1770 MHz referenc...

Palit GeForce RTX 3070 Ti GamingPRO review
Palit joins the review queue with a product aimed to stick at or even below the FE editions MSRP. The reference clocked Gaming PRO graphics card has a beefy cooler, but other than that was design to r...

© 2022