Guru3D.com
  • HOME
  • NEWS
    • Channels
    • Archive
  • DOWNLOADS
    • New Downloads
    • Categories
    • Archive
  • GAME REVIEWS
  • ARTICLES
    • Rig of the Month
    • Join ROTM
    • PC Buyers Guide
    • Guru3D VGA Charts
    • Editorials
    • Dated content
  • HARDWARE REVIEWS
    • Videocards
    • Processors
    • Audio
    • Motherboards
    • Memory and Flash
    • SSD Storage
    • Chassis
    • Media Players
    • Power Supply
    • Laptop and Mobile
    • Smartphone
    • Networking
    • Keyboard Mouse
    • Cooling
    • Search articles
    • Knowledgebase
    • More Categories
  • FORUMS
  • NEWSLETTER
  • CONTACT

New Reviews
Fractal Design Pop Air RGB Black TG review
Palit GeForce GTX 1630 4GB Dual review
FSP Dagger Pro (850W PSU) review
Razer Leviathan V2 gaming soundbar review
Guru3D NVMe Thermal Test - the heatsink vs. performance
EnGenius ECW220S 2x2 Cloud Access Point review
Alphacool Eisbaer Aurora HPE 360 LCS cooler review
Noctua NH-D12L CPU Cooler Review
Silicon Power XPOWER XS70 1TB NVMe SSD Review
Hyte Y60 chassis review

New Downloads
Prime95 download version 30.9 build 1
Intel ARC graphics Driver Download Version: 30.0.101.1743
AMD Radeon Software Adrenalin 22.6.1 WHQL driver download
GeForce 516.59 WHQL driver download
Media Player Classic - Home Cinema v1.9.22 Download
AMD Chipset Drivers Download v4.06.10.651
CrystalDiskInfo 8.17 Download
AMD Radeon Software Adrenalin 22.6.1 Windows 7 driver download
ReShade download v5.2.2
HWiNFO Download v7.26


New Forum Topics
Ubisoft is cutting off online gameplay for 15 games, players will no longer have access to purchased DLC WD Gold 1TB in RAID does speed transfer decrease ? Can you measure if a CPU was used before? Rtx 3080 core clock 0 mhz... GPU core clock fluctuates to 0 MHz from 210 on idle In collaboration with Alphacool, ELSA releases RTX 3090 Liquid Cooled HyperX Releases Impact SO-DIMMs and FURY SSD 516.59 - Clean Version AMD Might Release and Add Ryzen 5 5600X3D, Ryzen 9 5900X3D (X3D) procs HighPoint Releases 8-port USB 3.2 Gen.2 x 2 expansion card (20 Gbps)




Guru3D.com » Review » GeForce RTX 2080 Ti Founders review » Page 3

GeForce RTX 2080 Ti Founders review - Three primary architectural blocks

by Hilbert Hagedoorn on: 09/19/2018 03:00 PM [ 5] 279 comment(s)

Tweet

Three primary architectural blocks

To understand the Turing architecture a little better, we need to look at the GPU as something holding three architectural blocks. The traditional shader engine you know from last-gen GPUs like Pascal, the Tensor cores you know from Volta and the new RT cores for the hardware acceleration of DXR (DirectX Raytracing). The symbiosis of the three types of processors form what is called the Turing architecture.

 

 

Shader processors

We'll go a little more in-depth on the shader engine on the next pages, but the first and foremost important block is the traditional shader engine that we all know of. A full Turing TU102 graphics processor houses 4608 shader processors. For the GeForce RTX 2080 Ti that is 4352 cores activated, for the RTX 2080 that is 2944 cores. We will not be surprised to see a GeForce RTX Titan down the line of course, with the full 4608 shader processors activated.

 

 

A fully enabled Turing processor will have 96 ROPs, 288 Texture units and offers a 384-bit memory bus. The engine has been overhauled and offers a new superscalar architecture offering concurrent FP & INT, execution datapaths, Enhanced L1 caches. Caches wise there is a dual L1 cache and shared cache L2 6MB. The L1 bandwidth has been doubled compared to Pascal making it more efficient and thus faster. NVIDIA mentions that the shading performance for Turing can be up to 50% faster compared to Pascal.

 

 

Tensor processors

You've seen these introduced in the Volta architecture. Tensor cores are optimized processing blocks that serve a very specific function, they are exceptionally good with anything AI and Deep learning. And on the biggest Turing processor, NVIDIA adds 576 Tensor cores. What nobody really expected though was that these would be enabled for the consumer products. So why would NVIDIA inject tensor cores in a consumer GPU? Actually, it is quite simple, Tensor cores can solve substandard problems and challenges that normally take ages to math. You could use the Tensor cores to have enemy troops in a game driven by far more intelligent AI, imagine clever NPCs learning what you are doing and react to that. As an example, TU102 GPU contains 576 Tensor cores: eight per SM and two per each processing block within an SM. Each Tensor Core can perform up to 64 floating point fused multiply-add (FMA) operations per clock using FP16 inputs. Eight Tensor Cores in an SM perform a total of 512 FP16 multiply and accumulate operations per clock, or 1024 total FP operations per clock. The new INT8 precision mode works at double this rate, or 2048 integer operations per clock.

 

 

You could also use Tensor cores for things like anti-cheat detection, or material enhancement. However, I'll keep things simple and relative towards gaming though as NVIDIA added a function called DLSS. And that is the primary use and intention for the Tensor cores at the time of writing. We'll talk about its biggest primary (initial) feature 'Deep Learning Super-Sampling' on the following pages though.

 

Raytracing (RT) processors

It has been a dream to bring ray tracing to gaming for NVIDIA and as they mentioned, developers also have that same wish. Roughly ten years ago NVIDIA started first working on solving the Ray Tracing challenge. As you guys know, Microsoft announced DXR a while ago, short for DirectX Raytracing. Over the DXR API developers can utilize advanced functions that are built with actual raytracing. Nvidia's new video cards can accelerate that API with the RT cores and its RTX technology. DXR is a step forward towards broader use of ray tracing and in the end, better, more real in-game image quality. Building the capability into DirectX will enable a wider swath of developers to experiment with a technology previously the purview of high-end content creation applications. Companies including Epic, Remedy, and Electronic Arts have already begun experimenting with adding real-time raytracing capabilities to their game engines and with the launch of Windows 10 Fall Edition (2018), DXR will be included into the distribution. 

So what is raytracing really? Well, with raytracing, you basically are mimicking the behavior, looks, and feel of a real-life environment in a computer generated 3D scene. Wood looks like wood, however, the leaking resin will shine and refract it's environment and lighting accurately. Fur on animals looks like actual fur, and things like glass and waves of water get refracted as glass based on the surroundings and lights/rays.

Can true 100% Ray tracing be applied in games? Short answer, no, partially. As you have just read and hopefully remember, Microsoft has released an extension towards DirectX; DirectX Raytracing (DXR). Nvidia, on their end, announced RTX (the accumulation of hardware and software Raytracing). The hardware and software combined that can make use of the DirectX Raytracing API. NVIDIA has dedicated hardware inbuilt into their GPUs to accelerate certain Ray tracing features. In the first wave of launches, you are going to see a dozen or so games adding support for the new RTX enabled technology. Truth has to be said, in games, you can really notice the difference. How much of a performance effect RTX will have on games, is something we'll need to look deep into as well. But that is going to take time as not even Microsoft has released Windows 10 with the DX-R API.

So while rasterization has been the default renderer for a long time now, we can now add to that the ability to trace a ray using the same algorithm for complex and deep soft shadows, reflections and refractions. Combining rasterization and ray tracing was the way for NVIDIA to move forward where they are today. The raster and compute phases are what NVIDIA has been working on for the last decade.  Game developers have a whole new suite of techniques available to create better-looking graphics. On a more personal note though. We need to really call this Hybrid raytracing as you combine the best of both worlds.




40 pages « 2 3 4 5 next »



Related Articles
Palit GeForce GTX 1630 4GB Dual review
NVIDIA has released a budget series graphics card, don't expect flying framerates, but a fun little card for entry-level gaming. Meet the GeForce GTX 1630 4GB from Palit, in a DUAL fan version....

ASUS GeForce RTX 3080 Noctua OC review
Enjoy the silence, since who doesn't remember that tune from the 1980s? Join us as we analyze the all new GeForce RTX 3080 Noctua OC model. You can dispute its appearance and style, but the card perf...

Gigabyte GeForce RTX 3090 Ti Gaming OC review
Gigabyte has released their GeForce RTX 3090 'Ti' Gaming OC. The new flagship was fitted with faster memory, a boost frequency of 1905 MHz, more shaders, and a TGP passing 450 Watts. This review ben...

ASUS GeForce RTX 3090 Ti TUF Gaming review
It's been boiling for a while, a GeForce RTX 3090 'Ti'. The 3090 flagship series now has quicker memory, more shaders, and a TGP of 450-500 Watts. In this review, we benchmark the GeForce RTX 309...

© 2022