Guru3D.com
  • HOME
  • NEWS
    • Channels
    • Archive
  • DOWNLOADS
    • New Downloads
    • Categories
    • Archive
  • GAME REVIEWS
  • ARTICLES
    • Rig of the Month
    • Join ROTM
    • PC Buyers Guide
    • Guru3D VGA Charts
    • Editorials
    • Dated content
  • HARDWARE REVIEWS
    • Videocards
    • Processors
    • Audio
    • Motherboards
    • Memory and Flash
    • SSD Storage
    • Chassis
    • Media Players
    • Power Supply
    • Laptop and Mobile
    • Smartphone
    • Networking
    • Keyboard Mouse
    • Cooling
    • Search articles
    • Knowledgebase
    • More Categories
  • FORUMS
  • NEWSLETTER
  • CONTACT

New Reviews
ASUS TUF Gaming B760-PLUS WIFI D4 review
Netac NV7000 2 TB NVMe SSD Review
ASUS GeForce RTX 4080 Noctua OC Edition review
MSI Clutch GM51 Wireless mouse review
ASUS ROG STRIX B760-F Gaming WIFI review
Asus ROG Harpe Ace Aim Lab Edition mouse review
SteelSeries Arctis Nova Pro Headset review
Ryzen 7800X3D preview - 7950X3D One CCD Disabled
MSI VIGOR GK71 SONIC Blue keyboard review
AMD Ryzen 9 7950X3D processor review

New Downloads
Intel ARC graphics Driver Download Version: 31.0.101.4255
GeForce 531.41 WHQL driver download
AMD Radeon Software Adrenalin 23.3.2 WHQL download
GeForce 531.29 WHQL driver download
CrystalDiskInfo 9.0.0 Beta3 Download
AMD Ryzen Master Utility Download 2.10.2.2367
AMD Radeon Software Adrenalin 23.3.1 WHQL download
Display Driver Uninstaller Download version 18.0.6.1
CPU-Z download v2.05
AMD Chipset Drivers Download 5.02.19.2221


New Forum Topics
Leaked Photographs of Alleged GeForce RTX 4060 (Ti) Founders Edition Card Designed to Fit Two PCIe Slots NVIDIA GeForce 531.41 WHQL driver Download & Discussion FSR Thread Failed 8,3 Years old WD Red drive 3TB (EFRX) - what now...? Gordon Moore Dies at 94 531.41 - Clean Version Kioxia 2nd Gen XL-NAND Flash Memory up to 13.5 GB/s Seq Reads and 3M IOPS Random Reads Fine Utilise Power of RadeonPRO Software & SweetFX Part 2 RDNA3 RX7000 Seriess! Owners Thread, Tests, Benchmarks, Screenshots, Overclocks, & Tweaks! RTX 4080 Owner's Thread




Guru3D.com » News » Nvidia Launches Pascal based Tesla P4 and P40 accelerators

Nvidia Launches Pascal based Tesla P4 and P40 accelerators

by Hilbert Hagedoorn on: 09/13/2016 04:00 PM | source: | 0 comment(s)
Nvidia Launches Pascal based Tesla P4 and P40 accelerators

NVIDIA today unveiled the latest additions to its Pascal architecture-based deep learning platform, with new NVIDIA Tesla P4 and P40 GPU accelerators and new software that deliver massive leaps in efficiency and speed to accelerate inferencing production workloads for artificial intelligence services.

Modern AI services such as voice-activated assistance, email spam filters, and movie and product recommendation engines are rapidly growing in complexity, requiring up to 10x more compute compared to neural networks from a year ago. Current CPU-based technology isn't capable of delivering real-time responsiveness required for modern AI services, leading to a poor user experience.

The Tesla P4 and P40 are specifically designed for inferencing, which uses trained deep neural networks to recognize speech, images or text in response to queries from users and devices. Based on the Pascal architecture, these GPUs feature specialized inference instructions based on 8-bit (INT8) operations, delivering 45x faster response than CPUs1 and a 4x improvement over GPU solutions launched less than a year ago.2

The Tesla P4 delivers the highest energy efficiency for data centers. It fits in any server with its small form-factor and low-power design, which starts at 50 watts, helping make it 40x more energy efficient than CPUs for inferencing in production workloads.3 A single server with a single Tesla P4 replaces 13 CPU-only servers for video inferencing workloads,4 delivering over 8x savings in total cost of ownership, including server and power costs.

The Tesla P40 delivers maximum throughput for deep learning workloads. With 47 tera-operations per second (TOPS) of inference performance with INT8 instructions, a server with eight Tesla P40 accelerators can replace the performance of more than 140 CPU servers.5 At approximately $5,000 per CPU server, this results in savings of more than $650,000 in server acquisition cost.

"With the Tesla P100 and now Tesla P4 and P40, NVIDIA offers the only end-to-end deep learning platform for the data center, unlocking the enormous power of AI for a broad range of industries," said Ian Buck, general manager of accelerated computing at NVIDIA. "They slash training time from days to hours. They enable insight to be extracted instantly. And they produce real-time responses for consumers from AI-powered services."

Software Tools for Faster Inferencing 
Complementing the Tesla P4 and P40 are two software innovations to accelerate AI inferencing: NVIDIA TensorRT and the NVIDIA DeepStream SDK.

TensorRT is a library created for optimizing deep learning models for production deployment that delivers instant responsiveness for the most complex networks. It maximizes throughput and efficiency of deep learning applications by taking trained neural nets -- defined with 32-bit or 16-bit operations -- and optimizing them for reduced precision INT8 operations.

NVIDIA DeepStream SDK taps into the power of a Pascal server to simultaneously decode and analyze up to 93 HD video streams in real time compared with seven streams with dual CPUs.6 This addresses one of the grand challenges of AI: understanding video content at-scale for applications such as self-driving cars, interactive robots, filtering and ad placement. Integrating deep learning into video applications allows companies to offer smart, innovative video services that were previously impossible to deliver.

Leap Forward for Customers
NVIDIA customers are delivering increasingly more innovative AI services that require the highest compute performance.

"Delivering simple and responsive experiences to each of our users is very important to us," said Greg Diamos, senior researcher at Baidu. "We have deployed NVIDIA GPUs in production to provide AI-powered services such as our Deep Speech 2 system and the use of GPUs enables a level of responsiveness that would not be possible on un-accelerated servers. Pascal with its INT8 capabilities will provide an even bigger leap forward and we look forward to delivering even better experiences to our users."

Specifications
Specifications of the Tesla P4 and P40 GPUs include:

Specification Tesla P4 Tesla P40
Single Precision FLOPS* 5.5 12
INT8 TOPS* (Tera-Operations Per Second) 22 47
CUDA Cores 2,560 3,840
GPU GDDR5 Memory 8GB 24GB
Memory Bandwidth 192GB/s 346GB/s
Power 50 Watt (or higher) 250 Watt

* With boost clock on

Availability
The NVIDIA Tesla P4 and P40 are planned to be available in November and October, respectively, in qualified servers offered by ODM, OEM and channel partners.



Nvidia Launches Pascal based Tesla P4 and P40 accelerators Nvidia Launches Pascal based Tesla P4 and P40 accelerators




« Review: Toshiba OCZ VX500 SSD - The Affordable MLC NAND SSD · Nvidia Launches Pascal based Tesla P4 and P40 accelerators · Cooler Master MasterCase 3 Pro Chassis »

Related Stories

Nvidia releases Euro Suggested Retail Prices GTX 1060 - 07/09/2016 08:42 AM
Nvidia released the recommended retail prices in euros of the GeForce GTX 1060 . The cheapest custom cards have a suggested retail price of 279 euros, while the Founders Edition retails at 319 euros i...

Nvidia Releases Quadro M6000 and it has 24GB vram - 03/22/2016 04:55 PM
Nvidia released and updated the Quadro M6000, the new version sits in their Pro product line and now has been fitted with an amazing 24 GB of video memory....

Nvidia recalls Shield Tablet - Fire Hazard - 08/01/2015 11:30 AM
NVIDIA are recalling the SHIELD Tablet. Being announced today and taking effect immediately, the tablets are being voluntarily recalled by NVIDIA on account of safety concerns with some of the batteri...

Nvidia releases GeForce GT 720 Series - 08/12/2014 09:39 PM
Nvidia today introduced the latest GeForce GT 700 series, the Palit GeForce GT 720. Based on Kepler architecture, 192 CUDA cores Palit GeForce GT 720 offers ultra low power and great performance and ...

NVIDIA Refreshes GPU Roadmap and Announces Pascal - 03/25/2014 07:26 PM
During a keynote speech at our annual GPU Technology Conference in San Jose, Calif., NVIDIA CEO Jen-Hsun Huang updated our public GPU roadmap with the announcement of Pascal, the GPU family that will ...


Post New Comment
Click here to post a comment for this news story on the message forum.


Guru3D.com © 2023