News

Nvidia Launches CUDA 9 at GTC 2017 in China

Nvidia’s CUDA 9 is now available.

Setting a new milestone in the HPC/ AI industries, NVIDIA has recently announced the availability of CUDA 9. The news was shared at GTC 2017 in China, and the release will likely spearhead support for new architectures. It’s worth noting that CUDA 9 has been available in release candidate form for a while now. However, this is the first time that we’ve been able to see the GA mark of the new tooling. Apart from new architectures, libraries optimized for brand new applications might also become available soon. You must be keen to find out about CUDA 9’s main features, though.

CUDA 9 features.

According to the NVIDIA developer site, the main highlights of the new platform are the following:

  • Speed up high-performance computing (HPC) and deep learning apps with new GEMM kernels in cuBLAS.
  • Execute image and signal processing apps faster with performance optimizations across multiple GPU configurations in cuFFT and NVIDIA Performance Primitives.
  • Solve linear and graph analytics problems common in HPC with new algorithms in cuSOLVER and nvGRAPH.
  • Express rich parallel algorithms with threads from sub-tiles to warps, blocks, and grids.
  • Manage and reuse threads efficiently within an application with new API and function primitives.
  • Optimize and pre-fetch memory access by identifying source code causing page faults in unified memory.
  • Inspect unified memory performance bottlenecks with new event filters based on virtual address, migration reason and page fault access type.

Moreover, several Volta and NVLink support items are also included:

  • Replace warp-synchronous programming with robust programming model on Kepler architecture and above.
  • Execute AI applications faster with Tensor Cores performing 5X faster than Pascal GPUs.
  • Scale multi-GPU applications with next-generation NVLink delivering 2X throughput of prior generation.
  • Increase GPU utilization with Volta Multi-Process Service (MPS).
  • Profile PCIe usage by analyzing bandwidth of memory transfers, latency, and comparison with NVLink.

Are you looking forward to Nvidia’s upcoming technologies?

Cernescu Andrei

Candrei is a writer for eTeknix who loves the latest technology news and gaming.

Disqus Comments Loading...

Recent Posts

Electronic Arts Titles Played for Over 11 Billion Hours in 2024

Electronic Arts (EA) announced today that its games were played for over 11 billion hours…

1 day ago

Just 15% of Steam Gaming Time in 2024 Was Spent on New Releases

Steam's annual end-of-year recap, Steam Replay, provides fascinating insights into gamer habits by comparing individual…

1 day ago

STALKER 2 Gets Massive 110GB Patch With 1800+ Fixes

GSC GameWorld released a major title update for STALKER 2 this seeking, bringing the game…

2 days ago

Intel Unveils Core 200H Processors Based on the Previous Raptor Lake Refresh

Without any formal announcement, Intel appears to have revealed its new Core 200H series processors…

3 days ago

Ubisoft Reportedly Developing a New Quadruple A Game

Ubisoft is not having the best of times, but despite recent flops, the company still…

3 days ago

STALKER 2: Heart of Chornobyl Update 1.1 Fixes 1,800 Issues and Revamps A-Life 2.0

If you haven’t started playing STALKER 2: Heart of Chornobyl yet, now might be the…

3 days ago