GPU Architecture
|
NVIDIA Turing
|
NVIDIA Turing Tensor Cores
|
320
|
NVIDIA CUDA® Cores
|
2,560
|
Single-Precision
|
8.1 TFLOPS
|
Mixed-Precision (FP16/FP32)
|
65 TFLOPS
|
INT8
|
130 TOPS
|
INT4
|
260 TOPS
|
GPU Memory
|
16 GB GDDR6
300 GB/sec
|
ECC
|
Yes
|
Interconnect Bandwidth
|
32 GB/sec
|
System Interface
|
x16 PCIe Gen3
|
Form Factor
|
Low-Profile PCIe
|
Thermal Solution
|
Passive
|
Compute APIs
|
CUDA, NVIDIA TensorRT™,ONNX
|