GPU Architecture NVIDIA Turing
NVIDIA Turing Tensor Cores 320
NVIDIA CUDA® Cores 2,560
Single-Precision 8.1 TFLOPS
Mixed-Precision (FP16/FP32) 65 TFLOPS
INT8 130 TOPS
INT4 260 TOPS
GPU Memory 16 GB GDDR6 300 GB/s
ECC Yes
Interconnect Bandwidth 32 GB/sec
System Interface x16 PCIe Gen3
Form Factor Low-Profile PCIe
Thermal Solution Passive
Compute APIs CUDA, NVIDIA TensorRT™, ONNX