NVIDIA L40S Specifications
| L40S | A100 80GB SXM | |
|---|---|---|
| Best For | Universal GPU for Gen AI | Highest Perf Multi-Node AI |
| GPU Architecture | NVIDIA Ada Lovelace | NVIDIA Ampere |
| FP64 | N/A | 9.7 TFLOPS |
| FP32 | 91.6 TFLOPS | 19.5 TFLOPS |
| RT Core | 212 TFLOPS | N/A |
| TF32 Tensor Core | 366 TFLOPS | 312 TFLOPS |
| FP16/BF16 Tensor Core | 733 TFLOPS | 624 TFLOPS |
| FP8 Tensor Core | 1466 TFLOPS | N/A |
| INT8 Tensor Core | 1466 TOPS | 1248 TFLOPS |
| GPU Memory | 48 GB GDDR6 | 80 GB HBM2e |
| GPU Memory Bandwidth | 864 GB/s | 2039 GB/s |
| L2 Cache | 96 MB | 40 MB |
| Media Engines | 3 NVENC(+AV1) 3 NVDEC 4 NVJPEG |
0 NVENC 5 NVDEC 5 NVJPEG |
| Power | Up to 350 W | Up to 400 W |
| Form Factor | 2-slot FHFL | 8-way HGX |
| Interconnect | PCle Gen4 x 16: 64 GB/s | PCle Gen4 x 16: 64 GB/s |

