Developer Daily

Top Stories

Hit Score 95
GitHub

DeepGEMM: High-Performance FP8 GPU Kernels

DeepSeek releases efficient CUDA kernels for FP8 GEMM with fine-grained scaling, drastically optimizing AI training and inference on NVIDIA hardware.

CUDAAIPerformance

More Updates