uvos
CUDA/HIP: fix ssm_scan on devices where warp size is not 32 (llama/14196)
adf6b4b