fix CUDA launch bounds usage
fix #191 lauch bounds must be placed before the return type but after the template paramater
Please register or sign in to comment
fix #191 lauch bounds must be placed before the return type but after the template paramater