NVIDIA interview question

Implement a convolution kernel Include padding and write the exact code