tensor-convolution-kernel-on-GPU Tensor kernel computation optimizations on GPU for different shape of input data