There seems to be a tensor mismatch shape issue of the MultiScale Vision Transformer. Does anyone know how to resolve this issue?
https://github.com/facebookresearch/mvit/issues/22
There seems to be a tensor mismatch shape issue of the MultiScale Vision Transformer. Does anyone know how to resolve this issue?
https://github.com/facebookresearch/mvit/issues/22