Can anyone help me understand the implementation of nn.Bilinear
As per the documentation, this function implements y = x1T * A * x2
taking x1 = (100,20)
, x2 = (100,30')
, assuming output_features = 50
. The matrix A
has dimensions of [50,20,30]
.
I am finding it difficult how these matrices are multiplied to get the output = [100,50]
Based on the size of x1,x2 and A
matrix, the multiplication seems incompatible as per y = x1T * A * x2 . What am I missing here?