You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for your impressive work! After reading your code and paper, I have some questions about the fusion design. Referring to the SENET, they implement the self-attention by global pooling, and two Convs to set the channel-wise descriptor to C1. While, in your paper, you created two matrices to change the dimension back to C1. Why? How about using two single Convs to change the dimension to C*1? Especially, it looks like more efficient.
The text was updated successfully, but these errors were encountered:
Thanks for your impressive work! After reading your code and paper, I have some questions about the fusion design. Referring to the SENET, they implement the self-attention by global pooling, and two Convs to set the channel-wise descriptor to C1. While, in your paper, you created two matrices to change the dimension back to C1. Why? How about using two single Convs to change the dimension to C*1? Especially, it looks like more efficient.
The text was updated successfully, but these errors were encountered: