-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
embedding group #16
Comments
During training, TorchEasyRec's During inference, TorchEasyRec's |
for the training, how the fusion helps? more specifically, at low level, where the benefit comes from? e.g., does the fusion helps on the better utilization on the communication in the forward pass by better batching the communication data? does the fusion helps on reduce the number of cuda kernel launching on pooling in forward? please help provide details or point to the detailed documentations/papers. |
the fusion helps on reduce the number of embedding lookup cuda kernel launching and better utilization on the communication. |
Can you explain how the embedding group can contribute to the better performance of embedding layer?
The text was updated successfully, but these errors were encountered: