Skip to content
This repository has been archived by the owner on Nov 17, 2023. It is now read-only.

Question about implementation of gradient compression strategies #18447

Answered by szhengac
shuo-ouyang asked this question in General
Discussion options

You must be logged in to vote

Recently, we provide a gradient compression api based on BytePS. See bytedance/byteps#225 and https://github.com/bytedance/byteps/blob/master/docs/gradient-compression.md. Currently, it supports 1-bit, top-k, random-k, and dithering. The example of training on ImageNet with mxnet is provided in https://github.com/bytedance/byteps/blob/master/example/mxnet/train_gluon_imagenet_byteps_gc.py

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@shuo-ouyang
Comment options

Answer selected by szha
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #18447 on September 05, 2020 19:33.