-
Notifications
You must be signed in to change notification settings - Fork 5.6k
2018 05 30
Tao Luo edited this page Dec 9, 2019
·
1 revision
- Automatical Differentiation with the Tape: https://github.com/PaddlePaddle/Paddle/pull/11019
- Fix build related documents after scripts clean up
- Set up CI for paddlepaddle/PARL repo ( Travis and Teamcity work now)
- Fix dependencies for test_paddle_inference_api_impl
- Fix teamcity ccache
- inference engine:
- refine fluid deploy on teamcity: https://github.com/PaddlePaddle/Paddle/pull/10904
- obtain the correct git commit id in teamcity with @yanxu https://github.com/PaddlePaddle/Paddle/pull/10991
- fix compiler error when do not have TensorRT library https://github.com/PaddlePaddle/Paddle/pull/11051
- MKLDNN:
- request for the static library of MKLML with intel @huying.
- help OCR to fix (doing): when use fluid version with MKLML shared library, other CPU jobs (with MKL static libary) on the same machine core, other GPU jobs become slower.
- code review:
- mkldnn
- [merge] Blas optimized elementwise_add forward and backward passes (10% speedup on elemtwise_add op of OCR CRNN_CTC model): https://github.com/PaddlePaddle/Paddle/pull/10913
- [merge] Top K algorithm parallel version: https://github.com/PaddlePaddle/Paddle/pull/10941
- [merge] Withdraw MKLDNN Mul operator: https://github.com/PaddlePaddle/Paddle/pull/10703#pullrequestreview-124659139
- https://github.com/PaddlePaddle/Paddle/pull/10934
- https://github.com/PaddlePaddle/Paddle/pull/10988
- https://github.com/PaddlePaddle/Paddle/pull/10841
- mkldnn
- Move sync_mode device ctx from grpc server
-
Test ReacordIO Reader with Resnet50(P40):
- 1 GPU:139 imgs/sec 8 GPU:896 imgs/sec
- speed-up ratio: 6.44
-
Training Data Preprocessor:
-
A preprocessing op(random crop op):
- Slice op.
- Add shape op to get the shape of a variable.
- Make bilinear_interp_op support attrs from the input.
- Add python wrapper for gather op.
- Discussion of polygon NMS op :
- Review:
- aws integration with CE https://github.com/PaddlePaddle/paddle-ce-latest-kpis/pull/27
- now it supports multi cluster config to work with KPI matrix introduced by #10944
- cluster spec chunk execution to make the most use of aws quota
- aws_runner now works with /benchmark/fluid
- going to integrate /benchmark/fluid to CE
- memory optimize
- [WIP]a MxNet like test of memory optimize
- accelerate the reduce op compile
- reduce the Se-ResNeXt-152 ops
- fix build scripts
- [WIP] Reading face detection caffe code
- issue
- PR
- NMT:
- Fix and enhance beam_search_op and beam_search_decode_op (WIP).
- Continue the work on beamsearch decoder in Transformer (WIP).
- Validate the beam-search related fix by comparing with original python beam searh
- Incremental Learning Support
- update fluid Train API param_path to checkpoint_config
- checkpoint feature on Paddle Cloud
- Face Detection:
- Fixed-point quantization:
- Develop a fake dequantized op for fixed-point quantization training framework.
- Update design doc of fixed-point quantization.
- Others:
- Fix fill_constant_batch_size_like_op when input is LoDTensor.
- Check label range in cross entropy calculation.
- No Attention Model
- Training: Cannot converge on wmt14 or wmt16 dataset
- Encoder is correct now
- Decoder has some unnatural behaviors
- Inference:
- Added Accumulated Prob
- Issue caused by beam search op, fixed by Sheng Guo
- Training: Cannot converge on wmt14 or wmt16 dataset
- Attention Model
- Training: Cannot overfit small dataset (1~4 lines of data)
- performance
- merge overlap rpc send and backward ops, performance improve 20%, https://github.com/PaddlePaddle/Paddle/pull/10550
- stablity
- discuess dist train KIPs on CE, https://github.com/PaddlePaddle/Paddle/issues/10944
- bug fix, https://github.com/PaddlePaddle/Paddle/pull/10937
- bug fix, https://github.com/PaddlePaddle/Paddle/pull/10932
- feature
- support RecordIO in distributed training, https://github.com/PaddlePaddle/Paddle/pull/11066
- PR review
- finish resnet50 performance comparations with TF
- transpiler cleanup, put weight decay and clipping on pserver:
- Discussion with @weibao about rpc interface, and a prototype:
- Look into TF codes to see how it deals with performance issue
- Testing LARS on imgnet
- Analysis of the reasons for the low acceleration ratio of PE 8 cards on Resnext152 with @yuyang
- PR
- Review
-
distributed trianing
- distribute lookup table support async training https://github.com/PaddlePaddle/Paddle/pull/10973
- add
auto_grown_mutex
for selected_rows https://github.com/PaddlePaddle/Paddle/pull/11012 - run dist lookup table on dlnel cluster[Done]
-
AbacusToPaddle
- build paddle CPP lib on centos4.3/6.3 kernel 2.x/3.x http://agroup.baidu.com/abacus2paddle/md/article/933770
- make the plan http://agroup.baidu.com/abacus2paddle/md/article/936930
- build Abacus with paddle static lib.
- fix protobuf problem(Abacus: protobuf 2.4.1 Paddle: protobuf 3.1.0)
- fix eigen build problem.(both use eigen)
-
fengchao reinforcement learning with Paddle
- operators alignment
- make plan
-
code clean
- fix inference build on mac https://github.com/PaddlePaddle/Paddle/pull/11007
-
Hsigmoid
-
Polish and add fluid_guide to fluid/get_started:
-
[WIP]paddlepaddle jupyter notebook raises error
- Inference API implementation
- https://github.com/PaddlePaddle/Paddle/pull/11036
- https://github.com/PaddlePaddle/Paddle/pull/11009
- https://github.com/PaddlePaddle/Paddle/pull/10983
- profiler API and dist benchmark fix
- https://github.com/PaddlePaddle/Paddle/pull/11036
-
CE frameworks
- issue
- pr
- https://github.com/PaddlePaddle/continuous_evaluation/pull/59
- https://github.com/PaddlePaddle/continuous_evaluation/pull/60
- https://github.com/PaddlePaddle/continuous_evaluation/pull/55
- https://github.com/PaddlePaddle/continuous_evaluation/pull/58
- https://github.com/PaddlePaddle/continuous_evaluation/pull/54
- wiki
-
benchmark
- environment built for Random model testing (Monitoring illegal access and convergence of model)
- NMT
- running transformer with mutil-trainers and mutil-psservers on one machine
- try to running transformer on paddlecloud
- AbacusToPaddle
- familiar with look-up table code
- Add create LoDTensor from list option and simplify recommender book example:
- Modify machine translation example using new LoDTensor API
- Modify RNN encoder decoder example using new LoDTensor API
- Fix float16 demo location issue
- Review
-
PR:
- Fix order of attributes in label_semantic_roles example: https://github.com/PaddlePaddle/Paddle/pull/10993
- consistent attribute naming in new API: https://github.com/PaddlePaddle/Paddle/pull/10947
- train() function API in fit_a_line example: https://github.com/PaddlePaddle/Paddle/pull/11020
-
Review:
- new LoDTensor API: #10886, #10888, #10889, #10946, #11018, #11021
- Translation with new API: https://github.com/PaddlePaddle/Paddle/pull/10895
- Re-write book chapter with new API: https://github.com/PaddlePaddle/book/pull/524
- Finished re-writing book tests to use new Fluid API
- Re-writing PaddlePaddle book, demo code and Jupyter notebook based on new Fluid API
- Reviewed PRs and created issues:
Working with Daming, Nicky and Sid on the book documentation. Currently all changes will go to https://github.com/PaddlePaddle/book/tree/high-level-api-branch first to ensure stability.
[WIP] working on Recognize Digit example.
-
PR:
- Update sentiment analysis text: https://github.com/PaddlePaddle/PaddlePaddle.org/pull/482
-
Issues:
- Unable to import paddle fluid in latest docker: https://github.com/PaddlePaddle/Paddle/issues/11059
PR reviews:
- https://github.com/PaddlePaddle/Paddle/pull/11056#event-1654718201
- https://github.com/PaddlePaddle/book/pull/524#pullrequestreview-124636834
- Rewrite Book chapter 5 Recommendation System documentation https://github.com/PaddlePaddle/book/pull/526
- Simply Machine translation function name https://github.com/PaddlePaddle/Paddle/pull/11056
- Discussed with Daming, Jeff, Siddarth for book documentation plan
- Follow up with Longfei for Machine Translation infer issue
- Review:
- Initial version of menu editor, and several updates and simplification to new documentation building system https://github.com/PaddlePaddle/PaddlePaddle.org/pull/481