-
Notifications
You must be signed in to change notification settings - Fork 5.6k
2018 05 23
- Multi-GPUs stable enhancement
- CI Speed up
- Several enhancements:
- Add SERIAL keyword to cmake
cc_test
andnv_test
- BUGFIX https://github.com/PaddlePaddle/Paddle/pull/10709
- switch scope for each unit test
- Add SERIAL keyword to cmake
- mkldnn:
- refine mkldnn cmake with official commit id: https://github.com/PaddlePaddle/Paddle/pull/10736
- inference engine:
- add version and cmakecache in inference_lib:https://github.com/PaddlePaddle/Paddle/pull/10807
- change CMAKE_INSTALL_PREFIX in inference_lib_dist to FLUID_INSTALL_DIR: https://github.com/PaddlePaddle/Paddle/pull/10827
- auto build and deploy fluid.tgz on TeamCity with @yanxu
- code review:
- mkldnn:
- add mkldnn to paddle lib: https://github.com/PaddlePaddle/Paddle/pull/10705
- Reuse of pooling mkldnn primitives:https://github.com/PaddlePaddle/Paddle/pull/10747
- Update activations for MKL-DNN: https://github.com/PaddlePaddle/Paddle/pull/10597
- Add cpp trainer lib and demo: https://github.com/PaddlePaddle/Paddle/pull/10681
- mkldnn:
-
Fixed roi_pool op bug
-
Convert Caffe pretrained vgg model to Paddle model
-
Expose bilinear_interp and prior_box op into Python API
-
CustomReader for data preprocessing:
-
New Fluid API (trainer.test()):
-
A few fixes on misuse of
TensorCopy
:
- Cleanup grpc code and add brpc support: https://github.com/PaddlePaddle/Paddle/issues/10804
-
fluid high level api
- add trainer.stop and fix a bug for train_by_parallel_executor https://github.com/PaddlePaddle/Paddle/pull/10762
-
distributed trianing
- complete distributed lookup table https://github.com/PaddlePaddle/Paddle/issues/10868
- some code review
- distributed training document http://agroup.baidu.com/paddlepaddle/md/article/928534
-
Fluid C++ trainer lib and demo(support Abacus). https://github.com/PaddlePaddle/Paddle/issues/10574
- Add cpp trainer lib and demo https://github.com/PaddlePaddle/Paddle/pull/10681
- support user defined download url for mklml and boost https://github.com/PaddlePaddle/Paddle/pull/10819
-
Bug fix:
-
Looking into the inference result of simple rnn beam search
- Might caused by encoder: encoder encodes everything indifferently
-
Attention model:
- Unable to converge
- aws tool ready with CE
- parallelly creating local and cluster training process to calculate the speed-up rate
- fixed paddle in cuda9 cudnn 7 production docker env
- issue:
- PR
- Set CI for paddlepaddle/PARL repo
- Travis CI is done
- [WIP] teamcity
- Fix build error
- Fix production image generation
- Fix manylinux build failure (fluid_inference_lib)
- [WIP] Modify documents after cleaning all the old shell scripts
- Fix concurrent build in the same machine
- Fix manylinux “cpu_avx_openblas” failure
- benchmark scripts polish https://github.com/PaddlePaddle/Paddle/pull/10707
- benchmark resnet 50 and compare with TF, all upgrade to use CUDA 9
- add Pinned memory to send: https://github.com/PaddlePaddle/Paddle/pull/10821
- discussions to use brpc, and review: https://github.com/PaddlePaddle/Paddle/pull/10805
- Some fixes:
- Face Detection:
- Finish PyramidBox model.
- Enhance detection_map_op and more check in prior_box API.
- Review:
- prior_box API: https://github.com/PaddlePaddle/Paddle/pull/10773
- group de-conv: https://github.com/PaddlePaddle/Paddle/pull/10846
- Load pretrained VGG model for face detection. https://github.com/PaddlePaddle/models/pull/925
- Add reader and train logic for PyramidBox. https://github.com/PaddlePaddle/models/pull/927
- Fixed-point quantization:
- [Need to review] Design doc of fixed-point quantization. https://github.com/PaddlePaddle/Paddle/pull/10553
- Survey TensorRT int8.
- Others:
- small fix: https://github.com/PaddlePaddle/models/pull/924
- code review:
- mean IoU: https://github.com/PaddlePaddle/Paddle/pull/10519#pullrequestreview-120617897
- dice loss: https://github.com/PaddlePaddle/Paddle/pull/10717#pullrequestreview-121699219
- Enhance reduce op: https://github.com/PaddlePaddle/Paddle/pull/10708
- quard_transform_op: https://github.com/PaddlePaddle/Paddle/pull/10802#pullrequestreview-122026772
- NMT:
- Fix ElementwiseOpInferVarType in elementwise_op.
- Add beamsearch decoder using while_op in Transformer (WIP).
- Make Transformer adapt to the latest api of ParallelExecutor.
- memory optimize
- code clean up in op_without_kernel
- uint8 support in memory optimize
- change input size accelerate ci
- big data high precision benchmarkSuite
- reivews
- distributed benchmark scripts
- disable test_network_with_dtype tests
-
Add quad transform op
-
Fix evaluator of OCR-CTC mode in parallel mode
-
Debug backward of OCR attention model
- [WIP] GAN high level API: https://github.com/PaddlePaddle/Paddle/pull/10885
- [Survey] AutoDiff: https://github.com/tonyyang-svail/notes/issues/1
-
CI environment on Baidu paddle standard environment
-
Baidu Paddle development and test standard environment
- will release a stable version next Wednesday (paddle-fluid:cuda8.0_cudnn5_v2)
- entrance
-
CE optimization and CE model PR monitoring
Documents:
-
Add pip install description
-
polish and add Inference doc for fluid
-
Add new fluid api documents
-
Add api docs for RMSProp and modelAverage
Hsigmoid[WIP]:
- make forward right
- Fix some calculation error in backward.
- overlap send op and backward ops, https://github.com/PaddlePaddle/Paddle/pull/10550
- bug fix
- Checkpoint:
- [merged] https://github.com/PaddlePaddle/Paddle/pull/10532
- [New Fluid API] https://github.com/PaddlePaddle/Paddle/issues/10828
- Incremental Learning Support:
- [WIP] https://github.com/PaddlePaddle/Paddle/pull/10878
-
Inference
-
CE
-
Reviews https://github.com/PaddlePaddle/continuous_evaluation/pull/48/files#r189496284 https://github.com/PaddlePaddle/continuous_evaluation/pull/48#pullrequestreview-121695955 https://github.com/PaddlePaddle/Paddle/pull/10807#pullrequestreview-121773023 https://github.com/PaddlePaddle/continuous_evaluation/pull/49#pullrequestreview-122022168 https://github.com/Superjomn/paddle-ce-latest-kpis/pull/32#pullrequestreview-122491411
- PR
- Add timer
- Refine Pe scheduling strategy
- Fix DataTransFunc
- Clean parallel_exe
- Review
- Add
op_role
into OpDesc.
- Add
- PR
- [Merged] Enable MKLDNN inference test https://github.com/PaddlePaddle/Paddle/pull/10701
- Code Review
- [Merged] Reuse of pooling mkldnn primitives https://github.com/PaddlePaddle/Paddle/pull/10747
- [Merged] Update activations for MKL-DNN https://github.com/PaddlePaddle/Paddle/pull/10597
- Withdraw MKLDNN Mul operator https://github.com/PaddlePaddle/Paddle/pull/10703
- issue
- Refinement for SequencePadOp
https://github.com/PaddlePaddle/Paddle/pull/10533 - Code Review
https://github.com/PaddlePaddle/Paddle/pull/10797
https://github.com/PaddlePaddle/Paddle/pull/10704
https://github.com/PaddlePaddle/Paddle/pull/10814
- PR
- distributed training,2 trainer 1 p server vgg improve performance 11%
- https://github.com/PaddlePaddle/Paddle/pull/10822
- Debug inference library no speed up when using multi-thread
- https://github.com/PaddlePaddle/Paddle/pull/10896
- Fix a compile error
- https://github.com/PaddlePaddle/Paddle/pull/10775
ONNX convertor:
- Merge the Inception_V4 config
- Add compare ops & several relu ops
Face detection:
- Add groups for conv transpose ops
DeepASR:
- [WIP] clean up code & polish document
Code Review:
- [Reviewing] Add an interface to set the number of threads for math function, and set the default value to 1 for inference
- Write benchmark C++ codes for Fluid inference
- Summarize some of my recent work
- Build paddle from source following the instruction
- Make the pserver receive interrupt signal
- Simplified Fluid API: finished all the 8 chapter demos with Siddharth, Nicky and Jeff
- PRs:
- Issues and Reviews:
- Review Translation of Text Classification:
- PaddlePaddle.org and VisualDL:
- anticipated in PPO bug fixing and new PPO design
- https://github.com/PaddlePaddle/VisualDL/pull/454
- Add lod_tensor.py for ease of creating lod tensor in book examples
- Modify label_semantic_roles book example using new LoDTensor API
- Modify understand_sentiment book example using new LoDTensor API
- Modify word2vec book example using new LoDTensor API
- Review:
- New Fluid API (PRs)
- Sentiment analysis with stacked lstm: https://github.com/PaddlePaddle/Paddle/pull/10812
- Sentiment analysis with conv: https://github.com/PaddlePaddle/Paddle/pull/10847
- Sentiment analysis with dynamic rnn: https://github.com/PaddlePaddle/Paddle/pull/10847
- Label semantic roles: https://github.com/PaddlePaddle/Paddle/pull/10757
- New API reviews:
- https://github.com/PaddlePaddle/Paddle/pull/10535
- https://github.com/PaddlePaddle/Paddle/pull/10762
- https://github.com/PaddlePaddle/Paddle/pull/10741
- https://github.com/PaddlePaddle/Paddle/pull/10779
- https://github.com/PaddlePaddle/Paddle/pull/10738
- https://github.com/PaddlePaddle/Paddle/pull/10539
- https://github.com/PaddlePaddle/Paddle/pull/10895
- https://github.com/PaddlePaddle/Paddle/pull/10886
- https://github.com/PaddlePaddle/Paddle/pull/10888
Working with Varun on the PaddlePaddle.org architecture. Working with Daming, Nick and Sid on the testings.
-
PRs
- Codebase cleanup, remove old files: https://github.com/PaddlePaddle/VisualDL/pull/458
- Fix failing build: https://github.com/PaddlePaddle/VisualDL/pull/456
- Fix broken links: https://github.com/PaddlePaddle/VisualDL/pull/455
- Resolve the incorrect category issue on documentation: https://github.com/PaddlePaddle/PaddlePaddle.org/pull/480
-
Reviews
- Worked on brand development for Paddle with Beijing
- Worked with Synopsis on state of ONNX support in Paddle
- Finished documentation on Overview, Running a program (https://github.com/PaddlePaddle/Paddle/pull/10758)
- Began refactor of documentation contribution process for PaddlePaddle.org (https://github.com/PaddlePaddle/PaddlePaddle.org/pull/481)
- Added favicon for VisualDL (https://github.com/PaddlePaddle/VisualDL/pull/454)
- Investigated benchmarking with MLPerf spec for Paddle
- Simplify Machine Translation API https://github.com/PaddlePaddle/Paddle/pull/10895
- Work with Daming, Siddarth, Jeff to resolve simplify book demo issue
- Reviews: