Skip to content

[Fix] fix initialization of ref_llm for full param dpo training with zero-3 #1705

[Fix] fix initialization of ref_llm for full param dpo training with zero-3

[Fix] fix initialization of ref_llm for full param dpo training with zero-3 #1705