Skip to content

Commit

Permalink
Merge pull request #73 from RWKV/rwkv-x-eagle-notebooks
Browse files Browse the repository at this point in the history
Rwkv x eagle notebooks
  • Loading branch information
PicoCreator committed Feb 2, 2024
2 parents a7b090d + 786889d commit 00274ed
Show file tree
Hide file tree
Showing 6 changed files with 75 additions and 106 deletions.
4 changes: 2 additions & 2 deletions notebook/finetune-example/Eagle-x-ALMA-prompt-completion.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -74,8 +74,8 @@ model:
load_model: ../model/L6-D512-neox-init.pth

# Starting and ending learning rate
lr_init: 5e-5
lr_final: 5e-5
lr_init: 1e-5
lr_final: 1e-5

# Training context length, note that the dataset can be
# larger then the context size, in which the trainer
Expand Down
4 changes: 2 additions & 2 deletions notebook/finetune-example/Eagle-x-capybara-chat.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -79,8 +79,8 @@ model:
load_model: ../model/L6-D512-neox-init.pth

# Starting and ending learning rate
lr_init: 5e-5
lr_final: 5e-5
lr_init: 1e-5
lr_final: 1e-5

# Training context length, note that the dataset can be
# larger then the context size, in which the trainer
Expand Down
4 changes: 2 additions & 2 deletions notebook/finetune-example/Eagle-x-openhermes1-instruct.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -74,8 +74,8 @@ model:
load_model: ../model/L6-D512-neox-init.pth

# Starting and ending learning rate
lr_init: 5e-5
lr_final: 5e-5
lr_init: 1e-5
lr_final: 1e-5

# Training context length, note that the dataset can be
# larger then the context size, in which the trainer
Expand Down
4 changes: 2 additions & 2 deletions notebook/finetune-example/Eagle-x-textbooks.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -79,8 +79,8 @@ model:
load_model: ../model/L6-D512-neox-init.pth

# Starting and ending learning rate
lr_init: 5e-5
lr_final: 5e-5
lr_init: 1e-5
lr_final: 1e-5

# Training context length, note that the dataset can be
# larger then the context size, in which the trainer
Expand Down
Loading

0 comments on commit 00274ed

Please sign in to comment.