Replace `num_beams` with `generations_per_sample` #971

eitanturok · 2024-02-12T23:27:24Z

In composer's build_icl_dataloader function (here), we specify the number of generations per sample with the variable generations_per_sample.

But when we call build_icl_dataloader in llm-foundry here, we specified the number of generations per sample with the variable num_beams. This was confusing because

we use different variable names
the variable num_beams is also used as a parameter for beam search in inference.

To resolve this, I replaced all instances of num_beams with generations_per_sample in llm-foundry. If generations_per_sample is not specified in the config and num_beams is specified, we will use the value set by num_beams and raise a warning that num_beams is depreciated.

Cheers!

llmfoundry/utils/builders.py

eitanturok · 2024-02-14T02:23:18Z

Overall, these changes are so that we use the same variable names in both llmfoundry and composer.

Right now, the PR Make CodeEval respect device_eval_batch_size uses the variable generations_per_sample to specify the number of generations per sample and so we change num_beams, which is how we specify the number of generations per sample in llmfoundry, to generations_per_sample.

josejg · 2024-02-14T23:27:26Z

I believe this PR is redundant with the accompanying foundry PR of the composer you linked: #956

I didn't raise a DeprecationWarning because I do think specifying num_beams was a bug and misnomer, and thus should not be supported any longer as it's a footgun.

dakinggg · 2024-02-15T19:47:00Z

I'm going to close this PR, it is being covered in #956 . Let me know if you disagree.

eitanturok · 2024-02-16T15:09:01Z

@josejg -- that makes sense, specifying num_beams is a bug and should not have a DeprecationWarning.

@dakinggg -- no worries, these changes can be implemented in that other PR.

replace num_beams with generations_per_sample

090355c

eitanturok requested a review from dakinggg February 12, 2024 23:27

eitanturok self-assigned this Feb 12, 2024

eitanturok requested a review from maxisawesome February 12, 2024 23:27

maxisawesome reviewed Feb 12, 2024

View reviewed changes

llmfoundry/utils/builders.py Outdated Show resolved Hide resolved

maxisawesome reviewed Feb 12, 2024

View reviewed changes

llmfoundry/utils/builders.py Outdated Show resolved Hide resolved

eitanturok added 4 commits February 12, 2024 19:11

replaced icl with icl_cfg; made better error message

3aa6190

remove x = ....

66786ca

change num_beams to generations_per_sample

06e7745

add missing comma

b56b2f5

eitanturok requested a review from bmosaicml February 13, 2024 03:41

eitanturok and others added 2 commits February 13, 2024 20:39

Merge branch 'mosaicml:main' into main

e4b732e

update warning message

fd4afbf

dakinggg closed this Feb 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace `num_beams` with `generations_per_sample` #971

Replace `num_beams` with `generations_per_sample` #971

eitanturok commented Feb 12, 2024

eitanturok commented Feb 14, 2024

josejg commented Feb 14, 2024

dakinggg commented Feb 15, 2024

eitanturok commented Feb 16, 2024

Replace num_beams with generations_per_sample #971

Replace num_beams with generations_per_sample #971

Conversation

eitanturok commented Feb 12, 2024

eitanturok commented Feb 14, 2024

josejg commented Feb 14, 2024

dakinggg commented Feb 15, 2024

eitanturok commented Feb 16, 2024

Replace `num_beams` with `generations_per_sample` #971

Replace `num_beams` with `generations_per_sample` #971