Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add TemplateLM boilerplate LM class #1279

Merged
merged 54 commits into from
Feb 22, 2024

Commits on Jan 13, 2024

  1. Configuration menu
    Copy the full SHA
    b4fcc09 View commit details
    Browse the repository at this point in the history
  2. linter

    anjor committed Jan 13, 2024
    Configuration menu
    Copy the full SHA
    ea44741 View commit details
    Browse the repository at this point in the history

Commits on Jan 31, 2024

  1. Configuration menu
    Copy the full SHA
    4e6a870 View commit details
    Browse the repository at this point in the history
  2. Make parallelize=True vs. accelerate launch distinction clearer i…

    …n docs (EleutherAI#1261)
    
    * Make parallelize=True distinction clearer in documentation.
    
    * run linter
    haileyschoelkopf authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    d444e9a View commit details
    Browse the repository at this point in the history
  3. Allow parameter edits for registered tasks when listed in a benchmark (

    …EleutherAI#1273)
    
    * benchmark yamls allow minor edits of already registered tasks
    
    * add documentation
    
    * removed print
    lintangsutawika authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    d41a351 View commit details
    Browse the repository at this point in the history
  4. Fix data-parallel evaluation with quantized models (EleutherAI#1270)

    * add WIP device_map overrides
    
    * update handling outside of accelerate launcher
    
    * change .to(device) log to debug level
    
    * run linter
    haileyschoelkopf authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    db3ee51 View commit details
    Browse the repository at this point in the history
  5. Rework documentation for explaining local dataset (EleutherAI#1284)

    * rewor documentation for explaining local dataset
    
    * fix typo
    
    * Update new_task_guide.md
    lintangsutawika authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    1c07f70 View commit details
    Browse the repository at this point in the history
  6. Re-add citation

    It looks like Google Scholar has [already noticed](https://scholar.google.com/scholar?hl=en&as_sdt=0%2C9&authuser=2&q=%22A+framework+for+few-shot+language+model+evaluation%2C+12+2023%22&btnG=) the updated citation block so let's add it back in.
    StellaAthena authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    b716761 View commit details
    Browse the repository at this point in the history
  7. Update CITATION.bib (EleutherAI#1285)

    Bumping CITATION.bib to match re-adding the citation in readme. 
    
    cc @StellaAthena
    haileyschoelkopf authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    370cbbe View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    4702624 View commit details
    Browse the repository at this point in the history
  9. Update README.md with custom integration doc (EleutherAI#1298)

    * Update README.md
    
    * punctuation
    
    ---------
    
    Co-authored-by: Hailey Schoelkopf <[email protected]>
    2 people authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    0013399 View commit details
    Browse the repository at this point in the history
  10. Update nq_open.yaml (EleutherAI#1305)

    * Update nq_open.yaml
    
    change regex
    
    * Bump NQ version
    
    ---------
    
    Co-authored-by: Hailey Schoelkopf <[email protected]>
    2 people authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    8783281 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    5762058 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    55e51ec View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    5fb93fc View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    7724bf1 View commit details
    Browse the repository at this point in the history
  15. Fix group register (EleutherAI#1315)

    * tuple should be considered as well
    
    * set option to keep callable as callable
    lintangsutawika authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    3688b1f View commit details
    Browse the repository at this point in the history
  16. Update task_guide.md (EleutherAI#1316)

    djstrong authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    b6051f9 View commit details
    Browse the repository at this point in the history
  17. Configuration menu
    Copy the full SHA
    d0de14e View commit details
    Browse the repository at this point in the history
  18. Configuration menu
    Copy the full SHA
    e7daca5 View commit details
    Browse the repository at this point in the history
  19. Fix Issue regarding stderr (EleutherAI#1327)

    * add fix fordeciding if stderr is N/A or not
    
    * process N/A
    lintangsutawika authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    e8bc89d View commit details
    Browse the repository at this point in the history
  20. Add local-completions support using OpenAI interface (EleutherAI#1277)

    * Add `local-completions` support using OpenAI interface
    
    * Refactor oa_completion
    
    * Address tokenizer comments and change request chunks to batch size
    
    * Add warning message for tiktoken backend
    
    * fix formatting
    
    * fix whitespace
    
    * Update README.md
    
    ---------
    
    Co-authored-by: Hailey Schoelkopf <[email protected]>
    2 people authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    fd94748 View commit details
    Browse the repository at this point in the history
  21. Configuration menu
    Copy the full SHA
    ea12d33 View commit details
    Browse the repository at this point in the history
  22. Configuration menu
    Copy the full SHA
    413f183 View commit details
    Browse the repository at this point in the history
  23. Configuration menu
    Copy the full SHA
    9703c8a View commit details
    Browse the repository at this point in the history
  24. Update migrated HF dataset paths (EleutherAI#1332)

    * Update arc_easy.yaml
    
    * Update flan_cot.yaml
    
    * update HF dataset path
    
    * Update freeform.yaml
    
    * Update flan_cot.yaml
    
    ---------
    
    Co-authored-by: Lintang Sutawika <[email protected]>
    2 people authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    0ffc6b6 View commit details
    Browse the repository at this point in the history
  25. Don't use get_task_dict() in task registration / initialization (El…

    …eutherAI#1331)
    
    * don't use get_task_dict() as a helper, it will download the dataset!
    
    * pre-commit
    
    * Update README.md
    
    ---------
    
    Co-authored-by: lintangsutawika <[email protected]>
    2 people authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    fa05528 View commit details
    Browse the repository at this point in the history
  26. manage default (greedy) gen_kwargs in vllm (EleutherAI#1341)

    * manage default (greedy) gen_kwargs in vllm better
    
    * mirror HF `do_sample`
    
    * just need to set temp=0 for greedy
    baberabb authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    4a2c48a View commit details
    Browse the repository at this point in the history
  27. Configuration menu
    Copy the full SHA
    08af37f View commit details
    Browse the repository at this point in the history
  28. Configuration menu
    Copy the full SHA
    279c5b5 View commit details
    Browse the repository at this point in the history
  29. Filter docs not offset by doc_id (EleutherAI#1349)

    * get `doc` from instance
    
    * acceletate bugfix: get ground doc from instance
    
    * convert filter to `process_result`
    
    * get docs from instances in `FilterEnsemble`
    
    * rename
    
    * nit
    
    * better looping
    
    * fix typehint
    baberabb authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    7cf3083 View commit details
    Browse the repository at this point in the history
  30. Add FAQ on lm_eval.tasks.initialize_tasks() to README (EleutherAI#1330

    )
    
    * Update README.md
    
    * [!Tip]
    haileyschoelkopf authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    5f09e98 View commit details
    Browse the repository at this point in the history
  31. Configuration menu
    Copy the full SHA
    e29ed4e View commit details
    Browse the repository at this point in the history
  32. Add causalLM OpenVino models (EleutherAI#1290)

    * added intel optimum
    
    * added intel optimum in readme
    
    * modified intel optimum
    
    * modified intel optimum
    
    * modified intel optimum
    
    * modified install optimum
    
    * modified path of IR file
    
    * added openvino_device
    
    * added openvino_device2
    
    * changed optimum-causal to openvino-causal
    
    * Update README.md
    
    * Update README.md
    
    * remove `lm_eval.base` import
    
    * update openvino-causal -> openvino ; pass device through super().__init__()
    
    * Update README.md
    
    * Add optimum to tests dependencies
    
    * apply pre-commit
    
    * fix so tests pass
    
    ---------
    
    Co-authored-by: Hailey Schoelkopf <[email protected]>
    Co-authored-by: haileyschoelkopf <[email protected]>
    3 people authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    ca8a014 View commit details
    Browse the repository at this point in the history
  33. Apply some best practices and guideline recommendations to code (Eleu…

    …therAI#1363)
    
    * raise Exception, not a string
    
    Additional info https://peps.python.org/pep-0352/#exception-hierarchy-changes
    https://docs.python.org/3.8/tutorial/errors.html#raising-exceptions
    
    * Apply PEP8 recommendation to prefer isinstance
    
    "Object type comparisons should always use isinstance() instead of comparing types directly"
    https://peps.python.org/pep-0008/
    
    * Remove dangerous default mutable values in arguments
    
    https://pylint.readthedocs.io/en/stable/user_guide/messages/warning/dangerous-default-value.html
    
    * Format logging messages with fstring (not with format)
    
    Additional info
    https://pylint.readthedocs.io/en/stable/user_guide/messages/warning/logging-format-interpolation.html
    There are also discussions about the speed of formatting while logging or some unintended code executions
    pylint-dev/pylint#2395
    https://stackoverflow.com/a/54368109
    but at least one format (fstring one) will be used throughout the project
    
    * Specify utf-8 encoding for `open` explicitly
    
    If not specified, it may be supposed differently in different environments, OSes, and Python versions. See
    https://peps.python.org/pep-0597/
    https://docs.python.org/3.11/library/locale.html#locale.getencoding
    https://docs.python.org/3.10/library/os.html#utf8-mode
    https://pylint.readthedocs.io/en/stable/user_guide/messages/warning/unspecified-encoding.html
    
    Helps also if some code from English language tasks is taken as inspiration for tasks in non-English languages.
    
    * Use inline-ignoring comments to pass pre-commit instead of identity process
    
    https://flake8.pycqa.org/en/3.0.1/user/ignoring-errors.html#in-line-ignoring-errors
    https://www.flake8rules.com/rules/F841.html
    
    flake8 comments are supported by ruff: https://docs.astral.sh/ruff/linter/#error-suppression
    LSinev authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    5f77a8f View commit details
    Browse the repository at this point in the history
  34. Configuration menu
    Copy the full SHA
    c986b5f View commit details
    Browse the repository at this point in the history
  35. delay filter init; remove *args (EleutherAI#1369)

    * delay filter init; remove `*args`
    
    * bugfix
    
    * optimize
    
    * type hint
    baberabb authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    87ea8d3 View commit details
    Browse the repository at this point in the history
  36. Fix unintuitive --gen_kwargs behavior (EleutherAI#1329)

    * don't override do_sample if no value for it is passed
    
    * Update gen_kwargs override condition
    
    * Update huggingface.py
    
    * Update huggingface.py
    
    * run linters
    
    * silence an erroneous warning
    haileyschoelkopf authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    6f4e5df View commit details
    Browse the repository at this point in the history
  37. Publish to pypi (EleutherAI#1194)

    * publish to pypi
    
    * lint
    
    * Update publish.yml
    
    * minor
    anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    5ff7c41 View commit details
    Browse the repository at this point in the history
  38. Make dependencies compatible with PyPI (EleutherAI#1378)

    * make deps not point to github urls
    
    * formatting
    
    * try making PyPI only run on tag pushes
    haileyschoelkopf authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    68a193b View commit details
    Browse the repository at this point in the history
  39. Add support for RWKV models with World tokenizer (EleutherAI#1374)

    * Add support for RWKV models with World tokenizer
    
    The RWKV line of model with the World tokenizer, does not allow the padding token to be configured, and has its value preset as 0
    
    This however fails all the "if set" checks, and would cause the tokenizer to crash.
    
    A tokenizer class name check was added, in addition to a model type check, as there exists RWKV models which uses the neox tokenizers
    
    * Update huggingface.py
    
    Genericized so that this supports any RWKVWorld tokenizer, and added a fall-back for if the HF implementation name changes.
    
    * Comply with formatting guidelines
    
    * fix format
    
    ---------
    
    Co-authored-by: Stella Biderman <[email protected]>
    Co-authored-by: Hailey Schoelkopf <[email protected]>
    3 people authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    492191d View commit details
    Browse the repository at this point in the history
  40. add bypass metric (EleutherAI#1156)

    * add bypass metric
    
    * fixed `bypass` metric.
    
    * add task attributes if predict_only
    
    * add `predict_only` checks
    
    * add docs
    
    * added `overide_metric`, `override_config` to `Task`
    
    * nits
    
    * nit
    
    * changed --predict_only to generations; nits
    
    * nits
    
    * nits
    
    * change gen_kwargs warning
    
    * add note about `--predict_only` in README.md
    
    * added `predict_only`
    
    * move table to bottom
    
    * nit
    
    * change null aggregation to bypass (conflict)
    
    * bugfix; default `temp=0.0`
    
    * typo
    baberabb authored and anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    477058a View commit details
    Browse the repository at this point in the history
  41. Configuration menu
    Copy the full SHA
    8d974bf View commit details
    Browse the repository at this point in the history
  42. Configuration menu
    Copy the full SHA
    3b07548 View commit details
    Browse the repository at this point in the history
  43. lint

    anjor committed Jan 31, 2024
    Configuration menu
    Copy the full SHA
    b9436a9 View commit details
    Browse the repository at this point in the history

Commits on Feb 21, 2024

  1. code review

    anjor committed Feb 21, 2024
    Configuration menu
    Copy the full SHA
    907968c View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    bb5481a View commit details
    Browse the repository at this point in the history
  3. neuron optimum

    anjor committed Feb 21, 2024
    Configuration menu
    Copy the full SHA
    129a2ee View commit details
    Browse the repository at this point in the history

Commits on Feb 22, 2024

  1. Configuration menu
    Copy the full SHA
    a97260e View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    63564e7 View commit details
    Browse the repository at this point in the history
  3. fix linter

    haileyschoelkopf committed Feb 22, 2024
    Configuration menu
    Copy the full SHA
    acff950 View commit details
    Browse the repository at this point in the history
  4. fix format

    haileyschoelkopf committed Feb 22, 2024
    Configuration menu
    Copy the full SHA
    5c17420 View commit details
    Browse the repository at this point in the history
  5. fix format

    haileyschoelkopf committed Feb 22, 2024
    Configuration menu
    Copy the full SHA
    b481947 View commit details
    Browse the repository at this point in the history
  6. fix format

    haileyschoelkopf committed Feb 22, 2024
    Configuration menu
    Copy the full SHA
    63d58f7 View commit details
    Browse the repository at this point in the history