Neurips client #1693

drisspg · 2023-06-26T22:24:53Z

To test against local server exposed on 8080

echo 'entries: [{description: "mmlu:subject=philosophy,model=neurips/local", priority: 1}]' > run_specs.conf

helm-run --conf-paths run_specs.conf --suite v1 --max-eval-instances 1

RaulPPelaez · 2023-08-01T09:29:34Z

src/helm/proxy/clients/http_client.py

+        cache_config: CacheConfig,
+        base_url: str = "http://localhost",
+        port: int = 8080,
+        timeout: int = 10,


Is this timeout a constraint for the challenge?

@drisspg can we increase the default timeout? 10 seconds is too small for: toy-submission but using 7B LLaMA2, on 4090, for GSM scenario. I believe that is within the specs of the competition you're running? :)

drisspg · 2023-08-04T00:16:51Z

Hey @yifanmai do you think it would be worthwhile to merge something like this in to the main repo? I can re name to http client and rebase

cc @msaroufim

yifanmai · 2023-08-05T20:40:28Z

I think this is generally good, but I would like to get #1761 in first to resolve #1673 first (so that we don't have a situation where everyone's model is named identically...)

yifanmai

I think this PR can be merged soon.

After merging this, I'd want to move to something like what #1761 is doing (see the PR description for an example) where a user would specify model_deployments.yaml and then run helm-run --model-deployments-paths model_deployments.yaml ...

This would ensure that each submitter's model is named something unique when you combine the benchmark_output files from all submitters, rather than having everything be named neurips/local.

I can send you a follow-up PR next week that does this early next week. I think the things that need to happen are:

Support instantiating clients from model_deployments.yaml that don't need an API key (like this one).
Support configurable window service / tokenizers.

Example: model_deployments.yaml

model_deployments:
  - name: efficiencychallenge/model-1
    model_name: efficiencychallenge/model-1
    tokenizer_name: "huggingface/gpt2"
    max_sequence_length: 2048
    client_spec:
      class_name: "helm.proxy.clients.http_client.HTTPClient"
      args:
        url: "http://localhost:8000/"
        url: "http://localhost:8000/"
        do_cache: false

src/helm/proxy/clients/http_client.py

yifanmai · 2023-08-10T17:34:15Z

src/helm/benchmark/window_services/neruips_local_window_service.py

+from .tokenizer_service import TokenizerService
+
+
+class NeuripsWindowService(LocalWindowService):


If we're keeping this, then rename the class (see the comments on HTTPClient later)

Thinking about this a bit more: if all of the models listed in the rules have a Hugging Face tokenizer, then we can just use HuggingFaceWindowService for all of them, and not need a new window service.

In fact, we can even delete the tokenize() method on the client later, because we just do tokenization using the local Hugging Face tokenizer.

Do you think though that since this could be more generic than just the comp it could still be useful to have a HTTPModelWindowServce?

Maybe... this is basically a generic LocalWindowService subclass and doesn't do anything HTTP specific. I think we can keep this in this PR, and when we get window service configuration in, this class will probably disappear in that refactor.

yifanmai · 2023-08-10T17:43:39Z

src/helm/benchmark/window_services/neruips_local_window_service.py

+
+    @property
+    def end_of_text_token(self) -> str:
+        return "<|endoftext|>"


Do the special tokens need to be user-configurable?

Yes I would imagine so, not sure what the best way of specifying that would be

One idea is to introduce tokenizers.yaml:

tokenizers.yaml

tokenizers: - name: "efficiencychallenge/tokenizer-1" client_spec: class_name: "helm.proxy.clients.http_client.HTTPClient" args: url: "http://localhost:8000/" prefix_token: "<|endoftext|>" end_of_text_token: "<|endoftext|>"

model_deployments.yaml

model_deployments: - name: efficiencychallenge/model-1 model_name: efficiencychallenge/model-1 tokenizer_name: "efficiencychallenge/tokenizer-1" max_sequence_length: 2048 client_spec: class_name: "helm.proxy.clients.http_client.HTTPClient" args: url: "http://localhost:8000/" do_cache: false

cc @percyliang any thoughts on configurable tokenizers?

I like configurable tokenizers.

Cool, I'll draft a PR for that.

In the meantime, I don't think we need to block this PR; we can keep these changes and clean things up later when we have configurable tokenizers.

yifanmai · 2023-08-11T23:32:03Z

Could you mark this as "Ready for review" (by clicking the button) when it's ready for me to take another look? Thanks!

drisspg · 2023-08-12T00:19:03Z

@yifanmai Cool I think I addressed everything but the yaml changes which I do think would make this service much more generic and alot more applicable to different situations

src/helm/proxy/clients/http_client.py

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

yifanmai

Looks good, thanks!

yifanmai

Sorry, I just caught a few minor issues.

src/helm/proxy/clients/http_client.py

src/helm/benchmark/window_services/httpmodel_window_service.py

src/helm/proxy/clients/http_client.py

src/helm/benchmark/static/schema.yaml

src/helm/benchmark/window_services/http_model_window_service.py

drisspg changed the title ~~Neruips client~~ Neurips client Jun 26, 2023

RaulPPelaez reviewed Aug 1, 2023

View reviewed changes

yifanmai requested changes Aug 10, 2023

View reviewed changes

drisspg and others added 4 commits August 10, 2023 12:00

simple stub

45662de

kinda owrking need to actually clean up

2042ee9

cleanup

826facb

rebase + comments

02e5583

drisspg force-pushed the neruips_client branch from 1f51e71 to 02e5583 Compare August 10, 2023 21:38

comments

53b53db

drisspg marked this pull request as ready for review August 12, 2023 00:18

formattin

b1bdcdd

aniketmaurya reviewed Aug 14, 2023

View reviewed changes

src/helm/proxy/clients/http_client.py Outdated Show resolved Hide resolved

<Replace this line with a title. Use 1 line only, 67 chars or less>

b7e5878

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

yifanmai approved these changes Aug 14, 2023

View reviewed changes

yifanmai requested changes Aug 14, 2023

View reviewed changes

drisspg added 2 commits August 14, 2023 16:40

Pr comments

66b7004

names didn't get added

29b118b

aniketmaurya mentioned this pull request Aug 15, 2023

HELM evaluation framework tries to generate with temperature=0 which leads to inf tensors Lightning-AI/litgpt#401

Closed

drisspg requested a review from yifanmai August 15, 2023 22:46

yifanmai approved these changes Aug 16, 2023

View reviewed changes

src/helm/benchmark/static/schema.yaml Show resolved Hide resolved

src/helm/benchmark/window_services/http_model_window_service.py Show resolved Hide resolved

add TODOs to remove neurips naming

12dcabb

drisspg force-pushed the neruips_client branch from 0596e32 to 12dcabb Compare August 16, 2023 01:03

yifanmai merged commit 6d18584 into stanford-crfm:main Aug 16, 2023
3 checks passed

msaroufim mentioned this pull request Aug 23, 2023

Unable to install helm llm-efficiency-challenge/neurips_llm_efficiency_challenge#15

Closed

drisspg mentioned this pull request Sep 2, 2023

Running against local model service #1688

Open

danielz02 pushed a commit to danielz02/helm that referenced this pull request Sep 7, 2023

Neurips client (stanford-crfm#1693)

8dc2b64

danielz02 pushed a commit to danielz02/helm that referenced this pull request Sep 7, 2023

Neurips client (stanford-crfm#1693)

9178dd8

danielz02 pushed a commit to danielz02/helm that referenced this pull request Sep 7, 2023

Neurips client (stanford-crfm#1693)

1ff742b

sfriedowitz mentioned this pull request Oct 23, 2023

TGI support - API evaluation of HF models EleutherAI/lm-evaluation-harness#869

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Neurips client #1693

Neurips client #1693

drisspg commented Jun 26, 2023 •

edited

Loading

RaulPPelaez Aug 1, 2023

HarounH Aug 10, 2023

drisspg commented Aug 4, 2023 •

edited

Loading

yifanmai commented Aug 5, 2023

yifanmai left a comment

yifanmai Aug 10, 2023

yifanmai Aug 10, 2023

drisspg Aug 10, 2023

yifanmai Aug 10, 2023

yifanmai Aug 10, 2023

drisspg Aug 10, 2023

yifanmai Aug 10, 2023

percyliang Aug 11, 2023

yifanmai Aug 11, 2023

yifanmai commented Aug 11, 2023

drisspg commented Aug 12, 2023

yifanmai left a comment

yifanmai left a comment

		from .tokenizer_service import TokenizerService


		class NeuripsWindowService(LocalWindowService):

Neurips client #1693

Neurips client #1693

Conversation

drisspg commented Jun 26, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

drisspg commented Aug 4, 2023 • edited Loading

yifanmai commented Aug 5, 2023

yifanmai left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yifanmai commented Aug 11, 2023

drisspg commented Aug 12, 2023

yifanmai left a comment

Choose a reason for hiding this comment

yifanmai left a comment

Choose a reason for hiding this comment

drisspg commented Jun 26, 2023 •

edited

Loading

drisspg commented Aug 4, 2023 •

edited

Loading