-
Notifications
You must be signed in to change notification settings - Fork 18
[Task Submission] Natural Language Codesearch Classification (nl_codesearch_clf
)
#16
Conversation
nl_codesearch_clf
)nl_codesearch
)
nl_codesearch
)nl_codesearch_clf
)
test_task failed due to non existent id |
Hi, that sounds like something that can be fixed on our side! Could you open an issue for that, please? :-) |
nl_codesearch_clf
)nl_codesearch
)
Thanks! i was also considering submitting two different branches, but if it can be fixed on your side even better :) |
nl_codesearch
)nl_codesearch_clf
)
Hi, the problem was that the automated CI was trying to read the task_id (which can be the parent task, but, your initial title contained You are submitting two parent tasks |
nl_codesearch_clf
)nl_codesearch_clf
)
Thanks! I removed the mrr part from this clf PR, and created a seperate one for mrr |
Hello! We are getting quite close to the deadline (September 1, 11:59PM anywhere on earth), so if your PR needs any final changes, please make them now, Good luck finalising your PR and paper, feel free to tag us if you have questions. |
Hi, Sorry there were some style issues introduced in the last commit at submission and forgot the rerun the make commands. |
@drndr We're in the process of merging the tasks into the repo. Could you please include a single file usage_example.py of each task where you use each task. It seems your tasks supports both finetuning/in-context-learning. It'd be nice to have examples of both usages. (Preferably a pretrained huggingface model). Please also include requirements-usage-example.txt for the python dependencies needed to be installed for running the example. |
|
||
# TRAIN_FILE = "./codesearchnet_adv/train_adv_clf.jsonl" | ||
|
||
TRAIN_FILE = NlCodesearchClfCodesearchnetAdv( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks great. Just a minor comment. Could you replace the task loading using the official method? for here and the rest of this file?
from genbench import load_task
task = load_task("nl_codesearch_clf:codesearchnet_adv")
Manually merged in |
[Natural Language Codesearch Classification]
The task consists of 8 subtasks and measures cross-lingual and domain generalization, and robustness to covariate shift
Includes a binary classification evaluation:
Given a natural language query, determine if a given code snippet is matches the natural language description or not.
Authors
[email protected]
[email protected]
[email protected]
[email protected]
[email protected]
[email protected]
Implementation
For the binary classification setup the config files were used with task_type "multiple_choice"
The task.py script includes a custom get_dataset_raw method, where the negativ samples are created.
Usage
For binary classification the default "multiple_choice" usage.
Checklist:
genbench-cli test-task
tool.