Skip to content
This repository has been archived by the owner on Jul 23, 2024. It is now read-only.

[Task Submission] Hate Speech Detection (latent_feature_splits) #37

Merged
merged 7 commits into from
Dec 31, 2023

Conversation

MaikeZuefle
Copy link
Contributor

Latent Feature-based Data Splits

This project aims to go beyond the random train-test split by developing a more challenging data-splitting process
to better evaluate generalisation performance.
We rely on a models internal representations to create a data split, creating the split by clustering the internal representations and assigning clusters to either the train or the test set.
Hate Speech is used as a testing ground for developing the splitting method.

Authors

Checklist:

  • I and my co-authors agree that, if this PR is merged, the code will be available under the same license as the genbench_cbt repository.
  • Prior to submitting, I have ran the GenBench CBT test suite using the genbench-cli test-task tool.
  • I have read the description of what should be in the doc.md of my task, and have added the required arguments.
  • I have submitted or will submit an accompanying paper to the GenBench workshop.

@kazemnejad kazemnejad changed the title Latent Feature Splits [Task Submission] Hate Speech Detection (latent_feature_split) Nov 16, 2023
@vernadankers vernadankers deleted the latent_feature_splits branch November 20, 2023 17:26
@vernadankers vernadankers restored the latent_feature_splits branch November 20, 2023 17:29
@vernadankers vernadankers reopened this Nov 20, 2023
@vernadankers vernadankers changed the title [Task Submission] Hate Speech Detection (latent_feature_split) [Task Submission] Hate Speech Detection (latent_feature_splits) Nov 20, 2023
@kazemnejad kazemnejad marked this pull request as ready for review December 31, 2023 02:00
@kazemnejad kazemnejad merged commit 657d531 into GenBench:main Dec 31, 2023
3 checks passed
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants