Skip to content

Commit

Permalink
add FewCLUE 9 datasets (PaddlePaddle#457)
Browse files Browse the repository at this point in the history
  • Loading branch information
tianxin authored May 28, 2021
1 parent 7955b93 commit eb94c26
Showing 1 changed file with 9 additions and 0 deletions.
9 changes: 9 additions & 0 deletions docs/data_prepare/dataset_list.md
Original file line number Diff line number Diff line change
Expand Up @@ -36,6 +36,15 @@ PaddleNLP提供了以下数据集的快速读取API,实际使用时请根据
| [CMNLI ](https://github.com/CLUEbenchmark/CLUE) | 中文语言推理任务,判断sentence1和sentence2的关系:蕴含(entailment),矛盾(contradiction),中立(neutral)。句子对三分类问题 | `paddlenlp.datasets.load_dataset('clue', 'cmnli')`|
| [CLUEWSC2020](https://github.com/CLUEbenchmark/CLUE) | WSC Winograd模式挑战中文版,代词消歧任务,二分类任务| `paddlenlp.datasets.load_dataset('clue', 'cluewsc2020')`|
| [CSL](https://github.com/P01son6415/CSL) | 论文关键词识别,判断关键词是否全部为真实关键词,二分类任务 | `paddlenlp.datasets.load_dataset('clue', 'csl')`|
| [EPRSTMT](https://github.com/CLUEbenchmark/FewCLUE/tree/main/datasets) | FewCLUE 评测中的电商产品评论情感分析数据集,Positive、Negative 情感 2 分类任务| `paddlenlp.datasets.load_dataset('fewclue', 'eprstmt')`|
| [CSLDCP](https://github.com/CLUEbenchmark/FewCLUE/tree/main/datasets) | FewCLUE 评测中的中文科学文献学科分类数据集,根据文献的中文摘要判断文献类别,共 67 类别。| `paddlenlp.datasets.load_dataset('fewclue', 'csldcp')`|
| [TNEWSF](https://github.com/CLUEbenchmark/FewCLUE/tree/main/datasets) | FewCLUE 评测中的今日头条中文新闻(短文本)分类,共15类 | `paddlenlp.datasets.load_dataset('fewclue', 'tnews')`|
| [IFLYTEK](https://github.com/CLUEbenchmark/FewCLUE/tree/main/datasets) | FewCLUE 评测中的长文本分类任务,共 119 个类别 | `paddlenlp.datasets.load_dataset('fewclue', 'iflytek')`|
| [OCNLIF](https://github.com/CLUEbenchmark/FewCLUE/tree/main/datasets) | FewCLUE 评测中的中文自然语言推理数据集,句子对三分类问题 | `paddlenlp.datasets.load_dataset('fewclue', 'ocnli')`|
| [BUSTM](https://github.com/CLUEbenchmark/FewCLUE/tree/main/datasets) | FewCLUE 评测中对话短文本语义匹配数据集, 2 分类任务 | `paddlenlp.datasets.load_dataset('fewclue', ‘bustm')`|
| [CHIDF](https://github.com/CLUEbenchmark/FewCLUE/tree/main/datasets) | FewCLUE 评测中的成语阅读理解填空, 根据文本内容从候选 7 个成语中预测正确的成语 | `paddlenlp.datasets.load_dataset('fewclue', 'chid')`|
| [CSLF](https://github.com/CLUEbenchmark/FewCLUE/tree/main/datasets) | FewCLUE 评测中的论文关键词识别,判断关键词是否全部为真实关键词,二分类任务 | `paddlenlp.datasets.load_dataset('fewclue', 'csl')`|
| [CLUEWSCF](https://github.com/CLUEbenchmark/FewCLUE/tree/main/datasets) | FewCLUE 评测中的 WSC Winograd 模式挑战中文版,代词消歧任务,二分类任务 | `paddlenlp.datasets.load_dataset('fewclue', 'cluewsc')`|



Expand Down

0 comments on commit eb94c26

Please sign in to comment.