Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

some questions about own dataset #9

Open
check-777 opened this issue Jul 9, 2021 · 2 comments
Open

some questions about own dataset #9

check-777 opened this issue Jul 9, 2021 · 2 comments

Comments

@check-777
Copy link

check-777 commented Jul 9, 2021

Thanks for your nice works!If I want to use another datasets from "mulan",such as emotion ,how can i convert it to datasets paper used ?I saw the words in the top of the preprocess.py "../data/reuters/train_inputs.txt -train_tgt ../data/reuters/train_labels.txt”, but i don't know the detail of the train_inputs.txt or train_labels.txt.Can you give an example of such type.Thank!

@untiltheday-lin
Copy link

I have the same problem. Have you solved it?

@jacklanchantin
Copy link
Collaborator

I don't have access to the raw data anymore as my home directory was removed from the university servers. I believe preprocess.py expects a train_input.txt file where each line is a sample and a train_label.txt file where each line contains the sequence of labels.

Alternatively, you can write your own prepocessing file to match the .pt pytorch object files that are currently used.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants