Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

question about dataset? #3

Open
zhhhzhang opened this issue Nov 9, 2020 · 2 comments
Open

question about dataset? #3

zhhhzhang opened this issue Nov 9, 2020 · 2 comments

Comments

@zhhhzhang
Copy link

There are some files which are missed in the data.
image
Could you please provide these files so that I can follow your step?
Thanks very much.

@suhara
Copy link
Contributor

suhara commented Dec 6, 2020

Sorry for the late response. Please use the latest version of download.sh to download the data and check if the files are still missing. I double-checked that it contained the three missing files that you mentioned. Thank you!

$ ./download.sh

...

$ ls -lha data/yelp-default
total 2.0G
drwxrwxr-x 2 suhara suhara 4.0K Oct  6 17:53 .
drwxrwxr-x 3 suhara suhara 4.0K Oct  6 17:55 ..
-rw-r--r-- 1 suhara suhara  45M May  6  2020 dev.csv
-rw------- 1 suhara suhara 609K Nov 27  2019 summaries_0-200_cleaned_fixed_business_ids.csv
-rw-r--r-- 1 suhara suhara  46M May  6  2020 test.csv
-rw-r--r-- 1 suhara suhara 765K May  6  2020 test_gold_8_15_all_all_300_8.csv
-rw-r--r-- 1 suhara suhara 832K Oct  5 23:06 test_gold.csv
-rw-r--r-- 1 suhara suhara 363M May  6  2020 train.csv
-rw-rw-r-- 1 suhara suhara 1.6G Nov 28  2019 yelp.jsonl

@KongKong390
Copy link

Dear author, I would like to inquire about the three files generated during the aggregation process. Could you please explain the purpose and content of these files? I would greatly appreciate your assistance. Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants