Skip to content
This repository has been archived by the owner on Jul 15, 2023. It is now read-only.

Ensure all dataset splits have same columns #15

Open
jonathancstroud opened this issue Sep 25, 2018 · 0 comments
Open

Ensure all dataset splits have same columns #15

jonathancstroud opened this issue Sep 25, 2018 · 0 comments
Labels
help wanted Extra attention is needed

Comments

@jonathancstroud
Copy link
Contributor

As pointed out in #14, the JSON columns are sparse, which causes many columns to be missing when preprocessing only a subset of rows. We should ensure that all splits (train, val, test, debug) have the same columns after preprocessing.

Related to #12 and #2.

@jonathancstroud jonathancstroud added the help wanted Extra attention is needed label Sep 25, 2018
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

1 participant