Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question on Data Splitting #43

Open
chao1224 opened this issue Nov 1, 2021 · 2 comments
Open

Question on Data Splitting #43

chao1224 opened this issue Nov 1, 2021 · 2 comments

Comments

@chao1224
Copy link

chao1224 commented Nov 1, 2021

Hi there, I have some questions about the data splitting on LBA.

I'm not sure if these three indices txt files are already generated in the previous steps (https://github.com/drorlab/atom3d/blob/master/examples/lba/dataset/prepare_lmdb.py#L156-L161).


Follow-up:
I checked the code base again, and found the identity split function. But I haven't found the scripts for running them. Then the remaining question is how to set the blast_db variable.

@awfderry
Copy link
Collaborator

The blast_db argument is used to specify a pre-computed BLAST database for your sequences. If you don't have it precomputed, the default functionality of identity_split should compute it for you and save in the current directory as "blast_db". Hope this helps!

@chao1224
Copy link
Author

chao1224 commented Feb 6, 2022

Thank you!

Just want to double-check.
According to these lines, do we need to take some extra config for the blast_db?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants