Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

smoove paste error #231

Open
ygwang1 opened this issue Oct 6, 2023 · 5 comments
Open

smoove paste error #231

ygwang1 opened this issue Oct 6, 2023 · 5 comments

Comments

@ygwang1
Copy link

ygwang1 commented Oct 6, 2023

I'd like to know if anyone has encountered such a problem when using the smoove paste command? (after population calling,about 4000 samples)I allocate the same resources, but it seems to be successful sometimes and error sometimes. I don't know what the problem is, the smoove or my server. Looking forward to your reply, thank you!
example 1 #error
[smoove] 2023/10/06 01:07:29 starting with version 0.2.8
[smoove] 2023/10/06 01:07:29 squaring 3647 files to DUP.smoove.square.vcf.gz
[smoove] 2023/10/06 01:13:31 all files had 23207 variants
2023/10/06 04:22:15 signal: killed
example 2 #worked successfully
[smoove] 2023/10/06 04:22:16 starting with version 0.2.8
[smoove] 2023/10/06 04:22:16 squaring 3154 files to DUP.smoove.square.vcf.gz
[smoove] 2023/10/06 04:27:22 all files had 23207 variants
[smoove] 2023/10/06 07:00:49 wrote squared file to DUP.smoove.square.vcf.gz
example 3 #another error #actually the index file has no error and No such errors were reported in other previous runs.
[smoove] 2023/10/06 01:07:09 starting with version 0.2.8
[smoove] 2023/10/06 01:07:09 squaring 3647 files to INV.smoove.square.vcf.gz
[smoove] 2023/10/06 01:10:39 Failed to open /INVs/XH4042/XH4042-smoove.genotyped.vcf.gz: could not load index
2023/10/06 01:10:42 exit status 255
[smoove] 2023/10/06 01:10:42 starting with version 0.2.8
[smoove] 2023/10/06 01:10:42 squaring 3154 files to INV.smoove.square.vcf.gz
[smoove] 2023/10/06 01:14:21 Failed to open /INVs/CN5780/CN5780-smoove.genotyped.vcf.gz: could not load index
2023/10/06 01:14:24 exit status 255

@brentp
Copy link
Owner

brentp commented Oct 6, 2023

Hi, smoove square is simply calling bcftools merge. That command suffers with large sample sizes such as yours. You might need to write your own (simply concat the columns).

@ygwang1
Copy link
Author

ygwang1 commented Oct 6, 2023

By the way, I got another problem when genotyping. The command I run is:
smoove genotype -p 1 --name C001 --outdir /DELs/C001 --fasta /genome_hg38/hg38.fa --duphold --vcf samples_merged_DEL.vcf C001.sort.mkdup.realign.cram
And for several sample, I got an incompelete log file(which I also don't know the problem is caused by the smoove or my server.) but the genotype results seems normal. So I don't know if I should pay attention to this issue.
The compelete log file:
[smoove] 2023/09/22 09:58:15 starting with version 0.2.8
[smoove] 2023/09/22 09:58:15 writing sorted, indexed file to XH1000-smoove.genotyped.vcf.gz
[smoove] 2023/09/22 09:58:15 > gsort version 0.1.4
[smoove] 2023/09/22 11:01:16 [smoove] 2023/09/22 11:01:16 starting with version 0.2.8
[smoove] 2023/09/22 11:01:16 [smoove] 2023/09/22 11:01:16 running duphold on 1 files in 56 processes
[smoove] 2023/09/22 11:06:43 [smoove] 2023/09/22 11:06:43 [duphold] finished
[smoove] 2023/09/22 11:06:47 [smoove] 2023/09/22 11:06:47 finished duphold
The incompelete log file such as :
[smoove] 2023/09/22 09:15:29 [smoove] 2023/09/22 09:15:29 starting with version 0.2.8
[smoove] 2023/09/22 09:15:29 [smoove] 2023/09/22 09:15:29 running duphold on 1 files in 40 processes
[smoove] 2023/09/22 09:24:31 [smoove] 2023/09/22 09:24:31 [duphold] finished
[smoove] 2023/09/22 09:24:37 [smoove] 2023/09/22 09:24:37 finished duphold
[smoove] 2023/09/22 09:24:42 wrote sorted, indexed file to XH1000-smoove.genotyped.vcf.gz

@ygwang1
Copy link
Author

ygwang1 commented Oct 6, 2023

Hi, smoove square is simply calling bcftools merge. That command suffers with large sample sizes such as yours. You might need to write your own (simply concat the columns).

Got it, thanks very much for your reply! I will try it using bcftools concat directly.

@brentp
Copy link
Owner

brentp commented Oct 6, 2023

I'm not sure why the logs wouldn't complete. But you should get a warning if the file is incomplete. And if it has an index then it must be fine.

@ygwang1
Copy link
Author

ygwang1 commented Oct 6, 2023

Hi,
My genotype vcf files and index files seem fine. Thanks again for your reply!!!
Best regards,
Yige Wang

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants