Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[notbug] unexpected termination on fasta.gz #27

Open
korziner opened this issue Apr 8, 2018 · 2 comments
Open

[notbug] unexpected termination on fasta.gz #27

korziner opened this issue Apr 8, 2018 · 2 comments

Comments

@korziner
Copy link

korziner commented Apr 8, 2018

Unexpected termination returns

2 �
9590 >

by the command:

~/miniconda3/bin/bioawk '/^>/ {count[substr($0,1,1)]++}END{for(j in count) print count[j],j}' SILVA_128_SSURef_tax_silva_full_align_trunc.fasta.gz

Reference gzip and mawk commands (doing the same count) are OK:

time zcat *fasta.gz|./bioawk '/^>/ {count[substr($0,1,1)]++}END{for(j in count) print count[j],j}'
1922213 >

real	47m44.564s
user	46m22.997s
sys	2m48.813s

time zcat *fasta.gz|mawk '/^>/ {count[substr($0,1,1)]++}END{for(j in count) print count[j],j}'
1922213 >

real	10m47.346s
user	11m31.835s
sys	1m44.985s

Is it similar to zgrep, which can search both in compressed and not compressed, or -c fastx should be explicit?

@lh3
Copy link
Owner

lh3 commented Apr 8, 2018

bioawk usually doesn't read gzip'd files. It only works with gzip'd files when you use "-c".

@korziner
Copy link
Author

korziner commented Apr 8, 2018

OK, notbug, just misuse

@korziner korziner changed the title Unexpected termination on fasta.gz [notbug] unexpected termination on fasta.gz Apr 8, 2018
ctSkennerton pushed a commit to ctSkennerton/bioawk that referenced this issue May 22, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants