Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

vsearch tool detailed option in command line ? #516

Closed
hafizmtalha opened this issue Apr 10, 2023 · 5 comments
Closed

vsearch tool detailed option in command line ? #516

hafizmtalha opened this issue Apr 10, 2023 · 5 comments

Comments

@hafizmtalha
Copy link

Hi, I am just wondering if I am using vsearch tool for examples 'vsearch --allpairs_global'. How can I check all the parameters of this tool and short description of each parameter in terminal ? sometimes its become so tedious to go back to the manual every time.

@frederic-mahe frederic-mahe changed the title vsearch tool detailed option in comand line ? vsearch tool detailed option in command line ? Apr 11, 2023
@frederic-mahe
Copy link
Collaborator

related to issue #340

@frederic-mahe
Copy link
Collaborator

One possibility is to run:

vsearch -h

It returns a list of commands, and for each command (or family of commands) a list of available options with a brief description:

...

Shuffling and sorting
  --shuffle FILENAME          shuffle order of sequences in FASTA file randomly
  --sortbylength FILENAME     sort sequences by length in given FASTA file
  --sortbysize FILENAME       abundance sort sequences in given FASTA file
 Parameters
  --maxsize INT               maximum abundance for sortbysize
  --minsize INT               minimum abundance for sortbysize
  --randseed INT              seed for PRNG, zero to use random data source (0)
  --sizein                    propagate abundance annotation from input
 Output
  --output FILENAME           output to specified FASTA file
  --relabel STRING            relabel sequences with this prefix string
  --relabel_keep              keep the old label after the new when relabelling
  --relabel_md5               relabel with md5 digest of normalized sequence
  --relabel_self              relabel with the sequence itself as label
  --relabel_sha1              relabel with sha1 digest of normalized sequence
  --sizeout                   include abundance information when relabelling
  --topn INT                  output just first n sequences
  --xsize                     strip abundance information in output

...

@hafizmtalha
Copy link
Author

One possibility is to run:

vsearch -h

It returns a list of commands, and for each command (or family of commands) a list of available options with a brief description:

...

Shuffling and sorting
  --shuffle FILENAME          shuffle order of sequences in FASTA file randomly
  --sortbylength FILENAME     sort sequences by length in given FASTA file
  --sortbysize FILENAME       abundance sort sequences in given FASTA file
 Parameters
  --maxsize INT               maximum abundance for sortbysize
  --minsize INT               minimum abundance for sortbysize
  --randseed INT              seed for PRNG, zero to use random data source (0)
  --sizein                    propagate abundance annotation from input
 Output
  --output FILENAME           output to specified FASTA file
  --relabel STRING            relabel sequences with this prefix string
  --relabel_keep              keep the old label after the new when relabelling
  --relabel_md5               relabel with md5 digest of normalized sequence
  --relabel_self              relabel with the sequence itself as label
  --relabel_sha1              relabel with sha1 digest of normalized sequence
  --sizeout                   include abundance information when relabelling
  --topn INT                  output just first n sequences
  --xsize                     strip abundance information in output

...

this help doesn't give you the options for subtools. like if I am using "--makeudb_usearch" it won't give me the sub options which are specific to --makeudb_usearch only

@frederic-mahe
Copy link
Collaborator

vsearch -h

produces a long text, with the options specific to the --makeudb_usearch command (and sibling commands) at the end:

...
UDB files
  --makeudb_usearch FILENAME  make UDB file from given FASTA file
  --udb2fasta FILENAME        output FASTA file from given UDB file
  --udbinfo FILENAME          show information about UDB file
  --udbstats FILENAME         report statistics about indexed words in UDB file
 Parameters
  --dbmask none|dust|soft     mask db with dust, soft or no method (dust)
  --hardmask                  mask by replacing with N instead of lower case
  --wordlength INT            length of words for database index 3-15 (8)
 Output
  --output FILENAME           UDB or FASTA output file

In addition to these options, some general options are also applicable:

  • bzip2 and gzip decompression,
  • log file,
  • minseqlength,
  • no progress,
  • quiet,
  • threads

@frederic-mahe
Copy link
Collaborator

also related to issue #417

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants