Skip to content

Frequently Asked Questions

Hannes Hauswedell edited this page Sep 2, 2015 · 3 revisions

What kind of data can I use Lambda with?

Lambda is optimized for searches in protein space, so whenever your reads / query sequences represent proteins (transcriptome, exome, rna data ...) and/or your database is a protein database (uniprot, ncbi nr ...) Lambda will perform very well.

Does that mean I cannot use it to replace BlastN oder Megablast?

Lambda has a BlastN mode that shares many features of the protein searches (BlastP, BlastX, TBlastN, TBlastX), but that does not benefit from some optimizations that are specific for these modes. Our tests show that Lambda is still a big improvement over BlastN while being in a comparable sensitivity range below e-values of 0.1. Since less time has been spent on tuning the BlastN default parameters we recommend you try it on one of your data sets and compare it to BlastN before you use it with the same confidence. And we are interested in hearing your feedback on this mode and whether we should invest more resources into it!

What kind of speed-ups can I expect over NCBI Blast?

For protein modes we have measured speed-ups of 150-300x over NCBI Blast. Different options are available to increase speed further (at the cost of system memory or sensitivity of the results); the recommended fast mode has speed-ups over 2000x. The speed-up also depends on the data and is higher if the dataset is bigger.

What kind of sensitivity can I expect compared NCBI Blast?

For short reads sensitivity was measured to be over 96%. For reads of length over 900 the sensitivity dropped to a little over 80% in the default mode -- but this is still better than with other Blast-competitors. There are different parameters to increase sensitivity at the cost of speed.

How much memory does Lambda require?

In its default mode Lambda requires approximately the following amount of RAM

size(queryFile) + 2 * size(dbFile)

The indexer has additional memory requirements, please see lambda_indexer --help. Future releases will contain algorithms with much lower requirements and higher speed.