-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #9 from ShawHahnLab/release-0.1.1
Version 0.1.1
- Loading branch information
Showing
37 changed files
with
674 additions
and
66 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,5 +1,30 @@ | ||
## Changelog | ||
# Changelog | ||
|
||
# 0.1.0 - 2021-11-19 | ||
## 0.1.1 - 2021-12-07 | ||
|
||
### Changed | ||
|
||
* Show a warning for zero file matches in list and show commands ([#8]) | ||
|
||
### Fixed | ||
|
||
* IgBLAST output is now shown even if it crashes ([#12]) | ||
* Header-only CSV/TSV files no longer crash the show command ([#8]) | ||
* summarize command now works with multiple references ([#7]) | ||
* vdj-match and summarize commands now work as intended via igblast, and all | ||
have basic automated tests ([#5]) | ||
* igblastn arguments can now be given with a two-dash prefix to ensure they | ||
aren't interpreted as igseq arguments ([#4]) | ||
* Duplicate FASTA paths found in vdj-gather will no longer result in | ||
duplicated output sequences ([#2]) | ||
|
||
[#12]: https://github.com/ShawHahnLab/igseq/pull/12 | ||
[#8]: https://github.com/ShawHahnLab/igseq/pull/8 | ||
[#7]: https://github.com/ShawHahnLab/igseq/pull/7 | ||
[#5]: https://github.com/ShawHahnLab/igseq/pull/5 | ||
[#4]: https://github.com/ShawHahnLab/igseq/pull/4 | ||
[#2]: https://github.com/ShawHahnLab/igseq/pull/2 | ||
|
||
## 0.1.0 - 2021-11-19 | ||
|
||
First beta release |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,32 @@ | ||
#!/usr/bin/env bash | ||
|
||
[ -v EXAMPLES ] || EXAMPLES=$(python -c 'import igseq.util; print(igseq.util.DATA)')/examples | ||
|
||
# An arbitrary antibody sequence pulled from one of our datasets that looks | ||
# complete and in-frame | ||
QUERY=$EXAMPLES/inputs/igblast/query.fasta | ||
|
||
# using the built-in Rhesus germline reference from IMGT and using the default | ||
# text output | ||
igseq igblast -r rhesus/imgt -Q $QUERY | ||
|
||
# Trying with AIRR output, saved to a file | ||
igseq igblast -r rhesus/imgt -Q $QUERY -outfmt 19 -out igblast.tsv | ||
|
||
# With the AIRR output, attributes are stored in columns with one row for each | ||
# query. For example, the V call and % identity: | ||
cut -f 10,62 igblast.tsv | ||
|
||
# or we can include all rhesus references, but igseq will append suffixes to | ||
# the names for any overlapping loci/segments from the reference FASTA files. | ||
# (IgBLAST can't handle repeated sequence IDs for the V/D/J databases.) | ||
# Here we have two equivalent possible V genes, one from IMGT and one from | ||
# Bernat et al. 2021 (https://doi.org/10.1016/j.immuni.2020.12.018) | ||
igseq igblast -r rhesus -Q $QUERY -outfmt 19 | cut -f 10,62 | ||
|
||
# The AIRR format is one row per query sequence, but there's also IgBLAST's own | ||
# tabular output format that can give more information on separate lines, like | ||
# additional gene assignments. | ||
# The -num_alignments_V argument clashes with iseq's -n, so we need to use -- | ||
# to clarify. igseq will remove the extra - when calling igblastn. | ||
igseq igblast -r rhesus -Q $QUERY -outfmt 7 --num_alignments_V 5 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,8 @@ | ||
>query | ||
CAGCTGCAGCTGCAGGAGTCGGGCCCAGGACTGGTGAAGCCTTCGGAGACCCTGTCCCTC | ||
ACCTGCGCTGTCTCTGGTGGCTCCATCAGCAGTAACTACTGGAGCTGGATCCGCCAGCCC | ||
CCAGGGAAGGGACTGGAGTGGATTGGACGTATCTCTGGTAGTGGTGGGAGCCCCGACTAC | ||
AACCCCTCCCTCAAGAGTCGAGTCACCATTTCAACAGACACGTCCAAGAGCCAGTTCTCC | ||
CTGAAGCTGAGCTCTGTGACCGCCGCGGACACGGCCGTGTATTACTGTGCGAGAGATACT | ||
TACAGTAACATCCCACCCAACTTTGACTACTGGGGCCAGGGAGTCCTGGTCACCGTCTCC | ||
TCAG |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,2 @@ | ||
sequence_id sequence locus stop_codon vj_in_frame v_frameshift productive rev_comp complete_vdj v_call d_call j_call sequence_alignment germline_alignment sequence_alignment_aa germline_alignment_aa v_alignment_start v_alignment_end d_alignment_start d_alignment_end j_alignment_start j_alignment_end v_sequence_alignment v_sequence_alignment_aa v_germline_alignment v_germline_alignment_aa d_sequence_alignment d_sequence_alignment_aa d_germline_alignment d_germline_alignment_aa j_sequence_alignment j_sequence_alignment_aa j_germline_alignment j_germline_alignment_aa fwr1 fwr1_aa cdr1 cdr1_aa fwr2 fwr2_aa cdr2 cdr2_aa fwr3 fwr3_aa fwr4 fwr4_aa cdr3 cdr3_aa junction junction_length junction_aa junction_aa_length v_score d_score j_score v_cigar d_cigar j_cigar v_support d_support j_support v_identity d_identity j_identity v_sequence_start v_sequence_end v_germline_start v_germline_end d_sequence_start d_sequence_end d_germline_start d_germline_end j_sequence_start j_sequence_end j_germline_start j_germline_end fwr1_start fwr1_end cdr1_start cdr1_end fwr2_start fwr2_end cdr2_start cdr2_end fwr3_start fwr3_end fwr4_start fwr4_end cdr3_start cdr3_end np1 np1_length np2 np2_length | ||
query CAGCTGCAGCTGCAGGAGTCGGGCCCAGGACTGGTGAAGCCTTCGGAGACCCTGTCCCTCACCTGCGCTGTCTCTGGTGGCTCCATCAGCAGTAACTACTGGAGCTGGATCCGCCAGCCCCCAGGGAAGGGACTGGAGTGGATTGGACGTATCTCTGGTAGTGGTGGGAGCCCCGACTACAACCCCTCCCTCAAGAGTCGAGTCACCATTTCAACAGACACGTCCAAGAGCCAGTTCTCCCTGAAGCTGAGCTCTGTGACCGCCGCGGACACGGCCGTGTATTACTGTGCGAGAGATACTTACAGTAACATCCCACCCAACTTTGACTACTGGGGCCAGGGAGTCCTGGTCACCGTCTCCTCAG IGH F T F T F T IGHV4-173*01 IGHD4-23*01,IGHD4-41*01 IGHJ4*01 CAGCTGCAGCTGCAGGAGTCGGGCCCAGGACTGGTGAAGCCTTCGGAGACCCTGTCCCTCACCTGCGCTGTCTCTGGTGGCTCCATCAGCAGTAACTACTGGAGCTGGATCCGCCAGCCCCCAGGGAAGGGACTGGAGTGGATTGGACGTATCTCTGGTAGTGGTGGGAGCCCCGACTACAACCCCTCCCTCAAGAGTCGAGTCACCATTTCAACAGACACGTCCAAGAGCCAGTTCTCCCTGAAGCTGAGCTCTGTGACCGCCGCGGACACGGCCGTGTATTACTGTGCGAGAGATACTTACAGTAACATCCCACCCAACTTTGACTACTGGGGCCAGGGAGTCCTGGTCACCGTCTCCTCAG CAGCTGCAGCTGCAGGAGTCGGGCCCAGGACTGGTGAAGCCTTCGGAGACCCTGTCCCTCACCTGCGCTGTCTCTGGTGGCTCCATCAGCAGTAACTACTGGAGCTGGATCCGCCAGCCCCCAGGGAAGGGACTGGAGTGGATTGGACGTATCTCTGGTAGTGGTGGGAGCACCGACTACAACCCCTCCCTCAAGAGTCGAGTCACCATTTCAACAGACACGTCCAAGAACCAGTTCTCCCTGAAGCTGAGCTCTGTGACCGCCGCGGACACGGCCGTGTATTACTGTGCGAGAGANNNNTACAGTAACNNNNNNNNNNACTTTGACTACTGGGGCCAGGGAGTCCTGGTCACCGTCTCCTCAG QLQLQESGPGLVKPSETLSLTCAVSGGSISSNYWSWIRQPPGKGLEWIGRISGSGGSPDYNPSLKSRVTISTDTSKSQFSLKLSSVTAADTAVYYCARDTYSNIPPNFDYWGQGVLVTVSS QLQLQESGPGLVKPSETLSLTCAVSGGSISSNYWSWIRQPPGKGLEWIGRISGSGGSTDYNPSLKSRVTISTDTSKNQFSLKLSSVTAADTAVYYCARXXYSNXXXXFDYWGQGVLVTVSS 1 296 301 309 320 364 CAGCTGCAGCTGCAGGAGTCGGGCCCAGGACTGGTGAAGCCTTCGGAGACCCTGTCCCTCACCTGCGCTGTCTCTGGTGGCTCCATCAGCAGTAACTACTGGAGCTGGATCCGCCAGCCCCCAGGGAAGGGACTGGAGTGGATTGGACGTATCTCTGGTAGTGGTGGGAGCCCCGACTACAACCCCTCCCTCAAGAGTCGAGTCACCATTTCAACAGACACGTCCAAGAGCCAGTTCTCCCTGAAGCTGAGCTCTGTGACCGCCGCGGACACGGCCGTGTATTACTGTGCGAGAGA QLQLQESGPGLVKPSETLSLTCAVSGGSISSNYWSWIRQPPGKGLEWIGRISGSGGSPDYNPSLKSRVTISTDTSKSQFSLKLSSVTAADTAVYYCAR CAGCTGCAGCTGCAGGAGTCGGGCCCAGGACTGGTGAAGCCTTCGGAGACCCTGTCCCTCACCTGCGCTGTCTCTGGTGGCTCCATCAGCAGTAACTACTGGAGCTGGATCCGCCAGCCCCCAGGGAAGGGACTGGAGTGGATTGGACGTATCTCTGGTAGTGGTGGGAGCACCGACTACAACCCCTCCCTCAAGAGTCGAGTCACCATTTCAACAGACACGTCCAAGAACCAGTTCTCCCTGAAGCTGAGCTCTGTGACCGCCGCGGACACGGCCGTGTATTACTGTGCGAGAGA QLQLQESGPGLVKPSETLSLTCAVSGGSISSNYWSWIRQPPGKGLEWIGRISGSGGSTDYNPSLKSRVTISTDTSKNQFSLKLSSVTAADTAVYYCAR TACAGTAAC YSN TACAGTAAC YSN ACTTTGACTACTGGGGCCAGGGAGTCCTGGTCACCGTCTCCTCAG FDYWGQGVLVTVSS ACTTTGACTACTGGGGCCAGGGAGTCCTGGTCACCGTCTCCTCAG FDYWGQGVLVTVSS CAGCTGCAGCTGCAGGAGTCGGGCCCAGGACTGGTGAAGCCTTCGGAGACCCTGTCCCTCACCTGCGCTGTCTCT QLQLQESGPGLVKPSETLSLTCAVS GGTGGCTCCATCAGCAGTAACTAC GGSISSNY TGGAGCTGGATCCGCCAGCCCCCAGGGAAGGGACTGGAGTGGATTGGACGT WSWIRQPPGKGLEWIGR ATCTCTGGTAGTGGTGGGAGCCCC ISGSGGSP GACTACAACCCCTCCCTCAAGAGTCGAGTCACCATTTCAACAGACACGTCCAAGAGCCAGTTCTCCCTGAAGCTGAGCTCTGTGACCGCCGCGGACACGGCCGTGTATTACTGT DYNPSLKSRVTISTDTSKSQFSLKLSSVTAADTAVYYC TGGGGCCAGGGAGTCCTGGTCACCGTCTCCTCA WGQGVLVTVSS GCGAGAGATACTTACAGTAACATCCCACCCAACTTTGACTAC ARDTYSNIPPNFDY TGTGCGAGAGATACTTACAGTAACATCCCACCCAACTTTGACTACTGG 48 CARDTYSNIPPNFDYW 16 456.805 17.992 87.208 296M68S 300S4N9M55S3N 319S3N45M 1.262e-130 7.987e-01 1.596e-21 99.324 100.000 100.000 1 296 1 296 301 309 5 13 320 364 4 48 1 75 76 99 100 150 151 174 175 288 331 363 289 330 TACT 4 ATCCCACCCA 10 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.