Parsing the barcode sequences in CRISPR-Case9 system using Next-generation sequencing
- Python => 3.6
- pandas
- numpy
-R1
R1 fastq sequence file (gizped or uncompressed)-R2
R2 fastq sequence file (gizped or uncompressed)-vector5
vector sequence adjacent to 5' barcode (default='TATAAGCGAAAGAAGCATCAGATGGGCAAACAAAGCACCAGTGGTCTAGTGGTAGAATAGTACCCTGCCACGGTACAGACCCGGGTTCGATTCCCGGCTGGTGCA')-vector3
vector sequence adjacent to 3' barcode (default='TAAAATAAGGCTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTTTTTTGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTTTTTAGCGCGTGCATGCCTGCAGGTCCACAAATTCGGGTC')-o
out put file
python barcode_parser.py -h
usage: barcode_parser.py [-h] [-R1 R1] [-R2 R2] [-verctor5 VERCTOR5]
[-verctor3 VERCTOR3] [-o O]
optional arguments:
-h, --help show this help message and exit
-R1 R1 R1 sequence file
-R2 R2 R2 sequence file
-vector5 VERCTOR5 verctor sequence adjacent to 5' barcode
-verctor3 VERCTOR3 verctor sequence adjacent to 3' barcode
-o O out put file
barcodeID
unique barcode idbarcodesequence
barcode sequencebarcodeCount
the number of barcode sequence been sequencedsgRNAsequence
sgRNA sequencesgRNACount
the number of sgRNA sequence been sequenced
#example
python barcode_parser.py -R1 testData/test_R1.fastq -R2 testData/test_R2.fastq -vector5 GCGAAAGAAGCATCAGATGGGCAAACAAAGCACCAGTGGTCTAGTGGTAGAATAGTACCCTGCCACGGTACAGACCCGGGTTCGATTCCCGGCTGGTGCA -vector3 GTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTTTTTTGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTTTTTAGCGCGTGCATGCCTGCAGGTCCACAAATTCGGGTC -o testData/sgRNA_count.txt