The analysis pipeline and downstream analysis in jingwen/TIFseq2 are used for publication TIF-Seq2 disentangles overlapping isoforms in complex human transcriptomes. The scripts for TIF-Seq2 data pre-processing and alignment in other TIF-Seq2 related publication are in PelechanoLab/TIFseq2.
prep_sampleSheet.awk <index_input.txt> > <sample_sheet>
bcl2fastq -R <input_dir> -o <fastq_dir> --sample-sheet <sample_sheet> --no-lane-splitting --barcode-mismatches demultiplex_stats.awk <fastq_dir>/Stats/DemultiplexingStats.xml > <fastq_dir>/demultiplex_stat.txt
preprocess.sh -I <fastq_dir> -O <output_dir> -j <thread_number> -A <polyA_length>
STAR_align.sh -R <STAR_index_dir> -A <splicing_junction_gtf> -I <output_dir>/cutPolyA -O <STAR_output_dir> -p <thread_number> -j <max_intron_size> -m 0
python boundary.py <input_bam>
Rscript clean_As.R <3end_ctss_path>
Rscript cluster_end.R <TIF_5end_path> <TIF_3end_path> <3'T-fill_3end_path>
Rscript form_TIF.R