Database Setup

Loci table (tab-delimited); columns need:
- Taxon_ID
- Taxon_Name
- Subtype*
- Locus_Start
- Locus_End
- Operon_Start*
- Operon_End*
- Array_Start*
- Array_End*
- Array_status**
- Operon_status***
- Genbank_file
- Array_File*
- Fasta_File*
- Author
- File_Creation_Date*
"*" blank values allowed

"**" Possible values: "present", "absent"

"***" Possible values: "intact", "absent", "broken", "shuffled"

"broken" = some genes missing "shuffled" = gene order
Array table files (tab-delimited; copy and paste from CRISPRFinder); columns needed:

Genbank files for each organism of interest
Fasta files of each genome (only needed genome sequence information is not included in the genbank files).

The directory name for this example: './CLdb/' The example loci table: 'loci.txt'

$ mkdir CLdb
$ cd CLdb
$ mkdir genbank

$ mkdir array

Provide feedback