File menu

In this menu the user has two choices, create a new project or open an existing one.

New project

By clicking "New Project" in the File menu, a window will appear that has to be filled with a name of the user choice (do not use spaces). In our example, we named our project "nelongata0"

file menu
write a name project

Following this, a directory will be created to store the data. This directory is composed of five folders: Blast, Fasta, Filtered, Graphs and Results in the path /caprib/project/nelongata0/  

folders project
blast & filter

A new window will appear to start blast and to filter the blast reports. Of note, all created files (fasta, blast and filtered files) will be saved in their respective folders.

Open Project

This option is used when the user has other analysis to do on an existing project.

Then, the user can choose the existing project.

open an existing project

Blast & Filter

To start the blast, the user will need input files: the protein fasta file and the feature file for the reference organism. Neisseria elongata is used in our example. The files are provided for training purposes in the tutorial folder named respectively:

-Neisseriaelongata_protein.faa

-Neisseriaelongata_feature_table.txt

For other reference organisms, these files (.faa and _feature_table.txt) are generally available on the NCBI FTP: (ftp://ftp.ncbi.nlm.nih.gov/genomes/refseq/bacteria/). The third output file needed is the gDNA sequence (.dna) as a fasta file for the other organism that will be compared (herein Neisseria meningitidis) with the reference. This file can also be found in the NCBI FTP. The user can choose the expect value cutoff (for explanation on this value see: https://blast.ncbi.nlm.nih.gov/Blast.cgi?CMD=Web&PAGE_TYPE=BlastDocs&DOC_TYPE=FAQ#expect). It is expressed in e^(value entered by the user). By clicking "Blast" the tblastn analyses will start.
The report of these analyses is stored in the Blast folder.  

Blast
blast
Filter
filter

The user must do this operation for all genomes that will be compared with the reference. At this point, the user can filter the blast results (---vs---.txt in the “blast” folder) separately to exclude non-conserved proteins if needed. The % of identity score threshold is chosen by the user. A threshold of 0% will include all proteins. If needed, the filtering can be redone later using another threshold. Again, this operation needs to be done for all genomes. The output file will be stored in the "Filtered" folder.