Sequence Retrieval by Keywords

Enter a keyword to search within the lettuce or sunflower contig description:
AssemblyBLAST vs. NCBInr BLAST vs. Arabidopsis
LACT_5CDS.CSA1NONE cds_LACT_5CDS_CSA1_blast_refgen
HELI_7CDS.CSA1NONE cds_HELI_7CDS_CSA1_blast_refgen
AST_11CDS.CSA1NONE cds_AST_11CDS_CSA1_blast_refgen

Search Pattern:
Cutoff value:
norm = normalized expectation = -log(exp)/100
for example: 1e-35 = 0.35 (norm exp)

Short Description:

The table of your choice contains data generated by
BLASTX search of the selected assembly versus the non-redundant
protein NCBI database. One table entry represents a BLAST hit. The table contains multiple
records for each contig depending on number of hits to each contig generated by BLAST search.

Blast Search Parser matches a given a keyword with the words in BLAST hit description line.
If the description line contains a searchable word, a cutoff value is compared. Blast Search Parser
will extract the contig with the largest expectation value above the cut off value.

Search instructions:

How the search is performed:

The keyword is searched in the description line of BLAST report. If the keyword is found, the normalized
expectation value of that sequence is checked. The sequence with the largest normalized expectation value
above the cutoff value is extracted.

How to understand the output:

Unique contigs resulted in this search are put into a table.

Table fields

The first column is the contig ID.

The second column is a fraction n / N.

             n - number of sequences with the same contig ID that have normalized
                 expectation values bigger than or equal to the cutoff norm value
                 and that have the searchable keyword in the contig description
                 field.
             N - number of all BLAST hits with the same contig ID.
        

The third column is the description line of Blast hit where keyword was found.

The fourth column is the normalized expectation value of a sequence.

The fifth column is the Subject_ID.

The sixth column is the maximum expectation value among all the hits having the same contig ID.

Default parameters:

normalized expectation = -log(exp)/100
for example: 1e-35 = 0.35 (norm exp)