Sequence Retrieval by Keywords

Enter a keyword to search within the lettuce or sunflower contig description:
AssemblyBLAST vs. NCBInr BLAST vs. Arabidopsis
L2_CDS_Contig#c_asmbl_L2_contig_blast_ncbi c_asmbl_L2_contig_blast_refgen
CLS_S3_Contig#c_asmbl_CLS_S3_contig_blast_ncbi c_asmbl_CLS_S3_contig_blast_refgen
CLX_S3_Contig#NONE c_asmbl_CLX_S3_contig_blast_refgen
CHA_M2_Contig#NONE c_asmbl_CHA_M2_contig_blast_refgen
CTO_M1_Contig#NONE c_asmbl_CTO_M1_contig_blast_refgen

Search Pattern:
Cutoff value:
norm = normalized expectation = -log(exp)/100
for example: 1e-35 = 0.35 (norm exp)

Short Description:

The table of your choice (Lettuce or Sunflower) contains data generated by
BLASTX search of the Lettuce or Sunflower assembly versus the non-redundant
protein NCBI database. One table entry represents a BLAST hit. The table contains multiple
records for each contig depending on number of hits to each contig generated by BLAST search.

Blast Search Parser matches a given a keyword with the words in BLAST hit description line.
If the description line contains a searchable word, a cutoff value is compared. Blast Search Parser
will extract the contig with the largest expectation value above the cut off value.

Search instructions:

How the search is performed:

The keyword is searched in the description line of BLAST report. If the keyword is found, the normalized
expectation value of that sequence is checked. The sequence with the largest normalized expectation value
above the cutoff value is extracted.

How to understand the output:

Unique contigs resulted in this search are put into a table.

Table fields

The first column is the contig ID.

The second column is a fraction n / N.

             n - number of sequences with the same contig ID that have normalized
                 expectation values bigger than or equal to the cutoff norm value
                 and that have the searchable keyword in the contig description
                 field.
             N - number of all BLAST hits with the same contig ID.
        

The third column is the description line of Blast hit where keyword was found.

The fourth column is the normalized expectation value of a sequence.

The fifth column is the Subject_ID.

The sixth column is the maximum expectation value among all the hits having the same contig ID.

Default parameters:

normalized expectation = -log(exp)/100
for example: 1e-35 = 0.35 (norm exp)