BLAST (Basic Local Alignment Search Tool) is a powerful and widely used bioinformatics program that allows users to compare a nucleotide (DNA/RNA) or protein sequence—the query—against databases of known sequences[1][2][3]. The primary purpose is to identify regions of local similarity, which can provide insight into gene or protein function, structure, and evolutionary relationships[1][3][4].
For example, if you sequence a novel gene or protein and want to learn more about its possible function or origin, BLAST helps by:
There are several BLAST programs, each suited to different types of queries and databases. Here’s a summary of the main BLAST types[3][6][7]:
Tool | Query | Database | Purpose |
---|---|---|---|
blastn | Nucleotide | Nucleotide | Finds similar DNA/RNA sequences |
blastp | Protein | Protein | Compares protein sequences |
blastx | Nucleotide (translated) | Protein | Translates DNA sequence (6 frames) and searches protein matches |
tblastn | Protein | Nucleotide (translated) | Compares protein against a translated DNA database |
tblastx | Nucleotide (translated) | Nucleotide (translated) | Compares protein-coding potential of novel DNA |
Use Cases:
Website: NCBI BLAST web tool[8]
Use by selecting the appropriate program (e.g., blastn
, blastp
) and submitting a FASTA sequence. You can compare against databases like nr
, refseq
, or swissprot
.
Important output metrics:
Applications: Identifying gene/protein functions, finding homologs, and exploring evolutionary relationships[1][8][3][1][8][3].
Suitable for large-scale, high-throughput or automated sequence analyses. BLAST+ provides a suite of command-line tools that can be run locally. This is essential for:
Installers and databases can be downloaded from NCBI, and tools are supported across different platforms. BLAST+ offers flexibility, database management, and can run without the restrictions of the web version[9][10][11][9][10][11].
Website: EBI BLAST
The EMBL-EBI also provides web interfaces to BLAST and related tools, supporting nucleotide and protein alignments with direct integration to many European sequence databases[12][13][12][13].
When you run BLAST, the output is typically broken down into:
Key Metrics: