References: Sequence Alignment and Homology
-
Sequence Alignment - Wikipedia - Comprehensive overview of pairwise and multiple sequence alignment methods, scoring systems, gap penalties, and the biological significance of sequence conservation and homology.
-
BLAST (Biotechnology) - Wikipedia - Explains the Basic Local Alignment Search Tool algorithm, including word-seeding heuristics, E-value statistics, database searching strategies, and the various BLAST program variants.
-
Smith-Waterman Algorithm - Wikipedia - Detailed description of the dynamic programming algorithm for local sequence alignment, including the scoring matrix, traceback procedure, and comparison with Needleman-Wunsch global alignment.
-
Biological Sequence Analysis - Richard Durbin - Cambridge University Press - Foundational textbook covering pairwise alignment, substitution matrices, hidden Markov models, and profile HMMs with rigorous probabilistic treatment essential for understanding homology search.
-
Introduction to Bioinformatics (5th Edition) - Arthur Lesk - Oxford University Press - Accessible coverage of sequence alignment algorithms, scoring matrices like BLOSUM and PAM, statistical significance of alignments, and practical BLAST usage for homology detection.
-
NCBI BLAST Help - NCBI - Official BLAST documentation explaining program selection, parameter tuning, output interpretation, E-value meaning, and best practices for sequence similarity searching.
-
EMBOSS Pairwise Alignment Tools - EMBL-EBI - Web-based tools for Needleman-Wunsch and Smith-Waterman alignments with interactive parameter control, demonstrating the difference between global and local alignment approaches.
-
PFAM Database - EMBL-EBI InterPro - Protein family database built on profile HMMs, demonstrating how sequence alignment and homology detection are applied to classify proteins into evolutionary families and domains.
-
Biopython Pairwise Alignment Tutorial - Biopython - Tutorial for performing sequence alignments in Python using Biopython's pairwise alignment module, covering substitution matrices, gap penalties, and alignment scoring.
-
NCBI Substitution Matrices - NCBI - Repository of BLOSUM and PAM substitution matrices used in sequence alignment, essential for understanding how amino acid similarity scores drive homology detection algorithms.