Professional Documents
Culture Documents
AND
DATABASES
BY
C.GAYATHRI
(I M.Sc.BIOINFORMATICS)
Needleman Wunsch
Algorithm
Smith Waterman
Algorithm
Published in 1970 by SAUL NEEDLEMAN
and CHRISTIAN WUNSCH
General algorithm for sequence
comparison
Commonly used in bioinformatics to align
protein or nucleotide sequences
Example of dynamic programming, and was
the first application of dynamic
programming to biological sequence
comparison.
Scores for aligned characters are
specified by a SIMILARITY MATRIX. Here,
S(i, j) is the similarity of characters i and
j. It uses a LINEAR GAP PENALTY, here
called d.
Maximizes a similarity score, to give
‘MAXIMUM MATCH’
Maximum match = largest number of
residues of one sequence that can be
matched with another allowing for all
possible deletions
Finds the best GLOBAL alignment of any
two sequences
N-W involves an iterative matrix method of
calculation
All possible pairs of residues (bases or amino
acids) - one from each sequence - are
represented in a 2-dimensional array
All possible alignments (comparisons) are
represented by pathways through this array
Three main steps
GCCUCG
GCCAUUG
Needleman-Wunsch Smith-Waterman
1. Global alignments 1. Local alignments
2. Requires alignment score for 2. Residue alignment score may
a pair of residues to be >=0 be positive or negative
3. No gap penalty required 3. Requires a gap penalty to
work effectively
4. Score cannot decrease 4. Score can increase, decrease
between two cells of a or stay level between two
pathway cells of a pathway
5. Trace back is mostly from 5. Trace back is from the cell
the last cell that has the that has the highest score
highest score
CONCLUSION