Professional Documents
Culture Documents
and insertions.
time present 3
National Taiwan University
Department of Computer Science
and Information Engineering
Haplotype
A set of closely linked SNPs located on one
chromosome.
SNP 1
SNP 2
SNP 3
GATATTCGTACGGA-T
Haplotypes
GATGTTCGTACTGAAT
GATATTCGTACGGA-T AG- 2/6
GATATTCGTACGGAAT GTA 3/6
GATGTTCGTACTGAAT AGA 1/6
GATGTTCGTACTGAAT
DNA
Sequences
4
National Taiwan University
Department of Computer Science
and Information Engineering
5
National Taiwan University
Department of Computer Science
and Information Engineering
Linkage Disequilibrium
A B
A b
Consider only a B
two SNPs a b
There are 4 possible haplotypes
SNP 1
B b Total
Total PB Pb 1.0 6
National Taiwan University
Department of Computer Science
and Information Engineering
Linkage Equilibrium
PAB = PAPB
PAb = PAPb = PA(1-PB)
PaB = PaPB = (1-PA) PB
Pab = PaPb = (1-PA) (1-PB) SNP 1
B b Total
A PAB PaB PA
SNP 2
a PaB Pab Pa
Total PB Pb 1.0
7
National Taiwan University
Department of Computer Science
and Information Engineering
Linkage Disequilibrium
PAB ≠ PAPB
PAb ≠ PAPb = PA(1-PB)
PaB ≠ PaPB = (1-PA) PB
Pab ≠ PaPb = (1-PA) (1-PB)
SNP 1
B b Total
A PAB PaB PA
SNP 2
a PaB Pab Pa
Total PB Pb 1.0
8
National Taiwan University
Department of Computer Science
and Information Engineering
An Example of Linkage
Disequilibrium
Before mutation After mutation
-- A -- -- -- G -- -- -- -- A -- -- -- G -- -- --
-- C -- -- -- G -- -- -- -- C -- -- -- G -- -- --
PA=1/2 PG=1 -- C -- -- -- C -- -- --
PC=1/2 PA=1/3 PG=2/3
PC=2/3 PC=1/3
An Example of Linkage
Equilibrium
Before recombination After recombination
-- A -- -- -- G -- -- -- -- A -- -- -- G -- -- --
-- C -- -- -- G -- -- -- -- C -- -- -- G -- -- --
-- C -- -- -- C -- -- -- -- C -- -- -- C -- -- --
-- A -- -- -- C -- -- --
After recombination, PA=1/2 PG=1/2
PAG = PAPG = 1/4, PC=1/2 PC=1/2
PCG = PCPG = 1/4,
PCC = PCPC = 1/4, and
PAC = PAPC = 1/4.
Thus, these two SNPs are linkage equilibrium. 10
National Taiwan University
Department of Computer Science
and Information Engineering
D Coefficient
We can measure the non-randomness of two loci by
means of a deviation, D, defined as follows:
D = PAB – PAPB or PABPab – PAbPaB
PAB = PAPB + D
PAb = PA(1-PB) - D
PaB = (1-PA) PB - D
11
National Taiwan University
Department of Computer Science
and Information Engineering
Standardization of D Coefficient
D coefficient can be standardized in many ways.
D’ = D/Dmax, where Dmax stands for the absolute maximal
possible value of D.
D
min( P P , P P ) , if D 0; D D
D' A B a b
D
, if D 0.
min( PA Pb , Pa PB )
PA PB D PAB 0 D PA PB -PAPB 0 PaPB
Pa Pb D Pab 0 D Pa Pb
PA Pb D PAb 0 D PA Pb
Pa PB D PaB 0 D Pa PB 12
National Taiwan University
Department of Computer Science
and Information Engineering
Interpretation of D’
D’ is constrained between -1 and +1.
D’ = 1 (perfect positive LD between SNP alleles)
D’ = 0 (linkage equilibrium between SNP alleles)
D’ = -1 (perfect negative LD between SNP alleles)
D’ = 0.87 (strong positive LD between SNP alleles)
D’ = 0.12 (weak positive LD between SNP alleles)
Other measures of D coefficient:
2
r2 or Δ2: D
2
PA (1 PA ) PB (1 PB )
Chi-square Test.
P value.
13
National Taiwan University
Department of Computer Science
and Information Engineering
14
National Taiwan University
Department of Computer Science
and Information Engineering
Chromosome 15
National Taiwan University
Department of Computer Science
and Information Engineering
G A
Diploid
T C
16
SNP1 SNP2
National Taiwan University
Department of Computer Science
and Information Engineering
1|2 1|2
17
1|2 3|2
National Taiwan University
Department of Computer Science
and Information Engineering
Thoughts
How to modify the tag SNP selection algorithm to
process genotype data.
The naïve approach is inferring haplotype data by
existing algorithms and finding tag SNPs.
Is it possible to determine tag SNPs directly from
genotype data?
Assume 0: homozygous wild type, 1: homozygous
mutant, 2: heterozyhous. P P P P
1 2 3 4
S1 1 1 0 0
S2 1 0 1 0
S3 1 2 0 1
S4 1 2 0 1 19
National Taiwan University
Department of Computer Science
and Information Engineering
20
National Taiwan University
Department of Computer Science
and Information Engineering
21