3.4 Multi Point Analysis
3.4.3 Multi point calculation using F ourier T ransforms
between marker 1 and 2. In the gure
v 2 = w 3
is updated according to allstatesof
v 1
.The idea behind multi point analysis is that we exploit thefact that few
crossoversaremost likelyto occurbetween two markers, sincethe
recombina-tionfractionbetweentwoadjacentmarkersisalwayslessthan
1 2
. Forinstance,if we have observed the exact state of
v 1
asv 3
, that isP ( v 1 = v 3 |G 1 ) = 1
,thentheprobabilityofall inheritancevectorsat
v 2
close24tov 3
getincreasedprobability,whereasinheritancevectorsfar 25
from
v 3
getdecreasedprobability. More precisely, the contribution from any statev i
ofv 1
is given by theprobability oftheinheritance vector at marker1conditioned on thegenotype
informationat thatlocus,
P (v i |G 1 )
,timesthetransitionprobabilityP (w 3 | v i )
.The transition probability is given by the Hamming distance between the
two inheritance vectors which indicates the number of odd crossovers that
have occurred between two markers on a chromosome. The probability of
d
crossoversoccurringbetweentwo markerswithaninheritance vectoroflength
n
isθ d 1 · (1 − θ 1 ) n − d
.We thensum over thecontribution to
w 3
of every vectorv i
, whichcorre-spondsto marginalizing out
v 1
. The sum inthen multiplied with thecondi-tionalprobabilityof
w 3
givenG 2
. ThisproductisproportionaltoP (w 3 |G 1 , G 2 )
.We compute this product for every inheritance vector at marker
M 2
togetthe full probability distribution,
P ( v 2 |G 1 , G 2 )
26. Theupdated probability distribution at marker 2 can then be used to calculate the left-conditionedprobability at marker 3 and so forth. The right-conditioned probability is
calculatedinat similar fashion.
Using this procedure we can compute
P( v i |G all )
for anyv i
. Thesevaluescanthenbeusedinthescoringfunctions,i.e. LODscore,todeterminelinkage
between markers and traits.
where
|v|
isthe numberof inheritance vectors. A convolution hasto be done twice for each marker: Once for calculating the left conditioned probabilityandoncefor the right conditioned probability.
In[Gud00 ]Gudbjartssonshowshowthe algorithmsforFastFourier
Trans-forms can berewritten for better utilisation of CPU cache and registers with
speeds-upranging fromafactor of1.9 to5.3dependingon thedataanalysed,
[Gud00 ,page 32-34]. The speed-ups areobtained througha reordering of the
computationssothat they take advantage ofthe cache memory,and through
unrollingloopswhich reduces the amount of book-keeping.
For more on Fourier Transforms within the eld of linkage analysis, see
[KL98 ]and [Gud00 ].
3.4.4 Founder Reduction
One way of reducing the number of computations for the set of inheritance
vectorsfor a pedigree isbyapplying thesocalled founderreduction, [Gud00,
page34]. Theintuitiveideaisthatconsistentlychangingtheallelepointingto
afounder, frompaternal to maternaland viceversa for all children, yields an
newinheritance vector withthesame probability astheoriginal vector. This
isbecause we cannotdistinguish between maternaland thepaternal alleles of
founderssince the phaseis unknown.
Noticethatitisnotareductioninthenumberofinheritancevectors,rather
anexploitation of symmetriesinthe probabilitydistribution.
For the
{ p, m }
describing paternal and maternal inheritance we dene in-versionsuchthat:p = m
andm = p.
Thefounderreductionappliesfor anyfounder,butfor convenience we just
state the reduction for one male founder. The reduction for all founders is
equivalent. Stated formally:
Theorem 6 (Founder Reduction) Let
h ∈ F
be a male founder in somepedigree
P = h F, N, f ather, mother i
. LetC
denote the set of children ofh
,thatis
C = { n ∈ N | f ather(n) = h }
. Letv
andw
be two inheritance vectors, suchthat for anyn ∈ N
:w(n, p) =
( v(n, p) : n ∈ C v(n, p) :
otherwise;w(n, m) = v(n, m).
Givensomegenotypeinformation
G = hL, astatesi
onthepedigreethefollowingalwaysholds:
P(v |G ) = P(w |G ).
(3-21)Intuitively,thedenitionof
v
andw
statesthatifachildofh
hasinheritedh
's paternal allele according tov
, then that child has inheritedh
's maternalallelein
w
,and viceversa.Proof: Prior to proving for multi point linkage analysiswe prove thatit
holds for single point linkage analysis. The denition of
v
andw
implies thefollowing relationship between
F v
andF w
:F w (n, $) =
(h, m) : F v (n, $) = (h, p) (h, p) : F v (n, $) = (h, m) F v (n, $) :
otherwise,
where
$ ∈ { p, m }
. Theorem1 statesthat:ϕ v G def = ^
n ∈L
(f F v ( n,p ) = a 1 ∧ f F v ( n,m ) = a 2 ) ∨ (f F v ( n,p ) = a 2 ∧ f F v ( n,m ) = a 1 ) ,
where
a 1 , a 2 ∈ astates(n)
. By denitionϕ w G
is equal toϕ v G
with regard to allsub-expressions,exceptthatanysub-expression:
(f h,$ = a 1 ∧ f n,$ 0 = a 2 ) ∨ (f h,$ = a 2 ∧ f n,$ 0 = a 1 ),
in
ϕ v G
lookslike:(f h,$ = a 1 ∧ f n,$ 0 = a 2 ) ∨ (f h,$ = a 2 ∧ f n,$ 0 = a 1 ),
in
ϕ w G
. Thismeans that any constraints on founder allele(h, $)
inϕ v G
is alsoaconstraint on
(h, $)
inϕ w G
.Thisimpliesthatforanyfounderalleleassignment,
Z M
,whichsatisesϕ v G
that assigns the allelic state
a
to(h, $)
, there exists a similar founder alleleassignment with equal probability,
Z M 0
, which satisesϕ w G
that assigns theallelicstate
a
to(h, $)
. Inother words:∀ Z M ∈ [[ϕ v G ]] ∃ Z M 0 ∈ [[ϕ w G ]] ∀ n ∈ F. Z M 0 (n, $) = (
Z M (n, $) : n = h Z M (n, $) :
otherwise,fora xed
h
. Theoppositeisalso truesincewecan applythesame argumentusing
v
asw
andw
asv
. Furthermore,P (Z M ) = P (Z M 0 )
since they assignexactlythe same set of allelic states to founder alleles, which in turnimplies
that:
X
Z M ∈[[ ϕ v G ]]
P(Z M ) = X
Z M 0 ∈[[ ϕ w G ]]
P (Z M 0 )
andfrom (3-5)we deduce:
P(v |G ) = P(w |G ).
case. To do this we use the fact that founder reduction holds in the single
point case, thatis weusethat(3-22)holds. We considertwo markers
M 1
andM 2
withcomputed single point probabilities. We prove that transferring the probabilities fromM 1
toM 2
preserves the equalitybetween the probabilities of the two inheritance vectorsv 2
andw 2
, wherev 2
andw 2
are inheritance vectors for markerM 2
, andv 2
andw 2
are dened atv
andw
, respectively, in Theorem 6. Hence, we want to show that two inheritance vectors in thesameequivalence class,due tothefounderreductionat marker
M 2
,remaininthe same equivalence class after we have propagated the evidence (genotype
information)observed at marker
M 1
.Assumethatavector,
v 1
,issomeinheritancevectorformarkerM 1
,andw 1
isdened intermsof
v 1
asdescribedin Theorem6. Observe thatP (v 1 |G 1 ) = P(w 1 |G 1 )
, andP (v 2 |G 2 ) = P (w 2 |G 2 )
due to (3-22), whereG 1
andG 2
are thegenotype information for marker
M 1
andM 2
,respectively. We want to show that:P (v 2 |G 1 , G 2 ) = P (w 2 |G 1 , G 2 )
(3-23)Lettherecombinationfractionbetween
M 1
andM 2
beθ 1
. Thencontribu-tion in terms of probability from
v 1
tov 2
is given byθ 1 d v (1 − θ 1 ) n − d v
, whered v = Ham(v 1 , v 2 )
is the Hamming distance between thevectors andn
is thelengthofaninheritance vector forthe pedigree. To showthat(3-23) holdswe
needtoshowthatthecontributionof
v 1
andw 1
totheprobabilityofv 2
,isex-actlythesameasthecontributionof
v 1
andw 1
tow 2
. TodothisweprovethatHam(v 1 , v 2 ) = Ham(w 1 , w 2 )
and thatHam(v 1 , w 2 ) = Ham(w 1 , v 2 )
,becausethis implies that
v 2
andw 2
are updated with exactly the same probabilities given the observed genotype forM 1
. We only prove this forHam(v 1 , v 2 ) = Ham(w 1 , w 2 )
sincethe oppositeis similar.The value
w 1
andv 1
are identical for all(n, $)
, except inthe case where$ = p
andf ather(n) = h
, in which case the value diers. Since the sameappliesfor
v 2
andw 2
weneedonlyconcentrateonHammingdistancesbetween theinheritance vectorsforchildrenofh
. Forany(n, $)
ifv 2 (n, $) 6 = v 1 (n, $)
then,bythe denitionof
w 2
andw 1
itfollowsthatw 2 (n, $) 6 = w 1 (n, $)
. Thesame argument applies for parts of the inheritance vectors where
v 2 (n, $) = v 1 (n, $)
. Inthiscase, bydenitionw 2 (n, $) = w 1 (n, $)
,hencetheHammingdistancebetween
v 1
andv 2
isequalto theHammingdistancebetweenw 1
andw 2
.The same can be shown for
Ham(v 1 , w 2 ) = Ham(w 1 , v 2 )
with the samearguments. For this reason, in the multi point step the founder reduction
holds.
♦
In[Gud00 ]Gudbjartssondenesfoundercouplereduction asanadditionalway
of reducing the number of computations performed during single and multi
point linkage analysis. Intuitively, founder reduction states that we cannot
distinguish between the maternal and paternal alleles of founders, that is,
we can switch between their two alleles. Founder couple reduction states,
intuitively,thatwecannotdistinguishthemaleandfemalefounderofafounder
couple. Afounder couple istwo founderswho share ospringinthepedigree.
Westate this formallyinthefollowing theorem:
Theorem 7 (Founder Couple Reduction) Let
h, h 0 ∈ F
be the male andfemalefounder,respectively, of a founder couple insome pedigree
P = hF, N, f ather, motheri
. LetC
denote theset of childrenofh
andh 0
, thatis
C = { n ∈ N | f ather(n) = h
andmother(n) = h 0 }
. Letv
andw
be twoinheritance vectors, suchthat for any
n ∈ N
:w(n, $) =
v(n, $) : n ∈ C
v(n, $) : f ather(n) ∈ C
ormother(n) ∈ C v(n, $) :
otherwise.
Given some genotype information
G = hL, astatesi
on the pedigree whereh, h 0 ∈ L /
and that neitherh
norh 0
has any children inV
not inC
, thefollowingalways holds:
P(v|G ) = P(w|G ).
(3-24)We do not prove this, but refer the keen reader to [Gud00 , page 35] for
moreinformation on foundercouple reduction.