Directory UMM :Data Elmu:jurnal:M:Mathematical Biosciences:Vol166.Issue2.Aug2000:
Mathematical Biosciences 166 (2000) 1±21
www.elsevier.com/locate/mbs
The eective number of a population that varies cyclically in
size. I. Discrete generations
Yufeng Wang a,1, Edward Pollak b,*
a
b
Department of Zoology and Genetics, Iowa State University, Ames, IA 50011-1210, USA
Statistical Lab, Department of Statistics, 111 Snedecor Hall, Iowa State University, Ames, IA 50011-1210, USA
Received 28 September 1999; received in revised form 9 May 2000; accepted 16 May 2000
Abstract
We consider a dioecious population having numbers of males and females that vary over time in cycles of
length k. It is shown that if k is small in comparison with the numbers of males and females in any generation of the cycle, the eective population number (or size), Ne , is approximately equal to the harmonic
mean of the eective population sizes during any given cycle. This result holds whether the locus under
consideration is autosomal or sex-linked and whether inbreeding eective population numbers or variance
eective population numbers are involved in the calculation of Ne . If, however, only two successive generations in the cycle are considered and the population changes in size between these generations, the inbreeding eective population number, NeI , diers from the variance eective population number, NeV . The
mutation eective population number turns out to be the same as the number derived using calculations
involving probabilities of identity by descent. It is also shown that, at least in one special case, the eigenvalue eective population number is the same as NeV . Ó 2000 Published by Elsevier Science Inc. All
rights reserved.
Keywords: Eective population size; Cyclic variation
1. Introduction
Random genetic drift is an important in¯uence on the genetic variability of a ®nite population
and a numerical measure of its in¯uence is the eective population size. Wright [1±3] presented the
®rst example for a dioecious population and a general expression for monoecious populations that
*
1
Corresponding author. Tel.: +1-515 294 7765; fax: +1-515 294 4040.
E-mail addresses: ywang@iastate.edu (Y. Wang), pllk@iastate.edu (E. Pollak).
Tel.: +1-515 294 9053.
0025-5564/00/$ - see front matter Ó 2000 Published by Elsevier Science Inc. All rights reserved.
PII: S 0 0 2 5 - 5 5 6 4 ( 0 0 ) 0 0 0 2 7 - 4
2
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
do not change in size. Later, Crow [4] showed, for a monoecious population, that the eective
population size measuring the increase in the probability that two copies of a gene are identical by
descent diers from the eective population size involved in the variance of allele frequency
changes, except if the size of the population remains constant. Other early work on this subject is
discussed in the text by Crow and Kimura [5].
Recently, Caballero [6] and Nagylaki [7] derived general expressions for the eective population
sizes of random mating dioecious populations, whether the locus under consideration is autosomal or sex-linked. For an autosomal locus, the results are consistent with those obtained by
Crow and Denniston [8]. Caballero [6] and Nagylaki [7] assume, however, that the numbers of
males and females do not change between generations. However, wild populations can ¯uctuate
considerably in size from year to year. A simple way to describe such populations, which may not
be too unrealistic, is to assume that these changes occur in repeated cycles. Wright [2,3] showed
that, for monoecious populations in which individuals have approximate Poisson distributions of
numbers of successful gametes in ospring, the eective population size is approximately equal to
the harmonic mean of the populations sizes in a cycle, provided the length of a cycle is short in
comparison to those sizes. Pollak [9] has shown that this result also holds generally for ospring
distributions with ®nite variances. However, to the best our knowledge, no such results have yet
been presented for dioecious populations. In the next two sections we shall derive eective population sizes for autosomal and sex-linked loci, by generalizing the reasoning used by Caballero [6]
and Nagylaki [7], whereby recurrence equations are obtained for probabilities of identity by
descent. Next, there will be alternative derivations for autosomal and sex-linked loci, which rely
on obtaining the variance of the change in the frequency of an allele in a generation. Here the
reasoning will be a discrete generation version of the reasoning used by Hill [10] and Pollak [11],
respectively, for an autosomal and a sex-linked locus. This alternative approach leads to the same
expressions as before for eective population sizes if the population is followed through an entire
cycle, although the individual terms in the resulting harmonic mean are not quite the same as the
corresponding terms that arise from the identity-by-descent approach. Mutation eective population numbers are also derived, as are eigenvalue eective population numbers in one special case. 2
2. Autosomal loci
We consider an autosomal locus in a population whose size undergoes repeated cycles of length
k. Let us suppose that at times t and t 1 the population is respectively in generations i and i 1
of a cycle. Then, at time t 1,
Fi1;t1 the inbreeding coefficient of a random individual
and
fvw;i1;t1 the coefficient of coancestry of a random pair of separate individuals of sexes
v and w;
2
Journal paper No. J-18628 of the Iowa Agriculture and Home Economics Experiment Station, Ames, IA, Project
No. 3201, and supported by Hatch Act and State of Iowa Funds.
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
3
where v and w can be replaced by either of the symbols m and f, denoting, respectively, male and
female. Let
i
P two copies of a gene in random separate offspring of sexes
Pu;vw
v and w were from the same parent of sex u generation i of a cycle:
Then
Fi1;t1 fmf ;i;t
1
and
fvw;i1;t1
1
i 1
i
i 1
i
P
1 Fi;t 1 ÿ Pm;vw fmm;i;t Pf ;vw 1 Fi;t 1 ÿ Pf ;vw fff ;i;t 2fmf ;i;t ; 2
4 m;vw 2
2
because half of the copies of a gene in an individual come from a male and half come from a
female. These equations can be simpli®ed if we set
Ui;t 1 ÿ Fi;t ;
dvw;it 1 ÿ fvw;i;t :
Then (1) and (2) reduce, respectively, to
Ui1;t1 dmf ;i;t
3
and
h
i
o
i
1 nh i
i
i
i
Pm;vw Pf ;vw Ui;t 21 ÿ Pm;vw
dmm;i;t 2 1 ÿ Pf ;vw dff ;i;t 4dmf ;i;t :
8
Eqs. (3) and (4) can be rewritten in matrix notation as
dvw;i1;t1
4
di1;t1 Ai di;t ;
5
where
i 8
6 P i P i
i
4
2 1 ÿ Pm;mm
6 m;mm
f ;mm
16
h
i
Ai 6
i
i
i
4
86
6 Pm;mf Pf ;mf 2 1 ÿ Pm;mf
4
h
i
i
i
i
Pm;ff Pf ;ff 2 1 ÿ Pm;ff
4
2
3
2
0
0 0 1 0
i
i
6
6 0 1 1 1 7 1 6 Pm;mm Pf ;mm
6
4
2
47
6
7 6 i
i
4 0 14 12 14 5 8 6
4 Pm;mf Pf ;mf
i
i
0 14 12 14
P
P
2
0
0h
m;ff
A0 Di :
Thus for all r
dir;tr Airÿ1 Airÿ2 Ai di;t
f ;ff
0h
i
3
i
2 1 ÿ Pf ;mm 7
7
h
i7
7
i
2 1 ÿ Pf ;mf 7
7
h
i 5
i
2 1 ÿ Pf ;ff
0
0
0
i
ÿ2Pm;mm
0
i
ÿ2Pf ;mm 7
7
7
i 7
ÿ2Pf ;mf 5
i
0
i
0
ÿ2Pm;mf
ÿ2Pm;ff
3
i
ÿ2Pf ;ff
4
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
i
We shall show later that if, at each stage i in the cycle, the numbers Nm i and Nf of males and
i
in the matrix Di are small. It can then be proved by
females are large, all the elements Pu;vw
induction that
Airÿ1 Airÿ2 Ai A0 Dirÿ1 A0 Dirÿ2 A0 Di Ar0
rÿ1
X
Dij Aj0 ; r P 1;
Arÿ1ÿj
0
j0
i
i
if r maxNm i ; Nf . Thus, if k maxNm i ; Nf ,
"
#
kÿ1
X
dik;tk Ak0
A0kÿ1ÿj Dij Aj0 di;t M Di di;t ;
6
j0
where M Ak0 and the elements of
Di
kÿ1
X
Dij Aj0
Akÿ1ÿj
0
j0
are all small.
Now
2
0
60
6
Aro 6
40
1
4
1
4
1
4
1
4
1
2
1
2
1
2
1
2
13
4
17
47
17
5
4
1
4
0
2 3
1
6 1 7 1 1 1
6 7
6 7 0
;
415
4 2 4
r P 2;
1
and it can also be shown that the eigenvalues of Ar0 are k1 1; k2 k3 k4 0. The left and right
eigenvectors p0 and v that correspond to k1 1; and satisfy the normalization conditions
p0 v p0 1 1 are
1 1 1
0
p 0
;
4 2 4
and
0
v 1 1 1 1 1 :
Since all the elements of Di are small, the dominant eigenvalue of M Di is
q 1 d;
where d is small. By a standard result from perturbation theory, as discussed, for example, by
Franklin [12, Section 6.12],
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
5
d p0 Di v
2 3
1
!6 7
X
kÿ1
617
1 1 1
7
Dj 6
0
617
4 2 4
5
4
j0
1
h
i 30
j
j
j
j
j
kÿ1
j
2 3
P
P
2 P
P
P
P
f ;mm
m;mf
f ;mf
m;ff
f ;ff
m;mm
j0
7 1
76 7
i
Pkÿ1 h j
7 617
j
j
2Pm;mf Pm;ff
ÿ2 j0 Pm;mm
76 7
76 7
7 415
7
0
h
i
5
Pkÿ1 j
j
j
1
ÿ2 j0 Pf ;mm 2Pf ;mf 2Pf ;ff
2P
6
6
1 6
6
6
32 6
6
4
ÿ
kÿ1 nh
i h
io
1 X
j
j
j
j
j
j
2Pm;mf 2Pm;ff Pf ;mm 2Pf ;mf Pf ;ff :
Pm;mm
32 j0
7
Thus, after many cycles, the probabilities of non-identity of pairs of copies of a gene shrink at a
steady rate that is approximately equal to 1 d per cycle, where d, the right-hand side of (7) is not
dependent on i. This rate may then be set equal to 1 ÿ 1= 2Ne k , so that
kÿ1 nh
io
i h
1
1 X
j
j
j
j
j
j
Pm;mm
2Pm;mf Pm;ff Pf ;mm 2Pf ;mf Pf ;ff :
Ne 16k j0
8
j
to be the number of successful gametes contributed by a parent of sex u in
We now de®ne Guv
generation j of a cycle to an ospring of sex v. Since we are considering neutral alleles and there
are Nu j parents of sex u and Nv j1 ospring of sex v,
j Nv j1
j :
E Guv
Nu
Since there is a random mating, any pair of gametes in ospring of sexes v and w coming from
parents of sex u is just as probable as any other pair, even if both gametes come from a single
individual. Thus,
j
Pu;vv
Nu j
ÿ 1
j ÿ j
Guv ÿ 1
E Guv
j1
ÿ 1
n
j o
ÿ j ÿ j 2
ÿ E Guv
E Guv
Var Guv
Nu j
j1
j1
Nv Nv
Nu j
j1
Nv
Nv
1
j1
Nv
ÿ1
ÿ j Nv j1
Var Guv j ÿ 1 ;
j1
Nv
Nu
9
6
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
and
Nu j
j
Pu;mf
j1
Nm
h
i
j j
G
E
G
um uf
j1
Nf
Nu j
1
j
j
;
G
Cov
G
j :
uf
um
j1 j1
Nm Nf
Nu
10
j
We assume that Nm j and Nf are all large so that the elements of Di are indeed small for all i. If,
therefore, (9) and (10) are substituted in (8), we obtain
8
!
"
>
kÿ1
ÿ j
1
1 X< Nm j
Nm j1
j
j
Cov
G
;
G
2 Var Gmm 2
mf
mm
j1
Ne
16k j0 >
Nf
: Nm j1
3
!2
j1 2
j1
j1
Nm
Nm
Nm
j
5
ÿ2
Var Gmf 4
j1
j
j
Nf
Nm
Nm
"
!
j
j1
Nf
N
f
j
j
j
Cov
G
;
G
2 Var Gff 2
ff
fm
j1
j1
Nm
Nf
!2
!2
!3 9
j1
j1
j1
=
Nf
Nf
Nf
j
5 :
11
4
Var
G
ÿ
2
fm
j1
j
j
;
Nm
N
N
f
f
3. Sex-linked loci
i
i
The notation remains the same as in the previous section, but now Pm;mm
Pm;mf 0 because
males get their copies of a gene only from their mothers. Females get half their copies of a gene
from parents of each sex. Therefore,
Fi1;t1 fmf ;i;t ;
12
fmm;i1;t1
i
Pf ;mm 12 1
fmf ;i1;t1
i
1
fPf ;mf 12 1
2
fff ;i1;t1
i
1
fPm;ff
4
Fi;t 1 ÿ
i
Pf ;mm fff ;it ;
Fi;t 1 ÿ
1 ÿ
i
Pm;ff fmm;i;t
i
Pf ;mf fff ;i;t
i
Pf ;ff 12 1
13
fmf ;i;t g;
Fi;t 1 ÿ
14
i
Pf ;ff fff ;i;t
2fmf ;i;t g:
15
As before, the recurrence equations can be simpli®ed if we set
Ui;t 1 ÿ Fi;t ;
dvw;i;t 1 ÿ fvw;it :
Eqs. (12)±(15) are then replaced by
di1;t1 Ai di;t ;
16
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
7
where
Ui;t1
2
3
7
6d
6 mm;i1;t1 7
di1;t1 6
7;
4 dmf ;i1;t1 5
dff ;i1;t1
0
2
6 4P i
f ;mm
16
Ai 6
i
6
8 4 2Pf ;mf
0
8
0
0
0
i
81 ÿ Pf ;mm 7
7
7
i
41 ÿ Pf ;mf 7
5
0
0
60
6
6
40
2
0
4
i
i
Pm;ff
i
Pf ;ff
3
4 21 ÿ Pf ;ff
21 ÿ
3
2
3
0
0
0 0
0 1 0
i
6 4P i
0
0 ÿ8Pf ;mm 7
f ;mm
7
0 0 17
7 16
7
6
6 i
i 7
1
17
0 ÿ4Pf ;mf 5
0 2 2 5 8 4 2Pf ;mf 0
1
4
1
2
1
4
i
Pf ;ff
i
ÿ2Pm;ff
i
0 ÿ2Pf ;ff
A0 Di :
It follows from (16) that
"
#
kÿ1
X
j
kÿ1ÿj
k
dik;tk A0
Dij A0 di;t M Di di;t ;
A0
17
j0
because, as in the previous section, it will be shown that all of the elements of Di are small. The
characteristic equation corresponding to A0 is
ÿk
0
1
1
1
k k ÿ 1k2 1k ÿ 1 0
ÿk
jA0 ÿ kIj ÿk 0
2
2
4
8
1
1
1
ÿ
k
4
2
4
and has the roots k1 1; k2 ÿ12; k3 14; k4 0. The eigenvectors of Ar0 associated with any power
r of Ar and the dominant eigenvalue 1 are, then,
1 4 4
0
p 0
9 9 9
and
v 1 1 1 1 10 :
It follows from perturbation theory that if all the elements of Di are small, the dominant eigenvalue of M Di is q 1 d, where because of periodicity
!
iÿ1
kÿ1
kÿ1
kÿ1
X
X
X
X
d p0 Di v p0 Dij v p0
Dj
Dj v p0 Dj v:
j0
ji
j0
j0
8
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
This is the same for all i. Thus
2
kÿ1 6
6
1 4 4 X
6
d p Di v 0
9 9 9 j0 6
4
0
0
1 j
ÿ 2 Pf ;mm
3
j
ÿ 14 Pf ;mf
j
j
ÿ 18 Pf ;ff 2Pm;ff
7
7
7
7
5
kÿ1
1 X
k
j
j
j
j
ÿ
:
P
2Pf ;mf Pf ;ff 2Pm;ff ÿ
18 j0 f ;mm
2Ne
18
As (9) and (10) still hold for a sex-linked locus, all the elements of Di are small. Their substitution
in (18) leads to
8
!
"
>
j
j1
kÿ1
Nf
1
1 X< Nf
j
j
j
Cov Gff ; Gfm
2 Var Gff 2
j1
Ne
9k j0 >
Nm
: N j1
f
!2
!2
!#
j1
j1
j1
Nf
Nf
Nf
j
Var Gfm 4
ÿ3
j1
j
j
Nm
Nf
Nf
39
2
!2
j1 2 j1 >
=
j
j1
N
Nm
Nm
Nm
j
5
m 2 42
2
ÿ
Var G
:
19
mf
j1
j
j
>
j1
N
N
N
;
m
m
f
Nm
4. Earlier results that are related to (11) and (19)
j
j
Let us assume that Pu;vw
Pu;vw , Nu j Nu and Guv
Guv for all j. There is then no cycle and k is
replaced by 1 in (11) and (19). Expressions (11) and (19) then, respectively, reduce to
(
"
#
2
1
1
1
Nm
Nm
Var Gmm 2
Var Gmf 2
Cov Gmm ; Gmf
Nf
Nf
Ne 16 Nm
"
#)
2
1
Nf
Nf
Var Gff 2
Var Gfm 2
20
Cov Gff ; Gfm
Nf
Nm
Nm
and
#
"
2
1
Nf
Nf
Cov Gff ; Gfm
Var Gfm 1
Var Gff 2
Nf
Nm
Nm
"
#)
2
1
Nm
Var Gmf 1 :
2
Nf
Nm
1
1
Ne 9
(
21
Expressions (20) and (21) are respectively the discrete generation versions of results derived by
Hill [10,13] and Pollak [11,14].
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
9
Another special case of (11) is where there are independent Poisson distributions of male and
female ospring of an individual in each generation. Then
j
Var Guv
Nv j1
j
Nu
and
j
j
; Guf 0:
Cov Gum
Hence (11) reduces to
"
#
kÿ1
1
1 X
1
1
4
2
1
1
4
2
ÿ
ÿ
Ne 16k j0 Nm j1 Nf j1 Nm j Nm j1 Nf j1 Nm j1 Nf j Nf j1
"
#
kÿ1
1 X
1
1
;
4k j0 Nm j Nf j
so that Ne is the harmonic mean of the k eective sizes in a cycle, each of which is of the form
derived by Wright [1].
Analogously, a special case of (19) is one for which there are independent Poisson distributions
of male and female ospring of a female and the number of daughters produced by a male has a
Poisson distribution. Then
#
#
"
"
kÿ1
kÿ1
1
1 X
1
1
4
3
2
2
1
1 X
4
2
;
ÿ
ÿ
Ne 9k j0 Nf j1 Nm j1 Nf j Nf j1 Nf j1 Nm j Nm j1
9k j0 Nf j Nm j
so that Ne is the harmonic mean of the k eective population sizes in a cycle, each of which is of
the form derived by Wright [15].
5. Another characterization of Ne
In this section, we will express the results given by (11) and (19) in terms of variance eective
numbers within a cycle. The reasoning will be based on that used by Hill [10]. Thus, we take
account of three sources of variability when gametes are transmitted from one generation to the
next: between numbers of gametes produced by dierent individuals, between genotypes of individuals, and among alleles carried by gametes when parents are heterozygous.
We consider a population in which parents and ospring are, respectively, in phases j and j 1
j
be the number of successful gametes contributed by the rth parent of sex u to
of a cycle. Let Guvr
ospring of sex v, and
Xur j the frequency of allelle B1 in the rth parent of sex u;
j
duvr` the difference between Xur j and the frequency of B1 in the
`th gamete contributed to an offspring of sex v by the rth parent of sex u:
So if the population has alleles B1 and B2 , Xur i will respectively be equal to 1, 1/2 and 0 in indi j
viduals of genotypes B1 B1 , B1 B2 and B2 B2 . In addition duvr` will be 0 if the parent of sex u is a
10
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
homozygote or a male with an X-linked locus and 1=2 with equal probabilities if it is a heterozygote.
If there is an autosomal locus the male and female ospring, respectively, originate from 2Nm j1
j1
successful gametes. These two sets of gametes are drawn from a population whose
and 2Nf
frequency of B1 is p. The frequency p0 of B1 among the ospring is the unweighted average of the
frequencies among males and females. Thus
8 2
2
39
3
j
j
j
j
Nf
Gfmr
>
>
Gmmr
Nm
<
=
X
X
X
X
1
6 j j
j 5
j 7
j
j
0
4
d
X
X
G
G
d
p
4
5
fmr fr
mmr`
fmr`
mmr mr
j1
>
4Nm >
: r1
;
`1
r1
`1
9
8 2
3
3
2
j
j
j
j
Nf
Gffr
Gmfr
>
>
N
=
<
m
X
1
6 j j X j 7 X6 j j X j 7
22
dffr` 5 :
dmfr` 5
j1
4Gffr Xfr
4Gmfr Xmr
>
>
4Nf
;
: r1
r1
`1
`1
j
When Nm j and Nf are large, the covariance between Guvr and Gu0 v0 r0 is approximately 0 if r 6 r0 , and
j
E Guvr
Nv j1
j
Nu
23
:
j
As we are considering neutral alleles the distribution of G j
uvr is the same for all r, so that Guvr and
j
j
j
j
Xur j are independent, as are Guvr
and duvr` . Also, if Nm j and Nf are large, the variance of both Xmr
j
and Xfr is
Var Xur j 12 p 1 ÿ p;
24
because the frequencies of B1 B1 ; B1 B2 ; and B2 B2 in both sexes are approximately p2 ; 2p 1 ÿ p and
1 ÿ p2 . In addition
j
j
Var duvr` E duvr` 2 14 P B1 B2 12 p 1 ÿ p:
25
After some algebra, it follows from Eqs. (22)±(25) that
2
Var p0 j p E p08
ÿ p j p
2
!
>
<
j
j1
p 1 ÿ p
Nm
N
j
j
j
m
; Gmf
Cov Gmm
2 4Var Gmm 2
j1
>
2
Nf
: 16 Nm j1
3
2
!2
j
j1
j1
N
Nm
N
f
j
j
Var Gmf 2 m j 5
2 4Var Gff
j1
j1
Nf
Nm
16 Nf
39
!
!2
j1
j1
j1 >
=
Nf
Nf
Nf
j
j
j
5
;
G
2
Var G
Cov G
2
fm
ff
fm
j1
j1
j
>
Nm
Nm
Nf
;
j
p 1 ÿ p
j
2NeV
;
26
where NeV is the variance eective population number when parents are in generation j of a cycle.
11
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
j
Since the changes in allele frequencies in dierent generations are independent, Nm j and Nf are
large for all j and k min Nm j ; Nm j , the variance
of the total change in allele frequency
P
j
throughout a cycle is approximately equal to kj1 p 1 ÿ p= 2NeV . Hence, we can de®ne the
eective population size to be
k
1
1X
1
j
NeV
k j1 NeV
8
2
!
kÿ1 >
< N j
j1
1 X
N
j
j
j
m
m
; Gmf
Cov Gmm
2 4Var Gmm 2
j1
16k j0 >
j1
N
: Nm
f
3
!2
Nm j1
2
j
Var Gmf 5 j1
j1
Nm
Nf
2
!
j
j1
Nf
Nf
j
j
j
Cov Gff ; Gfm
2 4Var Gff 2
j1
j1
Nm
Nf
9
3
!2
>
j1
Nf
2 =
j 5
Var G
:
27
fm
j1
j1 >
Nm
Nf
;
Note that the individual expressions being summed in (11) dier, with respect to terms not involving variances and covariances, from corresponding terms in (27). However, the sum of such
expressions in (11) is
8
9
2
"
!2
!3 >
#
>
j
j1
j1
2
kÿ1
<
=
X
Nf
Nf
Nf
Nm j
Nm j1
Nm j1
4
5
4
4
ÿ
2
ÿ
2
2
2
j
j
j
j
>
>
j1
j1
Nm
Nm
Nf
Nf
;
j0 : Nm
Nf
"
"
#
#
kÿ1
kÿ1
X
X
4
2
4
2
2
2
ÿ j1 j ÿ j1
j1 ;
j
j1
Nm
Nf
Nf
Nf
j0 Nm
j0 Nm
so that (11) and (27) give identical expressions for the eective population number.
j1
female ospring
Now let us suppose that there is a sex-linked locus. In this case the Nf
j1
originate from 2Nf
successful gametes, but the Nm j1 males are derived from only Nm j1 gametes, all of which were contributed by their mothers. Another new feature is that the frequency p0
among the ospring is a weighted average, with weights 1/3 and 2/3 for males and females. Thus
3
2
j
j
Gfmr
Nf
1 X6 j j X j 7
dfmr` 5
p0
4Gfmr Xfr
j1
3Nm
`1
r1
8
39
2
j
j
j
G
N
>
N
ffr
f
=
m
X6 j j X j 7>
2
1
3 2Nf j1 >
;
: r1
`1
r1
12
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
Eqs. (24) and (25) still hold if u f , but now
j
Var Xmr
p 1 ÿ p;
29
because the frequencies of B1 Y and B2 Y are p and 1 ÿ p.
j
Eq. (23) also remains valid except for the fact that E Gmmr
0. It therefore follows from
Eqs. (23)±(25) and (29) that
Var p0 jp E p0 ÿ p2 jp
8
2
!
>
j
j1
Nf
N
p 1 ÿ p <
f
j
j
j
Cov Gff ; Gfm
2 4Var Gff 2
j1
>
2
j1
N
:9 N
m
f
3
39
2
!2
!2
>
j1
j1
Nf
Nf
Nm j
Nm j1
Nm j1 5=
j
j
5
4
Var Gfm j
Var Gmf j
2 2
j1
j1
>
j1
Nm
Nf
Nf
Nm
;
9 Nm
p 1 ÿ p
:
2NeV
30
Hence
8
2
!
j
j1
kÿ1 >
<
X
Nf
Nf
1
1
j
j
j
Cov Gff ; Gfm
2 4Var Gff 2
j1
>
NeV 9k j0 :
j1
Nm
Nf
9
2
3
!2
>
1
Nm j 4 Nm j1
1 =
j 5
j1
Var Gmf j1 :
2 2
j1
j1
N
Nf
Nm >
;
f
Nm
j1
Nf
j1
Nm
!2
3
j
Var Gfm 5
31
The terms in (19) that do not involve variances and covariances add to
#
#
"
"
kÿ1
kÿ1
X
X
4
3
2
1
1
1
ÿ j1 j ÿ j1
j1
j
j1
Nf
Nm
Nm
Nm
j0 Nf
j0 Nf
so that, while individual summands in (19) and (31) dier, Ne , given by (19), and NeV , given by
(31), are the same.
6. Inbreeding and variance eective population numbers
j
are small and set duv;0;0 1 for all u and v, Eqs. (4)
If we assume that all the probabilities Pu;vw
imply that
dmm;i1;t1 dff ;i1;t1 dmf ;i1;t1 14dmm;it 2dmf ;i;t dff ;i;t
for all t. Thus, if we rede®ne the time scale so that t 0 and t 2 respectively indicate times at
which grandparents and grandchildren live, then
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
i
1 h 0
1 h 0
0
0
U2;2 dmf ;11 Pm;mf Pf ;mf U0;0 1 ÿ Pm;mf Pf ;mf
8
4
i
i
1 h 0
1 h 0
0
0
Pm;mf Pf ;mf U0;0 1 ÿ Pm;mf Pf ;mf U1;1 :
8
4
i
13
dmf ;00
32
This equation is a generalization of the usual recurrence equation connecting three successive
values of the panmictic index when there are independent Poisson distributions of numbers of
0
male and female ospring of an individual. It thus makes sense to replace 14 Nm 0 14 Nf in that
special case by
i
1
1 h 0
0
Pm;mf Pf ;mf ;
33
NeI 4
which is the probability that both gametes uniting to produce an individual in generation 2 came
from the same grandparent in generation 0. If we simplify our notation, by setting
and
Nv 1
luv
E G 0
uv
1
Nu
0
0
Cov Gum
; Guf rum;uf ;
it follows, when (10) is substituted in (33), that
#
"
1
1 rmm;mf lmm lmf rfm;ff lfm lff
;
0
0
NeI 4
Nm lmm lmf
Nf lfm lff
34
as found by Crow and Denniston [8].
In the particular use in which rum;uf 0, (34) reduces to
1
1
1
0 ;
0
NeI 4Nm
4Nf
which would hold, for example, if each individual had independent Poisson distributions of
numbers of male and female ospring. But then (27) implies, for k 1, that
"
#
1
1 1
1
:
NeV 4 Nm 1 Nf 1
In general, (34) seems to be dierent from the special case of (11) that results when k is set equal
to 1. But the two approaches that are used to calculate NeI can be reconciled as follows. For
simplicity, we set
ÿ
r2uv
Var G 0
uv
and
0
0
r2u Var Gum
Guf :
14
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
0
0
Next, we note that, given Gum
Guf Gu g; G 0
um has a binomial distribution with mean
1
1
1
gNm 1 = Nm 1 Nf and variance gNm 1 Nf = Nm 1 Nf 2 . Hence
1
r2um
r2uf
EVar Gum jGu VarE Gum jGu h
1
Nm 1 Nf
1
1
Nm Nf
"
#2
lum luf
lum
r2u ;
lum luf
lum luf
"
#2
lum luf
luf
r2u ;
lum luf
lum luf
Nm 1 Nf
i2
0
Nu
"
Nm 1
1
Nm
1
Nf
#2
r2u
and
rum;uf ECov Gum ; Guf j Gu CovE Gum j Gu ; E Guf j Gu
1
1
1
Nm 1 Nf
Nm 1 Nf
Nm 1 Nf
lum luf
lum luf
2
ÿh
r2 :
i2
h
i2 ru ÿ
2 u
0
l
l
1
1
1
1
l
l
Nu
um
uf
um
uf
Nm Nf
Nm Nf
Thus
# "
#2 "
#
2 "
l
l
l
l
l
l
um
uf
um
uf
um
uf
r2uf ÿ
r2um ÿ
lum luf
luf
lum luf
#
"
2
lum luf
lum luf
:
r2um;uf
lum luf
lum luf
lum luf
r2u
lum
35
It, therefore, follows from Eqs. (34) and (35) that
(
"
"
#
#)
1
1
1
1
1
1
1
1
0
r2 1 ÿ
r2 1 ÿ
2 f
NeI 4 Nm 0 lmm lmf 2 m
lmm lmf
lfm lff
lfm lff
Nf
(
"
#
r2mf
lmf
1
1 r2mm
rmm;mf
lmm
2
2
ÿ
4
ÿ
ÿ
16 Nm 0 l2mm
lmm lmf l2mf lmm lmm lmf lmf lmm lmf lmm lmf
1
0
Nf
"
r2fm
r2ff
lff
lfm
rfm;ff
2
2
ÿ
ÿ
ÿ
4
2
2
lfm
lfm lff lff lfm lfm lff lff lfm lff lfm lff
#)
:
Hence
(
"
#
r2mf
r2mm
rmm;mf
4
2
2
2
0 ÿ 1
0 l2
lmm lmf lmf
Nm
Nm
Nm
mm
)
"
#
r2ff
1 r2fm
rfm;ff
4
2
0
2
0 ÿ 1 ;
2
2
lfm lff lff
lfm
Nf
Nf
Nf
1
1
NeI 16
1
36
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
15
because
1
0
Nu
"
luf
lum
2
lum lum luf luf lum luf lum luf
1
0
Nu
"
luf lum
lum luf
#
1
0
Nu
"
1
1
lum luf
#
#
1
1
Nm
1
1
Nf
:
Expression (36) is consistent with (11).
If a locus is sex-linked and we set k 1 and j 0
"
!
#
1
1
1
1
1
1
1
ÿ
4
ÿ 1
:
ÿ 1 2
0
0
NeI NeV 9
Nf
Nf
Nm
Nm
Suppose, in particular, that there are independent Poisson distributions of male and female ospring of a female, and the number of daughters produced by a male has a Poisson distribution.
Hence, by the discussion in Section 4,
#
"
1
1 4
2
NeI 9 Nf 0 Nm 0
and
#
"
1
1 4
2
:
NeV 9 Nf 1 Nm 1
7. Mutation eective population numbers
i
Let Pu;vw
have the same meaning as in Sections 2 and 3. But we now assume that the in®nite
alleles model holds, so that genes mutate at a rate u, in such a way that each mutant is to an
entirely novel allelic type. Suppose that a population is in generation i 1 of a cycle at time t 1.
Then, at this time, Gi1;t1 and gvw;i1;t1 respectively denote the probabilities that two copies of a
gene in one individual, and in two random separate individuals of sexes v and w, are identical in
state. Then, if the locus under consideration is autosomal, Eqs. (1) and (2) are replaced by
Gi1;t1 1 ÿ u2 gmf ;it ;
37
and
gvw;i1;t1
i
1 ÿ u2 nh i
i
i
gmm;i;t
Pm;vw Pf ;vw 1 Gi;t 21 ÿ Pm;vw
8
o
i
4gmf ;i;t 21 ÿ Pf ;vw gff ;i;t :
38
If we now set Hi1;t1 1 ÿ Gi1;t1 and hvw;i1;t1 1 ÿ gvw;i1;t1 , (37) and (38) imply that
Hi1;t1 1 ÿ 1 ÿ u2 1 ÿ u2 hmf ;i;t ;
39
16
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
and
2
i
i
h
1 ÿ u nh i
i
i
hmm;i;t 4hmf ;i;t
Pm;vw Pf ;vw Hi;t 2 1 ÿ Pm;vw
8
h
o
i
i
2 1 ÿ Pf ;vw hff ;i;t :
hvw;i1;t1 1 ÿ 1 ÿ u2
40
Then if hi;t Hi;t ; hmm;i;t ; hmf ;i;t ; hff ;i;t 0 , Eqs. (39) and (40) can be written in matrix notation as
hi1;t1 1 ÿ 1 ÿ u2 1 1 ÿ u2 Ai hi;t ;
41
0
where Ai is the same matrix as in Eq. (5) and 1 1 1 1 1 . We assume that u is very small. Hence,
it follows from (6) and (41) and the fact that the sum of the elements of each row of any power of
A0 is 1, that
hik;tk I 1 ÿ u2 Aikÿ1 1 ÿ u2 kÿ1 Aikÿ1 Ai1 1 ÿ 1 ÿ u2 1
2k
1 ÿ u Aikÿ1 Ai hi;t 2ku1 1 ÿ 2ku M Di hi;t :
i
Thus, if ck maxNm i ; Nf ; uÿ1 ,
hick;tck
4Ne u 1
2cku1 1 ÿ 2ckuq 1p hi;t 2cku1 1 ÿ ck
2Ne
c
0
1p0 hi;t ;
42
where 1 and p0 are the eigenvectors associated with the dominant eigenvalue 1 of
Ar0 ; q 1 ÿ 1= 2Ne k , and Ne is given by (8) and (11). The stationary value of hi;t is hi , where, by
(42),
4Ne u 1 0
0
hi ck 2u ÿ
p hi p hi 1:
2Ne
Hence, all the elements of hi are equal to Hi and, because p0 1 1,
4Ne u
Hi H
; i 1; 2; . . . ; k:
4Ne u 1
43
The deviations from equilibrium are ei;t hi;t ÿ hi and, by (42),
4Ne u 1
1p0 ei;t ;
eick;tck 1 ÿ ck
2Ne
which shows that the equilibrium given by (43) is stable.
If the locus under consideration is sex-linked, it can be shown that
hi1;t1 1 ÿ 1 ÿ u2 1 1 ÿ u2 Ai hi;t ;
44
where Ai is now the same matrix as in (18). It then follows from the same sequence of steps as
those leading from (41) to (43) that
4Ne u
1;
45
h i ! Hi 1
4Ne u 1
where Ne is given by (19). Therefore, whether the locus is autosomal or sex-linked,
Gi;t ! Gi
1
;
4Ne u 1
t ! 1:
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
17
Thus, Ne , the mutation eective population number, as de®ned by Ewens [16], is the same as the
inbreeding eective population number.
8. Eigenvalue eective population numbers
i
In what follows, let Guvr
have the same meaning as in Section 5. Suppose that this random
variable has a binomial distribution with Nv j1 trials and a probability 1=Nu j of success. In ad j
j
dition, we assume that Gumr
and Gufr are independently distributed if there is an autosomal locus
or when there is a sex-linked locus and u f . Then
Nv j1
1
j
Var Guvr j 1 ÿ j :
Nu
Nu
If a locus is autosomal (27) reduces to
"
#
kÿ1
1
1 X
1
1
;
46
NeV 4k j0 Nm j1 Nf j1
j1
if Nm j1 and Nf
are large for all j 1. If a locus is sex-linked G j
mmr 0 and we assume that
j1
Nf
1
j
Var Gmfr j 1 ÿ j ;
Nm
Nm
Eq. (31) implies that
"
#
kÿ1
1
1 X
4
2
;
47
NeV 9k j0 Nf j1 Nf j1
if the numbers of males and females are always large.
Under the assumptions we have made about population sizes the covariance between the
outputs of successful gametes from two separate individuals of the same sex is approximately 0. It
is then also consistent with the special cases in the foregoing paragraph to consider the distribution of copies of B1 passed on to adults of the next generation by parents of sex u to be binomial. Let Yu t 1 be the number of copies B1 among fertilized eggs that lead to adults of sex v
in generation t 1, and Xuv t be the number of copies of this allele passed on to these fertilized
eggs by parents of sex u. Then, if there is an autosomal locus
Yv t 1 Xmv t Xfv t;
48
where Xmv t and Xfv t have independent binomial distributions. If Nu and Nu0 respectively denote
the number of adults of sex u in generation t and t 1
0 Yu t
:
49
Xuv t Bin Nv ;
2Nu
It follows from (48) and (49) that
Yv t 1
Ym t Yf t
Yu t; u m; f
E
;
0
2Nv
2Nm
2Nf
50
18
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
#
2
Yv t 1
E
Yu t; u m; f
2Nv0
"
(
2
2
2 #)
1
Yf t
Ym t
Yf t
0
0
0 Ym t
Nv
Nv Nv ÿ 1
2Nv0
2Nm
2Nf
2Nm
2Nf
"
1 Ym t Yf t
;
2 2Nm 2Nf
51
#
Ym t 1 Yf t 1
E
Yu t; u m; f
2Nm0
2Nf0
(
2
2 )
1
Ym t
Ym t
Yf t
Yf t
2
:
4
2Nm
2Nm
2Nf
2Nf
"
52
If, therefore, we average the right-hand sides of Eqs. (50)±(52) over the joint distributions of Ym t
and Yf t and set mv t EYv t= 2Nv t and muv t EYu tYv t= 4Nu Nv ; we obtain the system of equations
mt1 T tmt ;
53
where
mt1 mm t 1; mf t 1; mmm t 1; mmf t 1; mff t 10 ;
mt mm t; mf t; mmm t; mmf t; mff t0 ;
and
2
1
6 21
6 2
6
6 1
6 4Nm0
T t 6
6
6 0
4
1
4Nf0
1
2
1
2
1
4Nm0
0
1
4Nf0
1
4Nm0
0
0
0
0
Nm0 ÿ 1
1
4
1
0
N
f
4Nf0
ÿ 1
1
2
1
2
1
2
0
3
7
7
7
7
T11 t
1
0
N
ÿ
1
7
0
m
4Nm
7
T21 t
7
1
7
4
5
1
Nf0 ÿ 1
4N 0
0
0
T22 t
:
f
The characteristic equation of T t can be shown to be
(
"
!#
1 1
1
1
2
2
ÿ
jT t ÿ kIj k k ÿ 1 k ÿ k 1 ÿ
4 Nm0 Nf0
8
1
1
Nm0 Nf0
!)
0:
This equation has a single root equal to 1 and two others equal to 0. Among the remaining roots
the one of largest absolute value is
8
2
!
!2 31=2 9
0
0
0
0
=
<
Nm Nf
Nm Nf
1
1
1
1
5
:
41
k t 1
1ÿ
1ÿ
;
4Nm0 Nf0
4Nm0 Nf0
2:
8 Nm0 Nf0
If the numbers of males and females do not change between generations this expression reduces to
1 ÿ 1= 2NeE , where NeE is the eigenvalue eective population number obtained by Ewens [17,18].
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
19
If we now follow the population over a whole cycle, (53) leads to
mtk T t k ÿ 1T t k ÿ 2 T tmt :
Thus, if there are s cycles
(
"
#s )
k
Y
k j
m0 ;
msk A B
s ! 1;
54
j1
where A and B are matrices that depend on the phase of the cycle at time 0. Note that, by (46),
"
#
k
k
Y
1X
1
1
k
j 1 ÿ
k j 1 ÿ
j
8 j1 Nm
2NeV
Nf
j1
so that at the level of approximations used in this paper, the recurrence equations on the ®rst and
second moments lead to the variance eective population number.
We claim that, at least for the model discussed in this section, NeV is the same as NeE , the eigenvalue eective size of Ewens [17,18], which is the largest non-unit eigenvalue of the matrix of
transition probabilities of numbers of copies of B, in males and females when the same phase is
considered in two successive cycles. If second moments are traced through s cycles, and s is large,
they each satisfy
"
#
ks
1 Ym 0 Yf 0
1
0 1 ÿ
muv sk j AC;
55
muv sk
2 Nm 0
2NeE
Nf
where muv sk j AC is the expected value of the uvth moment, given the asymptotic conditional
distribution of allele frequency pairs in males and females when B1 has been neither ®xed nor lost.
Since (54) and (55) are of the same form, NeE is at least approximately equal to NeV .
If the locus under consideration is sex-linked (48) still holds if v f , but
0 Ym t
Xmf t Bin Nf ;
Nm
and
0 Yf t
Ym t 1 Xfm t Bin Nm ;
:
2Nf
Again, we assume that any pair of the random variables Xuv t are independent. It can then be
shown that
mt1 T tmt ;
where mm t Ym t=Nm ; mm t 1 Ym t 1=Nm0 and
3
2
0
1
0
0
0
7
6 1
1
0
0
0
7
6 2
2
7
6
1
1
6 0
0
0
1 ÿ Nm0 7
Nm0
T t 6
7:
7
6
1
1
7
6 0
0
0
2
2
5
4
1
1
1
1
1
1
1
1 ÿ N 0 2 4 1 ÿ N 0
4N 0
4N 0
4
f
f
f
f
20
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
The characteristic equation corresponding to T t is
!
!
(
1
1
3
1
3
1
1
1
3
2
2
ÿ
ÿ
k
k ÿ kÿ
ÿk k
ÿ
2
2
4 4Nf0
8 4Nm0 8Nf0 4Nm0 Nf0
!)
1
1
1
1
1
1
2
1ÿ 0 ÿ 0 0 0
k ÿ kÿ
ÿ
f k 0:
8
Nm Nf Nm Nf
2
2
The ®rst factor has roots 1 and ÿ12, whereas
!
1
1
;
f 1 ÿ
8Nm0 4Nf0
1
1
1
1
1
f ÿ
f 0 < 0; f
;
0;
0
2
8
2
8Nf 4Nm
and
9
1 1
;
f 0 1 ÿ O
8
Nm0 Nf0
!
:
Hence, by using Newton's method and the intermediate value theorem, we ®nd that the largest
root of f 0 0 is approximately
!
1
2
1ÿ
9Nm0 9Nf0
and the other two roots are between 0 and 12 in magnitude, with one being negative and the other
positive. Thus, if we trace the population through s cycles we ®nd, as for an autosomal locus, that
!
k
1
1 X
2
4
1
j
:
j
NeE 9k y1 Nm
NeV
Nf
We have not yet been able to generalize the foregoing reasoning to arbitrary ospring distributions, but conjecture that then also NeE NeV . Another reason to suspect that this is the case is
that Eq. (31), with k 1, was ®rst derived by Pollak [19] by a computation of the approximate
rate at which the probabilities in the asymptotic conditional distribution decrease. This method
also led to Eq. (27) when k 1. For either an autosomal or a sex-linked locus the branching
process approximation used by Pollak [19] involved looking forward in time from parents to
ospring, so that a variance eective population number was obtained.
9. Discussion
In this paper expressions have been derived for the eective population number with cyclic
variation in numbers of males and females, for either an autosomal or a sex-linked locus. In either
case, it is approximately equal to the harmonic mean of eective population numbers in a cycle,
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
21
and is the same regardless of whether the harmonic mean is calculated from inbreeding eective
numbers or variance eective numbers. We obtain the same result by calculating the mutation
eective size, and, as far as we can tell, the eigenvalue eective size. Of course, as shown in Section
6, the inbreeding eective number, NeI , and the variance eective number, NeV , are not the same if
only two successive generations are considered and the population changes in the size. As far as
we know, such expressions have not previously been given when there is a sex-linked locus.
In our calculations, we have not restricted ourselves to assuming Poisson distributions of ospring. Thus complete account has been taken of both ¯uctuations in population size and variability in family sizes. Frankham [20] reviewed 192 published estimates from 102 species of the
ratio of Ne to the actual population size, N, and concluded that the ®rst and second most important variables explaining variation among these estimates were, respectively, ¯uctuation in
population size and variance in family size. These variables both act to reduce Ne =N to the low
values found by him in wildlife populations.
References
[1] S. Wright, Evolution in Mendelian populations, Genetics 16 (1931) 97.
[2] S. Wright, Size of population and breeding structure in relation to evalution, Science 87 (1938) 430.
[3] S. Wright, Statistical genetics in relation to evolution, in: Actualities scienti®ques et industrielles, vol. 802, Exposes
de Biometrie et de la statistique biologique, XIII, Hermann, Paris, 1939, p. 5.
[4] J.F. Crow, Breeding structure of populations. II. Eective population number, in: O. Kempthorne, T.A. Bancroft,
J.W. Gowen, J.L. Lush (Eds.), Statistics and Mathematics in Biology, lowa State College Press, Ames, 1954, p. 543.
[5] J.F. Crow, M. Kimura, An Introduction to Population Genetics Theory, Harper and Row, New York, 1970.
[6] A. Caballero, On the eective size of populations with separate sexes with particular reference to sex-linked genes,
Genetics 139 (1995) 1007.
[7] T. Nagylaki, The inbreeding eective population number in dioecious populations, Genetics 139 (1995) 473.
[8] J.F. Crow, C. Denniston, Inbreeding and variance eective numbers, Evolution 42 (1998) 482.
[9] E. Pollak, Fixation probabilities when the population size undergoes cyclic ¯uctuations, Theoret. Popul. Biol.
(2000) 51.
[10] W.G. Hill, A note on eective population size with overlapping generations, Genetics 92 (1979) 317.
[11] E. Pollak, The eective size of an age-structured population with a sex-linked locus, Math. Biosci. 101 (1990) 121.
[12] J.N. Franklin, Matrix Theory, Prentice-Hall, Englewood Clis, NJ, 1968.
[13] W.G. Hill, Eective size of populations with overlapping generations, Theoret. Popul. Biol. 3 (1972) 278.
[14] E. Pollak, Eective population numbers and mean times to extinction in dioecious populations with overlapping
generations, Math. Biosci. 52 (1980) 1.
[15] S. Wright, Inbreeding and homozygosis, Proc. Nat. Acad. Sci. 19 (1933) 411.
[16] W.J. Ewens, The eective population sizes in the presence of catatstrophes, in: M. Feldman (Ed.), Mathematical
Evolutionary Theory, Princeton University, Princeton, 1989, p. 9.
[17] W.J. Ewens, Mathematical Population Genetics, Springer, Berlin, 1979.
[18] W.J. Ewens, On the concept of the eect population size, Theoret. Popul. Biol. 21 (1982) 373.
[19] E. Pollak, Eective population numbers and mean times to extinction in dioecious populations with overlapping
generations, Math. Biosci. 52 (1980) 1.
[20] R. Frankham, Eective population size/adult population size ratios in wildlife: a review, Genet. Res. Comb. 66
(1995) 95.
www.elsevier.com/locate/mbs
The eective number of a population that varies cyclically in
size. I. Discrete generations
Yufeng Wang a,1, Edward Pollak b,*
a
b
Department of Zoology and Genetics, Iowa State University, Ames, IA 50011-1210, USA
Statistical Lab, Department of Statistics, 111 Snedecor Hall, Iowa State University, Ames, IA 50011-1210, USA
Received 28 September 1999; received in revised form 9 May 2000; accepted 16 May 2000
Abstract
We consider a dioecious population having numbers of males and females that vary over time in cycles of
length k. It is shown that if k is small in comparison with the numbers of males and females in any generation of the cycle, the eective population number (or size), Ne , is approximately equal to the harmonic
mean of the eective population sizes during any given cycle. This result holds whether the locus under
consideration is autosomal or sex-linked and whether inbreeding eective population numbers or variance
eective population numbers are involved in the calculation of Ne . If, however, only two successive generations in the cycle are considered and the population changes in size between these generations, the inbreeding eective population number, NeI , diers from the variance eective population number, NeV . The
mutation eective population number turns out to be the same as the number derived using calculations
involving probabilities of identity by descent. It is also shown that, at least in one special case, the eigenvalue eective population number is the same as NeV . Ó 2000 Published by Elsevier Science Inc. All
rights reserved.
Keywords: Eective population size; Cyclic variation
1. Introduction
Random genetic drift is an important in¯uence on the genetic variability of a ®nite population
and a numerical measure of its in¯uence is the eective population size. Wright [1±3] presented the
®rst example for a dioecious population and a general expression for monoecious populations that
*
1
Corresponding author. Tel.: +1-515 294 7765; fax: +1-515 294 4040.
E-mail addresses: ywang@iastate.edu (Y. Wang), pllk@iastate.edu (E. Pollak).
Tel.: +1-515 294 9053.
0025-5564/00/$ - see front matter Ó 2000 Published by Elsevier Science Inc. All rights reserved.
PII: S 0 0 2 5 - 5 5 6 4 ( 0 0 ) 0 0 0 2 7 - 4
2
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
do not change in size. Later, Crow [4] showed, for a monoecious population, that the eective
population size measuring the increase in the probability that two copies of a gene are identical by
descent diers from the eective population size involved in the variance of allele frequency
changes, except if the size of the population remains constant. Other early work on this subject is
discussed in the text by Crow and Kimura [5].
Recently, Caballero [6] and Nagylaki [7] derived general expressions for the eective population
sizes of random mating dioecious populations, whether the locus under consideration is autosomal or sex-linked. For an autosomal locus, the results are consistent with those obtained by
Crow and Denniston [8]. Caballero [6] and Nagylaki [7] assume, however, that the numbers of
males and females do not change between generations. However, wild populations can ¯uctuate
considerably in size from year to year. A simple way to describe such populations, which may not
be too unrealistic, is to assume that these changes occur in repeated cycles. Wright [2,3] showed
that, for monoecious populations in which individuals have approximate Poisson distributions of
numbers of successful gametes in ospring, the eective population size is approximately equal to
the harmonic mean of the populations sizes in a cycle, provided the length of a cycle is short in
comparison to those sizes. Pollak [9] has shown that this result also holds generally for ospring
distributions with ®nite variances. However, to the best our knowledge, no such results have yet
been presented for dioecious populations. In the next two sections we shall derive eective population sizes for autosomal and sex-linked loci, by generalizing the reasoning used by Caballero [6]
and Nagylaki [7], whereby recurrence equations are obtained for probabilities of identity by
descent. Next, there will be alternative derivations for autosomal and sex-linked loci, which rely
on obtaining the variance of the change in the frequency of an allele in a generation. Here the
reasoning will be a discrete generation version of the reasoning used by Hill [10] and Pollak [11],
respectively, for an autosomal and a sex-linked locus. This alternative approach leads to the same
expressions as before for eective population sizes if the population is followed through an entire
cycle, although the individual terms in the resulting harmonic mean are not quite the same as the
corresponding terms that arise from the identity-by-descent approach. Mutation eective population numbers are also derived, as are eigenvalue eective population numbers in one special case. 2
2. Autosomal loci
We consider an autosomal locus in a population whose size undergoes repeated cycles of length
k. Let us suppose that at times t and t 1 the population is respectively in generations i and i 1
of a cycle. Then, at time t 1,
Fi1;t1 the inbreeding coefficient of a random individual
and
fvw;i1;t1 the coefficient of coancestry of a random pair of separate individuals of sexes
v and w;
2
Journal paper No. J-18628 of the Iowa Agriculture and Home Economics Experiment Station, Ames, IA, Project
No. 3201, and supported by Hatch Act and State of Iowa Funds.
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
3
where v and w can be replaced by either of the symbols m and f, denoting, respectively, male and
female. Let
i
P two copies of a gene in random separate offspring of sexes
Pu;vw
v and w were from the same parent of sex u generation i of a cycle:
Then
Fi1;t1 fmf ;i;t
1
and
fvw;i1;t1
1
i 1
i
i 1
i
P
1 Fi;t 1 ÿ Pm;vw fmm;i;t Pf ;vw 1 Fi;t 1 ÿ Pf ;vw fff ;i;t 2fmf ;i;t ; 2
4 m;vw 2
2
because half of the copies of a gene in an individual come from a male and half come from a
female. These equations can be simpli®ed if we set
Ui;t 1 ÿ Fi;t ;
dvw;it 1 ÿ fvw;i;t :
Then (1) and (2) reduce, respectively, to
Ui1;t1 dmf ;i;t
3
and
h
i
o
i
1 nh i
i
i
i
Pm;vw Pf ;vw Ui;t 21 ÿ Pm;vw
dmm;i;t 2 1 ÿ Pf ;vw dff ;i;t 4dmf ;i;t :
8
Eqs. (3) and (4) can be rewritten in matrix notation as
dvw;i1;t1
4
di1;t1 Ai di;t ;
5
where
i 8
6 P i P i
i
4
2 1 ÿ Pm;mm
6 m;mm
f ;mm
16
h
i
Ai 6
i
i
i
4
86
6 Pm;mf Pf ;mf 2 1 ÿ Pm;mf
4
h
i
i
i
i
Pm;ff Pf ;ff 2 1 ÿ Pm;ff
4
2
3
2
0
0 0 1 0
i
i
6
6 0 1 1 1 7 1 6 Pm;mm Pf ;mm
6
4
2
47
6
7 6 i
i
4 0 14 12 14 5 8 6
4 Pm;mf Pf ;mf
i
i
0 14 12 14
P
P
2
0
0h
m;ff
A0 Di :
Thus for all r
dir;tr Airÿ1 Airÿ2 Ai di;t
f ;ff
0h
i
3
i
2 1 ÿ Pf ;mm 7
7
h
i7
7
i
2 1 ÿ Pf ;mf 7
7
h
i 5
i
2 1 ÿ Pf ;ff
0
0
0
i
ÿ2Pm;mm
0
i
ÿ2Pf ;mm 7
7
7
i 7
ÿ2Pf ;mf 5
i
0
i
0
ÿ2Pm;mf
ÿ2Pm;ff
3
i
ÿ2Pf ;ff
4
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
i
We shall show later that if, at each stage i in the cycle, the numbers Nm i and Nf of males and
i
in the matrix Di are small. It can then be proved by
females are large, all the elements Pu;vw
induction that
Airÿ1 Airÿ2 Ai A0 Dirÿ1 A0 Dirÿ2 A0 Di Ar0
rÿ1
X
Dij Aj0 ; r P 1;
Arÿ1ÿj
0
j0
i
i
if r maxNm i ; Nf . Thus, if k maxNm i ; Nf ,
"
#
kÿ1
X
dik;tk Ak0
A0kÿ1ÿj Dij Aj0 di;t M Di di;t ;
6
j0
where M Ak0 and the elements of
Di
kÿ1
X
Dij Aj0
Akÿ1ÿj
0
j0
are all small.
Now
2
0
60
6
Aro 6
40
1
4
1
4
1
4
1
4
1
2
1
2
1
2
1
2
13
4
17
47
17
5
4
1
4
0
2 3
1
6 1 7 1 1 1
6 7
6 7 0
;
415
4 2 4
r P 2;
1
and it can also be shown that the eigenvalues of Ar0 are k1 1; k2 k3 k4 0. The left and right
eigenvectors p0 and v that correspond to k1 1; and satisfy the normalization conditions
p0 v p0 1 1 are
1 1 1
0
p 0
;
4 2 4
and
0
v 1 1 1 1 1 :
Since all the elements of Di are small, the dominant eigenvalue of M Di is
q 1 d;
where d is small. By a standard result from perturbation theory, as discussed, for example, by
Franklin [12, Section 6.12],
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
5
d p0 Di v
2 3
1
!6 7
X
kÿ1
617
1 1 1
7
Dj 6
0
617
4 2 4
5
4
j0
1
h
i 30
j
j
j
j
j
kÿ1
j
2 3
P
P
2 P
P
P
P
f ;mm
m;mf
f ;mf
m;ff
f ;ff
m;mm
j0
7 1
76 7
i
Pkÿ1 h j
7 617
j
j
2Pm;mf Pm;ff
ÿ2 j0 Pm;mm
76 7
76 7
7 415
7
0
h
i
5
Pkÿ1 j
j
j
1
ÿ2 j0 Pf ;mm 2Pf ;mf 2Pf ;ff
2P
6
6
1 6
6
6
32 6
6
4
ÿ
kÿ1 nh
i h
io
1 X
j
j
j
j
j
j
2Pm;mf 2Pm;ff Pf ;mm 2Pf ;mf Pf ;ff :
Pm;mm
32 j0
7
Thus, after many cycles, the probabilities of non-identity of pairs of copies of a gene shrink at a
steady rate that is approximately equal to 1 d per cycle, where d, the right-hand side of (7) is not
dependent on i. This rate may then be set equal to 1 ÿ 1= 2Ne k , so that
kÿ1 nh
io
i h
1
1 X
j
j
j
j
j
j
Pm;mm
2Pm;mf Pm;ff Pf ;mm 2Pf ;mf Pf ;ff :
Ne 16k j0
8
j
to be the number of successful gametes contributed by a parent of sex u in
We now de®ne Guv
generation j of a cycle to an ospring of sex v. Since we are considering neutral alleles and there
are Nu j parents of sex u and Nv j1 ospring of sex v,
j Nv j1
j :
E Guv
Nu
Since there is a random mating, any pair of gametes in ospring of sexes v and w coming from
parents of sex u is just as probable as any other pair, even if both gametes come from a single
individual. Thus,
j
Pu;vv
Nu j
ÿ 1
j ÿ j
Guv ÿ 1
E Guv
j1
ÿ 1
n
j o
ÿ j ÿ j 2
ÿ E Guv
E Guv
Var Guv
Nu j
j1
j1
Nv Nv
Nu j
j1
Nv
Nv
1
j1
Nv
ÿ1
ÿ j Nv j1
Var Guv j ÿ 1 ;
j1
Nv
Nu
9
6
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
and
Nu j
j
Pu;mf
j1
Nm
h
i
j j
G
E
G
um uf
j1
Nf
Nu j
1
j
j
;
G
Cov
G
j :
uf
um
j1 j1
Nm Nf
Nu
10
j
We assume that Nm j and Nf are all large so that the elements of Di are indeed small for all i. If,
therefore, (9) and (10) are substituted in (8), we obtain
8
!
"
>
kÿ1
ÿ j
1
1 X< Nm j
Nm j1
j
j
Cov
G
;
G
2 Var Gmm 2
mf
mm
j1
Ne
16k j0 >
Nf
: Nm j1
3
!2
j1 2
j1
j1
Nm
Nm
Nm
j
5
ÿ2
Var Gmf 4
j1
j
j
Nf
Nm
Nm
"
!
j
j1
Nf
N
f
j
j
j
Cov
G
;
G
2 Var Gff 2
ff
fm
j1
j1
Nm
Nf
!2
!2
!3 9
j1
j1
j1
=
Nf
Nf
Nf
j
5 :
11
4
Var
G
ÿ
2
fm
j1
j
j
;
Nm
N
N
f
f
3. Sex-linked loci
i
i
The notation remains the same as in the previous section, but now Pm;mm
Pm;mf 0 because
males get their copies of a gene only from their mothers. Females get half their copies of a gene
from parents of each sex. Therefore,
Fi1;t1 fmf ;i;t ;
12
fmm;i1;t1
i
Pf ;mm 12 1
fmf ;i1;t1
i
1
fPf ;mf 12 1
2
fff ;i1;t1
i
1
fPm;ff
4
Fi;t 1 ÿ
i
Pf ;mm fff ;it ;
Fi;t 1 ÿ
1 ÿ
i
Pm;ff fmm;i;t
i
Pf ;mf fff ;i;t
i
Pf ;ff 12 1
13
fmf ;i;t g;
Fi;t 1 ÿ
14
i
Pf ;ff fff ;i;t
2fmf ;i;t g:
15
As before, the recurrence equations can be simpli®ed if we set
Ui;t 1 ÿ Fi;t ;
dvw;i;t 1 ÿ fvw;it :
Eqs. (12)±(15) are then replaced by
di1;t1 Ai di;t ;
16
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
7
where
Ui;t1
2
3
7
6d
6 mm;i1;t1 7
di1;t1 6
7;
4 dmf ;i1;t1 5
dff ;i1;t1
0
2
6 4P i
f ;mm
16
Ai 6
i
6
8 4 2Pf ;mf
0
8
0
0
0
i
81 ÿ Pf ;mm 7
7
7
i
41 ÿ Pf ;mf 7
5
0
0
60
6
6
40
2
0
4
i
i
Pm;ff
i
Pf ;ff
3
4 21 ÿ Pf ;ff
21 ÿ
3
2
3
0
0
0 0
0 1 0
i
6 4P i
0
0 ÿ8Pf ;mm 7
f ;mm
7
0 0 17
7 16
7
6
6 i
i 7
1
17
0 ÿ4Pf ;mf 5
0 2 2 5 8 4 2Pf ;mf 0
1
4
1
2
1
4
i
Pf ;ff
i
ÿ2Pm;ff
i
0 ÿ2Pf ;ff
A0 Di :
It follows from (16) that
"
#
kÿ1
X
j
kÿ1ÿj
k
dik;tk A0
Dij A0 di;t M Di di;t ;
A0
17
j0
because, as in the previous section, it will be shown that all of the elements of Di are small. The
characteristic equation corresponding to A0 is
ÿk
0
1
1
1
k k ÿ 1k2 1k ÿ 1 0
ÿk
jA0 ÿ kIj ÿk 0
2
2
4
8
1
1
1
ÿ
k
4
2
4
and has the roots k1 1; k2 ÿ12; k3 14; k4 0. The eigenvectors of Ar0 associated with any power
r of Ar and the dominant eigenvalue 1 are, then,
1 4 4
0
p 0
9 9 9
and
v 1 1 1 1 10 :
It follows from perturbation theory that if all the elements of Di are small, the dominant eigenvalue of M Di is q 1 d, where because of periodicity
!
iÿ1
kÿ1
kÿ1
kÿ1
X
X
X
X
d p0 Di v p0 Dij v p0
Dj
Dj v p0 Dj v:
j0
ji
j0
j0
8
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
This is the same for all i. Thus
2
kÿ1 6
6
1 4 4 X
6
d p Di v 0
9 9 9 j0 6
4
0
0
1 j
ÿ 2 Pf ;mm
3
j
ÿ 14 Pf ;mf
j
j
ÿ 18 Pf ;ff 2Pm;ff
7
7
7
7
5
kÿ1
1 X
k
j
j
j
j
ÿ
:
P
2Pf ;mf Pf ;ff 2Pm;ff ÿ
18 j0 f ;mm
2Ne
18
As (9) and (10) still hold for a sex-linked locus, all the elements of Di are small. Their substitution
in (18) leads to
8
!
"
>
j
j1
kÿ1
Nf
1
1 X< Nf
j
j
j
Cov Gff ; Gfm
2 Var Gff 2
j1
Ne
9k j0 >
Nm
: N j1
f
!2
!2
!#
j1
j1
j1
Nf
Nf
Nf
j
Var Gfm 4
ÿ3
j1
j
j
Nm
Nf
Nf
39
2
!2
j1 2 j1 >
=
j
j1
N
Nm
Nm
Nm
j
5
m 2 42
2
ÿ
Var G
:
19
mf
j1
j
j
>
j1
N
N
N
;
m
m
f
Nm
4. Earlier results that are related to (11) and (19)
j
j
Let us assume that Pu;vw
Pu;vw , Nu j Nu and Guv
Guv for all j. There is then no cycle and k is
replaced by 1 in (11) and (19). Expressions (11) and (19) then, respectively, reduce to
(
"
#
2
1
1
1
Nm
Nm
Var Gmm 2
Var Gmf 2
Cov Gmm ; Gmf
Nf
Nf
Ne 16 Nm
"
#)
2
1
Nf
Nf
Var Gff 2
Var Gfm 2
20
Cov Gff ; Gfm
Nf
Nm
Nm
and
#
"
2
1
Nf
Nf
Cov Gff ; Gfm
Var Gfm 1
Var Gff 2
Nf
Nm
Nm
"
#)
2
1
Nm
Var Gmf 1 :
2
Nf
Nm
1
1
Ne 9
(
21
Expressions (20) and (21) are respectively the discrete generation versions of results derived by
Hill [10,13] and Pollak [11,14].
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
9
Another special case of (11) is where there are independent Poisson distributions of male and
female ospring of an individual in each generation. Then
j
Var Guv
Nv j1
j
Nu
and
j
j
; Guf 0:
Cov Gum
Hence (11) reduces to
"
#
kÿ1
1
1 X
1
1
4
2
1
1
4
2
ÿ
ÿ
Ne 16k j0 Nm j1 Nf j1 Nm j Nm j1 Nf j1 Nm j1 Nf j Nf j1
"
#
kÿ1
1 X
1
1
;
4k j0 Nm j Nf j
so that Ne is the harmonic mean of the k eective sizes in a cycle, each of which is of the form
derived by Wright [1].
Analogously, a special case of (19) is one for which there are independent Poisson distributions
of male and female ospring of a female and the number of daughters produced by a male has a
Poisson distribution. Then
#
#
"
"
kÿ1
kÿ1
1
1 X
1
1
4
3
2
2
1
1 X
4
2
;
ÿ
ÿ
Ne 9k j0 Nf j1 Nm j1 Nf j Nf j1 Nf j1 Nm j Nm j1
9k j0 Nf j Nm j
so that Ne is the harmonic mean of the k eective population sizes in a cycle, each of which is of
the form derived by Wright [15].
5. Another characterization of Ne
In this section, we will express the results given by (11) and (19) in terms of variance eective
numbers within a cycle. The reasoning will be based on that used by Hill [10]. Thus, we take
account of three sources of variability when gametes are transmitted from one generation to the
next: between numbers of gametes produced by dierent individuals, between genotypes of individuals, and among alleles carried by gametes when parents are heterozygous.
We consider a population in which parents and ospring are, respectively, in phases j and j 1
j
be the number of successful gametes contributed by the rth parent of sex u to
of a cycle. Let Guvr
ospring of sex v, and
Xur j the frequency of allelle B1 in the rth parent of sex u;
j
duvr` the difference between Xur j and the frequency of B1 in the
`th gamete contributed to an offspring of sex v by the rth parent of sex u:
So if the population has alleles B1 and B2 , Xur i will respectively be equal to 1, 1/2 and 0 in indi j
viduals of genotypes B1 B1 , B1 B2 and B2 B2 . In addition duvr` will be 0 if the parent of sex u is a
10
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
homozygote or a male with an X-linked locus and 1=2 with equal probabilities if it is a heterozygote.
If there is an autosomal locus the male and female ospring, respectively, originate from 2Nm j1
j1
successful gametes. These two sets of gametes are drawn from a population whose
and 2Nf
frequency of B1 is p. The frequency p0 of B1 among the ospring is the unweighted average of the
frequencies among males and females. Thus
8 2
2
39
3
j
j
j
j
Nf
Gfmr
>
>
Gmmr
Nm
<
=
X
X
X
X
1
6 j j
j 5
j 7
j
j
0
4
d
X
X
G
G
d
p
4
5
fmr fr
mmr`
fmr`
mmr mr
j1
>
4Nm >
: r1
;
`1
r1
`1
9
8 2
3
3
2
j
j
j
j
Nf
Gffr
Gmfr
>
>
N
=
<
m
X
1
6 j j X j 7 X6 j j X j 7
22
dffr` 5 :
dmfr` 5
j1
4Gffr Xfr
4Gmfr Xmr
>
>
4Nf
;
: r1
r1
`1
`1
j
When Nm j and Nf are large, the covariance between Guvr and Gu0 v0 r0 is approximately 0 if r 6 r0 , and
j
E Guvr
Nv j1
j
Nu
23
:
j
As we are considering neutral alleles the distribution of G j
uvr is the same for all r, so that Guvr and
j
j
j
j
Xur j are independent, as are Guvr
and duvr` . Also, if Nm j and Nf are large, the variance of both Xmr
j
and Xfr is
Var Xur j 12 p 1 ÿ p;
24
because the frequencies of B1 B1 ; B1 B2 ; and B2 B2 in both sexes are approximately p2 ; 2p 1 ÿ p and
1 ÿ p2 . In addition
j
j
Var duvr` E duvr` 2 14 P B1 B2 12 p 1 ÿ p:
25
After some algebra, it follows from Eqs. (22)±(25) that
2
Var p0 j p E p08
ÿ p j p
2
!
>
<
j
j1
p 1 ÿ p
Nm
N
j
j
j
m
; Gmf
Cov Gmm
2 4Var Gmm 2
j1
>
2
Nf
: 16 Nm j1
3
2
!2
j
j1
j1
N
Nm
N
f
j
j
Var Gmf 2 m j 5
2 4Var Gff
j1
j1
Nf
Nm
16 Nf
39
!
!2
j1
j1
j1 >
=
Nf
Nf
Nf
j
j
j
5
;
G
2
Var G
Cov G
2
fm
ff
fm
j1
j1
j
>
Nm
Nm
Nf
;
j
p 1 ÿ p
j
2NeV
;
26
where NeV is the variance eective population number when parents are in generation j of a cycle.
11
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
j
Since the changes in allele frequencies in dierent generations are independent, Nm j and Nf are
large for all j and k min Nm j ; Nm j , the variance
of the total change in allele frequency
P
j
throughout a cycle is approximately equal to kj1 p 1 ÿ p= 2NeV . Hence, we can de®ne the
eective population size to be
k
1
1X
1
j
NeV
k j1 NeV
8
2
!
kÿ1 >
< N j
j1
1 X
N
j
j
j
m
m
; Gmf
Cov Gmm
2 4Var Gmm 2
j1
16k j0 >
j1
N
: Nm
f
3
!2
Nm j1
2
j
Var Gmf 5 j1
j1
Nm
Nf
2
!
j
j1
Nf
Nf
j
j
j
Cov Gff ; Gfm
2 4Var Gff 2
j1
j1
Nm
Nf
9
3
!2
>
j1
Nf
2 =
j 5
Var G
:
27
fm
j1
j1 >
Nm
Nf
;
Note that the individual expressions being summed in (11) dier, with respect to terms not involving variances and covariances, from corresponding terms in (27). However, the sum of such
expressions in (11) is
8
9
2
"
!2
!3 >
#
>
j
j1
j1
2
kÿ1
<
=
X
Nf
Nf
Nf
Nm j
Nm j1
Nm j1
4
5
4
4
ÿ
2
ÿ
2
2
2
j
j
j
j
>
>
j1
j1
Nm
Nm
Nf
Nf
;
j0 : Nm
Nf
"
"
#
#
kÿ1
kÿ1
X
X
4
2
4
2
2
2
ÿ j1 j ÿ j1
j1 ;
j
j1
Nm
Nf
Nf
Nf
j0 Nm
j0 Nm
so that (11) and (27) give identical expressions for the eective population number.
j1
female ospring
Now let us suppose that there is a sex-linked locus. In this case the Nf
j1
originate from 2Nf
successful gametes, but the Nm j1 males are derived from only Nm j1 gametes, all of which were contributed by their mothers. Another new feature is that the frequency p0
among the ospring is a weighted average, with weights 1/3 and 2/3 for males and females. Thus
3
2
j
j
Gfmr
Nf
1 X6 j j X j 7
dfmr` 5
p0
4Gfmr Xfr
j1
3Nm
`1
r1
8
39
2
j
j
j
G
N
>
N
ffr
f
=
m
X6 j j X j 7>
2
1
3 2Nf j1 >
;
: r1
`1
r1
12
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
Eqs. (24) and (25) still hold if u f , but now
j
Var Xmr
p 1 ÿ p;
29
because the frequencies of B1 Y and B2 Y are p and 1 ÿ p.
j
Eq. (23) also remains valid except for the fact that E Gmmr
0. It therefore follows from
Eqs. (23)±(25) and (29) that
Var p0 jp E p0 ÿ p2 jp
8
2
!
>
j
j1
Nf
N
p 1 ÿ p <
f
j
j
j
Cov Gff ; Gfm
2 4Var Gff 2
j1
>
2
j1
N
:9 N
m
f
3
39
2
!2
!2
>
j1
j1
Nf
Nf
Nm j
Nm j1
Nm j1 5=
j
j
5
4
Var Gfm j
Var Gmf j
2 2
j1
j1
>
j1
Nm
Nf
Nf
Nm
;
9 Nm
p 1 ÿ p
:
2NeV
30
Hence
8
2
!
j
j1
kÿ1 >
<
X
Nf
Nf
1
1
j
j
j
Cov Gff ; Gfm
2 4Var Gff 2
j1
>
NeV 9k j0 :
j1
Nm
Nf
9
2
3
!2
>
1
Nm j 4 Nm j1
1 =
j 5
j1
Var Gmf j1 :
2 2
j1
j1
N
Nf
Nm >
;
f
Nm
j1
Nf
j1
Nm
!2
3
j
Var Gfm 5
31
The terms in (19) that do not involve variances and covariances add to
#
#
"
"
kÿ1
kÿ1
X
X
4
3
2
1
1
1
ÿ j1 j ÿ j1
j1
j
j1
Nf
Nm
Nm
Nm
j0 Nf
j0 Nf
so that, while individual summands in (19) and (31) dier, Ne , given by (19), and NeV , given by
(31), are the same.
6. Inbreeding and variance eective population numbers
j
are small and set duv;0;0 1 for all u and v, Eqs. (4)
If we assume that all the probabilities Pu;vw
imply that
dmm;i1;t1 dff ;i1;t1 dmf ;i1;t1 14dmm;it 2dmf ;i;t dff ;i;t
for all t. Thus, if we rede®ne the time scale so that t 0 and t 2 respectively indicate times at
which grandparents and grandchildren live, then
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
i
1 h 0
1 h 0
0
0
U2;2 dmf ;11 Pm;mf Pf ;mf U0;0 1 ÿ Pm;mf Pf ;mf
8
4
i
i
1 h 0
1 h 0
0
0
Pm;mf Pf ;mf U0;0 1 ÿ Pm;mf Pf ;mf U1;1 :
8
4
i
13
dmf ;00
32
This equation is a generalization of the usual recurrence equation connecting three successive
values of the panmictic index when there are independent Poisson distributions of numbers of
0
male and female ospring of an individual. It thus makes sense to replace 14 Nm 0 14 Nf in that
special case by
i
1
1 h 0
0
Pm;mf Pf ;mf ;
33
NeI 4
which is the probability that both gametes uniting to produce an individual in generation 2 came
from the same grandparent in generation 0. If we simplify our notation, by setting
and
Nv 1
luv
E G 0
uv
1
Nu
0
0
Cov Gum
; Guf rum;uf ;
it follows, when (10) is substituted in (33), that
#
"
1
1 rmm;mf lmm lmf rfm;ff lfm lff
;
0
0
NeI 4
Nm lmm lmf
Nf lfm lff
34
as found by Crow and Denniston [8].
In the particular use in which rum;uf 0, (34) reduces to
1
1
1
0 ;
0
NeI 4Nm
4Nf
which would hold, for example, if each individual had independent Poisson distributions of
numbers of male and female ospring. But then (27) implies, for k 1, that
"
#
1
1 1
1
:
NeV 4 Nm 1 Nf 1
In general, (34) seems to be dierent from the special case of (11) that results when k is set equal
to 1. But the two approaches that are used to calculate NeI can be reconciled as follows. For
simplicity, we set
ÿ
r2uv
Var G 0
uv
and
0
0
r2u Var Gum
Guf :
14
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
0
0
Next, we note that, given Gum
Guf Gu g; G 0
um has a binomial distribution with mean
1
1
1
gNm 1 = Nm 1 Nf and variance gNm 1 Nf = Nm 1 Nf 2 . Hence
1
r2um
r2uf
EVar Gum jGu VarE Gum jGu h
1
Nm 1 Nf
1
1
Nm Nf
"
#2
lum luf
lum
r2u ;
lum luf
lum luf
"
#2
lum luf
luf
r2u ;
lum luf
lum luf
Nm 1 Nf
i2
0
Nu
"
Nm 1
1
Nm
1
Nf
#2
r2u
and
rum;uf ECov Gum ; Guf j Gu CovE Gum j Gu ; E Guf j Gu
1
1
1
Nm 1 Nf
Nm 1 Nf
Nm 1 Nf
lum luf
lum luf
2
ÿh
r2 :
i2
h
i2 ru ÿ
2 u
0
l
l
1
1
1
1
l
l
Nu
um
uf
um
uf
Nm Nf
Nm Nf
Thus
# "
#2 "
#
2 "
l
l
l
l
l
l
um
uf
um
uf
um
uf
r2uf ÿ
r2um ÿ
lum luf
luf
lum luf
#
"
2
lum luf
lum luf
:
r2um;uf
lum luf
lum luf
lum luf
r2u
lum
35
It, therefore, follows from Eqs. (34) and (35) that
(
"
"
#
#)
1
1
1
1
1
1
1
1
0
r2 1 ÿ
r2 1 ÿ
2 f
NeI 4 Nm 0 lmm lmf 2 m
lmm lmf
lfm lff
lfm lff
Nf
(
"
#
r2mf
lmf
1
1 r2mm
rmm;mf
lmm
2
2
ÿ
4
ÿ
ÿ
16 Nm 0 l2mm
lmm lmf l2mf lmm lmm lmf lmf lmm lmf lmm lmf
1
0
Nf
"
r2fm
r2ff
lff
lfm
rfm;ff
2
2
ÿ
ÿ
ÿ
4
2
2
lfm
lfm lff lff lfm lfm lff lff lfm lff lfm lff
#)
:
Hence
(
"
#
r2mf
r2mm
rmm;mf
4
2
2
2
0 ÿ 1
0 l2
lmm lmf lmf
Nm
Nm
Nm
mm
)
"
#
r2ff
1 r2fm
rfm;ff
4
2
0
2
0 ÿ 1 ;
2
2
lfm lff lff
lfm
Nf
Nf
Nf
1
1
NeI 16
1
36
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
15
because
1
0
Nu
"
luf
lum
2
lum lum luf luf lum luf lum luf
1
0
Nu
"
luf lum
lum luf
#
1
0
Nu
"
1
1
lum luf
#
#
1
1
Nm
1
1
Nf
:
Expression (36) is consistent with (11).
If a locus is sex-linked and we set k 1 and j 0
"
!
#
1
1
1
1
1
1
1
ÿ
4
ÿ 1
:
ÿ 1 2
0
0
NeI NeV 9
Nf
Nf
Nm
Nm
Suppose, in particular, that there are independent Poisson distributions of male and female ospring of a female, and the number of daughters produced by a male has a Poisson distribution.
Hence, by the discussion in Section 4,
#
"
1
1 4
2
NeI 9 Nf 0 Nm 0
and
#
"
1
1 4
2
:
NeV 9 Nf 1 Nm 1
7. Mutation eective population numbers
i
Let Pu;vw
have the same meaning as in Sections 2 and 3. But we now assume that the in®nite
alleles model holds, so that genes mutate at a rate u, in such a way that each mutant is to an
entirely novel allelic type. Suppose that a population is in generation i 1 of a cycle at time t 1.
Then, at this time, Gi1;t1 and gvw;i1;t1 respectively denote the probabilities that two copies of a
gene in one individual, and in two random separate individuals of sexes v and w, are identical in
state. Then, if the locus under consideration is autosomal, Eqs. (1) and (2) are replaced by
Gi1;t1 1 ÿ u2 gmf ;it ;
37
and
gvw;i1;t1
i
1 ÿ u2 nh i
i
i
gmm;i;t
Pm;vw Pf ;vw 1 Gi;t 21 ÿ Pm;vw
8
o
i
4gmf ;i;t 21 ÿ Pf ;vw gff ;i;t :
38
If we now set Hi1;t1 1 ÿ Gi1;t1 and hvw;i1;t1 1 ÿ gvw;i1;t1 , (37) and (38) imply that
Hi1;t1 1 ÿ 1 ÿ u2 1 ÿ u2 hmf ;i;t ;
39
16
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
and
2
i
i
h
1 ÿ u nh i
i
i
hmm;i;t 4hmf ;i;t
Pm;vw Pf ;vw Hi;t 2 1 ÿ Pm;vw
8
h
o
i
i
2 1 ÿ Pf ;vw hff ;i;t :
hvw;i1;t1 1 ÿ 1 ÿ u2
40
Then if hi;t Hi;t ; hmm;i;t ; hmf ;i;t ; hff ;i;t 0 , Eqs. (39) and (40) can be written in matrix notation as
hi1;t1 1 ÿ 1 ÿ u2 1 1 ÿ u2 Ai hi;t ;
41
0
where Ai is the same matrix as in Eq. (5) and 1 1 1 1 1 . We assume that u is very small. Hence,
it follows from (6) and (41) and the fact that the sum of the elements of each row of any power of
A0 is 1, that
hik;tk I 1 ÿ u2 Aikÿ1 1 ÿ u2 kÿ1 Aikÿ1 Ai1 1 ÿ 1 ÿ u2 1
2k
1 ÿ u Aikÿ1 Ai hi;t 2ku1 1 ÿ 2ku M Di hi;t :
i
Thus, if ck maxNm i ; Nf ; uÿ1 ,
hick;tck
4Ne u 1
2cku1 1 ÿ 2ckuq 1p hi;t 2cku1 1 ÿ ck
2Ne
c
0
1p0 hi;t ;
42
where 1 and p0 are the eigenvectors associated with the dominant eigenvalue 1 of
Ar0 ; q 1 ÿ 1= 2Ne k , and Ne is given by (8) and (11). The stationary value of hi;t is hi , where, by
(42),
4Ne u 1 0
0
hi ck 2u ÿ
p hi p hi 1:
2Ne
Hence, all the elements of hi are equal to Hi and, because p0 1 1,
4Ne u
Hi H
; i 1; 2; . . . ; k:
4Ne u 1
43
The deviations from equilibrium are ei;t hi;t ÿ hi and, by (42),
4Ne u 1
1p0 ei;t ;
eick;tck 1 ÿ ck
2Ne
which shows that the equilibrium given by (43) is stable.
If the locus under consideration is sex-linked, it can be shown that
hi1;t1 1 ÿ 1 ÿ u2 1 1 ÿ u2 Ai hi;t ;
44
where Ai is now the same matrix as in (18). It then follows from the same sequence of steps as
those leading from (41) to (43) that
4Ne u
1;
45
h i ! Hi 1
4Ne u 1
where Ne is given by (19). Therefore, whether the locus is autosomal or sex-linked,
Gi;t ! Gi
1
;
4Ne u 1
t ! 1:
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
17
Thus, Ne , the mutation eective population number, as de®ned by Ewens [16], is the same as the
inbreeding eective population number.
8. Eigenvalue eective population numbers
i
In what follows, let Guvr
have the same meaning as in Section 5. Suppose that this random
variable has a binomial distribution with Nv j1 trials and a probability 1=Nu j of success. In ad j
j
dition, we assume that Gumr
and Gufr are independently distributed if there is an autosomal locus
or when there is a sex-linked locus and u f . Then
Nv j1
1
j
Var Guvr j 1 ÿ j :
Nu
Nu
If a locus is autosomal (27) reduces to
"
#
kÿ1
1
1 X
1
1
;
46
NeV 4k j0 Nm j1 Nf j1
j1
if Nm j1 and Nf
are large for all j 1. If a locus is sex-linked G j
mmr 0 and we assume that
j1
Nf
1
j
Var Gmfr j 1 ÿ j ;
Nm
Nm
Eq. (31) implies that
"
#
kÿ1
1
1 X
4
2
;
47
NeV 9k j0 Nf j1 Nf j1
if the numbers of males and females are always large.
Under the assumptions we have made about population sizes the covariance between the
outputs of successful gametes from two separate individuals of the same sex is approximately 0. It
is then also consistent with the special cases in the foregoing paragraph to consider the distribution of copies of B1 passed on to adults of the next generation by parents of sex u to be binomial. Let Yu t 1 be the number of copies B1 among fertilized eggs that lead to adults of sex v
in generation t 1, and Xuv t be the number of copies of this allele passed on to these fertilized
eggs by parents of sex u. Then, if there is an autosomal locus
Yv t 1 Xmv t Xfv t;
48
where Xmv t and Xfv t have independent binomial distributions. If Nu and Nu0 respectively denote
the number of adults of sex u in generation t and t 1
0 Yu t
:
49
Xuv t Bin Nv ;
2Nu
It follows from (48) and (49) that
Yv t 1
Ym t Yf t
Yu t; u m; f
E
;
0
2Nv
2Nm
2Nf
50
18
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
#
2
Yv t 1
E
Yu t; u m; f
2Nv0
"
(
2
2
2 #)
1
Yf t
Ym t
Yf t
0
0
0 Ym t
Nv
Nv Nv ÿ 1
2Nv0
2Nm
2Nf
2Nm
2Nf
"
1 Ym t Yf t
;
2 2Nm 2Nf
51
#
Ym t 1 Yf t 1
E
Yu t; u m; f
2Nm0
2Nf0
(
2
2 )
1
Ym t
Ym t
Yf t
Yf t
2
:
4
2Nm
2Nm
2Nf
2Nf
"
52
If, therefore, we average the right-hand sides of Eqs. (50)±(52) over the joint distributions of Ym t
and Yf t and set mv t EYv t= 2Nv t and muv t EYu tYv t= 4Nu Nv ; we obtain the system of equations
mt1 T tmt ;
53
where
mt1 mm t 1; mf t 1; mmm t 1; mmf t 1; mff t 10 ;
mt mm t; mf t; mmm t; mmf t; mff t0 ;
and
2
1
6 21
6 2
6
6 1
6 4Nm0
T t 6
6
6 0
4
1
4Nf0
1
2
1
2
1
4Nm0
0
1
4Nf0
1
4Nm0
0
0
0
0
Nm0 ÿ 1
1
4
1
0
N
f
4Nf0
ÿ 1
1
2
1
2
1
2
0
3
7
7
7
7
T11 t
1
0
N
ÿ
1
7
0
m
4Nm
7
T21 t
7
1
7
4
5
1
Nf0 ÿ 1
4N 0
0
0
T22 t
:
f
The characteristic equation of T t can be shown to be
(
"
!#
1 1
1
1
2
2
ÿ
jT t ÿ kIj k k ÿ 1 k ÿ k 1 ÿ
4 Nm0 Nf0
8
1
1
Nm0 Nf0
!)
0:
This equation has a single root equal to 1 and two others equal to 0. Among the remaining roots
the one of largest absolute value is
8
2
!
!2 31=2 9
0
0
0
0
=
<
Nm Nf
Nm Nf
1
1
1
1
5
:
41
k t 1
1ÿ
1ÿ
;
4Nm0 Nf0
4Nm0 Nf0
2:
8 Nm0 Nf0
If the numbers of males and females do not change between generations this expression reduces to
1 ÿ 1= 2NeE , where NeE is the eigenvalue eective population number obtained by Ewens [17,18].
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
19
If we now follow the population over a whole cycle, (53) leads to
mtk T t k ÿ 1T t k ÿ 2 T tmt :
Thus, if there are s cycles
(
"
#s )
k
Y
k j
m0 ;
msk A B
s ! 1;
54
j1
where A and B are matrices that depend on the phase of the cycle at time 0. Note that, by (46),
"
#
k
k
Y
1X
1
1
k
j 1 ÿ
k j 1 ÿ
j
8 j1 Nm
2NeV
Nf
j1
so that at the level of approximations used in this paper, the recurrence equations on the ®rst and
second moments lead to the variance eective population number.
We claim that, at least for the model discussed in this section, NeV is the same as NeE , the eigenvalue eective size of Ewens [17,18], which is the largest non-unit eigenvalue of the matrix of
transition probabilities of numbers of copies of B, in males and females when the same phase is
considered in two successive cycles. If second moments are traced through s cycles, and s is large,
they each satisfy
"
#
ks
1 Ym 0 Yf 0
1
0 1 ÿ
muv sk j AC;
55
muv sk
2 Nm 0
2NeE
Nf
where muv sk j AC is the expected value of the uvth moment, given the asymptotic conditional
distribution of allele frequency pairs in males and females when B1 has been neither ®xed nor lost.
Since (54) and (55) are of the same form, NeE is at least approximately equal to NeV .
If the locus under consideration is sex-linked (48) still holds if v f , but
0 Ym t
Xmf t Bin Nf ;
Nm
and
0 Yf t
Ym t 1 Xfm t Bin Nm ;
:
2Nf
Again, we assume that any pair of the random variables Xuv t are independent. It can then be
shown that
mt1 T tmt ;
where mm t Ym t=Nm ; mm t 1 Ym t 1=Nm0 and
3
2
0
1
0
0
0
7
6 1
1
0
0
0
7
6 2
2
7
6
1
1
6 0
0
0
1 ÿ Nm0 7
Nm0
T t 6
7:
7
6
1
1
7
6 0
0
0
2
2
5
4
1
1
1
1
1
1
1
1 ÿ N 0 2 4 1 ÿ N 0
4N 0
4N 0
4
f
f
f
f
20
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
The characteristic equation corresponding to T t is
!
!
(
1
1
3
1
3
1
1
1
3
2
2
ÿ
ÿ
k
k ÿ kÿ
ÿk k
ÿ
2
2
4 4Nf0
8 4Nm0 8Nf0 4Nm0 Nf0
!)
1
1
1
1
1
1
2
1ÿ 0 ÿ 0 0 0
k ÿ kÿ
ÿ
f k 0:
8
Nm Nf Nm Nf
2
2
The ®rst factor has roots 1 and ÿ12, whereas
!
1
1
;
f 1 ÿ
8Nm0 4Nf0
1
1
1
1
1
f ÿ
f 0 < 0; f
;
0;
0
2
8
2
8Nf 4Nm
and
9
1 1
;
f 0 1 ÿ O
8
Nm0 Nf0
!
:
Hence, by using Newton's method and the intermediate value theorem, we ®nd that the largest
root of f 0 0 is approximately
!
1
2
1ÿ
9Nm0 9Nf0
and the other two roots are between 0 and 12 in magnitude, with one being negative and the other
positive. Thus, if we trace the population through s cycles we ®nd, as for an autosomal locus, that
!
k
1
1 X
2
4
1
j
:
j
NeE 9k y1 Nm
NeV
Nf
We have not yet been able to generalize the foregoing reasoning to arbitrary ospring distributions, but conjecture that then also NeE NeV . Another reason to suspect that this is the case is
that Eq. (31), with k 1, was ®rst derived by Pollak [19] by a computation of the approximate
rate at which the probabilities in the asymptotic conditional distribution decrease. This method
also led to Eq. (27) when k 1. For either an autosomal or a sex-linked locus the branching
process approximation used by Pollak [19] involved looking forward in time from parents to
ospring, so that a variance eective population number was obtained.
9. Discussion
In this paper expressions have been derived for the eective population number with cyclic
variation in numbers of males and females, for either an autosomal or a sex-linked locus. In either
case, it is approximately equal to the harmonic mean of eective population numbers in a cycle,
Y. Wang, E. Pollak / Mathematical Biosciences 166 (2000) 1±21
21
and is the same regardless of whether the harmonic mean is calculated from inbreeding eective
numbers or variance eective numbers. We obtain the same result by calculating the mutation
eective size, and, as far as we can tell, the eigenvalue eective size. Of course, as shown in Section
6, the inbreeding eective number, NeI , and the variance eective number, NeV , are not the same if
only two successive generations are considered and the population changes in the size. As far as
we know, such expressions have not previously been given when there is a sex-linked locus.
In our calculations, we have not restricted ourselves to assuming Poisson distributions of ospring. Thus complete account has been taken of both ¯uctuations in population size and variability in family sizes. Frankham [20] reviewed 192 published estimates from 102 species of the
ratio of Ne to the actual population size, N, and concluded that the ®rst and second most important variables explaining variation among these estimates were, respectively, ¯uctuation in
population size and variance in family size. These variables both act to reduce Ne =N to the low
values found by him in wildlife populations.
References
[1] S. Wright, Evolution in Mendelian populations, Genetics 16 (1931) 97.
[2] S. Wright, Size of population and breeding structure in relation to evalution, Science 87 (1938) 430.
[3] S. Wright, Statistical genetics in relation to evolution, in: Actualities scienti®ques et industrielles, vol. 802, Exposes
de Biometrie et de la statistique biologique, XIII, Hermann, Paris, 1939, p. 5.
[4] J.F. Crow, Breeding structure of populations. II. Eective population number, in: O. Kempthorne, T.A. Bancroft,
J.W. Gowen, J.L. Lush (Eds.), Statistics and Mathematics in Biology, lowa State College Press, Ames, 1954, p. 543.
[5] J.F. Crow, M. Kimura, An Introduction to Population Genetics Theory, Harper and Row, New York, 1970.
[6] A. Caballero, On the eective size of populations with separate sexes with particular reference to sex-linked genes,
Genetics 139 (1995) 1007.
[7] T. Nagylaki, The inbreeding eective population number in dioecious populations, Genetics 139 (1995) 473.
[8] J.F. Crow, C. Denniston, Inbreeding and variance eective numbers, Evolution 42 (1998) 482.
[9] E. Pollak, Fixation probabilities when the population size undergoes cyclic ¯uctuations, Theoret. Popul. Biol.
(2000) 51.
[10] W.G. Hill, A note on eective population size with overlapping generations, Genetics 92 (1979) 317.
[11] E. Pollak, The eective size of an age-structured population with a sex-linked locus, Math. Biosci. 101 (1990) 121.
[12] J.N. Franklin, Matrix Theory, Prentice-Hall, Englewood Clis, NJ, 1968.
[13] W.G. Hill, Eective size of populations with overlapping generations, Theoret. Popul. Biol. 3 (1972) 278.
[14] E. Pollak, Eective population numbers and mean times to extinction in dioecious populations with overlapping
generations, Math. Biosci. 52 (1980) 1.
[15] S. Wright, Inbreeding and homozygosis, Proc. Nat. Acad. Sci. 19 (1933) 411.
[16] W.J. Ewens, The eective population sizes in the presence of catatstrophes, in: M. Feldman (Ed.), Mathematical
Evolutionary Theory, Princeton University, Princeton, 1989, p. 9.
[17] W.J. Ewens, Mathematical Population Genetics, Springer, Berlin, 1979.
[18] W.J. Ewens, On the concept of the eect population size, Theoret. Popul. Biol. 21 (1982) 373.
[19] E. Pollak, Eective population numbers and mean times to extinction in dioecious populations with overlapping
generations, Math. Biosci. 52 (1980) 1.
[20] R. Frankham, Eective population size/adult population size ratios in wildlife: a review, Genet. Res. Comb. 66
(1995) 95.