getdocd1be. 1341KB Jun 04 2011 12:05:00 AM

(1)

El e c t ro n ic

Jo ur

n a l o

f P

r o

b a b i_{l i t y} Vol. 14 (2009), Paper no. 43, pages 1222–1267.

Journal URL

http://www.math.washington.edu/~ejpecp/

Random networks with sublinear preferential attachment:

Degree evolutions

Steffen Dereich

∗

Institut für Mathematik, MA 7-5, Fakultät II Technische Universität Berlin Straße des 17. Juni 136, 10623 Berlin

Germany

[email protected]

Peter Mörters

†

Department of Mathematical Sciences University of Bath

Claverton Down Bath BA2 7AY, United Kingdom

[email protected]

Abstract

We define a dynamic model of random networks, where new vertices are connected to old ones with a probability proportional to a sublinear function of their degree. We first give a strong limit law for the empirical degree distribution, and then have a closer look at the temporal evolution of the degrees of individual vertices, which we describe in terms of large and moderate deviation principles. Using these results, we expose an interesting phase transition: in cases ofstrong

preference of large degrees, eventually a single vertex emerges forever as vertex of maximal degree, whereas in cases ofweakpreference, the vertex of maximal degree is changing infinitely often. Loosely speaking, the transition between the two phases occurs in the case when a new edge is attached to an existing vertex with a probability proportional to the root of its current degree.

Key words:Barabási-Albert model, sublinear preferential attachment, dynamic random graphs, maximal degree, degree distribution, large deviation principle, moderate deviation principle. AMS 2000 Subject Classification:Primary 05C80; Secondary: 60C05, 90B15.

Submitted to EJP on July 30, 2008, final version accepted May 6, 2009. ∗_{Supported by a grant from}_{Deutsche Forschungsgemeinschaft (DFG).}

(2)

1 Introduction

1.1 Motivation

Dynamic random graph models, in which new vertices prefer to be attached to vertices with higher degree in the existing graph, have proved to be immensely popular in the scientific literature re-cently. The two main reasons for this popularity are, on the one hand, that these models can be easily defined and modified, and can therefore be calibrated to serve as models for social networks, collaboration and interaction graphs, or the web graph. On the other hand, if the attachment prob-ability is approximately proportional to the degree of a vertex, the dynamics of the model can offer a credible explanation for the occurrence of power law degree distributions in large networks. The philosophy behind these preferential attachment models is that growing networks are built by adding nodes successively. Whenever a new node is added it is linked by edges to one or more existing nodes with a probability proportional to a function f of their degree. This functionf, called

attachment rule, or sometimesweight function, determines the qualitative features of the dynamic network.

The heuristic characterisation does not amount to a full definition of the model, and some clarifi-cations have to be made, but it is generally believed that none of these crucially influence the long time behaviour of the model.

It is easy to see that in the general framework there arethreemain regimes:

• thelinearregime, where f(k)_≍k;

• thesuperlinearregime, where f(k)_≫k;

• thesublinearregime, where f(k)_≪k.

The linear regime has received most attention, and a major case has been introduced in the much-cited paper Barabási and Albert[1999]. There is by now a substantial body of rigorous mathematical work on this case. In particular, it is shown in Bollobás et al.[2001], Móri[2002]that the empirical degree distribution follows an asymptotic power law and in Móri[2005]that the maximal degree of the network is growing polynomially of the same order as the degree of the first node.

In the superlinear regime the behaviour is more extreme. In Oliveira and Spencer [2005] it is shown that a dominant vertex emerges, which attracts a positive proportion of all future edges. Asymptotically, afternsteps, this vertex has degree of ordern, while the degrees of all other vertices are bounded. In the most extreme cases eventually all vertices attach to the dominant vertex. In the linear and sublinear regimes Rudas et al.[2007]find almost sure convergence of the empirical degree distributions. In the linear regime the limiting distribution obeys a power law, whereas in the sublinear regime the limiting distributions are stretched exponential distributions. Apart from this, there has not been much research so far in the sublinear regime, which is the main concern of the present article, though we include the linear regime in most of our results.

(3)

Specifically, we discuss a preferential attachment model where new nodes connect to a random number of old nodes, which in fact is quite desirable from the modelling point of view. More precisely, the node added in thenth step is connected independently to any old one with probability

f(k)/n, where k is the (in-)degree of the old node. We first determine the asymptotic degree distribution, see Theorem 1.1, and find a result which is in line with that of Rudas et al.[2007]. The result implies in particular that, if f(k) = (k+1)α for 0 ¶ α <1, then the asymptotic degree distribution(µk)satisfies

logµk∼ −₁₋1_αk1−α,

showing that power law behaviour is limited to the linear regime. Under the assumption that the strength of the attachment preference is sufficiently weak, we give very fine results about the prob-ability that the degree of a fixed vertex follows a given increasing function, see Theorem 1.13 and Theorem 1.15. These large and moderate deviation results, besides being of independent interest, play an important role in the proof of our main result. This result describes an interesting dichotomy about the behaviour of the vertex of maximal degree, see Theorem 1.5:

• The strong preference case: If P_n1/f(n)2 < ∞, then there exists a single dominant vertex –called persistent hub– which has maximal degree for all but finitely many times. However, only in the linear regime the number of new vertices connecting to the dominant vertex is growing polynomially in time.

• The weak preference case:IfP_n1/f(n)2=_∞, then there is almost surelynopersistent hub. In particular, the index, or time of birth, of the current vertex of maximal degree is a function of time diverging to infinity in probability. In Theorem 1.8 we provide asymptotic results for the index and degree of this vertex, as time goes to infinity.

A rigorous definition of the model is given in Section 1.2, and precise statements of the principal results follow in Section 1.3. In Section 1.4 we state fine results on the evolution of degrees, which are useful in the proofs of our main results, but also of independent interest. These include laws of large numbers, a central limit theorem and large deviation principles for the degree evolutions. At the end of that section, we also give a short overview over the further parts of this paper.

1.2 Definition of the model

We now explain how precisely we define our preferential attachment model given a monotonically increasingattachment rule f :N_{∪ {}0} −→(0,∞)with f(n) ¶ n+1 for alln∈N_{∪ {}0}. At time

n=1 the network consists of a single vertex (labeled 1) without edges and for eachn_∈Nthe graph evolves in the time stepn_→n+1 according to the following rule

• add a new vertex (labeledn+1) and

• insert for each old vertexma directed edgen+1_→mwith probability

f(indegree ofmat timen)

(4)

The new edges are inserted independently for each old vertex. Note that the assumptions imposed on f guarantee that in each evolution step the probability for adding an edge is smaller or equal to 1. Formally we are dealing with a directed network, but indeed, by construction, all edges are pointing from the younger to the older vertex, so that the directions can trivially be recreated from the undirected (labeled) graph.

There is one notable change to the recipe given in Krapivsky and Redner[2001]: We do not add one edge in every step but a random number, a property which is actually desirable in most applications. Given the graph after attachment of thenth vertex, the expected number of edges added in the next step is

m=1

f indegree of mat timen.

This quantity converges, as n _{→ ∞} almost surely to a deterministic limit λ, see Theorem 1.1. Moreover, the law of the number of edges added is asymptotically Poissonian with parameter λ. Observe that theoutdegree of every vertex remains unchanged after the step in which the vertex was created. Hence our principal interest when studying the asymptotic evolution of degree distributions is in theindegrees.

1.3 Presentation of the main results

We denote by_Z[m,n], for m,n_∈N, m ¶ n, the indegree of themth vertex after the insertion of thenth vertex, and byXk(n)the proportion of nodes of indegreek∈N∪ {0}at timen, that is

Xk(n) =

i=1

1l_{Z_[_i_,_n_]=_k_}.

Denoteµ_k(n) =EX_k(n),X(n) = (X_k(n):k∈N∪ {0}), andµ(n) = (µ_k(n):k∈N∪ {0}). Theorem 1.1(Asymptotic empirical degree distribution).

(a) Let

µk=

1 1+f(k)

k−1 Y

l=0 f(l)

1+f(l) for k∈N∪ {0},

which is a sequence of probability weights. Then, almost surely,

lim

n→∞X(n) =µ

in total variation norm.

(b) If f satisfies f(k) ¶ ηk+1for someη∈(0, 1), then the conditional distribution of the outdegree of the(n+1)st incoming node (given the graph at time n) converges almost surely in the total variation norm to the Poisson distribution with parameterλ:=_〈µ,f_〉.

(5)

Remark 1.2. In the model introduced in Krapivsky and Redner[2001]and studied by Rudas et al. [2007], every new vertex is connected toexactly oneexisting vertex. Every vertex is chosen with a probability proportional to a functionw:N∪ {0} →[0,∞)of its indegree. The asymptotic indegree distribution they obtain coincides with ours if f is chosen as a constant multiple of w. This is strong evidence that these models show the same qualitative behaviour, and that our further results holdmutatis mutandisfor preferential attachment models in which new vertices connect to a fixed number of old ones.

Example 1.3. Suppose f(k) =γk+β for fixed γ,β ∈ (0, 1] and for all k _∈ N_{∪ {}0_}. Then the asymptotic empirical distribution can be expressed in terms of theΓ-function,

µk=

Γ(k+β

γ) Γ( β+1

γ )

Γ(k+1+β_γ+γ) Γ(β_γ)

By Stirling’s formula,Γ(t+a)/Γ(t)_∼taas t tends to infinity. Hence,

µk∼

Γ(β+_γ1)

γΓ(β_γ) k

−(1+1_γ)

, ask→ ∞.

This is in line with the linear case in the classical model, where new vertices connect to a fixed numbermof old ones chosen with a probability proportional to their degree plus a constanta>−m. Independently of the chosen variant of the model, there are analogues of Theorem 1.1 with degree sequences(µk) of orderk−(3+a/m), see for instance Móri[2002]and Bollobás et al.[2001]for the

case a = 0 and Hofstad [2009]for the general case. The tail behaviour of our and the classical models coincide ifγ= ₂₊1_a_/_m.

Example 1.4. Suppose f(k)_∼γkα, for 0< α <1 andγ >0, then a straightforward analysis yields that

logµk∼ − k+1 X

l=1

log 1+ (γlα)−1_{∼ −}1

1 1−αk

1−α_.

Hence the asymptotic degree distribution has stretched exponential tails.

Ourmainresult describes the behaviour of the vertex ofmaximaldegree, and reveals an interesting dichotomy between weak and strong forms of preferential attachment.

Theorem 1.5 (Vertex of maximal degree). Suppose f is concave. Then we have the following di-chotomy:

Strong preference. If

∞ X

k=0

f(k)2 <∞,

then with probability one there exists a persistent hub, i.e. there is a single vertex which has maximal indegree for all but finitely many times.

(6)

Weak preference. If

∞ X

k=0

f(k)2 =∞,

then with probability one there exists no persistent hub and the time of birth, or index, of the current hub tends to infinity in probability.

Remark 1.6. Without the assumption of concavity of f, the assertion remains true in the weak preference regime. In the strong preference regime our results still imply that, almost surely, the number of vertices, which at some time have maximal indegree, is finite.

Remark 1.7. In the weak preference case the information about the order of the vertices is asymp-totically lost: as a consequence of the proof of Theorem 1.5, we obtain that, for any two vertices

n<n′,

lim

m→∞P(Z[n,m]>Z[n

′_,_m_{]) =}1 2.

Conversely, in the strong preference case, the information about the order is not lost completely and one has

lim

t→∞P(Z[n,m]>Z[n

′_,_m_])_>1 2.

Our next aim is to determine the typical age and indegree evolution of the hub in the strong prefer-ence case. For this purpose we make further assumptions on the attachment rulef. We now assume that

• f is regularly varying with index 0 ¶ α <1₂,

•for someη <1, we have f(j) ¶ η(j+1)for all j∈N∪ {0}.

(1)

We define two increasing ‘scaling’ functions,

Ψ(n):= n−1 X

m=1

m ∼logn forn∈N, (2)

and

Φ(n):= n−1 X

k=0

f(k) forn∈N∪ {0}, (3)

and we extend the definition ofΦto the positive real line by linearly interpolating between integer points.

Theorem 1.8(Limit law for age and degree of the vertex of maximal degree). Suppose f satisfies(1). Let m∗_n be the index of the hub at time n, andZnmax the maximal indegree at time n∈N. Then there

exists a slowly varying function¯ℓsuch that, in probability,

logm∗_n∼ 1

2 1₋α

1₋2α

(logn)11−−2αα ¯

ℓ(logn)

and

Znmax−Φ−

1 _Ψ(_n₎

∼ 1

2 1₋α

(7)

Remark 1.9. A slightly more general (and more technical) version of this result will be stated as Proposition 1.18. The rôle of the scaling functions and the definition of the slowly varying function ¯ℓ

will be made explicit in Section 1.4.

The results presented in the next section will shed further light on the evolution of the degree of a fixed vertex, and unlock the deeper reason behind the dichotomy described in Theorem 1.5. These results will also provide the set-up for the proof of Theorems 1.5 and 1.8.

1.4 Fine results for degree evolutions

In order to analyse the network further, we scale the time as well as the way of counting the indegree. Recall the definitions (2) and (3). To the originaltimen_∈N we associate anartificialtimeΨ(n)

and to theoriginaldegree j ∈N_{∪ {}0_}we associate theartificialdegreeΦ(j). An easy law of large numbers illustrates the role of these scalings.

Proposition 1.10(Law of large numbers). For any fixed vertex labeled m_∈N, we have that

lim

n→∞

Φ(_Z[m,n])

Ψ(n) =1 almost surely.

Remark 1.11. SinceΨ(n)_∼logn, we conclude that for anym_∈N, almost surely,

Φ(_Z[m,n])_∼lognasn→ ∞.

In particular, we get for an attachment rule f with f(n)_∼ γnandγ∈(0, 1], that Φ(n)_∼ 1_γlogn

which implies that

log_Z[m,n]_∼lognγ, almost surely.

In order to find the same behaviour in the classical linear preferential attachment model, one again has to choose the parameter asa=m(1_γ₋2)in the classical model, cf. Remark 1.3.

Similarly, an attachment rule with f(n)_∼γnα forα <1 andγ >0 leads to

Z[m,n]_∼(γ(1₋α)logn)1−1α almost surely.

We denote byT:=_{Ψ(n):n_∈N_}the set of artificial times, and byS:=_{Φ(j): j_∈N_{∪ {}0_}}the set of artificial degrees. From now on, we refer bytimeto the artificial time, and by(in-)degree to the artificial degree. Further, we introduce a new real-valued process(Z[s,t])_s_∈_T,_t_¾0via

Z[s,t]:= Φ(_Z[m,n]) ifs= Ψ(m), t= Ψ(n)andm ¶ n,

and extend the definition to arbitraryt by lettingZ[s,t]:=Z[s,s∨max(T_∩[0,t])]. For notational convenience we extend the definition of f to[0,_∞)by setting f(u):= f(_⌊u_⌋) for allu_∈[0,_∞)so that

Φ(u) =

Z u

f(v)d v.

We denote by_L[0,_∞)the space of càdlàg functions x:[0,_∞)_→Rendowed with the topology of uniform convergence on compact subsets of[0,∞).

(8)

0 2 4 6 8 10

Indegree evolutions of two nodes (lin. attachment)

art. time

art. indegree

0 2 4 6 8 10

Indegree evolutions of two nodes (weak attachment)

art. time

art. indegree

Figure 1: The degree evolution of a vertex in the artificial scaling: In the strong preference case, on the left, the distance of degrees is converging to a positive constant; in the weak preference case, on the right, fluctuations are bigger and, as time goes to infinity, the probability that the younger vertex has bigger degree converges to 1/2.

Proposition 1.12(Central limit theorem). In the case of weak preference, for all s_∈T, Z[s,s+ϕ_κ∗_t]₋ϕ∗_κ_t

κ : t ¾ 0

⇒(W_t:t ¾ 0),

in distribution on _L[0,_∞), where (W_t:t ¾ 0) is a standard Brownian motion and (ϕ∗_t)_t_¾0 is the inverse of(ϕt)t¾0 given by

ϕt=

Z Φ−1₍_t₎ 0

f(u)2du.

We now briefly describe the background behind the findings above. In the artificial scaling, an in-degree evolution is the sum of a linear drift and a martingale, and the perturbations induced by the martingale are of lower order than the drift term. Essentially, we have two cases: Either the martingale converges almost surely, the strong preference regime, or the martingale diverges, the

weak preference regime. The crucial quantity which separates both regimes isP∞_k₌₀f(k)−2_with

con-vergence leading to strong preference. Its appearance can be explained as follows: The expected artificial time a vertex spends having natural indegree k is 1/f(k). Moreover, the quadratic vari-ation grows throughout that time approximately linearly with speed 1/f(k). Hence the quadratic variation of the martingale over the infinite time horizon behaves like the infinite sum above. In theweak preference regime, the quadratic variation process of the martingale converges to the function(ϕ_t)when scaled appropriately, explaining the central limit theorem. Moreover, the differ-ence of two distinct indegree evolutions, satisfies a central limit theorem as well, and it thus will be positive and negative for arbitrarily large times. In particular, this means that hubs cannot be persistent. In the case of strong preference, the quadratic variation is uniformly bounded and the martingales converge with probability one. Hence, in the artificial scaling, the relative distance of two indegree evolutions freezes for large times. As we will see, in the long run, late vertices have no chance of becoming a hub, since the probability of this happening decays too fast.

(9)

Investigations so far were centred aroundtypical vertices in the network. Large deviation princi-ples, as provided below, are the main tool to analyseexceptional vertices in the random network. Throughout we use the large-deviation terminology of Dembo and Zeitouni[1998]and, from this point on, the focus is on the weak preference case.

We set ¯f := f _◦Φ−1, and recall from Lemma A.1 in the appendix that we can represent ¯f as ¯

f(u) = uα/(1−α)¯ℓ(u) for u > 0, where ¯ℓ is a slowly varying function. This is the slowly varying function appearing in Theorem 1.8.

We denote byI[0,∞)the space of nondecreasing functionsx:[0,∞)_→Rwithx(0) =0 endowed with the topology of uniform convergence on compact subintervals of[0,_∞).

Theorem 1.13 (Large deviation principles). Under assumption (1), for every s ∈T, the family of functions

₁

κZ[s,s+κt]:t ¾ 0

κ>0 satisfies large deviation principles on the space_I[0,_∞),

• with speed(κ1−1α¯ℓ(κ))and good rate function

J(x) =

(R∞ 0 x

1−α

t [1−˙xt+˙xtlog ˙xt]d t if x is absolutely continuous, ∞ otherwise.

• and with speed(κ)and good rate function

K(x) =

(

a f(0) if x_t= (t₋a)₊for some a ¾ 0,

∞ otherwise.

Remark 1.14. The large deviation principle states, in particular, that the most likely deviation from the growth behaviour in the law of large numbers is having zero indegree for a long time and after that time typical behaviour kicking in. Indeed, it is elementary to see that a delay time ofaκhas a probability ofe−aκf(0)+o(κ), asκ↑ ∞.

More important for our purpose is a moderate deviation principle, which describes deviations on a finer scale. Similar as before, we denote by_L(0,_∞)the space of càdlàg functions x:(0,_∞)_→R

endowed with the topology of uniform convergence on compact subsets of(0,_∞), and always use the conventionx0:=lim inft↓0xt.

Theorem 1.15 (Moderate deviation principle). Suppose(1) and that (aκ) is regularly varying, so

that the limit

c:=lim

κ↑∞aκκ 2α−1

1−α ¯ℓ(κ)∈[0,∞)

exists. Ifκ12−2−2αα¯ℓ(κ)−

2 ≪a_κ≪κ,then, for any s∈T, the family of functions _Z_[_s_,_s₊_κ_t_]₋_κ_t

a_κ :t ¾ 0

(10)

satisfies a large deviation principle on_L(0,_∞)with speed(a2_κκ21α−−α1¯ℓ(κ))and good rate function

I(x) =

¨ ₁ 2

R_∞ 0 (˙xt)

2_t₁₋α_α_{d t}₋1

c f(0)x0 if x is absolutely continuous and x0 ¶ 0, ∞ otherwise,

where we use the convention1/0=_∞.

Remark 1.16. If c = _∞ there is still a moderate deviation principle on the space of functions

x:(0,∞)_→Rwith the topology of pointwise convergence. However, the rate functionI, which has the same form as above with 1/∞interpreted as zero, fails to be a good rate function.

Let us heuristically derive the moderate deviation principle from the large deviation principle. Let

(y_t)_t_¾0denote an absolutely continuous path with y₀ ¶ 0. We are interested in the probability that

PZ[s,s+κt]−κt

aκ ≈

y_t=PZ[s,s+κt]

κ ≈t+

aκ

κ yt

Now note that x 7→1−x+xlogx attains its minimal value in one, and the corresponding second order differential is one. Consequently, using the large deviation principle together with Taylor’s formula we get

logP

_Z_[_s_,_s₊_κ_t_]₋_κ_t a_κ ≈ yt

∼ −1

2κ

2α−1 1−α¯ℓ(κ)a2

Z ∞

t1−αα˙y2

t d t−aκf(0)|y0|.

Here, the second term comes from the second large deviation principle. Ifcis zero, then the second term is of higher order and a path(y_t)_t_¾0 has to start in 0 in order to have finite rate. Ifc_∈(0,_∞), then both terms are of the same order. In particular, there are paths with finite rate that do not start in zero. The casec=_∞is excluded in the moderate deviation principle and it will not be considered in this article. As the heuristic computations indicate in that case the second term vanishes, which means that the starting value has no influence on the rate as long as it is negative. Hence, one can either prove an analogue of the second large deviation principle, or one can consider a scaling where the first term gives the main contribution and the starting value has no influence on the rate as long as it is negative. In the latter case one obtains a rate function, which is no longer good. Remark 1.17. Under assumption (1) the central limit theorem of Proposition 1.12 can be stated as a complement to the moderate deviation principle: Foraκ∼κ

1−2α

2−2α¯ℓ(κ)−

2, we have _Z_[_s_,_s₊

κt]₋κt

a_κ :t ¾ 0

⇒q₁1₋−₂α_αW

t11−−2αα:t ¾ 0

. See Section 2.1 for details.

We now state the refined version of Theorem 1.8 in the artificial scaling. It is straightforward to derive Theorem 1.8 from Proposition 1.18. The result relies on the moderate deviation principle above.

(11)

Proposition 1.18(Limit law for age and degree of the vertex of maximal degree). Suppose f satisfies assumption(1)and recall the definition of¯ℓfrom the paragraph preceding Theorem 1.13. Defining s∗_t to be the index of the hub at time tone has, in probability,

s∗_t ∼Z[s∗_t,t]₋t∼ 1

2 1₋α

1₋2α

t11−−2αα ¯

ℓ(t) =

1 2

1₋α

1₋2α

f(t).

Moreover, in probability on_L(0,_∞),

lim

t→∞

_Z_[_s∗

t,s∗t+tu]−tu

t11−−2αα¯ℓ(t)−1

:u ¾ 0

1−α

1−2α u

1−2α

1−α ∧1:u ¾ 0

The remainder of this paper is devoted to the proofs of the results of this and the preceding section. Rather than proving the results in the order in which they are stated, we proceed by the techniques used. Section 2 is devoted to martingale techniques, which in particular prove the law of large numbers, Proposition 1.10, and the central limit theorem, Proposition 1.12. We also prove absolute continuity of the law of the martingale limit which is crucial in the proof of Theorem 1.5. Section 3 is using Markov chain techniques and provides the proof of Theorem 1.1. In Section 4 we collect the large deviation techniques, proving Theorem 1.13 and Theorem 1.15. Section 5 combines the various techniques to prove our main result, Theorem 1.5, along with Proposition 1.18. An appendix collects the auxiliary statements from the theory of regular variation and some useful concentration inequalities.

2 Martingale techniques

In this section we show that in the artificial scaling, the indegree evolution of a vertex can be written as a martingale plus a linear drift term. As explained before, this martingale and its quadratic variation play a vital role in our understanding of the network.

2.1 Martingale convergence

Lemma 2.1. Fix s_∈Tand represent Z[s,_·]as

Z[s,t] =t−s+Mt.

Then(Mt)t∈T,t¾sis a martingale. Moreover, the martingale converges almost surely if and only if

Z ∞

f(u)2du<∞, (4) and otherwise it satisfies the following functional central limit theorem: Let

ϕt:=

Z Φ−1(t)

f(v)2 d v= Z t

1 ¯

(12)

and denote byϕ∗:[0,_∞)_→[0,_∞)the inverse of(ϕt); then the martingales

Mκ:=

₁

κMs+ϕκ∗t

t¾0

forκ >0

converge in distribution to standard Brownian motion asκtends to infinity. In any case the processes

κZ[s,s+κt])t¾0converge, asκ↑ ∞, almost surely, inL[0,∞)to the identity.

Proof. Fort= Ψ(n)_∈Twe denote by∆t the distance betweent and its right neighbour inT, i.e.

∆t= 1

Ψ−1₍_t₎. (5)

One has

EZ[s,t+ ∆t]₋Z[s,t]_G_n=EΦ_{◦ Z}[i,n+1]₋Φ_{◦ Z}[i,n]_G_n

= f(Z[i,n])

n ×

f(_Z[i,n])=

Moreover, recalling the definition of the bracket_〈M_〉, e.g. from 12.12 in Williams[1991], we have

〈M_〉_t+∆t− 〈M〉t=

1₋ ¯f(Z[s,t])∆t

₁

f(Z[s,t])∆t ¶

1 ¯

f(Z[s,t])∆t. (6)

Observe that by Doob’sL2-inequality (see, e.g., 14.11 in Williams[1991]) and the uniform bound-edness of ¯f(_·)−1 one has

a_i:= E[sups¶t¶s+2i+1|Mt|

2_]

2i/2_{log 2}i2 ¶

4E_|M_max₍_T_∩_[_0,_s₊₂i+1_])|2

2i/2_{log 2}i2 =

4E_〈M_〉_max₍_T_∩_[_0,_s₊₂i+1_])

2i/2_{log 2}i2 ¶ C

i2,

whereC >0 is a constant only depending on f(0). Moreover, by Chebyshev’s inequality, one has

sup

t¾s+2i

|Mt| p

t−slog(t−s) ¾1

∞ X

k=i P

sup

s+2k_¶_t_¶_s₊₂k+1

M_t2

(t−s)log2(t−s) ¾1

∞ X

k=i E

sup

s+2k_¶_t_¶_s₊₂k+1

M_t2

(t−s)log2(t−s)

∞ X

k=i

a_k.

AsPa_k<∞, lettingitend to infinity, we conclude that almost surely lim sup

t→∞

|Mt| p

t₋slog(t₋s) ¶ 1. (7)

In particular, we obtain almost sure convergence of(1_κZ[s,s+κt])_t_¾₀ to the identity. As a conse-quence of (6), for anyǫ >0, there exists a random almost surely finite constantη=η(ω,ǫ)such that, for allt ¾ s,

〈M_〉_t ¶

Z t−s

(13)

Note thatΦ:[0,_∞)_→[0,_∞)is bijective and substituting(1₋ǫ)κubyΦ(v)leads to

〈M_〉_t ¶ 1

1₋ǫ

Z Φ−1₍₍₁₋_ǫ₎₍_t₋_s₎₎ 0

f(v)2d v+η ¶

1 1₋ǫ

Z Φ−1₍_t₋_s₎ 0

f(v)2d v+η.

Thus, condition (4) implies convergence of the martingale(M_t).

We now assume that(ϕ_t)_t_¾₀ converges to infinity. Since ǫ >0 was arbitrary the above estimate implies that

lim sup

t→∞

〈M_〉_t

ϕt−s

¶ 1, almost surely.

To conclude the converse estimate note thatP_t_∈_T(∆t)2<∞so that we get with (6) and (7) that

〈M〉t ¾

Z t−s

f(Φ−1₍₍₁₊_ǫ)_u₎₎du−η ¾

1 1+ǫ

Z Φ−1₍_t₋_s₎ 0

f(v)2d v−η,

for an appropriate finite random variableη. Therefore, lim

t→∞

〈M_〉_t

ϕt−s

=1 almost surely. (8)

The jumps ofMκare uniformly bounded by a deterministic value that tends to zero asκtends to_∞. By a functional central limit theorem for martingales (see, e.g., Theorem 3.11 in Jacod and Shiryaev [2003]), the central limit theorem follows once we establish that, for anyt ¾ 0,

lim

κ→∞〈M

〉t=t in probability,

which is an immediate consequence of (8).

Proof of Remark 1.17. We suppose that f is regularly varying with indexα < 1₂. By the central limit theorem the processes

(Y_tκ:t ¾ 0):=

Z[s,s+ϕ∗_t_ϕ κ]−ϕ

∗

tϕκ p

ϕ(κ)

:t ¾ 0

forκ >0

converge in distribution to the Wiener process(W_t)asκtends to infinity. For eachκ >0 we consider the time change (τκ_t)_t_¾0 := (ϕκt/ϕκ). Using thatϕ is regularly varying with parameter 1₁−₋2_αα we

find uniform convergence on compacts:

(τκ_t)_→(t11−−2αα) =:(τ_t) asκ→ ∞. Therefore,

_Z[s,s+κt]₋κt p

ϕ(κ)

:t ¾ 0= (Y_τκκ

t :t ¾ 0)⇒(Wτt :t ¾ 0).

and, as shown in Lemma A.1,ϕκ∼ ₁1₋−₂α_ακ

1−2α

(14)

2.2 Absolute continuity of the law of

M

_∞

In the sequel, we consider the martingale(M_t)_t_¾_s_,_t_∈_Tgiven by Z[s,t]₋(t₋s)for a fixeds_∈Tin the case ofstrongpreference. We denote by M_∞the limit of the martingale.

Proposition 2.2. If f is concave, then the distribution of M_∞is absolutely continuous with respect to Lebesgue measure.

Proof. For ease of notation, we denoteYt=Z[s,t], for t∈T,t ¾ s. Moreover, we fixc>0 and let

A_t denote the event thatY_u∈[u−c,u+c]for allu∈[s,t]_∩T. Now observe that for two neighboursv₋andvinS

P _{Y_t+∆t=v} ∩At

= (1₋f¯(v) ∆t)P _{Y_t=v_{} ∩}A_t+ ¯f(v₋) ∆tP _{Y_t=v₋_{} ∩}A_t. (9) Again we use the notation ∆t = 1

Ψ−1₍_t₎. Moreover, we denote ∆f¯(v) = f¯(v)− f¯(v₋). In the first

step of the proof we derive an upper bound for

h(t) =max

v∈S P {Yt =v} ∩At

fort ∈T,t ¾ s. With (9) we conclude that

P {Y_t+∆t=v} ∩At

¶ (1−∆f¯(v) ∆t)h(t).

Forw ¾ 0 we denoteς(w) =maxS_∩[0,w]. Due to the concavity of f, we get that

h(t+ ∆t) ¶ (1₋∆¯f(ς(t+c+1)) ∆t)h(t). Consequently,

h(t) ¶ Y u∈[s,t)∩T

(1−∆f¯(ς(u+c+1)) ∆u)

and using that log(1+x) ¶ x we obtain

h(t) ¶ exp₋ X

u∈[s,t)∩T

∆f¯(ς(u+c+1)) ∆u

. (10)

We continue with estimating the sumΣin the latter exponential:

Σ = X

u∈[s,t)∩T

∆f¯(ς(u+c+1)) ∆u ¾

Z t

∆f¯(ς(u+c+1))du.

Next, we denote by flinthe continuous piecewise linear interpolation of f|N0. Analogously, we set

Φlin(v) =R₀v _flin1₍_u₎du and ¯flin(v) = flin◦(Φlin)−1(v). Using again the concavity of f we conclude

that

Z t

∆f¯(ς(u+c+1))du ¾

Z t

(flin)′(Φ−1(u+c+1))du, and that

(15)

Hence,

Σ ¾

Z t

(flin)′(Φ−1(u+c+1))du ¾

Z t

(flin)′_◦(Φlin)−1(u+c+1)du. For Lebesgue almost all arguments one has

(f¯lin)′= (flin◦(Φlin)−1)′= (flin)′_◦(Φlin)−1_·((Φlin)−1)′= (flin)′_◦(Φlin)−1_·flin◦(Φlin)−1

so that

(flin)′_◦(Φlin)−1= (

flin)′

flin = (log ¯f lin₎′_.

Consequently,

Σ ¾ log ¯flin(t+c+1)₋log ¯flin(s+c+1)

Using that flin ¾ f and(Φlin)−1 ¾ Φ−1we finally get that

Σ ¾ log ¯f(t+c+1)₋logc∗,

wherec∗is a positive constant not depending on t. Plugging this estimate into (9) we get

h(t) ¶ c

∗

f(t+c+1).

Fix now an intervalI⊂Rof finite length and note that

P {Mt∈I} ∩At

=P _{Yt∈t−s+I} ∩At

¶ #[(t−s+I)_∩S_∩At]·h(t).

Now(t−s+I)_∩S_∩At is a subset of[t−c,t+c]and the minimal distance of two distinct elements

is bigger than _¯ 1

f(t+c). Therefore, #[(t−s+I)∩S∩At] ¶ |I|f¯(t+c) +1, and

P {M_t∈I} ∩A_t ¶ c∗|I|+ c

∗

f(t+c).

Moreover, for any open and thus immediately also for any arbitrary intervalI one has

P _{M_∞_∈I_{} ∩}A_∞ ¶ lim inf

t→∞ P {Mt∈I} ∩At

¶ c∗_|I_|,

where A_∞ = T_t_∈_[_s_,_∞₎_∩_TA_t. Consequently, the Borel measure µc on R given by µc(E) = E[1l_A

∞1lE(M∞)], is absolutely continuous with respect to Lebesgue measure. The distribution µ

of M_∞, i.e. µ(E) = E[1lE(M∞)], can be written as monotone limit of the absolutely continuous

measuresµc, asc↑ ∞, and it is therefore also absolutely continuous.

3 The empirical indegree distribution

In this section we prove Theorem 1.1. For k _∈ N_{∪ {}0_} and n _∈ N let µk(n) = E[Xk(n)] and µ(n) = (µk(n))k∈N∪{0}. We first prove part (a), i.e. that X(n) converges almost surely to µ, as n → ∞, in the total variation norm. To do this we associate the sequence (µ(n))_n_∈_N to a time inhomogeneous Markov chain on N_{∪ {}0_}, which has µ as invariant measure. Then a coupling argument proves convergence ofµ(n)toµ, asn_{→ ∞}. Finally, the almost sure convergence ofX(n)

is verified by a mass concentration argument based on the Chernoff inequality.

We then prove part(b), i.e. that the outdegree of new vertices is asymptotically Poisson with inten-sity_〈µ,f〉. Here we derive and analyze a difference equation for the total sum of indegrees in the linear model, see (14). The general result is then obtained by a stochastic domination argument.

(16)

3.1 Proof of Theorem 1.1 (a)

We start by deriving a recursive representation forµ(n). Fork∈N_{∪ {}0_},

E[X_k(n+1)_|X(n)] = 1

n+1

i=1

−1l_{Z[i,n]=k<Z[i,n+1]}+1l{Z[i,n]<k=Z[i,n+1]} X(n)

+nX_k(n) +1l_{_k₌₀_}

=Xk(n) +

n+1

−nXk(n)

f(k)

n +nXk−1(n)

f(k₋1)

n −Xk(n) +1l{k=0} i

. Thus the linearity and the tower property of conditional expectation gives

µk(n+1) =µk(n) +

n+1(f(k−1)µk−1(n)−(1+f(k))µk(n) +1l{k=0}). Now definingQ_∈RN×N as

     

−f(0) f(0)

1 ₋(f(1) +1) f(1)

1 −(f(2) +1) f(2)

. ... ...

     

(11)

and conceivingµ(n)as a row vector we can rewrite the recursive equation as

µ(n+1) =µ(n)I+ 1

n+1Q

whereI= (δi,j)i,j∈N denotes the unit matrix. Next we show thatµis a probability distribution with

µQ=0. By induction, we get that

1₋

l=0

µl =

l=0 f(l)

1+ f(l)

for any k_∈N_{∪ {}0_}. SinceP∞_l₌₀1/f(l) ¾ P∞_l₌₀1/(l+1) =_∞it follows that µ is a probability measure on the setN_{∪ {}0_}. Moreover, it is straightforward to verify that

f(0)µ0=1−µ0= ∞ X

l=1

µl

and that for allk∈N_{∪ {}0}

f(k₋1)µk−1= (1+f(k))µk,

henceµQ=0.

Now we use the matricesP(n):=I+ 1

n+1Qto define an inhomogeneous Markov process. The entries

of each row of P(n) sum up to 1 but (as long as f is not bounded) each P(n) contains negative entries... Nonetheless one can use the P(n) as a time inhomogeneous Markov kernel as long as at the starting timem∈Nthe starting statel∈N∪ {0}satisfiesl ¶ m−1.

(17)

We denote for any admissible pairl,mby (Y_nl,m)_n_¾_m a Markov chain starting at time min state l

having transition kernels(P(n))_n_¾_m. Due to the recursive equation we now have

µk(n) =P(Yn0,1=k).

Next, fixk∈N_∪{0}, letm>karbitrary, and denote byνthe restriction ofµto the set{m,m+1, . . .}. Sinceµis invariant under eachP(n)we get

µk= (µP(m). . .P(n))k= m_X−1

l=0

µlP(Ynl,m=k) + (νP

(m)_{. . .}_P(n)₎

Note that in the n-th step of the Markov chain, the probability to jump to state zero is 1

n+1 for all

original states in{1, . . . ,n−1}and bigger than _n₊1₁ for the original state 0. Thus one can couple the Markov chains(Y_nl,m)and(Y_n0,1)in such a way that

P(Y_nl₊,m₁=Y_n0,1₊₁=0_|Y_nl,m6=Y_n0,1) = 1 n+1,

and that once the processes meet at one site they stay together. Then

P(Y_nl,m=Y_n0,1) ¾ 1₋

n−1 Y

i=m

i+1−→1. Thus(νP(m). . .P(n))_k_∈[0,µ([m,∞))]implies that

lim sup

n→∞

µk−P(Yn(0,1)=k) m_X−1

l=0

µ∗_l

¶ µ([m,∞)).

Asm→ ∞we thus get that

lim

n→∞µk(n) =µk.

In the next step we show that the sequence of the empirical indegree distributions(X(n))_n_∈_N con-verges almost surely toµ. Note thatn Xk(n)is a sum ofnindependent Bernoulli random variables.

Thus Chernoff’s inequality (Chernoff[1981]) implies that for anyt>0

P X_k(n) ¶ E[X_k(n)]₋t ¶ e−nt2/(2E[Xk(n)])₌_e−nt2/(2µk(n))_.

Since

∞ X

n=1

e−nt2/(2µk(n))_<_∞_,

the Borel-Cantelli lemma implies that almost surely lim infn→∞Xk(n) ¾ µk for allk∈N∪ {0}. If

A_⊂N_{∪ {}0_}we thus have by Fatou’s lemma lim inf

n→∞

k∈A

X_k(n) ¾ X k∈A

lim inf

(18)

Noting thatµis a probability measure and passing to the complementary events, we also get lim sup

n→∞

k∈A

Xk(n) ¶ µ(A).

Hence, givenε >0, we can pickN_∈Nso large thatµ(N,_∞)< ε, and obtain for the total variation norm

lim sup

n↑∞ 1 2

∞ X

k=0

X_k(n)₋µk

¶lim sup

n↑∞

1 2

k=0

X_k(n)₋µk

+1₂ lim

n↑∞ ∞ X

k=N+1

X_k(n) +1₂µ(N,_∞)¶ε.

This establishes almost sure convergence of(X(n))toµin the total variation norm.

3.2 Proof of Theorem 1.1 (b)

We now show that the conditional law of the outdegree of a new node converges almost surely in the weak topology to a Poisson distribution. In the first step we will prove that, forη ∈(0, 1), and the affine linear attachment rule f(k) = ηk+1, one has almost sure convergence of Yn :=

m=1Z[m,n] =〈X(n), id〉to y:=1/(1−η). First observe that Y_n₊₁= 1

n+1

nY_n+ n

m=1

∆_Z[m,n]=Y_n+ 1 n+1

−Y_n+ n

m=1

∆_Z[m,n],

where ∆_Z[m,n] := _Z[m,n+1]_{− Z}[m,n]. Given the past _F_n of the network formation, each

∆_Z[m,n]is independent Bernoulli distributed with success probability 1_n(ηZ[m,n] +1). Conse-quently,

E[Yn+1|Fn] =Yn+_n₊1₁

−Yn+ n

m=1 1

n(ηZ[m,n] +1)

=Y_n+_n₊1₁₋(1₋η)Y_n+1, and

〈Y_〉_n₊₁_{− 〈}Y_〉_n¶ ₍ 1 n+1)2

m=1 1

n(ηZ[m,n] +1) =

(n+1)2[ηYn+1]. (12)

Now note that due to Theorem 1.5 (which can be used here, as it will be proved independently of this section) there is a single node that has maximal indegree for all but finitely many times. Letm∗

denote the random node with this property. With Remark 1.11 we conclude that almost surely

log_Z[m∗,n]_∼lognη. (13)

Since for sufficiently largen

Yn=

m=1

Z[m,n]¶_Z[m∗,n],

(19)

Next, represent the incrementY_n₊₁₋Y_n as

Yn+1−Yn= _n₊1₁

−(1₋η)Yn+1

+ ∆Mn+1, (14)

where ∆M_n+1 denotes a martingale difference. We shall denote by (Mn)n∈N the corresponding

martingale, that is M_n = Pn_m₌₂∆M_m. Since _〈Y_〉_· is convergent, the martingale (M_n) converges almost surely. Next, we represent (14) in terms of ¯Yn = Yn− y as the following inhomogeneous

linear difference equation of first order: ¯

Yn+1=

1−1_n−₊η₁_Y¯_n+ ∆Mn+1...

The corresponding starting value is ¯Y1=Y1−y =−y, and we can represent its solution as

Y_n=₋y h1_n+ n

m=2

∆M_mhm_n

for

hm_n :=

  

0 ifn<m

Qn l=m+1

1−1−_lη ifn ¾ m.

Setting∆hm_n =hm_n ₋hm_n−1 we conclude with an integration by parts argument that

m=2

∆Mmhmn = n

m=2

∆Mm

1₋

k=m+1

∆hk_n

=Mn− n

m=2

k=m+1

∆Mm∆hkn

=M_n₋

k=3

k−1 X

m=2

∆M_m∆hk_n=M_n₋

k=3

M_k₋₁∆hk_n.

(15)

Note thathm_n and∆hm_n tend to 0 asntends to infinity so thatPn_k₌_m₊₁∆hk_n=1−hm_n tends to 1. With

M_∞:=lim_n_→∞M_n andǫm=supn¾m|Mn−M∞|we derive form ¶ n

M_n₋

k=3

M_k₋₁∆hk_n₋

k=m+1

M_k₋₁∆hk_n

¶ _|M_n₋M_∞_|

| {z } →0

+ m

k=3

|M_k₋₁_|∆hk_n

| {z }

→0

+_| n

k=m+1

(M_∞₋M_k₋₁) ∆hk_n_|

| {z }

¶ǫm

+ 1₋

k=m+1

∆hk_n_|M_∞_|

| {z }

→0

Since limm→∞ǫm = 0, almost surely, we thus conclude with (15) that

m=2∆Mmhmn tends to 0.

Consequently, lim_n_→∞Y_n = y, almost surely. Next, we show that also_〈µ, id_〉= y. Recall thatµ is the unique invariant distribution satisfyingµQ=0 (see (11) for the definition ofQ). This implies that for anyk∈N

f(k₋1)µk−1−(f(k) +1)µk=0, or equivalently, µk= f(k−1)µk−1− f(k)µk

Thus

〈µ, id_〉=

∞ X

k=1 kµk=

∞ X

k=1

kf(k₋1)µk−1−f(k)µk

(20)

One cannot split the sum into two sums since the individual sums are not summable. However, noticing that the individual term f(k)µkk≈ k2µk tends to 0, we can rearrange the summands to

obtain

〈µ, id_〉= f(0)µ0+ ∞ X

k=1

f(k)µk=〈µ,f〉=η〈µ, id〉+1.

This implies that〈µ, id〉= y and that for anym∈N

〈X(n), 1l[m,∞)·id〉=〈X(n), id〉 − 〈X(n), 1l[0,m)·id〉 → 〈µ, 1l[m,∞)·id〉, almost surely.

Now, we switch to general attachment rules. We denote by f an arbitrary attachment rule that is dominated by an affine attachment rule fa. The corresponding degree evolutions will be denoted by(_Z[m,n])and(_Za[m,n]), respectively. Moreover, we denote byµandµathe limit distributions of the empirical indegree distributions. Since by assumption f ¶ fa, one can couple both degree evolutions such thatZ[m,n] ¶ _Za[m,n]for alln ¾ m ¾ 0. Now

〈X(n),f_〉 ¶ _〈X(n), 1l_[_0,_m₎_·f_〉+_〈Xa(n), 1l_[_m_,_∞₎_·fa_〉

so that almost surely

lim sup

n→∞ 〈

X(n),f_〉 ¶ _〈µ, 1l[0,m)· f〉+〈µa, 1l[m,∞)·fa〉.

Sincemcan be chosen arbitrarily large we conclude that lim sup

n→∞ 〈X

(n),f〉 ¶ _〈µ,f〉.

The converse estimate is an immediate consequence of Fatou’s lemma. Hence, lim

n→∞E

hXn

m=1

∆_Z[m,n]

=_〈µ,f〉.

Since, conditional on_F_n,Pn_m₌₁∆_Z[m,n]is a sum of independent Bernoulli variables with success probabilities tending uniformly to 0, we finally get that_L(Pn_m₌₁∆_Z[m,n]_|F_n) converges in the weak topology to a Poisson distribution with parameter〈µ,f〉.

4 Large deviations

In this section we derive tools to analyse rare events in the random network... We provide large and moderate deviation principles for the temporal development of the indegree of a given vertex. This will allow us to describe the indegree evolution of the node with maximal indegree in the case of weak preferential attachment. The large and moderate deviation principles are based on an exponential approximation to the indegree evolution processes, which we first discuss.

(21)

4.1 Exponentially good approximation

In order to analyze the large deviations of the processZ[s,_·](or_Z[m,_·,]) we use an approximating process. We first do this on the level of occupation measures. Fors∈Tand 0 ¶ u<v we define

Ts[u,v) =sup{t′−t:Z[s,t] ¾ u,Z[s,t′]<v,t,t′∈T}

to be the time the processZ[s,_·]spends in the interval[u,v). Similarly, we denote byT_s[u]the time spent inu. Moreover, we denote by(T[u])_u_∈_S a family of independent random variables with each entryT[u]beingExp(f(u))-distributed, and denote

T[u,v):= X w∈S

u¶w<v

T[w] for all 0 ¶ u ¶ v.

The following lemma shows thatT[u,v)is a good approximation to Ts[u,v)in many cases.

Lemma 4.1. Fixη1∈(0, 1), let s∈Tand denote byτthe entry time into u of the process Z[s,·]. One can couple(Ts[u])u∈S and(T[u])u∈S such that, almost surely,

1l_{¯_f₍_u_)∆_τ_¶_η_1}|Ts[u]−T[u]| ¶ (1∨η2¯f(u))∆τ, whereη2 is a constant only depending onη1.

Proof. We fix t ∈ T with ¯f(u)∆t ¶ η1. Note that it suffices to find an appropriate coupling

conditional on the event{τ=t}. LetU be a uniform random variable and let F and ¯F denote the (conditional) distribution functions of T[u]and T_s[u], respectively. We couple T[u]andT_s[u]by setting T[u] = F−1(U)and Ts[u] = F¯−1(U), where ¯F−1 denotes the right continuous inverse of ¯F.

The variablesT[u]andT_s[u]satisfy the assertion of the lemma if and only if

F v−(1∨η2f¯(u))∆t

¶ F¯(v) ¶ F v+ (1∨η2f¯(u))∆t

for allv ¾ 0. (16) We compute

1−_F¯(v) = Y t¶w,w+∆w¶t+v

w∈T

1−f¯(u)∆w=exp X

t¶w,w+∆w¶t+v u∈T

log 1−f¯(u)∆w

Next observe that, from a Taylor expansion, for a suitably large η2 > 0, we have −f¯(u)∆w−

η2 ¯f(u)2[∆w]2 ¶ log 1−f¯(u)∆w

¶ ₋f¯(u)∆w, so that 1−_F¯(v) ¶ exp

−f¯(u) X t¶w,w+∆w¶t+v

w∈T

∆w

¶ exp −f¯(u)(v−∆t)=1−F(v−∆t).

This proves the left inequality in (16). It remains to prove the right inequality. Note that 1₋F¯(v) ¾ exp₋ X

t¶w,w+∆w¶t+v w∈T

(f¯(u)∆w+η2f¯(u)2[∆w]2)

(22)

and

t¶w,w+∆w¶t+v w∈T

[∆w]2 ¶

∞ X

m=[∆t]−1

m2 ¶

[∆t]−1₋₁ ¶ ∆t.

Consequently, 1₋F¯(v) ¾ exp_{−f¯(u)(v+η2¯f(u)∆t)}=1−F(v+η2¯f(u)∆t).

As a direct consequence of this lemma we obtain an exponential approximation... Lemma 4.2. Suppose that, for someη <1we have f(j) ¶ η(j+1)for all j∈N_{∪ {}0_}. If

∞ X

j=0 f(j)2

(j+1)2 <∞,

then for each s∈Tone can couple Tswith T such that, for allλ ¾ 0,

P sup

u∈S|

T_s[0,u)₋T[0,u)_| ¾ λ+p2K ¶ 4e−λ 2 2K,

where K>0is a finite constant only depending on f .

Proof. Fixs∈Tand denote byτu the first entry time of Z[s,·]into the stateu∈S. We couple the

random variablesT[u]andT_s[u]as in the previous lemma and let, forv∈S,

M_v=X

u∈S

u<v

T_s[u]₋T[u]=T_s[0,v)₋T[0,v).

Then(M_v)_v_∈_S is a martingale. Moreover, for each v = Φ(j)_∈S one has τv ¾ Ψ(j+1) so that ∆τv ¶ 1/(j+1). Consequently, using the assumption of the lemma one gets that

∆τvf¯(v) ¶

j+1f(j) =:cv ¶ η <1.

Thus by Lemma 4.1 there exists a constant η′ <∞ depending only on f(0) andη such that the increments of the martingale(M_v)are bounded by

|T_s[v]₋T[v]_| ¶ η′c_v.

By assumption we haveK:=P_v_∈_Sc2_v <∞an we conclude with Lemma A.4 that forλ ¾ 0,

P sup

u∈S|

T_s[0,u)₋T[0,u)_| ¾ λ+p2K ¶ 4e−λ 2 2K.

We define(Z_t)_t_¾₀to be theS-valued process given by

Z_t:=max_{v_∈S:T[0,v) ¶ t_}, (17) and start by observing its connection to the indegree evolution. The following corollary holds in full generality, in particular without assumption (1).

(1)

and analogously that logP(T_s

min[0,κ+aκζ) +smin ¶ κ)∼ −aκ

1 2

1₋2α

1₋α (u+ζ) 2_.

Next we prove thatP(max_s_∈_I_κZ[s,κ]< κ+a_κζ)tends to 0 whenζ <−v+Æ2₁−2α −2αv.

If ζ < −u, then the statement is trivial since by the moderate deviation principle P(Z[s_min,κ] < κ+a_κζ)tends to zero. Thus we can assume thatζ ¾ ₋u. By (28) one has

P max

s∈I_κ Z[s,κ]< κ+aκζ

¶ exp#I_κlog 1−P(T_s

max[0,κ+aκζ) +smax ¶ κ) .

and it suffices to show that the term in the exponential tends to_−∞in order to prove the assertion. The term satisfies

#I_κlog 1₋P(T_smax[0,κ+a_κζ) +s_max ¶ κ) ∼ −#I_κP(T_s

max[0,κ+aκζ) +smax ¶ κ)

=₋exp n

aκ

h ₁ aκ

log #Iκ+

1 aκ

logP(T_smax[0,κ+aκζ) +smax ¶ κ)

| {z }

=:c_κ

io .

Since _a1

κlog #Iκ converges tov, we conclude with (29) that

lim

κ→∞cκ=v− 1−2α

2−2α(v+ζ)

2_.

Now elementary calculus implies that the limit is bigger than 0 by choice ofζ. This implies the first part of the assertion.

It remains to prove thatP(max_s_∈_I_κZ[s,κ]< κ+aκζ)tends to 1 forζ >−u+

Æ 2−2α

1−2αv. Now

P max

s∈I_κ Z[s,κ]< κ+aκζ

¾ expn#I_κlog 1₋P(T_s

min[0,κ+aκζ) +smin ¶ κ)

and it suffices to show that the expression in the exponential tends to 0. As above we conclude that #I_κlog(1₋P(T_s

min[0,κ+aκζ) +smin ¶ κ))

∼ −exp aκ

1 aκ

log #Iκ+

1 aκ

logP(T_smin[0,κ+aκζ) +smin ¶ κ)

| {z }

=:c_κ

We find convergence

lim

κ→∞cκ=v− 1−2α

2−2α(u+ζ)

and (as elementary calculus shows) the limit is negative by choice ofζ. Fors_∈Tandκ >0 we denote by ¯Z(s,κ)_{= (}_Z_¯(s,κ)

t )t¾0 the random evolution given by ¯

Z(s,κ)

t =

Z[s,s+κt]₋κt

(2)

Moreover, we let

z= (z_t)_t¾₀=₁1−α −2α t

1−2α

1−α ∧1

t¾0.

Proof of Proposition 1.18. 1st Part:By Lemma 5.4 the maximal indegree is related to the unimodal functionhdefined by

h(u) =₋u+ q

2−2α

1−2αu, foru ¾ 0.

hattains its unique maximum in umax = 1₂₁1₋−₂α_α and h(umax) = umax. We fix c > 4₁1₋−₂α_α, let ζ = max[h(u_max)₋h(u_max_±ǫ)] and decompose the set[0,umax−ǫ)∪[umax+ǫ,c)into finitely many disjoint intervals[u_i,v_i)i_∈J, with mesh smaller thanζ/3. Then for the hub s_κ∗ at timeκ >0 one has

P(s_κ∗_∈aκ[umax−ǫ,umax+ǫ))

¾ P max

s∈a_κ[umax−ǫ,umax)∩T

Z[s,κ] ¾ κ+aκ(h(umax)−ζ/3)

×Y

i∈J

P max

s∈a_κ[ui,vi)∩T

Z[s,κ] ¶ κ+a_κ(h(v_i) +ζ/2)

×P

max

s∈[c aκ,∞)∩T

Z[s,κ] ¶ κ

(30)

By Lemma 5.4 the terms in the first two lines on the right terms converge to 1. Moreover, by Propo-sition 5.1 the third term converges to 1, if for all sufficiently largeκandκ+=min[κ,∞)∩S, one

has 4ϕκ+ ¶ caκ. This is indeed the case, since one hasκ+ ¶ κ+f(0)

−1 _{so that by Lemma A.1,} 4ϕκ+ ∼4₁1₋−₂α_αaκ. The statement on the size of the maximal indegree is now an immediate

conse-quence of Lemma 5.4.

2nd Part: We now prove that (an appropriately scaled version of) the evolution of a hub typically lies in an open neighbourhood aroundz.

LetU denote an open set in_L(0,_∞)that includeszand denote byUcits complement in_L(0,_∞). Furthermore, we set

Aǫ=

x _{∈ L}(0,_∞): max

t∈[1₂,1]

x_t ¾ 2(u_max₋ǫ) o

forǫ ¾ 0. We start by showing thatzis the unique minimizer ofI on the setA₀... Indeed, applying the inverse Hölder inequality gives, forx _∈A₀with finite rateI(x),

I(x) ¾ 1₂ Z 1

0 ˙

x2_t t1−ααd t ¾ 1

2 Z 1

|x˙_t_|d t 2Z 1

t−1−ααd t

₋1

¾ 1₂₁1−α

−2α =umax=I(z).

Moreover, one of the three inequalities is a strict inequality whenx₆=z. Recall that, by Lemma 4.15, I has compact level sets. We first assume that one of the entries in Uc_∩A₀ has finite rate I. Since Uc∩A0 is closed, we conclude thatI attains its infimum onUc∩A0. Therefore,

I(Uc_∩A₀):=inf_{I(x): x _∈Uc_∩A_0}>I(z) =u_max. Conversely, using again compactness of the level sets, gives

lim

ǫ↓0I(U

(3)

Therefore, there existsǫ >0 such thatI(Uc_∩A_ǫ)>I(z). Certainly, this is also true ifUccontains no element of finite rate.

From the moderate deviation principle, Theorem 1.15, together with the uniformity ins, see Propo-sition 4.4, we infer that

lim sup

κ→∞ 1 aκ

max

s∈T logP

¯ Z(s,κ)

∈Uc∩Aǫ

¶ ₋I(Uc_∩Aǫ)<−I(z)... (31)

It remains to show thatP(Z¯s∗κ,κ∈Uc)converges to zero. Forǫ >0 and sufficiently largeκ,

P(Z¯s∗κ,κ_∈_Uc₎ _{¶ P(s}∗

κ6∈aκ[umax−ǫ,umax+ǫ]) +P max

t∈[1₂,1]

¯ Zs∗κ,κ

t ¶ 2(umax−ǫ),s∗κ∈aκ[umax−ǫ,umax+ǫ]

+P Z¯sκ∗,κ_∈_Uc_{, max}

t∈[1₂,1]

¯ Zs∗κ,κ

t ¾ 2(umax−ǫ),s∗κ∈aκ[umax−ǫ,umax+ǫ]

. By the first part of the proof the first and second term in the last equation tend to 0 for anyǫ >0. The last term can be estimated as follows

P Z¯sκ∗,κ_∈_Uc_{, ¯}_Zs

∗

κ,κ

t ¾ 2(umax−ǫ),s∗κ∈aκ[umax−ǫ,umax+ǫ]

¶ X

s∈T∩aκ[umax−ǫ,umax+ǫ]

P(Z¯(s,κ)

∈Uc_∩A_ǫ). (32)

Moreover, log # T_∩aκ[umax−ǫ,umax+ǫ]

∼aκ(umax+ǫ). Since, for sufficiently smallǫ >0, we haveI(Uc_∩_A

ǫ)>umax+ǫwe infer from (31) that the sum in (32) goes to zero.

A

Appendix

A.1 Regularly varying attachment rules

In the following we assume that f:[0,_∞) _→ (0,_∞) is a regularly varying attachment rule with indexα <1, and represent f as f(u) =uαℓ(u), foru>0, with a slowly varying functionℓ.

Lemma A.1. 1. One has

Φ(u)_∼ 1 1−α

u1−α ℓ(u) as u tends to infinity and f admits the representation¯

f(u) = f ◦Φ−1(u) =u1−αα¯ℓ(u), for u>0,

where¯ℓis again a slowly varying function. 2. If additionallyα < 1₂, then

ϕu=

Z u

0 1 ¯

f(u)du∼ 1−α 1−2α

u11−−2αα

¯ ℓ(u).

(4)

Proof. The results follow from the theory of regularly variation, and we briefly quote the relevant results taken from Bingham et al. [1987]. The asymptotic formula for Φ is an immediate conse-quence of Karamata’s theorem, Theorem 1.5.11. Moreover, by Theorem 1.5.12, the inverse ofΦis regularly varying with index(1₋α)−1 so that, by Proposition 1.5.7, the composition ¯f = f _◦Φ−1 is regularly varying with index α

1−α. The asymptotic statement aboutϕfollows again by Karamata’s

theorem.

Remark A.2. In the particular case where f(u)_∼cuα, we obtain Φ(u)_∼ 1

c(1−α)u

1−α_, _Φ−1_(u)

∼(c(1₋α)u)1−1α

and

f(u)_∼c1−1α (1−α)u α

1−α_.

A.2 Two concentration inequalities for martingales

Lemma A.3. Let(M_n)_n_∈_N_∪{₀_}be a martingale for its canonical filtration(_F_n)_n_∈_N_∪{₀_}with M₀=0. We assume that there are deterministicσn∈Rand M<∞such that almost surely

• var(M_n_|F_n₋₁) ¶ σ2_nand • Mn−Mn−1 ¶ M . Then, for anyλ >0and m_∈N,

Psup

n¶m

M_n ¾ λ+ s

n=1 σ2

¶ 2 exp₋ λ

2 2(Pm_n₌₁σ2

n+Mλ/3)

Proof. Letτdenote the first timen_∈Nfor whichM_n ¾ λ+p2Pm_n₌₁σ2

n. Then

P(M_m ¾ λ) ¾

n=1

P(τ=n)PM_m₋M_n ¾ ₋ s

i=1 σ2_i

τ=n

. Next, observe that var(M_m₋M_n_|τ=n) ¶ Pm_i₌_n₊₁σ2_i so that by Chebyshev’s inequality

PM_m₋M_n ¾ ₋ s

i=1 σ2_i

τ=n

¾ 1/2. On the other hand, a concentration inequality of Azuma type gives

P(M_m ¾ λ) ¶ exp

− λ

2 2(Pm_n₌₁σ2

n+Mλ/3)

(see for instance Chung and Lu [2006], Theorem 2.21). Combining these estimates immediately

proves the assertion of the lemma.

Similarly, one can use the classical Azuma-Hoeffding inequality to prove the following concentration inequality.

(5)

Lemma A.4. Let(M_n)_n_∈_N_∪{₀_} be a martingale such that almost surely_|M_n₋M_n₋_1| ¶ c_n for given sequence(c_n)_n_∈_N. Then for anyλ >0and m∈N

Psup

n¶m|

M_n₋M_0| ¾ λ+ s

n=1 c2

¶ 4 exp₋ λ 2 2Pm_n₌₁c2

Acknowledgments: We would like to thank Remco van der Hofstad for his encouragement and directing our attention to the reference Rudas et al.[2007], and two referees for helping us improve the readability of the paper.

References

Barabási, A.-L. and R. Albert (1999). Emergence of scaling in random networks.Science 286(5439), 509–512. MR2091634

Bingham, N. H., C. M. Goldie, and J. L. Teugels (1987). Regular variation, Volume 27 of Encyclope-dia of Mathematics and its Applications. Cambridge: Cambridge University Press. MR0898871 Bollobás, B., O. Riordan, J. Spencer, and G. Tusnády (2001). The degree sequence of a scale-free random graph process. Random Structures Algorithms 18, 279–290. MR1824277

Chernoff, H. (1981). A note on an inequality involving the normal distribution.Ann. Probab. 9(3), 533–535. MR0614640

Chung, F. and L. Lu (2006). Complex graphs and networks, Volume 107 of CBMS Regional Con-ference Series in Mathematics. Published for the ConCon-ference Board of the Mathematical Sciences, Washington, DC. MR2248695

Dembo, A. and O. Zeitouni (1998). Large deviations techniques and applications (Second ed.), Volume 38 ofApplications of Mathematics. New York: Springer-Verlag. MR1619036

Hofstad, R. v. d. (2009). Random graphs and complex networks. Eindhoven. Lecture Notes.

Jacod, J. and A. N. Shiryaev (2003). Limit theorems for stochastic processes(Second ed.), Volume 288 ofGrundlehren der Mathematischen Wissenschaften. Berlin: Springer-Verlag. MR1943877 Krapivsky, P. and S. Redner (2001). Organization of growing random networks. Phys. Rev. E 63, Paper 066123.

Móri, T. F. (2002). On random trees. Studia Sci. Math. Hungar. 39, 143–155. MR1909153

Móri, T. F. (2005). The maximum degree of the Barabási-Albert random tree. Combin. Probab. Comput. 14, 339–348. MR2138118

Oliveira, R. and J. Spencer (2005). Connectivity transitions in networks with super-linear prefer-ential attachment. Internet Math. 2, 121–163. MR2193157

(6)

Rudas, A., B. Tóth, and B. Valkó (2007). Random trees and general branching processes. Random Structures and Algorithms 31, 186–202. MR2343718

Williams, D. (1991).Probability with martingales. Cambridge Mathematical Textbooks. Cambridge: Cambridge University Press. MR1155402

getdocd1be. 1341KB Jun 04 2011 12:05:00 AM

Random networks with sublinear preferential attachment:

Degree evolutions

Steffen Dereich

Peter Mörters

1

Introduction

1.1

Motivation

1.2

Definition of the model

1.3

Presentation of the main results

1.4

Fine results for degree evolutions

2

Martingale techniques

2.1

Martingale convergence

2.2

Absolute continuity of the law of

M

3

The empirical indegree distribution

3.1

Proof of Theorem 1.1 (a)

3.2

Proof of Theorem 1.1 (b)

4

Large deviations

4.1

Exponentially good approximation

A

Appendix

A.1

Regularly varying attachment rules

A.2

Two concentration inequalities for martingales

References

Parts

Dokumen yang terkait

AN ALIS IS YU RID IS PUT USAN BE B AS DAL AM P E RKAR A TIND AK P IDA NA P E NY E RTA AN M E L AK U K A N P R AK T IK K E DO K T E RA N YA NG M E N G A K IB ATK AN M ATINYA P AS IE N ( PUT USA N N O MOR: 9 0/PID.B /2011/ PN.MD O)

ANALISIS FAKTOR YANGMEMPENGARUHI FERTILITAS PASANGAN USIA SUBUR DI DESA SEMBORO KECAMATAN SEMBORO KABUPATEN JEMBER TAHUN 2011

EFEKTIVITAS PENDIDIKAN KESEHATAN TENTANG PERTOLONGAN PERTAMA PADA KECELAKAAN (P3K) TERHADAP SIKAP MASYARAKAT DALAM PENANGANAN KORBAN KECELAKAAN LALU LINTAS (Studi Di Wilayah RT 05 RW 04 Kelurahan Sukun Kota Malang)

FAKTOR Â– FAKTOR YANG MEMPENGARUHI PENYERAPAN TENAGA KERJA INDUSTRI PENGOLAHAN BESAR DAN MENENGAH PADA TINGKAT KABUPATEN / KOTA DI JAWA TIMUR TAHUN 2006 - 2011

A DISCOURSE ANALYSIS ON “SPA: REGAIN BALANCE OF YOUR INNER AND OUTER BEAUTY” IN THE JAKARTA POST ON 4 MARCH 2011

Pengaruh kualitas aktiva produktif dan non performing financing terhadap return on asset perbankan syariah (Studi Pada 3 Bank Umum Syariah Tahun 2011 – 2014)

Pengaruh pemahaman fiqh muamalat mahasiswa terhadap keputusan membeli produk fashion palsu (study pada mahasiswa angkatan 2011 & 2012 prodi muamalat fakultas syariah dan hukum UIN Syarif Hidayatullah Jakarta)

Pendidikan Agama Islam Untuk Kelas 3 SD Kelas 3 Suyanto Suyoto 2011

ANALISIS NOTA KESEPAHAMAN ANTARA BANK INDONESIA, POLRI, DAN KEJAKSAAN REPUBLIK INDONESIA TAHUN 2011 SEBAGAI MEKANISME PERCEPATAN PENANGANAN TINDAK PIDANA PERBANKAN KHUSUSNYA BANK INDONESIA SEBAGAI PIHAK PELAPOR

KOORDINASI OTORITAS JASA KEUANGAN (OJK) DENGAN LEMBAGA PENJAMIN SIMPANAN (LPS) DAN BANK INDONESIA (BI) DALAM UPAYA PENANGANAN BANK BERMASALAH BERDASARKAN UNDANG-UNDANG RI NOMOR 21 TAHUN 2011 TENTANG OTORITAS JASA KEUANGAN

Dokumen yang Anda mencari sudah siap untuk unduhkan