getdoc0c02. 292KB Jun 04 2011 12:04:04 AM

(1)

El e c t ro n ic

Jo ur

n a l o

f P

r o

b a b i_{l i t y} Vol. 15 (2010), Paper no. 30, pages 962–988.

Journal URL

http://www.math.washington.edu/~ejpecp/

Stein’s method for dependent random variables occurring

in Statistical Mechanics

∗

Peter Eichelsbacher

†

_{Matthias Löwe}

‡

Abstract

We develop Stein’s method for exchangeable pairs for a rich class of distributional approxima-tions including the Gaussian distribuapproxima-tions as well as the non-Gaussian limit distribuapproxima-tions with density proportional to exp(₋µ_|x_|2k/(2k)!). As a consequence we obtain convergence rates in limit theorems of partial sums Sn for certain sequences of dependent, identically distributed random variables which arise naturally in statistical mechanics, in particular in the context of the Curie-Weiss models. Our results include a Berry-Esseen rate in the Central Limit Theorem for the total magnetization in the classical Curie-Weiss model, for high temperatures as well as at the critical temperatureβ_c =1, where the Central Limit Theorem fails. Moreover, we ana-lyze Berry-Esseen bounds as the temperature 1/β_nconverges to one and obtain a threshold for the speed of this convergence. Single spin distributions satisfying the Griffiths-Hurst-Sherman (GHS) inequality like models of liquid helium or continuous Curie-Weiss models are considered.

Key words:Berry-Esseen bound, Stein’s method, exchangeable pairs, Curie-Weiss models, criti-cal temperature, GHS-inequality.

AMS 2000 Subject Classification:Primary 60F05; Secondary: 82B20, 82B26. Submitted to EJP on March 7„ final version accepted May 20, 2010.

∗_{Both authors have been supported by Deutsche Forschungsgemeinschaft via SFB}_/_{TR 12. This research was done}

partly at the Mathematisches Forschungsinstitut Oberwolfach during a stay within the Research in Pairs Programme.

†_{Ruhr-Universität Bochum, Fakultät für Mathematik, NA3}_/_{68, D-44780 Bochum, Germany,}

[email protected]

‡_{Universität Münster, Fachbereich Mathematik und Informatik, Institut für Mathematische Statistik, Einsteinstr. 62,}

(2)

1 Introduction

Stein’s method is a powerful tool to prove distributional approximation. One of its advantages is that is often automatically provides also a rate of convergence and may be applied rather effectively also to classes of random variables that are stochastically dependent. Such classes of random variables are in natural way provided by spin system in statistical mechanics. The easiest of such models are mean-field models. Among them the Curie–Weiss model is well known for exhibiting a number of properties of real substances, such as spontaneous magnetization or metastability. The aim of this paper is to develop Stein’s method for exchangeable pairs (see[20]) for a rich class of distributional approximations and thereby prove Berry-Esseen bounds for the sums of dependent random variables occurring in statistical mechanics under the name Curie-Weiss models. For an overview of results on the Curie–Weiss models and related models, see[9],[11],[13].

For a fixed positive integer d and a finite subset Λ ofZd, a ferromagnetic crystal is described by random variables X_iΛ which represent the spins of the atom at sites i ∈Λ, whereΛ describes the macroscopic shape of the crystal. InCurie–Weissmodels, the joint distribution at fixed temperature

T>0 of the spin random variables is given by

PΛ,β((xi)):=PΛ,β (XiΛ)i∈Λ= (xi)i∈Λ

:= 1 Z_Λ(β)exp

_β

2_|Λ_| X i∈Λ

x_i2 Y i∈Λ

d̺(x_i). (1.1)

Here β := T−1 is the inverse temperature and Z_Λ(β) is a normalizing constant, that turns P_Λ_,_β

into a probability measure, known as the partition function and_|Λ_| denotes the cardinality of Λ. Moreover̺ is the distribution of a single spin in the limit β →0. We define SΛ =

P i∈ΛX

i , the total magnetizationinsideΛ. We take without loss of generality d = 1 andΛ =_{1, . . . ,n_}, where

nis a positive integer. We writen, X_i(n), P_n_,_β andS_n, respectively, instead of_|Λ_|, XΛ_i , P_Λ_,_β, and S_Λ, respectively. In the case whereβ is fixed we may even sometimes simply write Pn. In theclassical Curie–Weiss model, spins are distributed in_{−1,+1_}according to̺= 1₂(δ₋1+δ1). The measures P_n_,_β is completely determined by the value of the total magnetization. It is therefore called anorder parameterand its behaviour will be studied in this paper.

The following is known about the fluctuation behaviour ofSn under Pn. In the classical model (̺

is the symmetric Bernoulli measure), for 0< β <1, in[18]and[11]the Central Limit Theorem is proved:

Pn i=1Xi p

n →N(0,σ

2₍_β₎₎ _{in distribution with respect to the Curie–Weiss finite volume Gibbs}

states with σ2(β) = (1₋β)−1. Since forβ = 1 the variance σ2(β) diverges, the Central Limit Theorem fails at the critical point. In[18]and[11]it is proved that forβ=1 there exists a random variableX with probability density proportional to exp(₋₁₂1 x4)such that

Pn i=1Xi

n3/4 →X as n→ ∞in

distribution with respect to the finite-volume Gibbs states.

Stein introduced in [20] the exchangeable pair approach. Given a random variable W, Stein’s method is based on the construction of another variable W′ (some coupling) such that the pair

(W,W′)is exchangeable, i.e. their joint distribution is symmetric. The approach essentially uses the elementary fact that if (W,W′) is an exchangeable pair, thenE_g₍_W_,_W′_{) =}_{0 for all antisymmetric}

measurable functions g(x,y) such that the expectation exists. A theorem of Stein ([20, Theorem 1, Lecture III]) shows that a measure of proximity ofW to normality may be provided in terms of the exchangeable pair, requiringW′₋W to be sufficiently small. He assumed the linear regression

(3)

property

E(W′|W) = (1₋λ)W

for some 0< λ <1. Stein proved that for any uniformly Lipschitz functionh, |E_h(W)₋E_h(Z)_{| ≤}

δkh′_kwithZ denoting a standard normally distributed random variable and

δ=4E

1−

1 2λE (W

′₋_W₎2 |W

2λE|W−W

′_|3_. _(1.2)

Stein’s approach has been successfully applied in many models, see e.g.[20]or[21]and references therein. In[16], the range of application was extended by replacing the linear regression property by a weaker condition.

For a motivation of our paper we consider the construction of an exchangeable pair(W,W′)in the classical Curie-Weiss model forW = (1/pn)Pn_i₌₁Xi, proving an approximate regression property.

We produce a spin collectionX′= (X′_i)_i≥₁ via aGibbs samplingprocedure: select a coordinate, say

i, at random and replace X_i by X′_i drawn from the conditional distribution of the i’th coordinate given(Xj)j6=i, independently formXi. LetIbe a random variable taking values 1, 2, . . . ,nwith equal

probability, and independent of all other random variables. Consider

W′:=W₋_pXI n+

X_I′ p

p n

X j6=I

X_j+ X ′ I

p n.

Hence(W,W′)is an exchangeable pair andW ₋W′= XI−XI′

p_n . For_F :=σ(X₁, . . . ,X_n)we obtain

E_[_W₋_W′_|F_{] =}_p1

n n X

i=1

E_[_X_i₋_X′

i|F] =

nW−

p n

n n X i=1

E_[_X′

i|F].

The conditional distribution at siteiis given by

P_n x_i_|(x_j)_j₆₌_i= exp xiβmi(x)

exp βmi(x)

+exp −βmi(x)

, with m_i(x):= 1 n

X j6=i

x_j, i=1, . . . ,n.

It follows that

E[X_i′|F] =E[Xi|(Xj)j6=i] =tanh(βmi(X)).

The Taylor-expansion tanh(x) =x+_O(x3)leads to

E_[_W₋_W′_|_W_{] =} 1−β

n W+R= λ

σ2W+R (1.3)

withλ:= 1 n, σ

2 _:_{= (}₁

−β)−1 andE_|_R_|₌_O 1

n3/2

. With (1.3) we are able to apply Theorem 1.2 in

[16]: for any uniformly Lipschitz functionh,_|E_h₍_W₎₋E_h₍_Z₎_{| ≤}_δ′_k_h′_k_with

δ′=4E

1−

1 2λE (W

′₋_W₎2 |W

2λE|W−W

′_|3₊₁₉

ER2

λ . (1.4)

(4)

In the critical caseβ = 1 of the classical Curie-Weiss model the Taylor expansion tanh(x) = x ₋ x3/3+_O(x5), (1.3) would lead to

E_[_W₋_W′_|_W_{] =}W

3 1

n2 +˜R

for some ˜R. The prefactorλ:= 1

n2 would give growing bounds. In other words, the criticality of the

temperature value 1/βc=1 can also be recognized by Stein’s method. We already know that at the

critical value, the sum of the spin-variables has to be rescaled. Let us now defineW := 1 n3/4

Pn i=1Xi.

Constructing the exchangeable pair(W,W′)in the same manner as before we will obtain

E[W−W′|W] = 1 n3/2

3 +R(W) =:−λψ(W) +R(W) (1.5) with λ = 1

n3/2 and a remainder R(W) presented later. Considering the density p(x) = C exp(₋x4/12), we have p_p′₍(_xx₎) = ψ(x). This is the starting point for developing Stein’s method for limiting distributions with a regular Lebesgue-density p(_·) and an exchangeable pair (W,W′)

which satisfies the condition

E_[_W₋_W′_|_W_{] =}₋_λψ₍_W_{) +}_R₍_W_{) =}₋_λp

′₍_W₎

p(W) +R(W)

with 0< λ <1.

In Section 2 we develop Stein’s method for exchangeable pairs for a rich class of other distributional approximations than normal approximation. Moreover, we prove certain refinements of Stein’s method for exchangeable pairs in the case of normal approximation. In Section 3 we apply our general approach to consider Berry-Esseen bounds for the rescaled total magnatization for general single spin distributions̺in (1.1) satisfying the GHS-inequality. This inequality ensures the appli-cation of correlation-inequalities due to Lebowitz for bounding the variances and other low order moments, which appear in Stein’s method. Rates of convergence in limit theorems for partial sums in the context of the Curie-Weiss model will be proven (Theorems 3.1, 3.2 and 3.3). Moreover, we prove a Berry-Essen rate in the Central Limit Theorem for the total magentization in the classi-cal Curie-Weiss model, for high tempertaures (Theorem 3.7) as well as at the criticlassi-cal temperature

β=1 (Theorem 3.8). We analyze Berry-Esseen bounds as the temperature 1/β_n converges to one and obtain a threshold for the speed of this convergence. Section 4 contains a collection of exam-ples including the Curie-Weiss model with three states, modeling liquid helium, and a continuous Curie-Weiss model, where the single spin distribution̺is a uniform distribution.

During the preparation of our manusscript we became aware of a preprint of S. Chatterjee ans Q.-M. Shao about Stein’s method with applications to the Curie-Weiss model[4]. As far as we understand, there the authors give an alternative proof of Theorem 3.7 and 3.8.

2 The exchangeable pair approach for distributional approximations

We will begin with modifying Stein’s method by replacing the linear regression property by

(5)

whereψ(x)depends on a continuous distribution under consideration. Let us mention that this is not the first paper to study other distributional approximations via Stein’s method. For a rather large class of continuous distributions, the Stein characterization was introduced in[21], following[20, Chapter 6]. In[21], the method of exchangeable pairs was introduced for this class of distribution and used in a simulation context. Recently, the exchangeable pair approach was introduced for exponential approximation in[3, Lemma 2.1].

Given two random variablesX and Y defined on a common probability space, we denote the Kol-mogorov distance of the distributions ofX andY by

d_K(X,Y):=sup

z∈R

|P(X≤z)₋P(Y ≤z)_|.

Motivated by the classical Curie-Weiss model at the critical temperature, we will develop Stein’s method with the help of exchangeable pairs as follows. Let I = (a,b) be a real interval, where

−∞ ≤a < b_{≤ ∞}. A function is calledregularif f is finite on I and, at any interior point of I, f

possesses a right-hand limit and a left-hand limit. Further, f possesses a right-hand limit f(a+)at the point a and a left-hand limit f(b−) at the point b. Let us assume, that the regular density p

satisfies the following condition:

Assumption (D)Letpbe a regular, strictly positive density on an intervalI= [a,b]. Supposephas a derivative p′that is regular on I, has only countably many sign changes, and is continuous at the sign changes. Suppose moreover thatR

Ip(x)|log(p(x))|d x<∞and thatψ(x):= p′(x)

p(x) is regular.

In[21]it is proved, that a random variableZ is distributed according to the densitypif and only if

E _f′(Z) +ψ(Z)f(Z)= f(b−)p(b−)₋f(a+)p(a+)for a suitably chosen class_F of functions f. The corresponding Stein identity is

f′(x) +ψ(x)f(x) =h(x)₋P(h), (2.6) wherehis a measurable function for whichR

I|h(x)|p(x)d x<∞,P(x):= Rx

−∞p(y)d yandP(h):= R

Ih(y)p(y)d y. The solution f := fh of this differential equation is given by f(x) =

a h(y)−Ph)p(y)d y

p(x) . (2.7)

For the functionh(x):=1_{x≤z}(x) let f_z be the corresponding solution of (2.6). We will make the following assumptions:

Assumption (B1)Let p be a density fulfilling Assumption (D). We assume that for any absolutely continuous functionh, the solution fhof (2.6) satisfies

kf_h_{k ≤}c_1kh′_k, _kf_h′_{k ≤}c_2kh′_k and _kf_h′′(x)_{k ≤}c_3kh′_k, wherec₁,c₂andc₃are constants.

Assumption (B2)Let pbe a density fulfilling Assumption (D) We assume that the solution f_z of

(6)

satisfies

|fz(x)| ≤d1, |fz′(x)| ≤d2 and |fz′(x)− fz′(y)| ≤d3

and

|(ψ(x)f_z(x))′_|=(p ′₍_x₎ p(x) fz(x))

′_≤_d

4 (2.9)

for all real x and y, whered1,d2,d3andd4are constants.

Remark 2.1. In the case of the normal approximation, ψ(x) = ₋x. Assumption (B2) includes a bound for (x f_z(x))′ for the solution f_z of the classical Stein equation. It is easy to observe that

|(x f_z(x))′_{| ≤}2 by direct calculation (see[5, Proof of Lemma 6.5]). However, in the normal approx-imation case, using this bound leads to a worse Berry-Esseen constant. We will be able to improve the Berry-Esseen constant applyingd2 = d3 =1 andd1 =

2π/4 (see Theorem 2.6 and Theorem 2.4).

We will see, that all limit laws in our class of Curie-Weiss models satisfy Condition (2.9):

Lemma 2.2. The densities f_k_,_µ_,_β in(3.35)and(3.36)and the densities in Theorem 3.1 and Theorem 3.2 satisfy Assumptions (D), (B1) and (B2).

Proof. We defer the proof to the appendix, since they only involve careful analysis.

Remark 2.3. We restrict ourselves to solutions of the Stein equation characterizing distributions with probability densitiespof the form b_kexp(₋a_kx2k). Along the lines of the proof of Lemma 2.2, one would also be able to derive good bounds (in the sense that Assumption (B1) and (B2) are fulfilled) even for measures with a probability density of the form

p(x) =bkexp −akV(x)

, (2.10)

whereV is even, twice continuously differentiable, unbounded above at infinity,V′₆=0 andV′and 1/V′are increasing on[0,_∞). Moreover one has to assume that V′′(x)

|V′₍_x₎_| can be bounded by a constant

forx ≥d with somed ∈R₊_{. We sketch the proof in the appendix. It is remarkable, that this class}

of measures is a subclass of measures which are GHS, see Section 3. A measure with density p in (2.10) is usually called a Gibbs measure. Stein’s method for discrete Gibbs measures is developed in[7]. Our remark might be of use for a potential application of Stein’s method to Gibbs measures with continuous spins.

The following result is a refinement of Stein’s result[20]for exchangeable pairs.

Theorem 2.4. Let p be a density fulfilling Assumption (D). Let (W,W′) be an exchangeable pair of real-valued random variables such that

E_[_W′_|_W_{] =}_W₊_λψ₍_W₎₋_R₍_W₎ _(2.11)

for some random variable R=R(W),0< λ <1andψ=p′/p. Then

(7)

Moreover it holds for a random variable Z distributed according to p:(1): Under Assumption (B1), for any uniformly Lipschitz function h, we obtain|E_h(W)₋E_h(Z)_{| ≤}δkh′kwith

δ:=c₂E

1−

2λE (W −W

′₎2 |W

4λE|W−W

′_|3₊ c1 λ

E₍_R2₎_. (2): Under Assumption (B2), we obtain for any A>0

d_K(W,Z) _≤ d₂ È

1₋ 1

2λE[(W′−W) 2_|_W_]

+ d₁+3

pE(R2)

+ 1

λ d4A3

+3A

2 E(|ψ(W)|) +

2λE (W−W

′₎2₁

{|W−W′_|≥A}. (2.13)

With (2.12) we obtainE

1−₂1_λE[(W−W′)2_|W]

=1+E[Wψ(W)]₋E(W R)

λ . Therefore the bounds

in Theorem 2.4 are only useful, if −E[Wψ(W)] is close to 1 and E(W R_λ ) is small. Alternatively, bounds can be obtained by comparing with a modified distribution that involvesE_[_W_ψ₍_W_)]_{. Let}

p_W be a probability density such that a random variableZ is distributed according top_W if and only ifE E[Wψ(W)]f′(Z) +ψ(Z)f(Z)=0 for a suitably chosen class of functions.

Theorem 2.5. Let p be a density fulfilling Assumption (D). Let (W,W′) be an exchangeable pair of random variables such that(2.11)holds. If Z_W is a random variable distributed according to p_W, we obtain under (B1), for any uniformly Lipschitz function h that|E_h(W)₋E_h(ZW)| ≤δ′kh′kwith

δ′:= c2

2λ Var E[(W −W

′₎2

|W]1/2+ c3

4λE|W−W

′_|3₊ c1+c2

E(W2)

E₍_R2₎_.

Under Assumption (B2) we obtain for any A>0

d_K(W,Z_W) _≤ d2

2λ Var E[(W−W

′₎2

|W]1/2+ d₁+d₂pE₍_W2_{) +}3

pE(R2)

+ 1

λ d₄A3

+3A

2 E(|ψ(W)|) +

d₃

2λE (W−W

′₎2₁

{|W−W′_|≥_A_}. (2.14) Proof of Theorem 2.4. The proof is an adaption of the results in[20]. For any function f such that

E(ψ(W)f(W))exists we obtain

0 = E (W −W′)(f(W′) + f(W))

= E ₍_W ₋_W′₎₍_f₍_W′₎₋ _f₍_W₎₎₋₂_λE₍_ψ₍_W₎_f₍_W_{)) +}₂E₍_f₍_W₎_R₍_W₎₎_, _(2.15)

which is equivalent to

E(ψ(W)f(W)) =₋ 1

2λE (W−W

′₎₍_f₍_W₎₋_f₍_W′₎₎₊ 1

λE(f(W)R(W)). (2.16) Proof of (1):Let f = fhbe the solution of the Stein equation (2.6), and define

(8)

By (2.16), following the calculations on page 21 in[5], we obtain

|E_h₍_W₎₋E_h₍_Z₎_| ₌ _|E _f′₍_W_{) +}_ψ₍_W₎_f₍_W₎_|

= E f′(W)

1₋ 1

2λ(W−W

′₎2

+ 1

2λE

(f′(W)₋f′(W+t))Kb(t)d t

+ 1

λE(f(W)R(W))

. UsingR

R|t|Kb(t)d t=

2E|W−W

′_|3_{, the bounds in Assumption (B1) give:}

|E_h(W)₋E_h(Z)_{| ≤ k}h′k

c2E

1−

2λE (W−W

′₎2 |W

c₃

4λE|W−W

′_|3₊ c1 λ

E(R2)

. (2.17)

Proof of (2):Now let f = f_z be the solution of the Stein equation (2.8). Using (2.16), we obtain

P(W ≤z)₋P(z) = E(f′(W) +ψ(W)f(W))

= E

f′(W) 1₋ 1

2λ(W −W

′₎2

+ 1

2λE(2f(W)R) − 1

2λE

(W₋W′) f(W)₋ f(W′)₋(W ₋W′)f′(W) = T1+T2+T3.

Now the bounds in Assumption (B2) give

|T1| ≤d₂ È

1− 1 λE[(W

′₋_W₎2_|_W_]

and |T2| ≤ d1 λ

E(R2₎_.

BoundingT₃we apply the concentration technique, see[17]:

(₋2λ)T3 = E

(W −W′)1_{|W_−W′_|_>_A}

Z 0

−(W−W′₎

(f′(W+t)₋f′(W))d t

+ E

(W −W′)1_{|_W₋_W′_|≤_A_}

Z 0

−(W−W′₎

(f′(W+t)₋f′(W))d t

. (2.18)

The modulus of the first term can be bounded by d₃E ₍_W ₋_W′₎2₁

{|W−W′_|_>_A_}. Using the Stein

identity (2.8), the second summand can be represented as

(W₋W′)1_{|_W₋_W′_|≤_A_}

Z 0

−(W−W′)

−ψ(W+t)f(W+t) +ψ(W)f(W)d t

(W−W′)1_{|W−W′_|≤A}

Z 0

−(W−W′₎

(1_{W+t≤z}−1{W≤z})d t

=:U1+U2.

Withg(x):= (ψ(x)f(x))′we obtain₋ψ(W+t)f(W+t) +ψ(W)f(W) =₋R₀tg(W+s)ds. Since

|g(x)_{| ≤}d4we obtain|U1| ≤ A

(9)

The termU₂ can be bounded byE ₍_W₋_W′₎2_I

{0≤(W−W′₎_≤_A_}1_{_z_≤_W_≤_z₊_A_}. Under the assumptions of

our Theorem we proceed as in[17]and obtain the following concentration inequality:

E (W −W′)2I_{0≤(W−W′₎_≤a}1_{z≤W_≤z₊_A}≤3A(λE(|ψ(W)|) +E(|R|)). (2.19)

To see this, we apply the estimate E ₍_W ₋_W′₎2_I₀

≤(W−W′₎_≤_A1_z_≤_W_≤_z₊_A ≤ E (W −W′)(f(W)− f(W′)), hence U2 can be bounded by E (W − W′)(f(W)− f(W′))

= 2E _f(W)R(W) ₋

2λE _ψ(W)f(W), where we applied (2.16), and wheref is defined byf(x):=₋1.5Aforx ≤z−A,

f(x):=1.5Aforx_≥z+2Aandf(x):= x₋z₋A/2 in between. ThusU₂_≤3A E₍_|_R_|₎₊_λE₍_|_ψ₍_W₎_|₎_.

Similarly we obtainU₂_{≥ −}3A E(_|R_|) +λE(_|ψ(W)_|).

Proof of Theorem 2.5. The main observation is the following identity:

E ₋E_[_W_ψ₍_W_)]_f′₍_W_{) +}_ψ₍_W₎_f₍_W₎ ₌ E

f′(W)

_E_[(_W

−W′)2_]₋₂_E_[_{W R}_]

2λ

+E _ψ₍_W₎_f₍_W₎

f′(W)

_E_[(_W

−W′)2]₋E_[(_W₋_W′₎2_|_W_]

2λ

+ 1

E[f(W)R]₋E[E(W R)f′(W)]

+T3

with T3 defined as in the proof of Theorem 2.4. We apply the Cauchy-Schwarz inequality to get

EE_[(_W₋_W′₎2_]₋E_[(_W₋_W′₎2_|_W_]_≤ _Var E_[(_W₋_W′₎2_|_W_]1/2_{. The proof follows the lines of}

the proof of Theorem 2.4.

In the special case of normal approximation, the following Theorem improves Theorem 1.2 in[16]:

Theorem 2.6. Let(W,W′)be an exchangeable pair of real-valued random variables such that E(W′_|W) = (1₋λ)W+R

for some random variable R=R(W)and with0< λ <1. Assume thatE(W2)_≤1. Let Z be a random variable with standard normal distribution. Then for any A>0,

dK(W,Z) ≤

1− 1

2λE[(W

′₋_W₎2_|_W_]

+ p

2π

4 +1.5A

p_E₍_R2₎

λ (2.20)

+0.41A

λ +1.5A+

2λE (W−W

′₎2₁

{|W−W′_|≥_A_}. (2.21) Remark 2.7. When|W−W′|is bounded, our estimate improves (1.10) in[16, Theorem 1.2]with respect to the Berry-Esseen constants.

Proof. Let f = f_z denote the solution of the Stein equation

f_z′(x)₋x fz(x) =1{x≤z}(x)−Φ(z). (2.22)

We obtain

P(W_≤z)₋Φ(z) = E

f′(W) 1₋ 1

2λ(W−W

′₎2

− 1

2λE(2f(W)R) − 1

2λE

(W−W′) f(W)₋f(W′)₋(W−W′)f′(W)

(10)

Using_|f′(x)_{| ≤}1 for all realx (see[5, Lemma 2.2]), we obtain the bound_|T_{1| ≤} E ₁₋ 1

2λE[(W′−

W)2_|W]21/2. Using 0 < f(x) _≤ p2π/4 (see [5, Lemma 2.2]), we have _|T_{2| ≤} p

2π

4λ E(|R|) ≤

2π

4λ

E₍_R2₎_{. Bounding} _T₃ _{we apply (2.18) for} ₍₋₂_λ₎_T₃_{. The modulus of the first term can be}

bounded byE (W−W′)21_{|_W₋_W′_|_>_A_}using|f′(x)−f′(y)| ≤1 for all real x andy (see[5, Lemma

2.2]). Using the Stein identity (2.22), the second summand can be represented as

(W₋W′)1_{|_W₋_W′_|≤_A_}

Z 0

−(W−W′₎

(W+t)f(W+t)₋W f(W)d t

(W₋W′)1_{|W_−W′_|≤A}

Z 0

−(W−W′)

(1_{W+t≤z}−1{W≤z})d t

=:U₁+U₂.

Next observe that|U1| ≤0.82A3, see[17]: By the mean value theorem one gets

(W+t)f(W+t)₋W f(W) =W(f(W+t)₋f(W))+t f(W+t) =W Z 1

f′(W+ut)t du+t f(W+t).

Hence _|(W + t)f(W + t) ₋ W f(W)_{| ≤ |}W| |t| + _|t|p2π/4 = _|t|(p2π/4 + _|W|). Using

E_|_W_{| ≤} pE(W2) _≤ 1 gives the bound. The term U2 can be bounded by E (W − W′)2I_{₀_≤₍_W₋_W′₎_≤_A_}1_{_z_≤_W_≤_z₊_A_} and E (W −W′)2I_{₀_≤₍_W₋_W′₎_≤_a_}1_{_z_≤_W_≤_z₊_A_} ≤ 3A(λ+E(R)), see

[17]. Similarly, we obtain U2≥ −3A(λ+E(R)).

Remark 2.8. In Theorem 2.6, we assumedE₍_W2₎_≤_{1. Alternatively, let us assume that}E₍_W2₎ _is

finite. Then the proof of Theorem 2.6 shows, that the third and the fourth summand of the bound (2.20) change to A_λ3 p2π

16 + p

E(W2₎

+1.5AE(_|W|).

In the following corollary, we discuss the Kolmogorov-distance of the distribution of a random vari-ableW to a random variable distributed according toN(0,σ2).

Corollary 2.9. Letσ2_>₀_and₍_W_,_W′₎_{be an exchangeable pair of real-valued random variables such} that

E(W′_|W) = 1₋ λ

σ2

W+R (2.24)

for some random variable R= R(W) and with0< λ <1. Assume thatE₍_W2₎ _{is finite. Let Z}_σ _{be a}

random variable distributed according to N(0,σ2). If |W−W′| ≤A for a constant A, we obtain the bound

d_K(W,Zσ) ≤

1₋ 1 2λE[(W

′₋_W₎2_|_W_]

_σp₂_π

4 +1.5A

p_E₍_R2₎ λ

+ A

3 λ

p₂

πσ2

16 +

E(W2)

+1.5ApE(W2). (2.25)

Proof. Let us denote by fσ:= fσ,zthe solution of the Stein equation f_σ′_,_z(x)₋ x

(11)

withFσ(z):= p₂1_πσ

−∞exp − y2

2σ2

d y. It is easy to see that the identity fσ,z(x) =σfz _σx

, where

f_z is the solution of the corresponding Stein equation of the standard normal distribution, holds true. Using[5, Lemma 2.2]we obtain 0< f_σ(x)< σ

2π

4 , |fσ′(x)| ≤1, and|fσ′(x)−fσ′(y)| ≤1.

With (2.24) we arrive at P(W ≤z)₋Fσ(z) = T1+T2+T3 with Ti’s defined in (2.23). Using the

bounds of fσand fσ′, the bound ofT1is the same as in the proof of Theorem 2.6, whereas the bound

ofT₂ changes to_|T_{2| ≤}σ

2π

4λ

E₍_R2₎_{. Since we consider the case}_|_W₋_W′_{| ≤}_A_{, we have to bound}

T3=−

1 2λE

(W−W′)1_{|W−W′_|≤_A_}

Z 0

−(W−W′₎

(f′(W+t)₋f′(W))d t

Along the lines of the proof of Theorem 2.6, we obtain_|T_{3| ≤} A_λ3 p

E(W2₎

4 +

σp2π

+1.5A pE₍_W2₎₊

E(R2₎

. Hence the corollary is proved. With (2.24) we obtainE₍_W₋_W′₎2₌ 2λ

σ2E(W

2₎

−2E₍_{W R}₎_{. Therefore} E

1₋ 1 2λE[(W

′₋_W₎2 |W]

=1₋E(W

2₎ σ2 +

E(W R)

λ , (2.27)

so that the bound in Corollary 2.9 is only useful whenE(W2)is close toσ2_(and_E₍_{W R}₎_/λ_{is small).}

An alternative bound can be obtained comparing with aN(0,E₍_W2₎₎_{-distribution.}

Corollary 2.10. In the situation of Corollary 2.9, let Z_W denote the N(0,E(W2)) distribution. We obtain

d_K(W,Z_W) _≤ σ

2λ Var E[(W

′₋_W₎2

|W]1/2+σ2

p_E₍_W2₎p₂_π

4 +1.5A

p_E₍_R2₎ λ

+σ2A 3 λ

p_E₍_W2₎p₂_π

16 +

E₍_W2₎

+σ21.5ApE₍_W2_{) +}_σ2

E₍_W2₎pE₍_R2₎

λ . (2.28) Proof. With (2.27) we getE(W2) =σ2 ₂1_λ(E(W−W′)2+2E(W R)). With the definition of T2and T3 as in (2.23) we obtain

E E₍_W2₎_f′₍_W₎₋_{W f}₍_W₎ ₌ _σ2E

_E₍_W

−W′)2+2E(W R)

2λ f

′₍_W₎

−E₍_{W f}₍_W₎₎

=σ2E

f′(W)

_E₍_W

−W′)2₋E_[(_W₋_W′₎2_|_W_]

2λ

+σ2(T₂+T₃) +σ2E(W R)

λ . (2.29)

Remark that nowσ2 in (2.24) is a parameter of the exchangeable-pair identity and no longer the parameter of the limiting distribution. We apply (2.26) and exchange every σ2 in (2.26) with

E(W2). Applying Cauchy-Schwarz to the first summand and bounding the other terms as in the proof of Corollary 2.9 leads to the result.

(12)

3 Berry-Esseen bounds for Curie-Weiss models

We assume that̺in (1.1) is in the class_B of non-degenerate symmetric Borel probability measures onR_{which satisfy}

exp

_{b x}2

d̺(x)<∞ for all b>0. (3.30) For technical reasons we introduce a model with non-negative external magnetic field, where the strength may even depend on the site:

P_n_,_β_,_h

1,...,hn(x) =

Zn,β,h1,...,hn

exp β 2nS

n+β n X i=1

h_ix_id̺⊗n(x), x = (x_i). (3.31)

In the general case (1.1), we will see (analogously to the results in[11; 13]) that the asymptotic behaviour ofSn depends crucially on the extremal points of a function G (which is a transform of

the rate function in a corresponding large deviation principle): defineφ̺(s):=log

exp(s x)d̺(x)

and

G̺(β,s):=

βs2

2 −φ̺(βs). (3.32) We shall dropβ in the notation forG whenever there is no danger of confusion, similarly we will suppress ̺ in the notation for φ and G. For any measure̺ ∈ B, G was proved to have global minima, which can be only finite in number, see[11, Lemma 3.1]. DefineC =C̺to be the discrete,

non–empty set of minima (local or global) of G. If α ∈ C, then there exists a positive integer

k:=k(α)and a positive real numberµ:=µ(α)such that

G(s) =G(α) +µ(α)(s−α)

(2k)! +O((s−α)

2k+1₎ _as _s

→α. (3.33) The numbers k and µ are called the type and strength, respectively, of the extremal point α. Moreover, we define the maximal type k∗ of G by the formula k∗ =

max_{k(α);αis a global minimum ofG_}. Note that theµ(α)can be calculated explicitly: one gets

µ(α) =β−β2φ′′(β α) ifk=1 while µ(α) =₋β2kφ(2k)(β α) if k≥2 (3.34) (see[13]). An interesting point is, that the global minima ofGof maximal type correspond to stable states, meaning that multiple minima represent a mixed phase and a unique global minimum a pure phase. For details see the discussions in[13]. In general, given̺∈ B, letαbe one of the global minima of maximal typekand strengthµofG̺. Then

Sn−nα

n1−1/2k →Xk,µ,β in distribution, whereXk,µ,β

is a random variable with probability density f_k_,µ,β, defined by

f_1,_µ_,_β(x) = _p 1

2πσ2

exp ₋x2/2σ2 (3.35)

and fork_≥2

f_k_,µ,β(x) =

exp ₋µx2k/(2k)!

(13)

Here, σ2 = _µ1 ₋_β1 so that forµ =µ(α)as in (3.34), σ2 = ([φ′′(βα)]−1₋β)−1 (see [11], [13]). Moderate deviation principles have been investigated in[6].

In[10]and[13], a class of measures̺is described with a behaviour similar to that of the classical Curie–Weiss model. Assume that̺is any symmetric measure that satisfies the GHS-inequality,

ds3φ̺(s)≤0 for alls≥0, (3.37)

(see also [12; 14]). One can show that in this case G has the following properties: There exists a valueβc, the inverse critical temperature, and G has a unique global minimum at the origin for

0< β ≤ βc and exactly two global minima, of equal type, for β > βc. For βc the unique global

minimum is of type k _≥ 2 whereas for β ∈(0,βc) the unique global minimum is of type 1. At

βc the fluctuations ofSn live on a larger scale thanpn. This critical temperature can be explicitly

computed asβ_c=1/φ′′(0) =1/Var̺(X1). By rescaling theXi we may thus assume thatβc=1.

By GHS we denote the set of measures̺∈ B such that the GHS-inequality (3.37) is valid. We will be able to obtain Berry-Esséen-type results for̺-a.s. bounded single-spin variablesX_i:

Theorem 3.1. Given ̺∈ B in GHS, let α be the global minimum of type k and strength µ of G̺.

Assume that the single-spin random variables X_i are bounded̺-a.s. In the case k=1we obtain d_K _pSn

n,ZW

≤C n−1/2, (3.38)

where Z_W denotes a random variable distributed according to he normal distribution with mean zero and varianceE(W2)and C is an absolute constant depending on0< β <1. For k≥2we obtain

d_K Sn−nα n1−1/2k,ZW,k

≤C_kn−1/k, (3.39)

where Z_W_,_kdenotes a random variable distributed according to the density bf_W_,_k defined by bf_W_,_k(x):= Cexp ₋ x2k

2kE(W2k₎

with C−1=Rexp ₋ x2k

2kE(W2k₎

d x and W := Sn−nα

n1−1/2k and Ckis an absolute constant.

Theorem 3.2. Let̺ ∈ B satisfy the GHS-inequality and assume that βc = 1. Let α be the global minimum of type k with k≥ 2and strength µk of G̺ and let the single-spin variable Xi be bounded. Let0< βn<∞depend on n in such a way thatβn→1monotonically as n→ ∞. Then the following assertions hold: (1): Ifβn−1=

n1−1k

for someγ6=0, we have

sup

z∈R

_S n−nα n1−1/2k ≤z

−FW,k,γ(z)

≤Ckn−1/k (3.40) with

FW,k,γ(z):=

Z Z z

−∞

exp

−c_W−1

µk (2k)!x

−γ

d x.

where Z:=R_Rexp ₋c_W−1 µk

(2k)!x 2k

− γ₂x2d x, with W := Sn−nα

n1−1/2k, cW :=

µk

(2k)!E(W 2k₎

−γE₍_W2₎_and

C_kis an absolute constant.

(2): If |βn−1| ≪ n−(1−1/k), Sn−nα

n1−1/2k converges in distribution to FbW,k, defined as in Theorem 3.1. Moreover, if|βn−1|=O(n−1),(3.39)holds true.

(14)

(3): If_|βn−1| ≫n−(1−1/k), the Kolmogorov distance of the distribution of W := q

1−β_n

n Pn

i=1Xi and the normal distribution N(0,E₍_W2₎₎_{converges to zero. Moreover, if}_|_β_n₋₁_{| ≫}_n−(1/2−1/2k)_{, we obtain}

sup

z∈R

p₍₁₋_β n)Sn

n ≤z

−Φ_W(z)

≤C n−1/2 with an absolute constant C.

For arbitrary̺ ∈GHS we are able to prove good bounds with respect to smooth test functions h. For technical reasons, we consider a modified model. Let

P_n_,_β_,_h(x) = 1 b Z_n_,β,h

exp

_β n

1≤i<j≤n

x_ix_j+βh n X i=1

x_i

d̺⊗n(x), x = (x_i).

Theorem 3.3. Given the Curie-Weiss model bP_n_,β and̺∈ B in GHS, letαbe the global minimum of

type k and strengthµof G_̺. In the case k= 1, for any uniformly Lipschitz function h we obtain for W =Sn/pn that

E _h₍_W₎₋_Φ_W₍_h₎_{≤ k}_h′_k_C max

E_|_X_1|3_,E_|_X_1|′ 3

n .

Here C is a constant depending on0< β <1andΦ_W(h):=R

Rh(z)ΦW(dz). The random variable X

′ i is drawn from the conditional distribution of the i’th coordinate X_i given(X_j)_j6₌_i (this choice will be explained in Section 3). For k≥2we obtain for any uniformly Lipschitz function h and for W := Sn−nα

n1−1/2k

E _h(W)₋FbW,k(h) _{≤ k}_h′_k

n1/k+

C₂ max E_|_X₁_|3_,E_|_X′

1| 3 n1−1/2k

Here C1,C2 are constants, and bFW,k(h):= R

Rh(z)FbW,k(dz).

Remark 3.4. Note that for ̺ ∈ GHS, φ̺(s) ≤ ₂1σ2̺s2 for all real s, where σ2̺ =

2_̺₍_{d x}₎_.

These measures are calledsub-Gaussian. Very important for our proofs of Berry-Esseen bounds will be the following correlation-inequality due to Lebowitz [15]: If E denotes the expectation with respect to the measure Pn,β,h1,...,hn in (3.31), one observes easily that for any ̺ ∈ B and sites i,j,k,l _{∈ {}1, . . . ,n_}the following identity holds:

∂3 ∂hi∂hj∂hk

E₍_X_l₎

allhi=0

(3.41)

= E(XiXjXkXl)−E(XiXj)E(XkXl)−E(XiXk)E(XjXl)−E(XiXl)E(XjXk).

Lebowitz[15]proved that if̺∈GHS, (3.41) is non-positive (see[9, V.13.7.(b)]). We will see in the proofs of Theorem 3.1 and Theorem 3.2 Lebowitz’ inequality helps to bound the variances. In the situation of Theorem 3.1 and Theorem 3.2 we can bound higher order moments as follows:

Lemma 3.5. Given̺∈ B, letαbe one of the global minima of maximal type k for k≥1and strength µof G̺. For W := Sn−n

n1−1/2k we obtain thatE|W| l

(15)

We prepare for the proof of Lemma 3.5. It considers a well known transformation – sometimes called theHubbard–Stratonovich transformation– of our measure of interest.

Lemma 3.6. Let m _∈ R _and ₀ _{< γ <} ₁ _{be real numbers. Consider the measure Q}_n_,_β _:₌ _P_n_◦

_S n−nm

nγ ₋1

∗ N(0,_β_n12γ−1)whereN(0,

βn2γ−1) denotes a Gaussian random variable with mean zero and variance _β 1

n2γ−1. Then for all n≥1the measure Qn,β is absolutely continuous with density

exp₋nG( s n1−γ +m)

Rexp

−nG( s n1−γ +m)

ds, (3.42)

where G is defined in equation(3.32).

As shown in [11], Lemma 3.1, our condition (3.30) ensures that R

Rexp

−nG s n1−γ+m

ds is finite, such that the above density is well defined. The proof of the lemma can be found at many places, e.g. in[11], Lemma 3.3.

Proof of Lemma 3.5. We apply the Hubbard-Stratonovich transformation with γ= 1₋1/2k. It is clear that this does not change the finiteness of any of the moments of W. Using the Taylor ex-pansion (3.33) ofG, we see that the density ofQn,β with respect to Lebesgue measure is given by

Const. exp(₋x2k)(up to negligible terms, see e.g.[11],[6]). A measure with this density, of course, has moments of any finite order.

Since the symmetric Bernoulli law is GHS, Theorems 3.1 and 3.2 include Berry-Esseen type results for the classical model. However the limiting laws depend on moments ofW. Approximations with fixed limiting laws can be obtained in the classical case, since we can apply Corollary 2.9 and part (2) of Theorem 2.4:

Theorem 3.7 (classical Curie-Weiss model, Berry-Esseen bounds outside the critical temperature).

Let̺= 1

2δ−1+ 1

2δ1 and0< β <1. We have

sup

z∈R

S_n/pn_≤z

−Φ_β(z)

≤C n−1/2, (3.43) whereΦ_βdenotes the distribution function of the normal distribution with expectation zero and variance (1₋β)−1, and C is an absolute constant, depending onβ, only.

Theorem 3.8 (classical Curie-Weiss model, Berry-Esseen bounds at the critical temperature). Let ̺= 1₂δ₋1+1₂δ1 andβ=1. We have

sup

z∈R

S_n/n3/4≤z

−F(z)

≤C n−1/2, (3.44) where F(z):= _Z1Rz

−∞exp(−x

4_/₁₂₎_{d x, Z}_:₌R

Rexp(−x

4_/₁₂₎_{d x and C is an absolute constant.}

Berry-Esséen bounds for size-dependent temperatures for̺= 1₂δ₋1+1₂δ1 and 0< βn<∞

depend-ing on nin such a way that βn →1 monotonically as n→ ∞ can be formulated similarly to the

(16)

Remark 3.9. In [1], Barbour obtained distributional limit theorems, together with rates of con-vergence, for the equilibrium distributions of a variety of one-dimensional Markov population pro-cesses. In section 3 he mentioned, that his results can be interpreted in the framework of[11]. As far as we understand, his result (3.9) can be interpreted as the statement (3.44), but with the rate

n−1/4.

Proof of Theorem 3.1. Given̺which satisfies the GHS-inequality and letαbe the global minimum of typekand strengthµ(α)ofG_̺. In casek=1 it is known that the random variable _pSn

n converges

in distribution to a normal distribution N(0,σ2) withσ2 =µ(α)−1₋β−1 = (σ_̺−2−β)−1, see for example[9, V.13.15]. Hence in this case we will apply Corollary 2.10 (to obtain better constants for our Berry-Esséen bound in comparison to Theorem 2.5). Considerk≥1. We just treat the case

α=0 and denoteµ=µ(0). The more general case can be done analogously. Fork=1, we consider

ψ(x) =₋_σx2 withσ

2₌_µ−1

−β−1. For anyk_≥2 we considerψ(x) =₋₍₂ µ k−1)!x

2k−1_{. We define}

W:=Wk,n:=

n1−1/(2k) n X i=1

andW′, constructed as in the Introduction, such thatW₋W′= XI−X′I

n1−1/(2k). We obtain

E[W−W′|F] =1 nW−

n1−1/(2k)

n n X

i=1

E(X_i′|F).

Lemma 3.10. In the situation of Theorem 3.1, if X1is̺-a.s. bounded, we obtain

E₍_X′

i|F) = mi(X)−

βG

′

̺(β,mi(X))

1+_O(1/n) with m_i(X):= 1

n P

j6=iXj=m(X)− Xi

Proof. We compute the conditional densitygβ(x1|(Xi)i≥2)ofX1=x1given(Xi)i≥2under the

Curie-Weiss measure:

gβ(x1|(Xi)i≥2) =

eβ/2n(

i≥2x1Xi+

i6=j≥2XiXj+x21) R

eβ/2n(

i≥2x1Xi+

i6=j≥2XiXj+x 2 1)_̺₍_{d x}

= e

β/2n(P_i_≥₂x1Xi+x21) R

eβ/2n(

i≥2x1Xi+x21)̺(d x

Hence we can computeE[X₁′|F]as

E_[_X_1|F′ _{] =}

x1eβ/2n( P

i≥2x1Xi+x21)̺(_{d x}

eβ/2n(Pi≥2x1Xi+x21)̺(d x

Now, if|X1| ≤c̺-a.s

E_[_X′

1|F]≤

x1eβ/2n( P

i≥2x1Xi)_̺₍_{d x}

1)eβc

2_/₂_n R

eβ/2n(

i≥2x1Xi+x12)̺(_{d x}

1)e−βc

2_/₂_n

and

E_[_X_1|F′ _]_≥

x1eβ/2n( P

i≥2x1Xi)_̺₍_{d x}

1)e−βc

2_/₂_n R

eβ/2n(Pi≥2x1Xi+x21)̺(d x

1)eβc

(1)

type k_≥ 2. Thisβc is easily computed to be one. Indeed, µ1 =β−β2φ′′(0) =β−β2E̺(X12) =

β(1₋β), since̺ is centered and has variance one. Thus µ1 vanishes atβ =βc =1. Eventually forβ >1 there are again two minima which are solutions of

3β

tanh(p3βx) =βx +

x. Now again by Theorems 3.1 and 3.2 forβ <1 the rescaled magnetizationSn/pnobeys a Central Limit Theorem and the limiting variance is(1₋β)−1. Indeed, since E_̺₍_X2

1) = 1, µ1 = β−β2 and σ2 = ₁₋1_β. For β = βc = 1 the rescaled magnetizationSn/n7/8 converges in distribution to X which has the density f4,6/5,1. Indeed µ2 is computed to be −E̺(X14) +3E̺(X12) = −

9 5 +3 =

5. The rate is C

n1/4. If βn converges monotonically to 1 faster than n−3/4 then Sn

n7/8 converges in distribution to

F₄, whereas ifβn converges monotonically to 1 slower than n−3/4 then p

1−βnSn

n satisfies a Central Limit Theorem. Eventually, if_|1₋βn|=γn−3/4, Sn

n7/8 converges in distribution to the mixed density exp

−c_W−1

6 5

x8 8! −γ

x2 2

withc_W = ₆₍5_8!₎E₍_W8₎₋_γE₍_W2₎_{. Here the Berry-Esseen rate is} C n1/4.

5 Appendix

Proof of Lemma 2.2. Consider a probability density of the form p(x):=pk(x):= bkexp −akx2k

(5.49) withb_k=R_Rexp ₋a_kx2kd x. Clearly psatisfies Assumption (D). First we prove that the solutions fz of the Stein equation, which characterizes the distribution with respect to the density (5.49), satisfies Assumption (B2). Let f_z be the solution of f_z′(x) +ψ(x)f_z(x) =1_{_x_≤_z_}(x)₋P(z). Here

ψ(x) =₋2k a_kx2k−1. We have f_z(x) =

(1−P(z))P(x)exp(akx2k)b−_k1 forx ≤z, P(z) (1₋P(x))exp(akx2k)b−_k1 forx ≥z

(5.50)

withP(z):=Rz

−∞p(x)d x. Note that fz(x) = f−z(−x), so we need only to consider the casez≥0.

Forx >0 we obtain

1−P(x)_≤ bk

2k a_kx2k−1exp −akx

2k_, _(5.51)

whereas forx <0 we have

P(x)_≤ bk

2k a_k|x_|2k−1 exp −akx

2k_. _(5.52)

By partial integration we have

Z ∞

(2k₋1)

2k a_k t

−2k _exp

−akt2k

=₋ 1

2k a_kt2k−1exp −akt 2k

∞

x −

Z ∞

exp −akt2k

d t. Hence for anyx >0

b_k

2k a_kx2k+2k−1

(2)

With (5.51) we get forx >0 d

d x

exp a_kx2k

Z ∞

exp −a_kt2kd t

=₋1+2k a_kx2k−1exp a_kx2k

Z ∞

exp −a_kt2kd t<0. So exp a_kx2k R∞

x exp −akt

2k_{d t}_{attains its maximum at} _x₌_{0 and therefore} exp akx2k

Z ∞

exp −akt2k

d t≤ 1 2. Summarizing we obtain for x>0

1₋P(x)_≤min

₁

2, bk 2k akx2k−1

exp ₋a_kx2k. (5.54)

With (5.52) we get forx <0 d

d x

exp a_kx2k

Z x

−∞

exp −a_kt2kd t

=1+2k a_kx2k−1exp a_kx2k

Z x

−∞

exp −a_kt2kd t>0. So exp akx2k

−∞exp −akt

2k_{d t}_{attains its maximum at} _x ₌_{0 and therefore} exp a_kx2kb_k

Z x

−∞

exp ₋a_kt2kd t_≤ 1 2. Summarizing we obtain for x<0

P(x)_≤min

₁

bk 2k a_k_|x|2k−1

exp ₋a_kx2k. (5.55)

Applying (5.54) and (5.55) gives 0< f_z(x)_≤ ₂1_b

k for all x. Note that for x <0 we only have to

consider the first case of (5.50), sincez≥0. The constant 1

2bk is not optimal. Following the proof of

Lemma 2.2 in[5]or alternatively of Lemma 2 in [20, Lecture II]would lead to optimal constants. We omit this. It follows from (5.50) that

f_z′(x) =   

(1₋P(z))

1+x2k−12k a_kP(x)exp(a_kx2k)b−_k1

for x_≤z, P(z)

(1₋P(x))2k a_kx2k−1exp(a_kx2k)b−_k1₋1

for x_≥z.

(5.56)

With (5.51) we obtain for 0< x≤zthat f_z′(x)_≤(1−P(z))

z2k−12k a_kP(z)exp(a_kz2k)b−_k1

+1≤2.

The same argument for x ≥ z leads to|f_z′(x)_{| ≤}2. For x < 0 we use the first half of (5.50) and apply (5.52) to obtain_|f_z′(x)_{| ≤}2. Actually this bound will be improved later. Next we calculate the derivative of₋ψ(x)f_z(x):

(₋ψ(x)fz(x))′=

  

(1−P(z))

P(x)eakx2k

2k(2k₋1)a_kx2k−2+ (2k)2a2_kx4k−2

+2ka_kx2k−1b_k

, x _≤z, P(z)

(1₋P(x))eakx2k

2k(2k₋1)a_kx2k−2+ (2k)2a2_kx4k−2

−2ka_kx2k−1b_k

, x _≥z. (5.57)

(3)

With (5.53) we obtain(₋ψ(x)f_z(x))′ _≥0, so₋ψ(x)f_z(x) is an increasing function of x (remark that for x <0 we only have to consider the first half of (5.50)). Moreover with (5.51), (5.52) and (5.53) we obtain that

lim

x→−∞2k akx

2k−1_f

z(x) =P(z)−1 and _xlim

→∞2k akx

2k−1_f

z(x) =P(z). (5.58) Hence we have|2k a_kx2k−1f_z(x)_{| ≤}1 and|2k a_k x2k−1f_z(x)₋u2k−1f_z(u)_{| ≤}1 for any x andu. From (5.51) it follows that f_z′(x) >0 for all x <z and f_z′(x)< 0 for x > z. With Stein’s identity f_z′(x) = ₋ψ(x)fz(x) +1{x≤x}−P(z) and (5.58) we have 0< fz′(x) ≤ −ψ(z)fz(z) +1−P(z) < 1 for x <zand−1<−ψ(z)f_z(z)₋P(z)_≤ f_z′(x)<0 for x >z. Hence, for anyx and y, we obtain

|f_z′(x)_{| ≤}1 and |f_z′(x)₋f_z′(y)_{| ≤}max 1,−ψ(z)f_z(z) +1−P(z)₋(₋ψ(z)f_z(z)₋P(z))=1. Next we bound(₋ψ(x)fz(x))′. We already know that(−ψ(x)fz(x))′ >0. Again we apply (5.51) and (5.52) to see that(₋ψ(x)f_z(x))′_≤ 2k−1

|x| forx ≥z>0 and all x ≤0. For 0< x ≤zthis latter

bound holds, as can be seen by applying this bound (more precisely the bound for(₋ψ(x)fz(x))′ bk

P(z)

for x _≥ z) with ₋x for x to the formula for (ψ(x)f_z(x))′ in x _≤ z. For some constant c we can bound(ψ(x)fz(x))′bycfor all|x| ≥ 2k_c−1. Moreover, on[−2k_c−1,2k_c−1]the continuous function

(₋ψ(x)f_z(x))′is bounded by some constantd, hence we have proved_|−(ψ(x)f_z(x))′_{| ≤}max(c,d). The problem of finding the optimal constant, depending onk, is omitted. Summarizing, Assumption (B2) is fulfilled forpwithd2=d3=1 and some constantsd1andd4.

Next we consider an absolutely continuous functionh:R _→R. Let fhbe the solution of the Stein equation (2.6), that is

f_h(x) = 1

p(x) Z x

−∞

(h(t)₋Ph)p(t)d t=₋ 1

p(x)

Z _∞

(h(t)₋Ph)p(t)d t.

We adapt the proof of[5, Lemma 2.3]:Without loss of generality we assume thath(0) =0 and put e₀:=sup_x_|h(x)₋Ph_|ande₁:=sup_x_|h′(x)_|. From the definition of f_hit follows that_|f_h(x)_{| ≤}e₀ 1

2bk.

An alternative bound isc1e1with some constantc1 depending onE|Z|, where Zdenotes a random variable distributed according top. With (2.6) and (5.53), for x _≥0,

|f_h′(x)_{| ≤ |}h(x)₋Ph_{| −}ψ(x)eakx2k

Z ∞

|h(t)₋Ph_|e−akt2k_{d t}_≤_2e

An alternative bound isc2e1with some constantc2depending on the(2k−2)’th moment ofp. This would mean to Stein’s identity (2.6) to obtainf_h′(x) =₋eakx2kR∞

x (h′(t)−ψ′(t)f(t))e−

akt2k_{d t. The}

details are omitted. To bound the second derivative f_h′′, we differentiate (2.6) and have f_h′′(x) = ψ2(x)₋ψ′(x)f_h(x)₋ψ(x) h(x)₋Ph+h′(x).

Similarly to[5, (8.8), (8.9)]we obtain h(x)₋Ph= Rx −∞h

′₍_t₎_P₍_t₎_{d t}₋R∞

x h

′₍_t₎₍₁₋_P₍_t₎₎_{d t. It}

follows that f_h(x) =₋ 1

eakx2k₍₁₋_P₍_x₎₎

Z x

−∞

h′(t)P(t)d t₋ 1 bk

eakx2k_P₍_x₎

Z _∞

(4)

Now we apply the fact that the quantity in (5.57) is non-negative to obtain |f_h′′(x)_{| ≤ |}h′(x)_|+ ψ2(x)₋ψ′(x)f_h(x)₋ψ(x) h(x)₋Ph

≤ |h′(x)_|+

−ψ(x)₋ 1

b_k ψ 2₍_x₎

−ψ′(x)eakx2k₍₁₋_P₍_x₎₎

Z x

−∞

h′(t)P(t)d t

ψ(x)₋ 1

ψ2(x)₋ψ′(x)eakx2k_P₍_x₎

Z ∞

h′(t)(1₋P(t))d t

≤ |h′(x)_|+e1

ψ(x) + 1

b_k ψ 2₍_x₎

−ψ′(x)eakx2k₍₁₋_P₍_x₎₎

Z x

−∞

P(t)d t

+e₁

−ψ(x) + 1

ψ2(x)₋ψ′(x)eakx2k_P₍_x₎

Z ∞

(1₋P(t))d t. Moreover we know, that the quantity in (5.57) can be bounded by 2k−1

|x| , hence

|f_h′′(x)_{| ≤}e₁+e₁2bk(2k−1) |x_|

Z x

−∞

P(t)d t+

Z ∞

(1₋P(t))d t

. Now we bound

Z x

−∞

P(t)d t+

Z ∞

(1₋P(t))d t=x P(x)₋x(1₋P(x)) +2

Z ∞

t p(t)d t≤2_|x|+2E_|_Z_|, whereZ is distributed according top. Summarizing we have_|f_h′′(x)_{| ≤}c₃sup_x_|h′(x)_|for some con-stantc3, using the fact that fhand therefore f_h′andf_h′′are continuous. Hence fhsatisfies Assumption (B1).

Sketch of the proof of Remark 2.3. Now letp(x) =bkexp −akV(x)

andV satisfies the assumptions listed in Remark 2.3. To proof that f_z(with respect top) satisfies Assumption (B2), we adapt (5.53) as well as (5.54) and (5.55), using the assumptions onV. We obtain forx>0

b_k

_V′₍_x₎

V′′₍_x_{) +}_a

kV′(x)2

exp ₋a_kV(x)_≤1₋P(x). and forx >0

1₋P(x)_≤min

₁

2, bk akV′(x)

exp ₋a_kV(x)

and forx <0

P(x)_≤min

₁

2, bk ak|V′(x)|

exp ₋a_kV(x). Estimating(₋ψ(x)f_z(x))′ gives(₋ψ(x)f_z(x))′_≤const. V′′(x)

|V′₍_x₎_|. By our assumptions onV, the right

hand side can be bounded for x _≥d withd _∈R₊_{and since}_ψ(_x₎_f_z₍_x₎_{is continuous, it is bounded} everywhere.

(5)

References

[1] A. D. Barbour,Equilibrium distributions for Markov population processes, Adv. in Appl. Probab.

12(1980), no. 3, 591–614. MRMR578839

[2] M. Blume, V. J. Emery, and R. B. Griffiths,Ising model for theλtransition and phase separation in H e3–H e4mixtures, Phys. Rev. A4(1971), 1071–1077.

[3] S. Chatterjee, J. Fulman, and A. Röllin,Exponential approximation by Stein’s method and spec-tral graph theory, preprint, 2009.

[4] S. Chatterjee and Q.-M. Shao,Stein’s method of exchangeable pairs with application to the Curie-Weiss model, preprint, 2009.

[5] L. H. Y. Chen and Q.-M. Shao, Stein’s method for normal approximation, An introduction to Stein’s method, Lect. Notes Ser. Inst. Math. Sci. Natl. Univ. Singap., vol. 4, Singapore Univ. Press, Singapore, 2005, pp. 1–59. MR2235448

[6] P. Eichelsbacher and M. Löwe, Moderate deviations for the overlap parameter in the Hopfield model, Probab. Theory and Related Fields130(2004), no. 4, 441–472.

[7] P. Eichelsbacher and G. Reinert,Stein’s method for discrete Gibbs measures, Ann. Appl. Probab.

18(2008), no. 4, 1588–1618. MR2434182

[8] R. S. Ellis, Concavity of magnetization for a class of even ferromagnets, Bull. Amer. Math. Soc.

81(1975), no. 5, 925–929. MR0376052

[9] R. S. Ellis, Entropy, Large Deviations, and Statistical Mechanics, Springer-Verlag, New York, 1985.

[10] R. S. Ellis, J. L. Monroe, and C. M. Newman, Theghs and other correlation inequalities for a class of even ferromagnets, Comm. Math. Phys.46(1976), no. 2, 167–182. MR0395659

[11] R. S. Ellis and C. M. Newman,Limit theorems for sums of dependent random variables occurring in statistical mechanics, Z. Wahrsch. Verw. Gebiete44(1978), no. 2, 117–139.

[12] , Necessary and sufficient conditions for the GHS inequality with applications to analysis and probability, Trans. Amer. Math. Soc.237(1978), 83–99.

[13] R. S. Ellis, C.M. Newman, and J. S. Rosen,Limit theorems for sums of dependent random vari-ables occurring in statistical mechanics, II., Z. Wahrsch. Verw. Gebiete51 (1980), no. 2, 153– 169.

[14] R. B. Griffiths, C. A. Hurst, and S. Sherman,Concavity of magnetization of an Ising ferromagnet in a positive external field, J. Mathematical Phys.11(1970), 790–795.

[15] J.L. Lebowitz,GHS and other inequalities, Comm. Math. Phys.35(1974), 87–92. MR0339738

[16] Y. Rinott and V. Rotar,On coupling constructions and rates in the CLT for dependent summands with applications to the antivoter model and weighted U-statistics, Ann. Appl. Probab.7(1997), no. 4, 1080–1105. MR1484798

(6)

[17] Q.-M. Shao and Z.-G. Su,The Berry-Esseen bound for character ratios, Proc. Amer. Math. Soc.

134(2006), no. 7, 2153–2159 (electronic). MR2215787

[18] B. Simon and R. B. Griffiths, The (φ4)₂ field theory as a classical Ising model, Comm. Math. Phys.33(1973), 145–164. MR0428998

[19] ,The(φ4)₂ field theory as a classical Ising model, Comm. Math. Phys.33(1973), 145– 164. MR0428998

[20] C. Stein, Approximate computation of expectations, Institute of Mathematical Statistics Lec-ture Notes—Monograph Series, 7, Institute of Mathematical Statistics, Hayward, CA, 1986. MR882007

[21] C. Stein, P. Diaconis, S. Holmes, and G. Reinert, Use of exchangeable pairs in the analysis of simulations, Stein’s method: expository lectures and applications, IMS Lecture Notes Monogr. Ser., vol. 46, Inst. Math. Statist., Beachwood, OH, 2004, pp. 1–26. MR2118600

[22] C. J. Thompson, Mathematical statistical mechanics, The Macmillan Co., New York, 1972, A Series of Books in Applied Mathematics.

getdoc0c02. 292KB Jun 04 2011 12:04:04 AM

Stein’s method for dependent random variables occurring

in Statistical Mechanics

∗

Peter Eichelsbacher

_{Matthias Löwe}

1

Introduction

2

The exchangeable pair approach for distributional approximations

3

Berry-Esseen bounds for Curie-Weiss models

5

Appendix

References

Parts

Dokumen yang terkait

AN ALIS IS YU RID IS PUT USAN BE B AS DAL AM P E RKAR A TIND AK P IDA NA P E NY E RTA AN M E L AK U K A N P R AK T IK K E DO K T E RA N YA NG M E N G A K IB ATK AN M ATINYA P AS IE N ( PUT USA N N O MOR: 9 0/PID.B /2011/ PN.MD O)

ANALISIS FAKTOR YANGMEMPENGARUHI FERTILITAS PASANGAN USIA SUBUR DI DESA SEMBORO KECAMATAN SEMBORO KABUPATEN JEMBER TAHUN 2011

EFEKTIVITAS PENDIDIKAN KESEHATAN TENTANG PERTOLONGAN PERTAMA PADA KECELAKAAN (P3K) TERHADAP SIKAP MASYARAKAT DALAM PENANGANAN KORBAN KECELAKAAN LALU LINTAS (Studi Di Wilayah RT 05 RW 04 Kelurahan Sukun Kota Malang)

FAKTOR Â– FAKTOR YANG MEMPENGARUHI PENYERAPAN TENAGA KERJA INDUSTRI PENGOLAHAN BESAR DAN MENENGAH PADA TINGKAT KABUPATEN / KOTA DI JAWA TIMUR TAHUN 2006 - 2011

A DISCOURSE ANALYSIS ON “SPA: REGAIN BALANCE OF YOUR INNER AND OUTER BEAUTY” IN THE JAKARTA POST ON 4 MARCH 2011

Pengaruh kualitas aktiva produktif dan non performing financing terhadap return on asset perbankan syariah (Studi Pada 3 Bank Umum Syariah Tahun 2011 – 2014)

Pengaruh pemahaman fiqh muamalat mahasiswa terhadap keputusan membeli produk fashion palsu (study pada mahasiswa angkatan 2011 & 2012 prodi muamalat fakultas syariah dan hukum UIN Syarif Hidayatullah Jakarta)

Pendidikan Agama Islam Untuk Kelas 3 SD Kelas 3 Suyanto Suyoto 2011

ANALISIS NOTA KESEPAHAMAN ANTARA BANK INDONESIA, POLRI, DAN KEJAKSAAN REPUBLIK INDONESIA TAHUN 2011 SEBAGAI MEKANISME PERCEPATAN PENANGANAN TINDAK PIDANA PERBANKAN KHUSUSNYA BANK INDONESIA SEBAGAI PIHAK PELAPOR

KOORDINASI OTORITAS JASA KEUANGAN (OJK) DENGAN LEMBAGA PENJAMIN SIMPANAN (LPS) DAN BANK INDONESIA (BI) DALAM UPAYA PENANGANAN BANK BERMASALAH BERDASARKAN UNDANG-UNDANG RI NOMOR 21 TAHUN 2011 TENTANG OTORITAS JASA KEUANGAN

Dukungan

Links

getdoc0c02. 292KB Jun 04 2011 12:04:04 AM

Stein’s method for dependent random variables occurring

in Statistical Mechanics

∗

Peter Eichelsbacher

Matthias Löwe

1

Introduction

2

The exchangeable pair approach for distributional approximations

3

Berry-Esseen bounds for Curie-Weiss models

5

Appendix

References

Parts

Dokumen yang terkait

AN ALIS IS YU RID IS PUT USAN BE B AS DAL AM P E RKAR A TIND AK P IDA NA P E NY E RTA AN M E L AK U K A N P R AK T IK K E DO K T E RA N YA NG M E N G A K IB ATK AN M ATINYA P AS IE N ( PUT USA N N O MOR: 9 0/PID.B /2011/ PN.MD O)

ANALISIS FAKTOR YANGMEMPENGARUHI FERTILITAS PASANGAN USIA SUBUR DI DESA SEMBORO KECAMATAN SEMBORO KABUPATEN JEMBER TAHUN 2011

EFEKTIVITAS PENDIDIKAN KESEHATAN TENTANG PERTOLONGAN PERTAMA PADA KECELAKAAN (P3K) TERHADAP SIKAP MASYARAKAT DALAM PENANGANAN KORBAN KECELAKAAN LALU LINTAS (Studi Di Wilayah RT 05 RW 04 Kelurahan Sukun Kota Malang)

FAKTOR Â– FAKTOR YANG MEMPENGARUHI PENYERAPAN TENAGA KERJA INDUSTRI PENGOLAHAN BESAR DAN MENENGAH PADA TINGKAT KABUPATEN / KOTA DI JAWA TIMUR TAHUN 2006 - 2011

A DISCOURSE ANALYSIS ON “SPA: REGAIN BALANCE OF YOUR INNER AND OUTER BEAUTY” IN THE JAKARTA POST ON 4 MARCH 2011

Pengaruh kualitas aktiva produktif dan non performing financing terhadap return on asset perbankan syariah (Studi Pada 3 Bank Umum Syariah Tahun 2011 – 2014)

Pengaruh pemahaman fiqh muamalat mahasiswa terhadap keputusan membeli produk fashion palsu (study pada mahasiswa angkatan 2011 & 2012 prodi muamalat fakultas syariah dan hukum UIN Syarif Hidayatullah Jakarta)

Pendidikan Agama Islam Untuk Kelas 3 SD Kelas 3 Suyanto Suyoto 2011

ANALISIS NOTA KESEPAHAMAN ANTARA BANK INDONESIA, POLRI, DAN KEJAKSAAN REPUBLIK INDONESIA TAHUN 2011 SEBAGAI MEKANISME PERCEPATAN PENANGANAN TINDAK PIDANA PERBANKAN KHUSUSNYA BANK INDONESIA SEBAGAI PIHAK PELAPOR

KOORDINASI OTORITAS JASA KEUANGAN (OJK) DENGAN LEMBAGA PENJAMIN SIMPANAN (LPS) DAN BANK INDONESIA (BI) DALAM UPAYA PENANGANAN BANK BERMASALAH BERDASARKAN UNDANG-UNDANG RI NOMOR 21 TAHUN 2011 TENTANG OTORITAS JASA KEUANGAN

Dokumen yang Anda mencari sudah siap untuk unduhkan

_{Matthias Löwe}