getdocb915. 413KB Jun 04 2011 12:04:53 AM

(1)

El e c t ro n ic

Jo ur

n a l o

f P

r o

b a b i_{l i t y} Vol. 15 (2010), Paper no. 55, pages 1703–1742.

Journal URL

http://www.math.washington.edu/~ejpecp/

Stein’s method and stochastic analysis of Rademacher

functionals

Ivan Nourdin

∗

,

Université Paris VI

Giovanni Peccati

†

Université Paris Ouest

Gesine Reinert

‡

Oxford University

Abstract

We compute explicit bounds in the Gaussian approximation of functionals of infinite Rademacher sequences. Our tools involve Stein’s method, as well as the use of appropriate discrete Malliavin operators. As the bounds are given in terms of Malliavin operators, no coupling construction is required. When the functional depends only on the first d coordinates of the Rademacher sequence, a simple sufficient condition for convergence to a normal distribution is derived. For finite quadratic forms, we obtain necessary and sufficient conditions.

Although our approach does not require the classical use of exchangeable pairs, when the func-tional depends only on the first dcoordinates of the Rademacher sequence we employ chaos expansion in order to construct an explicit exchangeable pair vector; the elements of the vector relate to the summands in the chaos decomposition and satisfy a linearity condition for the con-ditional expectation.

Among several examples, such as random variables which depend on infinitely many Rademacher variables, we provide three main applications: (i) to CLTs for multilinear forms belonging to a fixed chaos, (ii) to the Gaussian approximation of weighted infinite 2-runs, and (iii) to the computation of explicit bounds in CLTs for multiple integrals over sparse sets. This last application provides an alternate proof (and several refinements) of a recent result by Blei and Janson.

∗_{Laboratoire de Probabilités et Modèles Aléatoires, Université Pierre et Marie Curie (Paris VI), Boîte courrier 188, 4}

place Jussieu, 75252 Paris Cedex 05, France. Email:[email protected]

†_{Equipe Modal’X, Université Paris Ouest – Nanterre la Défense, 200 Avenue de la République, 92000 Nanterre, and}

LSTA, Université Paris VI, France. Email:[email protected]

‡_{Department of Statistics, University of Oxford, 1 South Parks Road, Oxford OX1 3TG, UK. Email}

[email protected]

(2)

Key words: Central Limit Theorems; Discrete Malliavin operators; Normal approximation; Rademacher sequences; Sparse sets; Stein’s method; Walsh chaos.

AMS 2000 Subject Classification:Primary 60F05; 60F99; 60G50; 60H07. Submitted to EJP on July 6, 2009, final version accepted September 11, 2010.

(3)

1 Introduction

The connection between Stein’s method (see e.g. [8, 36, 43, 44]) and the integration by parts formulae of stochastic analysis has been the object of a recent and quite fruitful study. The papers [25, 26, 27]deal with Stein’s method and Malliavin calculus (see e.g. [29]) in a Gaussian setting; in [28]one can find extensions involving density estimates and concentration inequalities; the paper [31]is devoted to explicit bounds, obtained by combining Malliavin calculus and Stein’s method in the framework of Poisson measures. Note that all these references contain examples and applications that were previously outside the scope of Stein’s method, as they involve functionals of infinite-dimensional random fields for which the usual Stein-type techniques (such as exchangeable pairs, zero-bias transforms or size-bias couplings), which involve picking a random index from afiniteset, seem inappropriate.

The aim of this paper is to push this line of research one step further, by combining Stein’s method with an appropriatediscreteversion of Malliavin calculus, in order to study the normal approxima-tion of the funcapproxima-tionals of an infiniteRademacher sequence. By this expression we simply mean a sequenceX =_{Xn : n>1}of i.i.d. random variables, such that P(X1 =1) = P(X1 =−1) =1/2.

A full treatment of the discrete version of Malliavin calculus adopted in this paper can be found in [34]or[35]; Section 2.5 below provides a short introduction to the main objects and results. The main features of our approach are the following:

(i) We will be able to deal directly with the normal approximation of random variables possi-bly depending on the whole infinitesequenceX. In particular, our results apply to random elements not necessarily having the form of partial sums.

(ii) We will obtain bounds expressed in terms of the kernels appearing in thechaotic decompo-sitionof a given square-integrable random variable. Note that every square-integrable func-tional ofX admits a chaotic decomposition into an orthogonal sum of multiple integrals. (iii) We employ the chaotic expansion in order to construct an explicit exchangeable pair vector

for any random variable which depends on a finite set of Rademacher variables. In particular, this exchangeable pair satisfies a linearity condition in conditional expectationexactly, which allows the normal approximation results from Reinert and Röllin[37]to be applied.

(iv) In the particular case of random variables belonging to a fixed chaos, that is, having the form of a (possibly infinite) series of multilinear forms of a fixed order, we will express our bounds in terms of norms ofcontraction operators(see Section 2.2). These objects also play a central role in the normal and Gamma approximations of functionals of Gaussian fields (see [25, 26, 27, 30]) and Poisson measures (see [31, 32, 33]). In this paper, we shall consider kernels defined on discrete sets, and the corresponding contraction norms are integrals with respect to appropriate tensor products of the counting measure.

The use of contraction operators in the normal approximation of Rademacher functionals seems to us a truly promising direction for further research. One striking application of these techniques is given in Section 6, where we deduce explicit bounds (as well as a new proof) for a combina-torial central limit theorem (CLT) recently proved by Blei and Janson in [5], involving sequences of multiple integrals over finite “sparse” sets. In the original Blei and Janson’s proof, the required sparseness condition emerges as an efficient way of re-expressing moment computations related to

(4)

martingale CLTs. In contrast, our approach does not involve martingales or moments and (quite sur-prisingly) the correct sparseness condition stems naturally from the definition of contraction. This yields moreover an explicit upper bound on an adequate distance between probability measures. Another related work is the paper by Mosselet al. [24], where the authors prove a general invari-ance principle, allowing to compare the distribution of polynomial forms constructed from different collections of i.i.d. random variables. In particular, in[24]a crucial role is played by kernels “with low influences”: we will point out in Section 4 that influence indices are an important component of our bounds for the normal approximation of elements of a fixed chaos. It is to note that influence indices already appear (under a different name) in the papers by Rotar’ [40, 41]. In Section 6.3, we also show that the combination of the findings of the present paper with those of[24]yields an elegant answer to a question left open by Blei and Janson in[5].

A further important point is that our results do not hinge at all on the fact that N is an ordered set. This is one of the advantages both of Stein’s method and of the Malliavin-type, filtration-free stochastic analysis. It follows that our findings can be directly applied to i.i.d. Rademacher families indexed by an arbitrary discrete set.

The reader is referred to Chatterjee[6]for another Stein-type approach to the normal approximation of functionals of finite i.i.d. vectors. Some connections between our findings and the results proved in[6](in the special case of quadratic functionals) are pointed out in Section 4.2. Indeed there is a long history of normal approximations for finite vectors; notably Hoeffding decompositions, also in conjunction with truncations such as Hajek’s projection, can yield optimal Berry-Esséen bounds, see [3]and[18]as well[46]for some seminal papers. A connection between jackknife and Hoeffding decompositions is explored for example in[13]. The connection between chaos decomposition and Hoeffding decompositions is detailed in Subsection 2.4. In contrast, our approach allows to directly treat functionals of infinite sequences.

The paper is organized as follows. Section 2 contains results on multiple integrals, Stein’s method and discrete Malliavin calculus (in particular, a new chain rule is proved). Section 3 provides general bounds for the normal approximation of Rademacher functionals; it gives first examples and the exchangeable pair construction. In Section 4 we are interested in random variables belonging to a fixed chaos. Section 5 focuses on sums of single and double integrals and related examples. Section 6 is devoted to further applications and refinements, mainly involving integrals over sparse sets. Technical proofs are deferred to the Appendix.

2 Framework and main tools

2.1 The setup

As before, we shall denote byX =X_n:n>1 a standardRademacher sequence. This means that

X is a collection of i.i.d. random variables, defined on a probability space(Ω,_F,P) and such that

PX1=1

=PX1=−1

=1/2. To simplify the subsequent discussion, for the rest of the paper we shall suppose thatΩ =_{−1, 1_}NandP= [1₂_{δ₋1+δ1}]

. Also,X will be defined as the canonical process, that is: for everyω= (ω1,ω2, ...)∈Ω, Xn(ω) =ωn. Starting fromX, for every N>1 one

can build a random signed measureµ(X,N)on{1, ...,N}, defined for everyA⊂ {1, ...,N}as follows:

µ(X,N)(A) =

j∈A

(5)

It is clear thatσ{X_}=σ{µ(X,N):N >1}. We shall sometimes omit the indexN and writeµ(X,N)=

µX, whenever the domain of integration is clear from the context.

We define the setD=¦ i₁,i₂∈N2_:_i 1=i2

to be thediagonalofN2. Forn>2, we put

∆_n=¦ i1, ...,in

∈Nn_{: the}_i

j’s are all different

, (2.1)

and, forN,n>2,

∆N_n = ∆_n_{∩ {}1, ...,N}n (2.2) (so that ∆N

n = ∅ if N < n). The random signed measures µ(X,N) satisfy that, for every A,B ⊂ {1, ...,N_},

µ⊗₍_X2_,_N₎([A_×B]_∩D) =X

X2_j1_{_j_∈_A_}1_{_j_∈_B_}=κ(A_∩B) =♯j: j_∈A_∩B , (2.3) where κ is the counting measure on N. The application A _7→ µ₍⊗_X2_,_N₎([A_×A]_∩D) is called the

diagonal measure associated withµ(X,N). The above discussion simply says that, for everyN >1, the diagonal measure associated withµ(X,N)is the restriction on{1, ...,N}of the counting measure

κ. Note that, for finite setsA,Bone has also

E[µ_X(A)µ_X(B)] =κ(A∩B).

Remark 2.1. In the terminology e.g. of[33]and[45], the collection of random variables {µ(X,N)(A):A⊂ {1, ...,N},N >1}

defines anindependently scattered random measure(also called acompletely random measure) on

N_{, with} _{control measure} _{equal to} _κ_{. In particular, if} _A₁_{, ...,}_A_k _{is a collection of mutually disjoint}

finite subsets ofN, then the random variablesµX(A1), ...,µX(Ak)are mutually independent.

2.2 The star notation

For n > 1, we denote by ℓ2(N)n the class of the kernels (= functions) on Nn that are square integrable with respect to κ⊗n_; _ℓ2₍_N₎◦n _{is the subset of}_ℓ2₍_N₎n _{composed of symmetric kernels;}

ℓ2₀(N)n is the subset ofℓ2(N)n composed of kernels vanishing on diagonals, that is, vanishing on the complement of∆_n;ℓ2₀(N)◦nis the subset ofℓ2₀(N)n composed of symmetric kernels.

For everyn,m>1, everyr=0, ...,n∧m, everyl =0, ...,r, and everyf ∈ℓ2₀(N)◦n and g∈ℓ2₀(N)◦m, we denote by f ⋆l_r g the function of n+m₋r₋l variables obtained as follows: r variables are identified and, among these,lare integrated out with respect to the counting measureκ. Explicitly,

f ⋆l_rg i1, ...,in+m−r−l

= X

a1,...,al

f i₁, ...,i_n₋_r;i_n₋_r+1, ...,in−l;a1, ...,al

×g in−l+1, ...,in+m−r−l;in−r+1, ...,in−l;a1, ...,al

;

note that, since f and gvanish on diagonals by assumption, the sum actually runs over all vectors

a₁, ...,a_l_∈∆_l. For instance,

f ⋆0₀g i1, ...,in+m

= f ⊗g i1, ...,in+m

= f i1, ...,in

g in+1, ...,in+m

(6)

and, forr_{∈ {}1, . . . ,n_∧m_},

f ⋆r_rg i₁, ...,i_n+m−2r

= X

a1,...,ar

f i₁, ...,i_n₋_r;a₁, ...,a_rg i_n₋_r₊₁, ...,i_n₊_m₋₂_r;a₁, ...,a_r. In particular, ifr =m=n

f ⋆m_m g= X

a1,...,am

f a₁, ...,a_mg a₁, ...,a_m=_〈f,g_〉_ℓ2₍_N₎⊗m.

Example 2.2. Ifn=m=2, one has

f ⋆0₀ g i1,i2,i3,i4

= f i1,i2

g i3,i4

; f ⋆0₁g i1,i2,i3

= f i1,i2

g i3,i2

;

f ⋆1₁ g i₁,i₂= Σ_af i₁,ag i₂,a ; f ⋆0₂ g i₁,i₂= f i₁,i₂g i₁,i₂;

f ⋆1₂ g(i) = Σ_af (i,a)g(i,a) ; f ⋆2₂g= Σ_a

1,a2f a1,a2

g a₁,a₂.

Remark 2.3. In general, a kernel of the type f ⋆l_r g may be not symmetric and may not vanish on diagonal sets.

The next two lemmas collect some useful facts for subsequent sections. Their proofs are postponed in the appendix.

Lemma 2.4. Let f _∈ℓ2₀(N)◦n, g_∈ℓ2₀(N)◦m (n,m>1)and06l6r6n_∧m. Then: 1. f ⋆l_r g∈ℓ2(N)⊗(n+m−r−l)andkf ⋆l_rgkℓ2₍N)⊗(n+m−r−l) 6kfkℓ2₍_N₎⊗nkgk_ℓ2₍_N₎⊗m;

2. if n>2then

max

j∈N

  

(b1,...,bn−1)∈Nn−1

f2(j,b1, ...,bn−1)    2

6 _kf ⋆n_n−1 fk2_ℓ2₍_N₎

6 _kf_k2_ℓ2₍_N₎⊗n×max

j∈N

(b1,...,bn−1)∈Nn−1

f2(j,b₁, ...,b_n₋₁)

and, if l=1, . . . ,n,

kf ⋆l_l−1g_k2_ℓ2₍N)⊗(n+m−2l+1) = ∞

j=1

kf(j,_·)⋆l_l₋−1₁g(j,_·)_k2

ℓ2₍_N₎⊗(n+m−2l)

6 _kf ⋆n_n−1 f_k_ℓ2₍_N₎× kgk2

ℓ2₍_N₎⊗m. (2.4)

3. kf ⋆0

1 fkℓ2₍N)⊗(2n−1) =kf ⋆_nn−1 fk_ℓ2₍N)and, for every l=2, ...,n(n>2), kf ⋆l_l−1 fkℓ2₍_N₎⊗(2n−2l+1)6kf ⋆l−1

l−1 f ×1∆c

2(n−l+1)kℓ2(N)⊗(2n−2l+2) 6kf ⋆

l−1

l−1 fkℓ2₍_N₎⊗(2n−2l+2).

Here, ∆c_q stands for the complement of the set ∆c_q, that is, ∆c_q is the collection of all vectors (i₁, ...,iq)such that ik=il for at least one pair(l,k)such that l6=k.

(7)

Remark 2.5. 1. According e.g. to[24], the quantity

(b1,...,bn−1)∈Nn−1

f2(j,b1, ...,bn−1)

is called theinfluence of the jthcoordinateon f.

2. When specializing Lemma 2.4 to the casen=2, one gets

max

j∈N



X

i∈N

f2(i,j)

  2

6 _kf ⋆1₂ f_k2_ℓ2₍_N₎ (2.5)

= _kf ⋆1₁ f _×1∆c 2k

ℓ2₍_N₎⊗26kf ⋆ 1 1 fk

ℓ2₍_N₎⊗2=Trace{[f] 4

}. Here,[f]denote the infinite array{f(i,j): i,j∈N_}_and[f]4 _{stands for the fourth power of}

[f](regarded as the kernel of a Hilbert-Schmidt operator).

Lemma 2.6. Fix n,m>1, and let f_k_∈ℓ2₀(N)◦n and g_k_∈ℓ2₀(N)◦m, k>1, be such that f_k _−→

k→∞ f and

g_k _−→

k→∞g, respectively, inℓ

2 0(N)◦

n_and_ℓ2 0(N)◦

m_{. Then, for every r}₌_{0, ...,}_n

∧m, f_k⋆r_rg_k _−→

k→∞ f ⋆

r rg in

ℓ2₍_N₎⊗m+n−2r_.

2.3 Multiple integrals, chaos and product formulae

Following e.g.[35], for everyq>1 and everyf _∈ℓ₀2(N)◦qwe denote byJ_q fthemultiple integral

(of orderq) of f with respect toX. Explicitly,

J_q(f) = X

(i1,...,iq)∈Nq

f(i₁, ...,i_q)X_i

1· · ·Xiq = X

(i1,...,iq)∈∆q

f(i₁, ...,i_q)X_i

1· · ·Xiq (2.6)

= q! X

i1<...<iq

f(i₁, ...,i_q)X_i

1· · ·Xiq,

where the possibly infinite sum converges in L2(Ω). Note that, if f has support contained in some finite set_{1, ...,N_}q, thenJ_q fis a true integral with respect to the product signed measureµ⊗₍_Xq_,_N₎, that is,

Jq f

{1,...,N}q

f dµ⊗₍_Xq_,_N₎=

∆N q

f dµ⊗₍_Xq_,_N₎.

One customarily sets ℓ2(N)◦0 = R, and J₀(c) = c, _∀c _∈ R. It is known (see again [35]) that, if

f ∈ℓ2₀(N)◦qand g∈ℓ2₀(N)◦p, then one has the isometric relation

E[J_q(f)J_p(g)] =1_{_q₌_p_}q!_〈f,g_〉_ℓ2₍_N₎⊗q. (2.7)

In particular,

E[J_q(f)2] =q!_kfk2_ℓ2₍N)⊗q. (2.8) The collection of all random variables of the typeJn(f), where f ∈ℓ20(N)◦

q_{, is called the}

qthchaos

(8)

Remark 2.7. Popular names for random variables of the typeJ_q(f)areWalsh chaos(see e.g. [22, Ch. IV]) and Rademacher chaos(see e.g. [5, 12, 21]). In Meyer’s monograph[23]the collection {J_q(f):f ∈ℓ2

0(N)◦

q_,_q_>₀

}is called thetoy Fock spaceassociated withX.

Recall (see[35]) that one has the followingchaotic decomposition: for everyF_∈L2(σ{X_})(that is, for every square integrable functional of the sequenceX) there exists a unique sequence of kernels

f_n_∈ℓ2₀(N)◦n,n>1, such that

F=E(F) +X

n>1

J_n(f_n) =E(F) +X

n>1

n! X

i1<i2<...<in

f_n(i₁, ...,i_n)X_i

1· · ·Xin, (2.9) where the second equality follows from the definition ofJ_n(f_n), and the series converge in L2.

Remark 2.8. Relation (2.9) is equivalent to the statement that the set {1_{} ∪}[

n>1

{X_i

1. . .Xin: 16i1<. . .<in} (2.10) is an orthonormal basis of L2(σ{X_}). An immediate proof of this fact can be deduced from basic harmonic analysis. Indeed, it is well-known that the setΩ =_{−1,+1_}N, when endowed with the product structure of coordinate multiplication, is a compact Abelian group (known as the Cantor group), whose unique normalized Haar measure is the law ofX. Relation (2.9) then follows from the fact that the dual of Ω consists exactly in the mappingsω 7→ 1 and ω 7→ X_i

1(ω)· · ·Xin(ω),

in>...>i1>1,n>1. See e.g. [4, Section VII.2].

Given a kernel f onNn, we denote by ef its canonical symmetrization, that is:

f i1, ...,in

= 1

f(i_σ₍₁₎, ...,iσ(n)),

whereσruns over then! permutations of_{1, ...,n_}. The following result is a standard multiplication formula between multiple integrals of possibly different orders. Since we did not find this result in the literature (see, however,[35, formula (15)]), we provide a simple combinatorial proof in Section 6.3.

Proposition 2.9. For every n,m>1, every f ∈ℓ2 0(N)◦

n _{and g}

∈ℓ2 0(N)◦

m_{, one has that}

Jn f

Jm g

n_X∧m r=0

Jn+m−2r

h ^_f _⋆r

r g

1∆n+m−2r i

. (2.11)

Remark 2.10. Proposition 2.9 yields that the product of two multiple integrals is a linear combina-tion of square-integrable random variables. By induccombina-tion, this implies in particular that, for every

f _∈ℓ2₀(N)◦n and everyk>1, E_|J_n(f)_|k<∞, that is, the elements belonging to a given chaos have finite moments of all orders. This fact can be alternatively deduced from standard hypercontractive inequalities, see e.g. Theorem 3.2.1 in[12]or[21, Ch. 6]. Note that similar results hold for random variables belonging to a fixed Wiener chaos of a Gaussian field (see e.g. [19, Ch. V]).

(9)

2.4 Finding a chaotic decomposition

The second equality in (2.9) clearly implies that, for everyi1<...<in,

n!f_n(i₁, ...,i_n) =E(F_×X_i₁· · ·X_i_n).

In what follows, we will point out two alternative procedures yielding an explicit expression for the kernels fn.

(i) Möbius inversion (Hoeffding decompositions). We start with the following observation: if

F = F(X₁, ...,Xd) is a random variable that depends uniquely on the first d coordinates of the

Rademacher sequenceX, then necessarily F admits a finite chaotic decomposition of the form

F =E(F) +

n=1

X 16i1<...<in6d

n!f_n(i₁, . . . ,i_n)X_i

1· · ·Xin. (2.12) Note that the sum in the chaotic decomposition (2.12) must stop atd: indeed, the terms of order greater thandare equal to zero since, if not, they would involve e.g. products of the typeX_j

1···Xjd+1

with all the ja’s different, which would not be consistent with the fact that F is σ{X1, . . . ,Xd}

-measurable. Now note that one can rewrite (2.12) asF =E(F) +P_I_⊂{_1,...,_d_}G(I), where, for I :=

{i₁, . . . ,i_a_},

G(I) =a!fa(i1, . . . ,ia)Xi1. . .Xia.

By exploiting the independence, one obtains also that, for everyJ :=_{j1, ...,jn} ⊂ {1, . . . ,d},

H(J):=E[F₋E(F)_|Xj1, . . . ,Xjn] = X

I⊂J

G(I),

thus yielding that, by inversion (see e.g.[42, p. 116]), for everyn=1, ...,d,

n!fn(j1, ...,jn)Xj1· · ·Xjn=

{i1,...,ia}⊂{j1,...,jn}

(₋1)n−aE[F₋E(F)_|Xi1, ...,Xia]. (2.13) Formula (2.13) simply means that, for a fixed n, the sum of all random variables of the type

n!f_n(j₁, ...,j_n)X_j₁_{· · ·}X_j_n coincides with the nth term in the Hoeffding-ANOVA decomposition of F

(see e.g. [20]). By a density argument (which is left to the reader), representation (2.13) extends indeed to random variables depending on thewholesequenceX.

(ii)Indicators expansions. Assume that we can write F = F(X₁, . . . ,Xd) with F :{−1,+1}d →R.

Taking into account all the possibilities, we have

F= X

(ǫ1,...,ǫd)∈{−1,+1}d

F(ǫ1, . . . ,ǫd) d

i=1 1_{_X

i=ǫi}.

Now, the crucial point is the identity1_{_X_i=ǫi}= 1

2(1+ǫiXi). Hence

F =2−d X

(ǫ1,...,ǫd)∈{−1,+1}d

F(ǫ1, . . . ,ǫd) d

i=1

(10)

But

i=1

(1+ǫiXi) = 1+

X 16i16d

ǫi1Xi1+ X 16i1<i26d

ǫi1ǫi2Xi1Xi2

+ X

16i1<i2<i36d

ǫ_i₁ǫ_i₂ǫ_i₃X_i₁X_i₂X_i₃+. . .+ǫ_i₁. . .ǫ_i_dX_i₁. . .X_i_d;

inserting this in (2.14) one can deduce the chaotic expansion ofF.

2.5 Discrete Malliavin calculus and a new chain rule

We will now define a set of discrete operators which are the analogues of the classic Gaussian-based Malliavin operators (see e.g.[19, 29]). The reader is referred to[34]and[35]for any unexplained notion and/or result.

The operatorD, called thegradient operator, transforms random variables into random sequences. Its domain, noted domD, is given by the class of random variables F ∈ L2(σ{X}) such that the kernels f_n_∈ℓ2₀(N)◦nin the chaotic expansionF=E(F) +P_n_>₁J_n(f_n)(see (2.9)) verify the relation

n>1

nn!kfnk2_ℓ2₍N)⊗n<∞.

In particular, if F = F(X₁, . . . ,X_d) depends uniquely on the first d coordinates of X, then F _∈

domD. More precisely, D is an operator with values in L2(Ω_×N,P⊗κ), such that, for every

F=E(F) +P_n_>₁J_n(f_n)_∈domD,

DkF =

n>1

nJn−1(fn(·,k)), k>1, (2.15)

where the symbol f_n(_·,k)indicates that the integration is performed with respect ton₋1 variables. According e.g. to [34, 35], the gradient operator admits the following representation. Let ω = (ω1,ω2, . . .)∈Ω, and set

ωk₊= (ω1,ω2, . . . ,ωk−1,+1,ωk+1, . . .)

and

ωk₋= (ω1,ω2, . . . ,ωk−1,−1,ωk+1, . . .)

to be the sequences obtained by replacing the kth coordinate of ω, respectively, with +1 and ₋1. WriteF_k±instead of F(ωk

±)for simplicity. Then, for everyF∈domD,

D_kF(ω) = 1

2 F +

k −Fk−

, k>1. (2.16)

Remark 2.11. 1. It is easily seen that, if the random variable F _∈ L2(σ{X_}) is such that the mapping (ω,k) _7→ 1

2(F

k −Fk−)(ω) is an element of L

2_(Ω

×N,P_⊗κ), then necessarily F _∈

(11)

2. If F = F(X₁, . . . ,X_d) depends uniquely on the first d coordinates of X, then the randomized derivative ∆_jF(X)ofF, defined in[6], relates toDkF(ω)as follows. LetX′= (X1′, . . . ,Xd′)be

an independent copy of(X₁, . . . ,X_d), then, for j=1, . . . ,d,

∆_jF(X₁, . . . ,Xd) = (F+j −Fj−)

1_{Xj=1,X′j=−1}−1{Xj=−1,X′j=1}

(2.17)

= 2D_jF(X₁, . . . ,X_d)1_{_X

j=1,X′j=−1}−1{Xj=−1,X′j=1}

A key advantage of the operator D_j, compared to∆_j, is that no coupling construction is re-quired, which makesDjeasily amenable to Malliavin calculus and provides a unified treatment

of normal approximations both for functionals acting on sequences of continuous random vari-ables and on sequences of discrete random varivari-ables.

We writeδ for the adjoint of D, also called thedivergence operator. The domain of δis denoted by domδ, and is such that domδ ⊂ L2(Ω_×N_,_P_⊗_κ)_{. Recall that} _δ _{is defined via the following} integration by parts formula: for everyF _∈domDand everyu_∈domδ

E[Fδ(u)] =E[_〈DF,u_〉_ℓ2₍_N₎] =〈DF,u〉_L2_(Ω_×_N_,_P_⊗_κ₎. (2.18)

Now set L₀2(σ{X_}) to be the subspace of L2(σ{X_}) composed of centered random variables. We write L: L2(σ{X})_→ L2₀(σ{X})for the Ornstein-Uhlenbeck operator, which is defined as follows. The domain domLofLis composed of random variablesF =E(F) +P_n_>₁J_n(f_n)_∈L2(σ{X})such

that _X

n>1

n2n!_kfnk_ℓ22₍N)⊗n<∞,

and, forF∈domL,

L F =₋X

n>1

nJ_n(f_n). (2.19)

With respect to[34, 35], note that we choose to add a minus in the right-hand side of (2.19), in order to facilitate the connection with the paper[25]. One crucial relation between the operators

δ, DandLis that

δD=₋L. (2.20)

The inverse ofL, noted L−1, is defined onF _∈L₀2(σ{X_}), and is given by

L−1F=₋X

n>1

nJn(fn). (2.21)

Lemma 2.12. Let F ∈ domD be centered, and f : R → R _{be such that f}(F) _∈ domD. Then EF f(F)=E_〈D f(F),₋D L−1F_〉_ℓ2₍_N₎.

Proof. Using (2.20) and (2.18) consecutively, we can write

EF f(F)=EL L−1F f(F)=₋EδD L−1F f(F)=E_〈D f(F),₋D L−1F_〉_ℓ2₍_N₎.

Finally, we define_{Pt:t>0}={et L:t>0}to be the thesemi-groupassociated with L, that is,

PtF=

∞

n=0

e−ntJn(fn), t>0, forF=E(F) +

∞

n=1

Jn(fn)∈L2(σ{X}). (2.22)

(12)

Lemma 2.13. Let F_∈domD and fix k_∈N. Then:

1. The random variables DkF , DkL−1F , F_k+and F_k− are independent of Xk.

2. It holds that_|F_k+₋F_|62_|D_kF_|and_|F_k−₋F_|62_|D_kF_|, P-almost surely. 3. If F has zero mean, then E_kD L−1F_k2_ℓ2₍N) 6 EkDFk

ℓ2₍N) with equality if and only if F is an

element of the first chaos.

Proof. 1. One only needs to combine the definition of F_k±with (2.16). 2. UseF_k±−F=_±(F_k+₋F_k−)1_{Xk=∓1}=±2DkF1{Xk=∓1}.

3. Let us consider the chaotic expansion ofF:

F =X

n>1

Jn(fn).

Then₋D_kL−1F =P_n_>₁J_n₋₁ f_n(_·,k) and D_kF = P_n_>₁nJ_n₋₁ f_n(_·,k). Therefore, using the iso-metric relation (2.7),

EkD L−1Fk2_ℓ2₍_N₎ = E X

k∈N

n>1

J_n₋1 fn(·,k)

= X

n>1

(n₋1)!_kfnk2_ℓ2₍N)⊗n

6 X

n>1

n2(n₋1)!_kf_n_k2_ℓ2₍_N₎⊗n

= EX

k∈N

n>1

nJn−1 fn(·,k)

=EkDFk2_ℓ2₍N).

Moreover, the previous equality shows that we have equality if and only if f_n=0 for alln>2, that is, if and only ifF is an element of the first chaos.

We conclude this section by proving a chain rule involving deterministic functions of random vari-ables in the domain of D. It should be compared with the classic chain rule of the Gaussian-based Malliavin calculus (see e.g.[29, Prop. 1.2.2]).

Proposition 2.14. (Chain Rule). Let F ∈domD and f :R→R_{be thrice differentiable with bounded}

third derivative. Assume moreover that f(F)_∈domD. Then, for any integer k, P-a.s.:

Dkf(F)−f′(F)DkF+

1 2 f

′′_(F+

k) + f′′(Fk−)

(D_kF)2Xk

10 3 |f

′′′_|

(13)

Proof. By a standard Taylor expansion,

Dkf(F) =

1 2 f(F

k)−f(Fk−)

= 1

2 f(F +

k)−f(F)

−1 2 f(F

−

k)−f(F)

= 1

′_(F)(_F+

k −F) +

1 4f

′′_(F_)(F+

k −F)

2₊_R 1

−1 2f

′_(F_)(F−

k −F)−

1 4f

′′_(F)(_F−

k −F)

2₊_R 2

= f′(F)D_kF+

1 8 f

′′_(F+

k) + f′′(Fk−)

(F_k+₋F)2₋(F_k−₋F)2+R₁+R₂+R₃ = f′(F)D_kF₋1

2 f ′′_(F+

k) + f′′(Fk−)

(D_kF)2X_k+R₁+R₂+R₃, where, using Lemma 2.13,

|R1| 6

1 12|f

′′′_| ∞

F_k+−F362

3|f ′′′_|

∞|DkF|3

|R2| 6

1 12|f

′′′_| ∞

F_k−−F362

3|f ′′′_|

∞|DkF|3

|R3| 6

1 8|f

′′′_| ∞

F_k+−F3+F_k−−F3

62|f′′′|∞|DkF|3.

By putting these three inequalities together, the desired conclusion follows.

2.6 Stein’s method for normal approximation

Stein’s method is a collection of probabilistic techniques, using differential operators in order to assess quantities of the type

E[h(F)]₋E[h(Z)],

where Z and F are generic random variables, and the function his such that the two expectations are well defined. In the specific case where Z ∼N (0, 1), withN (0, 1)a standard Gaussian law, one is led to consider the so-calledStein equationassociated withh, which is classically given by

h(x)₋E[h(Z)] = f′(x)₋x f(x), x_∈R. (2.23)

A solution to (2.23) is a function f, depending on h, which is Lebesgue a.e.-differentiable, and such that there exists a version of f′verifying (2.23) for every x _∈R. The following result collects findings by Stein [43, 44], Barbour[2], Daly[9]and Götze[15]. Precisely, the proof of Point (i) (which is known as Stein’s lemma) involves a standard use of the Fubini theorem (see e.g. [8, Lemma 2.1]). Point (ii) is proved e.g. in[44, Lemma II.3](for the estimates on the first and second derivative), in[9, Theorem 1.1](for the estimate on the third derivative) and in[2]and[15](for the alternative estimate on the first derivative). From here onwards, denote by C_bk the set of all real-valued bounded functions with bounded derivatives up tokth order.

Lemma 2.15. (i) Let W be a random variable. Then, W Law= Z ∼N (0, 1) if and only if for every continuous, piecewise continuously differentiable function f such thatE_|f′(Z)_|<∞,

(14)

(ii) If h_{∈ C}_b2, then (2.23) has a solution f_h which is thrice differentiable and such that _kf_h′_k_∞6

4_khk∞,kfh′′k∞62kh′k∞andkfh′′′k∞62kh′′k∞. We also havekfh′k6kh′′k∞.

Now fix a function h _{∈ C}_b2, consider a Gaussian random variable Z _∼ N ₍_{0, 1}₎_{, and let} _F _{be a} generic square integrable random variable. Integrating both sides of (2.23) with respect to the law

ofF gives

E[h(F)]₋E[h(Z)]=E[f_h′(F)₋F f_h(F)] (2.25)

with fh the function given in Lemma 2.15. In the following sections we will show that, if F is a

functional of the infinite Rademacher sequence X, then the quantity in the RHS of (2.25) can be successfully assessed by means of discrete Malliavin operators.

Remark 2.16. Plainly, if the sequence F_n, n>1, is such that E[h(F_n)]₋E[h(Z)] _→0 for every

h∈ C2

b, then Fn

Law

→ Z.

3 General Bounds for Rademacher functionals

3.1 Main bound

The following result combines Proposition 2.14 with Lemma 2.15-(ii) and Relation (2.25) in order to estimate expressions of the type_|E[h(F)]₋E[h(Z)]_|, where F is a square integrable functional of the infinite Rademacher sequenceX, andZ∼N₍_{0, 1}₎_.

Theorem 3.1. Let F ∈domD be centered and such thatP_kEDkF

<∞. Consider a function h∈ C_b2

and let Z_∼N₍_{0, 1}_{). Then,}

|E[h(F)]₋E[h(Z)]_|6min(4khk∞,kh′′k∞)B1+kh′′k∞B2, (3.26)

where

B₁ = E1_{− 〈}DF,₋D L−1F_〉_ℓ2₍_N₎

E(1_{− 〈}DF,₋D L−1_F_〉

ℓ2₍_N₎)2;

B₂ = 20

3 E

h¬

|D L−1F_|,_|DF_|3¶_ℓ2₍_N₎ i

. (3.27)

Proof. Sinceh∈ Cb2, Equality (2.25) holds. Observe that, since the first two derivatives of fh are

bounded, one has that f_h(F)_∈domD(this fact can be proved by using a Taylor expansion, as well as the assumptions on DF and the content of the first point of Remark 2.11). Using Lemma 2.12, one deduces that

Ef_h′(F)₋F f_h(F)=Ef_h′(F)_{− 〈}D f_h(F),₋D L−1F_〉_ℓ2₍_N₎.

We now use again the fact that f_h′′ is bounded: as an application of the first point of Lemma 2.13, we deduce therefore that

ED_kL−1F_× f_h′′(F_k+) +f_h′′(F_k−)(D_kF)2X_k=0, k>1;

in particular, the boundedness of f_h′′, along with the fact that E|D_kF|4 _<_∞ _{and Lemma 2.13-(3),}

ensure that the previous expectation is well-defined. Finally, the desired conclusion follows from the chain rule proved in Proposition 2.14, as well as the bounds on_kf_h′k∞andkfh′′′k∞stated in Lemma 2.15-(ii).

(15)

Remark 3.2. 1. The requirement that P_kEDkF

< ∞ is automatically fulfilled whenever F

belongs to afinitesum of chaoses. This can be deduced from the hypercontractive inequalities stated e.g. in[21, Ch. 6].

2. Since we are considering random variables possibly depending on the whole sequenceX and having an infinite chaotic expansion, the expectation in (3.27) may actually be infinite. 3. Fixq >1 and let F have the form of a multiple integral of the type F = Jq(f), where f ∈

ℓ2₀(N)◦q. Then, one has that

〈DF,−D L−1F〉ℓ2₍_N₎=

qkDFk

ℓ2₍_N₎, (3.28)

〈|D L−1F_|,_|DF_|3_〉_ℓ2₍_N₎=

qkDFk

ℓ4₍_N₎. (3.29)

4. LetGbe an isonormal Gaussian process (see[19]or[29]) over some separable Hilbert space

H, and assume that F ∈ L2(σ{G}) is centered and differentiable in the sense of Malliavin. Then the Malliavin derivative of F, noted DF, is a H-valued random element, and in [25] the following upper bound is established (via Stein’s method) between the laws of F and

Z∼N ₍_{0, 1}₎_:

d_{T V}(F,Z)62E|1− 〈DF,−D L−1F〉H|,

whereL−1is here the inverse of the Ornstein-Uhlenbeck generator associated withGandd_{T V}

is the total variation distance between the laws ofF andZ.

5. Let N be a compensated Poisson measure over some measurable space (A,A), with σ-finite control measure given byµ. Assume thatF _∈L2(σ{N_})is centered and differentiable in the sense of Malliavin. Then the derivative ofF is a random processes which a.s. belongs toL2(µ). In[31, Section 3]the following upper bound for the Wasserstein distance (see Section 3.4 ) between the laws ofF andZ_∼N₍_{0, 1}₎_{is proved (again by means of Stein’s method):}

dW(F,Z)6E|1− 〈DF,−D L−1F〉L2₍_µ₎|+E Z

|DaF|2× |DaL−1F|µ(d a).

6. In[6], Theorem 2.2, a bound to the normal distribution is given for functionsFwhich strictly depend on the firstncoordinates ofX only. The bound is in terms of the operator∆_jFofF, see (2.17). Using the connection in Remark 2.11 point 2, the term corresponding toB₂ in (3.27) would correspond to a bound on 1

j=1E|DjF|. Instead ofB1 in (3.27),[6]gives a bound

in terms of the variance of a conditional expectation which depends on an exchangeable pair of sequences. Such variances are typically not easy to bound, except in the case of quadratic forms; see also Section 4.2.

3.2 First examples: Rademacher averages

A (possibly infinite)Rademacher averageis just an element of the first chaos associated withX, that is, a random variable of the type

∞

i=1

(16)

See e.g. [22, Ch. IV]for general facts concerning random variables of the type (3.30). The next result is a consequence of Theorem 3.1 and of the relations (3.28)–(3.29). It yields a simple and explicit upper bound for the Gaussian approximation of Rademacher averages. We note that this simple example already goes beyond the typical examples in Stein’s method, as it treats a function

F which depends on infinitely many coordinates ofX.

Corollary 3.3. Let F have the form (3.30), let Z_∼N ₍_{0, 1}₎_{and consider h}_{∈ C}2

b. Then

E[h(F)]₋E[h(Z)]6min(4_kh_k_∞,_kh′′_k_∞)

1−

∞

i=1

α2_i

20 3 kh

′′_k ∞

∞

i=1

α4_i. (3.31)

The proof of Corollary 3.3 (whose details are left to the reader) is easily deduced from the fact that, forF as in (3.30), one has that (3.28)-(3.29) withq=1 hold, andD_iF=αi for alli>1.

As an example, for the special case of Corollary 3.3, if F_n = p1_nPn_i₌₁X_i, then it is easy to see that Relation (3.31) yields a bound of ordern−1, implying a faster rate than in the classical Berry-Esséen estimates. This faster rate arises from our use of smooth test functions; a related result is obtained in[17]using a coupling approach.

Remark 3.4. (A Mehler-type representation) Take an independent copy of X, noted X∗ =

{X₁∗,X₂∗, ..._}, fix t > 0, and consider the sequence Xt = _{X₁t,X₂t, ..._} defined as follows: Xt is a sequence of i.i.d. random variables such that, for every k >1, X_kt = X_k with probability e−t and

X_kt = X_k∗ with probability 1₋e−t (we stress that the choice between X and X∗is made separately for every k, so that one can have for instance X₁t = X1,X2t = X2∗, ... and so on). Then X

t _{is a}

Rademacher sequence and one has the following representation: for everyF ∈L2(σ(X))and every

ω= (ω1,ω2, ...)∈Ω ={−1,+1}

P_tF(ω) =E[F(Xt)_|X =ω], (3.32)

where Pt is the Ornstein-Uhlenbeck semigroup given in (2.22). To prove such a representation of

P_t, just consider a random variable of the typeX_j₁_×···×X_j_d, and then use a density argument. Now observe that, for a centeredF =P_n_>₁J_n(f_n)_∈L2₀(σ{X_}), Equation (2.21) holds, and consequently

−D_kL−1F=X

n>1

J_n₋₁(f_n(k,_·)) =

Z ∞

e−tP_tD_kF d t=

Z ∞

e−tE[D_kF(Xt)_|X]d t, so that

〈DF,₋D L−1F_〉_ℓ2₍_N₎=

Z ∞

e−tX

k>1

D_kF E[D_kF(Xt)_|X]d t=

Z ∞

e−t_〈DF,E[DF(Xt)_|X]_〉_ℓ2₍_N₎d t.

Note that the representation (3.32) of the Ornstein-Uhlebeck semigroup is not specific of Radema-cher sequences, and could be e.g. extended to the case whereX is i.i.d. standard Gaussian (see for instance[24, Section 2.2]). This construction would be different from the one leading to the usual

Mehler formula(see e.g. [29, Formula (1.67)]). See Chatterjee[7, Lemma 1.1]and Nourdin and Peccati[25, Remark 3.6]for further connections between Mehler formulae and Stein’s method in a Gaussian setting.

(17)

3.3 A general exchangeable pair construction

Remark 3.4 uses a particular example of exchangeable pair of infinite Rademacher sequences. We shall now show that the chaos decomposition approach links in very well with the method of ex-changeable pairs, and we also provide alternative bounds to[6]in terms of an exchangeable pair. Using a chaos decomposition, instead of a univariate approach such as suggested in[6], we em-ploy a multivariate setting which conditions on exchangeable versions of the elements in the chaos decomposition.

Let us assume that F = F(X₁, ...,X_d) is a random variable that depends uniquely on the first d

coordinatesX_d = (X₁, . . . ,X_d)of the Rademacher sequenceX, with finite chaotic decomposition of the form (2.12). Now assume thatE(F) =0 andE(F2_{) =}_{1 so that}

F =

n=1

X 16i1<...<in6d

n!f_n(i₁, ...,i_n)X_i

1· · ·Xin=

n=1

J_n(f_n). (3.33)

A natural exchangeable pair construction is as follows. Pick an indexIat random, so thatP(I=i) =

d fori=1, . . . ,d, independently ofX1, ...,Xd, and ifI =ireplace Xi by an independent copyX

∗

i in

all sums in the decomposition (3.33) which involveXi. Call the resulting expressionF′. Also denote

the vector of Rademacher variables with the exchanged component by X′_d. Then(F,F′) forms an exchangeable pair.

In[37]an embedding approach is introduced, which suggests enhancing the pair(F,F′)to a pair of vectors(W,W′)which then ideally satisfy the linearity condition

E(W′₋W_|W) =₋ΛW (3.34)

with Λ being a deterministic matrix. In our example, choosing as embedding vector W = (J₁(f1), . . . ,Jd(fd)), we check that

E(J_n′(f_n)₋J_n(f_n)_|W)

= ₋1

i=1

X 16i1<...<in6d

1_{_i₁,...,in}(i) n!fn(i1, ...,in)E(Xi1· · ·Xin|W)

= ₋n

d Jn(fn).

Thus, withW′= (J₁′(f1), . . . ,J_d′(fd)), the condition (3.34) is satisfied, withΛ = (λi,j)16i,j6d being

zero off the diagonal andλn,n = n_d forn= 1, . . . ,d. Our random variable of interest F is a linear

combination of the elements inW, and we obtain the simple expression

E(F′₋F_|W) = 1

dL F=−

dδDF. (3.35)

This coupling helps to assess the distance to the normal distribution, as follows.

Theorem 3.5. Let F _∈domD be centered and such that E(F2) = 1. Let h _{∈ C}_b1, let Z _∼ N₍_{0, 1}_),

(18)

the Ornstein-Uhlenbeck operator for the exchanged Rademacher sequenceX′_d, and denote by(L′)−1 its inverse. Then,

|E[h(F)]₋E[h(Z)]_| 6 4_kh_k∞

Var

(F′₋F)_× (L′)−1_F′₋_L−1_F_W

2kh

′_k∞_E_(F′₋_F₎2

× |(L′)−1F′₋L−1F_|.

Proof. Leth_{∈ C}_b1, and let g denote the solution of the Stein equation (2.23) forh. Using antisym-metry, we have that for all smoothg,

0 = E g(F) +g(F′)_× (L′)−1F′₋L−1F

= 2Eg(F) (L′)−1F′−L−1F+E g(F′)₋g(F)_× (L′)−1F′−L−1F

= 2

F g(F)+E g(F′)₋g(F)_× (L′)−1F′−L−1F; (3.36)

the last equality coming from (3.35) as well as the definitions of(L′)−1 andL−1. Hence

Eg′(F)₋F g(F) = E

g′(F)

1+ d

(F′₋F)_× (L′)−1F′−L−1FW

+R, where by Taylor expansion

|R_|6d

4kg ′′_k

∞E

(F′₋F)2_{× |}(L′)−1F′₋L−1F_|. From (3.36) with g(x) =x we obtain

EE (F′₋F)_× (L′)−1F′₋L−1FW) = E(F′₋F)_× (L′)−1F′₋L−1F=₋2 d

by our assumption that E F2 = 1. We now only need to apply the Cauchy-Schwarz inequality and use the bounds from Lemma 2.15 to complete the proof.

Equation (3.34) is key to answering the question on how to construct an exchangeable pair in the case of a random variable which admits a finite Rademacher chaos decomposition. Indeed, one can proceed as follows. Firstly, useW= (J₁(f1), . . . ,Jd(fd)), where the components come from the chaos

decomposition. Then the results from[37]can be applied to assess the distance of Wto a normal distribution with the same covariance matrix. Note that due to the orthogonality of the integrals, the covariance matrix will be zero off the diagonal.

Finally, note that formally, using (3.35) and that L F=limt→0

PtF−F

t ,

E(F(X′_d)₋F(X_d)_|W) = 1 d limt→0

P_tF(X_d)₋F(X_d)

t =

d limt→0

t[E(F(X

d)|Xd)−F(Xd)],

where the notation is the same as in Remark 3.4. In this sense, our exchangeable pair can be viewed as the limit ast_→0 of the construction in Remark 3.4.

(19)

3.4 From twice differentiable functions to the Wasserstein distance

Although many of our results are phrased in terms of smooth test functions, we can also obtain results in Wasserstein distance by mimicking the smoothing in[38]used for Kolmogorov distance, but now apply it to Lipschitz functions. Recall that the Wasserstein distance between the laws ofY

andZis given by

dW(Y,Z) = sup h∈Lip(1)

|E[h(Y)]₋E[h(Z)]_|,

where Lip(1)is the class of all Lipschitz-continuous functions with Lipschitz constant less or equal to 1. Rademacher’s Theorem states that a function which is Lipschitz continuous on the real line is Lebesgue-almost everywhere differentiable. For anyh_∈Lip(1)we denote byh′its derivative, which exists almost everywhere.

Corollary 3.6. Let Z ∼N (0, 1)and let F ∈domD be centered. Suppose that (3.26)holds for every function h_{∈ C}_b2and that4(B₁+B₂)65.Then

dW(F,Z) 6

2(B₁+B2)(5+E|F|).

Proof.Leth_∈Lip(1)and, fort>0, define

h_t(x) =

Z _∞

−∞

h(pt y+p1₋t x)φ(y)d y,

where φ denotes the standard normal density. Then we may differentiate and integrate by parts, using thatφ′(x) =₋xφ(x), to get

h′′_t(x) = 1_p−t t

Z _∞

−∞

yh′(pt y+p1−t x)φ yd y. Hence for 0<t <1 we may bound

kh′′_tk∞ 6 1₋t

t kh

′_k ∞

Z ∞

−∞

|y|φ(y)d y6_p1

t. (3.37)

Taylor expansion gives that for 0<t61

2 so that

t6p1−t, |E[h(F)]₋E[h_t(F)]_| 6

Z n

h(pt y+p1₋t F)₋h(p1₋t F)oφ(y)d y

h(p1₋t F)₋h(F)

6 _kh′k∞

|y|φ(y)d y+_kh′k∞

2p1₋tE|F|6

1+1

2E|F|

. Here we used that_kh′_k∞61 and that for 0< θ <1, we have(p1₋θt)−1<(p1₋t)−1. Similarly, |Eh(Z)₋Eh_t(Z)_|63

t. Using (3.26) with (3.37) together with the triangle inequality we have for allh∈Lip(1)

|E[h(F)]₋E[h(Z)]_| 6 _p1

t(B1(1) +B2(1)) +

1 2

t(5+E|F|).

Minimising in t gives that the optimal is achieved for t = 2(B₁(1) +B2(1))/(5+E|F|), and using

(1)

[11] P. de Jong (1990). A central limit theorem for generalized multilinear forms.J. Mult. Anal.34, 275-289. MR1073110 MR1073110

[12] V.H. de la Peña and E. Giné (1997).Decoupling. Springer-Verlag. Berlin Heidelberg New York. MR1666908

[13] B. Efron and C. Stein (1981). The Jackknife Estimate of Variance. Ann. Statist. 9, 586-596. MR0615434 (82k:62074) MR0615434

[14] A.P. Godbole (1992). The exact and asymptotic distribution of overlapping success runs.Comm. Statist. - Theory and Methods21, 953–967. MR1173302 MR1173302

[15] F. Götze (1991). On the rate of convergence in the multivariate CLT.Ann. Probab.19, 724-739. MR1106283 MR1106283

[16] F. Götze and A.N. Tikhomirov (2002). Asymptotic distribution of quadratic forms and applica-tions.J. Theoret. Probab.15(2), 423-475. MR1898815 MR1898815

[17] L. Goldstein and G. Reinert (1997). Stein’s method and the zero bias transformation with application to simple random sampling. Ann. Appl. Probab. 7(4), 935-952. MR1484792 MR1484792

[18] J. Hájek (1968). Asymptotic Normality of Simple Linear Rank Statistics Under Alternatives

Ann. Math. Statist.39, 325-346. MR0222988 MR0222988

[19] S. Janson (1997). Gaussian Hilbert Spaces. Cambridge University Press. MR1474726 MR1474726

[20] S. Karlin and Y. Rinott (1982). Applications of ANOVA type decompositions for compar-isons of conditional variance statistics including jackknife estimates.Ann. Statist.10485-501. MR0653524 MR0653524

[21] S. Kwapie´n and W.A. Woyczy´nski (1992).Random Series and Stochastic Integrals: Single and Multiple. Birkhäuser, Basel. MR1167198 MR1167198

[22] M. Ledoux and M. Talagrand (1991). Probability on Banach spaces. Springer-Verlag, Berlin Heidelberg New York. MR1102015 MR1102015

[23] P.-A. Meyer (1992). Quantum probability for probabilists. LNM1538. Springer-Verlag, Berlin Heidelberg New York. MR1222649 MR1222649

[24] E. Mossel, R. O’Donnell and K. Oleszkiewicz (2010). Noise stability of functions with low influences: variance and optimality.Ann. Math.71, 295–341. MR2630040 MR2630040 [25] I. Nourdin and G. Peccati (2009). Stein’s method on Wiener chaos.Probab. Theory Rel. Fields

145, no. 1, 75–118. MR2520122 MR2520122

[26] I. Nourdin and G. Peccati (2010). Stein’s method and exact Berry-Esséen bounds for function-als of Gaussian fields.Ann. Probab.37, 2231–2261. MR2573557

(2)

[27] I. Nourdin, G. Peccati and A. Réveillac (2008). Multivariate normal approximation using Stein’s method and Malliavin calculus.Ann. Inst. H. Poincaré Probab. Statist.46(2010), 45–58. MR2641769

[28] I. Nourdin and F.G. Viens (2009). Density estimates and concentration inequalities with Malli-avin calculus.Electron. J. Probab.14, 2287–2309. MR2556018 MR2556018

[29] D. Nualart (2006).The Malliavin calculus and related topics.Springer-Verlag, Berlin, 2nd edi-tion. MR2200233 MR2200233

[30] D. Nualart and G. Peccati (2005). Central limit theorems for sequences of multiple stochastic integrals.Ann. Probab.33(1), 177-193. MR2118863 MR2118863

[31] G. Peccati, J.-L. Solé, M.S. Taqqu and F. Utzet (2010). Stein’s method and normal approxima-tion of Poisson funcapproxima-tionals.Ann. Probab.38, 443-478. MR2642882 MR2642882

[32] G. Peccati and M.S. Taqqu (2008). Central limit theorems for double integrals.Bernoulli14(3), 791-821. MR2537812 MR2537812

[33] G. Peccati and M.S. Taqqu (2010). Wiener chaos: moments, cumulants and diagrams. Springer-Verlag, to appear.

[34] N. Privault (2008). Stochastic analysis of Bernoulli processes.Probability Surveys5, 435-483. MR2476738 MR2476738

[35] N. Privault and W. Schoutens (2002). Discrete chaotic calculus and covariance identities.Stoch. Stoch. Reports72, 289-315. MR1897919 MR1897919

[36] G. Reinert (2005). Three general approaches to Stein’s method. In: An introduction to Stein’s method, 183-221. Lect. Notes Ser. Inst. Math. Sci. Natl. Univ. Singap.4, Singapore Univ. Press, Singapore. MR2235451 MR2235451

[37] G. Reinert and A. Röllin (2009). Multivariate normal approximation with Stein’s method of exchangeable pairs under a general linearity condition. Ann. Probab. 37 , 2150-2173. MR2573554 MR2573554

[38] Y. Rinott and V. Rotar (1996). A multivariate CLT for local dependence with n−1/2lognrate and applications to multivariate graph related statistics. J. Multivariate Anal. 56, 333–350. MR1379533 MR1379533

[39] Y. Rinott and V. Rotar (1997). On coupling constructions and rates in the CLT for dependent summands with applications to the antivoter model and weighted U-statistics. Ann. Appl. Probab.7, 1080–1105. MR1484798 MR1484798

[40] V.I. Rotar’ (1975). Limit theorems for multilinear forms and quasipolynomial functions. Teor. Verojatnost. i Primenen.,20(3), 527-546. MR0385980 MR0385980

[41] V.I. Rotar’ (1979). Limit theorems for polylinear forms. J. Multivariate Anal. 9, 511–530. MR0556909 MR0556909

[42] R.P. Stanley (1997). Enumerative Combinatorics, Vol. 1. Cambridge University Press. Cam-bridge. MR1442260 MR1442260

(3)

[43] C. Stein (1972). A bound for the error in the normal approximation to the distribution of a sum of dependent random variables. In: Proceedings of the Sixth Berkeley Symposium on Mathematical Statistics and Probability, Vol. II: Probability theory, 583–602. Univ. California Press, Berkeley, CA. MR0402873 MR0402873

[44] C. Stein (1986). Approximate computation of expectations. Institute of Mathematical Statis-tics Lecture Notes - Monograph Series, 7. Institute of Mathematical Statistics, Hayward, CA. MR0882007 MR0882007

[45] D. Surgailis (2003). CLTs for Polynomials of Linear Sequences: Diagram Formulae with Appli-cations. In: Long Range Dependence. Birkhäuser, Basel, 111-128. MR1956046 MR1956046 [46] W.R. van Zwet (1984). A Berry-Esseen bound for symmetric statistics.Z.

Wahrscheinlichkeits-theorie verw. Gebiete66, 425-440. MR0751580 (86h:60063) MR0751580

Appendix: Some technical proofs

Proof of Lemma 2.4

In what follows, fork>1, we shall writea_kto indicate the generic vectora_k= (a₁, ...,a_k)_∈Nk. We start by proving the first point. To do this, just write, using the Cauchy-Schwarz inequality,

kf ⋆l_rgk2_ℓ2₍N)⊗(n+m−r−l) =

X im+n−r−l

f ⋆l_rg(i_m+n₋_r₋_l)2

= X

in−r X jm−r

X kr−l

X al

f(i_n₋_r,k_r₋_l,a_l)g(j_m₋_r,k_r₋_l,a_l) !2

X in−r

X kr−l

X al

f2(i_n₋_r,k_r₋_l,a_l)X jm−r

X bl

g2(j_m₋_r,k_r₋_l,b_l)

X in−r

X kr−l

X al

f2(i_n₋_r,k_r₋_l,a_l)X jm−r

X lr−l

X bl

g2(j_m₋_r,l_r₋_l,b_l) = _kfk2_ℓ2₍N)⊗nkgk

ℓ2₍N)⊗m. The first part of the second point is obtained by writing

max

  

X bn₋1

f2(j,bn−1)    2

  

X bn₋1

f2(j,bn−1)    2

= _kf ⋆n_n−1 fk2_ℓ2₍N)

6 max

X bn₋1

f2(j,bn−1)× X

j′

X b′_n₋₁

f2(j,bn−1)

= max

X bn−1

(4)

For the second part, we have, by applying (in order) the Cauchy-Schwarz inequality, as well as the previous estimate,

kf ⋆l_l−1 g_k2

ℓ2₍_N₎⊗(n+m−2l+1)

= X

X kn−l

X lm−l

  

X il−1

f(j,i_l₋₁,k_n₋_l)g(j,i_l₋₁,l_m₋_l)    2 =X j

kf(j,_·)⋆_ll−₋1₁g(j,_·)_k2

ℓ2₍_N₎⊗(n+m−2l)

6 X

X kn−l

X il−1

f2(j,i_l₋₁,k_n₋_l)X lm−l

X jl−1

g2(j,j_l₋₁,l_m₋_l)

6 max

X bn−1

f2(j,b_n₋₁)_×X im

g2(i_m) =max

X bn−1

f2(j,b_n₋₁)_{× k}g_k2_ℓ2₍_N₎⊗m

= v u u u tmax j    X bn₋1

f2(j,bn−1)    2

× kgk2_ℓ2₍N)⊗n

6 v u u u tX j    X bn−1

f2(j,bn−1)    2

× kgk2_ℓ2₍N)⊗m=kf ⋆

n−1

n−1 fkℓ2₍N)× kgk2_ℓ2₍N)⊗m.

Finally, for the third part, just observe that

kf ⋆0₁ fk2_ℓ2₍_N₎⊗(2n−1)=

X in−1

X jn−1

f(j,i_n₋₁)f(j,j_n₋₁) =X

  

X in−1

f(j,i_n₋₁)    2

=_kf ⋆n_n−1 fk2_ℓ2₍N)

and, for 26l 6n:

kf ⋆l_l−1 f_k2

ℓ2₍_N₎⊗(2n−2l+1)

= X

i=j

X kn−l

X ln−l

  

X il−1

f(i,i_l₋₁,k_n₋_l)f(j,i_l₋₁,l_n₋_l)    2 6 X

(an−l+1,bn−l+1)∈∆c2n−2l+2

  

X il₋1

f(i_l₋₁,a_n₋_l+₁)f(i_l₋₁,b_n₋_l+₁)    2

= _kf ⋆l_l−₋1₁ f _×1_∆c

2n−2l+2k

ℓ2₍_N₎⊗(2n−2l+2).

Proof of Lemma 2.6

We have, by bilinearity and using the first point of Lemma 2.4,

kfk⋆rrgk− f ⋆rr gkℓ2₍_N₎⊗(m+n−2r) =kf_k⋆r_r(g_k−g) + (f_k−f)⋆r_r gk_ℓ2₍_N₎⊗(m+n−2r)

6_kf_k⋆r_r(g_k₋g)_k_ℓ2₍_N₎⊗(m+n−2r)+k(f_k−f)⋆r_rgk_ℓ2₍_N₎⊗(m+n−2r)

6_kf_k_k_ℓ2₍_N₎⊗nkg_k−gk_ℓ2₍_N₎⊗m+kf_k−fk_ℓ2₍_N₎⊗nkgk_ℓ2₍_N₎⊗m −→

(5)

Proof of Proposition 2.9

We assume, without loss of generality, thatm6n. We first prove (2.11) for functions f andg with support, respectively, in∆N

n and∆Nm, for some finiteN. One has that J_n fJ_m g=

∆N n×∆Nm

f _⊗g dµ⊗_Xn+m.

Forr=0, ...,m, we denote byΠ_r(n,m)the set of all partitionsπof_{1, ....,n+m}composed of i) exactlyr blocks of the typei₁,i₂ , with 16i₁6nandn+16i₂6n+m,

ii) exactlyn+m₋2r singletons.

For instance: an element ofΠ₂(3, 2)is the partition

π=_{{1, 4},{2, 5},{3}}; (6.73) the only element ofΠ₀(n,m)is the partitionπ=_{{1},{2}, ...,{n+m}}composed of all singletons; an element ofΠ₃(4, 4)is the partition

π=_{{1, 5},{2, 6},{3, 7},{4},{8}}. (6.74) It is easily seen thatΠ_r(n,m)contains exactlyr! n

m r

elements (to specify an element ofΠ_r(n,m), first selectrelements of{1, ...,n}, then selectr elements in{n+1, ...,n+m}, then build a bijection between the two selectedr-sets). For everyr=0, ...,mand everyπ∈Π_r(n,m), we write B_nN_,_m(π) to denote the subset of_{1, ...,N_}n+mgiven by

i₁, ...,i_n,in+1, ...,in+m

∈ {1, ...,N_}n+m:i_j=i_kiff j andkare in the same block ofπ© (note the “if and only if” in the definition). For instance, forπas in (6.73), an element ofB3_3,2(π) is(1, 2, 3, 1, 2); forπas in (6.74), an element of B_4,45 (π)is(1, 2, 3, 4, 1, 2, 3, 5). The following two facts can be easily checked;

∆N_n _×∆N_m=

[

r=0 [

π∈Πr(n,m)

B_nN_,_m(π), where the unions are disjoint;

B) For everyr=0, ...,m, and everyπ∈Π_r(n,m), Z

BN n,m(π)

∆n+m−2r ¦

(note that the last expression does not depend on the partition π, but only on the class Π_r(n,m)).

(6)

It follows that

J_n fJ_m g = Z

∆N n×∆Nm

r=0 X

π(r)_∈_Π∗

r(n,m) Z

BN n,m(π(r))

r=0

r! _n

∆n+m−2r ¦

=(∗)

r=0

r! _n

∆n+m−2r n

^_f _⋆r rg

dµ⊗_Xn+m

r=0

r! _n

Jn+m−2r

h_^

f ⋆r rg

1_∆

n+m−2r i

which is the desired conclusion. We stress that the equality(_∗)has been obtained by using the sym-metry of the measureµ⊗_Xn+m. The result for general f,gis deduced by an approximation argument and Lemma 2.6. This concludes the proof.

getdocb915. 413KB Jun 04 2011 12:04:53 AM

Stein’s method and stochastic analysis of Rademacher

functionals

Ivan Nourdin

,

Université Paris VI

Giovanni Peccati

Université Paris Ouest

Gesine Reinert

Oxford University

1

Introduction

2

Framework and main tools

2.1

The setup

2.2

The star notation

2.3

Multiple integrals, chaos and product formulae

2.4

Finding a chaotic decomposition

2.5

Discrete Malliavin calculus and a new chain rule

2.6

Stein’s method for normal approximation

3

General Bounds for Rademacher functionals

3.1

Main bound

3.2

First examples: Rademacher averages

3.3

A general exchangeable pair construction

3.4

From twice differentiable functions to the Wasserstein distance

Appendix: Some technical proofs

Proof of Lemma 2.4

Proof of Lemma 2.6

Proof of Proposition 2.9

Parts

Dokumen yang terkait

AN ALIS IS YU RID IS PUT USAN BE B AS DAL AM P E RKAR A TIND AK P IDA NA P E NY E RTA AN M E L AK U K A N P R AK T IK K E DO K T E RA N YA NG M E N G A K IB ATK AN M ATINYA P AS IE N ( PUT USA N N O MOR: 9 0/PID.B /2011/ PN.MD O)

ANALISIS FAKTOR YANGMEMPENGARUHI FERTILITAS PASANGAN USIA SUBUR DI DESA SEMBORO KECAMATAN SEMBORO KABUPATEN JEMBER TAHUN 2011

EFEKTIVITAS PENDIDIKAN KESEHATAN TENTANG PERTOLONGAN PERTAMA PADA KECELAKAAN (P3K) TERHADAP SIKAP MASYARAKAT DALAM PENANGANAN KORBAN KECELAKAAN LALU LINTAS (Studi Di Wilayah RT 05 RW 04 Kelurahan Sukun Kota Malang)

FAKTOR Â– FAKTOR YANG MEMPENGARUHI PENYERAPAN TENAGA KERJA INDUSTRI PENGOLAHAN BESAR DAN MENENGAH PADA TINGKAT KABUPATEN / KOTA DI JAWA TIMUR TAHUN 2006 - 2011

A DISCOURSE ANALYSIS ON “SPA: REGAIN BALANCE OF YOUR INNER AND OUTER BEAUTY” IN THE JAKARTA POST ON 4 MARCH 2011

Pengaruh kualitas aktiva produktif dan non performing financing terhadap return on asset perbankan syariah (Studi Pada 3 Bank Umum Syariah Tahun 2011 – 2014)

Pengaruh pemahaman fiqh muamalat mahasiswa terhadap keputusan membeli produk fashion palsu (study pada mahasiswa angkatan 2011 & 2012 prodi muamalat fakultas syariah dan hukum UIN Syarif Hidayatullah Jakarta)

Pendidikan Agama Islam Untuk Kelas 3 SD Kelas 3 Suyanto Suyoto 2011

ANALISIS NOTA KESEPAHAMAN ANTARA BANK INDONESIA, POLRI, DAN KEJAKSAAN REPUBLIK INDONESIA TAHUN 2011 SEBAGAI MEKANISME PERCEPATAN PENANGANAN TINDAK PIDANA PERBANKAN KHUSUSNYA BANK INDONESIA SEBAGAI PIHAK PELAPOR

KOORDINASI OTORITAS JASA KEUANGAN (OJK) DENGAN LEMBAGA PENJAMIN SIMPANAN (LPS) DAN BANK INDONESIA (BI) DALAM UPAYA PENANGANAN BANK BERMASALAH BERDASARKAN UNDANG-UNDANG RI NOMOR 21 TAHUN 2011 TENTANG OTORITAS JASA KEUANGAN

Dokumen yang Anda mencari sudah siap untuk unduhkan