Maximizing the mean escape time of a degenerate diffusion process

On the Dirichelet problem 33 Let y ǫ n be the solution of the system y ′ = f y, a ǫ n t t n , yt ˆx a ǫ n = z , that is, the trajectory moving backward from z using control a ǫ n , and set x n : = y ǫ n 0. In order to prove that x n → ˆx we consider the solution y ǫ n of y ′ = f y, a ǫ n t t n , yt n = z ǫ n , that is, the trajectory moving backward from z ǫ n and using control a ǫ n . Note that y ǫ n = ˆx. By differentiating |y ǫ n − y ǫ n | 2 , using 37 and then integrating we get, for all t t n , |y ǫ n t − y ǫ n t | 2 ≤ |z ǫ n − z| 2 + Z t n t 2L |y ǫ n s − y ǫ n s | 2 ds . Then by Gronwall’s lemma, for all t t n , |y ǫ n t − y ǫ n t | ≤ |z ǫ n − z|e Lt n −t , which gives, for t = 0, | ˆx − x n | ≤ |z ǫ n − z|e Lt n . By letting n → ∞, we get that x n → ˆx. By definition of minimum time T x n ≤ t n , so letting n → ∞ we obtain T ∗ ˆx ≤ t, which gives the desired contradiction with 43. The remaining case is v ∗ ˆx = 1. By 39 T ∗ ǫ ˆx ≤ K +∞ for all ǫ. By using the previous argument we get 42 with t +∞ and T ∗ ˆx ≤ t. This is a contradiction with T ∗ ˆx = +∞ and completes the proof.

3.2. Maximizing the mean escape time of a degenerate diffusion process

In this subsection we study a stochastic control problem having as a special case the problem of maximizing the expected discounted time spent by a controlled diffusion process in a given open set ⊆ N . A number of engineering applications of this problem are listed in [19], where, however, a different cost criterion is proposed and a nondegeneracy assumption is made on the diffusion matrix. We consider a probability space ′ , , P with a right-continuous increasing filtration of complete sub-σ fields { t }, a Brownian motion B t in M t -adapted, a compact set A, and call the set of progressively measurable processes α t taking values in A. We are given bounded and continuous maps σ from N × A into the set of N × M matrices and b : N × A → N satisfying 14, 15 and consider the controlled stochastic differential equation S D E d X t = σ α t X t d B t − b α t X t dt , t 0 , X = x . For any α . ∈ S D E has a pathwise unique solution X t which is t -progressively measurable and has continuous sample paths. We are given also two bounded and uniformly continuous 34 M. Bardi – S. Bottacin maps f, c : N × A → , c α x ≥ c 0 for all x, α, and consider the payoff functional J x, α . : = E Z t x α . f α t X t e − R t c αs X s ds dt , where E denotes the expectation and t x α . : = inf{t ≥ 0 : X t 6∈ } , where, as usual, t x α . = +∞ if X t ∈ for all t ≥ 0. We want to maximize this payoff, so we consider the value function v x : = sup α . ∈ ✁ J x, α . . Note that for f = c ≡ 1 the problem becomes the maximization of the mean discounted time E 1 − e −t x α . spent by the trajectories of S D E in . The Hamilton-Jacobi-Bellman operator and the Dirichlet problem associated to v by the Dynamic Programming method are Fx, u, Du, D 2 u : = min α∈ A {−a α i j xu x i x j + b α x · Du + c α xu − f α x } , where the matrix a i j is 1 2 σ σ T , and Fx, u, Du, D 2 u = 0 in , u = 0 on ∂ , 44 see, for instance, [40, 35, 36, 22, 32] and the references therein. The proof that the value function satisfies the Hamilton-Jacobi-Bellman PDE is based on the Dynamic Programming Principle v x = sup α . ∈ ✁ E Z θ ∧ t x f α t X t e − R t c αs X s ds dt + vX θ ∧ t x e − R θ ∧ tx c αs X s ds , 45 where t x = t x α . , for all x ∈ and all t -measurable stopping times θ . Although the DPP 45 is generally believed to be true under the current assumptions see, e.g., [35], we were able to find its proof in the literature only under some additional conditions, such as the convexity of the set {a α x, b α x, f α x, c α x : α ∈ A} for all x ∈ , see [20] this is true, in particular, when relaxed controls are used, or the inde- pendence of the variance of the noise from the control [15], i.e., σ α x = σ x for all x, or the continuity of v [35]. As recalled in Subsection 1.1 a Comparison Principle for 44 can be found in [29], see also [18] and the references therein. In order to prove that v is the e-solution of 44, we approximate with a nested family of open sets with the properties 2 ǫ ⊆ , ǫ ∈]0, 1], 2 ǫ ⊇ 2 δ for ǫ δ, [ ǫ 2 ǫ = . 46 For each ǫ 0 we call v ǫ the value function of the same control problem with t x replaced with t ǫ x α . : = inf{t ≥ 0 : X t 6∈ 2 ǫ } On the Dirichelet problem 35 in the definition of the payoff J . In the next theorem we assume that each v ǫ satisfies the DPP 45 with t x replaced with t ǫ x . Finally, we make the additional assumption f α x ≥ 0 for all x ∈ , α ∈ A . 47 which ensures that u ≡ 0 is a subsolution of 44. The main result of this subsection is the following. T HEOREM 9. Under the previous assumptions the value function v is the e-solution and the minimal supersolution of 44, and v = sup 0ǫ≤1 v ǫ = lim ǫց v ǫ . Proof. Note that v ǫ is nondecreasing as ǫ ց 0, so lim ǫց v ǫ exists and equals the sup. By Theorem 3 with g ≡ 0, u ≡ 0, there exists the e-solution H of 44. We consider the functions u ǫ defined by 20 and claim that u 2ǫ ≤ v ǫ ∗ ≤ v ∗ ǫ ≤ H . Then H = sup 0ǫ≤1 v ǫ , 48 because H = sup ǫ u 2ǫ by Remark 1. We prove the claim in three steps. Step 1. By standard methods [35, 9], the Dynamic Programming Principle for v ǫ implies that v ǫ is a non-continuous viscosity solution of the Hamilton-Jacobi-Bellman equation F = 0 in 2 ǫ and v ∗ ǫ is a viscosity subsolution of the boundary condition u = 0 or Fx, u, Du, D 2 u = 0 on ∂2 ǫ , 49 as defined in Subsection 1.3. Step 2. Since v ǫ ∗ is a supersolution of the PDE F = 0 in 2 ǫ and v ǫ ∗ ≥ 0 on ∂2 ǫ , the Comparison Principle implies v ǫ ∗ ≥ w for any subsolution w of 44 such that w = 0 on ∂2 ǫ . Since ∂2 ǫ ⊆ \ 2 2ǫ by 46, we obtain u 2ǫ ≤ v ǫ∗ by the definition 20 of u 2ǫ . Step 3. We claim that v ∗ ǫ is a subsolution of 44. In fact we noted before that it is a subsolution of the PDE in 2 ǫ , and this is true also in \ 2 ǫ where v ∗ ǫ ≡ 0 by 47, whereas the boundary condition is trivial. It remains to check the PDE at all points of ∂2 ǫ . Given ˆx ∈ ∂2 ǫ , we must prove that for all φ ∈ C 2 such that v ∗ ǫ − φ attains a local maximum at ˆx, we have F ˆx, v ∗ ǫ ˆx, Dφ ˆx, D 2 φ ˆx ≤ 0 . 50 1st Case: v ∗ ǫ ˆx 0. Since v ∗ ǫ satisfies 49, for all φ ∈ C 2 2 ǫ such that v ∗ ǫ − φ attains a local maximum at ˆx 50 holds. Then the same inequality holds for all φ ∈ C 2 as well. 2nd Case: v ∗ ǫ ˆx = 0. Since v ∗ ǫ − φ attains a local maximum at ˆx, for all x near ˆx we have v ∗ ǫ x − v ∗ ǫ ˆx ≤ φx − φ ˆx . By Taylor’s formula for φ at ˆx and the fact that v ∗ ǫ x ≥ 0, we get Dφ ˆx · x − ˆx ≥ o|x − ˆx| , 36 M. Bardi – S. Bottacin and this implies Dφ ˆx = 0. Then Taylor’s formula for φ gives also x − ˆx · D 2 φ ˆxx − ˆx ≥ o|x − ˆx| 2 , and this implies D 2 φ ˆx ≥ 0, as it is easy to check. Then F ˆx, v ∗ ǫ ˆx, Dφ ˆx, D 2 φ ˆx = F ˆx, 0, 0, D 2 φ ˆx ≤ 0 because a α ≥ 0 and f α ≥ 0 for all x and α. This completes the proof that v ∗ ǫ is a subsolution of 44. Now the Comparison Principle yields v ∗ ǫ ≤ H , since H is a supersolution of 44. It remains to prove that v = sup 0ǫ≤1 v ǫ . To this purpose we take a sequence ǫ n ց 0 and define J n x, α . : = E Z t ǫn x α . f α t X t e − R t c αs X s ds dt . We claim that lim n J n x, α . = sup n J n x, α . = J x, α . for all α . and x . The monotonicity of t ǫ n x follows from 46 and it implies the monotonicity of J n by 47. Let τ : = sup n t ǫ n x α . ≤ t x α . , and note that t x α . = +∞ if τ = +∞. In the case τ +∞, X t ǫn x ∈ ∂2 ǫ n implies X τ ∈ ∂, so τ = t x α . again. This and 47 yield the claim by the Lebesgue monotone convergence theorem. Then v x = sup α . sup n J n x, α . = sup n sup α . J n x, α . = sup n v ǫ n = sup ǫ v ǫ , so 48 gives v = H and completes the proof. R EMARK 7. From Theorem 9 it is easy to get a Verification theorem by taking the su- persolutions of 44 as verification functions. We consider a presynthesis α x , that is, a map α · : → , and say it is optimal at x o if J x o , α x o = vx o . Then Theorem 9 gives im- mediately the following sufficient condition of optimality: if there exists a verification function W such that W x o ≤ J x o , α x o , then α · is optimal at x o ; moreover, a characterization of global optimality is the following: α · is optimal in if and only if J ·, α · is a verification function. R EMARK 8. We can combine Theorem 9 with the results of Subsection 2.2 to approximate the value function v with smooth value functions. Consider a Brownian motion ˜ B t in N t - adapted and replace the stochastic differential equation in SDE with d X t = σ α t X t d B t − b α t X t dt + √ 2h d ˜ B t , t 0 , for h 0. For a family of nested open sets with the properties 46 consider the value function v ǫ h of the problem of maximizing the payoff functional J with t x replaced with t ǫ x . Assume for simplicity that a α , b α , c α , f α are smooth otherwise we can approximate them by mollification. Then v ǫ h is the classical solution of 30, where F is the HJB operator of this subsection and u ≡ 0, by the results in [21, 24, 36, 31], and it is possible to synthesize an optimal Markov control policy for the problem with ǫ, h 0 by standard methods see, e.g., [22]. By Theorem 6 v ǫ h converges to v as ǫ, h ց 0 with h linked to ǫ. On the Dirichelet problem 37 References [1] B ARDI M., B OTTACIN S., Discontinuous solution of degenerate elliptic boundary value problems, Preprint 22, Dip. di Matematica P. e A., Università di Padova, 1995. [2] B ARDI M., B OTTACIN S., Characteristic and irrelevant boundary points for viscosity solutons of nonlinear degenerate elliptic equations, Preprint 25, Dip. di Matematica P. e A., Università di Padova, 1998. [3] B ARDI M., B OTTACIN S., F ALCONE M., Convergence of discrete schemes for discontinuous value functions of pursuit-evasion games, in “New Trends in Dynamic Games and Applications”, G .J. Olsder ed., Birkhäuser, Boston 1995, 273–304. [4] B ARDI M., C APUZZO -D OLCETTA I., Optimal control and viscosity solutions of Hamilton- Jacobi-Bellman equations, Birkhäuser, Boston 1997. [5] B ARDI M., C RANDALL M., E VANS L. C., S ONER H. M., S OUGANIDIS P. E., Viscosity solutions and applications, I. Capuzzo Dolcetta and P.-L. Lions eds., Springer Lecture Notes in Mathematics 1660, Berlin 1997. [6] B ARDI M., G OATIN P., I SHII H., A Dirichlet type problem for nonlinear degenerate elliptic equations arising in time-optimal stochastic control, Preprint 1, Dip. di Matematica P. e A., Università di Padova, 1998, to appear in Adv. Math. Sci. Appl. [7] B ARDI M., S TAICU V., The Bellman equation for time-optimal control of noncontrollable, nonlinear systems, Acta Appl. Math. 31 1993, 201–223. [8] B ARLES G., Solutions de viscosité des equations de Hamilton-Jacobi, Springer-Verlag, 1994. [9] B ARLES G., B URDEAU J., The Dirichlet problem for semilinear second-order degenerate elliptic equations and applications to stochastic exit time control problems, Comm. Partial Differential Equations 20 1995, 129–178. [10] B ARLES G., P ERTHAME B., Discontinuous solutions of deterministic optimal stopping time problems, RAIRO Modél. Math. Anal. Numér 21 1987, 557–579. [11] B ARRON E. N., J ENSEN R., Semicontinuous viscosity solutions of Hamilton-Isaacs equation with convex Hamiltonians, Comm. Partial Differential Equations 15 1990, 1713– 1742. [12] B ERESTYCKI H., N IRENBERG L., V ARADHAN S. R. S., The Principal Eigenvalue and Maximum Principle for Second-Order Elliptic Operators in General Domains, Comm. Pure App. Math. XLVII 1994, 47–92. [13] B ETTINI P., Problemi di Cauchy per equazioni paraboliche degeneri con dati discontinui e applicazioni al controllo e ai giochi differenziali, Thesis, Università di Padova, 1998. [14] B LANC A.-P H ., Deterministic exit time control problems with discontinuous exit costs, SIAM J. Control Optim. 35 1997, 399–434. [15] B ORKAR V. S., Optimal control of diffusion processes, Pitman Research Notes in Mathe- matics Series 203, Longman, Harlow 1989. [16] B RELOT

M., Familles de Perron et probl`eme de Dirichlet, Acta Litt. Sci. Szeged 9 1939,

133–153. [17] C AFFARELLI L. A., C ABR ´ E X., Fully nonlinear elliptic equations, Amer. Math. Soc., Providence, RI, 1995.

Maximizing the mean escape time of a degenerate diffusion process

3.2. Maximizing the mean escape time of a degenerate diffusion process

M., Familles de Perron et probl`eme de Dirichlet, Acta Litt. Sci. Szeged 9 1939,

Parts

Dokumen yang terkait

PERENCANAAN PORTAL KELUAR KAMPUS III UMM

PERENCANAAN ULANG (REDESIGN) STRUKTUR ATAS PROYEK RUMAH SUSUN SEDERHANA 2 (RUSUNAWA) UMM MALANG DENGAN MENGGUNAKAN BALOK PRATEGANG PARSIAL

PERSEPSI MAHASISWA TENTANG MATERI TAYANGAN SANG PEMBURU DI LATIVI ( Studi Pada Mahasiswa Jurusan Ilmu Komunikasi UMM Angkatan 2002)

APRESIASI MAHASISWA TERHADAP TAYANGAN Â“OPERA VAN JAVAÂ” DI TRANS7 (Studi Pada Mahasiswa Jurusan Ilmu Komunikasi UMM Angkatan 2008)

FAKTOR-FAKTOR PENYEBAB KESULITAN BELAJAR BAHASA ARAB PADA MAHASISWA MA’HAD ABDURRAHMAN BIN AUF UMM

38 Mahasiswa UMM Pilih KKN di Padang

FPP UMM Seleksi Sarjana Membangun Desa

Lagi, UMM Tambah Mahasiswa Asing

Pengajian Muhammadiyah di UMM Akan Hadirkan BJ Habibie

Robert John Pope: UMM Miliki Intellectual Honesty

Dukungan

Links

Maximizing the mean escape time of a degenerate diffusion process

3.2. Maximizing the mean escape time of a degenerate diffusion process

M., Familles de Perron et probl`eme de Dirichlet, Acta Litt. Sci. Szeged 9 1939,

Parts

Dokumen yang terkait

PERENCANAAN PORTAL KELUAR KAMPUS III UMM

PERENCANAAN ULANG (REDESIGN) STRUKTUR ATAS PROYEK RUMAH SUSUN SEDERHANA 2 (RUSUNAWA) UMM MALANG DENGAN MENGGUNAKAN BALOK PRATEGANG PARSIAL

PERSEPSI MAHASISWA TENTANG MATERI TAYANGAN SANG PEMBURU DI LATIVI ( Studi Pada Mahasiswa Jurusan Ilmu Komunikasi UMM Angkatan 2002)

APRESIASI MAHASISWA TERHADAP TAYANGAN Â“OPERA VAN JAVAÂ” DI TRANS7 (Studi Pada Mahasiswa Jurusan Ilmu Komunikasi UMM Angkatan 2008)

FAKTOR-FAKTOR PENYEBAB KESULITAN BELAJAR BAHASA ARAB PADA MAHASISWA MA’HAD ABDURRAHMAN BIN AUF UMM

38 Mahasiswa UMM Pilih KKN di Padang

FPP UMM Seleksi Sarjana Membangun Desa

Lagi, UMM Tambah Mahasiswa Asing

Pengajian Muhammadiyah di UMM Akan Hadirkan BJ Habibie

Robert John Pope: UMM Miliki Intellectual Honesty

Dokumen yang Anda mencari sudah siap untuk unduhkan