, , ,

Let

(X_{n})_{n \geq 0}

be a homogeneous ergodic Markov chain,

X_{n} \in R^{d}

with the transition probability kernel for

n

steps:

P_{x}^{(n)} = P^{(n)} (x, d y)

(for brevity

P_{x}^{(1)} : = P_{x}

) and the unique invariant measure

μ

Let

H

be a measurable function

R^{d} \overset{H}{\to} R^{p}

with

\int_{R^{d}} | H (z) | μ (d z) < \infty

and

\begin{matrix} \int_{R^{d}} H (z) μ (d z) = 0 . \end{matrix}

(1.1)

Set

S_{n}^{α} = \frac{1}{n^{α}} \sum_{i = 1}^{n} H (X_{i - 1}), n \geq 1; (0.5 < α < 1) .

In this paper, we examine the moderate deviation principle (in short:

MDP) for the family

(S_{n}^{α})_{n \geq 1}

when the spectrum of operator

P_{x}

is continuous.

It is well known that for bounded

H

satisfying 1.1 ((H) condition), the most MDP compatible Markov chains are characterized by eigenvalues gap condition (EG) (see Wu, [16] , [17] , Gong and Wu, [7] , and citations therein):

the unit is an isolated, simple and the only eigenvalue with modulus 1 of the transition probability kernel $P_{x}$ .

In the framework of (H)-(EG) conditions, the MDP is valid with the rate of speed

n^{- (2 α - 1)}

and the rate function

I (y), y \in R^{d}

\begin{matrix} I (y) = {\begin{matrix} \frac{1}{2} ∥ y ∥_{B^{\oplus}}^{2}, & B^{\oplus} B y = y \\ \infty, & otherwise, \end{matrix} \end{matrix}

(1.2)

where

B^{\oplus}

is the pseudoinverse matrix (in Moore-Penrose sense, see e.g.[1] ) for the matrix

\begin{matrix} B = \int_{R^{d}} H (x) H^{*} (x) μ (d x) + \sum_{n \geq 1} \int_{R^{d}} [H (x) (P_{x}^{(n)} H)^{*} + (P_{x}^{(n)} H) H^{*} (x)] μ (d x) \end{matrix}

(1.3)

(henceforth,

^{*}

| \cdot |

, and

∥ \cdot ∥_{Q}

are the transposition symbol,

L^{1}

norm and

L^{2}

norm with the kernel

Q

(

∥ x ∥_{Q} = \sqrt{〈 x, Q x 〉}

) respectively).

Thanks to the quadratic form rate function, the MDP is an attractive tool for an asymptotic analysis in many areas, say, with thesis (see, example in Section 7 )

“MDP instead of CLT”.

In this paper, we intend to apply the MDP analysis to Markov chain defined by the recurrent equation

X_{n} = f (X_{n - 1}, ξ_{n}), n \geq 1

generated by i.i.d. sequence

(ξ_{n})_{n \geq 1}

of random vectors, where

f

is some vector-valued measurable function. Obviously, the function

f

and the distribution of

ξ_{1}

might be specified in this way

P_{x}

satisfies (EG). For instance, if

d = 1

and

X_{n} = f (X_{n - 1}) + ξ_{n},

then for bounded

f

and Laplacian random variable

ξ_{1}

(EG) holds. However, (EG) fails for many useful in applications ergodic Markov chains. For

d = 1

, a typical example gives Gaussian Markov chain defined by a linear recurrent equation governed by i.i.d. sequence of

(0, 1)

-Gaussian random variables(here

| a | < 1

)

X_{n} = a X_{n - 1} + ξ_{n} .

In order to clarify this remark, notice that if (EG) holds true, than for any bounded and measurable function

H

, satisfying (H)-property, for some constants

K > 0

ϱ \in (0, 1)

n \geq 1,

\begin{matrix} | E_{x} H (X_{n}) | \leq K ϱ^{n} . \end{matrix}

(1.4)

However, the latter fails for

H (x) = sign (x)

satisfying 1.1 . In fact, if 1.4 were correct, then

\sum_{n = 0}^{\infty} | E_{x} H (X_{n}) | \leq \frac{K}{1 - ϱ} .

On the other hand, it is readily to compute that

\sum_{n = 0}^{\infty} | E_{x} H (X_{n}) |

grows in

| x |

on the set

{| x | > 1}

faster than

O (log (| x |)

In this paper, we avoid a verification of (EG). Although our approach is close to a conception of “Multiplicative Ergodicity” (see Balaji and Myen [2] ) and “Geometrical Ergodicity” (see Kontoyiannis and Meyn, [8] and Meyn and Tweedie, [11] ), Chen and Guillin, [4] ) we do not follow explicitly these methodologies.

Our main tools are the Poisson equation and the Puhalskii theorem from [15] . The Poisson equation enables to reduce the MDP verification for

(S_{n}^{α})_{n \geq 1}

(\frac{1}{n^{α}} M_{n})_{n \geq 1}

, where

M_{n}

is a martingale generated by Markov chain, while the Puhalskii theorem allows to replace an asymptotic analysis for the Laplace transform of

\frac{1}{n^{α}} M_{n}

by the asymptotic analysis for, so called, Stochastic Exponential

\begin{matrix} E_{n} (λ) =^{n} \prod_{i = 1} E (exp [〈 λ, \frac{1}{n^{α}} (M_{i} - M_{i - 1}) 〉] | X_{i - 1}), λ \in R^{d} \end{matrix}

(1.5)

being the product of the conditional Laplace transforms for martingale increments.

An effectiveness of the Poisson equation approach (method of corrector) combined with the stochastic exponential is well known from the proofs of functional central limit theorem (FCLT) for the family

(S_{n}^{0.5})_{n \geq 1}

(see, e.g.

Papanicolaou, Stroock and Varadhan [12] , Ethier and Kurtz [6] , Bhattacharya [3] , Pardoux and Veretennikov [13] ; related topics can be found in Metivier and Priouret (80's) for stochastic algorithms analysis. The use of the same approach for a continuous time setting can be found e.g. in [9] , [10] ).

2 Formulation of main result

We consider Markov chain

(X_{n})_{n \geq 0}

X_{n} \in R^{d}

defined by a nonlinear recurrent equation

\begin{matrix} X_{n} = f (X_{n - 1}, ξ_{n}), \end{matrix}

(2.1)

where

f = f (z, v)

is a vector function with entries

f_{1} (z, v), \dots, f_{d} (z, v)

u \in R^{d}

v \in R^{p}

and

(ξ_{n})_{n \geq 1}

is i.i.d. sequence of random vectors of the size

p

We fix the following assumptions.

Assumption 2.1. Entries of

f

are Lipschitz continuous functions in the following sense: for any

v

| f_{i} (z_{1} \dots, z_{j - 1}, z_{j}^{'}, z_{j + 1} \dots, z_{d}, v_{1}, \dots, v_{p}) - f_{i} (z_{1} \dots, z_{j - 1}, z_{j}^{''}, z_{j + 1} \dots, z_{d}, v_{1}, \dots, v_{p}) | \leq ϱ_{i j} | z_{j}^{'} - z_{j}^{''} |, | f (z^{'}, v) - f (z^{''}, v) | \leq ϱ | z^{'} - z^{''} |,

where

{max}_{i, j} ϱ_{i j} = ϱ < 1 .

Assumption 2.2. For sufficiently small positive

δ

, Cramer's condition holds:

E e^{δ | ξ_{1} |} < \infty .

Theorem 2.1. Under Assumptions 2.1 and 2.2 , the Markov chain is ergodic with the invariant measure

μ

such that

\int_{R^{d}} | z | μ (d z) < \infty

. For any Lipschitz continuous function

H

, satisfying 1.1 , the family

(S_{n}^{α})_{n \geq 1}

obeys the MDP in the metric space

(R^{d}, r)

(

r

is the Euclidean metric) with the rate of speed

n^{- (2 α - 1)}

and the rate function given in 1.2 .

Remark 1. Notice that:

assumptions of Theorem 2.1 do not guarantee (EG), Lipschitz continuous

H

, obeying the linear growth condition, are permissible for the MDP analysis, the

ξ_{1}

-distribution with a continuous component is not required.

Consider now a linear version of 2.1 :

X_{n} = A X_{n - 1} + ξ_{n},

where

A

is the

d \times d

-matrix with entries

A_{i j}

. Now, Assumption 2.1 reads as:

{max}_{i j} | A_{i j} | < 1

. This assumption is too restrictive. We replace it by more natural one

Assumption 2.3. The eigenvalues of

A

lie within the unit circle.

Theorem 2.2. Under Assumption 2.3 , the Markov chain is ergodic with the invariant measure

μ

such that

\int_{R^{d}} ∥ z ∥^{2} μ (d z) < \infty

For any Lipschitz continuous function

H

, satisfying 1.1 , the family

(S_{n}^{α})_{n \geq 1}

obeys the MDP in the metric space

(R^{d}, r)

with the rate of speed

n^{- (2 α - 1)}

and the rate function given in 1.2 .

3 Preliminaries

3.1 (EG)-(H) conditions

To clarify our approach to the MDP analysis, let us first demonstrate its applicability under (EG)-(H) setting.

The (EG) condition provides the geometric ergodicity of

P_{x}^{(n)}

to the invariant measure

μ

uniformly in

x

in the total variation norm: there exist constants

K > 0

and

ϱ \in (0, 1)

such that for any

x \in R^{d}

∥ P_{x}^{(n)} - μ ∥_{t v} \leq K ϱ^{n}, n \geq 1 .

The latter provides an existence of bounded function

\begin{matrix} U (x) = H (x) + \sum_{n \geq 1} P_{x}^{(n)} H \end{matrix}

(3.1)

solving the Poisson equation

\begin{matrix} H (x) = H (x) + P_{x} U . \end{matrix}

(3.2)

In view of the Markov property, a sequence

(ζ_{i})_{i \geq 1}

of bounded random vectors with

ζ_{i} : = U (X_{i}) - P_{X_{i - 1}} U

forms a martingale-differences relative to the filtration generated by Markov chain. Hence,

M_{n} = \sum_{i = 1}^{n} ζ_{i}

is the martingale with bounded increments. With the help of Poisson's equation we get the following decomposition

\begin{matrix} \frac{1}{n^{α}} \sum_{i = 1}^{n} H (X_{i - 1}) = {\underset{︸}{\frac{1}{n^{α}} [U (x) - U (X_{n})]}}_{c o r r e c t o r} + \frac{1}{n^{α}} M_{n} . \end{matrix}

(3.3)

The boundedness of

U

provides a corrector negligibility in the MDP scale, that is, the families

S_{n}^{α}

and

\frac{1}{n^{α}} M_{n}

share the same MDP. In view of that, suffice it to to establish the MDP for

(\frac{1}{n^{α}} M_{n})_{n \geq 1}

Assume for a moment that

ζ_{i}

's are i.i.d. sequence of random vectors.

Recall,

E ζ_{1} = 0

and denote

B = E ζ_{1} ζ_{1}^{*}

. Then, the Laplace transform for

\frac{1}{n^{α}} M_{n}

is:

\begin{matrix} E_{n} (λ) = {(E e^{〈 λ, \frac{ζ_{1}}{n^{α}} 〉})}^{n}, λ \in R^{d} . \end{matrix}

(3.4)

Under this setting, it is well known that

\frac{1}{n^{α}} M_{n}

obeys the MDP if

B

is not singular matrix and

{lim}_{n \to \infty} n^{2 α - 1} log E_{n} (λ) = \frac{1}{2} 〈 λ, B λ 〉, λ \in R^{d} .

We adapt this method of MDP verification to our setting. Instead of

B

, we introduce matrices

B (X_{i - 1}), i \geq 1

with

\begin{matrix} B (x) = P_{x} U U^{*} - P_{x} U {(P_{x} U)}^{*} . \end{matrix}

(3.5)

The homogeneity of Markov chain and the definition of

ζ_{i}

provide a.s. that

E (ζ_{i} ζ_{i}^{*} | X_{i - 1}) = B (X_{i - 1}) .

Instead of the Laplace transform 3.4 , we apply the stochastic exponential 1.5 , expressed via

ζ_{i}

's,

E_{n} (λ) =^{n} \prod_{i = 1} E (e^{〈 λ, \frac{ζ_{i}}{n^{α}} 〉} | X_{i - 1}), λ \in R,

which is not the Laplace transform itself.

The Poisson equation 3.2 and its solution 3.1 permit to transform 3.5 into

B (x) = H (x) H^{*} (x) + \sum_{n \geq 1} [H (x) {(P_{x}^{(n)} H)}^{*} + (P_{x}^{(n)} H) H^{*}],

that is,

\int_{R^{d}} B (z) μ (d z)

coincides with

B

from 1.3 .

Now, we are in the position to formulate Puhalskii Theorem. [for more details, see [15] and [?] .] Assume

B

from 1.3 is nonsingular matrix and for any

ɛ > 0

λ \in R^{d}

\begin{matrix} {lim}_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (| n^{2 α - 1} log E_{n} (λ) - \frac{1}{2} 〈 λ, B λ 〉 | > ɛ) = - \infty . \end{matrix}

(3.6)

Then, the family

\frac{1}{n^{α}} M_{n}

n \geq 1

possesses the MDP in the metric space

(R^{d}, r)

(

r

is the Euclidean metric) with the rate of speed

n^{- (2 α - 1)}

and rate function

I (y) = \frac{1}{2} ∥ y ∥_{B^{- 1}}^{2}

Remark 2. The condition 3.6 is verifiable with the help of

\begin{matrix} \begin{matrix} {lim}_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (\frac{1}{n} | \sum_{i = 1}^{n} 〈 λ, [B (X_{i - 1}) - B] λ 〉 | > ɛ) = - \infty \\ {lim}_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (\frac{1}{6 n^{1 + α}} \sum_{i = 1}^{n} E [| ζ_{i} |^{3} e^{n^{- α} | ζ_{i} |} | X_{i - 1}] > ɛ) = - \infty . \end{matrix} \end{matrix}

(3.7)

The second condition in 3.7 is implied by the boundedness of

| ζ_{i} |

's. The first part in 3.7 is known as Dembo's conditions, [?] , formulated as follows:

for any

ɛ > 0

λ \in R^{d}

l i m_{n \to \infty} \frac{1}{n} log P (\frac{1}{n} | \sum_{i = 1}^{n} 〈 λ, [B (X_{i - 1}) - B] λ 〉 | > ɛ) < 0 .

In order to verify the first condition in 3.7 , we apply again the Poisson equation technique. Set

h (x) = 〈 λ, [B (x) - B] λ 〉

and notice that

\int_{R^{d}} h (z) μ (d z) = 0 .

Then, the function

u (x) = h (x) + \sum_{n \geq 1} P_{x}^{(n)} h

is well defined and solves the Poisson equation

u (x) = h (x) + P_{x} u .

Similarly to 3.3 , we have

\frac{1}{n} \sum_{i = 1}^{n} h (X_{i - 1}) = \frac{u (x) - u (X_{n})}{n} + \frac{m_{n}}{n},

where

m_{n} = \sum_{i = 1}^{n} z_{i}

is the martingale with bounded martingale-differences

(z_{i})_{i \geq 1}

. Since

u

is bounded, the first condition in 3.7 is reduced to

\begin{matrix} {lim}_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (| m_{n} | > n ɛ) = - \infty \end{matrix}

(3.8)

while 3.8 is provided by Theorem A.1 in Appendix which states that 3.8 holds for any martingale with bounded increments.

3.1.1 Singular $B$

The conditions from 3.7 remain to hold whether

B

is nonsingular or singular. For singular

B

the Puhalskii theorem is no longer valid. With singular

B

, we use the Puhalskii theorem as helpful tool It is well known that the family

\frac{M_{n}}{n^{α}}

n \geq 1

obeys the MDP with the rate of speed

n^{- (2 α - 1)}

and some rate function,say

I (y)

provided that

\begin{matrix} \begin{matrix} l i m_{C \to \infty} l i m_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (∥ \frac{M_{n}}{n^{α}} ∥ > C) = - \infty \\ l i m_{ɛ \to 0} l i m_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (∥ \frac{M_{n}}{n^{α}} - y ∥ \leq ɛ) \leq - I (y) \\ l i m_{ɛ \to 0} l i m_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (∥ \frac{M_{n}}{n^{α}} - y ∥ \leq ɛ) \geq - I (y) . \end{matrix} \end{matrix}

(3.9)

The first condition in 3.9 provides the exponential tightness in the metric

r

while the next others the local MDP. In order to verify of 3.9 , we introduce “regularized” family

\frac{M_{n}^{β}}{n^{α}}, n \geq 1

with

M_{n}^{β} = M_{n} + \sqrt{β} \sum_{i = 1}^{n} ϑ_{i},

where

β

is a positive parameter and

(ϑ_{i})_{i \geq 1}

is a sequence of zero mean i.i.d.

Gaussian random vectors with

cov (ϑ_{1}, ϑ_{1}) = : I

(

I

is the unit matrix). The Markov chain and

(ϑ_{i})_{i \geq 1}

are assumed to be independent objects.

It is clear that for this setting the matrix

B

is transformed into a positive definite matrix

B_{β} = B + β I

. Now, the Puhalskii theorem is applicable and guarantees the MDP with the same rate of speed and the rate function

I_{β} (y) = \frac{1}{2} ∥ y ∥_{B_{β}^{- 1}}^{2} .

We use now the well known fact (see, e.g. Puhalskii, [14] ) that MDP provides the exponentially tightness and the the local MDP:

\begin{matrix} \begin{matrix} l i m_{C \to \infty} l i m_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (∥ \frac{M_{n}^{β}}{n^{α}} ∥ > C) = - \infty \\ l i m_{ɛ \to 0} l i m_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (∥ \frac{M_{n}^{β}}{n^{α}} - y ∥ \leq ɛ) \leq - I_{β} (y) \\ l i m_{ɛ \to 0} l i m_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (∥ \frac{M_{n}^{β}}{n^{α}} - y ∥ \leq ɛ) \geq - I_{β} (y) . \end{matrix} \end{matrix}

(3.10)

Notice now that 3.9 is implied by 3.10 if

\begin{matrix} {lim}_{β \to 0} I_{β} (y) = {\begin{matrix} \frac{1}{2} ∥ y ∥_{B^{\oplus}}^{2}, & B^{\oplus} B y = y \\ \infty, & otherwise \end{matrix} \end{matrix}

(3.11)

and

\begin{matrix} {lim}_{β \to 0} l i m_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (∥ \frac{\sqrt{β}}{n^{α}} \sum_{i = 1}^{n} ϑ_{i} ∥ > η) = - \infty, \forall η > 0 . \end{matrix}

(3.12)

Let

T

be an orthogonal matrix transforming

B

to a diagonal form:

diag (B) = T^{*} B T .

Then, owing to

2 I_{β} (y) = y^{*} (β I + B)^{- 1} y = y^{*} T (β I + diag (B))^{- 1} T^{*} y,

for

y = B^{\oplus} B y

we have (recall that

B^{\oplus} B B^{\oplus} = B^{\oplus}

, see [1] )

\begin{matrix} 2 I_{β} (y) & = y^{*} B^{\oplus} B T (β I + diag (B))^{- 1} T^{*} y \end{matrix}

\begin{matrix} = y^{*} B^{\oplus} T T^{*} B T (β I + diag (B))^{- 1} T^{*} y \end{matrix}

\begin{matrix} = y^{*} B^{\oplus} T diag (B) (β I + diag (B))^{- 1} T^{*} y \end{matrix}

\begin{matrix} - - - \to β \to 0 y^{*} B^{\oplus} T diag (B) diag ((B))^{\oplus} T^{*} y \end{matrix}

\begin{matrix} = y^{*} B^{\oplus} T diag (B) T^{*} T (diag (B))^{\oplus} T^{*} y \end{matrix}

\begin{matrix} = y^{*} B^{\oplus} B B^{\oplus} u = u^{*} B^{\oplus} y = ∥ y ∥_{B^{\oplus}}^{2} = 2 I (y) . \end{matrix}

y \neq B^{\oplus} B y

{lim}_{β \to 0} 2 I_{β} (y) = \infty

Thus, 3.11 holds true.

Since

(ϑ_{i})_{i \geq 1}

is i.i.d. sequence of random vectors and entries of

ϑ_{1}

are i.i.d.

(0, 1)

-Gaussian random variables, the verification of 3.12 is reduced to

\begin{matrix} {lim}_{β \to 0} l i m_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (| \sum_{i = 1}^{n} ξ_{i} | > \frac{n^{α} η}{\sqrt{β}}) = - \infty, \end{matrix}

(3.13)

where

(ξ_{i})_{i \geq 1}

is a sequence of i.i.d.

(0, 1)

-Gaussian random variables, and it suffices to consider the case “+” only. By the Chernoff inequality with

λ > 0

, we find that

P (\sum_{i = 1}^{n} ϑ_{i} > \frac{n^{α} η}{\sqrt{β}}) \leq exp (- λ \frac{n^{α} η}{\sqrt{β}} + n \frac{λ^{2}}{2})

while the choice of

λ = \frac{n^{α} η}{n \sqrt{β}}

provides

\frac{1}{n^{2 α - 1}} log P (\sum_{i = 1}^{n} η_{i} > \frac{n^{α} η}{\sqrt{β}}) \leq - \frac{η^{2}}{2 β} - - - \to β \to 0 - \infty .

3.2 Virtual scenario

(EG)-(H) are not assumed the ergodicity of Markov chain is checked -

H

is chosen to hold 1.1 .

(1) Let 3.1 hold. Hence, the function

U

solves the Poisson equation and the decomposition from 3.3 is valid with

M_{n} = \sum_{i = 1}^{n} ζ_{i}

, where

ζ_{i} = u (X_{i}) - P_{X_{i - 1}} u .

Let

\begin{matrix} E ζ_{i}^{*} ζ_{i} \leq c o n s t . \end{matrix}

\begin{matrix} E [| ζ_{i} |^{3} e^{n^{- α} | ζ_{i} |} | X_{i - 1}] \leq c o n s t . \end{matrix}

(2) With

B (x)

and

B

are defined in 3.5 and 1.3 respectively, set

h (x) = 〈 λ, [B (x) - B] λ 〉, λ \in R^{d} .

Let (i)

u (x) = h (x) + \sum_{n \geq 1} P_{x}^{(n)} h

is well defined (ii) for

z_{i} = u (X_{i}) - P_{X_{i - 1}} u

\begin{matrix} E z_{i}^{2} \leq c o n s t . \end{matrix}

\begin{matrix} E [| z_{i} |^{3} e^{n^{- α} | z_{i} |} | X_{i - 1}] \leq c o n s t . \end{matrix}

(3) For any

ɛ > 0

, let

\begin{matrix} {lim}_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (| U (X_{n}) | > n^{α} ɛ) = - \infty \end{matrix}

\begin{matrix} {lim}_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (| u (X_{n}) | > n^{α} ɛ) = - \infty . \end{matrix}

Notice that (EG)-(H) provide (1)-(3) and even if (EG)-(H) fail, (1)-(3) may fulfill. Moreover, (1)-(3) guarantee the validity for all steps of the proof given in Section 3.1 .

Thus, an ergodic Markov chain, possessing (1)-(3), obeys the MDP. The proof of Theorems 2.1 and 2.2 follows this scenario.

4 The proof of Theorem 2.1

4.1 Ergodic property

Lemma 4.1. Under Assumption 2.1 ,

(X_{n})_{n \geq 0}

possesses the unique probability invariant measure

μ

with

\int_{R^{d}} | z | μ (d z) < \infty

Proof. Let

ν

be a probability measure on

R^{d}

with

\int_{R^{d}} | x | ν (d x) < \infty

and let a random vector

X_{0}

, distributed in the accordance to

ν

, is independent of

(ξ_{n})_{n \geq 1}

. We initialize the recursion, given in 2.1 , by

X_{0}

. Let now

X_{n}

is generated by 2.1 . Then,

μ^{n} (d z) = \int_{R^{d}} P_{x}^{(n)} (d z) ν (d x)

defines the distribution of

X_{n}

We show that the family

(μ^{n})_{n \geq 1}

is tight in the Levy-Prohorov metric:

{lim}_{k \to \infty} l i m_{n \to \infty} μ^{n} (| z | > k) = 0 .

By the Chebyshev inequality,

μ^{n} (| z | > k) \leq \frac{E | X_{n} |}{k}

. The tightness follows from

{sup}_{n \geq 1} E | X_{n} | < \infty .

Further, since By Assumption 2.1 ,

\begin{matrix} | X_{n} | & = | f (0, ξ_{n}) + (f (X_{n - 1}, ξ_{n}) - f (0, ξ_{n})) | \end{matrix}

\begin{matrix} \leq | f (0, ξ_{n}) | + | f (X_{n - 1}, ξ_{n}) - f (0, ξ_{n})) | \end{matrix}

\begin{matrix} \leq | f (0, ξ_{n}) | + ϱ | X_{n - 1} | \end{matrix}

\begin{matrix} \leq | f (0, 0) | + ℓ | ξ_{n} | + ϱ | X_{n - 1} |, \end{matrix}

the sequence

(E | X_{n} |)_{n \geq 1}

solves a recurrent inequality

E | X_{n} | \leq | f (0, 0) | + ℓ E | ξ_{1} | + ϱ E | X_{n - 1} |

subject to

E | X_{0} | = \int_{R^{d}} | x | ν (d x) (< \infty)

. Hence, we find that for any

n \geq 1

E | X_{n} | \leq E | X_{0} | + \frac{| f (0, 0) | + ℓ E | ξ_{1} |}{1 - ϱ} .

Thus, the family

{μ_{n}}

is tight, so that, by the Prohorov theorem,

{μ^{n}}

contains further subsequence

{μ^{n^{'}}}

converging, as

n^{'} ↗ \infty

, in the Levy-Prohorov metric to a limit

μ

being a probability measure on

R^{d}

: for any bounded and continuous function

g

R^{d}

{lim}_{n^{'} \to \infty} \int_{R^{d}} g (z) μ^{n^{'}} (d z) = \int_{R^{d}} g (z) μ (d z) .

Thence, for

g (z) = L \land | z |

and

L > 0

, it holds

\int_{R^{d}} (L \land | z |) μ (d z) = {lim}_{n^{'} \to \infty} E (L \land | X_{n^{'}} |) \leq l i m_{n \to \infty} E | X_{n} | < \infty

and, by the monotone convergence theorem,

\int_{R^{d}} | z | μ (d z) \leq l i m_{n \to \infty} E | X_{n} | < \infty .

The

μ

is regarded now as a candidate to be the unique invariant measure.

So, we shall verify

\int_{R^{d}} g (x) μ (d x) = \int_{R^{d}} P_{x} g μ (d x) .

for any nonnegative, bounded and continuous function

g

. For notational convenience, write

X_{n}^{x}

and

X_{n}^{ν}

, if

X_{0} = x

and

X_{0}

is distributed in the accordance with

ν

. By Assumption 2.1 ,

| X_{n}^{x} - X_{n}^{ν} | \leq ϱ | X_{n - 1}^{x} - X_{n - 1}^{ν} |, n \geq 1,

that is,

| X_{n}^{x} - X_{n}^{ν} |

converges to zero exponentially fast as long as

n \to \infty

For any

x \in R^{d}

, the latter provides

{lim}_{n^{'} \to \infty} E g (X_{n^{'}}^{x}) = \int_{R^{d}} g (x) μ (d x) .

Since the Markov chain is homogeneous, we also find that

{lim}_{n^{'} \to \infty} E g (X_{n^{'} + 1}^{x}) = \int_{R^{d}} g (z) μ (d z) .

On the other hand, owing to

E g (X_{n^{'} + 1}^{x}) = E P_{X_{n^{'}}^{x}} g

, the above relation is nothing but

{lim}_{n^{'} \to \infty} E P_{X_{n^{'}}^{x}} g = \int_{R^{d}} g (z) μ (d z) .

Finally, owing to

P_{x} g = E g (f (x, ξ_{1}))

, the function

P_{x} g

of argument

x

is bounded and continuous. Consequently,

{lim}_{n^{'} \to \infty} E P_{X_{n^{'}}^{x}} g = \int_{R^{d}} P_{x} g μ (d x) .

Assume

μ^{'}

is another invariant probability measure,

μ^{'} \neq μ

. Then, taking

X_{0}^{μ}

and

X_{0}^{μ^{'}}

, distributed in the accordance to

μ

and

μ^{'}

respectively and independent of

(ξ_{n})_{n \geq 1}

, we get two stationary Markov chains

(X_{n}^{μ})

and

(X_{n}^{μ^{'}})

defined on the same probability space as:

\begin{matrix} X_{n}^{μ} = f (X_{n - 1}^{μ}, ξ_{n}) \end{matrix}

\begin{matrix} X_{n}^{μ^{'}} = f (X_{n - 1}^{μ^{'}}, ξ_{n}) . \end{matrix}

By Assumption 2.1 ,

| X_{n}^{μ} - X_{n}^{μ^{'}} | \leq ϱ | X_{n - 1}^{μ} - X_{n - 1}^{μ^{'}} |

, i.e.

{lim}_{n \to \infty} | X_{n}^{μ} - X_{n}^{μ^{'}} | = 0

. Recall that both processes

X_{n}^{μ}

and

X_{n}^{μ^{'}}

are stationary with the marginal distributions

μ

and

μ^{'}

respectively. Hence, for any bounded and continuous function

g : R^{d} \to R

| \int_{R^{d}} g (x) μ (d x) - \int_{R^{d}} g (x) μ^{'} (d x) | \leq E | g (X_{n}^{μ}) - g (X_{n}^{μ^{'}}) | - - - \to n \to \infty 0,

that is,

μ = μ^{'}

. □

4.2 The verification of (1)

Let

K

be the Lipschitz constant for

H

. Then

| H (x) | \leq | H (0) | + K | x |

and

\int_{R^{d}} | H (z) | μ (d z) < \infty .

By 1.1 ,

E H (X_{n}^{μ}) \equiv 0

. Then,

\begin{matrix} | E H (X_{n}^{x}) | & = | E (H (X_{n}^{x}) - H (X_{n}^{μ}) | \end{matrix}

\begin{matrix} \leq K ϱ^{n} E | x - X_{n}^{μ} | \leq K (1 + | x |) ϱ^{n} . \end{matrix}

Therefore,

\sum_{n \geq 1} | E H (X_{n}^{x} | \leq \frac{K}{1 - ϱ} (1 + | x |)

. Consequently, the function

U (x)

, given in 3.1 , is well defined and solves the Poisson equation.

Recall that

ζ_{i} = U (X_{i}) - P_{X_{i - 1}} U

Lemma 4.2. The function

U (x)

possesses the following properties:

U (x)

is Lipschitz continuous; 2)

P_{x} (U U^{*}) - P_{x} U (P_{x} U)^{*}

is bounded and Lipschitz continuous; 3) For sufficiently small

δ > 0

and any

i \geq 1

E (| U (X_{i}) - P_{_{X_{i - 1}}} U |^{3} e^{δ | U (X_{i}) - P_{_{X_{i - 1}}} U |} | X_{i - 1}) \leq const.

Proof. 1) Since by Assumption 2.1 ,

| X_{n}^{x^{'}} - X_{n}^{x^{''}} | \leq ϱ | X_{n - 1}^{x^{'}} - X_{n - 1}^{x^{''}} |, | X_{0}^{x^{'}} - X_{0}^{x^{''}} | \leq | x^{'} - x^{''} |,

we have

\begin{matrix} \begin{matrix} | U (x^{'}) - U (x^{''}) | & \leq | H (x^{'}) - H (x^{''}) | + \sum_{n \geq 1} E | H (X_{n}^{x^{'}}) - H (X_{n}^{x^{''}}) | \\ \leq \frac{K}{1 - ϱ} | x^{'} - x^{''} | . \end{matrix} \end{matrix}

(4.1)

2) Recall (see 3.5 )

P_{x} (U U^{*}) - P_{x} U (P_{x} U)^{*} = B (x)

and denote

B_{p q} (x)

p, q = 1, \dots, d

the entries of matrix

B (x)

. Also, denote by

U_{p} (x)

p = 1, \dots, d

the entries of

U (x)

. Since

B (x)

is nonnegative definite matrix, suffice it to show only that

B_{p p} (x)

's are bounded functions. Denote

F (z)

the distribution function of

ξ_{1}

. Taking into the consideration 4.1 and Assumption 2.1 , we get

\begin{matrix} B_{p p} (x) & = E {(U_{p} (f (x, ξ_{1})) - \int_{R^{d}} U_{p} (f (x, z)) d F (z))}^{2} \end{matrix}

\begin{matrix} \leq \frac{(K ℓ)^{2}}{(1 - ϱ)^{2}} E | \int_{R^{d}} | ξ_{1} - z | d F (z) |^{2} \leq 4 \frac{(K ℓ)^{2}}{(1 - ϱ)^{2}} E | ξ_{1} |^{2} < \infty . \end{matrix}

The Lipschitz continuity of

B_{p q} (x)

is proved similarly. Write

B_{p q} (x^{'}) - B_{p q} (x^{''}) = : a b - c d,

where

\begin{matrix} a = E (U_{p} (f (x^{'}, ξ_{1})) - \int_{R^{d}} U_{q} (f (x^{'}, z)) d F (z)) \end{matrix}

\begin{matrix} b = E (U_{q} (f (x^{'}, ξ_{1})) - \int_{R^{d}} U_{q} (f (x^{'}, z)) d F (z)) \end{matrix}

\begin{matrix} c = E (U_{p} (f (x^{''}, ξ_{1})) - \int_{R^{d}} U_{q} (f (x^{''}, z)) d F (z)) \end{matrix}

\begin{matrix} d = E (U_{q} (f (x^{''}, ξ_{1})) - \int_{R^{d}} U_{q} (f (x^{''}, z)) d F (z)) . \end{matrix}

Now, applying

a b - c d = a (b - d) + d (a - c)

and taking into account 4.1 and Assumption 2.1 , we find that

| a |, | d | \leq \frac{2 K ℓ}{1 - ϱ} E | ξ_{1} |

and so

| B_{p q} (x^{'}) - B_{p q} (x^{''}) | \leq \frac{4 K^{2} ℓ ϱ}{(1 - ϱ)^{2}} E | ξ_{1} | | x^{'} - x^{''} | .

3) By 4.1 and Assumption 2.1

| U (X_{i}) - P_{X_{i - 1}} U | \leq \frac{K ℓ}{1 - ϱ} (E | ξ_{1} | + | ξ_{i} |) .

□

4.3 The verification of (2)

The properties of

B (x)

to be bounded and Lipschitz continuous provide the same properties for

h (x) = 〈 λ, [B (x) - B] λ 〉 .

Hence (2) is provided by (1).

4.4 The verification of (3)

Since

U

and

u

are Lipschitz continuous, they possess the linear growth condition, e.g.,

| U (x) | \leq C (1 + | x |), \exists C > 0 .

So, (3) is reduced to the verification of

\begin{matrix} {lim}_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (| X_{n} | > ɛ n^{α}) = - \infty, ɛ > 0 . \end{matrix}

(4.2)

Due to Assumption 2.1 , we have

\begin{matrix} | X_{n} | & \leq | f (X_{n - 1}, ξ_{n}) | \leq | f (0, ξ_{n}) | + ϱ | X_{n - 1} | \end{matrix}

\begin{matrix} \leq | f (0, 0) | + ϱ | X_{n - 1} | + ℓ | ξ_{n} | . \end{matrix}

Iterating this inequality with

X_{0} = x

we obtain

\begin{matrix} | X_{n} | & \leq ϱ^{n} | x | + | f (0, 0) | \sum_{j = 1}^{n} ϱ^{n - j} + ℓ \sum_{j = 1}^{n} ϱ^{n - j} | ξ_{j} | \end{matrix}

\begin{matrix} \leq | x | + \frac{| f (0, 0) |}{1 - ϱ} + ℓ \sum_{j = 0}^{n - 1} ϱ^{j} | ξ_{n - j} | . \end{matrix}

Hence, 4.2 is reduced to

\begin{matrix} {lim}_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (\sum_{j = 0}^{n - 1} ϱ^{j} | ξ_{n - j} | \geq n^{α} ɛ) = - \infty . \end{matrix}

(4.3)

We verify 4.3 with the help of Chernoff 's inequality: with

δ

, involving in Assumption 2.2 , and

γ = \frac{δ}{1 - ϱ}

\begin{matrix} P (\sum_{j = 0}^{n - 1} ϱ^{j} | ξ_{n - j} | \geq n^{α} ɛ) \leq e^{- n^{α} γ ɛ} E e^{\sum_{j = 0}^{n - 1} γ ϱ^{j} | ξ_{n - j} |} . \end{matrix}

The i.i.d. property for

ξ_{j}

's provides

E e^{\sum_{j = 0}^{n - 1} γ ϱ^{j} | ξ_{n - j} |} = E e^{\sum_{j = 0}^{n - 1} γ ϱ^{j} | ξ_{1} |} \leq E e^{\sum_{j = 0}^{\infty} γ ϱ^{j} | ξ_{1} |} = E e^{δ | ξ_{1} |} < \infty

and we get

\frac{1}{n^{2 α - 1}} log P (\sum_{j = 0}^{n - 1} ϱ^{j} | ξ_{n - j} | \geq n^{α} ɛ) \leq - n^{1 - α} δ ɛ + \frac{log E e^{δ | ξ_{1} |}}{n^{2 α - 1}} - - - \to n \to \infty - \infty .

5 The proof of Theorem 2.2

The proof of this theorem differs from the proof of Theorem 2.1 only in some details concerning to (L.1). So, only these parts of the proof are given below.

5.1 Ergodic property and invariant measure

Introduce

({\tilde{ξ}}_{n})_{n \geq 1}

the independent copy of

(ξ_{n})_{n \geq 1}

. Owing to

X_{n} = A^{n} x + \sum_{i = 1}^{n} A^{n - i} ξ_{i} = A^{n} x + \sum_{i = 0}^{n - 1} A^{i} ξ_{n - i},

we introduce

\begin{matrix} {\tilde{X}}_{n} = A^{n} x + \sum_{i = 0}^{n - 1} A^{i} {\tilde{ξ}}_{i} \end{matrix}

(5.1)

and notice that the i.i.d. property of

(ξ_{i})_{i \geq 1}

provides

(X_{n})_{n \geq 0} \overset{l a w}{=} ({\tilde{X}}_{n})_{n \geq 0} .

By Assumption 2.3 ,

A^{n} \to 0

n \to \infty

, exponentially fast. Particularly,

\sum_{i = 0}^{\infty} trace (A^{i} cov (ξ_{1}, ξ_{1}) (A^{i})^{*}) < \infty,

so that

{lim}_{n \to \infty} {\tilde{X}}_{n} = \sum_{i = 0}^{\infty} A^{i} {\tilde{ξ}}_{i}

a.s. and in

L^{2}

norm.

Thus, the invariant measure

μ

is generated by the distribution function of

{\tilde{X}}_{\infty}

. In addition,

E ∥ {\tilde{X}}_{\infty} ∥^{2} = \sum_{i = 0}^{\infty} trace (A^{i} cov (ξ_{1}, ξ_{1}) (A^{i})^{*})

, so that

\int_{R^{d}} ∥ z ∥^{2} μ (d z) < \infty .

5.2 The verification of (1) and (2)

Due to

(X_{n}^{x^{'}} - X_{n}^{x^{''}}) = A (X_{n - 1}^{x^{'}} - X_{n - 1}^{x^{''}}),

we have

(X_{n}^{x^{'}} - X_{n}^{x^{''}}) = A^{n} (x^{'} - x^{''})

. Let us transform the matrix

A

into a Jordan form

A = T J T^{- 1}

and notice that

A^{n} = T J^{n} T^{- 1}

. It is well known that the maximal absolute value of entries of

J^{n}

n | λ |^{n}

, where

| λ |

is the maximal absolute value among eigenvalues of

A

. By Assumption 2.3 ,

| λ | < 1

So, there exist

K > 0

and

ϱ < 1

such that

| λ | < ϱ

. Then, entries

A_{p q}^{n}

A^{n}

are evaluated as:

| A_{p q}^{n} | \leq K ϱ^{n}

. Hence,

| X_{n}^{x^{'}} - X_{n}^{x^{''}} | \leq K ϱ^{n} | x^{'} - x^{''} |

n \geq 1

, and the verification of (1), (2) is in the framework of Section 3 .

5.3 The verification of (3)

As in Section 3 , the verification of this property is reduced to

\begin{matrix} {lim}_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (| X_{n} | > ɛ n^{α}) = - \infty, ɛ > 0 . \end{matrix}

(5.2)

In 5.2 , we may replace

X_{n}

by its copy

{\tilde{X}}_{n}

defined in 5.1 . Notice also that

| {\tilde{X}}_{n} | \leq | A^{n} x | + \sum_{i = 0}^{\infty} {max}_{p q} | A_{p q}^{i} | | \tilde{ξ} | .

As was mentioned above,

| A_{p q}^{i} | \leq K ϱ^{j}

for some

K > 0

and

ϱ \in (0, 1)

Hence, suffice it to verify

{lim}_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (\sum_{i = 0}^{\infty} ϱ^{i} | ξ_{i} | > ɛ n^{α}) = - \infty, ɛ > 0

what be going on similarly to corresponding part of the proof in Section 3 .

6 Exotic example

Let

(X_{n})_{n \geq 0}

X_{n} \in R

and

X_{0} = x

, be Markov chain defined by the recurrent equation

\begin{matrix} X_{n} = X_{n - 1} - m \frac{X_{n - 1}}{| X_{n - 1} |} + ξ_{n}, \end{matrix}

(6.1)

where

m

is a positive parameter,

(ξ_{n})

is i.i.d. sequence of zero mean random variables with

E e^{δ | ξ_{1} |} < \infty, for some δ > 0,

and let

\frac{0}{0} = 0

Although the virtual scenario is not completely verifiable here we show that for

H (x) = \frac{x}{| x |}

the family

(S_{n}^{α})_{n \geq 1}

possesses the MDP provided that

\begin{matrix} m > \frac{1}{δ} log E e^{δ | ξ_{1} |} . \end{matrix}

(6.2)

Indeed, by 6.1 we have

\frac{1}{n^{α}} \sum_{k = 1}^{n} \frac{X_{k - 1}}{| X_{k - 1} |} = \frac{1}{m} \frac{(X_{n} - x)}{n^{α}} + \frac{1}{n^{α}} \sum_{k = 1}^{n} \frac{ξ_{k}}{m} .

The family

{(\frac{1}{n^{α}} \sum_{k = 1}^{n} \frac{ξ_{k}}{m})}_{n \geq 1}

possesses the MDP with the rate of speed

n^{- (2 α - 1)}

and the rate function

I (y) = \frac{m^{2}}{2 E ξ_{1}^{2}} y^{2}

. Then, the family

(S_{n}^{α})_{n \geq 1}

obeys the same MDP provided that

{(\frac{X_{n} - x}{n^{α}})}_{n \geq 1}

is exponentially negligible family with the rate

n^{- (2 α - 1)}

. This verification is reduced to

\begin{matrix} {lim}_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (| X_{n} | > n^{α} ɛ) = - \infty, ɛ > 0 . \end{matrix}

(6.3)

By the Chernoff inequality

P (| X_{n} | > n^{α} ɛ) \leq e^{- δ n^{α} ɛ} E e^{δ | X_{n} |},

that is 6.3 holds if

{sup}_{n \geq 1} E e^{δ | X_{n} |} < \infty

for some

δ > 0

. We show that the latter holds true for

δ

involved in 6.2 . A helpful tool for this verification is the inequality

| z - m \frac{z}{| z |} | \leq | | z | - m |

. Write

\begin{matrix} E e^{δ | X_{n} |} & = E e^{δ | X_{n} |} I (| X_{n - 1} | \leq m) + E e^{δ | X_{n} |} I (| X_{n - 1} | > m) \end{matrix}

\begin{matrix} \leq e^{δ m} E e^{δ | ξ_{1} |} + e^{- δ m} E e^{δ | ξ_{1} |} E e^{δ | X_{n - 1} |} . \end{matrix}

Set

ℓ = e^{δ m} E e^{δ | ξ_{1} |}

and

ϱ = e^{- δ m} E e^{δ | ξ_{1} |}

. By 6.2 ,

ϱ < 1

. Hence,

V (x) = e^{δ | x |}

is the Lyapunov function:

P_{x} V \leq ϱ V (x) + ℓ .

Consequently,

E V (X_{n}) \leq ϱ E V (X_{n}) + ℓ, n \geq 1

and so,

{sup}_{n \geq 1} E V (X_{n}) \leq V (x) + \frac{ℓ}{1 - ϱ}

7 Statistical example

An asymptotic analysis, given in this section, demonstrate the thesis “MDP instead of CLT”. Let

X_{n} = θ f (X_{n - 1}) + ξ_{n},

where

θ

is a number and

(ξ_{n})_{n \geq 1}

is i.i.d. sequence of of

(0, 1)

-Gaussian random variables. We assume that

| θ | < 1

and

f

is bounded continuously differentiable function with

| f^{'} (x) | \leq 1

. By Theorem 2.1 ,

(X_{n})

is an ergodic Markov chain and its invariant measure

μ_{θ}

depends on parameter

θ

. Since

ξ_{1}

is Gaussian random variables,

μ_{θ}

, being a convolution of some measure with Gaussian one, possesses a density relative to

d z

. Then, assuming

f^{2} (x) > 0

relative to Lebesgue measure, we have

B_{θ} = \int_{R} f^{2} (z) μ (d z) > 0 .

Under the above assumptions,

θ_{n} = \frac{\sum_{i = 1}^{n} f (X_{i - 1}) X_{i}}{\sum_{i = 1}^{n} f^{2} (X_{i - 1})}

is a strongly consistent estimate of

θ

by sampling

{X_{1}, \dots, X_{n}}

, that is,

{lim}_{n \to \infty} θ_{n} = θ

a.s. Moreover, it is known its asymptotic in the CLT scale:

\sqrt{n} (θ - θ_{n}) - - - \to l a w n \to \infty (0, \frac{1}{B_{θ}}) -Gaussian r. v.

Here, we give an asymptotic of

θ_{n}

in the MDP scale: for any

α \in (\frac{1}{2}, 1)

n^{1 - α} (θ - θ_{n}) - - - \to M D P n \to \infty (\frac{1}{n^{2 α - 1}}, \frac{y^{2}}{2 B_{θ}}) .

Theorem 7.1. The family

n^{1 - α} (θ - θ_{n})

obeys the MDP with the rate of speed

\frac{1}{n^{2 α - 1}}

and the rate function

I (y) = \frac{y^{2}}{2 B_{θ}}

Proof. The use of

n^{1 - α} (θ - θ_{n}) = \frac{\frac{1}{n^{α}} \sum_{i = 1}^{n} f (X_{i - 1}) ξ_{i}}{\frac{1}{n} \sum_{i = 1}^{n} f^{2} (X_{i - 1})}

and the law of large numbers,

P - {lim}_{n \to \infty} \frac{1}{n} \sum_{i = 1}^{n} f^{2} (X_{i - 1}) = B_{θ}

, give a hint that that the theorem statement is valid provided that (i) for

M_{n} = \sum_{i = 1} f (X_{i - 1}) ξ_{i}

, the family

{(\frac{1}{n^{α}} M_{n})}_{n \to \infty}

obeys the MDP with the rate of speed

\frac{1}{n^{2 α - 1}}

and the rate function

I (y) = \frac{y^{2}}{2 B_{θ}^{- 1}}

; (ii) for any

ɛ > 0

{lim}_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (| \frac{1}{n} \sum_{i = 1}^{n} [f^{2} (X_{i - 1}) - B_{θ}] | \geq ɛ) = - \infty .

Following to 1.5 and taking into account the setting, we notice that

E_{n} (λ) = exp (\sum_{i = 1}^{n} \frac{λ^{2}}{2 n^{2 α}} f^{2} (X_{i - 1})) .

is the stochastic exponential related to

{(\frac{1}{n^{α}} M_{n})}_{n \to \infty}

. Consequently, 3.6 is reduced to (ii), that is, only (ii) is left to be verified.

The verification of (ii) is in the framework of Theorem 2.1 . The function

H (x) = f^{2} (x) - B_{θ}

satisfies the assumptions of Theorem 2.1 . Hence, the family

{(\frac{1}{n^{α}} \sum_{i = k}^{n} H (X_{k_{1}}))}_{n \to \infty}

obeys the MDP with the rate of speed

\frac{1}{n^{2 α - 1}}

and the rate function

J (y) = {\begin{matrix} \frac{y^{2}}{2} {\hat{B}}_{θ}^{\oplus} & {\hat{B}}_{θ} > 0, \\ \infty & , {\hat{B}}_{θ} = 0, y \neq 0, \end{matrix}

where, in accordance with 1.3 ,

{\hat{B}}_{θ} = \int_{R} H^{2} (x) μ_{θ} (d x) + 2 \sum_{n \geq 1} \int_{R} H (x) P_{x}^{(n)} H μ_{θ} (d x) .

In particular,

l i m_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (| \frac{1}{n^{α}} \sum_{k = 1}^{n} H (X_{k - 1}) | \geq C ɛ) \leq {\begin{matrix} - \frac{1}{2 {\hat{B}}_{θ}} C^{2} ɛ^{2}, & {\hat{B}}^{θ} > 0 \\ - \infty, & otherwise . \end{matrix}

Hence, for any

C > 0

, we find that

\begin{matrix} l i m_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (| \frac{1}{n} \sum_{k = 1}^{n} H (X_{k - 1}) | \geq ɛ) \end{matrix}

\begin{matrix} = l i m_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (| \frac{1}{n^{α}} \sum_{k = 1}^{n} H (X_{k - 1}) | \geq n^{1 - α} ɛ) \end{matrix}

\begin{matrix} \leq l i m_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (| \frac{1}{n^{α}} \sum_{k = 1}^{n} H (X_{k - 1}) | \geq C ɛ) \end{matrix}

\begin{matrix} \leq {\begin{matrix} - \frac{C^{2} ɛ^{2}}{2 {\hat{B}}_{θ}} & {\hat{B}}_{θ} > 0, \\ - \infty & otherwise \end{matrix} - - - - \to C \to \infty - \infty . \end{matrix}

□

A Exponentially integrable martingale-differences

Let

ζ_{n} = (ζ_{n})_{n \geq 1}

be a martingale-difference with respect to some filtration

F = (F_{n})_{n \geq 0}

and

M_{n} = \sum_{i = 1}^{n} ζ_{i}

be the corresponding martingale.

Theorem A.1. Assume that for sufficiently small positive

δ

and any

i \geq 1

\begin{matrix} E (e^{δ | ζ_{i} |} | F_{i - 1}) \leq c o n s t . \end{matrix}

(A.1)

Then for any

α \in (0.5, 1)

{lim}_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (| M_{n} | > n ɛ) = - \infty .

Proof. Suffice it to prove

{lim}_{n \to \infty} \frac{1}{n^{2 α - 1}} log P (\pm M_{n}^{'} > n ɛ) = - \infty

. We verify here only “+” only (the proof of “-” is similar).

For fixed positive

λ

and sufficiently large

n

, let us introduce the stochastic exponential

E_{n} (λ) =^{n} \prod_{i = 1} E (e^{λ \frac{ζ_{i}}{n}} | F_{i - 1}) .

A direct verification shows that

E exp (\frac{λ M_{n}}{n} - log E_{n} (λ)) = 1 .

We apply this equality for further ones

\begin{matrix} \begin{matrix} 1 & \geq E I (M_{n} > n ɛ) exp (\frac{λ M_{n}}{n} - log E_{n} (λ)) \\ \geq E I (M_{n} > n ɛ) exp (λ ɛ - log E_{n} (λ)) . \end{matrix} \end{matrix}

(A.2)

Due to

E (λ \frac{ζ_{i}}{n} | F_{i - 1}) = 0

and A.1 , we find that

\begin{matrix} log E_{n} (λ) & = \sum_{i = 1}^{n} log (1 + E [e^{λ \frac{ζ_{i}}{n}} - 1 - λ \frac{ζ_{i}}{n} | F_{i - 1}]) \end{matrix}

\begin{matrix} \leq \sum_{i = 1}^{n} {\frac{λ^{2}}{2 n^{2}} E ((ζ_{i})^{2} | X_{i - 1}) + \frac{λ^{3}}{6 n^{3}} E (| ζ_{i} |^{3} e^{λ \frac{| ζ_{i} |}{n}} | F_{i - 1})} \end{matrix}

\begin{matrix} \leq K [\frac{λ^{2}}{2 n} + \frac{λ^{3}}{6 n^{2}}], \end{matrix}

where

K

is some constant. This inequality, being incorporated into A.2 , provides

1 \geq E I (M_{n} > n ɛ) exp (λ ɛ - K [\frac{λ^{2}}{2 n} + \frac{λ^{3}}{6 n^{2}}]) .

ɛ < 3

, taking

λ = ɛ n K^{- 1}

, we find that

\frac{1}{n^{2 α - 1}} log P (M_{n} > n ɛ) \leq - \frac{ɛ^{2} n^{2 (1 - α)}}{K} (\frac{1}{2} - \frac{ɛ}{6}) - - - \to n \to \infty - \infty

Thus, the desired statement holds true. □

References

Albert, A. (1972) Regression and the Moore-Penrose Pseudoinverse. Academic Press, New York and London.
Balaji, S. Meyen, S. P. (2000) Multiplicative ergodicity and large deviations for an irreducible Markov chain. Stochastic Processes and their Applications. 90, pp. 123-144.
Bhattacharya, R.N. (1992) On the functional central limit theorem and the law of the iterated logarithm for Markov processes, Z. Wharsch. verw. Geb. 60, pp. 185–201.
Chen. Xia, Guillin, A. (2004) The functional moderate deviations for Harris recurrent Markov chains and applications. Annales de l'Institut Henri Poincarè (B) Probabilitès et Statistiques. 40 pp. 89-124
Dembo, A. (1996) Moderate deviations for martingales with bounded jumps,Elect. Comm. in Probab. 1, pp. 11-17.
Ethier, S. N., Kurtz, T. G. (1986), Markov processes. Characterization and convergence, Wiley Series in Probability and Mathematical Statistics, John Wiley & Sons, New York et al.
Gong, F. and Wu, L. (2000) Spectral gap of positive operators and applications, C. R. Acad. Sci., Sér. I, Math. 331(12), pp. 983-988.
Kontoyiannis, I., Meyn, S.P. (2002) Spectral Theory and Limit Theorems for Geometrically Ergodic Markov Processes. Article math.PR/0209200.
Liptser, R.S. and Spokoiny, V. (1999) Moderate deviations type evaluation for integral functionals of diffusion processes, EJP. 4, Paper 17. (http://www.math.washington.edu/ ejpecp/)
Liptser, R., Spokoiny, V. and Veretennikov, A.Yu. (2002) Freidlin-Wentzell type large deviations for smooth processes. Markov Process and Relat. Fields. 8, pp. 611-636.
Meyn, S.P. Tweedie, R.L.(1993) Markov chains and stochastic stability, Springer-Verlag .
Papanicolaou, C.C., Stroock, D.W., Varahan, S.R.S. (1977) Martingale approach to some limit theorems. in: Conference on Statistical Mechanics, Dinamical Systems and Turbulence, M. Reed ed., Duke Univ. Math. Series, 3.
Pardoux, E., Veretennikov, A.Yu. (2001) On Poisson equation and diffusion approximation, 1. Ann. Prob. 29 (2001), n. 3, pp. 1061-1085.
A.A Puhalskii, (1991) On functional principle of large deviations”. New trends in Probability and Statistics., Vilnius, Lithuania, VSP/Mokslas, , pp. 198-218.
Puhalskii, A.A. (1994) The method of stochastic exponentials for large deviations. Stochast. Proc. Appl. 54, pp. 45-70.
Wu, L. (1992) Moderate deviations of dependent random variables related to CLT and LIL. Prépublication N. 118, Lab. de Probabilité de l'Université Paris VI.
Wu, L. (1995) Moderate deviations of dependent random variables related to CLT, Annals of Probability. 23, No. 1, pp. 420-445.

Universite de Rennes 1, IRISA, Campus de Beaulieu, 35042 Rennes Cedex, France. E-mail address : bernard.delyon@univ-rennes1.fr University Joseph Fourier of Grenoble, France E-mail address : juditsky@inrialpes.fr Department of Electrical Engineering-Systems, Tel Aviv University, 69978 Tel Aviv Israel E-mail address : liptser@eng.tau.ac.il

B. Delyon, A. Juditsky,

R. Liptser