Almost Sure Invariance Principle for Nonuniformly Hyperbolic Systems, , , 23 September, 2004. Revised 8 February, 2005.

Statistical limit laws such as the central limit theorem, the law of the iterated logarithm, and their functional versions, are immediate consequences.

1 Introduction

Statistical properties of uniformly expanding maps and uniformly hyperbolic (Axiom A) diffeomorphisms are by now classical. Hölder observations satisfy exponential decay of correlations and the central limit theorem (CLT), see for example Bowen [8] , Ratner [34] , Ruelle [35] , Parry and Pollicott [31] . Furthermore, Denker and Philipp [17] proved an almost sure invariance principle (ASIP) for Hölder observations.

Immediate consequences of the ASIP are the CLT, the law of the iterated logarithm (LIL), and their functional versions, see [32] .

Many proofs of the CLT for dynamical systems use directly the martingale approximation method of Gordin [20] , see [25, 26, 30] . The ASIP can often also be obtained in this way see [16, 19, 30, 39] and indeed this method yields a better error estimate in the ASIP than the usual one, see Field et al. [19] . However, it should be emphasised that the martingale approximation of Gordin [20] leads directly only to a reverse martingale increment sequence and so the ASIP is obtained in backwards time in the first instance. This is not an issue for distributional results such as the CLT, but the ASIP in [16, 19, 30] uses explicitly the fact that the class of systems being studied is closed under time-reversal. To obtain forward martingale approximations, it is necessary to use more sophisticated versions of Gordin's approach [32] .

Recently, there has been an explosion of interest in nonuniformly expanding maps and nonuniformly hyperbolic diffeomorphisms (possibly with singularities). We refer to the articles of Young [40, 41] as well as Aaronson [1] , Baladi [4, 5] , Gouëzel [22] , Viana [38] and references therein. In particular, decay of correlations and the CLT are studied extensively in these references. However, such classes of dynamical systems are intrinsically time-orientation specific, and largely for this reason the ASIP has not previously been proved. Similarly, the LIL was previously unproved for such systems.

In this paper, we establish the ASIP, and hence the (functional) LIL, for nonuniformly expanding/hyperbolic systems. Both discrete time systems and flows are covered by our results.

Remark 1.1We note that [33] attempted to apply the approach in [19] to nonuniformly expanding systems. However, it appears that the time-orientation issue discussed above was overlooked in [33] , and that this is a gap. Hence it seems necessary to find an alternative approach to the one in [19] , and that is what is done in the current paper.

Precise formulations are given in the body of the paper, but here is an outline of our main result, and the strategy behind its proof, for a nonuniformly expanding map

T : M \to M

where

(M, d)

is a metric space. By standard methods,

T

can be modelled by a discrete-time suspension over a Gibbs-Markov map [1]

f : Y \to Y

with return time function

R : Y \to Z^{+}

. (Roughly speaking, a Gibbs-Markov map is like a uniformly expanding map with possibly countably many inverse branches.) There exists a unique ergodic

T

-invariant probability measure equivalent to Lebesgue, and the following result is formulated with this measure in mind.

Theorem 1.2Let

T : M \to M

be a nonuniformly expanding map. Assume moreover that

R \in L^{2 + δ} (Y)

. Let

φ : M \to R

be a mean zero Hölder observation. Then

φ

satisfies the ASIP. That is, there exists

ε > 0

, a sequence of random variables

{S_{n}}

and a Brownian motion

W

with variance

σ^{2} \geq 0

such that

{\sum_{j = 0}^{N - 1} φ \circ T^{j}} =_{d} {S_{N}}

, and

S_{N} = W (N) + O (N^{\frac{1}{2} - ε}) as N \to \infty,

almost everywhere.

Using a method due to Hofbauer and Keller [24] which exploits a result of Philipp and Stout [32,Theorem 7.1] , we obtain the ASIP (in the correct time direction but without the improved error term) for

f : Y \to Y

and a class of “weighted Lipschitz” observations. Theorem 1.2 then follows directly by Melbourne and Török [29] . (We note that the method in [29] has independently been used by Gouëzel [21] to obtain a simplified derivation of the CLT and stable laws.) A precise version of Theorem 1.2 is stated and proved in Section 2 (e). The ASIP for nonuniformly hyperbolic maps extends easily to a class of nonuniformly expanding semiflows, see Section 2 (e).

Our results for nonuniformly hyperbolic diffeomorphisms and nonuniformly hyperbolic flows are completely analogous, but the set-up is more technical and we postpone further details until Section 3 .

Planar periodic Lorentz gas

The planar periodic Lorentz gas is a class of examples introduced by Sinaĭ [36] .

See [15] for a survey of results about Lorentz gases. The Lorentz flow is a billiard flow on

T^{2} - Ω

where

Ω

is a disjoint union of convex regions with

C^{3}

boundaries. (The phase-space of the flow is three-dimensional; planar position and direction.) The flow has a natural global cross-section

M = \partial Ω \times [- π / 2, π / 2]

corresponding to collisions and the Poincaré map

T : M \to M

is called the billiard map. Bunimovich, Sinaĭ and Chernov [11] proved the central limit theorem and weak invariance principle for such maps.

Denote the return time function by

h : M \to R^{+}

. The Lorentz flow satisfies the finite horizon condition if

h

is uniformly bounded. The central limit theorem and weak invariance principle was proved by [11] for Lorentz flows satisfying the finite horizon condition.

Theorem 1.3Suppose that

T_{t}

is a planar periodic Lorentz gas.

(i) The billiard map satisfies the ASIP for Hölder observations.
(ii) If the finite horizon condition holds, then the Lorentz flow satisfies the ASIP for Hölder observations.

In Section 2 , we prove the ASIP for nonuniformly expanding maps and semiflows.

In Section 3 , we prove the analogous results for systems that are nonuniformly hyperbolic in the sense of Young [40] . In Section 4 , we list numerous examples in the literature for which our results apply. In particular, we prove Theorem 1.3 . The results of [32, 29] required in this paper are reproduced as appendices.

2 Nonuniformly expanding systems

In this section, we prove the ASIP for nonuniformly expanding systems. The first step is to prove the ASIP for Gibbs-Markov maps. Such maps are reviewed in Subsection (a) and a class of “weighted Lipschitz” observations is introduced in Subsection (b). The ASIP for Gibbs-Markov maps is proved in Subsection (c) using an approach of Hofbauer and Keller [24] . In Subsection (d), we obtain the ASIP for Young towers [41] as an application of [29] . In Subsection (e), we prove the ASIP for nonuniformly expanding maps and semiflows.

(a) Gibbs-Markov maps

Let

(Λ, m)

be a Lebesgue space with a countable measurable partition

α

. Without loss, we suppose that all partition elements

a \in α

have

m (a) > 0

. Recall that a measure-preserving transformation

f : Λ \to Λ

is a Markov map if

f (a)

is a union of elements of

α

and

f |_{a}

is injective for all

a \in α

. Define

α^{'}

to be the coarsest partition of

Λ

such that

f a

is a union of atoms in

α^{'}

for all

a \in α

. (So

α^{'}

is a coarser partition than

α

.) If

a_{0}, \dots, a_{n - 1} \in α

, we define the

n

-cylinder

[a_{0}, \dots, a_{n - 1}] = \cap_{i = 0}^{n - 1} f^{- i} a_{i}

. It is assumed that

f

and

α

separate points in

Λ

(if

x, y \in Λ

and

x \neq y

, then for

n

large enough there exist distinct

n

-cylinders that contain

x

and

y

Let

0 < β < 1

. We define a metric

d_{β}

Λ

d_{β} (x, y) = β^{s (x, y)}

where

s (x, y)

is the greatest integer

n \geq 0

such that

x, y

lie in the same

n

-cylinder. Define

g = J f^{- 1} = \frac{d m}{d (m \circ f)}

and set

g_{k} = g g \circ f \dots g \circ f^{k - 1}

The map

f : Λ \to Λ

is a Gibbs-Markov map if it satisfies the additional properties:

(i) Big images property: There exists $c > 0$ such that $m (f a) \geq c$ for all $a \in α$ .
(ii) Distortion: $log g |_{a}$ is Lipschitz with respect to $d_{β}$ for all $a \in α^{'}$ .

It follows from assumptions (i) and (ii) that there exists a constant

D \geq 1

such that for all

x, y

lying in a common

k

-cylinder

[a_{0}, \dots, a_{k - 1}]

\begin{matrix} | \frac{g_{k} (x)}{g_{k} (y)} - 1 | \leq D d_{β} (f^{k} x, f^{k} y) and D^{- 1} \leq \frac{m [a_{0}, \dots, a_{k - 1}]}{g_{k} (x)} \leq D . \end{matrix}

(2.1)

\begin{matrix}  \end{matrix}

(b) Weighted Lipschitz observations

Let

p \geq 1

. We fix a sequence of weights

R (a) > 0

satisfying

| R |_{p} = (\sum_{a \in α} m (a) R (a)^{p})^{1 / p} < \infty

Given

v : Λ \to R

continuous, we set

v_{a} = v |_{a}

and define

| v |_{β}

to be the Lipschitz constant of

v

with respect to the metric

d_{β}

. Let

∥ v ∥_{\infty} = {sup}_{a \in α} | v_{a} |_{\infty} / R (a), ∥ v ∥_{β} = {sup}_{a \in α} | v_{a} |_{β} / R (a) .

Let

ℬ

consist of the space of weighted Lipschitz functions with

∥ v ∥ = ∥ v ∥_{\infty} + ∥ v ∥_{β} < \infty

. Note in particular that

R \in ℬ

and

∥ R ∥ = 1

. We have the embeddings

L i p \subset ℬ \subset L^{p} \subset L^{1},

where

L i p

is the space of (globally) Lipschitz functions.

The transfer (Perron-Frobenius) operator

P : L^{1} \to L^{1}

maps

v \in L^{1}

P v

where

\int_{Λ} P v w d m = \int_{Λ} v w \circ f d m

for all

w \in L^{\infty}

, and is given by

(P v) (x) = \sum_{f y = x} g (y) v (y)

. Note that

| P |_{1} = 1

Proposition 2.1Let

a \in α

be an

n

-cylinder and suppose that

v : a \to R

is Lipschitz. Then

| v |_{\infty} \leq \frac{1}{m (a)} \int_{a} | v | d m + β^{n} | v |_{β}

Proof For

x \in a

\begin{matrix} | v (x) | & \leq \frac{1}{m (a)} \int_{a} | v | d m + | v (x) - \frac{1}{m (a)} \int_{a} v d m | \leq \frac{1}{m (a)} \int_{a} | v | d m + | v |_{β} diam (a) . \end{matrix}

\begin{matrix}  \end{matrix}

The result follows since

diam (a) = β^{n}

Lemma 2.2The transfer operator

P

restricts to an operator

P : ℬ \to ℬ

and there exists a constant

C \geq 1

such that

∥ P^{n} v ∥ \leq C (| v |_{1} + β^{n} ∥ v ∥_{β}),

for all

v \in ℬ

and

n \geq 1

. Moreover,

P (ℬ) \subset L i p

Proof We prove the estimate on

∥ P^{n} v ∥

. The remaining statements of the lemma are evident from the proof.

Note that

(P^{n} v) (x) = \sum_{f^{n} y = x} g_{n} (y) v (y)

. Since the

n

-cylinders

[a_{0}, \dots, a_{n - 1}]

form a partition and each

n

-cylinder contains precisely one preimage

y_{a}

, we have

\begin{matrix} | (P^{n} v) (x) | & \leq \sum_{a = [a_{0}, \dots, a_{n - 1}]} g_{n} (y_{a}) | v (y_{a}) | \leq D \sum_{a = [a_{0}, \dots, a_{n - 1}]} m (a) | v_{a} |_{\infty} \end{matrix}

\begin{matrix} \leq D \sum_{a = [a_{0}, \dots, a_{n - 1}]} [\int_{a} | v | d m + m (a) β^{n} | v_{a} |_{β}] \end{matrix}

\begin{matrix} \leq D \sum_{a = [a_{0}, \dots, a_{n - 1}]} [\int_{a} | v | d m + β^{n} m (a) R (a_{0}) ∥ v ∥_{β}] \end{matrix}

\begin{matrix}  \end{matrix}

where we have used Proposition 2.1 and estimate 2.1 . Hence

| P^{n} v |_{\infty} \leq D [| v |_{1} + β^{n} | R |_{1} ∥ v ∥_{β}]

. Similarly,

\begin{matrix} | (P^{n} v) (x) - (P^{n} v) (x^{'}) | & \leq \sum_{a = [a_{0}, \dots, a_{n - 1}]} | g_{n} (y_{a}) - g_{n} (y_{a}^{'}) | | v (y_{a}) | \end{matrix}

\begin{matrix} + \sum_{a = [a_{0}, \dots, a_{n - 1}]} | g_{n} (y_{a}^{'}) | | v (y_{a}) - v (y_{a}^{'}) | . \end{matrix}

\begin{matrix}  \end{matrix}

Each term in the first summation can be estimated by

\begin{matrix} D | g_{n} (y_{a}^{'}) | d_{β} (f^{n} y_{a}, f^{n} y_{a}^{'}) | v_{a} |_{\infty} & \leq D^{2} m (a) d_{β} (x, x^{'}) [\frac{1}{m (a)} \int_{a} | v_{a} | d m + β^{n} | v_{a} |_{β}] \end{matrix}

\begin{matrix} \leq D^{2} [\int_{a} | v | d m + β^{n} m (a) R (a_{0}) ∥ v ∥_{β}] d_{β} (x, x), \end{matrix}

\begin{matrix}  \end{matrix}

so the first summation is bounded by

D^{2} [| v |_{1} + β^{n} | R |_{1} ∥ v ∥_{β}] d_{β} (x, x^{'})

. Each term in the second summation can be estimated by

\begin{matrix} D m (a) | v_{a} |_{β} d_{β} (y_{a}, y_{a}^{'}) \leq D m (a) R (a_{0}) ∥ v ∥_{β} β^{n} d_{β} (x, x^{'}), \end{matrix}

\begin{matrix}  \end{matrix}

so the second summation is bounded by

β^{n} D | R |_{1} ∥ v ∥_{β} d_{β} (x, x^{'})

. The result follows.

We have the following standard consequences of Lemma 2.2 .

Corollary 2.3Let

p, q \geq 1

with

\frac{1}{p} + \frac{1}{q} = 1

. Assume that

f : Λ \to Λ

is mixing and that

R \in L^{p}

. Then there exist constants

C \geq 1

and

τ \in (0, 1)

such that

(a) $∥ P^{n} v - \int_{Λ} v d m ∥ \leq C τ^{n} ∥ v ∥$ for all $v \in ℬ$ and $n \geq 1$ .
(b) $| \int_{Λ} v (w \circ f^{n}) d m - \int_{Λ} v d m \int_{Λ} w d m | \leq C τ^{n} ∥ v ∥ | w |_{q}$ for all $v \in ℬ$ , $w \in L^{q}$ , $n \geq 1$ .
(c) If $R \in L^{2}$ , then for any $v \in ℬ$ with $\int_{Λ} v d m = 0$ , the series $σ^{2} = \int_{Λ} v^{2} d m + 2 \sum_{k = 1}^{\infty} \int_{Λ} v (v \circ f^{k}) d m,$ is absolutely convergent, and $\int_{Λ} v_{N}^{2} d m = σ^{2} N + O (1)$ as $N \to \infty$ , where $v_{N} = \sum_{j = 0}^{N - 1} v \circ f^{j}$ . Moreover, $σ = 0$ if and only if there exists a Lipschitz function $w : Λ \to R$ such that $v = w \circ f - w$ .

Proof Most of this result is completely standard, but we include the details for completeness. By an Arzela-Ascoli argument, the unit ball in

ℬ

is compact in

L^{1}

This combined with Lemma 2.2 implies, by Hennion [23] , that the essential spectral radius of

P : ℬ \to ℬ

is bounded above by

β < 1

. There is a simple eigenvalue at

1

with eigenspace consisting of constant functions, but the mixing assumption guarantees that there are no further eigenvalues on the unit circle. Now choose

τ \in (β, 1)

such that all eigenvalues of

P

other than

1

lie strictly inside the disk of radius

τ

. Part (a) follows for such a choice of

τ

To prove part (b), compute that

\begin{matrix} | \int_{Λ} v (w \circ f^{n}) d m - \int_{Λ} v d m \int_{Λ} w d m | & = | \int_{Λ} (P^{n} v - \int_{Λ} v) w d m | \leq | P^{n} v - \int_{Λ} v |_{p} | w |_{q} \end{matrix}

\begin{matrix} \leq ∥ P^{n} v - \int_{Λ} v ∥ | w |_{q} \leq C τ^{n} ∥ v ∥ | w |_{q} . \end{matrix}

\begin{matrix}  \end{matrix}

It follows from (b) that

| \int_{Λ} v (v \circ f^{k}) d m | \leq C τ^{n} ∥ v ∥ | v |_{2}

and so the series for

σ^{2}

converges absolutely. Moreover

\begin{matrix} \int_{Λ} v_{N}^{2} d m & = N \int_{Λ} v^{2} d m + 2 \sum_{0 \leq i < j \leq N - 1} \int_{Λ} v (v \circ f^{j - i}) d m \end{matrix}

\begin{matrix} = N \int_{Λ} v^{2} d m + 2 \sum_{k = 1}^{N} (N - k) \int_{Λ} v (v \circ f^{k}) d m \end{matrix}

\begin{matrix} = N σ^{2} - 2 \sum_{k = 1}^{N} k \int_{Λ} v (v \circ f^{k}) d m - 2 \sum_{k = N + 1}^{\infty} N \int_{Λ} v (v \circ f^{k}) d m \end{matrix}

\begin{matrix} = N σ^{2} + O (1), \end{matrix}

\begin{matrix}  \end{matrix}

proving (c).

The criterion for

σ = 0

follows as in [19, 28] . If

v = w \circ f - w

, then

v_{N} = w \circ f^{N} - w

so it is clear that

σ = 0

. To prove the converse, define

w = \sum_{j = 1}^{\infty} P^{j} v

. This series converges in

ℬ

by (b) and is Lipschitz by Lemma 2.2 . Write

v = \hat{v} + w \circ f - w

Then it is easily seen that

\hat{v}

has the same variance as

v

and that

P \hat{v} = 0

. Hence

σ^{2} = \int_{Λ} {\hat{v}}^{2} d m

, so if

σ = 0

, then

\hat{v} = 0

and

v = w \circ f - w

Let

α_{0}^{k - 1}

denote the partition into length

k

cylinders

a = [a_{0}, \dots, a_{k - 1}]

Lemma 2.4Assume that

f : Λ \to Λ

is mixing and that

R \in L^{2 + δ}

for some

δ > 0

. Let

v \in ℬ

with

\int_{Λ} v d m = 0

. Then

(a) $\sum_{a \in α_{0}^{k - 1}} \int_{a} | v - \frac{1}{m (a)} \int_{a} v d m |^{2 + δ} d m \leq {(∥ v ∥_{β} | R |_{2 + δ} β^{k})}^{2 + δ}$ .
(b) $| m (a \cap f^{- (N + k)} (b)) - m (a) m (b) | \leq C τ^{N} m (a) m (b)^{1 / 2}$ for all $a \in α_{0}^{k - 1}$ and all measurable sets $b$ .

Proof Note that

| v - \frac{1}{m (a)} \int_{a} v d m | \leq | v_{a} |_{β} diam (a) \leq ∥ v ∥_{β} R (a_{0}) β^{k}

. Part (a) follows immediately.

We argue as in Aaronson & Denker [2] to establish (b). Let

v_{a, k} = P^{k} χ_{a}

. By definition,

v_{a, k} = \sum_{f^{k} y = x} g_{k} (y) χ_{a} (y) = g_{k} (y_{a})

where

y_{a}

is the unique point in

a

such that

f^{k} y_{a} = x

. Hence by 2.1 ,

| v_{a, k} (x) - v_{a, k} (x^{'}) | \leq D | g_{k} (y_{a}) | d_{β} (x, x^{'}) \leq D^{2} m (a) d_{β} (x, x^{'}) .

It follows that

∥ v_{a, k} ∥ \leq E m (a)

where

E = D^{2} + D

Using this estimate and Corollary 2.3 (b), we compute that

\begin{matrix} | m (a \cap f^{- (N + k)} b) - m (a) m (b) | = | \int P^{k} χ_{a} χ_{b} \circ f^{N} - \int P^{k} χ_{a} \int χ_{b} | \end{matrix}

\begin{matrix} = | \int v_{a, k} χ_{b} \circ f^{N} - \int v_{a, k} \int χ_{b} | \leq C τ^{N} ∥ v_{a, k} ∥ | χ_{b} |_{2} \leq C E τ^{N} m (a) m (b)^{1 / 2}, \end{matrix}

\begin{matrix}  \end{matrix}

as required.

Corollary 2.5Let

f : Λ \to Λ

be an ergodic Gibbs-Markov map. Define the Banach space

ℬ

corresponding to weights

R \in L^{2 + δ}

for some

δ > 0

Suppose that

v \in ℬ

and

\int_{Λ} v d m = 0

. Define

σ^{2}

as in Corollary 2.3 and assume that

σ^{2} > 0

. Then

v_{N} = \sum_{j = 0}^{N - 1} v \circ f^{j}

satisfies the ASIP.

Proof We verify the hypotheses of Philipp & Stout [32,Theorem 7.1] . For convenience, we have translated this theorem into dynamical systems terminology in the appendix, see Theorem A.1 . Condition (i) of Theorem A.1 is automatic since

ℬ \subset L^{2 + δ}

and condition (ii) follows from Corollary 2.3 (c). Conditions (iii) and (iv) follow from parts (a) and (b) of Lemma 2.4 .

(d) ASIP for tower maps

Suppose that

(Λ, m)

is a probability space and that

f : Λ \to Λ

is a measure-preserving transformation. Let

R : Λ \to Z^{+}

be a measurable function (called a return time function with

R \in L^{1} (Λ)

. Define the suspension

\begin{matrix} Δ = {(x, ℓ) \in Λ \times N : 0 \leq ℓ \leq R (x)} / \sim, \end{matrix}

\begin{matrix}  \end{matrix}

where

(x, R (x)) \sim (f (x), 0)

. Define

F : Δ \to Δ

F (x, ℓ) = (x, ℓ + 1)

computed subject to identifications. Note in particular that

F (x, 0) = (f (x), 0)

. An

F

-invariant probability measure on

Δ

is given by

m^{R} = m \times l / \bar{R}

where

\bar{R} = \int_{Λ} R d m

and

l

is counting measure on

N

Let

{Δ_{j, 0}}

be a countable measurable partition of

Λ

such that

f

and

{Δ_{j, 0}}

separate points in

Λ

, and for each

j

R_{j} = R |_{Δ_{j, 0}}

is constant and

f : Δ_{j, 0} \to Λ

is a measurable isomorphism. For each

j

and

0 \leq ℓ < R_{j}

, let

Δ_{j, ℓ} = Δ_{j, 0} \times {ℓ}

. This defines a partition

{Δ_{j, ℓ}}

Δ

A separation time function

s : Δ \times Δ \to N

is defined as follows: If

x, y

lie in distinct partition elements, then

s (x, y) = 0

. If

x, y \in Δ_{j, 0}

for some

j

, then

s (x, y)

is the greatest integer

n \geq 0

such that

f^{k} x

and

f^{k} y

lie in the same partition element of

Λ

for

k = 0, \dots, n

. If

x, y \in Δ_{j, ℓ}

, then write

x = F^{ℓ} x_{0}

y = F^{ℓ} y_{0}

where

x_{0}, y_{0} \in Δ_{j, 0}

and define

s (x, y) = s (x_{0}, y_{0})

. For

θ \in (0, 1)

, we define a metric

d_{θ}

Δ

by setting

d_{θ} (x, y) = θ^{s (x, y)}

Definition 2.6The suspension

F : Δ \to Δ

is called a Young tower if

f : Λ \to Λ

is a Gibbs-Markov map with respect to the partition

α = {Δ_{j, 0}}

Remark 2.7The big images condition for

f

to be a Gibbs-Markov map is automatically satisfied in the strong sense that

f (a) = Λ

for each each

a \in α

. Hence,

F : Δ \to Δ

is a Young tower provided the distortion condition holds: there exist constants

θ \in (0, 1)

and

C \geq 1

such that for each

j

the Jacobian

g_{j} = J f |_{Δ_{j, 0}} : Δ_{j, 0} \to Λ

satisfies

| log g_{j} (x) - log g_{j} (y) | \leq C d_{θ} (x, y)

for all

x, y \in Δ_{j, 0}

Theorem 2.8Let

F : Δ \to Δ

be a Young tower defined as a suspension over

f : Λ \to Λ

with return time function

R

. Assume that

R \in L^{2 + δ} (Λ)

. Let

φ : Δ \to R

be a mean zero observation and assume that

φ

is Lipschitz with respect to

d_{θ}

. Then

φ_{N} = \sum_{j = 0}^{N - 1} φ \circ F^{j}

satisfies the ASIP.

Proof Define a mean zero observation

Φ : Λ \to R

by setting

Φ (x) = \sum_{j = 0}^{R (x) - 1} φ (x, j)

Since

φ

is Lipschitz, it is immediate that

Φ

lies in the space

ℬ

of weighted Lipschitz observations. Since

R \in L^{2 + δ} (Λ)

, it follows from Corollary 2.5 that

Φ_{N} = \sum_{j = 0}^{N - 1} Φ \circ f^{j}

satisfies the ASIP on

Λ

Note that

R - \bar{R}

also satisfies the hypotheses of Corollary 2.5 , and so the ASIP, and hence the LIL, applies. Therefore, it is certainly the case that

\sum_{j = 0}^{N - 1} R \circ f^{j} = N \bar{R} + o (N^{1 - δ})

almost everywhere. The result follows from [29,Theorem 4.2] , see Corollary B.2 .

(e) ASIP for nonuniformly expanding systems

Let

(M, d)

be a locally compact separable bounded metric space with Borel probability measure

η

and let

T : M \to M

be a nonsingular transformation for which

η

is ergodic. Let

Y \subset M

be a measurable subset with

η (Y) > 0

. We suppose that there is an at most countable measurable partition

{Y_{j}}

with

η (Y_{j}) > 0

, and that there exist integers

R_{j} \geq 1

, and constants

λ > 1

;

C, D > 0

and

γ \in (0, 1)

such that for all

j

(1) $T^{R_{j}} : Y_{j} \to Y$ is a (measure-theoretic) bijection.
(2) $d (T^{R_{j}} x, T^{R_{j}} y) \geq λ d (x, y)$ for all $x, y \in Y_{j}$ .
(3) $d (T^{k} x, T^{k} y) \leq C d (T^{R_{j}} x, T^{R_{j}} y)$ for all $x, y \in Y_{j}$ , $k < R_{j}$ .
(4) $g_{j} = \frac{d (η |_{Y_{j}} \circ (T^{R_{j}})^{- 1})}{d η |_{Y}}$ satisfies $| log g_{j} (x) - log g_{j} (y) | \leq D d (x, y)^{γ}$ for almost all $x, y \in Y$ .
(5) $\sum_{j} R_{j} η (Y_{j}) < \infty$ .

We say that a dynamical system

T

satisfying (1)–(5) is nonuniformly expanding.

Define the return time function

R : Y \to Z^{+}

R |_{Y_{j}} \equiv R_{j}

. Condition (5) says that

\int_{Y} R d η < \infty

. The map

f : Y \to Y

given by

f (y) = T^{R (y)} (y)

is the corresponding induced map. It can be shown (see Young [41,Theorem 1] ) that there is a unique invariant probability measure

m

M

that is equivalent to

η

We can now state and prove a precise version of Theorem 1.2 .

Theorem 2.9Let

T : M \to M

be a nonuniformly expanding map satisfying (1)–(5) above. Assume moreover that the return time function

R

lies in

L^{2 + δ} (Y)

. Let

φ : M \to R

be a mean zero Hölder observation. Then

φ

satisfies the ASIP.

Proof Let

Δ = {(y, ℓ) : y \in Y, ℓ = 0, . . ., R (y) - 1}

, so

Δ

is the disjoint union of

R_{j}

copies of each

Y_{j}

. Define a measure

μ

Δ

by setting

μ |_{Y_{j} \times {ℓ}} = m |_{Y_{j}} / \bar{R}

Define

F : Δ \to Δ

by setting

F (y, ℓ) = (y, ℓ + 1)

for

0 \leq ℓ < R (y) - 1

and

F (y, R (y) - 1) = (f y, 0)

. Define the separation time

s : Δ \times Δ \to N

as in the previous section.

By shrinking

γ

if necessary, we may suppose that

φ

γ

-Hölder for the same

γ

that appears in condition (4). Define the metric

d_{θ}

Δ

with

θ = 1 / λ^{γ}

. It follows from condition (2) that

d (x, y) \leq diam (Y) / λ^{s (x, y)}

for all

(x, y) \in Δ

. Hence

f

and

{Y_{j}}

separate points in

Y

and the required distortion condition on

g_{j}

is immediate, so

Δ

is a Young tower with

Λ = Y

and

Δ_{j, 0} = Y_{j}

. If

x, y

lie in the same partition element of

Δ_{ℓ}

, then write

x = F^{ℓ} x_{0}

y = F^{ℓ} y_{0}

d (f x_{0}, f y_{0}) \leq diam (Y) / λ^{s (x, y)}

. By condition (3),

\begin{matrix} d (x, y) \leq C d (f x_{0}, f y_{0}) \leq C diam (Y) / λ^{s (x, y)} = C diam (Y) [d_{θ} (x, y)]^{1 / γ} . \end{matrix}

\begin{matrix}  \end{matrix}

Hence, there is a constant

C^{'} \geq 1

such that

d (x, y) \leq C^{'} [d_{θ} (x, y)]^{1 / γ}

for all

x, y \in Δ

Define the projection

π : Δ \to M

π (y, ℓ) = T^{ℓ} y

. Then

π

is a measure-preserving isomorphism and it follows as above that

d (π (x), π (y))^{γ} \leq C^{''} d_{θ} (x, y)

, for all

x, y \in Δ

In particular, since

φ : (M, d) \to R

γ

-Hölder, it follows that

φ \circ π : (Δ, d_{θ}) \to R

is Lipschitz. By Theorem 2.8 , the ASIP holds for

φ \circ π

Δ

. Since

π

is a measure-preserving map semiconjugacy, the ASIP holds for

φ

M

Remark 2.10As already pointed out in [21] , the CLT for nonuniformly expanding maps holds under slightly weaker hypotheses using [29,Theorem 1.1] . Instead of requiring that

R \in L^{2 + δ}

, it suffices that

R \in L^{2}

Remark 2.11The ASIP is said to be degenerate if

σ^{2} = 0

. It follows from previous work in connection with the CLT [40, 41] that the ASIPs obtained in this paper are degenerate if and only if

φ = ψ \circ T - ψ

where

ψ \in L^{2} (M)

Moreover, by a Livšic regularity result of Bruin et al. [9] , such an

L^{2}

function

ψ

has a version that is Hölder on

\cup_{j = 0}^{ℓ} T^{j} Y

for each fixed

ℓ

. (It is easy to construct examples where the ASIP is degenerate but

ψ

does not have a version that is continuous on the whole of

M

.) In particular, if

T

has a periodic point

x \in Y

of period

k

and

\sum_{j = 0}^{k - 1} φ (T^{j} y) \neq 0

, then the ASIP is nondegenerate.

Nonuniformly expanding semiflows

We continue to assume that

T : M \to M

is a nonuniformly expanding map satisfying conditions (1)–(5). Suppose that

h : M \to R^{+}

lies in

L^{1} (M)

. Regarding

h

as a roof function, we form the suspension

M^{h} = {(x, u) \in M \times [0, \infty) : 0 \leq u \leq h (x)} / \sim

where

(x, h (x)) \sim (T x, 0)

. The suspension semiflow

T_{t} : M^{h} \to M^{h}

is given by

T_{t} (x, u) = x (u + t)

computed modulo identifications. We call

T_{t} : M^{h} \to M^{h}

a nonuniformly expanding semiflow. We say that an observation

ψ : M^{h} \to R

is Hölder if

ψ

is bounded and

{sup}_{(x, u) \neq (y, u)} | ψ (x, u) - ψ (y, u) | / d (x, y) < \infty

Corollary 2.12Let

T_{t} : M^{h} \to M^{h}

be a nonuniformly expanding semiflow.

Assume moreover that the return time function

R

lies in

L^{2 + δ} (Y)

and that the roof function

h : M \to R^{+}

is Hölder. Let

ψ : M^{h} \to R

be a mean zero Hölder observation. Then

ψ

satisfies the ASIP. That is, there exists

ε > 0

, a family of random variables

{S_{t}}

and a Brownian motion

W

with variance

σ^{2} \geq 0

such that

{\int_{0}^{t} ψ \circ T_{s} d s} =_{d} {S_{t}}

, and

S_{t} = W (t) + O (t^{\frac{1}{2} - ε})

t \to \infty

, almost everywhere.

Proof According to [29,Theorem 4.2] (Theorem B.1 ), it suffices that (i)

h \in L^{2 + δ} (Y)

, (ii)

φ (x) = \int_{0}^{h (x)} ψ (x, u) d u

satisfies the ASIP on

Y

, and (iii)

h

satisfies the ASIP on

Y

. Hence, the result is immediate from Theorem 2.9 .

Remark 2.13We have not striven for greatest generality in the statements of Theorem 2.9 and Corollary 2.12 . However, it is clear from the proof that in Theorem 2.9 we can relax the assumption that

φ

is Hölder. It is sufficient that

φ

is such that

Φ (x) = \sum_{ℓ = 0}^{R (x) - 1} φ (T^{ℓ} x)

lies in the space of weighted Lipschitz observations in Subsection (b) for an appropriate choice of weight function. Taking the weight function to be the return time function, it suffices that

φ

is Hölder on

T^{ℓ} Y_{j}

for all

j \geq 1

0 \leq ℓ < R (j) - 1

, with

L^{\infty}

norm and Hölder constant independent of

j, ℓ

Similarly, the hypotheses that

ψ

and

h

are Hölder can be weakened in Corollary 2.12 . For example, provided

ψ

is Hölder, it suffices that

h

is Hölder on

T^{ℓ} Y_{j}

for all

j \geq 1

0 \leq ℓ < R (j) - 1

, with

L^{\infty}

norm and Hölder constant independent of

j, ℓ

3 Nonuniformly hyperbolic systems

In this section, we show how to prove the ASIP for Lipschitz observations of a dynamical system that is nonuniformly hyperbolic in the sense of Young [40] . Instead of using the original set up, we make four assumptions (A1)–(A4) that are distilled from those in [40] . In doing so, we bypass the differential structure, and certain conclusions in [40] become assumptions here, particularly (A4) below.

Let

T : M \to M

be a diffeomorphism (possibly with singularities) defined on a Riemannian manifold

(M, d)

. We assume from the start that

T

preserves a “nice” probability measure

m

(one of the conclusions in Young [40] is that

m

is a SRB measure). Assumption (A4) contains the properties of

m

that we require for the ASIP. We fix a subset

Λ \subset M

and a family of subsets of

M

that we call “stable disks”

{W^{s}}

that are disjoint and cover

Λ

. If

x

lies in a stable disk, we label the disk

W^{s} (x)

(A1) There is a partition ${Λ_{j}}$ of $Λ$ and integers $R_{j} \geq 1$ such that for all $x \in Λ_{j}$ we have $T^{R_{j}} (W^{s} (x)) \subset W^{s} (T^{R_{j}} x)$ .

Define the return time function

R : Λ \to Z^{+}

R |_{Λ_{j}} = R_{j}

and the induced map

f : Λ \to Λ

f (x) = T^{R (x)} (x)

. Form the discrete suspension map

F : Δ \to Δ

where

F (x, ℓ) = (x, ℓ + 1)

for

ℓ < R (x) - 1

and

F (x, R (x) - 1) = (f x, 0)

. We define a separation time

s : Λ \times Λ \to N

by defining

s (x, x^{'})

to be the greatest integer

n \geq 0

such that

f^{k} x, f^{k} x^{'}

lie in the same partition element of

Λ

for

k = 0, \dots, n

. (If

x, x^{'}

do not lie in the same partition element, then we take

s (x, x^{'}) = 0

.) For general points

p = (x, ℓ), p^{'} = (x^{'}, ℓ^{'}) \in Δ

, define

s (p, q) = s (x, x^{'})

ℓ = ℓ^{'}

and

s (p, q) = 0

otherwise.

This defines a separation time

s : Δ \times Δ \to N

. We have the projection

π : Δ \to M

given by

π (x, ℓ) = T^{ℓ} x

and satisfying

π T = F π

(A2) There is a distinguished subset or “unstable leaf ” W u ⊂ Λ such that each stable disk intersects W u in precisely one point, and there exist constants C ≥ 1 , α ∈ ( 0 , 1 ) such that
- (i) $d (T^{n} x, T^{n} y) \leq C α^{n}$ , for all $y \in W^{s} (x)$ , all $n \geq 0$ , and
- (ii) $d (T^{n} x, T^{n} y) \leq C α^{s (x, y)}$ for all $x, y \in W^{u}$ and all $0 \leq n < R$ .

Remark 3.1We note that Young [40] uses a separation time

s_{0}

defined in terms of the underlying diffeomorphism

T : M \to M

whereas our separation time

s

is defined in terms of the induced map

f : Λ \to Λ

. In particular, [40,conditions (iii)and (iv),p. 589] guarantee that

s_{0} \geq s

and moreover that

s_{0} - (R - 1) \geq s

. Hence [40,assumption (P4)(a)] (

d (T^{n} x, T^{n} y) \leq C α^{s_{0} (x, y) - n}

for

0 \leq n < s_{0} (x, y)

) implies our assumption (A2)(ii).

There is also a separation time in [40] that is denoted

s

. This is different from our separation time and plays no role in this paper.

Let

\bar{Λ} = Λ / \sim

where

x \sim x^{'}

x \in W^{s} (x^{'})

. Similarly, define the partition

{{\bar{Λ}}_{j}}

\bar{Λ}

. We obtain a well-defined return time function

R : \bar{Λ} \to Z^{+}

and induced map

f : \bar{Λ} \to \bar{Λ}

. Let

F : \bar{Δ} \to \bar{Δ}

denote the corresponding suspension map. We note that this can be viewed as the quotient of

F : Δ \to Δ

where

(x, ℓ)

is identified with

(x^{'}, ℓ^{'})

ℓ = ℓ^{'}

and

x^{'} \in W^{s} (x)

. Let

\bar{π} : Δ \to \bar{Δ}

denote the natural projection.

The separation time on

Δ

drops down to a separation time on

\bar{Δ}

(and agrees with the natural separation time defined using

f : \bar{Λ} \to \bar{Λ}

and the partition

{{\bar{Λ}}_{j}}

(A3) The map $f : \bar{Λ} \to \bar{Λ}$ and partition ${{\bar{Λ}}_{j}}$ separate points in $\bar{Λ}$ .

It follows that

d_{θ} (p, q) = θ^{s (p, q)}

defines a metric on

\bar{Δ}

for each

θ \in (0, 1)

(A4) There exist F -invariant probability measures m ~ on Δ and m ¯ on Δ ¯ such that
- (i) $π : Δ \to M$ and $\bar{π} : Δ \to \bar{Δ}$ are measure-preserving ( $π$ takes $\tilde{m}$ to $m$ and $\bar{π}$ takes $\tilde{m}$ to $\bar{m}$ ); and
- (ii) $F : \bar{Δ} \to \bar{Δ}$ is a Young tower (in the sense of section 2 (d)).

We say that an observation

ψ : Δ \to R

depends only on future coordinates if

ψ (p) = ψ (q)

whenever

p \sim q

where

\sim

is the equivalence relation on

Δ

arising from quotienting along stable disks. Such an observation drops down to an observation

ψ : \bar{Δ} \to R

. The following result shows that any Hölder observation on

M

is related to a Lipschitz observation on

\bar{Δ}

(cf. [37, 8] ).

Lemma 3.2Suppose that

φ : M \to R

γ

-Hölder with respect to the metric

d

. Then there exist functions

ψ, χ : Δ \to R

such that

(i) $φ \circ π = ψ + χ - χ \circ F$ ,
(ii) $χ$ is bounded,
(iii) $ψ$ depends only on future coordinates,
(iv) $ψ : \bar{Δ} \to R$ is Lipschitz with respect to the metric $d_{θ}$ , for $θ = α^{γ / 2}$ .

Proof Given

p = (x, ℓ) \in Δ

, define

\hat{p} = (\hat{x}, ℓ)

where

\hat{x}

is the unique point in

W^{s} (x) \cap W^{u}

(see (A2)). Define

χ (p) = \sum_{j = 0}^{\infty} φ (π F^{j} p) - φ (π F^{j} \hat{p}) .

Note that

π F^{j} p = T^{j} π p = T^{j + ℓ} x

and similarly

π F^{j} \hat{p} = T^{j + ℓ} \hat{x}

. Since

x

and

\hat{x}

lie in the same stable disk

W^{s}

, it follows from (A2)(i) that

\begin{matrix} | χ (p) | & \leq \sum_{j = 0}^{\infty} | φ (π F^{j} p) - φ (π F^{j} \hat{p}) \leq | φ |_{γ} \sum_{j = 0}^{\infty} d (T^{j + ℓ} x, T^{j + ℓ} \hat{x})^{γ} \end{matrix}

\begin{matrix} \leq | φ |_{γ} C^{γ} \sum_{j = 0}^{\infty} α^{j γ} = | φ |_{γ} C^{γ} (1 - α^{γ})^{- 1} . \end{matrix}

\begin{matrix}  \end{matrix}

Define

ψ = φ \circ π - χ + χ \circ F

. Then

ψ (p) = \sum_{j = 0}^{\infty} φ (π F^{j} \hat{p}) - φ (π F^{j} \hat{F p})

depends only upon future coordinates. It remains to check that

ψ

is Lipschitz with respect to the metric

d_{θ}

. In fact, we prove that

ψ

is Lipschitz with respect to

d_{θ^{1 / 2}}

where

θ = α^{γ}

For any

N \geq 1

p, q \in Δ

\begin{matrix} | ψ (p) - ψ (q) | \leq \sum_{j = 0}^{N} | φ (π F^{j} \hat{p}) - φ (π F^{j} \hat{q}) | + \sum_{j = 0}^{N - 1} | φ (π F^{j} \hat{F p}) - φ (π F^{j} \hat{F q}) | \end{matrix}

(3.1)

\begin{matrix} + \sum_{j = N + 1}^{\infty} | φ (π F^{j} \hat{p}) - φ (π F^{j - 1} \hat{F p}) | + \sum_{j = N + 1}^{\infty} | φ (π F^{j} \hat{q}) - φ (π F^{j - 1} \hat{F q}) | . \end{matrix}

\begin{matrix}  \end{matrix}

Suppose that

d_{θ} (p, q) = d_{θ} (\hat{p}, \hat{q}) \approx θ^{2 N}

. We show that each of these four terms is bounded by

θ^{N} \approx d_{θ^{1 / 2}} (p, q)

up to a constant.

Starting with the third term in 3.1 , we note that

F \hat{p} = \hat{F p}

unless

p = (x, R (x) - 1)

, in which case

F \hat{p} = (f \hat{x}, 0)

and

\hat{F p} = (\hat{f x}, 0)

. Then

π F^{j} \hat{p} = T^{j - 1} (f \hat{x})

and

π F^{j - 1} \hat{F p} = T^{j - 1} (\hat{f x})

. Since

f \hat{x}

and

\hat{f x}

lie in the same stable disk

W^{s}

, we have

| φ (π F^{j} \hat{p}) - φ (π F^{j - 1} \hat{F p}) | \leq | φ |_{γ} C^{γ} α^{(j - 1) γ}

so that

\sum_{j = N + 1}^{\infty} | φ (π F^{j} \hat{p}) - φ (π F^{j - 1} \hat{F p}) | \leq C^{'} θ^{N}

as required. Similarly for the fourth term in 3.1 .

Next, we consider the first term in 3.1 . By assumption,

s (p, q) \approx 2 N

so separation does not takes place during the calculation. Write

p = (x, ℓ)

q = (y, ℓ)

. Then

π F^{j} \hat{p} = T^{j + ℓ} \hat{x} = T^{L} f^{J} \hat{x}

where

J \leq j

and

L < R (f^{J} \hat{x})

. Similarly,

π F^{j} \hat{q} = T^{L} f^{J} \hat{y}

Hence by (A2)(ii),

\begin{matrix} | φ (π F^{j} \hat{p}) - φ (π F^{j} \hat{q}) | & \leq | φ |_{γ} d {(T^{L} f^{J} \hat{x}, T^{L} f^{J} \hat{y})}^{γ} \leq | φ |_{γ} C^{γ} α^{s (f^{J} \hat{x}, f^{J} \hat{y}) γ} \end{matrix}

\begin{matrix} = | φ |_{γ} C^{γ} α^{[s (\hat{x}, \hat{y}) - J] γ} \leq | φ |_{γ} C^{γ} α^{[s (\hat{x}, \hat{y}) - j] γ} \approx | φ |_{γ} C^{γ} θ^{2 N - j}, \end{matrix}

\begin{matrix}  \end{matrix}

so that

\sum_{j = 0}^{N} | φ (π F^{j} \hat{p}) - φ (π F^{j} \hat{q}) | \leq C^{'} θ^{N}

as required. Similarly for the second term in 3.1 .

Remark 3.3Although Lemma 3.2 is modelled on the treatments in [8, 31] , we have not defined a metric on

Δ

and hence the usual regularity statement about

χ

is missing.

Theorem 3.4Suppose that

T : M \to M

satisfies (A1)–(A4) and assume that

R \in L^{2 + δ} (Λ)

for some

δ > 0

. Let

φ : M \to R

be a mean zero Hölder observation. Then

φ

satisfies the ASIP.

Proof Since

π : Δ \to M

is measure preserving, it suffices to prove the ASIP for the lift

\tilde{φ} = φ \circ π : Δ \to R

. By Lemma 3.2 , there exists

ψ : Δ \to R

depending only on future coordinates such that

{\tilde{φ}}_{N} - ψ_{N}

is uniformly bounded, and it suffices to prove the ASIP for

ψ

. Since the projection

\bar{π} : Δ \to \bar{Δ}

is measure preserving, it suffices to prove the ASIP for

ψ

at the level of

\bar{Δ}

. Finally, Lemma 3.2 guarantees that

ψ : \bar{Δ} \to R

is Lipschitz with respect to

d_{θ}

, so it suffices to prove the ASIP for Lipschitz observations on

\bar{Δ}

which is a Young tower by (A4)(ii). Now apply Theorem 2.8 .

Nonuniformly hyperbolic flows

Given an

L^{1}

roof function

h : M \to R^{+}

, we define a suspension flow

T_{t} : M^{h} \to M^{h}

in the same way that we defined the semiflow in Section 2 (e). If

T : M \to M

satisfies (A1)–(A4), we say that

T_{t} : M^{h} \to M^{h}

is a nonuniformly hyperbolic flow.

Corollary 3.5Let

T_{t} : M^{h} \to M^{h}

be a nonuniformly hyperbolic flow.

Assume moreover that the return time function

R

lies in

L^{2 + δ} (Y)

and that the roof function

h : M \to R^{+}

is Hölder. Let

ψ : M^{h} \to R

be a mean zero Hölder observation. Then

ψ

satisfies the ASIP.

Proof This follows immediately from Theorem 3.4 , applying Theorem B.1 .

Remark 3.6The weakened hypotheses mentioned in Remark 2.13 apply equally in the nonuniformly hyperbolic setting.

4 Applications

In this section, we indicate a wide range of applications to which the results in this paper apply.

We begin with nonuniformly expanding systems that can be modelled by a Young tower as in Section 2 . In the literature it is standard to speak of return time asymptotics in the form

m {y \in Y : R (y) \geq n} = O (n^{- γ})

. (Recall from Section 2 that

Y

is the subset used for inducing, equivalently the base of the Young tower.)

Proposition 4.1If

m {R \geq n} = O (n^{- γ})

for some

γ > 2

, then

R \in L^{2 + δ} (Y)

for

δ \in (0, γ - 2)

Proof This is immediate from the inequality

E [R^{2 + δ}] \leq \sum_{n = 0}^{\infty} m {R^{2 + δ} \geq n} = \sum_{n = 0}^{\infty} m {R \geq n^{\frac{1}{2 + δ}}}

Many maps satisfy the condition in Proposition 4.1 :

(i) the Alves-Viana map [3]

T : S^{1} \times I \to S^{1} \times I

\begin{matrix} T (ω, x) = (16 ω, a - x^{2} + ε sin (2 π ω)) \end{matrix}

\begin{matrix}  \end{matrix}

when

0

is preperiodic for the map

x \mapsto a - x^{2}

and

ε

is small enough.

(ii) the Liverani-Saussol-Vaienti (Pomeau-Manneville) maps [27]

T : [0, 1] \to [0, 1]

T x = {\begin{matrix} x (1 + 2^{α} x^{α}) & 0 \leq x < \frac{1}{2} \\ 2 x - 1 & \frac{1}{2} \leq x < 1 \end{matrix}

for

0 < α < \frac{1}{2}

(iii) certain classes of multimodal maps, Bruin et al. [10] .

(iv) a class of expanding circle maps

T : S^{1} \to S^{1}

of degree

d > 1

with a neutral fixed point, Young [41,Section6] :

T

C^{1}

S^{1}

and

C^{2}

S^{1} - {0}

T^{'} > 1

S^{1} - {0}

T (0) = 0

T^{'} (0) = 1

, and for

x \neq 0

- x T^{''} (x) ≃ | x |^{α}

for

0 < α < \frac{1}{2}

Applying Theorem 1.2 , we obtain the ASIP for Hölder observations for the systems in (i)–(iv) above. For example, in (iii) and (iv) we obtain the ASIP under the same conditions for which [10] and [41] obtain the CLT. Next, we recall examples of nonuniformly hyperbolic systems that have been modelled by towers. Consider the following classes of

C^{1 + ε}

diffeomorphisms treated in Young [40] (see also Baladi [4,§4.3] ): (v) Lozi maps and certain piecewise hyperbolic maps [40, 13] .

(vi) a class of Hénon maps [6, 7] .

(vii) some partially hyperbolic diffeomorphisms with a mostly contracting direction [12, 18] .

In these examples, the return time asymptotics are exponential so certainly

R \in L^{2 + δ}

. By Theorem 3.4 , we obtain the ASIP for Hölder observations for the systems in (v)–(viii) above.

Billiard maps and Lorentz flows

Finally, we consider the application to the planar periodic Lorentz gas discussed in the introduction. Under the finite horizon condition, Young [40] demonstrated that the billiard map (which is the Poincaré map for the flow) is nonuniformly hyperbolic with exponential return time asymptotics. As a result, Young established exponential decay of correlations for such billiard maps, resolving a long-standing (and controversial) open question. Chernov [14] extended Young's method to obtain the same result for infinite horizons.

For our purposes, the weaker conclusion that

R \in L^{2 + δ}

is again sufficient. Hence, by the results in [14, 40] , the first statement of Theorem 1.3 is an immediate consequence of Theorem 3.4 .

For the flow itself, the finite horizon condition is crucial since even the CLT is unlikely in the infinite horizon case. Assuming finite horizons, the roof function

h

is uniformly bounded and piecewise Hölder. Since

h

is not uniformly Hölder, Corollary 3.5 does not apply directly, but the result is easily modified as in Remarks 2.13 and 3.6 to include such roof functions. Hence, we obtain the second statement of Theorem 1.3 .

A ASIP for functions of mixing sequences

Here is a special case of Philipp & Stout [32,Theorem7.1] adapted to dynamical systems terminology. The notation is as in Section 2 (c).

Theorem A.1 (Philipp & Stout)Assume that there exists

δ \in (0, 2]

σ^{2} > 0

and

C > 0

such that for all

k, N \geq 1

(i) $v \in L^{2 + δ} (Λ)$ and $\int_{Λ} v d m = 0$ ,
(ii) $\int_{Λ} v_{N}^{2} d m = σ^{2} N + O (N^{1 - δ / 30})$ ,
(iii) $\sum_{a \in α_{0}^{k - 1}} \int_{a} | v - \frac{1}{m (a)} \int_{a} v d m |^{2 + δ} d m \leq C k^{- (2 + 7 / δ) (2 + δ)}$ ,
(iv) $| m (a \cap f^{- (N + k)} (b)) - m (a) m (b) | \leq C N^{- 168 (1 + 2 / δ)}$ for all $a \in α_{0}^{k - 1}$ and all measurable sets $b$ .

Then

v_{N} = W (N) + O (N^{1 / 2 - δ / 600})

B ASIP for suspensions

Suppose that

(Λ, m)

is a probability space and that

f : Λ \to Λ

is a measure-preserving transformation. Let

h : Λ \to R^{+}

be a roof function and suppose that

f_{t} : Λ^{h} \to Λ^{h}

is the corresponding suspension (semi)flow as in Section 2 (e). The following result is a special case of [30,Theorem 4.2] .

Theorem B.1 (Melbourne & Török)Let

δ > 0

. Suppose that

h \in L^{2 + δ} (Λ)

and that

\sum_{j = 0}^{N - 1} h \circ f^{j} = N \bar{h} + o (N^{1 - δ})

N \to \infty

almost everywhere.

Suppose that

ψ : Λ^{h} \to R

lies in

L^{\infty} (Λ^{h})

and has mean zero. Define

φ : Λ \to R

φ (x) = \int_{0}^{h (x)} ψ (f_{t} x)

. If

φ

satisfies the ASIP on

Λ

with variance

σ_{1}^{2}

, then

ψ

satisfies the ASIP on

Λ^{h}

with variance

σ^{2} = σ_{1}^{2} / \bar{h}

Theorem B.1 is easily modified for discrete suspensions. Let

R : Λ \to Z^{+}

be an

L^{1}

return time function and form the discrete suspension map

F : Δ \to Δ

as in Section 2 (d).

Corollary B.2Let

δ > 0

. Suppose that

R \in L^{2 + δ} (Λ)

and that

\sum_{j = 0}^{N - 1} R \circ f^{j} = N \bar{R} + o (N^{1 - δ})

N \to \infty

almost everywhere.

Suppose that

φ : Δ \to R

lies in

L^{\infty} (Δ)

and has mean zero. Define

Φ : Λ \to R

Φ (x) = \sum_{j = 0}^{R (x) - 1} φ (f^{j} x)

. If

Φ

satisfies the ASIP on

Λ

with variance

σ_{1}^{2}

, then

φ

satisfies the ASIP on

Δ

with variance

σ^{2} = σ_{1}^{2} / \bar{R}

Acknowledgements

This research was supported in part by EPSRC Grant GR/S11862/01. IM is greatly indebted to UH for the use of e-mail, given that pine is currently not supported on the University of Surrey network.

References

J. Aaronson. An Introduction to Infinite Ergodic Theory. Math. Surveys and Monographs 50, Amer. Math. Soc., 1997.
J. Aaronson and M. Denker. Local limit theorems for partial sums of stationary sequences generated by Gibbs-Markov maps. Stoch. Dyn. 1 (2001) 193–237.
J. Alves, S. Luzzatto and V. Pinheiro. Markov structures and decay of correlations for non-uniformly expanding dynamical systems. Ann. Inst. H. Poincaré, Anal. Non Linéaire. To appear.
V. Baladi. Positive Transfer Operators and Decay of Correlations. Advanced Series in Nonlinear Dynamics 16, World Scientific, Singapore, 2000.
V. Baladi. Decay of correlations. Smooth Ergodicity Theory and its Applications (A. Katok et al., ed.), Proc. Symp. Pure Math. 69, Amer. Math. Soc., 2001, pp. 297–325.
M. Benedicks and L.-S. Young. Absolutely continuous invariant measures and random perturbations for certain one-dimensional maps. Ergod. Th. & Dynam. Sys. 12 (1992) 13–37.
M. Benedicks and L.-S. Young. Sinai-Bowen-Ruelle measures for certain Hénon maps. Invent. Math. 112 (1993) 541–576.
R. Bowen. Equilibrium States and the Ergodic Theory of Anosov Diffeomorphisms. Lecture Notes in Math. 470, Springer, Berlin, 1975.
H. Bruin, M. Holland and M. Nicol. Livsic regularity for Markov systems. Preprint.
H. Bruin, S. Luzzatto and S. van Strien. Decay of correlations in one-dimensional dynamics. Ann. Sci. École Norm. Sup. 36 (2003) 621–646.
L. A. Bunimovich, Y. G. Sinaĭ and N. I. Chernov. Statistical properties of two-dimensional hyperbolic billiards. Uspekhi Mat. Nauk 46 (1991) 43–92.
A. Castro. Backward inducing and exponential decay of correlations for partially hyperbolic attractors with mostly contracting direction. Ph. D. Thesis, IMPA (1998).
N. Chernov. Statistical properties of piecewise smooth hyperbolic systems in high dimensions. Discrete Contin. Dynam. Systems 5 (1999) 425–448.
N. Chernov. Decay of correlations and dispersing billiards. J. Statist. Phys. 94 (1999) 513–556.
N. Chernov and L. S. Young. Decay of correlations for Lorentz gases and hard balls. Hard ball systems and the Lorentz gas. Encyclopaedia Math. Sci. 101, Springer, Berlin, 2000, pp. 89–120.
J.-P. Conze and S. Le Borgne. Méthode de martingales et flow géodésique sur une surface de courbure constante négative. Ergod. Th. & Dynam. Sys. 21 (2001) 421–441.
M. Denker and W. Philipp. Approximation by Brownian motion for Gibbs measures and flows under a function. Ergod. Th. & Dynam. Sys. 4 (1984) 541–552.
D. Dolgopyat. On dynamics of mostly contracting diffeomorphisms. Commun. Math. Phys. 213 (2000) 181–201.
M. J. Field, I. Melbourne and A. Török. Decay of correlations, central limit theorems and approximation by Brownian motion for compact Lie group extensions. Ergod. Th. & Dynam. Sys. 23 (2003) 87–110.
M. I. Gordin. The central limit theorem for stationary processes. Soviet Math. Dokl. 10 (1969) 1174–1176.
S. Gouëzel. Statistical properties of a skew product with a curve of neutral points. Preprint, 2004.
S. Gouëzel. Vitesse de décorrélation et théorèmes limites pour les applications non uniformément dilatantes. Ph. D. Thesis, Ecole Normale Supérieure, 2004.
H. Hennion. Sur un théorème spectral et son application aux noyaux lipchitziens. Proc. Amer. Math. Soc. 118 (1993) 627–634.
F. Hofbauer and G. Keller. Ergodic properties of invariant measures for piecewise monotonic transformations. Math. Z. 180 (1982) 119–140.
G. Keller. Un théorème de la limite centrale pour une classe de transformations monotones per morceaux. C. R. Acad. Sci. Paris 291 (1980) 155–158.
C. Liverani. Central limit theorem for deterministic systems. International Conference on Dynamical Systems (F. Ledrappier, J. Lewowicz and S. Newhouse, eds.), Pitman Research Notes in Math. 362, Longman Group Ltd, Harlow, 1996, pp. 56–75.
C. Liverani, B. Saussol and S. Vaienti. A probabilistic approach to intermittency. Ergodic Theory and Dynamical Systems 19 (1999) 671–685.
I. Melbourne and M. Nicol. Statistical properties of endomorphisms and compact group extensions. J. London Math. Soc. 70 (2004) 427–446.
I. Melbourne and A. Török. Statistical limit theorems for suspension flows. Israel J. Math. 194 (2004) 191–210.
I. Melbourne and A. Török. Central limit theorems and invariance principles for time-one maps of hyperbolic flows. Commun. Math. Phys. 229 (2002) 57–71.
W. Parry and M. Pollicott. Zeta Functions and the Periodic Orbit Structure of Hyperbolic Dynamics. Astérique 187-188, Société Mathématique de France, Montrouge, 1990.
W. Philipp and W. F. Stout. Almost Sure Invariance Principles for Partial Sums of Weakly Dependent Random Variables. Memoirs of the Amer. Math. Soc. 161, Amer. Math. Soc., Providence, RI, 1975.
M. Pollicott and R. Sharp. Invariance principles for interval maps with an indifferent fixed point. Commun. Math. Phys. 229 (2002) 337–346.
M. Ratner. The central limit theorem for geodesic flows on $n$ -dimensional manifolds of negative curvature. Israel J. Math. 16 (1973) 181–197.
D. Ruelle. Thermodynamic Formalism. Encyclopedia of Math. and its Applications 5, Addison Wesley, Massachusetts, 1978.
Y. G. Sinaĭ. Dynamical systems with elastic reflections. Ergodic properties of dispersing billiards. Uspehi Mat. Nauk 25 (1970) 141–192.
Y. G. Sinaĭ. Gibbs measures in ergodic theory. Russ. Math. Surv. 27 (1972) 21–70.
M. Viana. Stochastic dynamics of deterministic systems. Col. Bras. de Matemática, 1997.
C. P. Walkden. Invariance principles for iterated maps that contract on average. Preprint, 2003.
L.-S. Young. Statistical properties of dynamical systems with some hyperbolicity. Ann. of Math. 147 (1998) 585–650.
L.-S. Young. Recurrence times and rates of mixing. Israel J. Math. 110 (1999) 153–188.

Almost Sure Invariance Principle for Nonuniformly Hyperbolic Systems

Ian Melbourne Department of Maths and Stats University of Surrey Guildford GU2 7XH, UK

Matthew Nicol Department of Maths University of Houston Houston TX 77204-3008, USA

23 September, 2004. Revised 8 February, 2005.