Almost Sure Invariance Principle for Nonuniformly Hyperbolic Systems

Ian Melbourne Department of Maths and Stats University of Surrey Guildford GU2 7XH, UK

Matthew Nicol Department of Maths University of Houston Houston TX 77204-3008, USA

23 September, 2004. Revised 8 February, 2005.

Abstract
We prove an almost sure invariance principle that is valid for general classes of nonuniformly expanding and nonuniformly hyperbolic dynamical systems. Discrete time systems and flows are covered by this result. In particular, the result applies to the planar periodic Lorentz flow with finite horizon.
Statistical limit laws such as the central limit theorem, the law of the iterated logarithm, and their functional versions, are immediate consequences.

1 Introduction

Statistical properties of uniformly expanding maps and uniformly hyperbolic (Axiom A) diffeomorphisms are by now classical. Hölder observations satisfy exponential decay of correlations and the central limit theorem (CLT), see for example Bowen [8, Ratner [34, Ruelle [35, Parry and Pollicott [31. Furthermore, Denker and Philipp [17proved an almost sure invariance principle (ASIP) for Hölder observations.
Immediate consequences of the ASIP are the CLT, the law of the iterated logarithm (LIL), and their functional versions, see [32.
Many proofs of the CLT for dynamical systems use directly the martingale approximation method of Gordin [20, see [25, 26, 30. The ASIP can often also be obtained in this way see [16, 19, 30, 39and indeed this method yields a better error estimate in the ASIP than the usual one, see Field et al. [19. However, it should be emphasised that the martingale approximation of Gordin [20leads directly only to a reverse martingale increment sequence and so the ASIP is obtained in backwards time in the first instance. This is not an issue for distributional results such as the CLT, but the ASIP in [16, 19, 30uses explicitly the fact that the class of systems being studied is closed under time-reversal. To obtain forward martingale approximations, it is necessary to use more sophisticated versions of Gordin's approach [32.
Recently, there has been an explosion of interest in nonuniformly expanding maps and nonuniformly hyperbolic diffeomorphisms (possibly with singularities). We refer to the articles of Young [40, 41as well as Aaronson [1, Baladi [4, 5, Gouëzel [22, Viana [38and references therein. In particular, decay of correlations and the CLT are studied extensively in these references. However, such classes of dynamical systems are intrinsically time-orientation specific, and largely for this reason the ASIP has not previously been proved. Similarly, the LIL was previously unproved for such systems.
In this paper, we establish the ASIP, and hence the (functional) LIL, for nonuniformly expanding/hyperbolic systems. Both discrete time systems and flows are covered by our results.
Remark 1.1We note that [33attempted to apply the approach in [19to nonuniformly expanding systems. However, it appears that the time-orientation issue discussed above was overlooked in [33, and that this is a gap. Hence it seems necessary to find an alternative approach to the one in [19, and that is what is done in the current paper.
Precise formulations are given in the body of the paper, but here is an outline of our main result, and the strategy behind its proof, for a nonuniformly expanding map T : M M   where ( M , d )   is a metric space. By standard methods, T   can be modelled by a discrete-time suspension over a Gibbs-Markov map [1 f : Y Y   with return time function R : Y Z +   . (Roughly speaking, a Gibbs-Markov map is like a uniformly expanding map with possibly countably many inverse branches.) There exists a unique ergodic T   -invariant probability measure equivalent to Lebesgue, and the following result is formulated with this measure in mind.
Theorem 1.2Let T : M M   be a nonuniformly expanding map. Assume moreover that R L 2 + δ ( Y )   . Let φ : M R   be a mean zero Hölder observation. Then φ   satisfies the ASIP. That is, there exists ε > 0   , a sequence of random variables { S n }   and a Brownian motion W   with variance σ 2 0   such that { j = 0 N 1 φ T j } = d { S N }   , and S N = W ( N ) + O ( N 1 2 ε ) as N ,   almost everywhere.
Using a method due to Hofbauer and Keller [24which exploits a result of Philipp and Stout [32,Theorem 7.1, we obtain the ASIP (in the correct time direction but without the improved error term) for f : Y Y   and a class of “weighted Lipschitz” observations. Theorem  1.2 then follows directly by Melbourne and Török [29. (We note that the method in [29has independently been used by Gouëzel [21to obtain a simplified derivation of the CLT and stable laws.) A precise version of Theorem  1.2 is stated and proved in Section  2 (e). The ASIP for nonuniformly hyperbolic maps extends easily to a class of nonuniformly expanding semiflows, see Section  2 (e).
Our results for nonuniformly hyperbolic diffeomorphisms and nonuniformly hyperbolic flows are completely analogous, but the set-up is more technical and we postpone further details until Section  3 .

Planar periodic Lorentz gas

The planar periodic Lorentz gas is a class of examples introduced by Sinaĭ [36.
See [15for a survey of results about Lorentz gases. The Lorentz flow is a billiard flow on T 2 Ω   where Ω   is a disjoint union of convex regions with C 3   boundaries. (The phase-space of the flow is three-dimensional; planar position and direction.) The flow has a natural global cross-section M = Ω × [ π / 2 , π / 2 ]   corresponding to collisions and the Poincaré map T : M M   is called the billiard map. Bunimovich, Sinaĭ and Chernov [11proved the central limit theorem and weak invariance principle for such maps.
Denote the return time function by h : M R +   . The Lorentz flow satisfies the finite horizon condition if h   is uniformly bounded. The central limit theorem and weak invariance principle was proved by [11for Lorentz flows satisfying the finite horizon condition.
Theorem 1.3Suppose that T t   is a planar periodic Lorentz gas.
  • (i) The billiard map satisfies the ASIP for Hölder observations.
  • (ii) If the finite horizon condition holds, then the Lorentz flow satisfies the ASIP for Hölder observations.
In Section  2 , we prove the ASIP for nonuniformly expanding maps and semiflows.
In Section  3 , we prove the analogous results for systems that are nonuniformly hyperbolic in the sense of Young [40. In Section  4 , we list numerous examples in the literature for which our results apply. In particular, we prove Theorem  1.3 . The results of [32, 29required in this paper are reproduced as appendices.

2 Nonuniformly expanding systems

In this section, we prove the ASIP for nonuniformly expanding systems. The first step is to prove the ASIP for Gibbs-Markov maps. Such maps are reviewed in Subsection (a) and a class of “weighted Lipschitz” observations is introduced in Subsection (b). The ASIP for Gibbs-Markov maps is proved in Subsection (c) using an approach of Hofbauer and Keller [24. In Subsection (d), we obtain the ASIP for Young towers [41as an application of [29. In Subsection (e), we prove the ASIP for nonuniformly expanding maps and semiflows.

(a) Gibbs-Markov maps

Let ( Λ , m )   be a Lebesgue space with a countable measurable partition α   . Without loss, we suppose that all partition elements a α   have m ( a ) > 0   . Recall that a measure-preserving transformation f : Λ Λ   is a Markov map if f ( a )   is a union of elements of α   and f | a   is injective for all a α   . Define α   to be the coarsest partition of Λ   such that f a   is a union of atoms in α   for all a α   . (So α   is a coarser partition than α   .) If a 0 , , a n 1 α   , we define the n   -cylinder [ a 0 , , a n 1 ] = i = 0 n 1 f i a i   . It is assumed that f   and α   separate points in Λ   (if x , y Λ   and x y   , then for n   large enough there exist distinct n   -cylinders that contain x   and y   ).
Let 0 < β < 1   . We define a metric d β   on Λ   by d β ( x , y ) = β s ( x , y )   where s ( x , y )   is the greatest integer n 0   such that x , y   lie in the same n   -cylinder. Define g = J f 1 = d m d ( m f )   and set g k = g g f g f k 1   .
The map f : Λ Λ   is a Gibbs-Markov map if it satisfies the additional properties:
  • (i) Big images property: There exists c > 0   such that m ( f a ) c   for all a α   .
  • (ii) Distortion: log g | a   is Lipschitz with respect to d β   for all a α   .
It follows from assumptions (i) and (ii) that there exists a constant D 1   such that for all x , y   lying in a common k   -cylinder [ a 0 , , a k 1 ]   ,
| g k ( x ) g k ( y ) 1 | D d β ( f k x , f k y ) and D 1 m [ a 0 , , a k 1 ] g k ( x ) D . (2.1)

(b) Weighted Lipschitz observations

Let p 1   . We fix a sequence of weights R ( a ) > 0   satisfying | R | p = ( a α m ( a ) R ( a ) p ) 1 / p <   .
Given v : Λ R   continuous, we set v a = v | a   and define | v | β   to be the Lipschitz constant of v   with respect to the metric d β   . Let v = sup a α | v a | / R ( a ) , v β = sup a α | v a | β / R ( a ) .   Let   consist of the space of weighted Lipschitz functions with v = v + v β <   . Note in particular that R   and R = 1   . We have the embeddings L i p L p L 1 ,   where L i p   is the space of (globally) Lipschitz functions.
The transfer (Perron-Frobenius) operator P : L 1 L 1   maps v L 1   to P v   where Λ P v w d m = Λ v w f d m   for all w L   , and is given by ( P v ) ( x ) = f y = x g ( y ) v ( y )   . Note that | P | 1 = 1   .
Proposition 2.1Let a α   be an n   -cylinder and suppose that v : a R   is Lipschitz. Then | v | 1 m ( a ) a | v | d m + β n | v | β   .
Proof For x a   ,
| v ( x ) | 1 m ( a ) a | v | d m + | v ( x ) 1 m ( a ) a v d m | 1 m ( a ) a | v | d m + | v | β diam ( a ) .
The result follows since diam ( a ) = β n   .
Lemma 2.2The transfer operator P   restricts to an operator P :   and there exists a constant C 1   such that P n v C ( | v | 1 + β n v β ) ,   for all v   and n 1   . Moreover, P ( ) L i p   .
Proof We prove the estimate on P n v   . The remaining statements of the lemma are evident from the proof.
Note that ( P n v ) ( x ) = f n y = x g n ( y ) v ( y )   . Since the n   -cylinders [ a 0 , , a n 1 ]   form a partition and each n   -cylinder contains precisely one preimage y a   , we have
| ( P n v ) ( x ) | a = [ a 0 , , a n 1 ] g n ( y a ) | v ( y a ) | D a = [ a 0 , , a n 1 ] m ( a ) | v a |
D a = [ a 0 , , a n 1 ] [ a | v | d m + m ( a ) β n | v a | β ]
D a = [ a 0 , , a n 1 ] [ a | v | d m + β n m ( a ) R ( a 0 ) v β ]
where we have used Proposition  2.1 and estimate  2.1 . Hence | P n v | D [ | v | 1 + β n | R | 1 v β ]   . Similarly,
| ( P n v ) ( x ) ( P n v ) ( x ) | a = [ a 0 , , a n 1 ] | g n ( y a ) g n ( y a ) | | v ( y a ) |
+ a = [ a 0 , , a n 1 ] | g n ( y a ) | | v ( y a ) v ( y a ) | .
Each term in the first summation can be estimated by
D | g n ( y a ) | d β ( f n y a , f n y a ) | v a | D 2 m ( a ) d β ( x , x ) [ 1 m ( a ) a | v a | d m + β n | v a | β ]
D 2 [ a | v | d m + β n m ( a ) R ( a 0 ) v β ] d β ( x , x ) ,
so the first summation is bounded by D 2 [ | v | 1 + β n | R | 1 v β ] d β ( x , x )   . Each term in the second summation can be estimated by
D m ( a ) | v a | β d β ( y a , y a ) D m ( a ) R ( a 0 ) v β β n d β ( x , x ) ,
so the second summation is bounded by β n D | R | 1 v β d β ( x , x )   . The result follows.
We have the following standard consequences of Lemma  2.2 .
Corollary 2.3Let p , q 1   with 1 p + 1 q = 1   . Assume that f : Λ Λ   is mixing and that R L p   . Then there exist constants C 1   and τ ( 0 , 1 )   such that
  • (a) P n v Λ v d m C τ n v   for all v   and n 1   .
  • (b) | Λ v ( w f n ) d m Λ v d m Λ w d m | C τ n v | w | q   for all v   , w L q   , n 1   .
  • (c) If R L 2   , then for any v   with Λ v d m = 0   , the series σ 2 = Λ v 2 d m + 2 k = 1 Λ v ( v f k ) d m ,   is absolutely convergent, and Λ v N 2 d m = σ 2 N + O ( 1 )   as N   , where v N = j = 0 N 1 v f j   . Moreover, σ = 0   if and only if there exists a Lipschitz function w : Λ R   such that v = w f w   .
Proof Most of this result is completely standard, but we include the details for completeness. By an Arzela-Ascoli argument, the unit ball in   is compact in L 1   .
This combined with Lemma  2.2 implies, by Hennion [23, that the essential spectral radius of P :   is bounded above by β < 1   . There is a simple eigenvalue at 1   with eigenspace consisting of constant functions, but the mixing assumption guarantees that there are no further eigenvalues on the unit circle. Now choose τ ( β , 1 )   such that all eigenvalues of P   other than 1   lie strictly inside the disk of radius τ   . Part (a) follows for such a choice of τ   .
To prove part (b), compute that
| Λ v ( w f n ) d m Λ v d m Λ w d m | = | Λ ( P n v Λ v ) w d m | | P n v Λ v | p | w | q
P n v Λ v | w | q C τ n v | w | q .
It follows from (b) that | Λ v ( v f k ) d m | C τ n v | v | 2   and so the series for σ 2   converges absolutely. Moreover
Λ v N 2 d m = N Λ v 2 d m + 2 0 i < j N 1 Λ v ( v f j i ) d m
= N Λ v 2 d m + 2 k = 1 N ( N k ) Λ v ( v f k ) d m
= N σ 2 2 k = 1 N k Λ v ( v f k ) d m 2 k = N + 1 N Λ v ( v f k ) d m
= N σ 2 + O ( 1 ) ,
proving (c).
The criterion for σ = 0   follows as in [19, 28. If v = w f w   , then v N = w f N w   so it is clear that σ = 0   . To prove the converse, define w = j = 1 P j v   . This series converges in   by (b) and is Lipschitz by Lemma  2.2 . Write v = v ^ + w f w   .
Then it is easily seen that v ^   has the same variance as v   and that P v ^ = 0   . Hence σ 2 = Λ v ^ 2 d m   , so if σ = 0   , then v ^ = 0   and v = w f w   .

(c) ASIP for Gibbs-Markov maps

Let α 0 k 1   denote the partition into length k   cylinders a = [ a 0 , , a k 1 ]   .
Lemma 2.4Assume that f : Λ Λ   is mixing and that R L 2 + δ   for some δ > 0   . Let v   with Λ v d m = 0   . Then
  • (a) a α 0 k 1 a | v 1 m ( a ) a v d m | 2 + δ d m ( v β | R | 2 + δ β k ) 2 + δ   .
  • (b) | m ( a f ( N + k ) ( b ) ) m ( a ) m ( b ) | C τ N m ( a ) m ( b ) 1 / 2   for all a α 0 k 1   and all measurable sets b   .
Proof Note that | v 1 m ( a ) a v d m | | v a | β diam ( a ) v β R ( a 0 ) β k   . Part (a) follows immediately.
We argue as in Aaronson & Denker [2to establish (b). Let v a , k = P k χ a   . By definition, v a , k = f k y = x g k ( y ) χ a ( y ) = g k ( y a )   where y a   is the unique point in a   such that f k y a = x   . Hence by  2.1 , | v a , k ( x ) v a , k ( x ) | D | g k ( y a ) | d β ( x , x ) D 2 m ( a ) d β ( x , x ) .   It follows that v a , k E m ( a )   where E = D 2 + D   .
Using this estimate and Corollary  2.3 (b), we compute that
| m ( a f ( N + k ) b ) m ( a ) m ( b ) | = | P k χ a χ b f N P k χ a χ b |
= | v a , k χ b f N v a , k χ b | C τ N v a , k | χ b | 2 C E τ N m ( a ) m ( b ) 1 / 2 ,
as required.
Corollary 2.5Let f : Λ Λ   be an ergodic Gibbs-Markov map. Define the Banach space   corresponding to weights R L 2 + δ   for some δ > 0   .
Suppose that v   and Λ v d m = 0   . Define σ 2   as in Corollary  2.3 and assume that σ 2 > 0   . Then v N = j = 0 N 1 v f j   satisfies the ASIP.
Proof We verify the hypotheses of Philipp & Stout [32,Theorem 7.1. For convenience, we have translated this theorem into dynamical systems terminology in the appendix, see Theorem  A.1 . Condition (i) of Theorem  A.1 is automatic since L 2 + δ   and condition (ii) follows from Corollary  2.3 (c). Conditions (iii) and (iv) follow from parts (a) and (b) of Lemma  2.4 .

(d) ASIP for tower maps

Suppose that ( Λ , m )   is a probability space and that f : Λ Λ   is a measure-preserving transformation. Let R : Λ Z +   be a measurable function (called a return time function with R L 1 ( Λ )   . Define the suspension
Δ = { ( x , ) Λ × N : 0 R ( x ) } / ,
where ( x , R ( x ) ) ( f ( x ) , 0 )   . Define F : Δ Δ   by F ( x , ) = ( x , + 1 )   computed subject to identifications. Note in particular that F ( x , 0 ) = ( f ( x ) , 0 )   . An F   -invariant probability measure on Δ   is given by m R = m × l / R ¯   where R ¯ = Λ R d m   and l   is counting measure on N   .
Let { Δ j , 0 }   be a countable measurable partition of Λ   such that f   and { Δ j , 0 }   separate points in Λ   , and for each j   , R j = R | Δ j , 0   is constant and f : Δ j , 0 Λ   is a measurable isomorphism. For each j   and 0 < R j   , let Δ j , = Δ j , 0 × { }   . This defines a partition { Δ j , }   of Δ   .
A separation time function s : Δ × Δ N   is defined as follows: If x , y   lie in distinct partition elements, then s ( x , y ) = 0   . If x , y Δ j , 0   for some j   , then s ( x , y )   is the greatest integer n 0   such that f k x   and f k y   lie in the same partition element of Λ   for k = 0 , , n   . If x , y Δ j ,   , then write x = F x 0   , y = F y 0   where x 0 , y 0 Δ j , 0   and define s ( x , y ) = s ( x 0 , y 0 )   . For θ ( 0 , 1 )   , we define a metric d θ   on Δ   by setting d θ ( x , y ) = θ s ( x , y )   .
Definition 2.6The suspension F : Δ Δ   is called a Young tower if f : Λ Λ   is a Gibbs-Markov map with respect to the partition α = { Δ j , 0 }   .
Remark 2.7The big images condition for f   to be a Gibbs-Markov map is automatically satisfied in the strong sense that f ( a ) = Λ   for each each a α   . Hence, F : Δ Δ   is a Young tower provided the distortion condition holds: there exist constants θ ( 0 , 1 )   and C 1   such that for each j   the Jacobian g j = J f | Δ j , 0 : Δ j , 0 Λ   satisfies | log g j ( x ) log g j ( y ) | C d θ ( x , y )   for all x , y Δ j , 0   .
Theorem 2.8Let F : Δ Δ   be a Young tower defined as a suspension over f : Λ Λ   with return time function R   . Assume that R L 2 + δ ( Λ )   . Let φ : Δ R   be a mean zero observation and assume that φ   is Lipschitz with respect to d θ   . Then φ N = j = 0 N 1 φ F j   satisfies the ASIP.
Proof Define a mean zero observation Φ : Λ R   by setting Φ ( x ) = j = 0 R ( x ) 1 φ ( x , j )   .
Since φ   is Lipschitz, it is immediate that Φ   lies in the space   of weighted Lipschitz observations. Since R L 2 + δ ( Λ )   , it follows from Corollary  2.5 that Φ N = j = 0 N 1 Φ f j   satisfies the ASIP on Λ   .
Note that R R ¯   also satisfies the hypotheses of Corollary  2.5 , and so the ASIP, and hence the LIL, applies. Therefore, it is certainly the case that j = 0 N 1 R f j = N R ¯ + o ( N 1 δ )   almost everywhere. The result follows from [29,Theorem 4.2, see Corollary  B.2 .

(e) ASIP for nonuniformly expanding systems

Let ( M , d )   be a locally compact separable bounded metric space with Borel probability measure η   and let T : M M   be a nonsingular transformation for which η   is ergodic. Let Y M   be a measurable subset with η ( Y ) > 0   . We suppose that there is an at most countable measurable partition { Y j }   with η ( Y j ) > 0   , and that there exist integers R j 1   , and constants λ > 1   ; C , D > 0   and γ ( 0 , 1 )   such that for all j   ,
  • (1) T R j : Y j Y   is a (measure-theoretic) bijection.
  • (2) d ( T R j x , T R j y ) λ d ( x , y )   for all x , y Y j   .
  • (3) d ( T k x , T k y ) C d ( T R j x , T R j y )   for all x , y Y j   , k < R j   .
  • (4) g j = d ( η | Y j ( T R j ) 1 ) d η | Y   satisfies | log g j ( x ) log g j ( y ) | D d ( x , y ) γ   for almost all x , y Y   .
  • (5) j R j η ( Y j ) <   .
We say that a dynamical system T   satisfying (1)–(5) is nonuniformly expanding.
Define the return time function R : Y Z +   by R | Y j R j   . Condition (5) says that Y R d η <   . The map f : Y Y   given by f ( y ) = T R ( y ) ( y )   is the corresponding induced map. It can be shown (see Young [41,Theorem 1) that there is a unique invariant probability measure m   on M   that is equivalent to η   .
We can now state and prove a precise version of Theorem  1.2 .
Theorem 2.9Let T : M M   be a nonuniformly expanding map satisfying (1)–(5) above. Assume moreover that the return time function R   lies in L 2 + δ ( Y )   . Let φ : M R   be a mean zero Hölder observation. Then φ   satisfies the ASIP.
Proof Let Δ = { ( y , ) : y Y , = 0 , . . . , R ( y ) 1 }   , so Δ   is the disjoint union of R j   copies of each Y j   . Define a measure μ   on Δ   by setting μ | Y j × { } = m | Y j / R ¯   .
Define F : Δ Δ   by setting F ( y , ) = ( y , + 1 )   for 0 < R ( y ) 1   and F ( y , R ( y ) 1 ) = ( f y , 0 )   . Define the separation time s : Δ × Δ N   as in the previous section.
By shrinking γ   if necessary, we may suppose that φ   is γ   -Hölder for the same γ   that appears in condition (4). Define the metric d θ   on Δ   with θ = 1 / λ γ   . It follows from condition (2) that d ( x , y ) diam ( Y ) / λ s ( x , y )   for all ( x , y ) Δ   . Hence f   and { Y j }   separate points in Y   and the required distortion condition on g j   is immediate, so Δ   is a Young tower with Λ = Y   and Δ j , 0 = Y j   . If x , y   lie in the same partition element of Δ   , then write x = F x 0   , y = F y 0   so d ( f x 0 , f y 0 ) diam ( Y ) / λ s ( x , y )   . By condition (3),
d ( x , y ) C d ( f x 0 , f y 0 ) C diam ( Y ) / λ s ( x , y ) = C diam ( Y ) [ d θ ( x , y ) ] 1 / γ .
Hence, there is a constant C 1   such that d ( x , y ) C [ d θ ( x , y ) ] 1 / γ   for all x , y Δ   .
Define the projection π : Δ M   by π ( y , ) = T y   . Then π   is a measure-preserving isomorphism and it follows as above that d ( π ( x ) , π ( y ) ) γ C d θ ( x , y )   , for all x , y Δ   .
In particular, since φ : ( M , d ) R   is γ   -Hölder, it follows that φ π : ( Δ , d θ ) R   is Lipschitz. By Theorem  2.8 , the ASIP holds for φ π   on Δ   . Since π   is a measure-preserving map semiconjugacy, the ASIP holds for φ   on M   .
Remark 2.10As already pointed out in [21, the CLT for nonuniformly expanding maps holds under slightly weaker hypotheses using [29,Theorem 1.1. Instead of requiring that R L 2 + δ   , it suffices that R L 2   .
Remark 2.11The ASIP is said to be degenerate if σ 2 = 0   . It follows from previous work in connection with the CLT [40, 41that the ASIPs obtained in this paper are degenerate if and only if φ = ψ T ψ   where ψ L 2 ( M )   .
Moreover, by a Livšic regularity result of Bruin et al. [9, such an L 2   function ψ   has a version that is Hölder on j = 0 T j Y   for each fixed   . (It is easy to construct examples where the ASIP is degenerate but ψ   does not have a version that is continuous on the whole of M   .) In particular, if T   has a periodic point x Y   of period k   and j = 0 k 1 φ ( T j y ) 0   , then the ASIP is nondegenerate.

Nonuniformly expanding semiflows

We continue to assume that T : M M   is a nonuniformly expanding map satisfying conditions (1)–(5). Suppose that h : M R +   lies in L 1 ( M )   . Regarding h   as a roof function, we form the suspension M h = { ( x , u ) M × [ 0 , ) : 0 u h ( x ) } /   where ( x , h ( x ) ) ( T x , 0 )   . The suspension semiflow T t : M h M h   is given by T t ( x , u ) = x ( u + t )   computed modulo identifications. We call T t : M h M h   a nonuniformly expanding semiflow. We say that an observation ψ : M h R   is Hölder if ψ   is bounded and sup ( x , u ) ( y , u ) | ψ ( x , u ) ψ ( y , u ) | / d ( x , y ) <   .
Corollary 2.12Let T t : M h M h   be a nonuniformly expanding semiflow.
Assume moreover that the return time function R   lies in L 2 + δ ( Y )   and that the roof function h : M R +   is Hölder. Let ψ : M h R   be a mean zero Hölder observation. Then ψ   satisfies the ASIP. That is, there exists ε > 0   , a family of random variables { S t }   and a Brownian motion W   with variance σ 2 0   such that { 0 t ψ T s d s } = d { S t }   , and S t = W ( t ) + O ( t 1 2 ε )   as t   , almost everywhere.
Proof According to [29,Theorem 4.2(Theorem  B.1 ), it suffices that (i) h L 2 + δ ( Y )   , (ii) φ ( x ) = 0 h ( x ) ψ ( x , u ) d u   satisfies the ASIP on Y   , and (iii) h   satisfies the ASIP on Y   . Hence, the result is immediate from Theorem  2.9 .
Remark 2.13We have not striven for greatest generality in the statements of Theorem  2.9 and Corollary  2.12 . However, it is clear from the proof that in Theorem  2.9 we can relax the assumption that φ   is Hölder. It is sufficient that φ   is such that Φ ( x ) = = 0 R ( x ) 1 φ ( T x )   lies in the space of weighted Lipschitz observations in Subsection (b) for an appropriate choice of weight function. Taking the weight function to be the return time function, it suffices that φ   is Hölder on T Y j   for all j 1   , 0 < R ( j ) 1   , with L   norm and Hölder constant independent of j ,   .
Similarly, the hypotheses that ψ   and h   are Hölder can be weakened in Corollary  2.12 . For example, provided ψ   is Hölder, it suffices that h   is Hölder on T Y j   for all j 1   , 0 < R ( j ) 1   , with L   norm and Hölder constant independent of j ,   .

3 Nonuniformly hyperbolic systems

In this section, we show how to prove the ASIP for Lipschitz observations of a dynamical system that is nonuniformly hyperbolic in the sense of Young [40. Instead of using the original set up, we make four assumptions (A1)–(A4) that are distilled from those in [40. In doing so, we bypass the differential structure, and certain conclusions in [40become assumptions here, particularly (A4) below.
Let T : M M   be a diffeomorphism (possibly with singularities) defined on a Riemannian manifold ( M , d )   . We assume from the start that T   preserves a “nice” probability measure m   (one of the conclusions in Young [40is that m   is a SRB measure). Assumption (A4) contains the properties of m   that we require for the ASIP. We fix a subset Λ M   and a family of subsets of M   that we call “stable disks” { W s }   that are disjoint and cover Λ   . If x   lies in a stable disk, we label the disk W s ( x )   .
  • (A1) There is a partition { Λ j }   of Λ   and integers R j 1   such that for all x Λ j   we have T R j ( W s ( x ) ) W s ( T R j x )   .
Define the return time function R : Λ Z +   by R | Λ j = R j   and the induced map f : Λ Λ   by f ( x ) = T R ( x ) ( x )   . Form the discrete suspension map F : Δ Δ   where F ( x , ) = ( x , + 1 )   for < R ( x ) 1   and F ( x , R ( x ) 1 ) = ( f x , 0 )   . We define a separation time s : Λ × Λ N   by defining s ( x , x )   to be the greatest integer n 0   such that f k x , f k x   lie in the same partition element of Λ   for k = 0 , , n   . (If x , x   do not lie in the same partition element, then we take s ( x , x ) = 0   .) For general points p = ( x , ) , p = ( x , ) Δ   , define s ( p , q ) = s ( x , x )   if =   and s ( p , q ) = 0   otherwise.
This defines a separation time s : Δ × Δ N   . We have the projection π : Δ M   given by π ( x , ) = T x   and satisfying π T = F π   .
  • (A2) There is a distinguished subset or “unstable leaf ” W u Λ   such that each stable disk intersects W u   in precisely one point, and there exist constants C 1   , α ( 0 , 1 )   such that
    • (i) d ( T n x , T n y ) C α n   , for all y W s ( x )   , all n 0   , and
    • (ii) d ( T n x , T n y ) C α s ( x , y )   for all x , y W u   and all 0 n < R   .
Remark 3.1We note that Young [40uses a separation time s 0   defined in terms of the underlying diffeomorphism T : M M   whereas our separation time s   is defined in terms of the induced map f : Λ Λ   . In particular, [40,conditions (iii)and (iv),p. 589guarantee that s 0 s   and moreover that s 0 ( R 1 ) s   . Hence [40,assumption (P4)(a)( d ( T n x , T n y ) C α s 0 ( x , y ) n   for 0 n < s 0 ( x , y )   ) implies our assumption (A2)(ii).
There is also a separation time in [40that is denoted s   . This is different from our separation time and plays no role in this paper.
Let Λ ¯ = Λ /   where x x   if x W s ( x )   . Similarly, define the partition { Λ ¯ j }   of Λ ¯   . We obtain a well-defined return time function R : Λ ¯ Z +   and induced map f : Λ ¯ Λ ¯   . Let F : Δ ¯ Δ ¯   denote the corresponding suspension map. We note that this can be viewed as the quotient of F : Δ Δ   where ( x , )   is identified with ( x , )   if =   and x W s ( x )   . Let π ¯ : Δ Δ ¯   denote the natural projection.
The separation time on Δ   drops down to a separation time on Δ ¯   (and agrees with the natural separation time defined using f : Λ ¯ Λ ¯   and the partition { Λ ¯ j }   ).
  • (A3) The map f : Λ ¯ Λ ¯   and partition { Λ ¯ j }   separate points in Λ ¯   .
It follows that d θ ( p , q ) = θ s ( p , q )   defines a metric on Δ ¯   for each θ ( 0 , 1 )   .
  • (A4) There exist F   -invariant probability measures m ~   on Δ   and m ¯   on Δ ¯   such that
    • (i) π : Δ M   and π ¯ : Δ Δ ¯   are measure-preserving ( π   takes m ~   to m   and π ¯   takes m ~   to m ¯   ); and
    • (ii) F : Δ ¯ Δ ¯   is a Young tower (in the sense of section  2 (d)).
We say that an observation ψ : Δ R   depends only on future coordinates if ψ ( p ) = ψ ( q )   whenever p q   where   is the equivalence relation on Δ   arising from quotienting along stable disks. Such an observation drops down to an observation ψ : Δ ¯ R   . The following result shows that any Hölder observation on M   is related to a Lipschitz observation on Δ ¯   (cf. [37, 8).
Lemma 3.2Suppose that φ : M R   is γ   -Hölder with respect to the metric d   . Then there exist functions ψ , χ : Δ R   such that
  • (i) φ π = ψ + χ χ F   ,
  • (ii) χ   is bounded,
  • (iii) ψ   depends only on future coordinates,
  • (iv) ψ : Δ ¯ R   is Lipschitz with respect to the metric d θ   , for θ = α γ / 2   .
Proof Given p = ( x , ) Δ   , define p ^ = ( x ^ , )   where x ^   is the unique point in W s ( x ) W u   (see (A2)). Define χ ( p ) = j = 0 φ ( π F j p ) φ ( π F j p ^ ) .   Note that π F j p = T j π p = T j + x   and similarly π F j p ^ = T j + x ^   . Since x   and x ^   lie in the same stable disk W s   , it follows from (A2)(i) that
| χ ( p ) | j = 0 | φ ( π F j p ) φ ( π F j p ^ ) | φ | γ j = 0 d ( T j + x , T j + x ^ ) γ
| φ | γ C γ j = 0 α j γ = | φ | γ C γ ( 1 α γ ) 1 .
Define ψ = φ π χ + χ F   . Then ψ ( p ) = j = 0 φ ( π F j p ^ ) φ ( π F j F p ^ )   depends only upon future coordinates. It remains to check that ψ   is Lipschitz with respect to the metric d θ   . In fact, we prove that ψ   is Lipschitz with respect to d θ 1 / 2   where θ = α γ   .
For any N 1   , p , q Δ   ,
| ψ ( p ) ψ ( q ) | j = 0 N | φ ( π F j p ^ ) φ ( π F j q ^ ) | + j = 0 N 1 | φ ( π F j F p ^ ) φ ( π F j F q ^ ) | (3.1)
+ j = N + 1 | φ ( π F j p ^ ) φ ( π F j 1 F p ^ ) | + j = N + 1 | φ ( π F j q ^ ) φ ( π F j 1 F q ^ ) | .
Suppose that d θ ( p , q ) = d θ ( p ^ , q ^ ) θ 2 N   . We show that each of these four terms is bounded by θ N d θ 1 / 2 ( p , q )   up to a constant.
Starting with the third term in  3.1 , we note that F p ^ = F p ^   unless p = ( x , R ( x ) 1 )   , in which case F p ^ = ( f x ^ , 0 )   and F p ^ = ( f x ^ , 0 )   . Then π F j p ^ = T j 1 ( f x ^ )   and π F j 1 F p ^ = T j 1 ( f x ^ )   . Since f x ^   and f x ^   lie in the same stable disk W s   , we have | φ ( π F j p ^ ) φ ( π F j 1 F p ^ ) | | φ | γ C γ α ( j 1 ) γ   so that j = N + 1 | φ ( π F j p ^ ) φ ( π F j 1 F p ^ ) | C θ N   as required. Similarly for the fourth term in  3.1 .
Next, we consider the first term in  3.1 . By assumption, s ( p , q ) 2 N   so separation does not takes place during the calculation. Write p = ( x , )   , q = ( y , )   . Then π F j p ^ = T j + x ^ = T L f J x ^   where J j   and L < R ( f J x ^ )   . Similarly, π F j q ^ = T L f J y ^   .
Hence by (A2)(ii),
| φ ( π F j p ^ ) φ ( π F j q ^ ) | | φ | γ d ( T L f J x ^ , T L f J y ^ ) γ | φ | γ C γ α s ( f J x ^ , f J y ^ ) γ
= | φ | γ C γ α [ s ( x ^ , y ^ ) J ] γ | φ | γ C γ α [ s ( x ^ , y ^ ) j ] γ | φ | γ C γ θ 2 N j ,
so that j = 0 N | φ ( π F j p ^ ) φ ( π F j q ^ ) | C θ N   as required. Similarly for the second term in  3.1 .
Remark 3.3Although Lemma  3.2 is modelled on the treatments in [8, 31, we have not defined a metric on Δ   and hence the usual regularity statement about χ   is missing.
Theorem 3.4Suppose that T : M M   satisfies (A1)–(A4) and assume that R L 2 + δ ( Λ )   for some δ > 0   . Let φ : M R   be a mean zero Hölder observation. Then φ   satisfies the ASIP.
Proof Since π : Δ M   is measure preserving, it suffices to prove the ASIP for the lift φ ~ = φ π : Δ R   . By Lemma  3.2 , there exists ψ : Δ R   depending only on future coordinates such that φ ~ N ψ N   is uniformly bounded, and it suffices to prove the ASIP for ψ   . Since the projection π ¯ : Δ Δ ¯   is measure preserving, it suffices to prove the ASIP for ψ   at the level of Δ ¯   . Finally, Lemma  3.2 guarantees that ψ : Δ ¯ R   is Lipschitz with respect to d θ   , so it suffices to prove the ASIP for Lipschitz observations on Δ ¯   which is a Young tower by (A4)(ii). Now apply Theorem  2.8 .

Nonuniformly hyperbolic flows

Given an L 1   roof function h : M R +   , we define a suspension flow T t : M h M h   in the same way that we defined the semiflow in Section  2 (e). If T : M M   satisfies (A1)–(A4), we say that T t : M h M h   is a nonuniformly hyperbolic flow.
Corollary 3.5Let T t : M h M h   be a nonuniformly hyperbolic flow.
Assume moreover that the return time function R   lies in L 2 + δ ( Y )   and that the roof function h : M R +   is Hölder. Let ψ : M h R   be a mean zero Hölder observation. Then ψ   satisfies the ASIP.
Proof This follows immediately from Theorem  3.4 , applying Theorem  B.1 .
Remark 3.6The weakened hypotheses mentioned in Remark  2.13 apply equally in the nonuniformly hyperbolic setting.

4 Applications

In this section, we indicate a wide range of applications to which the results in this paper apply.
We begin with nonuniformly expanding systems that can be modelled by a Young tower as in Section  2 . In the literature it is standard to speak of return time asymptotics in the form m { y Y : R ( y ) n } = O ( n γ )   . (Recall from Section  2 that Y   is the subset used for inducing, equivalently the base of the Young tower.)
Proposition 4.1If m { R n } = O ( n γ )   for some γ > 2   , then R L 2 + δ ( Y )   for δ ( 0 , γ 2 )   .
Proof This is immediate from the inequality E [ R 2 + δ ] n = 0 m { R 2 + δ n } = n = 0 m { R n 1 2 + δ }   .
Many maps satisfy the condition in Proposition  4.1 :
(i) the Alves-Viana map [3 T : S 1 × I S 1 × I  
T ( ω , x ) = ( 16 ω , a x 2 + ε sin ( 2 π ω ) )
when 0   is preperiodic for the map x a x 2   and ε   is small enough.
(ii) the Liverani-Saussol-Vaienti (Pomeau-Manneville) maps [27 T : [ 0 , 1 ] [ 0 , 1 ]   T x = { x ( 1 + 2 α x α ) 0 x < 1 2 2 x 1 1 2 x < 1   for 0 < α < 1 2   .
(iii) certain classes of multimodal maps, Bruin et al. [10.
(iv) a class of expanding circle maps T : S 1 S 1   of degree d > 1   with a neutral fixed point, Young [41,Section6: T   is C 1   on S 1   and C 2   on S 1 { 0 }   , T > 1   on S 1 { 0 }   , T ( 0 ) = 0   , T ( 0 ) = 1   , and for x 0   , x T ( x ) | x | α   for 0 < α < 1 2   .
Applying Theorem  1.2 , we obtain the ASIP for Hölder observations for the systems in (i)–(iv) above. For example, in (iii) and (iv) we obtain the ASIP under the same conditions for which [10and [41obtain the CLT. Next, we recall examples of nonuniformly hyperbolic systems that have been modelled by towers. Consider the following classes of C 1 + ε   diffeomorphisms treated in Young [40(see also Baladi [4,§4.3): (v) Lozi maps and certain piecewise hyperbolic maps [40, 13.
(vi) a class of Hénon maps [6, 7.
(vii) some partially hyperbolic diffeomorphisms with a mostly contracting direction [12, 18.
In these examples, the return time asymptotics are exponential so certainly R L 2 + δ   . By Theorem  3.4 , we obtain the ASIP for Hölder observations for the systems in (v)–(viii) above.

Billiard maps and Lorentz flows

Finally, we consider the application to the planar periodic Lorentz gas discussed in the introduction. Under the finite horizon condition, Young [40demonstrated that the billiard map (which is the Poincaré map for the flow) is nonuniformly hyperbolic with exponential return time asymptotics. As a result, Young established exponential decay of correlations for such billiard maps, resolving a long-standing (and controversial) open question. Chernov [14extended Young's method to obtain the same result for infinite horizons.
For our purposes, the weaker conclusion that R L 2 + δ   is again sufficient. Hence, by the results in [14, 40, the first statement of Theorem  1.3 is an immediate consequence of Theorem  3.4 .
For the flow itself, the finite horizon condition is crucial since even the CLT is unlikely in the infinite horizon case. Assuming finite horizons, the roof function h   is uniformly bounded and piecewise Hölder. Since h   is not uniformly Hölder, Corollary  3.5 does not apply directly, but the result is easily modified as in Remarks  2.13 and  3.6 to include such roof functions. Hence, we obtain the second statement of Theorem  1.3 .

A ASIP for functions of mixing sequences

Here is a special case of Philipp & Stout [32,Theorem7.1adapted to dynamical systems terminology. The notation is as in Section  2 (c).
Theorem A.1 (Philipp & Stout)Assume that there exists δ ( 0 , 2 ]   , σ 2 > 0   and C > 0   such that for all k , N 1   ,
  • (i) v L 2 + δ ( Λ )   and Λ v d m = 0   ,
  • (ii) Λ v N 2 d m = σ 2 N + O ( N 1 δ / 30 )   ,
  • (iii) a α 0 k 1 a | v 1 m ( a ) a v d m | 2 + δ d m C k ( 2 + 7 / δ ) ( 2 + δ )   ,
  • (iv) | m ( a f ( N + k ) ( b ) ) m ( a ) m ( b ) | C N 168 ( 1 + 2 / δ )   for all a α 0 k 1   and all measurable sets b   .
Then v N = W ( N ) + O ( N 1 / 2 δ / 600 )   .

B ASIP for suspensions

Suppose that ( Λ , m )   is a probability space and that f : Λ Λ   is a measure-preserving transformation. Let h : Λ R +   be a roof function and suppose that f t : Λ h Λ h   is the corresponding suspension (semi)flow as in Section  2 (e). The following result is a special case of [30,Theorem 4.2.
Theorem B.1 (Melbourne & Török)Let δ > 0   . Suppose that h L 2 + δ ( Λ )   and that j = 0 N 1 h f j = N h ¯ + o ( N 1 δ )   as N   almost everywhere.
Suppose that ψ : Λ h R   lies in L ( Λ h )   and has mean zero. Define φ : Λ R   by φ ( x ) = 0 h ( x ) ψ ( f t x )   . If φ   satisfies the ASIP on Λ   with variance σ 1 2   , then ψ   satisfies the ASIP on Λ h   with variance σ 2 = σ 1 2 / h ¯   .
Theorem  B.1 is easily modified for discrete suspensions. Let R : Λ Z +   be an L 1   return time function and form the discrete suspension map F : Δ Δ   as in Section  2 (d).
Corollary B.2Let δ > 0   . Suppose that R L 2 + δ ( Λ )   and that j = 0 N 1 R f j = N R ¯ + o ( N 1 δ )   as N   almost everywhere.
Suppose that φ : Δ R   lies in L ( Δ )   and has mean zero. Define Φ : Λ R   by Φ ( x ) = j = 0 R ( x ) 1 φ ( f j x )   . If Φ   satisfies the ASIP on Λ   with variance σ 1 2   , then φ   satisfies the ASIP on Δ   with variance σ 2 = σ 1 2 / R ¯   .

Acknowledgements

This research was supported in part by EPSRC Grant GR/S11862/01. IM is greatly indebted to UH for the use of e-mail, given that pine is currently not supported on the University of Surrey network.
References

  1. J. Aaronson. An Introduction to Infinite Ergodic Theory. Math. Surveys and Monographs 50, Amer. Math. Soc., 1997.
  2. J. Aaronson and M. Denker. Local limit theorems for partial sums of stationary sequences generated by Gibbs-Markov maps. Stoch. Dyn. 1 (2001) 193–237.
  3. J. Alves, S. Luzzatto and V. Pinheiro. Markov structures and decay of correlations for non-uniformly expanding dynamical systems. Ann. Inst. H. Poincaré, Anal. Non Linéaire. To appear.
  4. V. Baladi. Positive Transfer Operators and Decay of Correlations. Advanced Series in Nonlinear Dynamics 16, World Scientific, Singapore, 2000.
  5. V. Baladi. Decay of correlations. Smooth Ergodicity Theory and its Applications (A. Katok et al., ed.), Proc. Symp. Pure Math. 69, Amer. Math. Soc., 2001, pp. 297–325.
  6. M. Benedicks and L.-S. Young. Absolutely continuous invariant measures and random perturbations for certain one-dimensional maps. Ergod. Th. & Dynam. Sys. 12 (1992) 13–37.
  7. M. Benedicks and L.-S. Young. Sinai-Bowen-Ruelle measures for certain Hénon maps. Invent. Math. 112 (1993) 541–576.
  8. R. Bowen. Equilibrium States and the Ergodic Theory of Anosov Diffeomorphisms. Lecture Notes in Math. 470, Springer, Berlin, 1975.
  9. H. Bruin, M. Holland and M. Nicol. Livsic regularity for Markov systems. Preprint.
  10. H. Bruin, S. Luzzatto and S. van Strien. Decay of correlations in one-dimensional dynamics. Ann. Sci. École Norm. Sup. 36 (2003) 621–646.
  11. L. A. Bunimovich, Y. G. Sinaĭ and N. I. Chernov. Statistical properties of two-dimensional hyperbolic billiards. Uspekhi Mat. Nauk 46 (1991) 43–92.
  12. A. Castro. Backward inducing and exponential decay of correlations for partially hyperbolic attractors with mostly contracting direction. Ph. D. Thesis, IMPA (1998).
  13. N. Chernov. Statistical properties of piecewise smooth hyperbolic systems in high dimensions. Discrete Contin. Dynam. Systems 5 (1999) 425–448.
  14. N. Chernov. Decay of correlations and dispersing billiards. J. Statist. Phys. 94 (1999) 513–556.
  15. N. Chernov and L. S. Young. Decay of correlations for Lorentz gases and hard balls. Hard ball systems and the Lorentz gas. Encyclopaedia Math. Sci. 101, Springer, Berlin, 2000, pp. 89–120.
  16. J.-P. Conze and S. Le Borgne. Méthode de martingales et flow géodésique sur une surface de courbure constante négative. Ergod. Th. & Dynam. Sys. 21 (2001) 421–441.
  17. M. Denker and W. Philipp. Approximation by Brownian motion for Gibbs measures and flows under a function. Ergod. Th. & Dynam. Sys. 4 (1984) 541–552.
  18. D. Dolgopyat. On dynamics of mostly contracting diffeomorphisms. Commun. Math. Phys. 213 (2000) 181–201.
  19. M. J. Field, I. Melbourne and A. Török. Decay of correlations, central limit theorems and approximation by Brownian motion for compact Lie group extensions. Ergod. Th. & Dynam. Sys. 23 (2003) 87–110.
  20. M. I. Gordin. The central limit theorem for stationary processes. Soviet Math. Dokl. 10 (1969) 1174–1176.
  21. S. Gouëzel. Statistical properties of a skew product with a curve of neutral points. Preprint, 2004.
  22. S. Gouëzel. Vitesse de décorrélation et théorèmes limites pour les applications non uniformément dilatantes. Ph. D. Thesis, Ecole Normale Supérieure, 2004.
  23. H. Hennion. Sur un théorème spectral et son application aux noyaux lipchitziens. Proc. Amer. Math. Soc. 118 (1993) 627–634.
  24. F. Hofbauer and G. Keller. Ergodic properties of invariant measures for piecewise monotonic transformations. Math. Z. 180 (1982) 119–140.
  25. G. Keller. Un théorème de la limite centrale pour une classe de transformations monotones per morceaux. C. R. Acad. Sci. Paris 291 (1980) 155–158.
  26. C. Liverani. Central limit theorem for deterministic systems. International Conference on Dynamical Systems (F. Ledrappier, J. Lewowicz and S. Newhouse, eds.), Pitman Research Notes in Math. 362, Longman Group Ltd, Harlow, 1996, pp. 56–75.
  27. C. Liverani, B. Saussol and S. Vaienti. A probabilistic approach to intermittency. Ergodic Theory and Dynamical Systems 19 (1999) 671–685.
  28. I. Melbourne and M. Nicol. Statistical properties of endomorphisms and compact group extensions. J. London Math. Soc. 70 (2004) 427–446.
  29. I. Melbourne and A. Török. Statistical limit theorems for suspension flows. Israel J. Math. 194 (2004) 191–210.
  30. I. Melbourne and A. Török. Central limit theorems and invariance principles for time-one maps of hyperbolic flows. Commun. Math. Phys. 229 (2002) 57–71.
  31. W. Parry and M. Pollicott. Zeta Functions and the Periodic Orbit Structure of Hyperbolic Dynamics. Astérique 187-188, Société Mathématique de France, Montrouge, 1990.
  32. W. Philipp and W. F. Stout. Almost Sure Invariance Principles for Partial Sums of Weakly Dependent Random Variables. Memoirs of the Amer. Math. Soc. 161, Amer. Math. Soc., Providence, RI, 1975.
  33. M. Pollicott and R. Sharp. Invariance principles for interval maps with an indifferent fixed point. Commun. Math. Phys. 229 (2002) 337–346.
  34. M. Ratner. The central limit theorem for geodesic flows on n   -dimensional manifolds of negative curvature. Israel J. Math. 16 (1973) 181–197.
  35. D. Ruelle. Thermodynamic Formalism. Encyclopedia of Math. and its Applications 5, Addison Wesley, Massachusetts, 1978.
  36. Y. G. Sinaĭ. Dynamical systems with elastic reflections. Ergodic properties of dispersing billiards. Uspehi Mat. Nauk 25 (1970) 141–192.
  37. Y. G. Sinaĭ. Gibbs measures in ergodic theory. Russ. Math. Surv. 27 (1972) 21–70.
  38. M. Viana. Stochastic dynamics of deterministic systems. Col. Bras. de Matemática, 1997.
  39. C. P. Walkden. Invariance principles for iterated maps that contract on average. Preprint, 2003.
  40. L.-S. Young. Statistical properties of dynamical systems with some hyperbolicity. Ann. of Math. 147 (1998) 585–650.
  41. L.-S. Young. Recurrence times and rates of mixing. Israel J. Math. 110 (1999) 153–188.