, , ,

In this paper we are interested in parametrizations and combinatorial descriptions of positive definite kernels on the set

N_{0}

of non-negative integers. Positive definite kernels are complex valued maps

K : N_{0} \times N_{0} \to C

with the property that for each

n > 0

and each choice of elements

p_{1}

\dots

p_{n}

N_{0}

and complex numbers

λ_{1}

\dots

λ_{n}

, we have

\begin{matrix} \sum_{k, j = 1}^{n} K (p_{k}, p_{j}) λ_{j} {\bar{λ}}_{k} \geq 0 . \end{matrix}

(1.1)

A fundamental result of Kolmogorov, [5] provides a Hilbert space interpretation of positive definite kernels as Gram kernels, that is, there exists a Hilbert space

ℋ

and elements

v (n) \in ℋ

n \geq 0

, such that

\begin{matrix} K (i, j) = 〈 v (j), v (i) 〉 . \end{matrix}

(1.2)

Two of the best known examples of positive definite kernels are those of Toeplitz type, for which

K (i + l, j + l) = K (i, j)

i, j, l \in N_{0},

and those of Hankel type, for which

K (i, j + l) = K (i + l, j)

i, j, l \in N_{0} .

In both these cases the representation 1.2 can be improved by some more specialized descriptions that might be called operator models of the kernels. Thus, if

K

is Toeplitz (for simplicity we assume

K (0, 0) = 1

and all the inequalities in 1.1 are strict) then there exists an isometric operator

W

, written in upper Hessenberg form, such that

\begin{matrix} K (i, j) = e_{0} W^{j - i} e_{0}^{*}, \end{matrix}

(1.3)

where

e_{0} = [\begin{matrix} 1 & 0 & . . . \end{matrix}]

. Likewise, if

K

is Hankel (with same simplifications as above) then there exists a symmetric operator

J

, written in tridiagonal form, such that

\begin{matrix} K (i, j) = e_{0} J^{i + j} e_{0}^{*} . \end{matrix}

(1.4)

Our goal is to extend both models 1.3 and 1.4 to arbitrary positive definite kernels on

N_{0}

without Toeplitz or Hankel assumptions. These models will produce parametrizations of the kernels and we will give combinatorial descriptions of these parametrizations in terms of lattice paths.

2 Isometric Hessenberg models and Dyck paths

In this section we show that any positive definite kernel on

N_{0}

has a Hessenberg model and then we show how to relate this model to the set of Dyck paths. In order to simplify the notation we consider only positive definite kernels for which all the inequalities in 1.1 are strict, when we say that the kernel is stricly positive definite, and we also assume

K (l, l) = 1

for all

l \geq 0

. Both these assumptions can be easily removed. In addition, all our considerations can be easily adapted to kernels

K : N_{0} \times N_{0} \to ℒ (ℰ)

, where

ℒ (ℰ)

denotes the set of linear bounded operators on the Hilbert space

ℰ

We now introduce the elements necesary in the presentation of the results. For a complex number

γ

with

| γ | \leq 1

we define its defect by

d_{γ} = (1 - | γ |^{2})^{1 / 2}

and its Julia matrix by

J (γ) = [\begin{matrix} γ & d_{γ} \\ d_{γ} & - \bar{γ} \end{matrix}] .

We note that the Julia matrix is unitary and this construction can be extended to certain families of complex numbers as follows. Let

Γ = {γ_{k, j}}_{0 \leq k < j}

be a family of complex numbers such that

| γ_{k, j} | < 1

for all

k < j

. For simplicity we will write

d_{k, j}

instead of

d_{γ_{k, j}}

. We can now describe the Hessenberg model. First we define for

k < j

V_{k, j} (Γ) = (J (γ_{k, k + 1}) \oplus I_{j - k - 1}) (I_{1} \oplus J (γ_{k, k + 2}) \oplus I_{j - k - 2}) \dots (I_{j - k - 1} \oplus J (γ_{k, j})),

where

I_{l}

denotes the

l \times l

identity matrix. Then we introduce the operators

W_{k} (Γ)

on the Hilbert space

l^{2} (N_{0})

of square-summable sequences by the formula:

W_{k} (Γ) = s - {lim}_{j \to \infty} (V_{k, j} (Γ) \oplus 0), k \geq 0,

where

s - lim

denotes the strong operator limit. It is easily seen that each

W_{k} (Γ)

is an isometry with upper Hessenberg matrix with respect to the standard basis of

l^{2} (N_{0})

, that is, if

(W_{k} (Γ))_{i, j}

denotes the

(i, j)

entry of

W_{k} (Γ)

, then

(W_{k} (Γ))_{i, j} = 0

for

i \geq j + 1

. It is also useful to consider the unitary matrices

U_{k, j} (Γ)

defined recursively by

U_{k, k} (Γ) = I_{1}

and for

k < j

U_{k, j} (Γ) = V_{k, j} (Γ) (U_{k + 1, j} (Γ) \oplus I_{1}) .

We can prove now the existence of an isometric Hessenberg model for any strictly positive definite kernel on

N_{0}

Theorem 2.1. If

K

is a strictly positive definite kernel on

N_{0}

with

K (l, l) = 1

for all

l \geq 0

, then there exists a family

{W_{k}}_{k \geq 0}

of isometric Hessenberg operators such that for

j > i

\begin{matrix} K (i, j) = e_{0} W_{i} (Γ) W_{i + 1} (Γ) \dots W_{j - 1} (Γ) e_{0}^{*} . \end{matrix}

(2.1)

Proof. This is just a reformulation of Theorem 2.3 in [2] (see also [3] , Chapter 1). Thus, by Theorem 1.3 in [2] , there exists a uniquely determined family $Γ = {γ_{k, j}}_{0 \leq k < j}$ of complex numbers such that $K (i, j) = {(U_{i, j} (Γ))}_{0, 0}$ for $i < j$ . Then it is easily seen from the definitions that $e_{0} W_{i} (Γ) W_{i + 1} (Γ) \dots W_{j - 1} (Γ) e_{0}^{*} = {(U_{i, j} (Γ))}_{0, 0} .$ □

When

K

is a Toeplitz kernel, then 2.1 reduces to 1.3 and the parameters

γ_{k, j}

satisfy

γ_{i + l, j + l} = γ_{i, j}

for

i < j

l \geq 1

. The numbers

γ_{n} = γ_{0, n}

n \geq 1

, are called the Szegö parameters of

K

(other names, like Schur parameters, reflection coefficients, or Verblunski parameters are currently used in the literature), and they play a central role in the theory of orthogonal polynomials on the unit circle and its many applications, [8] and, for a recent account, [6] (which also contains a detailed discussion of the Hessenberg model in the Toeplitz case).

Next we explain the connection between Hessenberg models and Dyck paths. A Dyck path of length

2 k

is a path in the positive quadrant of the lattice

Z^{2}

which starts at

(0, 0)

, ends at

(2 k, 0)

, and consists of rise steps

↗

and fall steps

↘

(see Figure 1). For more information on Dyck paths and their combinatorics, see [7] .

Figure 1 . A Dyck path of length

8

and height

3

Let

D_{k}

be the set of Dyck paths of length

2 k

and let

A_{k}

be the set of points

(l, q)

q > 0

, with the property that there exists

p \in D_{k}

with

(l, q) \in p

. It is seen that

A_{k} = {(j + i, j - i) | 0 \leq i < j \leq k} .

Also, we notice that if

p \in D_{k}

and

x = (l, q) \in p

, then there are only four types of behaviour of

p

about

x

: (I) a rise step followed by a fall step; (II) a fall step followed by a rise step; (III) two consecutive rise steps; (IV) two consecutive fall steps (see Figure 2).

Figure 2 . Behaviour of a Dyck path about a vertex

x \in A_{k}

Consequently, for each pair

i, j

with

0 \leq i < j \leq k

we define the function

a_{i, j} : D_{k} \to C

a_{i, j} (p) = {\begin{matrix} 1 & if & x = (j + i, j - i) / \in p; \\ γ_{i, j} & if & x = (j + i, j - i) \in p and (I) holds; \\ - {\bar{γ}}_{i, j} & if & x = (j + i, j - i) \in p and (II) holds; \\ d_{i, j} & if & x = (j + i, j - i) \in p and either (III) or (IV) holds . \end{matrix}

Let

p

be a Dyck path in

D_{k}

with the property that

(2 l, 0) \in p

. The restriction of

p

from

(2 l, 0)

(2 k, 0)

is called a Dyck subpath starting at

(2 l, 0)

D_{k}

. The set of all possible Dyck subpaths starting at

(2 l, 0)

D_{k}

is denoted by

D_{k}^{l}

and there exists a bijection between

D_{k}^{l}

and

D_{k - l}

. This implies that the number of elements in

D_{k}^{l}

is given by the Catalan number

C_{k - l} = \frac{1}{k - l + 1} (\begin{matrix} 2 (k - l) \\ k - l \end{matrix});

also,

D_{k}^{0} = D_{k}

. If

q \in D_{k}^{l}

then there could be many Dyck paths whose restrictions at

(2 l, 0)

coincide with

q

, however if

p_{1}

and

p_{2}

are two such Dyck paths then

a_{i, j} (p_{1}) = a_{i, j} (p_{2})

for

j + i > 2 l

. We will write

a_{i, j} (q)

in order to denote this common value.

We now describe the structure of the strictly positive definite kernels on

N_{0}

Theorem 2.2. The kernel

K

N_{0}

with

K (l, l) = 1

for all

l

is strictly positive definite if and only if there is a family

{γ_{k, j}}_{0 \leq k \leq j}

of complex numbers,

| γ_{k, j} | < 1

for all

k < j

, such that

\begin{matrix} K (l, m) = \sum_{q \in D_{m}^{l}} \prod_{l \leq i < j \leq m} a_{i, j} (q) . \end{matrix}

(2.2)

Proof. Half of this result was proved in [1] , but we give some details here for completeness. Assume that

K

is strictly positive definite. By Theorem 1.3 and Theorem 2.3 in [2] there exists a uniquely determined family

Γ = {γ_{k, j}}_{0 \leq k < j}

of complex numbers such that

K (l, m) = (U_{l, m} (Γ))_{0, 0}

, the

(0, 0)

entry of the matrix

U_{l, m} (Γ)

. It is convenient to visualize this relation by means of a so-called transmission line, as showed in Figure 3 for

K (0, 2)

and

K (0, 3)

Figure 3 . Transmission line for

K (0, 3)

Thus, if

1

is the input at

A

then at

B

we read off the expression of

K (0, 3)

in terms of the parameters

γ_{0, 1}

γ_{0, 2}

γ_{0, 3}

γ_{1, 2}

γ_{1, 3}

γ_{2, 3}

and their defects,

\begin{matrix} K (0, 3) = & γ_{0, 1} γ_{1, 2} γ_{2, 3} + γ_{0, 1} d_{1, 2} γ_{1, 3} d_{2, 3} + d_{0, 1} γ_{0, 2} d_{1, 2} γ_{2, 3} \end{matrix}

\begin{matrix} - d_{0, 1} γ_{0, 2} {\bar{γ}}_{1, 2} γ_{1, 3} d_{2, 3} + d_{0, 1} d_{0, 2} γ_{0, 3} d_{1, 3} d_{2, 3} . \end{matrix}

Likewise, if the input at

C

1

, then the output at

B

is now the expression of

K (0, 2)

K (0, 2) = γ_{0, 1} γ_{1, 2} + d_{0, 1} γ_{0, 2} d_{1, 2}

(for more details see [3] ). Each path in the transmission line contributes an additive term in

K (l, m)

. Going from a path in the transmission line to a Dyck path is easy, each box associated with a Julia matrix corresponds to a point in

A_{k}

, see Figure 4. It is also clear that each additive term in

K (l, m)

is given by

\prod_{l \leq i < j \leq m} a_{i, j} (q)

for some

q \in D_{m}^{l}

. This gives 2.2 .

Conversely, given a family

Γ = {γ_{k, j}}_{0 \leq k < j}

of complex numbers with

| γ_{k, j} | < 1

for all

k < j

, we define

K (l, m) = \sum_{q \in D_{m}^{l}} \prod_{l \leq i < j \leq m} a_{i, j} (q) .

By the first part of the proof, this gives

K (l, m) = (U_{l, m} (Γ))_{0, 0}

, and it remains to show that

K

is a strictly positive definite kernel on

N_{0}

. By Theorem 2.1,

K (i, j) = e_{0} W_{i} (Γ) W_{i + 1} (Γ) \dots W_{j - 1} (Γ) e_{0}^{*}

for

i < j

. This relation implies that

K

is a positive definite kernel. Also, by Proposition 1.7 in [2] ,

det {[K (l, m)]}_{l, m = 0}^{n} = \prod_{0 \leq i < j \leq n} d_{i, j}^{2} > 0,

so that

K

is a strictly positive definite kernel on

N_{0}

. □

Figure 4 . From a path in a transmission line to a Dyck path

Remarks

(a)

It is quite simple to remove the two restrictions on

K

considered in Theorem 2.2 . First, formula 2.2 still provides a one-to-one correspondence between the set of positive definite kernels on

N_{0}

with

K (l, l) = 1

for all

l

and the set

S

of families

{γ_{k, j}}_{0 \leq k < j}

of complex numbers with the properties:

| γ_{k, j} | \leq 1

for all

k < j

; if

| γ_{k, j} | = 1

for some pair

(k, j)

, then

γ_{l, j} = 0

for

l < k

and

γ_{k, m} = 0

for

m > j

(b)

If we remove the assumption that

K (l, l) = 1

for all

l

, then the diagonal elements

K (l, l)

of the kernel

K

could be considered as parameters and there is a one-to-one correspondence between the positive definite kernels on

N_{0}

and the set

S_{+}

of pairs

({f_{l}}_{l \geq 0}, {γ_{k, j}}_{0 \leq k < j})

, where

f_{l} \geq 0

for all

l \geq 0

and

{γ_{k, j}}_{0 \leq k < j}

is an element of

S

with the additional property that if

f_{l} = 0

for some

l \geq 0

, then

γ_{k, l} = 0

and

γ_{l, m} = 0

for

k < l

and

m > l

. Formula 2.2 has to be replaced in this case with:

\begin{matrix} K (l, m) = f_{l}^{1 / 2} f_{m}^{1 / 2} \sum_{q \in D_{m}^{l}} \prod_{l \leq i < j \leq m} a_{i, j} (q) . \end{matrix}

(2.3)

(c)

In case

K

is a Toeplitz kernel we noticed already that

γ_{i + l, j + l} = γ_{i, j}

for

i < j

l \geq 1

and we denoted

γ_{n} = γ_{o, n}

n \geq 1

. We conclude that

a_{i + l, j + l} = a_{i, j}

for

i < j

l \geq 1

and formula 2.2 reduces to

\begin{matrix} K (0, n) = \sum_{p \in D_{n}} \prod_{0 \leq i < j \leq n} a_{i, j} (p) . \end{matrix}

(2.4)

We can compare this result with a classical formula of Verblunsky, according to which there exists a polynomial

V^{(n)} = V^{(n)} (γ_{1}, \dots γ_{n - 1}; d_{1}, \dots, d_{n - 1})

with integer coeffcients so that

K (0, n) = {γ_{n}}^{n - 1} \prod_{k = 1} d_{k}^{2} + V^{(n)}

(see [6] , in particular, pg. 60-61, for a comprehensive discussion of this formula).

We see that the term

γ_{n} \prod_{k = 1}^{n - 1} d_{k}^{2}

corresponds to the path

p_{0}

made of

n

consecutive rise steps followed by

n

consecutive fall steps. Consequently, we deduce from 2.4 that

V^{(n)} = \sum_{p \in D_{n} - {p_{0}}} \prod_{0 \leq i < j \leq n} a_{i, j} (p),

an explicit formula that explains some of the features of

V^{(n)}

□

3 Near tridiagonal models

In this section we show that positive definite kernels do not have tridiagonal models. Instead we introduce a near tridiagonal model and then we show how this model is related to the set of Lukasiewicz paths. Again, in order to simplify the notation we consider only strictly positive definite kernels

K

and we assume

K (0, 0) = 1

. We denote by

D \subset l^{2} (N_{0})

the vector space generated by the standard basis of

l^{2} (N_{0})

and we call tridiagonal model of

K

a family

{J_{n}}_{n \geq 0}

of tridiagonal operators (not necessarely bounded),

J_{n} = [\begin{matrix} b_{0} (n) & c_{1} (n) & 0 \\ a_{1} (n) & b_{1} (n) & c_{2} (n) \\ 0 & a_{2} (n) & b_{2} (n) & . . . \\ 0 & . . . & . . . \end{matrix}],

such that

J_{0} = I

and

\begin{matrix} K (i, j) = e_{0} J_{1}^{*} \dots J_{i}^{*} J_{j} \dots J_{1} e_{0}^{*}, i, j \geq 0 . \end{matrix}

(3.1)

Also, in analogy with the Hankel case, we ask

a_{k} (n) > 0

k, n \geq 1

. Since each

J_{n}

is tridiagonal,

J_{n} D \subset D

so 3.1 makes sense. However we have the following result.

Theorem 3.1. There are strictly positive definite kernels with no tridiagonal model.

Proof. We consider a strictly positive definite kernel

K

with

K (0, 1) = K (0, 2) = K (1, 2) = 0, K (0, 3) \neq 0

(it is easy to construct such a kernel using, for instance, Theorem 2.2 ). Let

{J_{n}}_{n \geq 1}

be a tridiagonal model of

K

, then we deduce that

b_{0} (1) = c_{1} (2) = b_{1} (2) = 0,

which implies

\begin{matrix} K (0, 3) & = & e_{0} J_{3} J_{2} J_{1} e_{0}^{*} \end{matrix}

\begin{matrix} = & b_{0} (3) (b_{0} (2) b_{0} (1) + c_{1} (2) a_{1} (1)) + c_{1} (3) (a_{1} (2) b_{0} (1) + b_{1} (2) a_{1} (1)) = 0, \end{matrix}

a contradiction showing that

K

has no tridiagonal model. □

We are now trying to find a model as close as possible of being tridiagonal, which should reduce to 1.4 in case the kernel

K

is Hankel. Thus, we consider operators (not necessarely bounded) with matrix still of Hessenberg form

\begin{matrix} J_{n} = [\begin{matrix} b_{0} (1) & c_{0, 1} (n) & c_{0, 2} (n) \\ a_{1} (1) & b_{1} (2) & c_{1, 2} (n) \\ 0 & a_{2} (2) & b_{2} (3) & . . . \\ 0 & . . . & . . . \end{matrix}], n \geq 1, \end{matrix}

(3.2)

with respect to the standard basis of

l^{2} (N_{0})

, and with the additional conditions:

\begin{matrix} \begin{matrix} a_{k} (k) > 0, k \geq 1, \\ c_{i, j} (1) = 0, j \geq 2, 0 \leq i < j - 1, \\ c_{i, j} (n) = 0, j \geq n, 0 \leq i < j - 1, \\ c_{k - 1, k} (n) = a_{k} (k), k \geq n, \\ c_{i, j} (n) = c_{i, j} (n - 1), j < n - 1, 0 \leq i < j - 1 . \end{matrix} \end{matrix}

(3.3)

We see that

J_{n} D \subset D

for each

n \geq 1

. We call such a family

{J_{n}}_{n \geq 1}

a near tridiagonal model of the kernel

K

provided that

\begin{matrix} K (i, j) = e_{0} J_{1}^{*} \dots J_{i}^{*} J_{j} \dots J_{1} e_{0}^{*}, i, j \geq 0 . \end{matrix}

(3.4)

Theorem 3.2. Any strictly positive definite kernel has a near tridiagonal model.

Proof. Let

K

be a kernel and

K_{n} = {[K (i, j)]}_{0 \leq i, j \leq n}

. Then

K

is strictly positive definite if and only if

K_{n} > 0

n \geq 1

(as already mentioned we can assume without loss of generality that

K (0, 0) = 1

). Let

D_{n} = {[d_{i, j} (n)]}_{0 \leq i, j \leq n}

be the upper triangular Cholesky factor of

K_{n}

, therefore

K_{n} = D_{n}^{*} D_{n}

and

d_{i, i} (n) > 0

. The uniqueness of the Cholesky factor implies that

D_{n + 1} = [\begin{matrix} D_{n} & l_{n + 1} \\ 0 & d_{n + 1, n + 1} (n + 1) \end{matrix}],

and

d_{n, n} (k + 1) = d_{n, n} (k)

for

k \geq n

, so we can drop the label

n

d_{i, j} (n)

. We now construct the near tridiagonal model of

K

. Thus we prove by induction on

n

that there exist numbers

a_{k} (k)

b_{k - 1} (k)

c_{i, k - 1} (k)

0 \leq i \leq k - 1

k \leq n

, such that 3.3 holds and

\begin{matrix} D_{n} = [\begin{matrix} 1 & J_{2, 1} & J_{3, 2} J_{2, 1} & . . . & J_{n + 1, n} \dots J_{2, 1} \\ 0_{n} & 0_{n - 1} & 0_{n - 2} & . . . & 0_{0} \end{matrix}], \end{matrix}

(3.5)

where

J_{2, 1} = [\begin{matrix} b_{0} (1) \\ a_{1} (1) \end{matrix}], J_{k + 1, k} = [\begin{matrix} b_{0} (1) & c_{0, 1} (2) & . . . & c_{0, k - 1} (k) \\ a_{1} (1) & b_{1} (2) \\ 0 & a_{2} (2) & . . . \\ . . . & . . . \\ 0 & 0 & a_{k} (k) \end{matrix}],

and

0_{k}

denotes the column with

k

zero entries.

For

n = 1

we define

b_{0} (1) = d_{0, 1} and a_{1} (1) = d_{1, 1} > 0,

so that

D_{1} = [\begin{matrix} 1 \\ J_{2, 1} \\ 0 \end{matrix}]

. Assume the statement is true up to

n

. We determine the numbers

a_{n + 1} (n + 1) > 0

b_{n} (n + 1)

, and

c_{k, n} (n + 1)

k = 0, \dots n, n - 1

, such that

[\begin{matrix} l_{n + 1} \\ d_{n + 1, n + 1} \end{matrix}] = J_{n + 2, n + 1} J_{n + 1, n} \dots J_{2, 1} .

By the induction hypothesis

J_{n + 1, n} \dots J_{2, 1} = [\begin{matrix} l_{n} \\ d_{n, n} \end{matrix}],

so that we must have

\begin{matrix} [\begin{matrix} l_{n + 1} \\ d_{n + 1, n + 1} \end{matrix}] = [\begin{matrix} x_{0} + c_{0, n} (n + 1) d_{n, n} \\ x_{1} + c_{1, n} (n + 1) d_{n, n} \\ . . . \\ x_{n} + b_{n} (n + 1) d_{n, n} \\ a_{n + 1} (n + 1) d_{n, n} \end{matrix}], \end{matrix}

(3.6)

where

x_{0}

\dots

x_{n}

are numbers uniquely determined by

a_{k} (k) > 0

b_{k - 1} (k)

, and

c_{i, l} (k)

i < l < k \leq n

. Since

d_{n, n} > 0

, 3.6 uniquely determine the numbers

c_{0, n} (n + 1)

\dots

c_{n - 1, n} (n + 1)

b_{n} (n + 1)

and

a_{n + 1} (n + 1) = \frac{d_{n + 1, n + 1}}{d_{n, n}} > 0

such that 3.5 holds for

n + 1

Now we can use all the numbers

a_{k} (k)

b_{k - 1} (k)

c_{i, l} (k)

in order to define

J_{n}

by 3.2 and then 3.5 shows that

{J_{n}}_{n \geq 1}

is a near tridiagonal model of

K

. □

We notice that the label

n

of the numbers

a_{n} (n)

b_{n - 1} (n)

c_{i, l} (n)

is superfluous due to the conditions 3.3 . We used it in order to have a uniform definition of

J_{n}

in 3.2 but we will drop it from now on. The proof of Theorem 3.2 gives a one-to-one correspondence between the set of strictly positive definite kernels on

N_{0}

with

K (0, 0) = 1

and the set

J

of families of numbers

{a_{k}, b_{k - 1}, c_{i, l} | k \geq 1, 0 \leq i < l}

. We will call these numbers the Jacobi parameters of

K

. In addition, we can easily characterize the strictly positive definite Hankel kernels by the additional conditions on the Jacobi parameters:

\begin{matrix} \begin{matrix} c_{i, l} = 0, l > i + 1, \\ c_{k, k + 1} = a_{k + 1}, k \geq 0 . \end{matrix} \end{matrix}

(3.7)

In this case the near tridiagonal model reduces to 1.4 .

The next task is to establish an explicit formula for the Cholesky factors

D_{n}

in terms of the Jacobi parameters. First, we obtain a recursive relation for

D_{n}

Lemma 3.3. For

n \geq 1

D_{n} = F_{n} (1 \oplus D_{n - 1}),

where

F_{n} = [\begin{matrix} 1 & b_{0} & c_{0, 1} & c_{0, 2} & . . . & c_{0, n - 1} \\ 0 & a_{1} & b_{1} & c_{1, 2} & c_{1, n - 1} \\ 0 & 0 & a_{2} & b_{2} & . . . \\ 0 & 0 & 0 & a_{3} & . . . \\ . . . & . . . & . . . & b_{n - 1} \\ 0 & . . . & 0 & a_{n} \end{matrix}] .

Proof. We have $D_{0} = 1$ and then $D_{1} = [\begin{matrix} 1 & b_{0} \\ 0 & a_{1} \end{matrix}] = F_{1} [\begin{matrix} 1 & 0 \\ 0 & D_{0} \end{matrix}] .$ Assume the statement is true up to $n$ . Then $D_{n + 1} = [\begin{matrix} D_{n} \\ J_{n + 2, n + 1} \dots J_{2, 1} \\ 0 \end{matrix}] .$ By the induction hypothesis, $[\begin{matrix} D_{n} \\ 0 \end{matrix}] = [\begin{matrix} F_{n} (1 \oplus D_{n - 1}) \\ 0 \end{matrix}] = F_{n + 1} [\begin{matrix} 1 & 0 \\ 0 & D_{n - 1} \\ 0 & 0 \end{matrix}]$ and we notice that $J_{n + 2, n + 1} \dots J_{2, 1} = F_{n + 1} [\begin{matrix} 0 \\ J_{n + 1, n} \dots J_{2, 1} \end{matrix}],$ so that $D_{n + 1} = F_{n + 1} [\begin{matrix} 1 & 0 & 0 \\ 0 & D_{n - 1} \\ J_{n + 1, n} \dots J_{2, 1} \\ 0 & 0 \end{matrix}] = F_{n + 1} (1 \oplus D_{n}) .$ □

The matrices

F_{n}

have a very simple recursive multiplicative structure. Actually it is convenient to make the dependence of

F_{n}

on the Jacobi parameters more explicit and introduce

F_{m, k} = [\begin{matrix} 1 & b_{k} & c_{k, k + 1} & . . . & c_{k, m - 1} \\ 0 & a_{k + 1} & b_{k + 1} & c_{k + 1, m - 1} \\ 0 & a_{k + 2} \\ . . . & b_{m - 1} \\ 0 & a_{m} \end{matrix}]

for

m \geq 1

and

0 \leq k < m - 1

. In particular,

F_{n, 0} = F_{n}

. We show that the building blocks of

F_{m, k}

(consequently, of

D_{n}

) are the

2 \times 2

matrices

B_{k} = [\begin{matrix} 1 & b_{k} \\ 0 & a_{k + 1} \end{matrix}], k \geq 0,

and the

(l - k + 2) \times (l - k + 2)

matrices

C_{k, l} = [\begin{matrix} 1 & 0 & . . . & 0 & c_{k, l} \\ 0 & 1 & 0 & 0 \\ . . . & 0 \\ 0 & 0 & 1 \end{matrix}], 0 \leq k < l .

Lemma 3.4. For

m \geq 1

and

0 \leq k < m - 1

F_{m, k} = (1 \oplus F_{m, k + 1}) G_{m, k},

where

G_{m, k} = C_{k, m - 1} (C_{k, m - 2} \oplus 1) \dots (C_{k, k + 1} \oplus 1_{m - k - 2}) (B_{k} \oplus 1_{m - k - 1}) .

Proof. The proof is a straightforward calculation and can be omitted. □

Once again it is convenient to visualize all those matrix multiplications by means of a transmission line picture similar to the one in Figure 3. Thus, Figure 5 illustrates how to calculate

D_{3}

by using Lemma 3.3 and Lemma 3.4 . In particular, if

1

is the imput at

A

then at

B

we read the expression of

K (0, 3)

in terms of the Jacobi parameters,

K (0, 3) = b_{0}^{3} + b_{0} a_{1} c_{0, 1} + a_{1} c_{0, 1} b_{0} + a_{1} b_{1} c_{0, 1} + a_{1} a_{2} c_{0, 2} .

Figure 5 suggests a connection with weighted Lukasiewicz paths. A Lukasiewicz path of length

n

is a path in the positive quadrant of the lattice

Z^{2}

which starts at

(0, 0)

, ends at

(n, 0)

, and consists of rise unit steps, horizontal unit steps, and fall steps of arbitrary depth. Let

ℒ_{n, 0}

denote the set of Lukasiewicz paths of length

n

Figure 5 . Transmission line representation for

D_{3}

We also consider

ℒ_{n, k}

the set of paths of length

n

in the positive quadrant, starting at

(0, 0)

and consisting of the same type of steps as above, but ending at

(n, k)

. We introduce a weigth on the elements of

ℒ_{n, k}

as follows. Let

p \in ℒ_{n, k}

consists of steps

p_{1}

\dots

p_{n}

. Then

w (p) =^{n} \prod_{k = 1} w (p_{k})

and

w (p_{k}) = {\begin{matrix} a_{l} & if p_{k} is a rise step (j, l) \to (j + 1, l + 1) for some j \geq 0; \\ b_{l} & if p_{k} is a horizontal step (j, l) \to (j + 1, l) for some j \geq 0; \\ c_{k, l} & if p_{k} is a fall step (j, l) \to (j + 1, k) for some j \geq l . \end{matrix}

Figure 6 . Passing from paths of length

n - 1

to paths of lentgh

n

Theorem 3.5. The Cholesky factor

D_{n} = {[d_{i, j}]}_{0 \leq i, j \leq n}

is given by the formula

\begin{matrix} d_{i, j} = \sum_{p \in ℒ_{j, i}} w (p), i \leq j, (i, j) \neq (0, 0) . \end{matrix}

(3.8)

Proof. We can prove the statement by induction on

n

. For

n \leq 3

, 3.8 is seen from Figure 5. The general induction step is provided by Lemma 3.3 . Thus,

\begin{matrix} d_{0, n} & = & b_{0} d_{0, n - 1} + c_{0, 1} d_{1, n - 1} + \dots + c_{0, n - 1} d_{n - 1, n - 1} \end{matrix}

\begin{matrix} d_{1, n} & = & a_{1} d_{0, n - 1} + b_{1} d_{1, n - 1} + \dots + c_{1, n - 1} d_{n - 1, n - 1} \end{matrix}

\begin{matrix} . . . \end{matrix}

\begin{matrix} d_{n, n} & = & a_{n} d_{n - 1, n - 1}, \end{matrix}

and these relations are precisely those obtained by passing from weighted paths of length

n - 1

to weighted paths of length

n

, as showed in Figure 6. □

Remarks

(a)

For a Hankel kernel

K

, Theorem 3.5 reduces to well-known results in the combinatorial theory for orthogonal polynomials (on the real line), [4] , [9] . Indeed, by 3.7 there are no fall steps of depth other than one. In this case, the summation in 3.8 is only over Motzkin paths, which is the classical formula in [4] , [9] . It might be interesting to note that the correpsonding formula for orthogonal polynomials on the unit circle, 2.4 , involves summation over labelled configurations in

Z^{2}

rather than over weighted paths.

(b)

There are other significant differences between the two parametrizations discussed in this paper. For instance, the parameters

{γ_{k, j}}

have the following inheritance property: the parameters of the kernel

K^{(1)} = {[K (l, m)]}_{1 \leq l, m}

are precisely

{γ_{k, j}}_{1 \leq k < j}

The Jacobi parameters do not have such a property. Another difference involves computations of determinants. Thus, we have already notice the formula (we assume

K (0, 0) = 1

det {[K (l, m)]}_{l, m = 0}^{n} =^{n} \prod_{k = 1} f_{k} \prod_{0 \leq i < j \leq n} d_{i, j}^{2},

and from Lemma 3.3 we deduce

det {[K (l, m)]}_{l, m = 0}^{n} =^{n} \prod_{k = 1} a_{k}^{2 (n - k)},

which does not involve all the Jacobi parameters (up tu

n

). So the first determinant formula is much tighter in its parameters.

(c)

Theorem 3.5 gives, in particular, that

\begin{matrix} K (0, n) = \sum_{p \in ℒ_{n, 0}} w (p), n \geq 1 . \end{matrix}

(3.9)

Since the Jacobi parameters do not have the inheritance property mentioned above, we cannot have formulae of this type for any

K (l, n)

. Instead we have the following construction.

For

n

fixed we consider admissible steps in the psitive quadrant of the following types: between the vertical lines

x = 0

and

x = n

only of Lukasiewicz type are allowed and between the vertical lines

x = n

and

x = 2 n

only reflections with respect to the line

x = n

of Lukasiewicz steps are allowed (and they are weighted with the complex conjugate of the weight of the reflected Lukasiewicz step, see Figure 7).

Figure 7 . Admissible steps for

n = 2

Denote by

K_{n, k}

the set of paths in the positive quadrant of

Z^{2}

made of admissible steps, starting at

(0, 0)

and ending at

(n + k, 0)

0 \leq k \leq n

. In particular,

K_{n, 0} = ℒ_{n, 0}

. The weights are defined correspondingly. With these elements, we deduce from Theorem 3.5 that

\begin{matrix} K (l, n) = \sum_{p \in K_{n, l}} w (p), 0 \leq l \leq n . \end{matrix}

(3.10)

(d)

We notice that the proof of Theorem 2.2 gives a formula for the Cholesky factor

D_{n}

in terms of Dyck type paths analogous to formula 3.8 .

□

References

T. Banks, T. Constantinescu, and N. El-Sissi, Tensor algebras and displacement structure. IV. Invariant kernels, math.FA/0410491, to appear in Linear Alg. Appl.
T. Constantinescu, Schur analysis of positive block-matrices, in I. Schur Methods in Operator Theory and Signal Processing (I. Gohberg, Ed.), Birkhäuser, Basel, 1986, pp. 191-206.
T. Constantinescu, Schur Parameters, Factorization and Dilation Problems, Birkhäuser, Basel, 1996.
P. Flajolet, Combinatorial aspects of continued fractions, Discrete Math., 32(1980), 125-161.
A. N. Kolmogorov, Sur l'interpolation et l'extrapolation des suites stationaire, C. R. Acad. Sci. (Paris), 208(1939), 2043-2045.
B. Simon, Orthogonal Polynomials on the Unit Circle, Colloquium Publications, 54, Amer. Math. Soc., Providence, Rhode Island, 2004.
R. P. Stanley, Enumerative Combinatorics, Vol. 2, Cambridge Univ. Press, Cambridge, 1999.
G. Szegö, Orthogonal Polynomials, Colloquium Publications, 23, Amer. Math. Soc., Providence, Rhode Island, 1939.
G. Viennot, A combinatorial theory for general orthogonal polynomials with extensions and applications, in: Orthogonal polynomials and applications (Bar-le-Duc, 1984), 139-157, Lecture Notes in Math., 1171, Springer, Berlin, 1985.

Department of Mathematics, University of Texas at Dallas, Richardson, TX 75083 E-mail address : tiberiu@utdallas.edu Department of Mathematics, University of Texas at Dallas, Richardson, TX 75083 E-mail address : nae021000@utdallas.edu

T. Constantinescu

Nermine El-Sissi