Algorithmic definition of means acting on positive numbers and operators
Miklós PálfiaI would like to thank Dénes Petz and Nándor Sieben for their help and support.
March 6, 2005
Abstract
Means are used in several applications from electronic engeneering to information theory, however there is no general theorem on how to extend a given
mean function to multiple variable forms. In this article we would like to present a theorem, which gives one possible solution for this problem, for every
mean function, acting on positive numbers and operators.
1 Introduction
The use of mean functions falls back to the early ages of mathematics, but they are used widely nowdays aswell. For example they have great importance in statistics, but also in volume visualization, in imaging surgery.
The study of electrical network connections implied the introduction of parallel sum of two positive semidefinite matrices in [2] . Formerly, Anderson defined a matrix operation, called shorted operation to a subspace, for each positive semidefinite matrix. Anderson and Trapp in [3] , have extended the theory of parallel addition and shorted operation to bounded linear positive operators on a Hilbert space and demonstrated its importance in operator theory. They have studied fundamental properties of these operations.
The axiomatic theory of means, for pair of positive operators, have been developed by Kubo and Ando in [7] . This theory has found a number of applications in operator theory.
Several steps have been taken in the theory of means, but a general idea has not been laid down yet, on how to extend, or define an
variable version of a given general
mean function acting on numbers and operators. We would like to present a theory which gives a possible solution and a frame theory for further studies.
Definition 1.1.
A two variable function M :
according to [
11]
, is called a mean function if
-
(i)
for every
.
-
(ii)
for every
.
-
(iii)
If
, then
.
-
(iv)
If
and
, then
.
-
(v)
is continuous.
The geometric mean
, the arithmetic mean
and the harmonic mean
are the most known examples but there are many other means aswell.
The above definition of means can be easily extended to positive ordered operators and matrices, by replacing the numbers with them.
2 Extending a mean to multiple variables
The definition of
given in the first section, can be extended to multiple variable functions. The following definition is one possibility of extension.
Definition 2.1.
An
variable function
may be called a (
variable) mean function if
-
(i')
for every
.
-
(ii')
is independent from the ordering of
.
-
(iii')
.
-
(iv')
If
, then
.
-
(v')
If
, then
.
-
(vi')
is continuous.
Our goal is to algorithmically define an
variable mean by using its less than
variable forms. Firstly we will use the
variable form of
, to define the
variable one, as an iteration's limit.
Definition 2.2.
Let
and
be an
variable mean function. Let us consider the
class variations of
. There are
different variations. Let us define an iteration as
|
(2.2)
|
where
is the
th
different variation of
(
are chosen from
, which could be done in
different ways).
Theorem 2.1.
The
sequences defined in definition 2.2 are convergent and their limits are the same, which could be defined as the
mean of the
numbers.
-
Proof.
The iteration given in definition 2.2 has a contractive-like property by (iii'), which means that the sequence
is monotonic increasing,
is monotonic decreasing. Hence the limits
and
exist. From condition (iv'), one can see that the series' minimal and maximal elements are given by the following:
|
(2.4)
|
where
are the smallest
numbers,
are the largest
numbers from the series
. By 2.3 and 2.4
and
explicit dependence on
and
is given
|
(2.6)
|
and by (vi') one can write
|
(2.8)
|
which yields
|
(2.10)
|
and this is only true when
.
The above theorem and its proof yields the following remarks.
Remark 2.1.
The iteratively defined mean function
, is invariant to the initial ordering of the
variations.
Remark 2.2.
The iteration 2.2 leaves the mean of the starting n numbers invariant through the sequence.
It is easy to verify the next two theorems, which have stressed importance in inequalities of means and operator means in our given context.
Corollary 2.2.
Two different, iteratively defined mean function
and
, is in the same relation as their two variable forms (
implies
).
Corollary 2.3.
Theorem 2.1 and the proof also works for ordered positive operators, acting on a
Hilbert space, and for
matrices.
We will show with some examples, that theorem 2.1 gives a sufficient definition of the
variable mean.
Corollary 2.4.
Theorem 2.1 applied on the
variable arithmetic, geometric and harmonic mean, gives the corresponding
variable mean.
-
Proof.
According to the given serie's convergence in theorem 2.1 it is enough to prove that the minimum's or maximum's limit is the
variable mean. We will prove it only for the arithmetic mean. Let us consider the numbers
and the
arithmetic mean. By theorem 2.1, the sequence
can explicitly be written and proven by induction with 2.2 , 2.3 and 2.4 :
|
(2.11)
|
For the geometric mean the proof can be extended using the logarithmic function and its inverse for the limit:
|
(2.12)
|
The proof for the harmonic mean can be given by inverses:
|
(2.13)
|
Our main idea of extending means to multiple variables is based on theorem 2.1 , which is theoretically enough but in practice is very insufficient. For example if we would like to compute
numbers or matrices
mean, we should use the two variable main definition
and extend the other ones from one to another.
In the next section we will prove that
can be directly extended from the corresponding
.
3 Extending
directly from
Let us define the following iteration:
Definition 3.1.
Let
and
be a two variable mean function,
|
(3.1)
|
Theorem 3.1.
The iteration given in definition 3.1 for all
is convergent and
where
is defined by theorem 2.1 .
-
Proof.
Firstly we begin with proving the convergence. It is clear that the
is always the first element (
) and the
is always the last element (
) of the series in definition 3.1 . Hence (iii) and definition 3.1 ,
is increasing and
is decreasing. This yields:
|
(3.3)
|
and by (v):
|
(3.5)
|
which give
|
(3.7)
|
Considering the characteristics of the definition 3.1 , this can only be true when
. Secondly we will prove the limit. For
the theorem is clear, because the two iterations, defined in theorems 2.1 and in 3.1 , are the same. Our next step is to prove for
, if it is true for
.
Let us consider the definition of
in theorem 2.1 . Comparing the
(which is the first element
) in theorem 3.1 , and
(which equals
by 2.3 ), we can see that
, because of the inductional condition and the definition of
as a limit in theorem 2.1 . The same can be applied for
and
which yields
.
Hence
and
are minoring and majoring, for every
,
and
, but
, so
.
Furthermore there is special property in the iteration in definition 3.1 .
Definition 3.2.
Let
and
be a graph, with n verteces and edges given as that, there is one cycle in
, which contains all verteces and edges (so it is at the same time a Hamiltonian and an Euler-cycle ). This implies that in
, every vertex has two edges and all of them are bound together. Let us consider an optional one to one correspondence between
numbers and
-s verteces. Taking every edge in
as an
(where
is a mean function and
,
are assigned to the two ending points of the edge as previously given), we can define an iteration with an optional
mappings,
|
(3.8)
|
Theorem 3.2.
Every different iteration given in definition 3.2 , converge to the limit
mean function, and the iteration independently from the mapping
converge on a higher or equally rate as the iteration given in definition 3.1 .
-
Proof.
For
, it is easy to see that the theorem is true, because the iterations given in definitions 3.1 and 3.2 , are the same.
Assume that the theorem is true for
variable. Let us expand from an
variable iteration defined in 3.2 with an optional mapping, to
variable.
This can be done as replacing one edge with two edges and one vertex (mapped to a new number). Let us do this expansion in the following way. Take the first smallest
numbers from
and set up on them the iteration given in definition 3.1 . From the inductional condition this iteration will have the slowest convergence rate, which means that its minimal and maximal elements will minor and major every other iteration in definition 3.2 . Let us replace the edge which gives the maximal element of the iteration given in definition 3.1 , with the two new edges and the remaining number (which is the greatest number out of the
) as a vertex. This two edges with the corresponding two
will give the new iterations given for
numbers greatest two elements. This replacement cannot be done better in any other optional mapped iteration given in definition 3.2 aswell. But considering the inductional condition this yields that any
variable iteration given in definition 3.2 , cannot minor and major, with its maximal and minimal elements, the iteration given in definition 3.1 , hence theorem 3.2 is proven.
Corollary 3.3.
The above theorems also work for ordered operators acting on a
Hilbert space, and
matrices aswell.
We will have to consider further examinations to define the above iterational definitions for inorderable matrices. In the next section we will study this problem.
4 Extending theorems 3.1 and 3.2 to unordered matrices and operators
The problem is with positive matrices and operators which satisfy
and
. For the above matrices and operators, the function
's and its arguments' relation is not explained and highly depend on the main characteristics of
, so the given iterations in the above theorems must be specified.
Theorem 4.1.
For any
positive operators or matrices the iteration given in definition 3.1 is convergent, as defined in theorem 3.1 .
-
Proof.
If
does not hold, than the iteration in definition 3.1 converges for all
, because after
steps, the iteration will surely alter all of the
-s (from one to another), so the iteration will converge.
If
does hold, we will have to define (according to [11] ) the following construction. Let
and
be monotone sequences as,
|
(4.1)
|
Let
,
and
,
. Let us set the iteration given in definition 3.1 up on
,
and
. According to the first part of the proof, the
and
series are convergent for any
, as given in theorem 3.1 . Considering the definition of sequences
and
in 4.1 , it is easy to verify by condition (iii), that for any
, the
series are minoring and
series are majoring the series
for any
.
Taking the limit
, we get
and
.
Hence
and
are Cauchy sequences in index
and condition (vi'), they are convergent and
|
(4.2)
|
But
and
are minoring and majoring every
for any
and
, so the limit
exist and by 4.2 ,
|
(4.3)
|
and theorem 4.1 is proven.
Corollary 4.2.
Using the above proof, theorems 2.1 and 3.2 work for the unordered
-s.
5 Consequences
By the theorems given in our examinations generalize the extension of the two variable mean functions and gives a frame theory, which may be used in the future studies related to the extension of means to multiple variables.
An important outcome is, that these theorems are applying for operators and matrices and guarantee the existence of one possible extension.
It is known, that in several situations, there are more than one possible generalization of a mean. One example is the logarithmic mean,
|
(5.1)
|
which has several extended forms according to [5] , [8] , [9] , but our theorems may leave only one form valid. However with some means, it appears to be quite difficult to give the iterations limit in a closed form.
References
-
T. Ando, C-K. Li and R. Mathias, Geometric means, Linear Algebra Appl.
-
W. N. Jr. Anderson and R. J. Duffin, Series and parallel addition of matrices, J. Math. Anal. Appl. 26(1969), 576594.
-
W. N. Jr. Anderson and G. E. Trapp, Shorted Operators II, Siam J. Appl. Math. 28(1975), 6071.
-
R. Bhatia, Matrix Analysis, Springer-Verlag, New York, 1996.
-
B. C. Carlson, The logarithmic mean, Amer. Math. Monthly 79, 615-618 (1972).
-
F. Hiai and H. Kosaki, Means of Hilbert space operators, Lecture Notes In Maths. 1820, Springer, 2003.
-
F. Kubo, T. Ando, Means of positive linear operators, Math. Ann. 246(1980), 205-224.
-
S. Mustonen, Logarithmic mean for several arguments,
-
E. Neuman, The weighted logarithmic mean, J. Math. Anal. Appl. 188(1994), 885-900.
-
M. K. Vamanamurthy and M. Vuorinen, Inequalities for means. J. Math. Anal. Appl. 183(1994), 155-166.
-
D. Petz, Means of positive numbers and operators, preprint (2004).