Random Trees and General Branching Processes
Anna Rudas
*
, Balint Toth
*
, Benedek Valko
*
,
∘
*
Institute of Mathematics, TU Budapest
∘
Alfréd Rényi Institute of Mathematics, Budapest
November 27, 2006
Abstract
We consider a tree that grows randomly in time. Each time a new vertex appears, it chooses exactly one of the existing vertices and attaches to it. The probability that the new vertex chooses vertex
is proportional to
, a weight function of the actual degree of
. The weight function
is the parameter of the model.
In [3] and [9] the authors derive the asymptotic degree distribution for a model that is equivalent to the special case, when the weight function is linear. The proof therein strongly relies on the linear choice of
.
We give the asymptotical degree distribution for a wide range of weight functions. Moreover, we provide the asymptotic distribution of the tree itself as seen from a randomly selected vertex. The latter approach is new and gives full insight to the limiting structure of the tree.
Our proof is robust and we believe that the method may be used to answer several other questions related to the model. It relies on the fact that considering the evolution of the random tree in continuous time, the process may be viewed as a general branching process, this way classical results can be applied.
1 Introduction
In models of randomly growing networks, the concept of preferential attachment means that when a new vertex appears, it is more likely that it attaches to an already existing vertex if the latter already has more neighbors. One of the realizations of this idea is the Barabási Albert graph. In this model, a tree grows in discrete time-steps: at each step we add a new vertex and connect it to an already existing vertex. This choice is made randomly, using probabilities exactly proportional to the degree of the existing vertices. This model reproduces phenomena observed in certain real-world networks (see [1] ), the power-law decay of the degree sequence, for example. This was proved in a mathematically precise way in [3] and, independently, in [9] .
The probability with which a newly appearing vertex chooses its neighbor could depend on the degrees of the existing vertices in a more general way. We define a weight function,
, and let the probability be proportional to this function of the degree. In [3] and [9] ,
is linear, their techniques strongly depend on martingales that are apparent in the system only in this special case.
General weight functions are considered in the model of Krapivsky and Redner [7] . There
, and non-rigorous results are obtained, showing the different behavior for
and
. In the first region the limiting object does not have a non-trivial degree sequence:
a single dominant vertex appears which is linked to almost every other vertex, the others having only finite degree. This statement is made precise and proved rigorously in [11] . The weight functions we consider will be such that this does not happen.
Our main results are the following. We determine the asymptotic distribution of the degree sequence, which equivalently gives the limiting distribution of the degree of a (uniformly) randomly selected vertex. We also look deeper into the structure of the tree: we give the asymptotic distribution of the subtree under a randomly selected vertex. Moreover, we present the asymptotic distribution of the whole tree, seen from a randomly selected vertex.
The latter approaches are new and give full insight to the limiting structure of the random tree.
The key of our method is to place the process into continuous time in such a way that it fits in to the well-developed theory of general branching processes (see Section 5 ). The asymptotic behavior of the continuous time model gives that of the discrete time model, as pointed out in Section 3 . A special case of our model (when
is linear) is equivalent to those studied in [3] and [9] . The robustness of our method is apparent in the fact that it gives a.s. results for a wide class of weight functions, it generalizes the results mentioned above.
The class of weight functions for which our theorems hold are not fully identified by the condition we make: it is sufficient, but we do not claim that it is also necessary.
In the present paper we do not intend to analyse all the ways we beleive this new approach can be used. We are planning to apply this technique to answer many other questions related to the randomly growing tree model.
The paper is organized as follows. We introduce the terminology and notation in the next section, then in Section 3 we define the model. We state our results in Section 4 and show the simplifications that arise in the linear case. After that, in Section 5 we give a brief introduction to the theory of general branching processes, and quote the theorems that we rely on. We present our proofs in Section 6 . We comment on the asymptotic growth of the tree in the last section.
2 Rooted ordered trees: terminology and notation
Throughout the paper it will be convenient to use genealogical phrasing. We will consider our tree evolving as a genealogical tree: where the individuals are the vertices and the parent child relations are the edges of the tree. It will be also convenient to follow the `birth orders' among children of the same parent. For this purpose we will consider our trees always as rooted ordered trees (sometimes called family trees or rooted planar trees). In the following paragraphs we introduce the (commonly known) terminology for these trees and also some additional notations needed for our theorems.
2.1 Vertices, individuals, trees
We will label the vertices of a rooted ordered tree using a subset of
denotes the root of the tree, its children are labelled with
, and in general the children of
are labelled by
.
Thus if a vertex has the label
then this means that it is the
child of its parent, which is the
child of its own parent and so on. If
and
we will use the shorthanded notation
for the concatenation
, and with a slight abuse of notation for
we use
for
.
We will identify a rooted ordered tree with the set of labels of its vertices, since this already contains the necessary information about the edges. It is clear that a
may represent a rooted ordered tree if and only if
and for each
we have
as well as
, if
.
The set of finite rooted ordered trees will be denoted by
. We think about
as an oriented tree with edges pointing from parents to children. The degree of a vertex
is just the number of its children in
:
The
generation of
is
where
iff
.
The
ancestor of
with
is
if
and
if
.
The subtree rooted at a vertex
is:
this is just the progeny of
viewed as a rooted ordered tree. Also, (again with a slight abuse of notations) for an
with
we use the notation
. This would be the new label given to
in the subtree
.
Consider a
. An ordering
of the elements of
is called historical if it gives a possible 'birth order' of the vertices in
, formally if for each
we have
. The set of all historical orderings of
will be denoted
. For a fixed
the rooted ordered trees
|
(1)
|
give the evolution of
in this historical ordering
.
2.2 Random trees
Throughout the paper we will use Greek letters to denote random elements (of various distributions) selected from
and
:
Our results will deal with some asymptotic properties of a randomly chosen vertex in a certain random tree. We will investigate the asymptotic distribution of its degree, its progeny and also the progeny of its the
ancestor. In order to study the latter object, we introduce rooted ordered trees with a marked vertex in generation
:
is identified with
, since generation 0 consists of only the root,
. We can use the elements of
to describe the progeny of the
ancestor of a random vertex:
is an ordered tree rooted in the
ancestor of the selected point and
is the position of the random vertex in this tree. Clearly, if
describes the progeny of the
ancestor, then for
the progeny of the
ancestor is described by
.
Thus if
is a distribution on
which describes the progeny of the
ancestor of a chosen vertex, then, if
, the distribution of the progeny of the
ancestor is:
The sequence
of probability measures on
,
is called consistent if for any
, the identity
holds.
Without presenting the precise formulation, it is clear that a consistent sequence
of probability measures on
gives full insight to the limiting structure of the tree as seen from a random vertex, see Remark after Theorem 2 .
We call a probability measure
on
steady if
|
(2)
|
It is easy to check that in this case, for any
, the similar identity
follows. Equivalently, for any bounded function
and any
,
where
is a random element of
with distribution
, and on the left hand side
is a random vertex selected uniformly from the
-th generation of
. (We don't have to worry about the fact that
may be empty, since in that case the expression
is automatically 0.) Immediate consequences of this property are that the expected size of the
generation is 1 for any
(choose
identically 1), and therefore the expected size of the whole tree is infinite.
Backward extensions: Given a steady probability measure
on
, define the probability measures
on
,
, by
One can easily check that, due to the steadiness of the distribution
, the sequence of probability measures
on
,
, is consistent.
3 The Random Tree Model
Given a weight function
, let
be a Markovian pure birth process with
and birth rates
Let
be the density of the point process corresponding to the pure birth process
and
the (formal) Laplace transform of
:
|
(3)
|
The rightmost expression of
is easily computed, given the fact that the intervals between successive jumps of
are independent exponentially distributed random variables of parameters
respectively. Let
|
(4)
|
Throughout this paper we impose the following condition on the weight function
:
|
(M)
|
We are now ready to define our randomly growing tree
which will be a continuous time, time-homogeneous Markov chain on the countable state space
, with initial state
.
The jump rates are the following: if for a
we have
then the process may jump to
with rate
where
and
. This means that each existing vertex
`gives birth to a child' with rate
independently of the others.
Note that condition M implies
and hence it follows that the Markov chain
is well defined for
, it does not blow up in finite time.
We define the total weight of a tree
as
Described in other words, the Markov chain
evolves as follows: assuming
, at time
a new vertex is added to it with total rate
which is attached with an oriented edge (pointing towards the newly added vertex) to the already existing vertex
with probability
Therefore, if we only look at our process at the stopping times when a new vertex is just added to the randomly growing tree:
then we get the discrete time model:
has the same distribution as the discrete time model at time
.
It will sometimes be convenient to refer to the vertices in the order of their birth, not their genealogical code: let
denote the vertex that appeared at
. Of course we will always have
and
.
4 Results
4.1 Statement of results
From condition M it follows that the equation
has a unique root
.
Now we are ready to state our first theorem.
From Theorem 1 several statements follow, regarding the asymptotic behavior of our random tree as seen from a randomly selected vertex
, chosen uniformly from
. As typical examples we determine the asymptotic distribution of the number of children, respectively, that of the whole subtree under the randomly chosen vertex, its
ancestor, respectively. That is: the asymptotic distribution of
,
and
.
In order to formulate these consequences of Theorem 1 we need to introduce some more notation. Let
and one of its historical orderings
be fixed. The historical sequence of total weights are defined as
|
(5)
|
for
while the respective weights of the appearing vertices are defined as
|
(6)
|
for
. Since
is the degree of
's parent just before
appeared,
is the rate with which our random tree process jumps from
to
.
Given the weight function
satisfying condition ( M ) and
defined as before define
|
(7)
|
|
(8)
|
Theorem 2.
Consider a weight function
which satisfies condition (M) and let
be defined as before. Then the following limits hold almost surely:
-
(a)
For any fixed
-
(b)
For any fixed
-
(c)
For any fixed
Furthermore, the functions
are probability distributions on
and
, respectively, and
is steady (i.e. identity 2 holds).
Remarks. 1. Parts (a), (b) and (c) of Theorem 2 , in turn, give more and more information about the asymptotic shape of the randomly growing tree
, as seen from a random vertex
chosen with uniform distribution. Part (a) identifies the a.s. limit as
, of the degree distribution of
. Part (b) identifies the a.s. limit as
, of the distribution of the progeny of
. Finally, part (c) does the same for the distribution of the progeny of the
ancestor of the randomly selected vertex with the position of this vertex marked.
2. From part (c) it is easy to derive the asymptotic distribution of the progeny of the
ancestor of the randomly selected vertex (as a rooted ordered tree without any marked vertices):
The limit is the size-biased version of
, with the biasing done by the size of the
generation.
3. Since the distribution
is steady, part (c) identifies the asymptotic distribution of the whole family tree of the randomly selected vertex
(relatives of arbitrary degree included).
Hence asymptotically, as
, the tree
viewed from a random vertex
will have the following structure (we omit the precise formulation):
– there exists an infinite path of ancestors
`going back in time', – we have finite ordered random trees rooted at each vertex of this path, – the tree rooted at
with the position of
marked on it has distribution
on
where
.
4.2 Linear weight function
In the linear case
all computations are rather explicit. In this case the asymptotic degree distribution
(computed in [3] , [9] ) is reproduced, of course. But even in this simplest case the asymptotic distribution
of the subtree under a randomly selected vertex seems to be new.
For sake of completeness, in the rest of this section we perform these (explicit and straightforward) computations for the linear case. Multiplying the rate function with a positive constant only means the rescaling of time in our model thus it is enough to consider
(with
). In this case it is straightforward to compute that condition (M) holds,
,
and
. Thus both Theorems 1 and 2 hold.
For the asymptotic degree distribution we get
where we used the shorthanded notation
For the calculation of
first we show that the sum which defines it contains identical elements. In order to avoid heavy notation, during the following computations we will use
and
instead of
.
Clearly, for any
(Actually, the first equality holds for every weight function
.) It is also easy to see that for any
thus for any
Therefore
In the
case (i.e. if we consider random tree proposed in [1] ) the previous calculations give
and
The value of
cannot be written as the function of degrees of
only, but one can compute it using the values
for
. For a given
and
let us introduce the following notations (these will not be used in the other parts of the paper):
| |
| |
It is a simple exercise to prove that for a
with
The proof is left to the reader.
5 Branching Processes
The random tree model, defined in continuous time, has the big advantage that it fits into the framework of the well-established theory of general branching processes. We give a brief introduction to the fundamentals and state the theorems that we rely on in our proofs.
We do not give a broad survey on the most general types of branching processes here, we choose to focus on the results which may be applied to our process. For more details see the monograph [4] or the papers [5] , [10] , [12] and the references therein. For a survey on branching processes, trees and superprocesses, see [8] .
In the case of a general branching process, there is a population in which each individual reproduces at ages according to i.i.d. copies of a random point process
on
. We denote by
the
-measure of
, this the random number of children an individual has up to time
.
The individuals in the population are labelled with the elements of
, as described in Section 3 (see 2.1 ). The basic probability space is
where
are identical spaces on which
are distributed like
.
For each
there is a
shift defined on
by
in plain words,
is the life of the progeny of
, regarding
as the ancestor.
The birth times
of the individuals are defined in the obvious way:
and if
with
then
|
(9)
|
The branching process is often counted by a random characteristic, this can be any real-valued process
. For each individual
,
is defined by
in plain words
denotes the value of
evaluated on the progeny of
, regarding
as the ancestor, at the time when
is of age
. We can think about
as a `score' given to
when its age is
. With this,
is the branching process counted by the random characteristic
(the `total score' of the population at time
).
For our applications we only consider random characteristics which are 0 for
and for
they are equal to a bounded deterministicfunction of the rooted tree.
This means that only those individuals contribute to
which are born up to time
and their contribution is a deterministic function of their progeny tree. (Random characteristics may be defined in a more general way, see e.g. [4] , [5] .) One of the important examples is
when
is just the total number of individuals born up to time
.
The Laplace-transform of
will be of great importance, we denote this random variable by:
|
(10)
|
We shall be interested in supercritical, Malthusian processes, meaning that there exists a finite
(the so-called Malthusian parameter) for which
and also
|
(12)
|
(The last property means that the process is Malthusian and the first means that it is supercritical.) Also, we require the reproduction to be non-lattice, which means that the jumps of
cannot be supported by any lattice
,
with probability one.
We quote here a weaker form of Theorem 6.3 from [10] , using its extension which appears in Section 7 of the same paper. This way the conditions of the original theorem are fulfilled automatically.
Theorem A (Nerman, [10] ).
Consider a supercritical, Malthusian branching process with Malthusian parameter
, counted by two random characteristics
and
which have the properties described above (i.e. they are 0 for
and a deterministic bounded function of the progeny tree for
). Suppose that there exists a
for which
Then almost surely
|
(13)
|
where
.
For a reader interested in how the Malthusian parameter and
play a role in the theory, we give a short indication. This part may be skipped without any confusion.
The key observation is the so-called basic decomposition, namely that
|
(14)
|
The advantage of this formula is that if we know the sequence
(all the birth-times of the children of the root), then
has the same conditional distribution as
.
Therefore, using the notation
, taking expectation on both sides of 14 in two steps (first conditionally on
, then taking expectation regarding
, we get
|
(15)
|
where we used the notation
. Taking the Laplace transform of both sides,
|
(16)
|
so formally
|
(17)
|
Note that
. This shows that if there is a positive interval below
where the Laplace transform is finite, then
has a simple pole at
(it is easy to check that
and
). So taking series expansion and inverse Laplace transform results that
|
(18)
|
where
is the finite constant defined in ( 12 ). This means that the ratio of the expectations of
and
indeed tends to
. To get the almost sure convergence of
to the same limit needs a lot more work and of course its proof is more elaborate (see [10] for example).
6 Proofs
-
Proof of Theorem 1 .
Consider the continuous time branching process where the reproduction process
is the Markovian pure birth process
, with rate function
, described at the beginning of Section 3 .
Clearly, the time-evolution of the population has the same distribution as the evolution of the continuous time Random Tree Model corresponding to the weight function
. The vertices are the respective individuals and edges are the parent-child relations.
It is also not hard to see that the function
for the branching process is the same as
which means that by condition (M) we may apply Theorem A with appropriate random characteristics. Given any bounded function
, setting the characteristics
as
and
we get exactly the statement of Theorem 1 .
-
Proof of Theorem 2 .
(a) Apply Theorem 1 with the function
This gives that
almost surely. By the definition of
(see ( 9 )):
Since
and
is the sum of independent exponentially distributed random variables with parameters
, we get
This completes the proof of part (a) of the Theorem. Using the identity
and the fact that
is finite for every
with probability 1 it is straightforward to prove that
is indeed a probability distribution on
.
(b) Let
be fixed and denote
. We apply Theorem 1 with
. We need to compute
Consider the following random stopping times:
| |
| |
That is:
is the birth time of the first vertex not in
, while
is the minimum of
and the time when we first have
, if the latter ever happens. Since
we get that
| |
| |
Note that by the definition we always have
. The event
means that there is a
when
. On this event
gives the time when we first have
and
gives the appearance of the next vertex. Given the event
, the conditional distribution of
is exponential with parameter
and it is (conditionally) independent of
. This leads to
Now, it is clear that the following two events are actually the same
This implies that
| |
| |
For
see 3 and the definition below it. Given
fixed
(See 3 , 5 and 6 for the definitions.) Also, if
is fixed then conditionally on the event
the random variables
,
, are independent and exponentially distributed with parameters
,
, respectively. This is an easy exercise: it may be proved by using the `lack of memory' of the exponential distribution and the fact that the minimum of independent exponentially distributed random variables with parameters
is also exponentially distributed with parameter
. Hence it is straightforward to get
Collecting our previous calculations part (b) of Theorem 2 follows.
Using similar considerations as in the end of the proof of part (a) it is apparent that
is a probability distribution on
.
(c) This is straightforward since for any
and
we have
| |
The statement now follows from part (b).
The only thing left to prove is that
satisfies ( 2 ), i.e. it is steady. First observe, that if
is fixed and
is a uniformly chosen random vertex in
then the distribution of
(which is a probability distribution on
) is steady. (This follows by simple counting.) Equation ( 2 ) is linear in
, therefore mixtures of steady distributions are also steady. Thus, if
is a uniformly chosen random vertex in
then the distribution of
(which is a random probability distribution on
) is also steady. By part (b) with probability one these distributions converge (in distribution) to
and from this an easy consideration shows that
must satisfy ( 2 ).
7 Asymptotic growth
In our theorems we determined the asymptotic ratio of vertices in
satisfying certain properties. It is also natural to ask if one can prove results about the asymptotic number of the respective vertices. As we have seen, this essentially requires to study the asymptotic behavior of
for a suitable random characteristic
. This has been done in the framework of general branching processes, we shall give a short overview of the relevant results.
We have already seen that
|
(19)
|
where
is the constant defined in ( 12 ). Thus we need to divide
by
to get something non-trivial. Let us quote a weaker form of Theorem 5.4 of [10] .
Theorem B (Nerman, [10] ).
Consider a supercritical, Malthusian branching process with Malthusian parameter
. Suppose that condition (M) holds and
is a random characteristic with properties described in Section 4 . Then almost surely
where
is a random variable not depending on
.
The necessary and sufficient condition for the random variable
to be a.s. positive is the so-called
property of the reproduction process
:
|
(L)
|
We quote Theorem 5.3 of [5] .
Theorem C (Jagers-Nerman, [5] ).
Consider a supercritical, Malthusian branching process with Malthusian parameter
. If condition ( L ) holds then
a.s. and
; otherwise
a.s.
Remark. This theorem is the generalization of the Kesten-Stigum theorem, which states this fact for Galton-Watson processes (see [6] ).
Theorem B applies for the random tree model if condition (M) is fulfilled. We do not intend to identify the necessary and sufficient condition on the weight function
which would guarantee that the corresponding reproduction process possesses property (L). Still it is worth pointing out that if
as
, then this property holds, this by Theorem C,
is a.s. positive.
Lemma 1.
If a weight function
satisfies condition (M) and
, as
, then the corresponding branching process satisfies condition (L).
-
Proof.
We will prove the existence of the second moment of
from which condition (L) trivially follows. Since
, we need
|
(20)
|
The random variables
are independent exponentials for
with parameters
, respectively, thus a simple computation yields that the expression in ( 20 ) is equal to
| |
Transforming the double sum on the right, we get
| |
where we also used
. On the other hand,
| |
Since
and
, we have
| |
if
is large enough. This leads to
by condition (M) which completes the proof of the lemma.
The distribution of
is usually hard to determine from the weight function
, however, one can characterize its moment generating function
. Using the idea of the basic decomposition ( 14 ) one can write the following equation for
By Theorem 4
|
(21)
|
which gives
|
(22)
|
It can be proved that this equation characterizes
as there is no other bounded function satisfying it with a right derivative
at 0. (See Theorem 6.8.3 in [4] .) In the linear case (
) the distribution of
may be explicitly calculated.
Using the fact that
is now a Markov process (which is an easy exercise to prove) and applying standard martingale techniques one can derive a partial differential equation for
. Solving this pde and using the limit ( 21 ) we get the momentum generating function of
from which one can identify the distribution of
as a Gamma distribution with parameters
.
Acknowledgements: We are thankful to J-F. Le Gall for having called our attention to his review paper [8] and the reference [5] therein. This work was partially supported by the Hungarian Scientific Research Fund (OTKA) grants no. T037685 and TS40719. References
-
Barabási, A.-L., Albert, R., Emergence of scaling in random networks, Science, 286 (1999), 509-512
-
Bollobás, B., Mathematical results on scale-free random graphs, in: S. Bornholdt and H.G. Schuster, editors, Handbook of Graphs and Networks, pp 1-34, Wiley, 2002.
-
Bollobás, B., Riordan, O., Spencer, J., Tusnády, G., The degree sequence of a scale-free random graph process, Random Structures and Algorithms, 18 (2001), 279-290.
-
Jagers, P., Branching Processes with Biological Applications, Wiley, 1975
-
Jagers, P., Nerman, O., The growth and composition of branching populations, Adv. Appl. Prob., 16, (1984) 221-259
-
Kesten, H. Stigum, B., A limit theorem for multidimensional Galton-Watson processes. Ann. Math. Statist. 37 (1966) 1211-1223
-
Krapivsky, P. L., Redner, S., Organization of Growing Random Networks, Physical Review E, 63, 066123 (2001)
-
Le Gall, J-F., Processus de branchement, arbres et superprocessus, in: Pier J-P., editor, Development of mathematics 1950-2000, pp 763-793, Birkhäuser, 2000.
-
Móri, T., On random trees, Studia Sci. Math. Hungar., 39 (2002),143155.
-
Nerman, O., On the convergence of supercritical (C-M-J) branching processes, Z. Wahrscheinlichkeitstheorie verw. Geb., 57, (1981), 365-395
-
Olivieri, R., Spencer, J., Connectivity transitions in networks with super-linear preferential attachment, Internet Mathematics, to appear
-
Olofsson, P., The
condition for general branching processes, J. Appl. Prob., 35, (1998) 537-544
Anna Rudas & Balint Toth Institute of Mathematics Technical University Budapest Egry Jozsef u. 1.
H-1111 Budapest, Hungary {rudasa,balint}@math.bme.hu Benedek Valko Renyi Inst. of Math. Hung. Acad. of Sciences Realtanoda u. 13-15 H-1053 Budapest, Hungary valko@renyi.hu