Navigating in the Cayley graphs of 
 
and 
 
 
 T. R. RileySupport from NSF grant 0404767 is gratefully acknowledged.
October 2004
 Abstract
 We give a non-deterministic algorithm that expresses elements of 
 
, for 
 
, as words in a finite set of generators, with the length of these words at most a constant times the word metric. We show that the non-deterministic time-complexity of the subtractive version of Euclid's algorithm for finding the greatest common divisor of 
 
integers  
 
 is at most a constant times 
 
 where  
 
. This leads to an elementary proof that for 
 
the word metric in 
 
is biLipschitz equivalent to the logarithm of the matrix norm – an instance of a theorem of Mozes, Lubotzky and Raghunathan. And we show constructively that there exists 
 
such that for all 
 
and primes 
 
, the diameter of the Cayley graph of 
 
with respect to the generating set  
 
 is at most 
 
. 
2000 Mathematics Subject Classification: 20F05 Key words and phrases: special linear, normal form, diameter, Cayley graph, Euclid's algorithm  
 1  Introduction
 This paper concerns expressing elements of 
 
and 
 
, for 
 
, as words in the generating set  
 
 consisting of the  
 
elementary matrices  
 
 that have 
 
's along the diagonal, the off-diagonal 
 
-entry 
 
, and all other entries 0. 
What gets our study off the ground is an explicit means of writing powers  
 
 in 
 
, for 
 
, as products of 
 
matrices in  
 
. This is explained in Section   2 and involves expressing 
 
as a sum of Fibonacci numbers. It is used (in Section   3 ) in a study of the non-deterministic time complexity of the subtractive Euclid's algorithm for finding the greatest common divisor of integers  
 
. This differs from the standard Euclid's algorithm in that in each step one integer is added to or subtracted from another, rather than a remainder on division taken. 
Yao and Knuth [?]  proved that the average number of steps to compute 
 
by the (deterministic) subtractive version of Euclid's algorithm, where 
 
 is uniformly distributed in the range 
 
, is 
 
. We show that the worst–case non–deterministic complexity of Euclid's algorithm for computing the g.c.d. of 
 
integers 
 
is 
 
, where  
 
. 
Theorem   3.1 Suppose 
 
is an  
 
-tuple of integers, not all zero, and 
 
. Define  
 
. There is a constant 
 
, independent of 
 
and  
 
, such that there is a sequence of no more than 
 
additions and subtractions of one entry from another, after which all but one entry in the  
 
-tuple are zero. 
The innovation is to use the compression techniques of Section   2 to accelerate repeated additions or subtractions of one entry to or from another. By contrast, the non–deterministic complexity is 
 
 in the case 
 
– we supply a group theoretic proof of this, presumably well-known, result. A vivid example is that it requires  
 
 steps (additions and subtractions) to convert 
 
to 
 
, but 
 
can be reduced to 
 
in 
 
steps. 
Then, in Section   4 , we run our accelerated version of Euclid's algorithm of Section   3 on the columns of a matrix 
 
 in 
 
, in the course of reducing 
 
 to the identity by row operations. This leads to a new proof of an instance of a celebrated theorem of Mozes, Lubotzky and Raghunathan [?] , [?] , and we contribute information about the constants: 
Theorem   4.1 Fix 
 
. Let 
 
denote the word length of 
 
, with respect to a fixed finite generating set. There exist 
 
such that for all 
 
 
 
Moreover, if the generating set is  
 
then  
 
is independent of   
 
and  
 
for a constant 
 
that is independent of  
 
. 
 Our proof of Theorem   4.1 is constructive (as are the proofs of the results in Sections   2 and  3 it appeals to) and amounts to an effective algorithm for finding a normal form for 
 
for 
 
– that is, for every 
 
, a word  
 
 on a fixed finite generating set and representing 
 
. 
Equivalently, a normal form is a choice for all 
 
 of path in the Cayley graph from the identity to 
 
. By homogeneity, it amounts to a means of navigating between any two vertices in the graph. 
Our normal form for 
 
is of linearly bounded length ; that is, there exists 
 
such that for all 
 
, the length of  
 
 is at most   
 
 times the length 
 
of the shortest word that represents 
 
. This is because, the length of  
 
 is at most 
 
, on account of its role in the proof of Theorem   4.1 , and 
 
. 
The author's original motivation for embarking on the work in this article was a potential application to the construction of van  Kampen diagrams to establish certain isoperimetric functions (concerning filling loops with discs): 
a long-standing claim of Thurston, originally quoted in [?]  and repeated in [?,§5.A8̇] , is that 
 
admits a quadratic isoperimetric function for all 
 
. By contrast, Epstein and Thurston showed that the minimal isoperimetric function for 
 
grows at least exponentially  [?,Chapter 10] . By a theorem of Gromov  [?,§5A7̇] , 
 
admits an exponential isoperimetric function. The author hopes the normal form will be of use towards proving Thurston's assertion and giving an elementary proof of Gromov's result. 
However there may be formidable obstacles; the geometry of the normal form has to be complicated in the following sense. For 
 
, no normal form for 
 
of linearly bounded length can (either synchronously or asynchronously ) fellow-travel. This result was proved by Epstein and Thurston  [?,Chapter 10] to show that 
 
is not automatic for 
 
; they use isoperimetric inequalities concerning filling 
 
-spheres with 
 
-balls (we mentioned the case 
 
above). 
Finally, in Section   5 , we apply similar techniques to 
 
. We find a normal form and prove the following result about Cayley graph diameter. Theorem   5.1 There exists 
 
such that for all 
 
and primes 
 
,  
 
 We remark, for comparison, that a lower bound on the diameter of a constant times 
 
 follows from  
 
 because  
 
. 
Define  
 
 In Lemma   5.2 (which is due to M.  Kassabov) we show using elementary, constructive means that every  
 
 equals a word in  
 
 and  
 
 of length at most  
 
. Applying this to Theorem   5.1 we get: 
Corollary 1.1
There exists 
 
such that for all 
 
and primes 
 
, 
 
 
 Lubotzky  [?]  explains a non-constructive proof that given 
 
and a generating set  
 
 for 
 
, there exists 
 
, that will depend on  
 
 and potentially (see Problem   1.5 below) on  
 
, such that 
 
 for all primes 
 
: since 
 
enjoys Property  (T ) for 
 
, the graphs  
 
 are a family of expanders, and the result follows. This article supplies an elementary, constructive proof that avoids the big guns of Property (T ) and Selberg's Theorem. The prior absence of such a proof is lamented on of [?,page102] . 
The argument above can be made quantitative as follows to yield a result that is weaker than that of Theorem   5.1 in that it gives  
 
 in place of the  
 
 term in the estimate. Kassabov  [?] , extending methods of Shalom  [?] , [?] , shows that the Kazhdan constant for 
 
with respect to  
 
 is at least  
 
. Define 
 
. 
The first non-zero eigenvalue  
 
 of the discrete Laplacian on 
 
is 
 
 where 
 
 is the spectral gap; 
 
by [?] ; and  
 
by [?,Proposition 5.24] . In fact, any upper bound on diameter obtained this way is also an upper bound on mixing time of the random walk on the Cayley graph, and so Theorem   5.1 suggests that, as is often the case, mixing time and diameter differ for  
 
. 
It is an open question 
 
 [?,Problem8.1.3] whether the  
 
 of Corollary   1.1 can be improved to  
 
. Such a result would be best possible because  
 
. Lubotzky, himself, gets close by proving with Babai and Kantor: 
Proposition 1.2
 [
?] 
, [
?,Proposition8.1.7] 
There exists 
 
such that for all 
 
and primes 
 
, there is a set 
 
of three generators for 
 
such that 
 
In fact, 
 
can be taken to be  
 
where  
 
and  
 
are defined above and  
 
.  
The proof in [?]  appeals to Selberg's Theorem, but the proof in [?]  is constructive and elementary save that “unnatural” generators of 
 
are used in place of  
 
 and  
 
. An alternative route to Corollary   1.1 is to apply Lemma   5.2 to Proposition   1.2 . 
The following problems provide a wider context for the study of diameters of Cayley graphs of 
 
. 
Problem 1.3
Fix 
 
. Does 
 
enjoy uniform Property (T )? 
 
Problem 1.4
 (An Independence Problem for 
 
.) Fix 
 
. Is  
 
a family of expanders? 
 
Problem 1.5
Fix 
 
. Does there exist 
 
such that for all generating sets  
 
for 
 
and all primes 
 
 
 
For fixed 
 
, an affirmative answer to Problem   1.3 would imply an affirmative answer to  1.4 , and that, in turn, would imply an affirmative answer to  1.5 . In the case 
 
, the same implications apply between  1.5 and the following analogues of  1.3 and  1.4 : does 
 
enjoy uniform Property ( 
 
) with respect to congruence subgroups (“The Selberg Property” [?] ), and is  
 
 a family of expanders? 
More details can be found in [?]  and [?] ; groups in which the analogue of Problem   1.3 has a negative answer are constructed in [?] ; the original (more general) independence problems are in [?] ; and a rare example of an independence result is due to Gamburd [?]  who (roughly speaking) finds a large class of generating sets  
 
 for 
 
and primes 
 
 for which 
 
forms a family of expanders. 
We briefly mention related results for 
 
and 
 
when 
 
. 
Property ( 
 
) is enjoyed by 
 
as a consequence of Selberg's Theorem (see [?] , [?] , [?] ), and so for any fixed finite generating set  
 
 for 
 
, we find 
 
 is a family of expanders. So there exists 
 
such that 
 
 for all primes 
 
. This proof (explained in [?] ) is not constructive and neither is the only other known proof, which uses the circle method for lifting elements of 
 
to elements of 
 
with short word representations  [?] . But Larsen  [?]  has given an algorithm that produces word representations of length 
 
. In common with this article, representing powers such as  
 
 by short words is key, and the subtractive version of Euclid's algorithm plays a role. 
Another constructive result is due to Gamburd and Shahshahani [?]  and is in the direction of Problem   1.5 in the case 
 
. They give an algorithm that produces paths in Cayley graphs to prove the following uniform diameter bound: for all primes 
 
, and for all finite sets  
 
 of elements of 
 
such that 
 
 is a  
 
-dense subgroup of 
 
 
 
 where 
 
and 
 
 depends on  
 
. This has been recently improved by Dinai [?]  who shows that for all 
 
, there exists 
 
such that  
 
 for all generating sets  
 
 for 
 
. 
Acknowledgements. I am grateful to Tsachik  Gelander, Martin  Kassabov and Alex  Lubotzky for explaining background to the subject of diameters of the Cayley graphs of 
 
to me, and to Karen Vogtmann for encouragement to investigate Thurston's claims about isoperimetric function for 
 
. 
I additionally wish to thank Martin  Kassabov for providing Lemma   5.2 , improving a lemma in an earlier version of this article. 
 2  Compressing powers  
 
  This section is devoted to proving the following result about representing powers  
 
 in 
 
by words of length 
 
. 
Proposition 2.1
Suppose 
 
with 
 
, with  
 
, and with 
 
. There exists a word  
 
such that  
 
in 
 
and  
 
 
 It suffices to prove the result for 
 
and 
 
, which we do by giving  
 
 explicitly in the second of the two lemmas below. The first lemma addresses the case where 
 
 is a Fibonacci number (defined recursively by  
 
), and will be superseded by the second lemma. The detailed calculation in the proof of the first lemma is key to understanding the proof of the second. 
Lemma 2.2
For non-negative integers 
 
, the words 
 
 
and   
 
 
equal  
 
and  
 
, respectively, in 
 
.  
Proof. We multiply out the first of these words from right to left as follows. 
The calculation for the second is very similar. The notation for each step shown is 
 
. 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 The following result can be proved by an easy induction. 
Zeckendorf 's Theorem [?] , [?] . Every positive integer 
 
can be expressed in a unique way as  
 |  | (1) | 
with 
 
and 
 
for all 
 
. 
 In fact,  
 
 is the largest Fibonacci number no bigger than 
 
, and  
 
 is the largest no bigger than  
 
, and so on. Recall that  
 
for all 
 
, where 
 
, and so  
 
Thus, as 
 
, 
 |  | (3) | 
 
Lemma 2.3
Suppose 
 
is a positive integer expressed as in ( 1 ). Write  
 
where  
 
are the even numbers amongst  
 
and  
 
are the odd numbers. Let 
 
be the integer such that either  
 
or  
 
. Let  
 
be the word 
 
in which  
 
if 
 
and is the empty string otherwise, and  
 
if  
 
and is the empty string otherwise. Let  
 
be the word obtained from  
 
by replacing every  
 
and  
 
by  
 
and  
 
, respectively. Define 
 
Then  
 
equals  
 
in 
 
and has length 
 |  | (4) | 
  
Proof. Lemma   2.2 is a special case of this lemma: when  
 
 we find  
 
 and  
 
, and when  
 
 we find  
 
 and  
 
. Multiply out  
 
 from right to left, as follows, using a more general and concise version of the calculation used to establish Lemma   2.2 . All the sums are over 
 
. 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 The length of  
 
 is 
 
, from which we get ( 4 ) by using ( 3 ),  
 
 and 
 
. 
 3  Accelerating the subtractive version of Euclid's algorithm
  The subtractive version of Euclid's algorithm for finding the greatest common divisor of an  
 
-tuple of integers differs from the standard Euclid's algorithm in that at each step an addition or subtraction is made rather than a remainder taken. That is, in one step an  
 
-tuple 
 
is produced from the previous 
 
-tuple 
 
, as follows. Take 
 
 and 
 
 so that  
 
 and  
 
 have the greatest and second greatest absolute values amongst  
 
. (To resolve dead-heats, take 
 
 minimal, and then take 
 
 minimal amongst the remaining indices.) Define  
 
, with the sign chosen so that  
 
, and define  
 
 for all 
 
. Stop when all but one entry is zero and output the absolute value of that entry. 
For example, in 6 steps the algorithm gives 
 
: 
 
 
 There is a non–deterministic version of this algorithm in which obtaining 
 
from 
 
by adding one entry to another or by subtracting one entry from another constitutes a step. Again, the algorithm stops when all but one entry is zero, and the output is the absolute value of that entry. 
Yao and Knuth  [?]  proved that the average number of steps to compute 
 
by the (deterministic) subtractive version of Euclid's algorithm, where 
 
 is uniformly distributed in the range 
 
, is 
 
. We will show that the worst–case non–deterministic complexity of Euclid's algorithm for computing the  
 
of 
 
integers 
 
is 
 
, where  
 
. (In particular, the greatest common divisor of two integers 
 
can be calculated non-deterministically in 
 
steps by starting with 
 
.) That is, we prove: 
Theorem 3.1
Suppose 
 
is an  
 
-tuple of integers, not all zero, and 
 
. Define  
 
. There is a constant 
 
, independent of 
 
and  
 
, such that there is a sequence of no more than 
 
additions and subtractions of one entry to or from another, after which all but one entry in the  
 
-tuple are zero. 
Proof. First consider running the standard Euclid's algorithm on the first two entries  
 
 and  
 
 in the  
 
-tuple. This proceeds via a sequence 
 
of pairs of integers finishing with a pair 
 
one of which is zero. The pair 
 
is obtained from 
 
by replacing the entry with the larger absolute value by the remainder on division by the other. So for all 
 
there is some integer  
 
 such that either (  
 
 and  
 
), or (  
 
 and  
 
). 
It takes the standard subtractive algorithm  
 
 steps to get from 
 
to 
 
. But, as 
 
, Proposition   2.1 gives us a word  
 
 that has length at most 
 
and that, reading right-to-left, describes a sequence of steps with the same effect. (The step described by the letter   
 
 corresponds to left-multiplying the transpose of the  
 
-tuple. The entries  
 
 in the  
 
-tuple may be disturbed in the course of these steps, but are recovered.) Define  
 
. Then  
 
 and 
 
. So it is possible to get from 
 
to 
 
in 
 
 steps where  
 
 But, as 
 
for all 
 
, and 
 
, this is at most 
 |  |  | 
 |  | (5) | 
Now  
 
 by an easy induction. So by inequality  ( 2 ) of Section   2  
 
 and thus 
 
. This inequality together with ( 5 ) shows there exists 
 
such that 
 
. 
Obtain the bound claimed in the theorem by next arguing as above for  
 
and whichever or the first and second entries in the  
 
-tuple is now non-zero, and then similarly for  
 
, and so on, until finally for  
 
. 
The proof above can be developed into a deterministic algorithm to calculate 
 
. What are needed are the  
 
 together with the words  
 
 of Lemma   2.3 . But those  
 
 are built using the expression for  
 
 of Zeckendorf 's Theorem. Whilst is not hard to write routines to supply the  
 
 and the expressions as per Zeckendorf 's Theorem, it is not clear that producing a deterministic algorithm to calculate 
 
in this manner has any computational advantages. 
Theorem   3.1 fails when 
 
(it is likely the following results are well known, but we include them for completeness and for the contrast): 
Proposition 3.2
To convert 
 
to 
 
or 
 
by successively subtracting one entry from, or adding one entry to, the other, requires 
 
steps. 
Proof. The number of steps required is at least the distance from  
 
 to the identity in the word metric on 
 
with respect to the generating set  
 
. This is because reading a word 
 
 that represents  
 
 from right to left would give a sequence of steps that transforms  
 
 to  
 
. 
But such a word 
 
 descends to a word  
 
 in the images 
 
 
, 
 
 
 of  
 
,  
 
 under the natural map 
 
. And 
 
, presented by 
 
, where  
 
 Now,  
 
 and so  
 
. And  
 
 is of minimal length amongst all words in  
 
 that represent 
 
 
 in the free product 
 
. So the minimal length of words in  
 
 that equal 
 
 
 in 
 
is 
 
. 
Corollary 3.3
The (worst case ) non-deterministic time complexity of the subtractive version of Euclid's algorithm for finding the greatest common divisor of two integers 
 
with  
 
is between 
 
and 
 
. 
 In the next section we will need the following more technical result that is proved in the same way as Theorem   3.1 . 
Theorem   3.1  
 
Suppose 
 
is an  
 
-tuple of integers, not all zero, where 
 
, and suppose  
 
. Define  
 
. 
There is a constant 
 
, independent of 
 
and  
 
, such that there is a sequence of no more than 
 
steps after which the first 
 
entries in the  
 
-tuple are unchanged, all but one of the remaining entries in the  
 
-tuple are zero, and that remaining entry is 
 
. 
 4  The Mozes-Lubotzky-Raghunathan Theorem
  In this section we give an elementary proof of the following result which is an instance of a theorem of Mozes, Lubotzky and Raghunathan on irreducible lattices in semi-simple Lie groups of rank at least 2. In [?]  they proved the case addressed below before generalising it to lattices in other Lie groups in [?] . We add information about the constants. (For a matrix 
 
 with real entries, 
 
 denotes the sup-norm, the maximum of the absolute values of the entries.) 
Theorem 4.1
Fix 
 
. Let 
 
denote the word length of 
 
, with respect to a fixed finite generating set. There exist 
 
such that for all 
 
 
 
Moreover, if the generating set is  
 
then  
 
is independent of  
 
and  
 
for a constant 
 
that is independent of  
 
. 
Proof. One easily checks that if the first part of the theorem holds for one finite generating set for 
 
then it holds for all. We will work with the generating set  
 
. 
The first inequality is straightforward. The sup-norm of a matrix that is the product of 
 
 matrices in  
 
 is at most  
 
, and  
 
 grows exponentially with 
 
. 
The second inequality will take more work. Suppose 
 
. 
Below, is a (well-known) procedure for reducing 
 
 to the identity by row operations. Each row operation corresponds to left-multiplication by some  
 
 and so a word  
 
 that equals 
 
 in 
 
can be extracted. 
- 
 
(
 
)
Convert 
 
 to an upper triangular matrix whose diagonal entries are all 
 
, as follows. 
 
- 
 
(  
 
)
Run Euclid's algorithm on the first column. This will leave all entries zero except one that is 
 
, because 
 
. Let   
 
 be the row containing the non-zero entry in the first column. 
If 
 
then premultiply by  
 
, which reverses the signs of the entries in row 1 and then interchanges rows 
 
and  
 
. 
 
-  
(  
 
)
Run Euclid's algorithm on the entries in rows 
 
to  
 
 of second column, leaving all zero except one that is 
 
and lies in row  
 
. If 
 
then premultiply by  
 
. 
-  
. 
. 
.
-  
(  
 
)
Run Euclid's algorithm on the entries in rows 
 
and   
 
 of the 
 
-st column, to make one entry 
 
and the other  
 
. Then, if necessary, premultiply by  
 
 to get an upper triangular matrix. 
 
 
-  
(
 
)
Get a matrix 
 
for which all the entries on the diagonal are 1 by premultipling by at most 
 
matrices  
 
 that reverse the signs of all the entries in rows 
 
 and 
 
. 
- 
(
 
)
Clear all the above–diagonal entries in 
 
, one column at a time, as follows. 
- 
 
(  
 
)
Premultiply by  
 
. 
- 
(  
 
)
Premultiply by  
 
. 
- 
. 
. 
.
-  
(  
 
)
Premultiply by  
 
. 
 
 
 
As it stands, the number of  
 
 used in the procedure above may wildly exceed 
 
 on account of steps (
 
) and (
 
). However, we can accelerate (  
 
)–(  
 
) as per Theorem   3.1  
 
. Define 
 
. Performing (  
 
) then takes at most 
 
steps and leaves a matrix  
 
such that  
 
. And, proceeding inductively, (  
 
) costs at most 
 
steps and leaves a matrix  
 
 with  
 
. 
So there is a constant 
 
such that for all 
 
, 
 |  |  | 
 |  |  | 
 These inequalites and induction can be used to establish that for  
 
 |  | (6) | 
and for all 
 
 |  |  | 
 So there is a constant 
 
, independent of 
 
 and  
 
, such that the contribution of (
 
) to 
 
is at most 
 
. 
The contribution of step (
 
) to 
 
is at most  
 
. To assess the contribution of step (
 
), first note that 
 
 for all 
 
 because in the course of step (
 
), row 
 
 is not disturbed after (  
 
). So, by inequality ( 6 ) and by compressing each of the 
 
terms  
 
 as per Proposition   2.1 , we see that the effect of step (
 
) can be achieved whilst contributing at most  
 |  | (7) | 
to 
 
, for some constant 
 
independent of 
 
 and  
 
. But the summation term in ( 7 ) is at most a constant times  
 
, which is at most  
 
 as 
 
. The outstanding claims of the theorem then follow. 
 5  The diameter of 
 
.
  We adopt the notation 
 
 and 
 
. 
Theorem 5.1
There exists 
 
such that for all 
 
and primes 
 
,  
 
 
Proof. Suppose 
 
. We reduce 
 
 to the identity matrix by successively premultiplying by matrices in  
 
 in a similar manner to that used to prove Theorem   4.1 . 
First lift the entries in the first column of 
 
 to 
 
and run the accelerated version of the subtractive version of Euclid's algorithm on the first column of the matrix. Then swap two rows (changing the sign of one) to move the non-zero entry to the first row. Next run the accelerated version of the subtractive version of Euclid's algorithm on the lift to 
 
of all but the first entry of second column, and move the non-zero entry to place 2,2 in the matrix. Continue similarly through all the columns. By Theorem   3.1  
 
, the cost is at most a constant times  
 
 We now have an upper triangular matrix 
 
such that every diagonal entry is non-zero. As 
 
 is prime and the diagonal entries in the matrix are non-zero, we can clear all the 
 
entries above the diagonal by premultiplying by matrices of the form  
 
 where 
 
. By Proposition   2.1 the effect of premultiplying by  
 
 can be achieved by premultiplying by a sequence of at most a constant times 
 
 matrices in  
 
. So we can reduce 
 
 to a diagonal matrix 
 
with total cost at most a constant times 
 
. 
As  
 
 and  
 
 are invertible in  
 
, we can convert 
 
 to 
 
by premultiplying by  
 
 as follows, 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 Using Proposition   2.1 the same effect can achieved by pre-multiplying by at most a constant times 
 
 matrices in  
 
. Applying this same process to the second and third rows, and then the third and fourth, and so on we reduce the matrix to the identity, at a total cost of at most a constant times 
 
. 
To deduce Corollary   1.1 we use the following lemma, due to M.  Kassabov, concerning the matrices  
 
 and  
 
 given in Section   1 . 
Lemma 5.2
For all  
 
with 
 
it is possible to express  
 
as a word in  
 
and  
 
of length at most  
 
. 
Proof. We will drop the subscripts from  
 
 and  
 
. For all 
 
 we have  
 
 where the indices are in  
 
 and are taken modulo  
 
. So it suffices to express all  
 
, for 
 
, as words in  
 
 and  
 
 of length at most  
 
. 
For 
 
 define  
 
. Then  
 
 Now  
 
 which, due to cancellations, equals a word of length 
 
in  
 
 and  
 
, and is 
 |  |  | 
 For 
 
, we calculate that  
 
 is 
 |  |  | 
 So  
 
 can be expressed as a word of length 
 
in  
 
. 
Tim R. Riley Mathematics Department, 10 Hillhouse Avenue, P.O. Box 208283, New Haven, CT 06520-8283, USA tim.riley@yale.edu, http://www.math.yale.edu/users/riley/