Pairs of separably closed fields and exotic groups
Abstract.
We look at simple groups associated primarily with the general theory of Moufang buildings, and to analyze their relation to stability theory in the model theoretic sense. As it becomes quite technical in the details, a lengthy introduction surveys the developments at a less detailed level.
The text, beginning from the second section, first deals with some model theoretic algebra of fields, followed by an extended study of three associated families of simple groups coming from the theory of Tits buildings, Moufang polygons, and Timmesfeld’s theory of exotic analogs of .
The field theoretic part is fundamental (§ 2). The rest of the paper relates this to group theoretic constructions, with two sections surveying the consequences for the original Tits and Timmesfeld theory before concentrating on the more exotic groups associated with Moufang polygons.
A good deal of the group theoretical material is expository, aimed to make the relevant structural information meaningful to those coming from the direction of model theory.
1. Introduction
Our aim here is to construct some simple stable groups which are not algebraic (hence, “exotic”). These are not, strictly speaking, “new” groups, but instances of a phenomenon discovered by Tits long ago, in connection with the classification of buildings of spherical type [Tits]. He called them groups of “mixed type”. We became aware of this much later, while looking into the classification of Moufang polygons given in [TW] and discussed below in § 5. Moufang polygons can be classified broadly speaking into algebraic (associated with algebraic groups), classical (in a historical sense), and mixed, reusing the term introduced by Tits to reflect both their similarities to the algebraic case, and the use of two fields rather than one in their construction; but in the case of Moufang polygons the meaning of the term becomes a bit broader.
So we have on the one hand the groups identified by Tits, which are analogs of algebraic groups in Lie rank at least , but with a coordinatization involving two fields , and we have also various groups associated with Moufang polygons which are analogs of algebraic groups in Lie rank , but associated with a considerably more intricate collection of coordinatizing structures (including some of Tits’ original type, constructed from a pair of fields). There is also a rank analog of due to Timmesfeld, which we will also consider.
A very natural program is then the following:
-
(a)
Construct some stable algebraic structures of the sorts used by Tits, Tits/Weiss, or Timmesfeld.
-
(b)
Deduce the existence of the corresponding stable simple groups.
This turns out to be more subtle than appears at first. So we aim not only to carry this through in some cases, but also to point out some issues that others might want to explore further.
In the Tits setting, things work out neatly but with more delicacy than one might expect. An ample supply of coordinatizing structures for Tits’ purposes is afforded by Theorem 2.1, and in a generalized form, by Theorem 2.2. We cover some cases relevant to the Timmesfeld construction and an interesting case from the Tits/Weiss classification. However one is not quite done at this point.
One might expect that a general interpretability result would allow for the systematic treatment of step (b) above. This seems not to be the case (see Question 4.12). On the other hand, in the context of the groups of Tits’ type associated to a pair of fields , this is the case.
The problem in general is that when one moves beyond Tits’ original setting, the groups are defined as those generated by a collection of subgroups. This is perhaps clearest in the rank 1 case (the Timmesfeld construction), which is given explicitly as a subgroup of whose diagonal subgroup is generated by elements whose coordinates lie in an additive subgroup of . The situation in rank 2 is much the same, but the notation involved is a good deal more complicated.
In fact, one may take a slightly different point of view on all of this, one that emerges most clearly in the rank 2 setting (Tits/Weiss). This becomes more technical. We describe this now, but the reader might prefer to look first at the more concrete rank 1 setting of § 4 where everything can be worked out in detail, from first principles, and only then return to a consideration of rank .
In any case, in the rank 2 setting, there are at least two groups naturally associated with a given Moufang polygon, and it becomes important to distinguish them, and to consider more generally the groups lying between them. The first group is the full automorphism group of the Moufang polygon. The second group, called the little projective group, is defined (by analogy with Chevalley groups) as the subgroup generated by the so-called root groups, which are the fundamental building blocks of the group from the point of view of either the Chevalley theory or the theory of Moufang polygons, and in the classical cases are copies of the additive group of the field. These groups appear in the Moufang theory as subgroups of the automorphism group of the Moufang polygon, and then the group they generate is one of the main groups of interest within the automorphism group, and is certainly the smallest group of interest, for our purposes.
In most cases the latter group is simple, and is the socle of the full automorphism group (its unique minimal normal subgroup). Between this group and the full automorphism group there are some other groups which are interpretable in the coordinate system for the group, and whose commutator subgroup is our simple group. So if we start with a stable coordinate system then we can associate a stable group with a simple socle to it, but in passing to the commutator subgroup, while we gain simplicity, we may lose definability.
Accordingly, our exposition becomes more elaborate than we had expected, as we sort through these issues. To complicate matters, our sources for the three cases take varying points of view, from the explicit matrix theoretic point of view of Timmesfeld, to the style of Chevalley (and Steinberg) in terms of generators and relations in the Tits/Weiss setting, and (for the part that concerns us) much more directly in terms of the structure of algebraic groups in the Tits setting. So we have the choice of unifying our perspective or staying close to our sources as we go along. We try to unify the description, but at the same time we do need to quote specific material from each source.
The paper is aimed at model theorists with an interest in a variety of related topics. We have arranged it as follows:
In § 2, Theorems 2.1 and 2.2 give the supply of stable “coordinate systems” with which we work. This is self-contained and is closely related to well-known work on the model theory of separably closed fields. Here the first theorem serves as warm-up to the second, and provides enough information to deal with the groups of mixed type as originally considered by Tits. We describe such groups in § 3 and prove that we do indeed get stable simple groups of this type by passing to the context of separably closed fields and applying Theorem 2.1.
Now Theorem 2.2 is of interest because the algebraic systems considered are the natural parametrizing systems for the groups which interest us. At the same time, the groups themselves cannot always be defined in a first order way from these structures. A point of considerable technical interest is that in some cases, enriching the original parametrizing structures to richer structures of the same kind may make the group first order definable.
We then pass to the opposite extreme—rank —in the following section, working out the details of Timmesfeld’s construction and the consequences for the issues of definability and interpretability that concern us here. Everything can be seen very simply by repeating standard computations (either from the point of view of Chevalley theory, or from the point of view of elementary linear algebra in two dimensions). The unsettling phenomenon of a conflict between the desired simplicity and the desired interpretability appears at this stage. One can say more precisely how the initial coordinate system should be expanded to make the simple group definable, but then the issue of stability has to be approached afresh, and the situation becomes much more complex. Perhaps someone will investigate this further.
The last three sections discuss the related groups of automorphisms of Moufang polygons at some length. At this point the notation becomes noticeably more burdensome. Here we encounter everything that we have seen in the original Tits construction together with the complications that became visible in rank 1—and not much else, fortunately, other than some rather specific notation. At this point one needs to work rather concretely in the notation of root systems in order to sort out the details. Readers will probably find our presentation either excessively terse or excessively detailed, depending on their degree of familiarity with that notation. The ultimate result, which is a theme throughout much of the latter part of Tits/Weiss—though not put in these terms—is that in rank one has to deal with two separate instances of the rank theory, and otherwise things are rather similar to the case of algebraic groups.
In more detail, the content of the last three sections runs as follows: In § 5 we give an overview of what is done by Tits and Weiss in [TW], and the notation used. Their goal is to give a classification theorem in terms of concrete coordinatizations by algebraic systems. This background material discusses what is common across all cases prior to the introduction of coordinates.
The next two sections then look into two particular cases of the classification of Moufang polygons as given by Tits and Weiss. The first concerns Moufang hexagons, where we encounter examples already noticed by Tits as rank 2 analogues of the algebraic group of exceptional type (so, in his notation). The second, more subtle example, treated in the last section, concerns the Moufang quadrangles of so-called “indifferent type,” which are those most closely related to the Timmesfeld construction in rank 1. Our summary of the situation, above, focuses on this case: this is the setting which inherits the specific difficulties associated with the rank 1 case.
The classification of Moufang polygons involves further families which could be investigated model theoretically; they tend to involve structure incompatible with stability, but compatible, in principle, with simplicity. The interested reader may want to look further in that direction, and in particular investigate the problem of building coordinate systems of the various types which are simple in the model theoretic sense.
We imagine that most readers will be interested either in looking into § 2 and taking much of the rest on faith (particularly from § 5 onward), or else taking § 2 on faith and looking into the following group theoretic issues (including the definability issues that arise). Either approach should be perfectly feasible. Most of what we have to say in the group theoretic part is intended to be expository, but it was not always evident where to find clear statements in the literature of the facts most directly relevant to the model theoretic issues.
Up to this point, we have been very vague about the details, in order not to become lost in them. In the remainder of this introduction we give a more precise account of the main points (and the key definitions) concerning the original construction of Tits, the lower rank constructions of Timmesfeld and Tits/Weiss, and the role of the model theory of separably closed fields in the construction of stable coordinate systems of the appropriate types.
1.1. The Tits construction: [Tits, (10.3.2)]
Tits constructs analogs of (abstractly) simple algebraic groups over algebraically closed fields, in certain very special cases, defined from a suitable pair of fields with . The point of view taken is that of Chevalley, with a small twist. This relies on the description of these groups in terms of root systems and their Dynkin diagrams, which may be summarized very rapidly as follows. This is either a reminder, or a few points of reference for the discussion afterward.
We begin with the algebraic group , which in algebraic terms is a -split simple algebraic group of adjoint type. The -dimensional subgroups are isomorphic to the additive or multiplicative group of . A maximal torus is a product of a certain number of copies of the multiplicative group of ; that number is the Lie rank. The copies of the additive group of invariant under the action of are the root groups (with respect to ); these are permuted by the group ; the action of on the root groups can be identified with the action of a finite reflection group acting on real Euclidean space (a Coxeter group) and these are classified by the Dynkin diagrams of types –. The root groups then correspond to a finite set of vectors invariant under the action of (these vectors encode the homomorphisms from to which gives the action of on the corresponding root group).
From the Dynkin diagram, or the root system and the action of , one can recover the construction of the group from the field ; this is the description of as a Chevalley group. We will see this concretely in the case of rank 2 in §§ 6, LABEL:Sec:Indifferent, where in the latter case the construction is a generalization of the one described by Tits, and additional complications arise.
For our purposes it is important that the roots will always have either one or two root lengths. The setting for the Tits construction involves a simple split algebraic group of adjoint type over a field associated with a root system in which, in fact, two root lengths occur. Furthermore we require the characteristic to be “exceptional” in a certain sense (in a familiar sense from the point of view of finite group theory, and explained by Tits in terms of s special isogenies, [Tits, (5.7.3)]). The restriction on root lengths means concretely that the Dynkin diagram is of type , , , or , and the restriction on the characteristic then means that the characteristic is unless we have type , in which case the characteristic will be .111Here the classification by Dynkin diagrams can be treated simply as a set of labels for the cases of interest, until we come down to the rank case. Tits mainly deals with the case of in [Tits]; he is able to identify types , with groups he has treated from another point of view [Tits, (2), p. 204], and is mentioned in passing but lies outside the scope of that monograph.
In this setting, one fixes a second field with
With the original algebraic group, one builds a group containing , and contained in , much as one might construct as a Chevalley group.
Namely, we consider a Borel subgroup with -split, we extend the groups and to groups and in a manner to be described momentarily, and then we set , so that is isomorphic to the usual Weyl group . The group is then defined as the group generated by and .
The group is an exact analog of the maximal unipotent subgroup of a Borel subgroup from the point of view of Chevalley. Namely, is generated by the root subgroups, which are copies of , subject to the Chevalley commutator relations determined by the root system. One adjusts this construction by taking the root groups for long roots to be copies of the additive group of the smaller field , and the root groups for the short roots to correspond to the larger field. One may then check that the Chevalley commutator formula makes sense (using the precise data in that formula, and the particular value of the characteristic).
At this point, one could reasonably proceed as follows: using the same modified notion of root group based on a pair of fields , take the group inside generated by all the long root groups over and the short root groups over . However, Tits proceeds in a different way. which connects up directly with his theory of BN-pairs. Before following him on this path, we discuss why one might do that.
1.1.1. BN-pairs and the Bruhat decomposition
In the first place, Tits’ BN-pair theory gives a direct route toward connecting the new groups with the subject of his monograph [Tits]. In the second place, the data and are explicitly given direct analogs of the usual groups and . On the other hand the group generated by them is potentially obscure; a priori it might very well be , for example. But the BN-pair theory implies a so-called Bruhat decomposition
which is the double coset decomposition of with respect to . (More properly, is replaced by a representative in , but the corresponding double coset is well-defined.) Comparing this to the corresponding decomposition of , we see that is , which is reassuring. And more generally, the Bruhat decomposition can be read as saying that is built from in exactly the way that is built from .
1.1.2. The groups
The groups obtained in this manner are (in the Dynkin notation) the families , , and the exceptional groups , . The groups are variations on the algebraic group . Further variations are possible: these correspond to Moufang quadrangles of indifferent type in the sense of Tits and Weiss, discussed in § 1.2. Rather than taking a pair of fields , we take a large field and two additive subgroups , , with
where now is a vector space over and is a vector space over the field generated by . We then proceed to build a group in the manner of Tits, using to parametrize long root groups, for short root groups. This is not the description used by Tits and Weiss however; they build its associated Moufang polygon and then compute the subgroup of the automorphism group generated by the corresponding root subgroups (parametrized by and rather than and ).
We have some unfinished business to attend to. On the one hand, we need to complete the definition of the groups . On the other hand, we should say a bit more as to how one actually obtains the BN-pair properties, or at least the Bruhat decomposition; this is the only way one has of seeing that these groups are in fact new groups, and Tits refers a little vaguely to Chevalley for this point, in [Tits], though elsewhere he gave the argument explicitly (in the Chevalley context).
1.1.3. (definition, concluded)
We have described as the subgroup of generated by modified root subgroups. Tits defines the torus , as the subgroup of whose elements act sensibly on the root groups: that is, the elements of which leave the root groups of invariant. In other words, these are the elements which act via multiplication by an element of on the long root groups.
In particular the group normalizes the group , and so we can define a “Borel subgroup” . (This is the largest available torus inside .) One could consider other constructions defining the torus in a different way. In the rank case this point is the subject of extended calculations in [TW]; however the full automorphism group also contains elements inducing automorphisms of the coordinate system, which in the cases of interest to us are certain field automorphisms, and these will not appear in an algebraic group.
It is then reasonably clear that the “Borel subgroup” is interpretable in the pair ; more concretely, its underlying set is definable in if we take to be equipped with a predicate for the subfield . It then follows from the Bruhat decomposition that the same applies to , and thus stability of the coordinate system will give rise to stability of the group; the converse also holds (indeed is interpretable in ).
1.1.4. and
We come back to the point that the groups and give a Bruhat decomposition for , indicating how this goes in the setting of Chevalley groups, and how it relates the theory of BN-pairs. For brevity we will now write , , , , and for the various groups involved in the definition of . So the Bruhat decomposition is
As Tits mentions in [Tits, 10.3.2], a key ingredient is the fact that the nilpotent group can be written as the product of its root subgroups, taken in any order. Another ingredient is the fact that the Bruhat decomposition holds in rank one (in , , or the projective versions of these groups). To this one adds some observations about the operation of the reflections corresponding to a simple root on the set o positive roots, and the fact that opposite root groups generate a rank one subgroup.
We run over some of the more formal aspects of this argument, taking as our initial goal the Bruhat decomposition. As is generated by and , and with contained in , the double coset decomposition exhibited is contained in and in order to show that it is , it suffices to show that it is closed under multiplication by (representatives for) and under multiplication by , the latter point being evident. Also, as is generated by reflections corresponding to simple roots , it suffices to check that sets of the form are contained again in the double cosets exhibited. What is claimed, in fact, is the following:
This is one of the fundamental axioms in the theory of BN-pairs, in fact, so the question is how to verify it.
This can be further reduced by similar formal manipulations, since and normalizes , to a consideration of , and then even further by consideration of the structure of . Namely, may be written as where is the root group corresponding to , and where is the product of the remaining root groups, which is itself invariant under . One reduces quickly to a consideration of . Then either or carries into another root group group contained in . In the first case and one finds that . In the second case one applies the same reasoning to in place of , but one also uses the Bruhat decomposition for the rank one group generated by and .
The last details are found in the proofs of [St, Lemma 25, § 3; (b) p. 34] or [Tits-BN, (16), p. 323].
1.2. Tits-Weiss and Timmesfeld: subtleties
So far, everything proceeds according to plan. Now complications arise as we encounter some variations corresponding to Lie ranks 1 or 2, where the underlying algebraic systems are of a more general type.
For us, the most interesting case concerns Moufang quadrangles of “indifferent” type, similar to the buildings associated with Tits’ groups , but more general. Most of the complexity of this case, as far as the model theory is concerned, can be traced back to the rank groups associated with simple roots in this setting, which turn out to be the groups Timmesfeld calls and (we are in characteristic , so we do not need to distinguish and ).
There are interesting comments about the history and the differing emphases of the various approaches taken to this subject by [Tits], [TW], and [Ti] to be found in Richard Weiss’ review of [Ti] in the AMS Bulletin [WeissBAMS]. In particular, the following has considerable relevance here:
In a spherical building, groups of rank one appear as groups generated by pairs of “opposite” root groups, …. In the classification of Moufang buildings, in fact, these subgroups are avoided to the maximal extent possible. The philosophy of abstract root groups is just the opposite—groups of rank one are enshrined in the hypothesis themselves and play a central role in the whole theory.
We will approach the rank 2 case via the rank 1 case, in order to encounter the model theoretic issues in their simplest “pure” state. This means in particular that we will be crossing over between two rather different points of view.
We are again in characteristic with an imperfect field , and we begin in rank . In the Timmesfeld setting—or rather, the special case of interest to us here—we will have an additive subgroup of containing and invariant under multiplication by . Timmesfeld’s description of his group involves generation by two “root subgroups” parametrized by , but as we will check later, we can give a description similar to the one given by Tits above.
We begin with a single root group (where is not necessarily a subfield) which we may take to be the upper unitriangular matrices with coefficients in the additive group . If we followed Tits’ construction we would also define a torus at this point. In fact we will take the root group and its opposite, and the group they generate, and then compute the torus generated as a subgroup of . This turns out to be parametrized by the multiplicative subgroup of which is generated by the nonzero elements of . This is the point at which nondefinability enters into the picture.
On other hand, after this detour we could start afresh and define as the particular group of diagonal matrices just mentioned, then define , and let be the group generated by and a suitable Weyl group element. The usual Weyl group element
will do (and we can omit the minus sign, as the characteristic is ). This preserves the connection with the Tits construction; but we will in fact take Timmesfeld’s definition as our point of departure.
As there is only one pair of roots, the field does not play much of a role here, and it could be replaced by the subfield generated by .
On the other hand, the torus is not the strict analog of the one considered by Tits. The direct analog of Tits’ in this context would be the subgroup of the diagonal group which normalizes . But this is , since is a vector space over . So that torus would depend on the choice of .
Notice that it is the small torus which is a maximal torus in the simple group . But in general it is the larger torus , defined in the manner of Tits, which is definable from the coordinate system, so here we have a definable group with simple socle, stable if is, and the commutator subgroup of this group is simple, but not necessarily stable.
All of this can be checked by direct computations which we will make, and which are the usual computations made over a field in the context of Chevalley groups. In particular one verifies the Bruhat decomposition in this context, and that leads to a proof of the BN-pair axioms also in rank (carried out in a different way in [TW]).
Turning to this rank case, let us call the group associated by Tits and Weiss to the coordinate system . Namely, one defines by strict analogy with the case of Chevalley groups, as in the algebraic group , with and parametrizing the long and short root groups respectively, and using the Chevalley commutator relation to define the group law.
In an algebraic group setting one may then take the opposite group and the group they generate; or in the setting of Moufang polygons one may define the corresponding Moufang polygon (with some effort) and then consider the group generated by root subgroups. From this point of view one also computes the torus (with considerable effort in this setting). This gives a simple group which is not necessarily first order definable, because the torus itself is not necessarily definable, and in fact rank one groups of type and are involved. The analysis of Tits and Weiss determines both the minimal torus (splitting the normalizer of the group as in the corresponding simple group) and the maximal torus (giving a similar splitting, but in the full automorphism group of the Moufang polygon)222Tits and Weiss give in [TW, § 37] a complete description of the automorphism group of the polygon for the various Moufang examples, which involves an “algebraic part” and a subgroup coming from automorphisms of the field ; here, by full automorphism group we will mean the “algebraic part”..
The result is that inside the automorphism group of the Moufang polygon, and above the group generated by root subgroups, we have a family of groups, corresponding to a family of “tori” (in a very broad sense, allowing actions by field automorphisms on the coordinate system).
The smallest of these groups is simple but not necessarily definable over the coordinate system (in the first order sense), while the largest is rather too large for any of our purposes; but in between one can find a definable group whose commutator subgroup is the corresponding simple group (i.e., the associated torus is abelian). Here, definability refers to definability in the structure .
In particular when the coordinate system is stable, the closest we come, in general, to building a stable simple group is to build a stable group with simple commutator subgroup.
On the other hand, as yet we have no negative results in the more challenging cases. In particular we do not know whether some of the simple groups which are not interpretable in the associated algebraic systems might themselves be stable, for other reasons.
This last is not intrinsically a group theoretic question, since the simple group of interest is definable from a coordinatizing structure expanding by the torus of the group and its action on the root subgroups (and conversely, this structure can be recovered from the group, if one is careful about the formulation). The torus can be made a little more concrete as it is a product of 1-dimensional tori which can be taken separately and come from rank 1 subgroups of Timmesfeld’s type.
So stability of the simple group is equivalent to stability of the structure together with the two 1-dimensional tori associated with the rank 1 subgroups corresponding to simple roots, and their actions on all the root groups. In this sense, one can set aside the simple group and work with an expanded language of fields instead.
1.3. Some model theory of fields
A few introductory remarks about the model theory of fields are also in order, just to set the scene properly. From our perspective, what was intriguing was the central role of imperfect fields in all of these constructions, and the known fact that separably closed fields have stable theories. This is what suggested the current line of investigation, and, in particular, our interest in the case of Moufang quadrangles of indifferent type.
The question as to whether every stable field is in fact separably closed is of long standing (see for example [KrP-SFW]). This question has been placed in a broader framework by Shelah and others, and occurs now in a number of formulations generally all going by the name of Shelah’s conjecture for (e.g.) dependent fields [HaHJ-SDF]. This broader question is being actively pursued at present and leads into very different issues outside stability theory. But certainly in the present state of knowledge the only definite source of constructions of stable simple groups in which fields can be interpreted will pass through the theory of separably closed fields. If one enlarges the scope to simple unstable theories, then some other constructions from the theory of Moufang polygons would come into play, involving automorphisms and various semilinear or quadratic forms.
We turn now to the details, beginning with the model theoretic algebra that produces a good supply of stable structures suitable for use as coordinatizing structures, with three cases: Timmesfeld’s rank one groups , and the two families of rank two groups and the indifferent type for .. With that in hand we will take up the three sorts of groups of interest, starting with Tits’ theory over pairs of fields, where matters are simplest in some fundamental sense (though with the usual apparatus of algebraic groups, root systems, and also BN-pairs all in the mix). Then we pass to the rank 1 case as a relatively transparent context where real problems of definability arise, before coming finally to the most interesting case, Moufang polygons of indifferent type, where the groups to be constructed are stable, with simple socle equal to the commutator subgroup, and nonalgebraic.
1.4. Main results of the paper
As explained before, our aim was to study from a model-theoretic point of view examples of “exotic groups,” preferably simple ones. We concentrated on three cases: , and groups obtained as automorphism groups of the Moufang polygons of [TW] coordinatized by an indifferent set, and in particular the groups generated in that setting by the root groups associated with the Moufang polygon (and a fixed apartment).
The first point is that stable coordinatizing systems exist in all three cases. This is proved by fixing an imperfect separably closed field of the appropriate characteristic, and studying their model theory in various enrichments, by subfields of , or by vector spaces between and . The results are valid in arbitrary characteristic, and the main result in that section is:
Theorem 2.2. Let be a separably closed field in characteristic , and
a chain of subfields of containing . Furthermore, for let be an additive subgroup of which contains and is a vector space over , and which satisfies, in addition, the following two conditions:
-
(1)
,
-
(2)
Any subset of which is linearly independent over is -independent over .
Then the structure is stable, and the complete theory is given by the properties stated together with simple numerical invariants: the dimensions of both and over , as finite values or the formal symbol .
One also obtains a variation of this result by slightly modifying the vector spaces (Theorem 2.6).
These results will be applied in characteristic to a pair of fields and in characteristic to two fields and two additive subgroups meeting the additional requirements.
Let us first start with two results on the groups (“à la Tits”).
Theorem 3.3. Suppose that is of adjoint type (centerless) and split over Then for , the group is simple.
For the Tits groups, stability of the group is equivalent to stability of the coordinatizing pair of fields, and we have
Theorem 3.5. Suppose is simple of type of type , , , or and is a pair of fields with
and the appropriate characteristic ( for type , and otherwise).
Then the following hold:
-
(1)
If the pair of fields is a stable structure, then the groups and are stable.
-
(2)
If is separably closed then and are stable groups.
(The converse of item (1) is proved separately for type and , see Theorem LABEL:bidef:G2UKk for , and Theorem LABEL:Thm:C2:U(k,K) for .)
In particular for the case of groups of type we achieved our goal, and obtain a class of automorphism groups of Moufang hexagons which are both stable and simple.
Coming now to the rank one case (Timmesfeld’s exotic simple groups of type ), we have the following standard facts:
Theorem 4.2. Let be an imperfect field of characteristic and an additive subgroup satisfying
where is a vector space over . Let be the subgroup of with coordinates in the multiplicative subgroup of generated by . Let and .
Then we have the Bruhat decomposition
In particular, is the group of upper unitriangular matrices in , and is the diagonal subgroup.
Furthermore, is simple.
The definability theoretic properties of the group are more subtle and lead us to consider a slight generalization where is a subgroup of the diagonal matrices in a larger group over a field. We may take to contain the diagonal matrices of .
Corollary 4.9. Given a (slightly generalized) Timmesfeld group , with additive group and torus , there is a structure with a field and a subgroup of such that the following are equivalent:
-
(1)
The group is stable.
-
(2)
The structure with the field structure on and the additive and multiplicative subgroups and is stable.
In particular, when is the subgroup of consisting of diagonal matrices (and is ), the corresponding group may be taken to be the subgroup of generated by the nonzero elements of .
This is the point at which one realizes that is likely to be undefinable in first order terms relative to its natural coordinatization by , and examples falling under Theorem 2.2 confirm this.
We now deal with our second example, associated to hexagonal systems of type 1/F, and which turns out to coincide with .
Theorem LABEL:Thm:G2(k,K). Suppose is a pair of fields with
Then the following hold:
-
(1)
The group is stable if and only if the pair of fields is a stable structure.
-
(2)
If is separably closed then is a stable simple group.
Theorem LABEL:bidef:G2UKk. Let be a pair of fields in characteristic with
and let in the sense of . Then each of and is definable in the other.
This immediately gives
Theorem LABEL:thm:hexagon. The group is stable
(model-theoretically simple, NTP2, NSOP1,
…) if and only if the pair of fields is stable
(resp. model-theoretically simple, …).
Now we turn to our real interest: the rank two case, and specifically automorphism groups of certain Moufang hexagons (§ 6) and Moufang quadrangles (§ 4).
Theorem LABEL:Thm:C2:U(k,K). Let be a weak indifferent set and let be the group in the sense of . Then each of and is definable in the other, where
on .
Theorem LABEL:Thm:Indifferent:Definability:U. Let be a weak indifferent set, a maximal torus of , and a subgroup of normalizing the group and containing .
Let be the structure
consisting of the group with the subset , the abstract group with its multiplication, and the following additional structure:
-
(1)
the map defined by ;
-
(2)
actions of on and on which correspond to the actions of on two root subgroups , with the two simple roots, where is short and is long.
Then the group is interdefinable with .
In particular, is stable if and only if is stable.
2. Stable pairs of fields and related structures
Results
For our applications, we need to work with pairs of fields, or with more general structures (but again, in pairs). But what can be done with pairs of fields can also be done, in the same way, with more than two nested fields, and with the more general coordinatizing systems called indifferent sets. Our first result in this line will be the following, which we will need in characteristic and , and with , so that we have two distinct fields at our disposal:
Theorem 2.1.
Let be a separably closed field of characteristic , and let
be a chain of subfields of containing , viewed as a structure with predicates for the fields. Then the theory of this structure is stable.
Furthermore, this theory is axiomatized by the stated properties together with a specification of the dimensions (as finite values or the formal symbol ).
The method of proof will pass through an elimination of quantifiers in an appropriate language—the language customarily used for quantifier elimination in separably closed fields, reviewed below, together with the appropriate unary predicates.
This result already supports the Tits constructions, including some in rank 2, notably in the case of , which was first described in [Tits, § 10.3, p. 205 (Remark)].
But as we have explained, we need a more varied supply of coordinatizing structures, involving some additive subgroups as well as subfields—in characteristic . The following will be sufficient for our current purposes, though as previously discussed, the question of stability of the associated simple groups would require even more elaborate coordinatizing structures, at this greater level of generality.
The relevant value of in the next theorem will be , as we will be working mainly with the two additive groups and .
Theorem 2.2.
Let be a separably closed field in characteristic , and
a chain of subfields of containing . Furthermore, for let be an additive subgroup of which contains and is a vector space over , and which satisfies, in addition, the following two conditions:
-
(1)
,
-
(2)
Any subset of which is linearly independent over is -independent over .
Then the structure is stable, and the complete theory is given by the properties stated, together with simple numerical invariants: the dimensions of both and over , as finite values or the formal symbol .
Algebraic preliminaries
Definition 2.3.
Let be fields of characteristic .
-
(1)
A subset of is -independent in if for every finite subset of ; otherwise, it is said to be -dependent. A maximal -independent subset of is called a -basis of , and one then has .
-
(2)
A subset of is -independent over in if whenever is a finite subset of . Note that if , we could equally say: is -independent in .
-
(3)
The degree of imperfection of the field is such that . Equivalently, it is the cardinality of a -basis if has a finite -basis, and the symbol otherwise.
Notation 2.4.
Let be a field of characteristic .
-
(1)
For each , we fix an enumeration , , of the -monomials in , i.e., of all monomials on where the exponents are between and . Without loss of generality, for each .
-
(2)
The -functions on are defined in the following way:
-
(3)
if is not -independent in , or if is not -dependent on in ; else,
-
(4)
the values of the are uniquely defined by the condition
-
(5)
Let be the language of fields , and let the language be . Observe that the inverse of the Frobenius map is -quantifier-free definable on : if , then .
-
(6)
Let be a -independent subset of . For each and , we denote by the corresponding restriction of . If , we will say that the -functions are well-defined at when . Similarly, the iterates of the are said to (all) be well-defined at if for all .
-
(7)
Suppose we have a nested sequence of fields
We define the language
where the are unary predicates for the subfields , and the function symbols are interpreted as the usual functions on the field , and outside. If is a -basis of , then denotes the -functions restricted to .
We now collect some useful results, mostly classical (and trivial if the degree of imperfection of is finite). We will give most of the proofs, though briefly. More detailed proofs can be found at various points in [B], [D] or [Sr86].
Remark 2.5.
-
(1)
Let be a subfield of . Then the following are equivalent:
-
(a)
is a separable extension of
-
(b)
is closed under the -functions of
-
(c)
the elements of any (or, some) -basis of stay -independent in .
In this case, the -functions of and of agree on .
-
(a)
-
(2)
Let be -independent. Assume that the iterates of the -functions are well-defined at the element of , and let denote the set of these iterates. Then is closed under the -functions. Hence has -basis , is a separable extension of , and is closed under the -functions of .
-
(3)
Let be a subfield of closed under the -functions of . Assume that is a -basis of such that is a -basis of . Let be closed under the -functions. Then is closed under the -functions of .
Note that in general it is not true that if and are -substructures of , then so is the field . For example, take -independent, and consider , .
-
(4)
Let be a subfield of closed under the -functions of , and let . If is the closure of under the -functions of , then is countably generated over .
-
(5)
The -functions of extend uniquely to the separable closure of .
-
(6)
Suppose the subfield of is an -substructure of , and let . Then the -substructure of generated by is countably generated over .
-
(7)
Let . Let be a -basis of over , and a -basis of over . Then is a -basis of and
-
(8)
Let and be separably closed fields, and an -substructure. Suppose that and are both saturated of the same cardinality with . Let be a -basis of such that is a -basis of , and let be a -basis of containing . If is a bijection, then extends to an isomorphism .
Proof.
(1) See [B] or a similar (general) text.
(2) If is a -independent -tuple in and are two elements of , then and belong to the ring generated over by the elements
, for . |
Moreover, ; this gives the first assertion, and the second follows by (1).
(3) By (2), is closed under the -functions, and the result follows by (1).
(4) Let be as above, and extend a -basis of to a -basis of . Let be the set of -iterates of . As this set of functions is countable, the set is countable, and involves only countably many elements of . That is, there is a countable subset of such that all iterates of the -functions are well-defined at .
Now by (3), is closed under the -functions of , and contains .
(5) We know that for each , , so that there are polynomials , depending only on the minimal polynomial of over , such that for all . Given a -basis of the polynomials determine uniquely the values of the iterates of the .
(6) For each , select a -basis of such that is a -basis of . Then, using (4), we build an increasing sequence of subfields of , where is the -closure of , and for , with , is the -closure of . Since each is countable by (4), so is , and by (4), since and are closed under the functions , so is .
(7) As is a -basis of over , is a -basis of over , and therefore and is a -basis of .
Observe now that
(8) This is a straightforward back-and-forth argument, using the stability of the theory of separably closed fields of infinite degree of imperfection.
By (5), extends to some defined on . Assume now that we have an isomorphism extending , with -substructures of respectively, such that and a -basis of .
Let . By (4), the -closure of is countably generated over , and adding countably many elements of if necessary, we may assume that is a -basis of ; let be countable, closed under the , and such that .
By saturation of , there is some which realizes , and as is independent from over , is separable over , and is stationary, it follows that realizes . This proves one direction, and the other is symmetric. ∎
Proofs
We now give the proofs of Theorems 2.1 and 2.2. We restate Theorem 2.1 in a more explicit form as follows:
Theorem 2.1. Let be a separably closed field of characteristic , and let
be a chain of subfields of containing , viewed as a structure with predicates for the fields. Then the theory of this structure is stable.
Furthermore, this theory is axiomatized by the stated properties together with a specification of the dimensions (as finite values or the formal symbol ), and admits elimination of quantifiers in the associated language , with the predicates and the functions interpreted naturally.
Proof.
Since all contain and are contained in , the sequence
is a series of purely inseparable extensions.
Let be theory stating that the sequence of fields has the stated properties, and which, in addition, specifies the degrees . We show first that this theory is complete and allows quantifier elimination.
Let be an -substructure of . Then , each extension , is separable, and for , and are linearly disjoint over , since is purely inseparable.
We may assume that is saturated of cardinality , and we fix another model of containing which is also saturated of cardinality .
Let be a -basis of , a -basis of , …, a -basis of ; extend to a -basis of , to a -basis of (this is possible because and are linearly disjoint over , so that are purely inseparable extensions), …, to a -basis of (again, use purely inseparable).
Then is a -basis of , and for each , is a -basis of . Do the same with to obtain corresponding -bases , and observe that necessarily, either is finite, or , by saturation of and .
Thus, if is a bijection between and which sends each to , then extends to an isomorphism of fields , which is the identity on , and sends each to : use (8) and (7) in Remark 2.5. This shows that is complete and eliminates quantifiers in .
Now we show stability of . Let , and let be the -substructure of generated by We may assume the -bases are chosen to contain a -basis of over extending . By Remark 2.5(6) (and (4)), there is a countable -substructure of containing and such that and is a -basis of . By elimination of quantifiers, is entirely determined by , and because and by Remark 2.5(2), there are at most such types. Thus the theory is stable. ∎
The proof of Theorem 2.2 is very similar. Again, we reformulate it in more precise terms:
Theorem 2.2. Let be a separably closed field in characteristic , and
a chain of subfields of containing . Furthermore, for let be an additive subgroup of which contains and is a vector space over , and which satisfies, in addition, the following two conditions:
-
(1)
,
-
(2)
Any subset of which is linearly independent over is -independent over .
Then the structure is stable, and the complete theory is given by the properties stated together with simple numerical invariants: the dimensions of both and over , as finite values or the formal symbol , and admits elimination of quantifiers in the associated language , with the predicates and the functions interpreted naturally.
Proof.
Let
be a substructure. As in the proof of Theorem 2.1, the sequence
is purely inseparable, and each is separable.
As usual, we suppose that the -structure is saturated, of cardinality , and that is another such model containing . By saturation, any of the invariants which are not finite take on the value in both of the -structures and .
The only change in what follows, relative to the proof of Theorem 2.1, will lie in the initial choice of -bases , and , so as to respect the additional structure.
Let be a -basis of over and extend it to a -basis of over . For , let be a -basis of over , such that is an -basis of the -vector space . Extend to a -basis of over in such a way that is a -basis of the -vector space ; this is possible because is a -basis of the purely inseparable extension of , so that are also purely inseparable extensions of . Choose -bases within similarly.
As in Theorem 2.1, if is a bijection for , then extends to an -isomorphism , which is an -isomorphism, and sends to for . This gives completeness of the theory and also quantifier elimination for this language because of the -functions and conditions (1) and (2) on .
The proof that the theory is stable goes much as before.
Let and be as above, with , and let . By Remark 2.5(4)(6), we know that there is some countable containing , closed under the -functions, containing a -basis of , and such that is closed under the -functions. By stability of , there is some countable substructure of , which is separably closed, and such that does not fork over , and enlarging we may assume that contains as an -substructure. There are possibilities for , and -many -formulas saying that does not fork over , so that there are at most types over . Thus the theory is stable. ∎
As an easy corollary, we obtain
Theorem 2.6.
Let satisfy the hypotheses of Theorem 2.2, and let be additive subgroups of , with a finite-dimensional -vector space contained in . Then the -structure
is stable.
Proof.
As the are finite dimensional over , both and are definable (with parameters) in the -structure . ∎
Lemma 2.7.
Let be a field in characteristic , and let
an increasing chain of additive subgroups of . Suppose that for all with , . Consider the structure
where the are given as subgroups of and
is the function .
Then there are -definable fields such that
and each is a vector space over .
Proof.
The element in is clearly definable, hence the -th power map is definable. The restriction of multiplication to is a partial binary operation defined by the relation
Define as the multiplicative stabilizer of under :
This is a definable subfield of which contains .
Note that the structure on induces the corresponding structure on all and hence we have definable subfields such that is a vector space over , where in addition contains if , and contains . Let . Then . ∎
From a model theoretic point of view, the reduced structure just on the is more convenient for interpretability results as the additional structure may then be treated as coming for free. Normally it would seem prudent, model theoretically, not to add undefinable structure to a given coordinate system. In practice, that can be either highly undesirable or extremely convenient. In the context of Theorem 2.2, adding undefinable fields is harmless, and also at times extremely convenient. We will see instances of the latter eventually (notably in the setting of rank 1 groups).
We conclude this section with some related questions. which concern the choice of the vector spaces in Theorem 2.2, which gives a good understanding of the most extreme case. Beyond that case, there may well be other natural theories of similar kinds.
Problems 2.8.
-
(1)
For and , choose -independent elements of over , and consider . Is stable? (The case is covered by Theorem 2.2).
Note that the union is definable in the specified language via the formula . This tends to suggest a level of complexity that may be incompatible with stability. In the structure as we have defined it, modulo all elements of have a finite “support” in the set (and in any model, elements with arbitrary finite supports will occur). Whether this translates concretely into definable complexity remains unclear.
-
(2)
Let be separably closed fields of infinite degree of imperfection, with . Let be a subset of consisting of elements -independent over , and set
Is stable?
Note that again a -saturated model will not be of the same form, even if .
3. Groups of mixed type according to Tits [Tits]; or a variation
In the present section we will discuss the groups of mixed type over pairs of fields in the spirit of [Tits] (with some slight variation). continuing on from the broad discussion in the introduction, § 1.1. In this context, by applying Theorem 2.1, we can identify some simple stable groups which are not algebraic but which one might reasonably call “algebraic over two intimately connected fields.” In Tits’ monograph the focus was on rank at least as far as classification is concerned, but the constructions make sense in rank , and in particular the case of was covered in [Tits, § 10.3, p. 205 (Remark)].
Definition 3.1.
Let be a Chevalley group associated with a root system with roots of two lengths: that is, type , , , or .
Fix a pair of fields satisfying
where if the type is , and otherwise.
For in the root system, define to be if is short and if is long.
Let be the group generated by the root subgroups . The notation indicates that we follow Tits’ construction of , but not exactly. The question is what part of the torus to take from and as we will see in Lemma LABEL:Lem:largetorus that there is some latitude in this respect in the case of , and more generally .
Remark 3.2.
The group has a BN-pair
where is generated by root subgroups for positive, is generated by the corresponding root tori, which can be defined as the intersection of the rank 1 group with , or more directly as the groups in the notation of [St, Lemma 19]. Then may be defined as (which normalizes and has as quotient the Weyl group of ). That it constitutes a BN-pair can be proved with the classical arguments, using the fact that can be written as a product of the root groups taken in any order, and that the result holds for subgroups of the type of and (treated more generally in § 1.1).
Theorem 3.3.
Suppose that is of adjoint type (centerless) and split over Then for , the group is simple.
Proof.
We use the Tits simplicity criterion for groups with a BN-pair, as can be found in § 29 of [Hum], see in particular Theorem 29.5.
Since our groups have BN-pairs, it suffices to check the following points:
-
(a)
is solvable and centerless.
-
(b)
The set of generators of corresponding to the simple roots does not deompose into a union of disjoint, nontrivial, commuting subsets.
-
(c)
contains no nontrivial normal subgroup of the full group .
-
(d)
is perfect.
Of these four points, the first is the clear, and the second is a basic fact about the classification of the associated root systems. In terms of the usual Dynkin diagram representation it means the diagram is connected. (In the rank two case with which we will be principally concerned, it means that the two simple roots are nonorthogonal—so that the corresponding generators of the Weyl group do not commute.)
The third point may be argued as follows: The group has a conjugate for which , so any normal subgroup of the full group contained in would be contained in . Then and centralizes , forcing as the torus acts faithfully on . This last point depends on the fact that the group has no center.
The proof that the group is perfect reduces to the condition for the root subgroups , since the root groups generate the full group. This computation can take place in the rank group , which is or over one of the fields or . Here we may work concretely with the group of strictly upper triangular matrices in and the group of diagonal matrices.
Writing for
and for the diagonal matrix with entries , we have the commutator law
Now we have only to choose so that is nonzero and to get the general element of as a commutator. ∎
We gave the final computation explicitly as it will serve again in the more general setting of Timmesfeld’s rank one groups, below.
Remark 3.4.
There are no exceptions over , in fact, though for what one might call accidental reasons. Over , our definitions only allow one group, the algebraic group , and it is simple, for reasons like the ones we give but more delicate [St, Lemma 32].
Of these, types and recur below in the context of Moufang polygons (Moufang hexagons and Moufang quadrangles, respectively). Type is a particular case of the class of Moufang quadrangles said to be of indifferent type. As we will see, in a fairly precise sense, the class of groups associated to Moufang polygons of indifferent type is related to the narrower class of groups in exactly the way that Timmesfeld’s groups are related to the usual groups over fields.
From our point of view the interest of these groups lie in the following:
Theorem 3.5.
Suppose is simple of type of type , , , or and is a pair of fields with
and the appropriate characteristic ( for type , and otherwise).
Then the following hold:
-
(1)
If the pair of fields is a stable structure, then the groups and are stable.
-
(2)
If is separably closed then and are stable groups.
We will look into this in a sharper form for the case of in § 6. Clearly (1) relates to a couple of claims about interpretability and (2) then follows via Theorem 2.1. But we give a proof of this form, in general.
It is also important to note that we set out with the expectation that something similar would occur in the analogous cases (at greater generality) in ranks 1 and 2, particularly in view of Theorem 2.2, but this is not the case: as already explained in the introduction, things become more subtle in rank 1 and then in rank 2 they remain equally subtle (but no worse).
Proof.
In view of Theorem 2.1 it suffices to prove the first point. For that we will use some coarse definability arguments. One should perhaps prove a bi-interpretability result characterizing definability exactly but it is not necessary for our purposes.
To show that the group is definable from the coordinate system (in first order terms) we work inside the algebraic group , which is certainly definable. It suffices to show that the underlying sets of and are also definable, in the coordinate system , as the group multiplication is inherited.
In view of the Bruhat decompositions
with varying over a finite set of representatives, it suffices to show that and are definable.
Relative to the extended coordinate system the root groups are definable (parametrized by one of the fields). The group is the product (in any order) of its root subgroups, so it is definable.
The root tori that generate are root tori of or , hence definable in the coordinate system. So is definable and is definable. The torus is a definable subgroup of (in the pair of fields ), so is definable.∎
Remark 3.6.
It turns out that the condition given in Theorem 3.5(1) is also necessary: one interprets the pair of fields in and in , using the commutator relations. We will give the precise computations in two cases, see Theorem LABEL:bidef:G2UKk for , and Theorem LABEL:Thm:C2:U(k,K) for . (In fact, in the case of we even prove outright definability.)
Theorem 3.5 sets out the model for what we try to do in this paper. This turns out to be more demanding than we initially expected. Theorem 2.2 prepares the ground by making an ample supply of some coordinate systems needed to generalize Tits’ construction in rank 2, but the definability issues are more severe as well. Namely, when one defines a group as “the group generated by” something, and the coordinate system defines the generators, then the algebraist may be reasonably happy with that (particularly if a Bruhat decomposition results, and one can tell from that what group one has), but the model theorist needs to worry about the definability of the constituents of the Bruhat decomposition as well. One might reasonably object that if we had followed Tits we would also have defined not only the subgroup but the torus and the group as well, from the coordinate system, and the issue would disappear. We will see next why this is clearly not the case when we take up Timmesfeld’s construction in rank 1, and then we will see why the difficulties that appear in rank 1 reappear in rank 2. The only reason they do not appear in higher ranks is that the coordinate systems that appear in higher rank are of a particularly simple type, and in particular the only rank 1 groups that occur in that construction are , , and .
4. The rank 1 case according to Timmesfeld [Ti]
Timmesfeld presents a very general theory of groups generated by abstract root groups which includes the automorphism groups of most Moufang buildings, and starts off in rank 1 in what amounts to the study of split BN-pairs of rank 1 from another point of view. In particular, even the more exotic rank 1 groups arising as groups generated by pairs of opposite root groups in the context of Moufang buildings are captured by his theory. We are interested in the ones which arise in the specific case of Moufang quadrangles of indifferent type, which we will come to in the next section. In that case, we arrive at the particular rank 1 groups with which Timmesfeld begins his discussion in [Ti], namely his Example 1.5, as specialized further in [Ti, Example 1.6 (2), p. 6].
In the presentation below, we begin with the explicit definition, but work out in detail the standard calculations in the manner of Chevalley or [St], in their minimalist form ( matrices). These calculations are identical to the usual calculations in , but we must pay close attention to where the entries of the matrices lie—and, in particular, which diagonal matrices are actually obtained in Timmesfeld’s setting, and whether or not that set is first order definable from the initial data.
Definition 4.1 ( according to Timmesfeld).
We begin with an imperfect field of characteristic and an additive subgroup satisfying
where is a vector space over . We then define the group to be the subgroup of generated by upper and lower unitriangular matrices in with coefficients in .
That is, we have the “root groups” , consisting of the elementary matrices
respectively, with . And we consider the group .
There is a good deal to be said about the group . Our main concern is with a criterion for stability, which naturally leads us to consider related definability issues, notable the definability of the subgroup of diagonal matrices. This last issue turns out to recur substantially, afterward, in our discussion of rank 2 groups, as some of them contain Timmesfeld’s groups. And for that matter, it is implicit in our treatment of Tits’ construction (where we avoided beginning with a description of the torus), though in that construction the rank 1 tori involved were just the multiplicative groups of the two fields . Here things become more delicate.
We begin with the Bruhat decomposition. As a point of notation, we will denote by the set of non-zero elements of the additive group . We make elementary calculations but keep track particularly of the diagonal matrices that appear.
Theorem 4.2.
Let be an imperfect field of characteristic and an additive subgroup satisfying
where is a vector space over . Let be the subgroup of with coordinates in the multiplicative subgroup of generated by . Let and .
Then we have the Bruhat decomposition
In particular, is the group of upper unitriangular matrices in , and is the diagonal subgroup.
Furthermore, is simple.
Proof.
Given any in , there is a unique such that , and we write ; then . (Even though we are in characteristic , we use the minus sign since the computation works in any characteristic). With , we find that
is an element of the Weyl group of , and that the elements are diagonal matrices in of the form , for .
It follows that the subgroup of diagonal elements of contains all elements of the group . From the formula , we deduce that
so that , and in fact,
Now we check that these calculations give
by formal manipulations, as in the case of fields.
On the one hand, we know that both and the element lie in , so the inclusion from right to left holds. In the opposite direction it suffices to check that the right hand side is closed under multiplication by and , which is obvious for . Hence for it suffices to check closure under multiplication by , which reduces to the following relations:
It now follows that is the full diagonal subgroup of and that is the full subgroup of upper unitriangular matrices of , since this is clear in the case of the subgroup , and the double coset is disjoint from it.
For the simplicity of the group we use the BN-pair and follow the line of [Tits-BN, (16), p. 323]. We first show that the group is perfect for . It suffices to show that is contained in the commutator subgroup, since then the conjugate is also contained in the commutator subgroup, and these two groups generate .
We claim in fact that . We have
which for fixed and not equal to or represents a general element of . The claim follows.
Now consider a normal subgroup of .
If is contained in then is contained in the conjugate and hence in the intersection, which is the group of diagonal matrices in . We then have , so is in , that is, is trivial.
So suppose now is not contained in . Then the group contains properly, and is a union of double cosets, so by the Bruhat decomposition ; hence the quotient is isomorphic to a quotient of , and in particular is solvable. On the other hand as is perfect the quotient is also perfect, and a perfect solvable group is trivial. So in this case .
Thus is simple. For a statement from a broader point of view see [Ti, I (2.10)]. ∎
One should notice at this point that the torus is likely to be undefinable in any natural language (at least, a priori; this is an interesting question in itself). Accordingly, even if the structure is stable we run the risk that the group is not. But there is a closely related group which is definable from the coordinate system, and has as its commutator subgroup: namely, the normalizer of in . So we examine this.
4.3.
The normalizer of
For the present we fix the notation as in Timmesfeld’s setting and consider within .
Remark 4.4.
The full diagonal subgroup of normalizes , and the group has the Bruhat decomposition
with .
The point here is that diagonal matrices act on and on by multiplication by , so leaves and invariant. Then the Bruhat decomposition for gives the Bruhat decomposition for .
The interest of this group is that it is definable over in view of the Bruhat decomposition, and its commutator subgroup is since is abelian. Thus we have a definable stable group with simple commutator subgroup associated to any stable coordinate system ; this depends intrinsically on as well as , though it would be very natural to take for the field generated by to get a more canonical construction (in similar settings in rank 2, this is actually part of the standard approach).
We note that in our definition of Tits’ groups we preferred to follow Timmesfeld, and rather than defining a torus in advance, let it be computed in the group generated by root subgroups. As the coordinate system used was a pair of fields, the rank 1 subgroups and appearing there were not problematic. But we will need to keep these extra complications—and the need in some cases to sacrifice simplicity for definability—firmly in mind going forward.
Lemma 4.5.
In Timmesfeld’s setting, the normalizer in of is .
Proof.
We work first in . Let denote the full subgroup of diagonal matrices. This also normalizes . It suffices to check that the normalizer in of is . We have noticed that the normalizer contains this group.
Let belong to the normalizer of in . If normalizes then it lies in the Borel subgroup of (where is the full set of strictly upper triangular matrices). Hence after multiplying by an element of we may suppose , and write for . In that case consider . Since and lie in the normalizer of , the group is also contained in the normalizer of . But is a simple group, so we find these two groups are equal and .
If normalizes then normalizes and we conclude similarly.
So suppose . As the torus acts transitively on the root groups of (which correspond to the points of the projective line other than , ) we may adjust by and suppose that is conjugated into a root group of the form where . But then adjusting by this element of we may again take to normalize , and conclude as before. ∎
Thus the family of groups normalizing in is parametrized by the family of groups lying between and . We would like to take to be definable in , ideally, but we would be perfectly happy as long as is stable. Here is to be taken either as an abstract multiplicative group with an action on (corresponding to the action on in ), or as the image of the action in , or more concretely as the multiplicative subgroup of whose action on is given by multiplication. Note that in the second interpretation the action of is multiplication by and in the third interpretation the multiplicative subgroup is actually the corresponding subgroup of .
An attractive choice for the intermediate torus is the multiplicative group of the field generated by . This will often not be definable in , but we can work equally well with . And there are good chances that will be equal to in concrete cases; this leads to interesting questions.
Again: the choice of gives a simple group; the choice of gives a group definable in the original structure with as commutator subgroup; and the choice gives a group which in general is not definable in , but is definable in ; and if Theorem 2.2 applies to , it will also apply to . And as always, what we encounter here recurs in much the same form in rank 2.
We formalize the foregoing discussion further as follows:
Theorem 4.6.
Let be an imperfect field of characteristic and an additive subgroup satisfying
where is a vector space over . Let be a group lying between the group and the group . Let be
Let .
Then the following hold:
-
(1)
The group is definable in the structure , where is the multiplication map on , and is the squaring map.
-
(2)
Conversely, this structure is definable in .
Proof.
1. One builds the group from , , and the action. One then builds the group as
since normalizes . On the right side elements are uniquely represented either by pairs in or by triples in (since ). Multiplication on this set is then determined by multiplication in and multiplication by on the right. This is trivial for the map from to and in the case of it reduces to the expression of in terms of the Bruhat decomposition, given in the proof of Theorem 4.2 as
We may set aside the minus signs as superfluous. We need the operation of multiplicative inversion on , which comes from squaring followed by the action of on , and the coordinate of in , which is .
2. and are, respectively, the centralizers in of any of their nontrivial elements. So gives and and the action of on . This gives as a subset of .
The element allows us to define the function used in the proof of Theorem 4.2 to compute the map from in to in . Thus we have the map from to multiplication by on . This then gives both the set as a subset of , and its action on by multiplication. That is, is the image of under , the image of under the corresponding element of is , and the squaring map is given by . ∎
Corollary 4.7.
A group of the form in Timmesfeld’s setting is stable if and only if the coordinatizing structure
is stable.
Now let us give a coordinatization that looks more normal from the algebraic point of view.
Theorem 4.8.
Let be an imperfect field of characteristic , an additive subgroup of , and a multiplicative subgroup of which contains . Suppose that . Then the structure
in which gives the multiplication on and gives the squaring map from to , is bi-interpretable with a structure
where is a field satisfying Timmesfeld’s conditions:
and .
Proof.
We have the multiplication on and the squaring map to . The restriction of multiplication from to is given, as a partial function, by iff .
Let be the multiplicative stabilizer of in :
This is definable from and is a field containing . Let be with its field structure, taken as an isomorphic copy of with an embedding corresponding to the squaring map to . We then have the structure
with the multiplication on inducing the remaining structure. ∎
Corollary 4.9.
In the (slightly generalized) Timmesfeld setting, the following are equivalent:
-
(1)
is stable.
-
(2)
The structure with , the multiplication on , and the squaring map from to , is stable.
-
(3)
The structure , with as above is stable.
In the last clause, note also that the group in the sense of is also in the sense of )
One can do something quite similar in the rank indifferent case, in principle; namely there will be two rank 1 groups of Timmesfeld type and the condition is that both are stable (i.e., both exist within a single stable structure).
Let us come back now to the case of , and consider the problem of stability. This raises interesting questions of model theoretic algebra. We are considering structures
where is the subgroup of generated by , given as an additional element of structure. By proper choice of , the problem of stability for becomes the problem of stability for structures of this kind. If is definable in there is no difficulty (Theorem 2.2). If happens to be the multiplicative group of the field generated by , then we may take the ambient field to be , apply Theorem 2.2 to that, and in this way force to be definable. It is not yet clear how often that is the case. So the questions are of two sorts: when is in fact the multiplicative group of a field, and in general, when is the expanded structure stable?
Lemma 4.10.
Let be an imperfect field of characteristic , and an additive subgroup of with
and a vector space over . Suppose in addition that contains a subfield of codimension (as a vector space over ). Then is the multiplicative group of the field generated by , and every element of is the product of two elements of .
Proof.
We write for some , with a field.
Then generates the field . The claim follows. ∎
Note therefore that we can always make definable, in this setting by including the field in the coordinate system. In the context of Theorem 2.2, the theorem will continue to apply.
In particular, we have the following:
Corollary 4.11.
Let be an imperfect field of characteristic , with . Let be -independent elements of , and consider . Then every nonzero element of is the product of 2 elements of , and therefore is definable in .
Proof.
As is a subfield of of codimension , Lemma 4.10 applies. ∎
Issues of stability in the groups have led us to consider issues of definability in the underlying coordinate systems. It is clear that the field generated by plays a special role here. One has in general the question of definability of in some particular coordinate system, but when working in the context of separably closed fields, which is the only concrete case known currently, we have observed that this field should be added to the coordinate system and one should consider the issue of definability of in the extended coordinate system, and in particular the question as to whether always coincides with the multiplicative group of , a question which reduces to the case of finite dimensional over .
Question 4.12.
Let be an imperfect field of characteristic , and an additive subgroup of containing which is a vector space over .
-
(1)
Is the multiplicative group of ? Does this hold at least if is separably closed?
-
(2)
If this is not the case, and the field is separably closed, is it possible for to be definable in nonetheless?
-
(3)
Can be stable when is not stable?
The first question, restricted to the case of separably closed, is the main question at present. In the event of a negative solution, the second question should be taken as the natural refinement. Finally, in a situation in which is not definable in any structure covered by Theorem 2.2, the third question remains. This is not strictly a group theoretic question but a question about extending Theorem 2.2 to include certain multiplicative subgroups as well as additive subgroups, which seems very difficult.
Since question (1) reduces to the finite dimensional case and one can in principle make detailed computations in that case, it would be of interest to take up the minimal open cases, in which has dimension over , or more generally, where contains a subfield of codimension . This seems accessible.
5. The rank 2 case: Automorphism groups of Moufang polygons
Our own introduction to this subject came via the elegant work of Tits and Weiss in [TW] concerning certain rank groups (or rather, the geometries on which they act). So now we come, finally, to what was our point of departure. In practice we will focus on two of the cases which they consider, where the results of Theorem 2.2, or the special case Theorem 2.1, are directly applicable. In one case the group considered is the group already considered by Tits (though we give it a slightly different definition, one should bear in mind). In the other case it is a substantial generalization of the Tits group of type in which the pair of fields used by Tits is replaced by a pair of suitably chosen abelian subgroups of fields in the manner of Timmesfeld.
Here we run over the point of view of [TW], though as we find the groups easier to work with as subgroups of algebraic groups, we will adopt Tits’ point of view for the more concrete discussions afterward. So this section indicates only how these groups were originally identified, within the scope of a broad classification project (a project initially proposed in [Tits] in a remark toward the end of the monograph).
The subject of [Tits] is the theory of buildings, the geometries on which simple algebraic groups, classical groups, and some other groups act naturally; a classification is given in dimension at least , which can be taken as a classification of the corresponding groups. These geometries generalize projective geometry, and just as high dimensional projective geometries satisfy the Desargues condition and can then be classified, all the higher dimensional buildings satisfy a related Moufang condition, and are thus called Moufang buildings. Tits proposed the problem of classifying all Moufang buildings in dimension or higher; or more specifically, classifying them in dimension specifically and then reducing the higher dimensional classification to that one. The project is carried through in [TW], with some surprises along the way.
In rank 2 the Moufang buildings are called Moufang polygons. They are combinatorial point-line geometries which are naturally represented as bipartite graphs where the parts are the points and lines, and the edge relation is incidence. One may also interpret the same graph with the points taken as lines and the lines taken as points, which would be treated as a dual geometry. Accordingly the automorphisms are taken to leave the points and lines invariant, and any graph automorphism which switches the parts would be called an anti-automorphism in the geometric terminology. Tits and Weiss consider in great detail the structure of the geometric automorphism group of a Moufang polygon and in particular a certain subgroup which is almost always simple and which includes the usual Chevalley groups along with many other groups with a very similar structure. In particular, the theory begins with a definition of root subgroups directly in terms of the action of the automorphism group on the graph, and is by definition the subgroup generated by a certain family of root groups (those associated with the vertices of an“apartment”, which is a cycle of minimal length in the graph).
As in the case of Chevalley groups, one may define a “maximal unipotent” subgroup generated by half of the root groups (taking a path which covers half of the cycle), which turns out to be a nilpotent group generated with the root groups as generators and a generalized Chevalley commutator formula as defining relations. We will consider some cases in which these commutator relations are the ones realized in some Chevalley groups.
Namely, we consider the Moufang hexagons which correspond to type and more specifically to the groups , and then the richer family of Moufang quadrangles of indifferent type which correspond to type , and are realized in (or since we work in characteristic ). In general, the polygon is called an -gon if the shortest cycle length is : geometrically an -gon has points and lines and forms a cycle of length in the incidence graph. In particular the group is the (noncommuting) product of root groups in a Moufang -gon.
The main result of [TW] is a classification theorem for Moufang polygons. Accordingly the various things known about Chevalley groups must not only be generalized, but proved in detail from first principles in a combinatorial setting. This complicates matters relative to the theory of Chevalley groups or algebraic groups, where the main facts are proved algebraically and may even be taken as belonging in part to the initial definition of the group (as in [St]).
But in addition to this, [TW] contains detailed studies of the automorphism groups in all of the cases identified in the classification theorem, including that of the (mostly) simple group as well as the full automorphism group and the quotient , which can be viewed as a group of automorphisms of . One of the main results of this analysis is the BN-pair structure for all of the groups between and . As we have seen in the case of Timmesfeld’s groups , we have reasons to consider larger groups than from the point of view of definability—though we will set aside the portion of which corresponds to nontrivial automorphisms of the coordinate system, which is not useful from the point of view of first order definability, and which does not appear in the corresponding algebraic group (when there is one).
Remark 5.1.
A very general lemma of [TW, (7.5)] states that a Moufang polygon is uniquely determined by the associated automorphism group and its sequence of root subgroups . In particular the Chevalley commutator formula in determines the group .
We now describe the groups corresponding to the coordinate systems of indifferent type, which generalize Timmesfeld’s systems .
Definition 5.2.
A weak indifferent set is a triple , where is a field of characteristic , and are additive subgroups of for which
is a vector space over , and is a vector space over the field generated by .
If a weak indifferent set satisfies the additional constraint that the field is generated by the set then it is called an indifferent set.
It is customary to use indifferent sets in the strong sense in the literature, and we are introducing the terminology weak indifferent set here to emphasize the variation. The distinction is not very significant from an algebraic perspective as there would be no harm in replacing the large field in a weak indifferent set by the field generated by . However, from a model theoretic point of view, the notion of weak indifferent set is axiomatizable, and the notion of indifferent set is not, so there is some advantage to allowing the broader notion into the formalism. It does not create any new examples of groups, however.
It is tempting to call a weak indifferent set an indifferent pair (even though it is a triple) because the groups play the roles previously played by the pair of fields in mixed type groups.
Definition 5.3.
Let be a weak indifferent set. Then
is the subgroup of generated by the subgroups for a short root, and by for a long root. We call these groups the root subgroups of (which will require a little justification).
Remark 5.4.
The group is defined by analogy with , replacing the pair by an indifferent set. As such it should more properly be denoted
and we may make use of that heavier notation if the point requires emphasis.
We have also identified a suitable torus in the verification of the BN-pair condition and the Bruhat property, and so we could also have followed the route taken by Tits in defining . But in any case it is important to us (and to [TW]) that this group is generated by its root subgroups.
There is some pathology in this construction, inherited from the rank 1 case, which will require close attention to the torus that appears in , and to other tori that normalize this group.
The definition of weak indifferent pair ensures that this group has more or less the same properties as where and , are fields. We recall the relevant properties now.
First, the Chevalley commutator formula makes sense: that is, for positive roots , , and writing , for the root groups relative to or (as specified), the formula giving coordinates of elements of in the root groups of lie in the corresponding root groups of . However: this only works because in the special characteristics we consider, some terms in the general Chevalley commutator formula vanish, and the corresponding entries do not occur. So this actually is what makes everything work.
At the same time, the rank 1 groups become or in the sense of Timmesfeld.
One gets the BN-pair property, the corresponding Bruhat decomposition, and simplicity as previously. The computations we made in rank 1 close the gap between the usual and the Timmesfeld variations, and the rest of the argument for the BN-pair is formal, modulo the rank 1 case.
Notice also that lies between and .
Lemma 5.5.
The groups and are simple (for , ).
Proof.
We use the Tits simplicity criterion for groups with a BN-pair, as can be found in § 29 of [Hum], see in particular Theorem 29.5. Since our groups have BN-pairs, it suffices to check the following points: (a) is solvable and centerless; (b) the set of generators of corresponding to the simple roots does not decompose into a union of disjoint, nontrivial, commuting subsets; (c) contains no nontrivial normal subgroup of the full group ; and (d) is perfect.
Of these four points, the first two are clear since there are only two simple roots and the corresponding reflections do not commute ( is a dihedral group of order greater than ). The other two points were noticed in the proof of the rank 1 case (Theorem 4.2), and the proofs given there continue to work. We repeat the main points. The group has a conjugate for which , so any normal subgroup of the full group contained in would be contained in , after which it follows easily that it centralizes , hence lies in , hence is trivial. The proof that the group is perfect reduces to the condition for the root subgroups , which is already shown in the rank case.∎
Lemma 5.6.
The groups and are the groups of [TW] corresponding to the Moufang hexagons of type and the Moufang quadrangles of indifferent type in the sense of [TW].
Proof.
We suppose the field . By [DemT, Thm. 6.1], if is the universal Steinberg group with the same presentation as then is simple.
Since and the groups of type or indifferent type are generated by root groups satisfying the same relations, both are homomorphic images of . Furthermore both groups are simple by Theorem 3.3, Remark 3.4, Lemma LABEL:lem:Gdag=G2kK and [TW, (37.3)]. So the kernel in both cases is and the two quotients are isomorphic. ∎