In mathematics, many sets of transformations form a group under function composition; for example, the rotations around a point in the plane. It is often useful to consider the group as an abstract group, and to say that one has a group action of the abstract group that consists of performing the transformations of the group of transformations. The reason for distinguishing the group from the transformations is that, generally, a group of transformations of a structure acts also on various related structures; for example, the above rotation group acts also on triangles by transforming triangles into triangles.
Formally, a group action of a group G on a set S is a group homomorphism from G to some group (under function composition) of functions from S to itself.
If a group acts on a structure, it will usually also act on objects built from that structure. For example, the group of Euclidean isometries acts on Euclidean space and also on the figures drawn in it; in particular, it acts on the set of all triangles. Similarly, the group of symmetries of a polyhedron acts on the vertices, the edges, and the faces of the polyhedron.
A group action on a vector space is called a representation of the group. In the case of a finite-dimensional vector space, it allows one to identify many groups with subgroups of the general linear group GL(n, K), the group of the invertible matrices of dimension n over a field K.
The symmetric group Sn acts on any set with n elements by permuting the elements of the set. Although the group of all permutations of a set depends formally on the set, the concept of group action allows one to consider a single group for studying the permutations of all sets with the same cardinality.
If G is a group with identity element e, and X is a set, then a (left) group action α of G on X is a function
that satisfies the following two axioms:[1]
Identity: | |
Compatibility: |
for all g and h in G and all x in X.
The group G is then said to act on X (from the left). A set X together with an action of G is called a (left) G-set.
It can be notationally convenient to curry the action α, so that, instead, one has a collection of transformations αg : X → X, with one transformation αg for each group element g ∈ G. The identity and compatibility relations then read
and
with ∘ being function composition. The second axiom then states that the function composition is compatible with the group multiplication; they form a commutative diagram. This axiom can be shortened even further, and written as αg ∘ αh = αgh.
With the above understanding, it is very common to avoid writing α entirely, and to replace it with either a dot, or with nothing at all. Thus, α(g, x) can be shortened to g⋅x or gx, especially when the action is clear from context. The axioms are then
From these two axioms, it follows that for any fixed g in G, the function from X to itself which maps x to g⋅x is a bijection, with inverse bijection the corresponding map for g−1. Therefore, one may equivalently define a group action of G on X as a group homomorphism from G into the symmetric group Sym(X) of all bijections from X to itself.[2]
Likewise, a right group action of G on X is a function
that satisfies the analogous axioms:[3]
Identity: | |
Compatibility: |
(with α(x, g) often shortened to xg or x⋅g when the action being considered is clear from context)
Identity: | |
Compatibility: |
for all g and h in G and all x in X.
The difference between left and right actions is in the order in which a product gh acts on x. For a left action, h acts first, followed by g second. For a right action, g acts first, followed by h second. Because of the formula (gh)−1 = h−1g−1, a left action can be constructed from a right action by composing with the inverse operation of the group. Also, a right action of a group G on X can be considered as a left action of its opposite group Gop on X.
Thus, for establishing general properties of group actions, it suffices to consider only left actions. However, there are cases where this is not possible. For example, the multiplication of a group induces both a left action and a right action on the group itself—multiplication on the left and on the right, respectively.
Let G be a group acting on a set X. The action is called faithful or effective if g⋅x = x for all x ∈ X implies that g = eG. Equivalently, the homomorphism from G to the group of bijections of X corresponding to the action is injective.
The action is called free (or semiregular or fixed-point free) if the statement that g⋅x = x for some x ∈ X already implies that g = eG. In other words, no non-trivial element of G fixes a point of X. This is a much stronger property than faithfulness.
For example, the action of any group on itself by left multiplication is free. This observation implies Cayley's theorem that any group can be embedded in a symmetric group (which is infinite when the group is). A finite group may act faithfully on a set of size much smaller than its cardinality (however such an action cannot be free). For instance the abelian 2-group (Z / 2Z)n (of cardinality 2n) acts faithfully on a set of size 2n. This is not always the case, for example the cyclic group Z / 2nZ cannot act faithfully on a set of size less than 2n.
In general the smallest set on which a faithful action can be defined can vary greatly for groups of the same size. For example, three groups of size 120 are the symmetric group S5, the icosahedral group A5 × Z / 2Z and the cyclic group Z / 120Z. The smallest sets on which faithful actions can be defined for these groups are of size 5, 7, and 16 respectively.
The action of G on X is called transitive if for any two points x, y ∈ X there exists a g ∈ G so that g ⋅ x = y.
The action is simply transitive (or sharply transitive, or regular) if it is both transitive and free. This means that given x, y ∈ X the element g in the definition of transitivity is unique. If X is acted upon simply transitively by a group G then it is called a principal homogeneous space for G or a G-torsor.
For an integer n ≥ 1, the action is n-transitive if X has at least n elements, and for any pair of n-tuples (x1, ..., xn), (y1, ..., yn) ∈ Xn with pairwise distinct entries (that is xi ≠ xj, yi ≠ yj when i ≠ j) there exists a g ∈ G such that g⋅xi = yi for i = 1, ..., n. In other words the action on the subset of Xn of tuples without repeated entries is transitive. For n = 2, 3 this is often called double, respectively triple, transitivity. The class of 2-transitive groups (that is, subgroups of a finite symmetric group whose action is 2-transitive) and more generally multiply transitive groups is well-studied in finite group theory.
An action is sharply n-transitive when the action on tuples without repeated entries in Xn is sharply transitive.
The action of the symmetric group of X is transitive, in fact n-transitive for any n up to the cardinality of X. If X has cardinality n, the action of the alternating group is (n − 2)-transitive but not (n − 1)-transitive.
The action of the general linear group of a vector space V on the set V ∖ {0} of non-zero vectors is transitive, but not 2-transitive (similarly for the action of the special linear group if the dimension of v is at least 2). The action of the orthogonal group of a Euclidean space is not transitive on nonzero vectors but it is on the unit sphere.
The action of G on X is called primitive if there is no partition of X preserved by all elements of G apart from the trivial partitions (the partition in a single piece and its dual, the partition into singletons).
Assume that X is a topological space and the action of G is by homeomorphisms.
The action is wandering if every x ∈ X has a neighbourhood U such that there are only finitely many g ∈ G with g⋅U ∩ U ≠ ∅.[4]
More generally, a point x ∈ X is called a point of discontinuity for the action of G if there is an open subset U ∋ x such that there are only finitely many g ∈ G with g⋅U ∩ U ≠ ∅. The domain of discontinuity of the action is the set of all points of discontinuity. Equivalently it is the largest G-stable open subset Ω ⊂ X such that the action of G on Ω is wandering.[5] In a dynamical context this is also called a wandering set.
The action is properly discontinuous if for every compact subset K ⊂ X there are only finitely many g ∈ G such that g⋅K ∩ K ≠ ∅. This is strictly stronger than wandering; for instance the action of Z on R2 ∖ {(0, 0)} given by n⋅(x, y) = (2nx, 2−ny) is wandering and free but not properly discontinuous.[6]
The action by deck transformations of the fundamental group of a locally simply connected space on an covering space is wandering and free. Such actions can be characterized by the following property: every x ∈ X has a neighbourhood U such that g⋅U ∩ U = ∅ for every g ∈ G ∖ {eG}.[7] Actions with this property are sometimes called freely discontinuous, and the largest subset on which the action is freely discontinuous is then called the free regular set.[8]
An action of a group G on a locally compact space X is called cocompact if there exists a compact subset A ⊂ X such that X = G ⋅ A. For a properly discontinuous action, cocompactness is equivalent to compactness of the quotient space G \ X.
Now assume G is a topological group and X a topological space on which it acts by homeomorphisms. The action is said to be continuous if the map G × X → X is continuous for the product topology.
The action is said to be proper if the map G × X → X × X defined by (g, x) ↦ (x, g⋅x) is proper.[9] This means that given compact sets K, K′ the set of g ∈ G such that g⋅K ∩ K′ ≠ ∅ is compact. In particular, this is equivalent to proper discontinuity G is a discrete group.
It is said to be locally free if there exists a neighbourhood U of eG such that g⋅x ≠ x for all x ∈ X and g ∈ U ∖ {eG}.
The action is said to be strongly continuous if the orbital map g ↦ g⋅x is continuous for every x ∈ X. Contrary to what the name suggests, this is a weaker property than continuity of the action.[citation needed]
If G is a Lie group and X a differentiable manifold, then the subspace of smooth points for the action is the set of points x ∈ X such that the map g ↦ g⋅x is smooth. There is a well-developed theory of Lie group actions, i.e. action which are smooth on the whole space.
If g acts by linear transformations on a module over a commutative ring, the action is said to be irreducible if there are no proper nonzero g-invariant submodules. It is said to be semisimple if it decomposes as a direct sum of irreducible actions.
Consider a group G acting on a set X. The orbit of an element x in X is the set of elements in X to which x can be moved by the elements of G. The orbit of x is denoted by G⋅x:
The defining properties of a group guarantee that the set of orbits of (points x in) X under the action of G form a partition of X. The associated equivalence relation is defined by saying x ~ y if and only if there exists a g in G with g⋅x = y. The orbits are then the equivalence classes under this relation; two elements x and y are equivalent if and only if their orbits are the same, that is, G⋅x = G⋅y.
The group action is transitive if and only if it has exactly one orbit, that is, if there exists x in X with G⋅x = X. This is the case if and only if G⋅x = X for all x in X (given that X is non-empty).
The set of all orbits of X under the action of G is written as X / G (or, less frequently, as G \ X), and is called the quotient of the action. In geometric situations it may be called the orbit space, while in algebraic situations it may be called the space of coinvariants, and written XG, by contrast with the invariants (fixed points), denoted XG: the coinvariants are a quotient while the invariants are a subset. The coinvariant terminology and notation are used particularly in group cohomology and group homology, which use the same superscript/subscript convention.
If Y is a subset of X, then G⋅Y denotes the set {g⋅y : g ∈ G and y ∈ Y}. The subset Y is said to be invariant under G if G⋅Y = Y (which is equivalent G⋅Y ⊆ Y). In that case, G also operates on Y by restricting the action to Y. The subset Y is called fixed under G if g⋅y = y for all g in G and all y in Y. Every subset that is fixed under G is also invariant under G, but not conversely.
Every orbit is an invariant subset of X on which G acts transitively. Conversely, any invariant subset of X is a union of orbits. The action of G on X is transitive if and only if all elements are equivalent, meaning that there is only one orbit.
A G-invariant element of X is x ∈ X such that g⋅x = x for all g ∈ G. The set of all such x is denoted XG and called the G-invariants of X. When X is a G-module, XG is the zeroth cohomology group of G with coefficients in X, and the higher cohomology groups are the derived functors of the functor of G-invariants.
Given g in G and x in X with g⋅x = x, it is said that "x is a fixed point of g" or that "g fixes x". For every x in X, the stabilizer subgroup of G with respect to x (also called the isotropy group or little group[10]) is the set of all elements in G that fix x: This is a subgroup of G, though typically not a normal one. The action of G on X is free if and only if all stabilizers are trivial. The kernel N of the homomorphism with the symmetric group, G → Sym(X), is given by the intersection of the stabilizers Gx for all x in X. If N is trivial, the action is said to be faithful (or effective).
Let x and y be two elements in X, and let g be a group element such that y = g⋅x. Then the two stabilizer groups Gx and Gy are related by Gy = gGxg−1. Proof: by definition, h ∈ Gy if and only if h⋅(g⋅x) = g⋅x. Applying g−1 to both sides of this equality yields (g−1hg)⋅x = x; that is, g−1hg ∈ Gx. An opposite inclusion follows similarly by taking h ∈ Gx and x = g−1⋅y.
The above says that the stabilizers of elements in the same orbit are conjugate to each other. Thus, to each orbit, we can associate a conjugacy class of a subgroup of G (that is, the set of all conjugates of the subgroup). Let (H) denote the conjugacy class of H. Then the orbit O has type (H) if the stabilizer Gx of some/any x in O belongs to (H). A maximal orbit type is often called a principal orbit type.
Orbits and stabilizers are closely related. For a fixed x in X, consider the map f : G → X given by g ↦ g⋅x. By definition the image f(G) of this map is the orbit G⋅x. The condition for two elements to have the same image is In other words, f(g) = f(h) if and only if g and h lie in the same coset for the stabilizer subgroup Gx. Thus, the fiber f−1({y}) of f over any y in G⋅x is contained in such a coset, and every such coset also occurs as a fiber. Therefore f induces a bijection between the set G / Gx of cosets for the stabilizer subgroup and the orbit G⋅x, which sends gGx ↦ g⋅x.[11] This result is known as the orbit-stabilizer theorem.
If G is finite then the orbit-stabilizer theorem, together with Lagrange's theorem, gives in other words the length of the orbit of x times the order of its stabilizer is the order of the group. In particular that implies that the orbit length is a divisor of the group order.
This result is especially useful since it can be employed for counting arguments (typically in situations where X is finite as well).
A result closely related to the orbit-stabilizer theorem is Burnside's lemma: where Xg is the set of points fixed by g. This result is mainly of use when G and X are finite, when it can be interpreted as follows: the number of orbits is equal to the average number of points fixed per group element.
Fixing a group G, the set of formal differences of finite G-sets forms a ring called the Burnside ring of G, where addition corresponds to disjoint union, and multiplication to Cartesian product.
The notion of group action can be encoded by the action groupoid G′ = G ⋉ X associated to the group action. The stabilizers of the action are the vertex groups of the groupoid and the orbits of the action are its components.
If X and Y are two G-sets, a morphism from X to Y is a function f : X → Y such that f(g⋅x) = g⋅f(x) for all g in G and all x in X. Morphisms of G-sets are also called equivariant maps or G-maps.
The composition of two morphisms is again a morphism. If a morphism f is bijective, then its inverse is also a morphism. In this case f is called an isomorphism, and the two G-sets X and Y are called isomorphic; for all practical purposes, isomorphic G-sets are indistinguishable.
Some example isomorphisms:
With this notion of morphism, the collection of all G-sets forms a category; this category is a Grothendieck topos (in fact, assuming a classical metalogic, this topos will even be Boolean).
We can also consider actions of monoids on sets, by using the same two axioms as above. This does not define bijective maps and equivalence relations however. See semigroup action.
Instead of actions on sets, we can define actions of groups and monoids on objects of an arbitrary category: start with an object X of some category, and then define an action on X as a monoid homomorphism into the monoid of endomorphisms of X. If X has an underlying set, then all definitions and facts stated above can be carried over. For example, if we take the category of vector spaces, we obtain group representations in this fashion.
We can view a group G as a category with a single object in which every morphism is invertible.[15] A (left) group action is then nothing but a (covariant) functor from G to the category of sets, and a group representation is a functor from G to the category of vector spaces.[16] A morphism between G-sets is then a natural transformation between the group action functors.[17] In analogy, an action of a groupoid is a functor from the groupoid to the category of sets or to some other category.
In addition to continuous actions of topological groups on topological spaces, one also often considers smooth actions of Lie groups on smooth manifolds, regular actions of algebraic groups on algebraic varieties, and actions of group schemes on schemes. All of these are examples of group objects acting on objects of their respective category.