vEnhance's avatar

Dec 10, 2014

🖉 Representation Theory, Part 1: Irreducibles and Maschke's Theorem

Good luck to everyone taking the December TST tomorrow!

The goal of this post is to give the reader a taste of representation theory, a la Math 55a. In theory, this post should be accessible to anyone with a knowledge of group actions and abstract vector spaces.

Fix a ground field kk (for all vector spaces). In this post I will introduce the concept of representations and irreducible representations. Using these basic definitions I will establish Maschke’s Theorem, which tells us that irreducibles and indecomposables are the same thing.

1. Definition and examples

Let GG be a group.

Definition. A representation of GG consists of a pair ρ=(V,ρ)\rho = (V, \cdot_\rho) where VV is a vector space over kk and ρ\cdot_\rho is a (left) group action of GG on VV which is linear in VV. If VV is finite-dimensional then the dimension of ρ\rho is just the dimension of VV.

Explicitly the conditions on ρ\cdot_\rho are that

1ρv=vg1ρ(g2ρv)=(g1g2)ρvg1ρ(v1+v2)=gρv1+gρv2g1ρ(cv)=c(gρv). \begin{aligned} 1 \cdot_\rho v &= v \\ g_1 \cdot_\rho (g_2 \cdot_\rho v) &= (g_1g_2) \cdot_\rho v \\ g_1 \cdot_\rho (v_1 + v_2) &= g \cdot_\rho v_1 + g \cdot_\rho v_2 \\ g_1 \cdot_\rho (cv) &= c(g \cdot_\rho v). \end{aligned}

Note that another equivalent phrasing is that ρ\rho is a homomorphism from GG to the general linear group GL(V)\operatorname{GL}(V); however, we will not use this phrasing.

By abuse of notation, we occasionally refer to ρ\rho by just its underlying vector space VV in the case that ρ\cdot_\rho is clear from context. We may also abbreviate gρvg \cdot_\rho v as just gvg \cdot v.

A simple example of a nontrivial representation is the following.

Example. If V=knV = k^{n} and G=SnG = S_n, then an example of an action is ρ=(V,ρ)\rho = (V, \cdot_\rho) is simply

σρ<x1,,xn>=<xσ(1),,xσ(n)>\sigma \cdot_\rho \left<x_1, \dots, x_n\right> = \left<x_{\sigma(1)}, \dots, x_{\sigma(n)}\right>

meaning we permute the basis elements of VV. We denote this representation by refln\operatorname{refl}_n.

Let us give another useful example.

Definition. Let XX be a set acted on by GG. We define the vector space

Fun(X){maps Xk}\operatorname{Fun}(X) \coloneqq \left\{ \text{maps } X \rightarrow k \right\}

with the standard addition of functions.

Example. We define a representation on Fun(X)\operatorname{Fun}(X) by the following action: every fFun(X)f \in \operatorname{Fun}(X) gets sent to a gfFun(X)g \cdot f \in \operatorname{Fun}(X) by

(gFun(X)f)(x)=f(g1Xx).(g \cdot_{\operatorname{Fun}(X)} f)(x) = f\left( g^{-1} \cdot_X x \right).

By abuse of notation we will let Fun(X)\operatorname{Fun}(X) refer both to the vector space and the corresponding representation.

Now that we have two nontrivial examples, we also give a trivial example.

Definition. Let GG be a group. We define the trivial representation trivG\text{triv}_G, or just triv\text{triv}, as the representation triv_G=(k,triv)\text{triv}\_G = (k, \cdot_\text{triv}), where

gtriva=ag \cdot_\text{triv} a = a

for every aka \in k. In other words, GG acts trivially on kk.

2. Homomorphisms of representations

First, as a good budding algebraist (not really) I should define how these representations talk to each other.

Definition. Let ρ1=(V1,ρ1)\rho_1 = (V_1, \cdot_{\rho_1}) and ρ2=(V2,ρ2)\rho_2 = (V_2, \cdot_{\rho_2}) be representations of the same group GG. A homomorphism of GG-representations is a linear map T:V1V2T : V_1 \rightarrow V_2 which respects the GG-action: for any gGg \in G and vVv \in V,

gρ2T(v)=T(gρ1v).g \cdot_{\rho_2} T(v) = T\left( g \cdot_{\rho_1} v \right).

The set of all these homomorphisms is written HomG(ρ1,ρ2)\operatorname{Hom}_G(\rho_1, \rho_2), which is itself a vector space over kk.

(Digression: For those of you that know category theory, you might realize by now that representations correspond to functors from a category G\mathcal G (corresponding to the group GG) into Vectk\textbf{Vect}_k and that homomorphisms of representations are just natural transformations.)

To see an example of this definition in action, we give the following as an exercise.

Proposition 1. Let ρ=(V,ρ)\rho = \left( V, \cdot_\rho \right). We define the GG-invariant space ρGV\rho^G \subseteq V to be

ρG{vVgρv=v,gG}.\rho^G \coloneqq \left\{ v \in V \mid g \cdot_\rho v = v , \forall g \in G \right\}.

Then there is a natural bijection of vector spaces HomG(trivG,ρ)ρG\operatorname{Hom}_G(\text{triv}_G, \rho) \simeq \rho^G.

Proof: Let ρ=(V,ρ)\rho = (V, \cdot_\rho). The set HomG(trivG,ρ)\operatorname{Hom}_G(\text{triv}_G, \rho) consists of maps T:kVT : k \rightarrow V with gρT(a)=T(gtriva)=T(a)g \cdot_\rho T(a) = T(g \cdot_{\text{triv}} a) = T(a) for every aka \in k. Since T:kVT : k \rightarrow V is linear, it is uniquely defined by T(1)T(1) (since T(a)=aT(1)T(a) = a T(1) in general). So gρT(1)=T(1)g \cdot_\rho T(1) = T(1), i.e. T(1)ρGT(1) \in \rho^G, is necessary and sufficient. Thus the bijection is just TT(1)T \mapsto T(1). \Box

This proposition will come up again at the end of Part 4.

3. Subrepresentations, irreducibles, and Maschke’s theorem

Now suppose I’ve got a representation ρ=(V,ρ)\rho = (V, \cdot_\rho).

Definition. Suppose we have a subspace WVW \subseteq V which is ρ\rho-invariant, meaning that gρwWg \cdot_\rho w \in W for every wWw \in W and gGg \in G. Then we can construct a representation of GG on WW by restricting the action to WW:

ρ=(W,ρW).\rho' = \left( W, \cdot_\rho|_W \right).

In that case the resulting ρ\rho is called a subrepresentation of VV.

Every ρ\rho has an obvious subrepresentation, namely ρ\rho itself, as well as a stupid subrepresenation on the zero-dimensional vector space {0}\{0\}. But it’s the case that some representations have interesting subrepresentations.

Example. Consider the representation refl=(kn,)\operatorname{refl} = (k^n, \cdot) of SnS_n on knk^n defined in the first section. For all n2n \ge 2, refl\operatorname{refl} is not irreducible.

Proof: Consider the subspace WknW \subset k^n given by W={(x1,x2,,xn)x1++xn=0.}W = \left\{ (x_1, x_2, \dots, x_n) \mid x_1 + \dots + x_n = 0. \right\} then WW is invariant under refl\operatorname{refl}, so we have a subrepresentation of refl\operatorname{refl}, which we’ll denote refl0\operatorname{refl}_0. \Box

This motivates the ideas of irreducibles.

Definition. A representation ρ\rho is irreducible if it has no nontrivial subrepresentations.

Of course the first thing we ask is whether any representation decomposes as a product of irreducible representations. But what does it mean to compose two representations, anyways? It’s just the “natural” definition with the direct sum.

Definition. Let ρ1=(W1,ρ1)\rho_1 = (W_1, \cdot_{\rho_1}) and ρ2=(W2,ρ2)\rho_2 = (W_2, \cdot_{\rho_2}) be representations and suppose we have V=W1W2V = W_1 \oplus W_2. Then we define the representation ρ=ρ1ρ2\rho = \rho_1 \oplus \rho_2 by ρ=(V,ρ)\rho = (V, \cdot_\rho) where

gρ(w1+w2)=(gρ1w1)+(gρ2w2).g \cdot_\rho (w_1 + w_2) = (g \cdot_{\rho_1} w_1) + (g \cdot_{\rho_2} w_2).

Just like every integer decomposes into prime factors, we hope that every representation decomposes into irreducibles. But this is too much to hope for.

Example. Let G=S2G = S_2, let k=F2k = \mathbb F_2 be the finite field of order 22 (aka Z/2Z{\mathbb Z}/2{\mathbb Z}), and consider refl=(k2,)\operatorname{refl} = (k^2, \cdot), which is not irreducible. However, we claim that we cannot write refl=ρ1ρ2\operatorname{refl} = \rho_1 \oplus \rho_2 for any nontrivial ρ1\rho_1 and ρ2\rho_2.

Proof: This is a good concrete exercise.

Assume not, and let V1V_1 and V2V_2 be the underlying vector spaces of ρ1\rho_1 and ρ2\rho_2. By nontriviality, dimV1=dimV2=1\dim V_1 = \dim V_2 = 1, and in particular we have that as sets, V1=V2=2\left\lvert V_1 \right\rvert = \left\lvert V_2 \right\rvert = 2. Take the only nonzero elements (a,b)V1(a,b) \in V_1 and (c,d)V2(c,d) \in V_2. Since V1V_1 is invariant under refl\operatorname{refl}, (b,a)V1(b,a) \in V_1, so (a,b)=(b,a)    (a,b)=(1,1)V1(a,b) = (b,a) \implies (a,b) = (1,1) \in V_1. Similarly, (1,1)V2(1,1) \in V_2, which is impossible. \Box

So we hoped for perhaps too much. However, with seemingly trivial modifications we can make the above example work.

Example. In the same example as above, suppose we replace kk with any field which does not have characteristic 22. Then ρ\rho does decompose.

Proof: Consider the following two subspaces of V=k2V = k^2:

W1={<a,a>ak}W2={<a,a>ak}. \begin{aligned} W_1 &= \left\{ \left<a, a\right> \mid a \in k \right\} \\ W_2 &= \left\{ \left<a, -a\right> \mid a \in k \right\}. \end{aligned}

It’s easy to see that both W1W_1 and W2W_2 are both invariant under ρ\rho. Moreover, if chark2\operatorname{char} k \neq 2 then we in fact have V=W1W2V = W_1 \oplus W_2 because <x,y>=12<x+y,x+y>+12<xy,yx>\left<x,y\right> = \frac{1}{2} \left<x+y, x+y\right> + \frac{1}{2} \left<x-y, y-x\right> for any x,ykx,y \in k. So if we let ρ1=(W1,ρ1)\rho_1 = (W_1, \cdot_{\rho_1}) be the subrepresentation corresponding to W1W_1, and define ρ2\rho_2 on W2W_2 similarly, then we have ρ=ρ1ρ2\rho = \rho_1 \oplus \rho_2. \Box

Thus the only thing in the way of the counterexample was the fact that chark=2\operatorname{char} k = 2. And it turns out in general this is the only obstacle, a result called Maschke’s Theorem.

Theorem 2 (Maschke’s Theorem). Suppose that GG is a finite group, and chark\operatorname{char} k does not divide G\left\lvert G \right\rvert. Then every finite-dimensional representation decomposes as a direct sum of irreducibles.

Before proceeding to the proof, I’ll draw an analogy between the proof that every positive integer mm decomposes as the product of primes. We use by strong induction on mm; if mm is prime we are done, and if mm is composite there is a nontrivial divisor dmd \mid m, so we apply the inductive hypothesis to dd and m/dm/d and combine these factorizations. We want to mimic the proof above in our proof of Maschke’s Theorem, but we have a new obstacle: we have to show that somehow, we can “divide”.

So why is it that we can divide in certain situations? The idea is that we want to be able to look at an “average” of the form 1GgGgv\frac{1}{\left\lvert G \right\rvert} \sum_{g \in G} g \cdot v because this average has the nice property of being GG-invariant. We’ll use this to obtain our proof of Maschke’s Theorem.

Proof: We proceed by induction on the dimension of the representation ρ\rho. Let ρ=(V,ρ)\rho = (V, \cdot_\rho) be a representation and assume its not irreducible, so it has a nontrivial subspace WW which is ρ\rho-invariant. It suffices to prove that there exists a subspace WVW' \subset V such that WW' is also ρ\rho-invariant and V=WWV = W \oplus W', because then we can apply the inductive hypothesis to the subrepresentations induced by WW and WW'.

Let π:VW\pi : V \rightarrow W be any projection of VV onto WW. We consider the averaging map T:VVT : V \rightarrow V by T(v)=1GgGg1ρπ(gρv).T(v) = \frac{1}{\left\lvert G \right\rvert} \sum_{g \in G} g^{-1} \cdot_\rho \pi(g \cdot_\rho v). We’ll use the following properties of the map.

Exercise. Show that the map TT has the following three properties.

  • For any wWw \in W, T(w)=wT(w) = w.
  • For any vVv \in V, T(w)WT(w) \in W.
  • THomG(ρ,ρ)T \in \operatorname{Hom}_G(\rho, \rho).

As with any projection map TT, we must have V=kerTImTV = \ker T \oplus \operatorname{Im} T. But ImT=W\operatorname{Im} T = W. Moreover, because the map TT is GG-invariant, it follows that kerT\ker T is ρ\rho-invariant. Hence taking W=kerTW' = \ker T completes the proof. \Box

This completes our proof of Maschke’s Theorem, telling us how all irreducibles decompose. Said another way, Maschke’s Theorem tells us that any finite-dimensional representation ρ\rho can be decomposed as ραIrrep(G)ραnα\bigoplus_{\rho_\alpha \in \operatorname{Irrep}(G)} \rho_{\alpha}^{\oplus n_\alpha} where nαn_\alpha is some nonnegative integer, and Irrep(G)\operatorname{Irrep}(G) is the set of all (isomorphism classes of) irreducibles representations.

You may wonder whether the decomposition is unique, and if so what we can say about the “exponents” nαn_\alpha. In the next post I’ll show how to compute the exponents nαn_\alpha (which in particular gives uniqueness).

Thanks to Dennis Gaitsgory, who taught me this in his course Math 55a. Thanks also to the MOPpers at PUMaC 2014 who let me run this by them during a sleepover; several improvements were made to the original draft as a result. My notes for Math 55a can be found at my website. Thanks also to N for pointing out an error in my proof of Maschke’s Theorem.