vEnhance's avatar

Jul 05, 2015

🖉 Linnik's Theorem for Sato-Tate Laws on CM Elliptic Curves

Here I talk about my first project at the Emory REU. Prerequisites for this post: some familiarity with number fields.

1. Motivation: Arithmetic Progressions

Given a property PP about primes, there’s two questions we can ask:

  1. How many primes x\le x are there with this property?
  2. What’s the least prime with this property?

As an example, consider an arithmetic progression aa, a+da+d, …, with a<da < d and gcd(a,d)=1\gcd(a,d) = 1. The strong form of Dirichlet’s Theorem tells us that basically, the number of primes a(modd)\equiv a \pmod d is 1d\frac 1d the total number of primes. Moreover, the celebrated Linnik’s Theorem tells us that the first prime is O(dL)O(d^L) for a fixed LL, with record L=5L = 5.

As I talked about last time on my blog, the key ingredients were:

  • Introducing Dirichlet characters χ\chi, which are periodic functions modulo qq. One uses this to get the mod qq into the problem.
  • Introducing an LL-function L(s,χ)L(s, \chi) attached to χ\chi.
  • Using complex analysis (Cauchy’s Residue Theorem) to boil the proof down to properties of the zeros of L(s,χ)L(s, \chi).

With that said, we now move to the object of interest: elliptic curves.

2. Counting Primes

Let EE be an elliptic curve over Q\mathbb Q, which for our purposes we can think of concretely as a curve in Weirestrass form y2=x3+Ax+By^2 = x^3 + Ax + B where the right-hand side has three distinct complex roots (viewed as a polynomial in xx). If we are unlucky enough that the right-hand side has a double root, then the curve ceases to bear the name “elliptic curve” and instead becomes singular.

Picture of an elliptic curve.
Picture of an elliptic curve.

Here’s a natural number theoretic question: for any rational prime pp, how many solutions does EE have modulo pp?

To answer this it’s helpful to be able to think over an arbitrary field FF. While we’ve written our elliptic curve EE as a curve over Q\mathbb Q, we could just as well regard it as a curve over C\mathbb C, or as a curve over Q(2)\mathbb Q(\sqrt 2). Even better, since we’re interested in counting solutions modulo pp, we can regard this as a curve over Fp\mathbb F_p. To make this clear, we will use the notation E/FE/F to signal that we are thinking of our elliptic curve over the field FF. Also, we write #E(F)\#E(F) to denote the number of points of the elliptic curve over FF (usually when FF is a finite field). Thus, the question boils down to computing #E(Fp)\#E(\mathbb F_p).

Anyways, the question above is given by the famous Hasse bound, and in fact it works over any number field!

Theorem 1 (Hasse Bound)

Let KK be a number field, and let E/KE/K be an elliptic curve. Consider any prime ideal pOK\mathfrak p \subseteq \mathcal O_K which is not ramified. Then we have #E(Fp)=Np+1ap\#E(\mathbb F_\mathfrak p) = \mathrm{N}\mathfrak p + 1 - a_\mathfrak p where ap2Np\left\lvert a_\mathfrak p \right\rvert \le 2\sqrt{\mathrm{N}\mathfrak p}.

Here Fp=OK/p\mathbb F_\mathfrak p = \mathcal O_K / \mathfrak p is the field of Np\mathrm{N}\mathfrak p elements. The extra “+1+1” comes from a point at infinity when you complete the elliptic curve in the projective plane.

Here, the ramification means what you might guess. Associated to every elliptic curve over Q\mathbb Q is a conductor NN, and a prime pp is ramified if it divides NN. The finitely many ramified primes are the “bad” primes for which something breaks down when we take modulo pp (for example, perhaps the curve becomes singular).

In other words, for the Q\mathbb Q case, except for finitely many bad primes pp, the number of solutions is p+1+O(p)p + 1 + O(\sqrt p), and we even know the implied OO-constant to be 22.

Now, how do we predict the error term?

3. The Sato-Tate Conjecture

For elliptic curves over Q\mathbb Q, we the Sato-Tate conjecture (which recently got upgraded to a theorem) more or less answers the question. But to state it, I have to introduce a new term: an elliptic curve E/QE/\mathbb Q, when regarded over C\mathbb C, can have complex multiplication (abbreviated CM). I’ll define this in just a moment, but for now, the two things to know are

  • CM curves are “special cases”, in the sense that a randomly selected elliptic curve won’t have CM.
  • It’s not easy in general to tell whether a given elliptic curve has CM.

Now I can state the Sato-Tate result. It is most elegantly stated in terms of the following notation: if we define ap=p+1#E(Fp)a_p = p + 1 - \#E(\mathbb F_p) as above, then there is a unique θp[0,π]\theta_p \in [0,\pi] which obeys ap=2pcosθp.a_p = 2 \sqrt p \cos \theta_p.

Theorem 2 (Sato-Tate)

Fix an elliptic curve E/QE/\mathbb Q which does not have CM (when regarded over C\mathbb C). Then as pp varies across unramified primes, the asymptotic probability that θp[α,β]\theta_p \in [\alpha, \beta] is 2π[α,β]sin2θp.\frac{2}{\pi} \int_{[\alpha, \beta]} \sin^2\theta_p. In other words, θp\theta_p is distributed according to the measure sin2θ\sin^2\theta.

Now, what about the CM case?

4. CM Elliptic Curves

Consider an elliptic curve E/QE/\mathbb Q but regard it as a curve over C\mathbb C. It’s well known that elliptic curves happen to have a group law: given two points on an elliptic curve, you can add them to get a third point. (If you’re not familiar with this, Wikipedia has a nice explanation). So elliptic curves have more structure than just their set of points: they form an abelian group; when written in Weirerstrass form, the identity is the point at infinity.

Group law on an elliptic curve.
Group law on an elliptic curve.

Letting A=(A,+)A = (A, +) be the associated abelian group, we can look at the endomorphisms of EE (that is, homomorphisms AAA \rightarrow A). These form a ring, which we denote End(E)\operatorname{End}(E). An example of such an endomorphism is anaa \mapsto n \cdot a for an integer nn (meaning a++aa+\dots+a, nn times). In this way, we see that ZEnd(E)\mathbb Z \subseteq \operatorname{End}(E).

Most of the time we in fact have End(E)Z\operatorname{End}(E) \cong \mathbb Z. But on occasion, we will find that End(E)\operatorname{End}(E) is congruent to OK\mathcal O_K, the ring of integers of a number field KK. This is called complex multiplication by KK.

Intuitively, this CM is special (despite being rare), because it means that the group structure associated to EE has a richer set of symmetry. For CM curves over any number field, for example, the Sato-Tate result becomes very clean, and is considerably more straightforward to prove.

Here’s an example. The elliptic curve E:y2=x317xE : y^2 = x^3 - 17 x of conductor N=26172N = 2^6 \cdot 17^2 turns out to have End(E)Z[i]\operatorname{End}(E) \cong \mathbb Z[i] i.e. it has complex multiplication has Z[i]\mathbb Z[i]. Throwing out the bad primes 22 and 1717, we compute the first several values of apa_p, and something bizarre happens. For the 33 mod 44 primes we get

a3=0a7=0a11=0a19=0a23=0a31=0 \begin{aligned} a_{3} &= 0 & a_{7} &= 0 & a_{11} &= 0 \\ a_{19} &= 0 & a_{23} &= 0 & a_{31} &= 0 \end{aligned}

and for the 11 mod 44 primes we have

a5=4a13=6a29=4a37=12a41=8 \begin{aligned} a_5 &= 4 \\ a_{13} &= 6 \\ a_{29} &= 4 \\ a_{37} &= 12 \\ a_{41} &= -8 \end{aligned}

Astonishingly, the vanishing of apa_p is controlled by the splitting of pp in Z[i]\mathbb Z[i]! In fact, this holds more generally. It’s a theorem that for elliptic curves E/QE/\mathbb Q with CM, we have End(E)OK\operatorname{End}(E) \cong \mathcal O_K where KK is some quadratic imaginary number field which is also a PID, like Z[i]\mathbb Z[i]. Then OK\mathcal O_K governs how the apa_p behave:

Theorem 3 (Sato-Tate Over CM)

Let E/QE/\mathbb Q be a fixed elliptic curve with CM by OK\mathcal O_K. Let p\mathfrak p be a unramified prime of OK\mathcal O_K.

  1. If p\mathfrak p is inert, then ap=0a_\mathfrak p = 0 (i.e. θp=12π\theta_\mathfrak p = \frac{1}{2}\pi).
  2. If p\mathfrak p is split, then θp\theta_\mathfrak p is uniform across [0,π][0, \pi].

I’m told this is much easier to prove than the usual Sato-Tate.

But there’s even more going on in the background. If I look again at apa_p where p1(mod4)p \equiv 1 \pmod 4, I might recall that pp can be written as the sum of squares, and construct the following table:

papx2+y25422+1213632+2229422+52371262+1241842+52531472+22611262+52731682+32891052+82 \begin{array}{rrl} p & a_p & x^2+y^2 \\ 5 & 4 & 2^2 + 1^2 \\ 13 & 6 & 3^2 + 2^2 \\ 29 & 4 & 2^2 + 5^2 \\ 37 & 12 & 6^2 + 1^2 \\ 41 & -8 & 4^2 + 5^2 \\ 53 & 14 & 7^2 + 2^2 \\ 61 & 12 & 6^2 + 5^2 \\ 73 & -16 & 8^2 + 3^2 \\ 89 & -10 & 5^2 + 8^2 \\ \end{array}

Each apa_p is double one of the terms! There is no mistake: the apa_p are also tied to the decomposition of p=x2+y2p = x^2+y^2. And this works for any number field.

What’s happening? The main idea is that looking at a prime ideal p=(x+yi)\mathfrak p = (x+yi), apa_\mathfrak p is related to the argument of the complex number x+yix+yi in some way. Of course, there are lots of questions unanswered (how to pick the ±\pm sign, and which of xx and yy to choose) but there’s a nice way to package all this information, as I’ll explain momentarily.

(Aside: I think the choice of having xx be the odd or even number depends precisely on whether pp is a quadratic residue modulo 1717, but I’ll have to check on that.)

5. LL-Functions

I’ll just briefly explain where all this is coming from, and omit lots of details (in part because I don’t know all of them). Let E/QE/\mathbb Q be an elliptic curve with CM by OK\mathcal O_K. We can define an associated LL-function

L(s,E/K)=p(1ap(Np)s+12+1(Np)2s) L(s, E/K) = \prod_\mathfrak p \left( 1 - \frac{a_\mathfrak p}{(\mathrm{N}\mathfrak p)^{s+\frac{1}{2}}} + \frac{1}{(\mathrm{N}\mathfrak p)^{2s}} \right)

(actually this isn’t quite true actually, some terms change for ramified primes p\mathfrak p).

At the same time there’s a notion of a Hecke Grössencharakter ξ\xi on a number field KK – a higher dimensional analog of the Dirichlet characters we used on Z\mathbb Z to filter modulo qq. For our purposes, think of it as a multiplicative function which takes in ideals of OK\mathcal O_K and returns complex numbers of norm 11. Like Dirichlet characters, each ξ\xi gets a Hecke LL-function L(s,ξ)=p(1ξ(p)(Np)s)L(s, \xi) = \prod_\mathfrak p \left( 1 - \frac{\xi(\mathfrak p)}{(\mathrm{N}\mathfrak p)^s} \right) which again extends to a meromorphic function on the entire complex plane.

Now the great theorem is:

Theorem 4 (Deuring)

Let E/QE/\mathbb Q have CM by OK\mathcal O_K. Then L(s,E/K)=L(s,ξ)L(s,ξ)L(s,E/K) = L(s, \xi)L(s, \overline{\xi}) for some Hecke Grössencharakter ξ\xi.

Using the definitions given above and equating the Euler products at an unramified p\mathfrak p gives

1ap(Np)s+12+1(Np)2s=(1ξ(p)(Np)s)(1ξ(p)(Np)s) 1 - \frac{a_\mathfrak p}{(\mathrm{N}\mathfrak p)^{s+\frac{1}{2}}} + \frac{1}{(\mathrm{N}\mathfrak p)^{2s}} = \left( 1 - \frac{\xi(\mathfrak p)}{(\mathrm{N}\mathfrak p)^s} \right) \left( 1 - \frac{\overline{\xi(\mathfrak p)}}{(\mathrm{N}\mathfrak p)^s} \right)

Upon recalling that ap=2Npcosθpa_\mathfrak p = 2 \sqrt{\mathrm{N}\mathfrak p} \cos \theta_\mathfrak p, we derive ξ(p)=exp(±iθp).\xi(\mathfrak p) = \exp(\pm i \theta_\mathfrak p). This is enough to determine the entire ξ\xi since ξ\xi is multiplicative.

So this is the result: let E/QE/\mathbb Q be an elliptic curve of conductor NN. Given our quadratic number field KK, we define a map ξ\xi from prime ideals of OK\mathcal O_K to the unit circle in C\mathbb C by

p{exp(±iθp)gcd(Np,N)=10gcd(Np,N)>1. \mathfrak p \mapsto \begin{cases} \exp(\pm i \theta_\mathfrak p) & \gcd(\mathrm{N}\mathfrak p, N) = 1 \\ 0 & \gcd(\mathrm{N}\mathfrak p, N) > 1. \end{cases}

Thus ξ\xi is a Hecke Grössencharakter for some choice of ±\pm at each p\mathfrak p.

It turns out furthermore that ξ\xi has frequency 11, which roughly means that the argument of ξ((π))\xi\left( (\pi) \right) is related to 11 times the argument of π\pi itself. This fact is what explains the mysterious connection between the apa_p and the solutions above.

6. Linnik-Type Result

With this in mind, I can now frame the main question: suppose we have an interval [α,β][0,π][\alpha, \beta] \subset [0,\pi]. What’s the first prime pp such that θp[α,β]\theta_p \in [\alpha, \beta]? We’d love to have some analog of Linnik’s Theorem here.

This was our project and the REU, and Ashvin, Peter and I proved that

Theorem 5. If a rational EE has CM then the least prime pp with θp[α,β]\theta_p \in [\alpha,\beta] is (Nβα)A.\ll \left( \frac{N}{\beta-\alpha} \right)^A.

I might blog later about what else goes into the proof of this… but Deuring’s result is one key ingredient, and a proof of an analogous theorem for non-CM curves would have to be very different.