vEnhance's avatar

Jul 31, 2016

🖉 Vinogradov's Three-Prime Theorem (with Sammy Luo and Ryan Alweiss)

This was my final paper for 18.099, seminar in discrete analysis, jointly with Sammy Luo and Ryan Alweiss.

We prove that every sufficiently large odd integer can be written as the sum of three primes, conditioned on a strong form of the prime number theorem.

1. Introduction

In this paper, we prove the following result:

Theorem 1 (Vinogradov)

Every sufficiently large odd integer NN is the sum of three prime numbers.

In fact, the following result is also true, called the “weak Goldbach conjecture”.

Theorem 2 (Weak Goldbach conjecture)

Every odd integer N7N \ge 7 is the sum of three prime numbers.

The proof of Vinogradov’s theorem becomes significantly simpler if one assumes the generalized Riemann hypothesis; this allows one to use a strong form of the prime number theorem (Theorem 9). This conditional proof was given by Hardy and Littlewood in the 1923’s. In 1997, Deshouillers, Effinger, te Riele and Zinoviev showed that the generalized Riemann hypothesis in fact also implies the weak Goldbach conjecture by improving the bound to 102010^{20} and then exhausting the remaining cases via a computer search.

As for unconditional proofs, Vinogradov was able to eliminate the dependency on the generalized Riemann hypothesis in 1937, which is why the Theorem 1 bears his name. However, Vinogradov’s bound used the ineffective Siegel-Walfisz theorem; his student K. Borozdin showed that 33153^{3^{15}} is large enough. Over the years the bound was improved, until recently in 2013 when Harald Helfgott claimed the first unconditional proof of Theorem 2, see here.

In this exposition we follow Hardy and Littlewood’s approach, i.e. we prove Theorem 1 assuming the generalized Riemann hypothesis, following the exposition of Rhee. An exposition of the unconditional proof by Vinogradov is given by Rouse.

2. Synopsis

We are going to prove that a+b+c=NΛ(a)Λ(b)Λ(c)12N2G(N)(1)\sum_{a+b+c = N} \Lambda(a) \Lambda(b) \Lambda(c) \asymp \frac12 N^2 \mathfrak G(N) \qquad (1) where

G(N)pN(11(p1)2)pN(1+1(p1)3) \mathfrak G(N) \coloneqq \prod_{p \mid N} \left( 1 - \frac{1}{(p-1)^2} \right) \prod_{p \nmid N} \left( 1 + \frac{1}{(p-1)^3} \right)

and Λ\Lambda is the von Mangoldt function defined as usual. Then so long as 2N2 \nmid N, the quantity G(N)\mathfrak G(N) will be bounded away from zero; thus (1) will imply that in fact there are many ways to write NN as the sum of three distinct prime numbers.

The sum (1) is estimated using Fourier analysis. Let us define the following.

Definition 3. Let T=R/Z\mathbb T = \mathbb R/\mathbb Z denote the circle group, and let e:TCe : \mathbb T \rightarrow \mathbb C be the exponential function θexp(2πiθ)\theta \mapsto \exp(2\pi i \theta). For αT\alpha\in\mathbb T, {α}\{\alpha\} denotes the minimal distance from α\alpha to an integer.

Note that e(θ)1=Θ({θ})|e(\theta)-1|=\Theta(\{\theta\}).

Definition 4. For αT\alpha \in \mathbb T and x>0x > 0 we define S(x,α)=nxΛ(n)e(nα).S(x, \alpha) = \sum_{n \le x} \Lambda(n) e(n\alpha).

Then we can rewrite (1) using SS as a “Fourier coefficient”:

Proposition 5. We have

a+b+c=NΛ(a)Λ(b)Λ(c)=αTS(N,α)3e(Nα)dα.(2) \sum_{a+b+c = N} \Lambda(a) \Lambda(b) \Lambda(c) = \int_{\alpha \in \mathbb T} S(N, \alpha)^3 e(-N\alpha) d\alpha. \qquad (2)

Proof: We have S(N,α)3=a,b,cNΛ(a)Λ(b)Λ(c)e((a+b+c)α),S(N,\alpha)^3=\sum_{a,b,c\leq N}\Lambda(a)\Lambda(b)\Lambda(c)e((a+b+c)\alpha), so

αTS(N,α)3e(Nα)dα=αTa,b,cNΛ(a)Λ(b)Λ(c)e((a+b+c)α)e(Nα)dα=a,b,cNΛ(a)Λ(b)Λ(c)αTe((a+b+cN)α)dα=a,b,cNΛ(a)Λ(b)Λ(c)I(a+b+c=N)=a+b+c=NΛ(a)Λ(b)Λ(c), \begin{aligned} \int_{\alpha \in \mathbb T} S(N, \alpha)^3 e(-N\alpha) d\alpha &= \int_{\alpha \in \mathbb T} \sum_{a,b,c\leq N} \Lambda(a)\Lambda(b)\Lambda(c)e((a+b+c)\alpha) e(-N\alpha) d\alpha \\ &= \sum_{a,b,c\leq N}\Lambda(a)\Lambda(b)\Lambda(c) \int_{\alpha \in \mathbb T}e((a+b+c-N)\alpha) d\alpha \\ &= \sum_{a,b,c\leq N}\Lambda(a)\Lambda(b)\Lambda(c)I(a+b+c=N) \\ &= \sum_{a+b+c=N}\Lambda(a)\Lambda(b)\Lambda(c), \end{aligned}

as claimed. \Box

In order to estimate the integral in Proposition 5, we divide T\mathbb T into the so-called “major” and “minor” arcs. Roughly,

  • The “major arcs” are subintervals of T\mathbb T centered at a rational number with small denominator.
  • The “minor arcs” are the remaining intervals.

These will be made more precise later. This general method is called the Hardy-Littlewood circle method, because of the integral over the circle group T\mathbb T.

The rest of the paper is structured as follows. In Section 3, we define the Dirichlet character and other number-theoretic objects, and state some estimates for the partial sums of these objects conditioned on the Riemann hypothesis. These bounds are then used in Section 4 to provide corresponding estimates on S(x,α)S(x, \alpha). In Section 5 we then define the major and minor arcs rigorously and use the previous estimates to given an upper bound for the integral over both areas. Finally, we complete the proof in Section 6.

3. Prime number theorem type bounds

In this section, we collect the necessary number-theoretic results that we will need. It is in this section only that we will require the generalized Riemann hypothesis.

As a reminder, the notation f(x)g(x)f(x)\ll g(x), where ff is a complex function and gg a nonnegative real one, means f(x)=O(g(x))f(x)=O(g(x)), a statement about the magnitude of ff. Likewise, f(x)=g(x)+O(h(x))f(x)=g(x)+O(h(x)) simply means that for some CC, f(x)g(x)Ch(x)|f(x)-g(x)|\leq C|h(x)| for all sufficiently large xx.

3.1. Dirichlet characters

In what follows, qq denotes a positive integer.

Definition 6. A Dirichlet character modulo qq χ\chi is a homomorphism χ:(Z/q)×C×\chi : (\mathbb Z/q)^\times \rightarrow \mathbb C^\times. It is said to be trivial if χ=1\chi = 1; we denote this character by χ0\chi_0.

By slight abuse of notation, we will also consider χ\chi as a function ZC\mathbb Z \rightarrow \mathbb C^\ast by setting χ(n)=χ(n(modq))\chi(n) = \chi(n \pmod q) for gcd(n,q)=1\gcd(n,q) = 1 and χ(n)=0\chi(n) = 0 for gcd(n,q)>1\gcd(n,q) > 1.

Remark 7. The Dirichlet characters form a multiplicative group of order ϕ(q)\phi(q) under multiplication, with inverse given by complex conjugation. Note that χ(m)\chi(m) is a primitive ϕ(q)\phi(q)-th root of unity for any m(Z/q)×m \in (\mathbb Z/q)^\times, thus χ\chi takes values in the unit circle.

Moreover, the Dirichlet characters satisfy an orthogonality relation

Experts may recognize that the Dirichlet characters are just the elements of the Pontryagin dual of (Z/q)×(\mathbb Z/q)^\times. In particular, they satisfy an orthogonality relationship

1ϕ(q)χ mod qχ(n)χ(a)={1n=a(modq)0otherwise(3) \frac{1}{\phi(q)} \sum_{\chi \text{ mod } q} \chi(n) \overline{\chi(a)} = \begin{cases} 1 & n = a \pmod q \\ 0 & \text{otherwise} \end{cases} \qquad (3)

and thus form an orthonormal basis for functions (Z/q)×C(\mathbb Z/q)^\times \rightarrow \mathbb C.

3.2. Prime number theorem for arithmetic progressions

Definition 8. The generalized Chebyshev function is defined by ψ(x,χ)=nxΛ(n)χ(n).\psi(x, \chi) = \sum_{n \le x} \Lambda(n) \chi(n).

The Chebyshev function is studied extensively in analytic number theory, as it is the most convenient way to phrase the major results of analytic number theory. For example, the prime number theorem is equivalent to the assertion that ψ(x,χ0)=nxΛ(n)x\psi(x, \chi_0) = \sum_{n \le x} \Lambda(n) \asymp x where q=1q = 1 (thus χ0\chi_0 is the constant function 11). Similarly, Dirichlet’s theorem actually asserts that any q1q \ge 1,

ψ(x,χ)={x+oq(x)χ=χ0 trivialoq(x)χχ0 nontrivial. \psi(x, \chi) = \begin{cases} x + o_q(x) & \chi = \chi_0 \text{ trivial} \\ o_q(x) & \chi \neq \chi_0 \text{ nontrivial}. \end{cases}

However, the error term in these estimates is quite poor (more than x1εx^{1-\varepsilon} for every ε\varepsilon). However, by assuming the Riemann Hypothesis for a certain “LL-function” attached to χ\chi, we can improve the error terms substantially.

Theorem 9 (Prime number theorem for arithmetic progressions)

Let χ\chi be a Dirichlet character modulo qq, and assume the Riemann hypothesis for the LL-function attached to χ\chi.

  1. If χ\chi is nontrivial, then

    ψ(x,χ)x(logqx)2.\psi(x, \chi) \ll \sqrt{x} (\log qx)^2.

  2. If χ=χ0\chi = \chi_0 is trivial, then

    ψ(x,χ0)=x+O(x(logx)2+logqlogx).\psi(x, \chi_0) = x + O\left( \sqrt x (\log x)^2 + \log q \log x \right).

Theorem 9 is the strong estimate that we will require when putting good estimates on S(x,α)S(x, \alpha), and is the only place in which the generalized Riemann Hypothesis is actually required.

3.3. Gauss sums

Definition 10. For χ\chi a Dirichlet character modulo qq, the Gauss sum τ(χ)\tau(\chi) is defined by τ(χ)=a=0q1χ(a)e(a/q).\tau(\chi)=\sum_{a=0}^{q-1}\chi(a)e(a/q).

We will need the following fact about Gauss sums.

Lemma 11. Consider Dirichlet characters modulo qq. Then:

  1. We have τ(χ0)=μ(q)\tau(\chi_0) = \mu(q).
  2. For any χ\chi modulo qq, τ(χ)q\left\lvert \tau(\chi) \right\rvert \le \sqrt q.

3.4. Dirichlet approximation

We finally require Dirichlet approximation theorem in the following form.

Theorem 12 (Dirichlet approximation)

Let αR\alpha \in \mathbb R be arbitrary, and MM a fixed integer. Then there exists integers aa and q=q(α)q = q(\alpha), with 1qM1 \le q \le M and gcd(a,q)=1\gcd(a,q) = 1, satisfying αaq1qM.\left\lvert \alpha - \frac aq \right\rvert \le \frac{1}{qM}.

4. Bounds on S(x,α)S(x, \alpha)

In this section, we use our number-theoretic results to bound S(x,α)S(x,\alpha).

First, we provide a bound for S(x,α)S(x,\alpha) if α\alpha is a rational number with “small” denominator qq.

Lemma 13. Let gcd(a,q)=1\gcd(a,q) = 1. Assuming Theorem 9, we have S(x,a/q)=μ(q)ϕ(q)x+O(qx(logqx)2)S(x, a/q) = \frac{\mu(q)}{\phi(q)} x + O\left( \sqrt{qx} (\log qx)^2 \right) where μ\mu denotes the Möbius function.

Proof: Write the sum as S(x,a/q)=nxΛ(n)e(na/q).S(x, a/q) = \sum_{n \le x} \Lambda(n) e(na/q). First we claim that the terms gcd(n,q)>1\gcd(n,q) > 1 (and Λ(n)0\Lambda(n) \neq 0) contribute a negligibly small logqlogx\ll \log q \log x. To see this, note that

  • The number qq has logq\ll \log q distinct prime factors, and
  • If peqp^e \mid q, then Λ(p)++Λ(pe)=elogp=log(pe)<logx\Lambda(p) + \dots + \Lambda(p^e) = e\log p = \log(p^e) < \log x.

So consider only terms with gcd(n,q)=1\gcd(n,q) = 1. To bound the sum, notice that

e(na/q)=b mod qe(b/q)1(ban)=b mod qe(b/q)(1ϕ(q)χ mod qχ(b)χ(an)) \begin{aligned} e(n \cdot a/q) &= \sum_{b \text{ mod } q} e(b/q) \cdot \mathbf 1(b \equiv an) \\ &= \sum_{b \text{ mod } q} e(b/q) \left( \frac{1}{\phi(q)} \sum_{\chi \text{ mod } q} \chi(b) \overline{\chi(an)} \right) \end{aligned}

by the orthogonality relations. Now we swap the order of summation to obtain a Gauss sum:

e(na/q)=1ϕ(q)χ mod qχ(an)(b mod qχ(b)e(b/q))=1ϕ(q)χ mod qχ(an)τ(χ). \begin{aligned} e(n \cdot a/q) &= \frac{1}{\phi(q)} \sum_{\chi \text{ mod } q} \overline{\chi(an)} \left( \sum_{b \text{ mod } q} \chi(b) e(b/q) \right) \\ &= \frac{1}{\phi(q)} \sum_{\chi \text{ mod } q} \overline{\chi(an)} \tau(\chi). \end{aligned}

Thus, we swap the order of summation to obtain that

S(x,α)=nxgcd(n,q)=1Λ(n)e(na/q)=1ϕ(q)nxgcd(n,q)=1χ mod qΛ(n)χ(an)τ(χ)=1ϕ(q)χ mod qτ(χ)nxgcd(n,q)=1Λ(n)χ(an)=1ϕ(q)χ mod qχ(a)τ(χ)nxgcd(n,q)=1Λ(n)χ(n)=1ϕ(q)χ mod qχ(a)τ(χ)ψ(x,χ)=1ϕ(q)(τ(χ0)ψ(x,χ0)+1χ mod qχ(a)τ(χ)ψ(x,χ)). \begin{aligned} S(x, \alpha) &= \sum_{\substack{n \le x \\ \gcd(n,q) = 1}} \Lambda(n) e(n \cdot a/q) \\ &= \frac{1}{\phi(q)} \sum_{\substack{n \le x \\ \gcd(n,q) = 1}} \sum_{\chi \text{ mod } q} \Lambda(n) \overline{\chi(an)} \tau(\chi) \\ &= \frac{1}{\phi(q)} \sum_{\chi \text{ mod } q} \tau(\chi) \sum_{\substack{n \le x \\ \gcd(n,q) = 1}} \Lambda(n) \overline{\chi(an)} \\ &= \frac{1}{\phi(q)} \sum_{\chi \text{ mod } q} \overline{\chi(a)} \tau(\chi) \sum_{\substack{n \le x \\ \gcd(n,q) = 1}} \Lambda(n)\overline{\chi(n)} \\ &= \frac{1}{\phi(q)} \sum_{\chi \text{ mod } q} \overline{\chi(a)} \tau(\chi) \psi(x, \overline\chi) \\ &= \frac{1}{\phi(q)} \left( \tau(\chi_0) \psi(x, \chi_0) + \sum_{1 \neq \chi \text{ mod } q} \overline{\chi(a)} \tau(\chi) \psi(x, \overline\chi) \right). \end{aligned}

Now applying both parts of Lemma 11 in conjunction with Theorem 9 gives

S(x,α)=μ(q)ϕ(q)(x+O(x(logqx)2))+O(x(logx)2)=μ(q)ϕ(q)x+O(qx(logqx)2) \begin{aligned} S(x,\alpha) &= \frac{\mu(q)}{\phi(q)} \left( x + O\left( \sqrt x (\log qx)^2 \right) \right) + O\left( \sqrt x (\log x)^2 \right) \\ &= \frac{\mu(q)}{\phi(q)} x + O\left( \sqrt{qx} (\log qx)^2 \right) \end{aligned}

as desired. \Box

We then provide a bound when α\alpha is “close to” such an a/qa/q.

Lemma 14. Let gcd(a,q)=1\gcd(a,q) = 1 and βT\beta \in \mathbb T. Assuming Theorem 9, we have

S(x,a/q+β)=μ(q)ϕ(q)(nxe(βn))+O((1+{β}x)qx(logqx)2). S(x, a/q + \beta) = \frac{\mu(q)}{\phi(q)} \left( \sum_{n \le x} e(\beta n) \right) + O\left( (1+\{\beta\}x) \sqrt{qx} (\log qx)^2 \right).

Proof: For convenience let us assume xZx \in \mathbb Z. Let α=a/q+β\alpha = a/q + \beta. Let us denote Err(x,α)=S(x,α)μ(q)ϕ(q)x\text{Err}(x, \alpha) = S(x,\alpha) - \frac{\mu(q)}{\phi(q)} x, so by Lemma 13 we have Err(x,α)qx(logx)2\text{Err}(x,\alpha) \ll \sqrt{qx}(\log x)^2. We have

S(x,α)=nxΛ(n)e(na/q)e(nβ)=nxe(nβ)(S(n,a/q)S(n1,a/q))=nxe(nβ)(μ(q)ϕ(q)+Err(n,α)Err(n1,α))=μ(q)ϕ(q)(nxe(nβ))+1mx1(e((m+1)β)e(mβ))Err(m,α)+e(xβ)Err(x,α)e(0)Err(0,α)μ(q)ϕ(q)(nxe(nβ))+(1mx1{β}Err(m,α))+Err(0,α)+Err(x,α)μ(q)ϕ(q)(nxe(nβ))+(1+x{β})O(qx(logqx)2) \begin{aligned} S(x, \alpha) &= \sum_{n \le x} \Lambda(n) e(na/q) e(n\beta) \\ &= \sum_{n \le x} e(n\beta) \left( S(n, a/q) - S(n-1, a/q) \right) \\ &= \sum_{n \le x} e(n\beta) \left( \frac{\mu(q)}{\phi(q)} + \text{Err}(n, \alpha) - \text{Err}(n-1, \alpha) \right) \\ &= \frac{\mu(q)}{\phi(q)} \left( \sum_{n \le x} e(n\beta) \right) + \sum_{1 \le m \le x-1} \left( e( (m+1)\beta) - e( m\beta ) \right) \text{Err}(m, \alpha) \\ &\qquad + e(x\beta) \text{Err}(x, \alpha) - e(0) \text{Err}(0, \alpha) \\ &\le \frac{\mu(q)}{\phi(q)} \left( \sum_{n \le x} e(n\beta) \right) + \left( \sum_{1 \le m \le x-1} \{\beta\} \text{Err}(m, \alpha) \right) + \text{Err}(0, \alpha) + \text{Err}(x, \alpha) \\ &\ll \frac{\mu(q)}{\phi(q)} \left( \sum_{n \le x} e(n\beta) \right) + \left( 1+x\left\{ \beta \right\} \right) O\left( \sqrt{qx} (\log qx)^2 \right) \end{aligned}

as desired. \Box

Thus if α\alpha is close to a fraction with small denominator, the value of S(x,α)S(x, \alpha) is bounded above. We can now combine this with the Dirichlet approximation theorem to obtain the following general result.

Corollary 15. Suppose M=N2/3M = N^{2/3} and suppose αa/q1qM\left\lvert \alpha - a/q \right\rvert \le \frac{1}{qM} for some gcd(a,q)=1\gcd(a,q) = 1 with qMq \le M. Assuming Theorem 9, we have S(x,α)xφ(q)+x56+εS(x, \alpha) \ll \frac{x}{\varphi(q)} + x^{\frac56+\varepsilon} for any ε>0\varepsilon > 0.

Proof: Apply Lemma 14 directly. \Box

5. Estimation of the arcs

We’ll write f(α)S(N,α)=nNΛ(n)e(nα)f(\alpha) \coloneqq S(N,\alpha)=\sum_{n \le N} \Lambda(n)e(n\alpha) for brevity in this section.

Recall that we wish to bound the right-hand side of (2) in Proposition 5. We split [0,1][0,1] into two sets, which we call the “major arcs” and the “minor arcs.” To do so, we use Dirichlet approximation, as hinted at earlier.

In what follows, fix M=N2/3K=(logN)10.\begin{aligned} M &= N^{2/3} \\ K &= (\log N)^{10}. \end{aligned}

5.1. Setting up the arcs

Definition 16. For qKq \le K and gcd(a,q)=1\gcd(a,q) = 1, 1aq1 \le a \le q, we define

M(a,q)={αTαaq1M}. \mathfrak M(a,q) = \left\{ \alpha \in \mathbb T \mid \left\lvert \alpha - \frac aq \right\rvert \le \frac 1M \right\}.

These will be the major arcs. The union of all major arcs is denoted by M\mathfrak M. The complement is denoted by m\mathfrak m.

Equivalently, for any α\alpha, consider q=q(α)Mq = q(\alpha) \le M as in Theorem 12. Then αM\alpha \in \mathfrak M if qKq \le K and αm\alpha \in \mathfrak m otherwise.

Proposition 17. M\mathfrak M is composed of finitely many disjoint intervals M(a,q)\mathfrak M(a,q) with qKq \le K. The complement m\mathfrak m is nonempty.

Proof: Note that if q1,q2Kq_1, q_2 \le K and a/q1b/q2a/q_1 \neq b/q_2 then aq1bq21q1q23qM\left\lvert \frac{a}{q_1} - \frac{b}{q_2} \right\rvert \ge \frac{1}{q_1q_2} \gg \frac{3}{qM}. \Box

In particular both M\mathfrak M and m\mathfrak m are measurable. Thus we may split the integral in (2) over M\mathfrak M and m\mathfrak m. This integral will have large magnitude on the major arcs, and small magnitude on the minor arcs, so overall the whole interval [0,1][0,1] it will have large magnitude.

5.2. Estimate of the minor arcs

First, we note the well known fact ϕ(q)q/logq\phi(q) \gg q/\log q. Note also that if q=q(α)q=q(\alpha) as in the last section and α\alpha is on a minor arc, we have q>(logN)10q > (\log N)^{10}, and thus ϕ(q)(logN)9\phi(q) \gg (\log N)^{9}.

As such Corollary 3.3 yields that f(α)Nϕ(q)+N.834N(logN)9f(\alpha) \ll \frac{N}{\phi(q)}+N^{.834} \ll \frac{N}{(\log N)^9}.

Now,

mf(α)3e(Nα)dαmf(α)3dαN(logN)901f(α)2dα=N(logN)901f(α)f(α)dα=N(logN)9nNΛ(n)2N2(logN)8, \begin{aligned} \left\lvert \int_{\mathfrak m}f(\alpha)^3e(-N\alpha) d\alpha \right\rvert &\le \int_{\mathfrak m}\left\lvert f(\alpha)\right\rvert ^3 d\alpha \\ &\ll \frac{N}{(\log N)^9} \int_{0}^{1}\left\lvert f(\alpha)\right\rvert ^2 d\alpha \\ &=\frac{N}{(\log N)^9}\int_{0}^{1}f(\alpha)f(-\alpha) d\alpha \\ &=\frac{N}{(\log N)^9}\sum_{n \le N} \Lambda(n)^2 \\ &\ll \frac{N^2}{(\log N)^8}, \end{aligned}

using the well known bound nNΛ(n)2NlogN\sum_{n \le N} \Lambda(n)^2 \ll \frac{N}{\log N}. This bound of N2(logN)8\frac{N^2}{(\log N)^8} will be negligible compared to lower bounds for the major arcs in the next section.

5.3. Estimate on the major arcs

We show that Mf(α)3e(Nα)dαN22G(N).\int_{\mathfrak M}f(\alpha)^3e(-N\alpha) d\alpha \asymp \frac{N^2}{2} \mathfrak G(N). By Proposition 17 we can split the integral over each interval and write

Mf(α)3e(Nα)dα=q(logN)101aqgcd(a,q)=11/qM1/qMf(a/q+β)3e(N(a/q+β))dβ. \int_{\mathfrak M} f(\alpha)^3e(-N\alpha) d\alpha = \sum_{q \le (\log N)^{10}} \sum_{\substack{1 \le a \le q \\ \gcd(a,q)=1}} \int_{-1/qM}^{1/qM}f(a/q+\beta)^3e(-N(a/q+\beta)) d\beta.

Then we apply Lemma 14, which gives

f(a/q+β)3=(μ(q)ϕ(q)nNe(βn))3+(μ(q)ϕ(q)nNe(βn))2O((1+{β}N)qNlog2qN)+(μ(q)ϕ(q)nNe(βn))O((1+{β}N)qNlog2qN)2+O((1+{β}N)qNlog2qN)3. \begin{aligned} f(a/q+\beta)^3 &= \left(\frac{\mu(q)}{\phi(q)}\sum_{n \le N}e(\beta n) \right)^3 \\ &+ \left(\frac{\mu(q)}{\phi(q)}\sum_{n \le N}e(\beta n)\right)^2 O\left((1+\{\beta\}N)\sqrt{qN} \log^2 qN\right) \\ &+ \left(\frac{\mu(q)}{\phi(q)}\sum_{n \le N}e(\beta n)\right) O\left((1+\{\beta\}N)\sqrt{qN} \log^2 qN\right)^2 \\ &+ O\left((1+\{\beta\}N)\sqrt{qN} \log^2 qN\right)^3. \end{aligned}

Now, we can do casework on the side of N.9N^{-.9} that {β}\{\beta\} lies on.

  • If {β}N.9\{\beta\} \gg N^{-.9}, we have nNe(βn)2e(β)11{β}N.9\sum_{n \le N}e(\beta n) \ll \frac{2}{|e(\beta)-1|} \ll \frac{1}{\{\beta\}} \ll N^{.9}, and (1+{β}N)qNlog2qNN5/6+ε(1+\{\beta\}N)\sqrt{qN} \log^2 qN \ll N^{5/6+\varepsilon}, because certainly we have {β}<1/M=N2/3\{\beta\}<1/M=N^{-2/3}.
  • If on the other hand {β}N.9\{\beta\}\ll N^{-.9}, we have nNe(βn)N\sum_{n \le N}e(\beta n) \ll N obviously, and O(1+{β}N)qNlog2qN)N3/5+εO(1+\{\beta\}N)\sqrt{qN} \log^2 qN) \ll N^{3/5+\varepsilon}.

As such, we obtain

f(a/q+β)3(μ(q)ϕ(q)nNe(βn))3+O(N79/30+ε) f(a/q+\beta)^3 \ll \left( \frac{\mu(q)}{\phi(q)}\sum_{n \le N}e(\beta n) \right)^3 + O\left(N^{79/30+\varepsilon}\right)

in either case. Thus, we can write

Mf(α)3e(Nα)dα=q(logN)101aqgcd(a,q)=11/qM1/qMf(a/q+β)3e(N(a/q+β))dβ=q(logN)101aqgcd(a,q)=11/qM1/qM[(μ(q)ϕ(q)nNe(βn))3+O(N79/30+ε)]e(N(a/q+β))dβ=q(logN)10μ(q)ϕ(q)3Sq(1aqgcd(a,q)=1e(N(a/q)))(1/qM1/qM(nNe(βn))3e(Nβ)dβ)+O(N59/30+ε). \begin{aligned} &\qquad \int_{\mathfrak M} f(\alpha)^3e(-N\alpha) d\alpha \\ &= \sum_{q \le (\log N)^{10}} \sum_{\substack{1 \le a \le q \\ \gcd(a,q)=1}} \int_{-1/qM}^{1/qM} f(a/q+\beta)^3e(-N(a/q+\beta)) d\beta \\ &= \sum_{q \le (\log N)^{10}} \sum_{\substack{1 \le a \le q \\ \gcd(a,q)=1}} \int_{-1/qM}^{1/qM}\left[\left(\frac{\mu(q)}{\phi(q)}\sum_{n \le N}e(\beta n)\right)^3 + O\left(N^{79/30+\varepsilon}\right)\right]e(-N(a/q+\beta)) d\beta \\ &= \sum_{q \le (\log N)^{10}} \frac{\mu(q)}{\phi(q)^3} S_q \left(\sum_{\substack{1 \le a \le q \\ \gcd(a,q)=1}} e(-N(a/q))\right) \left( \int_{-1/qM}^{1/qM}\left(\sum_{n \le N}e(\beta n)\right)^3e(-N\beta) d\beta \right ) \\ &\qquad +O\left(N^{59/30+\varepsilon}\right). \end{aligned}

just using MN2/3M \le N^{2/3}. Now, we use nNe(βn)=1e(βN)1e(β)1{β}.\sum_{n \le N}e(\beta n) = \frac{1-e(\beta N)}{1-e(\beta)} \ll \frac{1}{\{\beta\}}. This enables us to bound the expression

1/qM11/qM(nNe(βn))3e(Nβ)dβ1/qM11/qM{β}3dβ=21/qM1/2β3dβq2M2. \int_{1/qM}^{1-1/qM}\left (\sum_{n \le N}e(\beta n)\right) ^ 3 e(-N\beta)d\beta \ll \int_{1/qM}^{1-1/qM}\{\beta\}^{-3} d\beta = 2\int_{1/qM}^{1/2}\beta^{-3} d\beta \ll q^2M^2.

But the integral over the entire interval is

01(nNe(βn))3e(Nβ)dβ=01a,b,cNe((a+b+cN)β)a,b,cN1(a+b+c=N)=(N12). \begin{aligned} \int_{0}^{1}\left(\sum_{n \le N}e(\beta n) \right)^3 e(-N\beta)d\beta &= \int_{0}^{1} \sum_{a,b,c \le N} e((a+b+c-N)\beta) \\ &\ll \sum_{a,b,c \le N} \mathbf 1(a+b+c=N) \\ &= \binom{N-1}{2}. \end{aligned}

Considering the difference of the two integrals gives

1/qM1/qM(nNe(βn))3e(Nβ)dβN22q2M2+N(logN)cN4/3, \int_{-1/qM}^{1/qM}\left(\sum_{n \le N}e(\beta n) \right)^3 e(-N\beta) d\beta - \frac{N^2}{2} \ll q^2 M^2 + N \ll (\log N)^c N^{4/3},

for some absolute constant cc.

For brevity, let Sq=1aqgcd(a,q)=1e(N(a/q)).S_q = \sum_{\substack{1 \le a \le q \\ \gcd(a,q)=1}} e(-N(a/q)). Then

Mf(α)3e(Nα)dα=q(logN)10μ(q)ϕ(q)3Sq(1/qM1/qM(nNe(βn))3e(Nβ)dβ)+O(N59/30+ε)=N22q(logN)10μ(q)ϕ(q)3Sq+O((logN)10+cN4/3)+O(N59/30+ε)=N22q(logN)10μ(q)ϕ(q)3+O(N59/30+ε). \begin{aligned} \int_{\mathfrak M} f(\alpha)^3e(-N\alpha) d\alpha &= \sum_{q \le (\log N)^{10}} \frac{\mu(q)}{\phi(q)^3}S_q \left( \int_{-1/qM}^{1/qM}\left(\sum_{n \le N}e(\beta n)\right)^3e(-N\beta) d\beta \right ) \\ &\qquad +O\left(N^{59/30+\varepsilon}\right) \\ &= \frac{N^2}{2}\sum_{q \le (\log N)^{10}} \frac{\mu(q)}{\phi(q)^3}S_q + O((\log N)^{10+c} N^{4/3}) + O(N^{59/30+\varepsilon}) \\ &= \frac{N^2}{2}\sum_{q \le (\log N)^{10}} \frac{\mu(q)}{\phi(q)^3} + O(N^{59/30+\varepsilon}). \end{aligned}

The inner sum is bounded by ϕ(q)\phi(q). So,

q>(logN)10μ(q)ϕ(q)3Sqq>(logN)101ϕ(q)2, \left\lvert \sum_{q>(\log N)^{10}} \frac{\mu(q)}{\phi(q)^3} S_q \right\rvert \le \sum_{q>(\log N)^{10}} \frac{1}{\phi(q)^2},

which converges since ϕ(q)2qc\phi(q)^2 \gg q^c for some c>1c > 1. So

Mf(α)3e(Nα)dα=N22q=1μ(q)ϕ(q)3Sq+O(N59/30+ε). \int_{\mathfrak M} f(\alpha)^3e(-N\alpha) d\alpha = \frac{N^2}{2}\sum_{q = 1}^\infty \frac{\mu(q)}{\phi(q)^3}S_q + O(N^{59/30+\varepsilon}).

Now, since μ(q)\mu(q), ϕ(q)\phi(q), and 1aqgcd(a,q)=1e(N(a/q))\sum_{\substack{1 \le a \le q \\ \gcd(a,q)=1}} e(-N(a/q)) are multiplicative functions of qq, and μ(q)=0\mu(q)=0 unless qq is squarefree,

q=1μ(q)ϕ(q)3Sq=p(1+μ(p)ϕ(p)3Sp)=p(11(p1)3a=1p1e(N(a/p)))=p(11(p1)3a=1p1(p1(pN)1))=pN(11(p1)2)pN(1+1(p1)3)=G(N). \begin{aligned} \sum_{q = 1}^\infty \frac{\mu(q)}{\phi(q)^3} S_q &= \prod_p \left(1+\frac{\mu(p)}{\phi(p)^3}S_p \right) \\ &= \prod_p \left(1-\frac{1}{(p-1)^3} \sum_{a=1}^{p-1} e(-N(a/p))\right) \\ &= \prod_p \left(1-\frac{1}{(p-1)^3}\sum_{a=1}^{p-1} (p\cdot \mathbf 1(p|N) - 1)\right) \\ &= \prod_{p|N}\left(1-\frac{1}{(p-1)^2}\right) \prod_{p \nmid N}\left(1+\frac{1}{(p-1)^3}\right) \\ &= \mathfrak G(N). \end{aligned}

So,

Mf(α)3e(Nα)dα=N22G(N)+O(N59/30+ε). \int_{\mathfrak M} f(\alpha)^3e(-N\alpha) d\alpha = \frac{N^2}{2}\mathfrak{G}(N) + O(N^{59/30+\varepsilon}).

When NN is odd,

G(N)=pN(11(p1)2)pN(1+1(p1)3)m3(m2m1mm1)=12, \mathfrak{G}(N) = \prod_{p|N}\left(1-\frac{1}{(p-1)^2}\right) \prod_{p \nmid N}\left(1+\frac{1}{(p-1)^3}\right) \geq \prod_{m\geq 3}\left(\frac{m-2}{m-1}\frac{m}{m-1}\right) = \frac{1}{2},

so that we have Mf(α)3e(Nα)dαN22G(N),\int_{\mathfrak M} f(\alpha)^3e(-N\alpha) d\alpha \asymp \frac{N^2}{2}\mathfrak{G}(N), as desired.

6. Completing the proof

Because the integral over the minor arc is o(N2)o(N^2), it follows that

a+b+c=NΛ(a)Λ(b)Λ(c)=01f(α)3e(Nα)dαN22G(N)N2. \sum_{a+b+c=N} \Lambda(a)\Lambda(b)\Lambda(c) = \int_{0}^{1} f(\alpha)^3 e(-N\alpha) d \alpha \asymp \frac{N^2}{2}\mathfrak{G}(N) \gg N^2.

Consider the set SNS_N of integers pkNp^k\leq N with k>1k>1. We must have pN12p \le N^{\frac{1}{2}}, and for each such pp there are at most O(logN)O(\log N) possible values of kk. As such,

SNπ(N1/2)logNN1/2. |S_N| \ll \pi(N^{1/2}) \log N \ll N^{1/2}.

Thus

a+b+c=NaSNΛ(a)Λ(b)Λ(c)(logN)3SNlog(N)3N3/2, \sum_{\substack{a+b+c=N \\ a\in S_N}} \Lambda(a)\Lambda(b)\Lambda(c) \ll (\log N)^3 |S|N \ll \log(N)^3 N^{3/2},

and similarly for bSNb\in S_N and cSNc\in S_N. Notice that summing over aSNa\in S_N is equivalent to summing over composite aa, so

p1+p2+p3=NΛ(p1)Λ(p2)Λ(p3)=a+b+c=NΛ(a)Λ(b)Λ(c)+O(log(N)3N3/2)N2, \sum_{p_1+p_2+p_3=N} \Lambda(p_1)\Lambda(p_2)\Lambda(p_3) = \sum_{a+b+c=N} \Lambda(a)\Lambda(b)\Lambda(c) + O(\log(N)^3 N^{3/2}) \gg N^2,

where the sum is over primes pip_i. This finishes the proof.