As part of the 18.099 Discrete Analysis reading group at MIT,
I presented section 4.7 of Tao-Vu’s Additive
Combinatorics textbook.
Here were the notes I used for the second half of my presentation.
1. Synopsis
We aim to prove the following result.
Theorem 1 (Bourgain)
Assume N ≥ 2 N \ge 2 N ≥ 2 is prime and A , B ⊆ Z = Z N A, B \subseteq Z = \mathbb Z_N A , B ⊆ Z = Z N . Assume that
δ ≫ ( log log N ) 3 log N \delta \gg \sqrt{\frac{(\log \log N)^3}{\log N}} δ ≫ log N ( log log N ) 3
is such that min { P Z A , P Z B } ≥ δ \min\left\{ \mathbf P_ZA, \mathbf P_ZB \right\} \ge \delta min { P Z A , P Z B } ≥ δ .
Then A + B A+B A + B contains a proper arithmetic progression of length at least
exp ( C δ 2 log N 3 ) \exp\left( C\sqrt[3]{\delta^2 \log N} \right) exp ( C 3 δ 2 log N )
for some absolute constant C > 1 C > 1 C > 1 .
The methods that we used with Bohr sets fail here,
because in the previous half of yesterday’s lecture we took advantage of
Parseval’s identity in order to handle large convolutions,
always keeping two 1 ^ ∗ \widehat 1_\ast 1 ∗ term’s inside the ∑ \sum ∑ sign.
When we work with A + B A+B A + B this causes us to be stuck.
So, we instead use the technology of Λ ( p ) \Lambda(p) Λ ( p ) constants and dissociated sets.
2. Previous results
As usual, let Z Z Z denote a finite abelian group. Recall that
Definition 2. Let S ⊆ Z S \subseteq Z S ⊆ Z and 2 ≤ p ≤ ∞ 2 \le p \le \infty 2 ≤ p ≤ ∞ .
The Λ ( p ) \Lambda(p) Λ ( p ) constant of S S S , denoted ∥ S ∥ Λ ( p ) \left\lVert S \right\rVert_{\Lambda(p)} ∥ S ∥ Λ ( p ) , is defined as
∥ S ∥ Λ ( p ) = sup c : S → C c ≢ 0 ∥ ∑ ξ ∈ S c ( ξ ) e ( ξ ⋅ x ) ∥ L p ( Z ) ∥ c ∥ ℓ 2 ( S ) .
\left\lVert S \right\rVert_{\Lambda(p)} =
\sup_{\substack{c : S \rightarrow \mathbb C \\ c \not\equiv 0}}
\frac{\left\lVert \displaystyle\sum_{\xi \in S} c(\xi) e(\xi \cdot x) \right\rVert_{L^p(Z)}}
{\left\lVert c \right\rVert_{\ell^2(S)}}.
∥ S ∥ Λ ( p ) = c : S → C c ≡ 0 sup ∥ c ∥ ℓ 2 ( S ) ξ ∈ S ∑ c ( ξ ) e ( ξ ⋅ x ) L p ( Z ) .
Definition 3. If S ⊆ Z S \subseteq Z S ⊆ Z ,
we say S S S is a dissociated set if all 2 ∣ S ∣ 2^{|S|} 2 ∣ S ∣ subset sums of S S S are distinct.
For such sets we have the Rudin’s inequality (yes,
Walter ) which states that
Lemma 4 (Rudin’s inequality)
If S S S is dissociated then
∥ S ∥ Λ ( p ) ≪ p . \left\lVert S \right\rVert_{\Lambda(p)} \ll \sqrt p. ∥ S ∥ Λ ( p ) ≪ p .
Disassociated sets come up via the so-called “cube covering lemma”:
Lemma 5 (Cube covering lemma)
Let S ⊆ Z S \subseteq Z S ⊆ Z and d ≥ 1 d \ge 1 d ≥ 1 . Then we can partition
S = D 1 ⊔ D 2 ⊔ ⋯ ⊔ D k ⊔ R S = D_1 \sqcup D_2 \sqcup \dots \sqcup D_k \sqcup R S = D 1 ⊔ D 2 ⊔ ⋯ ⊔ D k ⊔ R
such that
Each D i D_i D i is dissociated of size d + 1 d+1 d + 1 ,
There exists η 1 \eta_1 η 1 , … \dots … , η d \eta_d η d such that R R R is contained in a d d d -cube, i.e.
it’s covered by c 1 η 1 + ⋯ + c d η d c_1\eta_1 + \dots + c_d\eta_d c 1 η 1 + ⋯ + c d η d , where c i ∈ { − 1 , 0 , 1 } c_i \in \{-1,0,1\} c i ∈ { − 1 , 0 , 1 } .
Finally, we remind the reader that
Lemma 6 (Parseval)
We have
∥ f ∥ L 2 Z = ∥ f ^ ∥ ℓ 2 Z . \left\lVert f \right\rVert_{L^2Z} = \left\lVert \widehat f \right\rVert_{\ell^2Z}. ∥ f ∥ L 2 Z = f ℓ 2 Z .
Since we don’t have Bohr sets anymore, the way we detect progressions is to use the pigeonhole principle.
In what follows, let T n f T^n f T n f be the shift of x x x by n n n , id est T n f ( x ) = f ( x − n ) T^nf(x) = f(x-n) T n f ( x ) = f ( x − n ) .
Proposition 7 (Pigeonhole gives arithmetic progressions)
Let f : Z → R ≥ 0 f : Z \rightarrow \mathbb R_{\ge 0} f : Z → R ≥ 0 , J ≥ 1 J \ge 1 J ≥ 1 and suppose r ∈ Z r \in \mathbb Z r ∈ Z is such that
E Z max 1 ≤ j ≤ J ∣ T j r f − f ∣ < E Z f . \mathbf E_Z \max_{1 \le j \le J} \left\lvert T^{jr}f - f \right\rvert < \mathbf E_Z f. E Z 1 ≤ j ≤ J max T j r f − f < E Z f .
Then supp ( f ) \operatorname{supp}(f) supp ( f ) contains an arithmetic of length j j j and spacing r r r .
Proof: Apply the pigeonhole principle to find an x x x such that
max 1 ≤ j ≤ J ∣ T j r f ( x ) − f ( x ) ∣ < f ( x ) . \max_{1 \le j \le J} \left\lvert T^{jr}f(x) - f(x) \right\rvert < f(x). 1 ≤ j ≤ J max T j r f ( x ) − f ( x ) < f ( x ) .
Then the claim follows. □ \Box □
3. Periodicity
Proposition 8 (Estimate for max h ∈ H ∣ T h f ∣ \max_{h \in H} |T^hf| max h ∈ H ∣ T h f ∣ for supp ( f ^ ) \operatorname{supp}(\widehat f) supp ( f ) dissociated)
Let f : Z → R f : Z \rightarrow \mathbb R f : Z → R , supp ( f ^ ) ⊆ S ⊆ Z \operatorname{supp}(\widehat f) \subseteq S \subseteq Z supp ( f ) ⊆ S ⊆ Z with S S S dissociated.
Then for any set H H H with ∣ H ∣ > 1 |H| > 1 ∣ H ∣ > 1 we have
∥ max h ∈ H ∣ T h f ∣ ∥ L 2 Z ≪ log ∣ H ∣ ∥ f ∥ L 2 Z .
\left\lVert \max_{h \in H} \left\lvert T^h f \right\rvert \right\rVert_{L^2Z}
\ll \sqrt{\log|H|} \left\lVert f \right\rVert_{L^2Z}.
h ∈ H max T h f L 2 Z ≪ log ∣ H ∣ ∥ f ∥ L 2 Z .
Proof: Let p > 2 p > 2 p > 2 be large and note
∥ max h ∈ H ∣ T h f ∣ ∥ L 2 Z ≤ ∥ max h ∈ H ∣ T h f ∣ ∥ L p Z ≤ ∥ ( ∑ h ∈ H ∣ T h f ∣ p ) 1 / p ∥ L p Z = ( E Z ( ∑ h ∈ H ∣ T h f ∣ p ) ) 1 / p = ( ∑ h ∈ H E Z ∣ T h f ∣ p ) 1 / p = ( ∑ h ∈ H E Z ∣ f ∣ p ) 1 / p = ∣ H ∣ 1 / p ∥ ∑ ξ f ^ ( ξ ) e ( ξ ⋅ x ) ∥ L p Z ≤ ∣ H ∣ 1 / p ∥ S ∥ Λ ( p ) ∥ f ^ ∥ ℓ 2 Z
\begin{aligned}
\left\lVert \max_{h \in H} \left\lvert T^h f \right\rvert \right\rVert_{L^2Z}
&\le \left\lVert \max_{h \in H} \left\lvert T^h f \right\rvert \right\rVert_{L^pZ} \\
&\le \left\lVert \left( \sum_{h \in H} \left\lvert T^h f \right\rvert^p \right)^{1/p} \right\rVert_{L^pZ} \\
&= \left( \mathbf E_Z \left( \sum_{h \in H} \left\lvert T^h f \right\rvert^p \right) \right)^{1/p} \\
&= \left( \sum_{h \in H} \mathbf E_Z \left\lvert T^h f \right\rvert^p \right)^{1/p} \\
&= \left( \sum_{h \in H} \mathbf E_Z \left\lvert f \right\rvert^p \right)^{1/p} \\
&= \left\lvert H \right\rvert^{1/p}
\left\lVert \sum_\xi \widehat f(\xi) e(\xi \cdot x) \right\rVert_{L^pZ} \\
&\le \left\lvert H \right\rvert^{1/p} \left\lVert S \right\rVert_{\Lambda(p)}
\left\lVert \widehat f \right\rVert_{\ell^2Z} \\
\end{aligned}
h ∈ H max T h f L 2 Z ≤ h ∈ H max T h f L p Z ≤ ( h ∈ H ∑ T h f p ) 1/ p L p Z = ( E Z ( h ∈ H ∑ T h f p ) ) 1/ p = ( h ∈ H ∑ E Z T h f p ) 1/ p = ( h ∈ H ∑ E Z ∣ f ∣ p ) 1/ p = ∣ H ∣ 1/ p ξ ∑ f ( ξ ) e ( ξ ⋅ x ) L p Z ≤ ∣ H ∣ 1/ p ∥ S ∥ Λ ( p ) f ℓ 2 Z
Then by Parseval and Rudin,
∥ max h ∈ H ∣ T h f ∣ ∥ L 2 Z ≤ ∣ H ∣ 1 / p ∥ S ∥ Λ ( p ) ∥ f ∥ L 2 Z ≪ ∣ H ∣ 1 / p p ∥ f ∥ L 2 Z .
\begin{aligned}
\left\lVert \max_{h \in H} \left\lvert T^h f \right\rvert \right\rVert_{L^2Z}
&\le \left\lvert H \right\rvert^{1/p} \left\lVert S \right\rVert_{\Lambda(p)}
\left\lVert f \right\rVert_{L^2Z} \\
&\ll \left\lvert H \right\rvert^{1/p} \sqrt p \left\lVert f \right\rVert_{L^2Z}.
\end{aligned}
h ∈ H max T h f L 2 Z ≤ ∣ H ∣ 1/ p ∥ S ∥ Λ ( p ) ∥ f ∥ L 2 Z ≪ ∣ H ∣ 1/ p p ∥ f ∥ L 2 Z .
We may then take p ≪ log H p \ll \log H p ≪ log H . □ \Box □
We combine these two propositions into the following lemma which applies if
f ^ \widehat f f has nonzero values of “uniform” size.
Lemma 9 (Uniformity estimate for shifts)
Let f : Z → R f : Z \rightarrow \mathbb R f : Z → R and J , d > 1 J, d > 1 J , d > 1 .
Suppose that f ^ \widehat f f is “uniform in size” across its support, in the sense that
sup ξ ∈ supp ( f ^ ) ∣ f ^ ( ξ ) ∣ inf ξ ∈ supp ( f ^ ) ∣ f ^ ( ξ ) ∣ ≤ 2016.
\frac {\sup_{\xi \in \operatorname{supp}(\widehat f)} \left\lvert \widehat f(\xi) \right\rvert}
{\inf_{\xi \in \operatorname{supp}(\widehat f)} \left\lvert \widehat f(\xi) \right\rvert} \le 2016.
inf ξ ∈ supp ( f ) f ( ξ ) sup ξ ∈ supp ( f ) f ( ξ ) ≤ 2016.
Then one can find S ⊆ Z S \subseteq Z S ⊆ Z such that ∣ S ∣ = d |S| = d ∣ S ∣ = d and for all r ∈ Z r \in Z r ∈ Z ,
E Z max 1 ≤ j ≤ J ∣ T j r f − f ∣ ≪ ( ∑ ξ ∣ f ^ ( ξ ) ∣ ) ( log J d + J d max η ∈ S ∥ η ⋅ r ∥ R / Z ) .
\mathbf E_Z \max_{1 \le j \le J} \left\lvert T^{jr}f - f \right\rvert
\ll \left( \sum_\xi \left\lvert \widehat f(\xi) \right\rvert \right)
\left( \sqrt{\frac{\log J}{d}} + Jd\max_{\eta \in S}
\left\lVert \eta \cdot r \right\rVert_{\mathbb R/\mathbb Z} \right).
E Z 1 ≤ j ≤ J max T j r f − f ≪ ξ ∑ f ( ξ ) ( d log J + J d η ∈ S max ∥ η ⋅ r ∥ R / Z ) .
Proof: Use the cube covering lemma to put
supp ( f ^ ) = D 1 ⊔ ⋯ ⊔ D k ⊔ R \operatorname{supp}(\widehat f) = D_1 \sqcup \dots \sqcup D_k \sqcup R supp ( f ) = D 1 ⊔ ⋯ ⊔ D k ⊔ R where R R R is
contained in the cube of S = { η 1 , … , η d } S = \left\{ \eta_1, \dots, \eta_d \right\} S = { η 1 , … , η d } and ∣ D i ∣ = d + 1 |D_i| = d+1 ∣ D i ∣ = d + 1 for 1 ≤ i ≤ k 1 \le i \le k 1 ≤ i ≤ k .
Accordingly, we decompose f f f over its Fourier transform as
f = f 1 + ⋯ + f k + g f = f_1 + \dots + f_k + g f = f 1 + ⋯ + f k + g
by letting f i f_i f i be supported on D i D_i D i and g ( x ) g(x) g ( x ) supported on R R R .
First, we can bound the “leftover” bits in R R R :
E Z max 1 ≤ j ≤ J ∣ T j r g − g ∣ = E Z max 0 ≤ j ≤ J ∑ ξ ∈ R ∣ f ^ ( ξ ) ⋅ ( e ( ξ ⋅ ( x + j r ) ) − e ( ξ ⋅ x ) ) ∣ ≤ E Z max 0 ≤ j ≤ J ∑ ξ ∈ R ∣ f ^ ( ξ ) ∣ ∣ ( e ( ξ ⋅ ( x + j r ) ) − e ( ξ ⋅ x ) ) ∣ ≤ ( ∑ ξ ∈ R ∣ f ^ ( ξ ) ∣ ) max 0 ≤ j ≤ J ξ ∈ R ∣ ( e ( ξ ⋅ ( x + j r ) ) − e ( ξ ⋅ x ) ) ∣ ≤ ( ∑ ξ ∈ R ∣ f ^ ( ξ ) ∣ ) max 0 ≤ j ≤ J ξ ∈ R ∣ e ( ξ ⋅ j r ) − 1 ∣ ≤ ( ∑ ξ ∈ R ∣ f ^ ( ξ ) ∣ ) 2 π max 0 ≤ j ≤ J ξ ∈ R ∥ ξ ⋅ j r ∥ R / Z
\begin{aligned}
\mathbf E_Z \max_{1 \le j \le J} \left\lvert T^{jr} g - g \right\rvert
&= \mathbf E_Z \max_{0 \le j \le J} \sum_{\xi \in R}
\left\lvert \widehat f(\xi) \cdot (e(\xi \cdot (x+jr)) - e(\xi \cdot x)) \right\rvert \\
&\le \mathbf E_Z \max_{0 \le j \le J} \sum_{\xi \in R}
\left\lvert \widehat f(\xi) \right\rvert
\left\lvert (e(\xi \cdot (x+jr)) - e(\xi \cdot x)) \right\rvert \\
&\le \left( \sum_{\xi \in R} \left\lvert \widehat f(\xi) \right\rvert \right)
\max_{\substack{0 \le j \le J \\ \xi \in R}}
\left\lvert (e(\xi \cdot (x+jr)) - e(\xi \cdot x)) \right\rvert \\
&\le \left( \sum_{\xi \in R} \left\lvert \widehat f(\xi) \right\rvert \right)
\max_{\substack{0 \le j \le J \\ \xi \in R}} \left\lvert e(\xi \cdot jr) - 1 \right\rvert \\
&\le \left( \sum_{\xi \in R} \left\lvert \widehat f(\xi) \right\rvert \right) 2\pi
\max_{\substack{0 \le j \le J \\ \xi \in R}}
\left\lVert \xi \cdot jr \right\rVert_{\mathbb R/\mathbb Z}
\end{aligned}
E Z 1 ≤ j ≤ J max T j r g − g = E Z 0 ≤ j ≤ J max ξ ∈ R ∑ f ( ξ ) ⋅ ( e ( ξ ⋅ ( x + j r )) − e ( ξ ⋅ x )) ≤ E Z 0 ≤ j ≤ J max ξ ∈ R ∑ f ( ξ ) ∣ ( e ( ξ ⋅ ( x + j r )) − e ( ξ ⋅ x )) ∣ ≤ ξ ∈ R ∑ f ( ξ ) 0 ≤ j ≤ J ξ ∈ R max ∣ ( e ( ξ ⋅ ( x + j r )) − e ( ξ ⋅ x )) ∣ ≤ ξ ∈ R ∑ f ( ξ ) 0 ≤ j ≤ J ξ ∈ R max ∣ e ( ξ ⋅ j r ) − 1 ∣ ≤ ξ ∈ R ∑ f ( ξ ) 2 π 0 ≤ j ≤ J ξ ∈ R max ∥ ξ ⋅ j r ∥ R / Z
Since the ξ ∈ R \xi \in R ξ ∈ R are covered by a cube of S = { η 1 , … , η d } S = \left\{ \eta_1, \dots, \eta_d \right\} S = { η 1 , … , η d } , we get
E Z max 1 ≤ j ≤ J ∣ T j r g − g ∣ ≤ ( ∑ ξ ∈ R ∣ f ^ ( ξ ) ∣ ) 2 π J d max 0 ≤ j ≤ J η ∈ S ∥ η ⋅ j r ∥ R / Z .
\mathbf E_Z \max_{1 \le j \le J} \left\lvert T^{jr} g - g \right\rvert
\le \left( \sum_{\xi \in R} \left\lvert \widehat f(\xi) \right\rvert \right) 2\pi Jd
\max_{\substack{0 \le j \le J \\ \eta \in S}}
\left\lVert \eta \cdot jr \right\rVert_{\mathbb R/\mathbb Z}.
E Z 1 ≤ j ≤ J max T j r g − g ≤ ξ ∈ R ∑ f ( ξ ) 2 π J d 0 ≤ j ≤ J η ∈ S max ∥ η ⋅ j r ∥ R / Z .
Let’s then bound the contribution over each dissociated set.
We’ll need both the assumption of uniformity and the proposition we proved for dissociated sets.
E Z max 1 ≤ j ≤ J ∣ T j r f i − f i ∣ ≤ 2 E Z max 0 ≤ j ≤ J ∣ T j r f i ∣ ≤ 2 ∥ max 0 ≤ j ≤ J ∣ T j r f i ∣ ∥ L 2 Z . ≪ log ( J ) ∥ f i ∥ L 2 Z = log ( J ) ∑ ξ ∈ D i ∣ f ^ ( ξ ) ∣ 2 ≪ log J D ∑ ξ ∈ D i ∣ f ^ ( ξ ) ∣
\begin{aligned}
\mathbf E_Z \max_{1 \le j \le J} \left\lvert T^{jr} f_i - f_i \right\rvert
&\le 2\mathbf E_Z \max_{0 \le j \le J} \left\lvert T^{jr} f_i \right\rvert \\
&\le 2\left\lVert \max_{0 \le j \le J} \left\lvert T^{jr} f_i \right\rvert \right\rVert_{L^2Z}. \\
&\ll \sqrt{\log(J)} \left\lVert f_i \right\rVert_{L^2Z} \\
&= \sqrt{\log(J)} \sqrt{\sum_{\xi \in D_i} \left\lvert \widehat f(\xi) \right\rvert^2 } \\
&\ll \sqrt{\frac{\log J}{D}} \sum_{\xi \in D_i} \left\lvert \widehat f(\xi) \right\rvert
\end{aligned}
E Z 1 ≤ j ≤ J max T j r f i − f i ≤ 2 E Z 0 ≤ j ≤ J max T j r f i ≤ 2 0 ≤ j ≤ J max T j r f i L 2 Z . ≪ log ( J ) ∥ f i ∥ L 2 Z = log ( J ) ξ ∈ D i ∑ f ( ξ ) 2 ≪ D log J ξ ∈ D i ∑ f ( ξ )
where the last step is by uniformity of ξ ^ \widehat \xi ξ . Now combine everything with triangle inequality.
□ \Box □
4. Proof of main theorem
Without loss of generality P Z A = P Z B = δ \mathbf P_ZA = \mathbf P_ZB = \delta P Z A = P Z B = δ .
Of course, we let f = 1 A ∗ 1 B f = 1_A \ast 1_B f = 1 A ∗ 1 B so E Z f = δ 2 \mathbf E_Z f = \delta^2 E Z f = δ 2 .
We will have parameters d ≥ 1 d \ge 1 d ≥ 1 , M ≥ 1 M \ge 1 M ≥ 1 ,
and J ≥ exp ( C δ 2 log N 3 ) J \ge \exp(C\sqrt[3]{\delta^2 \log N}) J ≥ exp ( C 3 δ 2 log N ) which we will select at the end.
Our goal is to show there exists some integer r r r such that
E Z max 1 ≤ j < J ∣ T j r f − f ∣ < δ 2 . \mathbf E_Z \max_{1 \le j < J} \left\lvert T^{jr} f - f \right\rvert < \delta^2. E Z 1 ≤ j < J max T j r f − f < δ 2 .
Now we cannot apply the uniformity estimate directly since f f f is probably not uniform,
and therefore we impose a dyadic decomposition on the base group Z Z Z ; let
Z 0 = { ξ ∈ Z : 1 2 δ 2 < ∣ f ^ ( ξ ) ∣ ≤ δ 2 } Z 1 = { ξ ∈ Z : 1 4 δ 2 < ∣ f ^ ( ξ ) ∣ ≤ 1 2 δ 2 } Z 2 = { ξ ∈ Z : 1 8 δ 2 < ∣ f ^ ( ξ ) ∣ ≤ 1 4 δ 2 } ⋮ Z M − 1 = { ξ ∈ Z : 2 − M δ 2 < ∣ f ^ ( ξ ) ∣ ≤ 2 − M + 1 δ 2 } Z e r r = { ξ ∈ Z : ∣ f ^ ( ξ ) ∣ < 2 − M δ 2 }
\begin{aligned}
Z_0 &= \left\{ \xi \in Z :
\frac{1}{2} \delta^2 < \left\lvert \widehat f(\xi) \right\rvert \le \delta^2 \right\} \\
Z_1 &= \left\{ \xi \in Z :
\frac14\delta^2 < \left\lvert \widehat f(\xi) \right\rvert \le \frac{1}{2}\delta^2 \right\} \\
Z_2 &= \left\{ \xi \in Z :
\frac18\delta^2 < \left\lvert \widehat f(\xi) \right\rvert \le \frac14\delta^2 \right\} \\
&\vdots \\
Z_{M-1} &= \left\{ \xi \in Z :
2^{-M} \delta^2 < \left\lvert \widehat f(\xi) \right\rvert \le 2^{-M+1} \delta^2 \right\} \\
Z_{\mathrm{err}} &= \left\{ \xi \in Z :
\left\lvert \widehat f(\xi) \right\rvert < 2^{-M} \delta^2 \right\} \\
\end{aligned}
Z 0 Z 1 Z 2 Z M − 1 Z err = { ξ ∈ Z : 2 1 δ 2 < f ( ξ ) ≤ δ 2 } = { ξ ∈ Z : 4 1 δ 2 < f ( ξ ) ≤ 2 1 δ 2 } = { ξ ∈ Z : 8 1 δ 2 < f ( ξ ) ≤ 4 1 δ 2 } ⋮ = { ξ ∈ Z : 2 − M δ 2 < f ( ξ ) ≤ 2 − M + 1 δ 2 } = { ξ ∈ Z : f ( ξ ) < 2 − M δ 2 }
Then as before we can decompose via Fourier transform to obtain
f = f 0 + f 1 + ⋯ + f M − 1 + f e r r f = f_0 + f_1 + \dots + f_{M-1} + f_{\mathrm{err}} f = f 0 + f 1 + ⋯ + f M − 1 + f err
so that f ^ i \widehat f_i f i is supported on Z i Z_i Z i .
Now we can apply the previous lemma to get for each 0 ≤ m < M 0 \le m < M 0 ≤ m < M :
E Z max 1 ≤ j ≤ J ∣ T j r f m − f m ∣ ≪ ( ∑ ξ ∈ Z m ∣ f ^ ( ξ ) ∣ ) ( log J d + J d max η ∈ S m ∥ η ⋅ r ∥ R / Z )
\mathbf E_Z \max_{1 \le j \le J} \left\lvert T^{jr} f_m - f_m \right\rvert
\ll \left( \sum_{\xi \in Z_m} \left\lvert \widehat f(\xi) \right\rvert \right)
\left( \sqrt{\frac{\log J}{d}} + Jd\max_{\eta \in S_m}
\left\lVert \eta \cdot r \right\rVert_{\mathbb R/\mathbb Z} \right)
E Z 1 ≤ j ≤ J max T j r f m − f m ≪ ξ ∈ Z m ∑ f ( ξ ) ( d log J + J d η ∈ S m max ∥ η ⋅ r ∥ R / Z )
for some S m S_m S m ; hence by summing and using the fact that
∑ ξ ∈ Z ∣ f ^ ( ξ ) ∣ = ∑ ξ ∈ Z ∣ 1 ^ A ( ξ ) ∣ ∣ 1 ^ B ( ξ ) ∣ ≤ ∥ 1 ^ A ∥ ℓ 2 Z ∥ 1 ^ B ∥ ℓ 2 Z = ∥ 1 A ∥ L 2 Z ∥ 1 B ∥ L 2 Z = P Z A P Z B = δ
\sum_{\xi \in Z} \left\lvert \widehat f(\xi) \right\rvert
= \sum_{\xi \in Z} \left\lvert \widehat 1_A(\xi) \right\rvert \left\lvert \widehat 1_B(\xi) \right\rvert
\le \left\lVert \widehat 1_A \right\rVert_{\ell^2Z} \left\lVert \widehat 1_B \right\rVert_{\ell^2Z}
= \left\lVert 1_A \right\rVert_{L^2Z} \left\lVert 1_B \right\rVert_{L^2Z}
= \sqrt{\mathbf P_ZA \mathbf P_ZB}
= \delta
ξ ∈ Z ∑ f ( ξ ) = ξ ∈ Z ∑ 1 A ( ξ ) 1 B ( ξ ) ≤ 1 A ℓ 2 Z 1 B ℓ 2 Z = ∥ 1 A ∥ L 2 Z ∥ 1 B ∥ L 2 Z = P Z A P Z B = δ
we obtain that
∑ 0 ≤ m < M E Z max 1 ≤ j ≤ J ∣ T j r f − f ∣ ≪ δ ( log J d + J d max η ∈ ⋃ S m ∥ η ⋅ r ∥ R / Z ) .
\sum_{0 \le m < M} \mathbf E_Z \max_{1 \le j \le J} \left\lvert T^{jr} f - f \right\rvert
\ll \delta \left( \sqrt{\frac{\log J}{d}}
+ Jd\max_{\eta \in \bigcup S_m} \left\lVert \eta \cdot r \right\rVert_{\mathbb R/\mathbb Z} \right).
0 ≤ m < M ∑ E Z 1 ≤ j ≤ J max T j r f − f ≪ δ ( d log J + J d η ∈ ⋃ S m max ∥ η ⋅ r ∥ R / Z ) .
As for the “error” term, we bound
E Z max 1 ≤ j ≤ J ∣ T j r f e r r − f e r r ∣ ≤ 2 E Z max 1 ≤ j ≤ J ∣ T j r f e r r ∣ ≤ 2 E Z ∑ 1 ≤ j ≤ J ∣ T j r f e r r ∣ ≤ 2 ∑ 1 ≤ j ≤ J E Z ∣ T j r f e r r ∣ ≤ 2 ∑ 1 ≤ j ≤ J E Z ∣ f e r r ∣ ≤ 2 J E Z ∣ f e r r ∣ ≤ 2 J ∥ f e r r ∥ L 2 Z = 2 J ∥ f ^ e r r ∥ ℓ 2 Z = 2 J ∑ ξ ∈ Z e r r ∣ f ^ e r r ( ξ ) ∣ 2 ≤ 2 J max ξ ∈ Z e r r ∣ f ^ e r r ( ξ ) ∣ ∑ ξ ∈ Z e r r ∣ f ^ e r r ( ξ ) ∣ ≤ 2 J 2 − M δ 2 ⋅ δ = 2 J 2 − M / 2 δ 3 / 2 ≤ 2 J 2 − M / 2 δ .
\begin{aligned}
\mathbf E_Z \max_{1 \le j \le J} \left\lvert T^{jr} f_{\mathrm{err}} - f_{\mathrm{err}} \right\rvert
&\le 2\mathbf E_Z \max_{1 \le j \le J} \left\lvert T^{jr} f_{\mathrm{err}} \right\rvert \\
&\le 2\mathbf E_Z \sum_{1 \le j \le J} \left\lvert T^{jr} f_{\mathrm{err}} \right\rvert \\
&\le 2\sum_{1 \le j \le J} \mathbf E_Z \left\lvert T^{jr} f_{\mathrm{err}} \right\rvert \\
&\le 2\sum_{1 \le j \le J} \mathbf E_Z \left\lvert f_{\mathrm{err}} \right\rvert \\
&\le 2J \mathbf E_Z \left\lvert f_{\mathrm{err}} \right\rvert \\
&\le 2J \left\lVert f_{\mathrm{err}} \right\rVert_{L^2Z} \\
&= 2J \left\lVert \widehat f_{\mathrm{err}} \right\rVert_{\ell^2 Z} \\
&= 2J \sqrt{\sum_{\xi \in Z_{\mathrm{err}}} \left\lvert \widehat f_{\mathrm{err}}(\xi) \right\rvert^2} \\
&\le 2J \sqrt{\max_{\xi \in Z_{\mathrm{err}}} \left\lvert \widehat f_{\mathrm{err}}(\xi) \right\rvert
\sum_{\xi \in Z_{\mathrm{err}}} \left\lvert \widehat f_{\mathrm{err}}(\xi) \right\rvert} \\
&\le 2J \sqrt{2^{-M}\delta^2 \cdot \delta} \\
&= 2J 2^{-M/2} \delta^{3/2} \\
&\le 2J 2^{-M/2} \delta.
\end{aligned}
E Z 1 ≤ j ≤ J max T j r f err − f err ≤ 2 E Z 1 ≤ j ≤ J max T j r f err ≤ 2 E Z 1 ≤ j ≤ J ∑ T j r f err ≤ 2 1 ≤ j ≤ J ∑ E Z T j r f err ≤ 2 1 ≤ j ≤ J ∑ E Z ∣ f err ∣ ≤ 2 J E Z ∣ f err ∣ ≤ 2 J ∥ f err ∥ L 2 Z = 2 J f err ℓ 2 Z = 2 J ξ ∈ Z err ∑ f err ( ξ ) 2 ≤ 2 J ξ ∈ Z err max f err ( ξ ) ξ ∈ Z err ∑ f err ( ξ ) ≤ 2 J 2 − M δ 2 ⋅ δ = 2 J 2 − M /2 δ 3/2 ≤ 2 J 2 − M /2 δ .
Thus, putting these altogether we need to find R ≠ 0 R \neq 0 R = 0 such that
log J d + J d max η ∈ ⋃ S m ∥ η ⋅ r ∥ R / Z + 2 J ⋅ 2 − M / 2 ≪ δ .
\sqrt{\frac{\log J}{d}} + Jd \max_{\eta\in\bigcup S_m}
\left\lVert \eta \cdot r \right\rVert_{\mathbb R/\mathbb Z} + 2J \cdot2^{-M/2} \ll \delta.
d log J + J d η ∈ ⋃ S m max ∥ η ⋅ r ∥ R / Z + 2 J ⋅ 2 − M /2 ≪ δ .
Now set M ≍ log J M \asymp \log J M ≍ log J and d ≍ δ − 2 log J d \asymp \delta^{-2} \log J d ≍ δ − 2 log J ,
so the first and third terms are less than 1 3 c δ \frac13 c \delta 3 1 cδ , since by hypothesis
δ ≫ ( log log N ) 3 log N \delta \gg \sqrt{\frac{(\log \log N)^3}{\log N}} δ ≫ log N ( log log N ) 3
from which we deduce
J ≫ exp ( C δ 2 log N 3 ) = exp ( C log log N ) ≥ ( log N ) C ≫ δ − 1 .
J \gg \exp\left( C\sqrt[3]{\delta^2\log N} \right)
= \exp\left( C\log \log N \right)
\ge (\log N)^C
\gg \delta^{-1}.
J ≫ exp ( C 3 δ 2 log N ) = exp ( C log log N ) ≥ ( log N ) C ≫ δ − 1 .
Thus it suffices that
max η ∈ S ∥ η ⋅ r ∥ R / Z ≪ δ 3 J log J \max_{\eta\in S} \left\lVert \eta \cdot r \right\rVert_{\mathbb R/\mathbb Z} \ll \frac{\delta^3}{J \log J} η ∈ S max ∥ η ⋅ r ∥ R / Z ≪ J log J δ 3
where S = ⋃ S m S = \bigcup S_m S = ⋃ S m . Note ∣ S ∣ ≤ d M ≪ ( log J δ ) 2 \left\lvert S \right\rvert \le dM \ll \left( \frac{\log J}{\delta} \right)^2 ∣ S ∣ ≤ d M ≪ ( δ l o g J ) 2 .
Now we recall the result that
Bohr ( S , ρ ) ≥ ∣ Z ∣ ρ ∣ S ∣ \operatorname{Bohr}(S, \rho) \ge |Z| \rho^{|S|} Bohr ( S , ρ ) ≥ ∣ Z ∣ ρ ∣ S ∣
and so it suffices for us that
N ⋅ ( c 1 δ 3 J log J ) c 2 ( δ − 1 log J ) 2 > 1 N \cdot \left( \frac{c_1 \delta^3}{J \log J} \right) ^{c_2 \left( \delta^{-1} \log J \right)^2} > 1 N ⋅ ( J log J c 1 δ 3 ) c 2 ( δ − 1 l o g J ) 2 > 1
for constants c 1 c_1 c 1 and c 2 c_2 c 2 . Then J = exp ( C δ 2 log N 3 ) J = \exp(C\sqrt[3]{\delta^2 \log N}) J = exp ( C 3 δ 2 log N ) works now.