In this chapter, we convert the additive large sieve inequality (Theorem 16.5), which involves characters of the additive group, into a result about Dirichlet characters.
Section17.1A specialization of the additive large sieve
We will use the additive large sieve inequality (Theorem 16.5) in the special case
\begin{equation}
S = \{a/q: 1 \leq q \leq Q, 0 \leq a \lt q, \gcd(a,q) = 1\}.\tag{17.1.1}
\end{equation}
Note that if \(a/q, a'/q' \in S\) are distinct and \(m \in \ZZ\text{,}\) then
We now ask the question: what if we replace the exponentials in the large sieve by the primitive Dirichlet characters of all moduli \(q \leq Q\text{?}\) (One can prove a stronger inequality in which some terms correspond to imprimitive characters, but we won't need this.)
Theorem17.2.Bombieri–Davenport.
Fix positive integers \(Q,N\text{.}\) For any \(a_n \in \CC\) for \(M \lt n \leq M+N\text{,}\) we have
As in the proof of the functional equation for Dirichlet \(L\)-functions (Chapter 7), we use the expansion of primitive Dirichlet characters in terms of Gauss sums:
Summing over \(1 \leq q \leq Q\) and \(\chi\) primitive gives the left side of (17.2.1).
To obtain an upper bound, we sum over \(1 \leq q \leq Q\) and all\(\chi\text{,}\) primitive or not. By orthogonality of characters for the group \((\ZZ/q\ZZ)^\times\) (Theorem 5.10), this yields
as an upper bound for the left side of (17.2.1). Applying Theorem 17.1 gives the right side of (17.2.1), completing the proof.
Section17.3An application of the large sieve
We will use the large sieve crucially in the Bombieri–Vinogradov theorem, but first let us illustrate its use with one of its original applications, due to Linnik.
Definition17.3.
Let \(\{a_n\}_n\) be a sequence of complex numbers \(a_n\) with finite support. Let \(P\) be a set of primes. For each \(p \in P\text{,}\) we wish to exclude a set of residue classes \(\Omega_p\) of size \(\omega(p)\text{.}\) That is, we wish to compute \(Z\text{,}\) the sum of \(a_n\) over those \(n\) which do not reduce to a class in \(\Omega_p\) for any \(p \in P\text{.}\)
So far, we are in the setup of the sieve of Eratosthenes (Definition 12.6), but with slightly different notation. The difference is that we do not require \(\omega(p)\) to be as small as before; that's what makes this a “large sieve”.
Theorem17.4.
With notation as in Definition 17.3, suppose that the support of \(a_n\) belongs to an interval of length \(N\) and that \(\omega(p) \lt p\) for all \(p \in P\text{.}\) Let \(h\) be the multiplicative function with \(h(q) = 0\) for \(q\) not squarefree and
We first reduce to the case where \(q\) is prime. Suppose \(q = q_1q_2\) and we know the desired result for both \(q_1\) and \(q_2\text{.}\) By the Chinese remainder theorem,
It thus remains to prove the case where \(q\) is prime; we leave this case as an exercise (Exercise 17.4.3).
Here is Linnik's application of the large sieve.
Definition17.6.
For \(p\) prime, let \(q(p)\) be the least positive integer which is not a quadratic residue modulo \(p\text{.}\) It is conjectured that \(q(p) = O(p^\epsilon)\) for any \(\epsilon > 0\text{,}\) but unconditionally this is only known for \(\epsilon > e^{-1/2}/4 \cong 0.152\text{.}\) On the other hand, under the generalized Riemann Hypothesis (Conjecture 11.4), one can even improve this to \(q(p) = O(\log^2 p)\text{.}\)
Theorem17.7.Linnik.
For any fixed \(\epsilon>0\text{,}\) there exists \(c = c(\epsilon)\) such that for any \(N\text{,}\) there are at most \(c\) primes \(p \leq N\) such that \(q(p) > N^\epsilon\text{.}\)
For convenience, we will prove instead that for some \(c = c(\epsilon)\text{,}\) for any \(N\) there are at most \(c\) primes \(p\leq \sqrt{N}\) with \(q(p) > N^\epsilon\text{.}\)
Let \(P\) be the set of primes \(p \leq \sqrt{N}\) such that \(\legendre{n}{p} = 1\) for all \(n \leq N^\epsilon\text{,}\) and let \(\Omega_p\) be the classes of quadratic nonresidues mod \(p\text{.}\) (This is indeed a large sieve because \(\omega(p) = (p-1)/2\text{,}\) so \(h(p) = (p-1)/(p+1) \sim 1/2\) as \(p \to \infty\text{,}\) whereas in our earlier examples \(\omega(p)\) was bounded.)
We will now sieve on the set \(\{1, \dots, N\}\text{,}\) i.e., take \(a_n = 1\) for \(1 \leq n \leq N\) and \(a_n = 0\) otherwise. The resulting sifted set includes all \(n \leq N\) with no prime divisors greater than \(N^\epsilon\text{;}\) if we let \(Z_\epsilon\) be the number of these, then Theorem 17.4 applied with \(Q = \sqrt{N}\) yields
On the other hand, if we let \(X_\epsilon\) be the number of primes \(p \leq \sqrt{N}\) with \(q(p) > N^\epsilon\text{,}\) then because \(h(p) \geq 1/3\) for all \(p\text{,}\)
To conclude, we need to show that \(Z_\epsilon \geq cN\) for some \(c>0\text{.}\) In fact it can be shown that \(Z_\epsilon \sim cN\) for some \(N\text{,}\) but as we don't care about the particular constant, it will suffice to exhibit a special class of numbers being counted by \(Z_\epsilon\) which are sufficiently numerous. Namely, take \(n = m p_1 \cdot p_k \leq N\) with \(N^{\epsilon - \epsilon^2} \lt p_j \lt N^\epsilon\) for \(j=1, \dots, k =
\epsilon^{-1}\text{;}\) then
Prove the following multivariate version of the additive large sieve inequality (but without optimizing the constant). Fix \(\delta > 0\) and \(d \geq 1\text{,}\) and let \(\alpha_i = (\alpha_{i,1},
\dots, \alpha_{i,d}) \in \RR^d/\ZZ^d\) be points which are \(\delta\)-spaced, in the sense that the distance from each \(\alpha_{i,k}-\alpha_{j,k}\) to the nearest integer is at least \(\delta\) (whenever \(i \neq j\) and \(1 \leq k \leq d\)). Prove that there exists \(c = c(d)\) (independent of \(\delta\) and the \(\alpha_i\)) such that for any \(a_n \in \CC\) with \(n\) running over \(\{1, \dots, N\}^d\text{,}\)
\begin{equation*}
\sum_i \left| \sum_n a_n \exp(2 \pi i (n \cdot \alpha_i)) \right|^2
\leq c (\delta^{-d} + N^d) \sum_n |a_n|^2.
\end{equation*}
2.
Prove directly (by expanding the squares) that if we take take all characters, not just the primitive ones, of a single modulus \(q\text{,}\) then the large sieve inequality holds with the constant \(q+N\text{.}\) (This is not very useful in practice.)
3.
Prove Lemma~\ref{L:linnik} in the case that \(q\) is prime.
There is no loss of generality in assuming that there is at most one \(n\) in each residue class modulo \(p\text{,}\) and none in the classes in \(\Omega_p\text{,}\) such that \(a_n \neq 0\text{.}\) Then use orthogonality of characters on \(\ZZ/q\ZZ\text{.}\)