Skip to main content

Chapter 4 Dirichlet characters and \(L\)-functions

In this chapter, we introduce some special multiplicative functions, the Dirichlet characters. We then study their corresponding Dirichlet series, the Dirichlet \(L\)-functions, which behave quite similarly to the Riemann zeta function.

Section 4.1 Dirichlet characters

Definition 4.1.

For \(N\) a positive integer, a Dirichlet character of level \(N\) is an arithmetic function \(\chi\) which factors through a homomorphism \((\ZZ/N\ZZ)^\times \to \CC\) on integers \(n \in \NN\) coprime to \(N\text{,}\) and is zero on integers not coprime to \(N\text{;}\) such a function is completely multiplicative (Definition 3.7). Note that the nonzero values must all be \(N\)-th roots of unity, and that the characters of level \(N\) form a group under termwise multiplication.
For each level \(N\text{,}\) there is a Dirichlet character taking the value 1 at all \(n\) coprime to \(N\text{;}\) it is called the principal character of level \(N\) (or the trivial character of level \(N\)). A nonprincipal Dirichlet character of level \(N\) is given by the Legendre–Jacobi symbol
\begin{equation*} \chi(n) = \legendre{n}{N}. \end{equation*}
The sum is invariant under multiplication by \(\chi(m)\) for any \(m \in \NN\) coprime to \(N\text{,}\) but if \(\chi\) is nonprincipal, then we can choose \(m\) with \(\chi(m) \neq 1\text{.}\)

Definition 4.3.

Sometimes a Dirichlet character of level \(N\) can be written as the termwise product of the principal character of level \(N\) with a character of some level \(N' \lt N\) (of course \(N'\) must divide \(N\)). We say the character is imprimitive in this case and primitive otherwise.

Section 4.2 \(L\)-series

Definition 4.4.

The Dirichlet series associated to a Dirichlet character \(\chi\) of level \(N\) is called a Dirichlet \(L\)-series of level \(N\) (or Dirichlet \(L\)-function of level \(N\)), denoted \(L(s,\chi)\text{.}\) (It may also be denoted \(L_\chi(s)\) so that one can refer to \(L_\chi\) as a function without explicitly naming the variable.) Since \(\chi\) is completely multiplicative, \(L(s,\chi)\) formally factors as
\begin{equation} \prod_p (1 - \chi(p) p^{-s})^{-1}.\tag{4.2.1} \end{equation}
In particular, if \(\chi\) is imprimitive corresponding to the character \(\chi'\) of level \(N'\text{,}\) then
\begin{equation} L(s,\chi) = L(s,\chi') \prod_{p | N, p \not| N'} (1 - \chi'(p) p^{-s}).\tag{4.2.2} \end{equation}
(Note that (4.2.2) reduces to \(L(s,\chi) = L(s,\chi')\) if \(N\) and \(N'\) have the same prime factors, e.g., if \(N'\) is prime and \(N = (N')^2\text{.}\)) In particular, the abscissa of absolute convergence of the principal character of level \(N\text{,}\) and hence of each of the characters of level \(N\text{,}\) is 1, and the product representation (4.2.1) is valid for \(\Real(s) > 1\text{.}\) In particular, \(L(s,\chi) \neq 0\) for \(\Real(s) > 1\text{.}\)
If \(\chi\) is principal, then by (4.2.2),
\begin{equation*} L(s,\chi) = \zeta(s) \prod_{p | N} (1 - p^{-s}), \end{equation*}
and the claims about \(L(s,\chi)\) follow from what we already know about \(\zeta\text{.}\) So assume hereafter that \(\chi\) is nonprincipal. By partial summation, we can write
\begin{equation} L(s,\chi) = \sum_{n=1}^\infty (\chi(1) + \cdots + \chi(n))(n^{-s} - (n+1)^{-s}).\tag{4.2.3} \end{equation}
Since \(\chi(1) + \cdots + \chi(N) = 0\) by Lemma 4.2, the quantities \(\chi(1) + \cdots + \chi(n)\) are bounded for all \(n\text{.}\) Meanwhile,
\begin{align*} n^{-s} - (n+1)^{-s} \amp= n^{-s} (1 - (1 + 1/n)^{-s})\\ \amp= s n^{-s-1} + O(n^{-s-2}), \end{align*}
where the implied constant in the big \(O\) can be taken uniform over \(s\) in a compact set. Consequently, the sum representation for \(L(s,\chi)\) given by (4.2.3) converges uniformly for \(\Real(s) \geq \epsilon\) for any \(\epsilon > 0\text{.}\) This yields the claim.

Section 4.3 Nonvanishing of \(L\)-functions on \(\Real(s) = 1\)

Much as we used nonvanishing of \(\zeta\) on the line \(\Real(s) = 1\) to study the prime number theorem, we will use nonvanishing of \(L\)-functions on that line to study the prime number theorem in arithmetic progressions. An additional wrinkle, though, is that we have to do some extra work to understand what is going on at \(s=1\) itself; see next section.
Note that for \(\Real(s) > 1\text{,}\) we have
\begin{equation} \log f_N(s) = \sum_{p: (p,N) = 1} \sum_{n=1}^\infty \left( \sum_\chi \chi(p^n) \right) p^{-ns},\tag{4.3.1} \end{equation}
which is a Dirichlet series with nonnegative real coefficients. (The sum over \(\chi\) is invariant under multiplication by \(\chi(p^n)\) for any single \(\chi\text{,}\) so either the sum is zero or all of the summands are equal to 1.) We may thus apply Lemma 4.6.
This tells us a lot about nonvanishing of individual \(L\)-functions, but not quite everything.
Let \(N\) be the level of \(\chi\text{.}\) Then \(f_N(s)\) is a product of functions, one of which is \(L(s,\chi)\text{,}\) all of which are holomorphic at \(s\text{.}\) By Theorem 4.7, \(f_N(s)\) has no zero at \(s\text{,}\) so none of the factors can either.
It will take a bit more work to deal with \(s=1\text{;}\) see Section 4.4.

Section 4.4 Nonvanishing for \(L\)-functions at \(s=1\)

At \(s=1\) (the so-called critical point for Dirichlet \(L\)-functions), life is a bit more complicated; to deduce that none of the \(L(1, \chi)\) vanish, we need to know that the function \(f_N(s)\) in Theorem 4.7 has a simple pole, rather than being holomorphic, at \(s=1\text{.}\)

Definition 4.9.

We say a Dirichlet character is real if it takes values in \(\pm 1\text{,}\) and nonreal (or complex) otherwise.
Let \(N\) be the level of \(\chi\text{.}\) If \(L(1,\chi) = 0\text{,}\) then also \(L(1, \overline{\chi}) = 0\text{,}\) where \(\overline{\chi}\) denotes the complex conjugate character. But then \(f_N(s)\) is the product of one factor with a simple pole at \(s=1\) (coming from the principal character), two factors with zeroes at \(s=1\) (coming from \(\chi\) and \(\overline{\chi}\)), and a bunch of factors which are holomorphic at \(s=1\text{.}\) This would force \(f_N(s)\) to have a zero at \(s=1\text{,}\) contradicting Theorem 4.7.
For the real characters, the above argument fails because \(\overline{\chi}\) and \(\chi\) are the same character, so they don't give two different contributions to \(f_N(s)\text{.}\) Instead, we use a different trick. (There are a number of proofs of this result; see Exercise 4.6.5 for a second approach.)
Assume on the contrary that \(L(1,\chi) = 0\text{.}\) Define
\begin{equation*} \psi(s) = \frac{L(s,\chi) L(s,\chi_0)}{L(2s,\chi_0)}, \end{equation*}
where \(\chi_0\) is the principal character of level \(N\text{.}\) Then the numerator of \(\psi\) is holomorphic for \(\Real(s) > 0\text{,}\) because \(L(s,\chi)\) counterbalances the simple pole of \(L(s,\chi_0)\) at \(s=1\text{.}\) On the other hand, the denominator of \(\psi\) is holomorphic and nonzero for \(\Real(s) > 1/2\text{;}\) moreover, it extends meromorphically to a neighborhood of \(s=1/2\) with a simple pole at \(s=1/2\text{.}\) Therefore \(\psi\) is holomorphic for \(\Real(s) > 1/2\text{,}\) and extends holomorphically to a neighborhood of \(1/2\) with a simple zero at \(s=1/2\text{.}\)
However, \(\psi(s)\) admits the formal factorization
\begin{equation*} \psi(s) = \prod_{p: \chi(p) = 1} \left( \frac{1 + p^{-s}}{1 - p^{-s}} \right) \end{equation*}
and so expands as a Dirichlet series with nonnegative real coefficients and constant coefficient 1. The product factorization converges absolutely for \(\Real(s) > 1\text{,}\) so the Dirichlet series does too. But \(\psi\) is holomorphic for \(\Real(s) > 1/2\text{,}\) so Theorem 3.4 implies that the Dirichlet series converges absolutely on \(\Real(s) > 1/2\text{.}\)
This yields \(\psi(s) \geq 1\) for \(s > 1/2\text{,}\) whereas \(\lim_{s \to (1/2)^+} \psi(s) = 0\text{,}\) contradiction.

Remark 4.12.

The proofs above have the merit that one could rewrite them without using complex analysis, in order to obtain a complex analysis-free proof of Dirichlet's theorem. (Dirichlet was working before the properties of complex analytic functions were completely understood, so his proofs tend to only involve real \(s\text{.}\)) However, in the context of complex analysis one can also interpret them as follows. Suppose at least one of the \(L(s,\chi)\) had a zero at \(s=1\text{;}\) then \(f_N(s)\) would be holomorphic on \(\Real(s) > 0\text{.}\) Since \(\log f_N(s)\) has nonnegative real coefficients, so does \(f_N(s)\) by formal exponentiation. Theorem 3.4 would then imply that the Dirichlet series for \(f_N(s)\) converges absolutely for \(\Real(s) > 0\text{.}\) However, since the Dirichlet series for \(\log f_N(s)\) diverges for \(s = 1 - 1/\phi(N)\) (Exercise 4.6.3), so does the series for \(f_N(s)\text{,}\) contradiction.

Section 4.5 Historical aside: Dirichlet's class number formula

Continuing the thread from Remark 4.12, we recall Dirichlet's original approach to Theorem 4.11: for \(\chi\) a real nonprincipal Dirichlet character, one can express the value \(L(1,\chi)\) in terms of a important numerical invariant, the class number of binary quadratic forms of a given discriminant. That number evidently being positive, Dirichlet could not only establish the nonvanishing of \(L(1,\chi)\text{,}\) but also its sign (but see Exercise 4.6.5 for another approach to the latter).
Nowadays, one typically expresses this in the language of algebraic number theory. If you are not familiar with this language, feel free to ignore the rest of this section.
Let \(K\) be a quadratic number field, and let \(\chi_K\) be the character such that
\begin{equation*} \chi_K(p) = \begin{cases} 1 \amp \mbox{ is unramified and split in } \\ -1 \amp \mbox{ is unramified and inert in } \\ 0 \amp \mbox{ is ramified in } \end{cases} \end{equation*}
One then proves that \(L(1,\chi_K)\) is equal to the class number \(h_K\) of \(K\) times the regulator \(R_K\text{.}\) (The latter equals 1 if \(K\) is imaginary quadratic, and otherwise is equal to a fixed normalization factor times the logarithm of the fundamental unit of \(K\text{.}\))
The point here is that up to multiplication by Euler factors for the ramified primes, \(L(1, \chi_0) L(1, \chi_K)\) is equal to the Dedekind zeta function \(\zeta_K\) of \(K\text{,}\) defined by
\begin{equation*} \zeta_K(s) := \sum_{\gotha} \Norm(\gotha)^{-s}, \end{equation*}
for \(\gotha\) running over nonzero ideals of the ring of integers \(\gotho_K\text{.}\) For a general number field \(K\text{,}\) \(\zeta_K\) has a simple pole at 1, whose residue is the class number of \(K\) times the regulator of \(K\) times a normalization factor (determined by the number of real and complex places of \(K\)); the point is that each factor in this product is visibly nonzero.
One sometimes turns this around and tries to use analytic information about \(\zeta_K\) to get information about the product \(h_K R_K\text{.}\) It is quite difficult to separate the two factors in this expression; indeed, one can make a good case that they really are simply two separate factors in the computation of the volume of a certain compact topological group, the Arakelov class group of \(R\text{,}\) whose group of components is isomorphic to the usual class group.
A notable exception is that there is no regulator for an imaginary quadratic field, so you can get good bounds in this case. For instance, the Brauer–Siegel theorem says that the class number of an imaginary quadratic field of discriminant \(D\) is at least \(c_\epsilon D^{1/2 - \epsilon}\) for any \(\epsilon>0\text{,}\) though unfortunately the constant \(c_\epsilon\) cannot be effectively determined from \(\epsilon\text{.}\) The best effective results are due to Goldfeld [7], who proves an effective lower bound which is polynomial in \(\log(D)\text{;}\) this is a far cry from the truth, but can be used to give a new resolution of Gauss's conjecture that there are exactly nine imaginary quadratic fields of class number 1 [26], or to determine all imaginary quadratic number fields with class number up to 100 [27].

Exercises 4.6 Exercises

2.

Let \(f\) be a meromorphic function on some neighborhood of \(\Real(s) \geq L\text{,}\) with a pole of order \(e > 0\) at \(s=L\) and no other poles. Suppose that \(\log f(s)\) is represented by a Dirichlet series with nonnegative real coefficients and abscissa of absolute convergence \(\leq L\text{.}\) Prove that every zero of \(f\) on the line \(\Real(s) = L\) has multiplicity \(\leq e/2\text{.}\) (For \(e=1\text{,}\) this implies Lemma 4.6.)

3.

Prove directly that the Dirichlet series in (4.3.1) does not converge for \(s=1 - 1/\phi(N)\text{.}\)
Hint.
For every Dirichlet character \(\chi\) of order \(N\text{,}\) \(\chi^{\phi(N)}\) is principal.

4.

Let \(\chi\) be a real nonprincipal character. Use Dirichlet's hyperbola method (Exercise 3.4.9) to show that
\begin{equation*} \sum_{n \leq x} f(n) n^{-1/2} = 2 L(1,\chi) x^{1/2}+ O(1), \end{equation*}
for \(f(n) = \sum_{d|n} \chi(d)\text{.}\)

5.

Use the previous exercise to show that \(L(1,\chi) > 0\text{,}\) thus giving a second proof of Theorem 4.11.
Hint.
Prove that \(f(n) \geq 1\) if \(n\) is a perfect square, and \(f(n) \geq 0\) otherwise.