6 Geometry of numbers
In this chapter we will develop the machinery necessary for proving Theorem 4.0.3.
A lattice \(\Lambda \subset \mathbb {R}^n\) is a subgroup (under addition) generated by \(n\) linearly independent vectors.
If \(\Lambda \) is a lattice in \(\mathbb {R}^n\) then
where \(e_i\) are linearly independent vectors over \(\mathbb {R}\), i.e. there does not exist \(r_i \in \mathbb {R}\) such that \(\sum _i r_ie_i =0\). So its not enough that the \(e_i\) be independent over \(\mathbb {Q}\), so \((1,0)\) and \((\pi ,0)\) do not generate a lattice in \(\mathbb {R}^2\).
If \(\Lambda \subset \mathbb {R}^n\) is a lattice generated by \(e_i\) then
is called the fundamental domain of \(\Lambda \).
Note that if we \(\lambda \in \Lambda \) and let \(P(\Lambda )+\lambda =\{ x+ \lambda \mid x \in P(\Lambda )\} \) then
Let \(\Lambda \subset \mathbb {R}^n\) be a lattice. Then the volume of \(P(\Lambda )\) does not depend on the choice of basis of \(\Lambda \). Moreover, if \(\{ e_i\} \) is the basis, then
(here the right hand side is the determinant of the matrix whose columns are given by the \(e_i\)).
The second statement is just linear algebra, so we will only prove the first. Let \(f_i\) denote a second basis of \(\Lambda \) and let \(M(e_i), M(f_i)\) denote the matrices whose columns are given by \(e_i\) and \(f_i\) respectively. Then
where \(A\) is a \(n \times n\) matrix with entries in \(\mathbb {Z}\). Similarly ,
Therefore,
Now, since \(M(e_i),M(f_i)\) are non-degenerate we have \(BA=I\) and therefore \(\det (A)=\pm 1\).
Let \(S \subset \mathbb {R}^n\) be a measurable set (i.e. \(\operatorname{Vol}(S)=|\idotsint _S dx_1\cdots dx_n|\) exists) and \(\Lambda \) is a lattice. Then if \(\operatorname{Vol}(S) {\gt} \operatorname{Vol}(P(\Lambda ))\) then there exist \(x,y \in S\) with \(x \neq y\) such that \(x-y \in \Lambda \).
Furthermore, if \(S\) is compact, then the same conclusion holds if \(\operatorname{Vol}(S) \geq \operatorname{Vol}(P(\Lambda ))\).
We begin by writing
therefore
From this it follows that
Now, if \(P(\Lambda ) \cap (S -\lambda )\) are all disjoint, then the sum their volume is \({\lt} \operatorname{Vol}(P(\Lambda ))\) contradicting our assumption that \(\operatorname{Vol}(S){\gt} \operatorname{Vol}(P(\Lambda ))\). Therefore two of these sets meet, say \(P(\Lambda ) \cap (S -\lambda )\) and \(P(\Lambda ) \cap (S -\mu )\) (with \(\lambda \neq \mu \)) and therefore we have some \(x-\lambda =y-\mu \) giving \(x-y=\lambda -\mu \in \Lambda \).
For the second part, if \(S\) is now compact with \(\operatorname{Vol}(S) \geq \operatorname{Vol}(P(\Lambda ))\). Then let \(S'=(1+\epsilon )S\) such that \(\operatorname{Vol}(S'){\gt} \operatorname{Vol}(P(\Lambda ))\). Then by the above, we can find \(x,y \in S'\) such that \(x-y \in \Lambda \). Let \(\Lambda _\epsilon \) denote the set of such \(x,y\).
Note that if \(\epsilon ' \leq \epsilon \) then \(\Lambda _{\epsilon '} \subset \Lambda _\epsilon \). Therefore \(\cap _{\epsilon {\gt}0} \Lambda _\epsilon \neq \emptyset \). So let \(\lambda \in \cap _{\epsilon {\gt}0} \Lambda _\epsilon \). We claim that \(\lambda =x-y\) for some \(x,y \in S\). Take \(\epsilon =1/n\) and write \(\lambda =x_n-y_n\) with \(x_n,y_n \in (1+1/n)S\) (which we can do by the first part). Since \(x_n,y_n \in 2S\) for all \(n\) and \(2S\) is compact. So \((x_n,y_n)\) form a sequence in a compact set, so there is a subsequence that converges to a point \((x,y)\). Since \(x_n,y_n \in (1+1/n)S\) we have \(x \in \bigcap (1+1/n)S=S\) and similarly \(y \in S\). Since \(\lambda =x_n-y_n\) for all \(n\) we see that in the limit \(\lambda =x-y\) which then gives the result.
A subset \(S \subset \mathbb {R}^n\) is called:
Convex if whenever \(x,y \in S\) then the line segment joining \(x\) and \(y\) is also contained in \(S\).
Centrally symmetric if whenever \(x \in S\) then \(-x \in S\).
Let \(S\) be a compact, convex and centrally symmetric subset of \(\mathbb {R}^n\) and \(\Lambda \) a lattice. If
then \(S\) contains a point of \(\Lambda \).
Consider the set
Then \(\operatorname{Vol}(\frac{1}{2}S) \geq \operatorname{Vol}(P(\Lambda ))\). So by Lemma 6.0.5 there exist \(x,y \in \frac{1}{2}S\) such that \(x-y \in \Lambda \). We claim that \(x-y \in S\).
Note that \(2x,2y \in S\). Now, since \(S\) is centrally symmetric, we have \(-2y \in S\). Furthermore, since \(S\) is convex,
Thus \(x-y \in S\).
Let now apply this to number theory. Let \(K\) be a number field with \([K:\mathbb {Q}]=n\). Then we have \(n\) embeddings of \(K \hookrightarrow \mathbb {C}\) and in fact if we let \(r_1\) be the number of real embeddings and \(r_2\) the number of complex conjugate embeddings then we have:
Let \(K\) be a number field with \(r_1\) real embeddings and \(r_2\) complex conjugate pairs of embeddings, then the canonical embedding is
given by
\begin{align*} x \mapsto & (\sigma _1(x),\dots ,\sigma _{r_1}(x),\sigma _{r_1+1}(x),\dots ,\sigma _{r_1+r_2}(x))\\ & \mapsto (\sigma _1(x),\dots ,\sigma _{r_1}(x),\Re \sigma _{r_1+1}(x),\Im \sigma _{r_1+1}(x), \dots ,\Re \sigma _{r_1+r_2}(x)), \Im \sigma _{r_1+1}(x) \end{align*}where the first \(r_1\) of the \(\sigma _i\) are the real embeddings then rest are the complex ones and \(\Re ,\Im \) denote real and imaginary parts.
Let \(K=\mathbb {Q}(\sqrt{-d})\) with \(d\) a square-free positive integer. Then the embedding is given by sending \(x+y\sqrt{-d}\) to
\[ (x,y\sqrt{-d}) \in \mathbb {R}^2. \]If \(K=\mathbb {Q}(\sqrt{d})\) with \(d\) a square-free positive integer. Then the embedding is given by sending \(x+y\sqrt{d}\) to
\[ (x+y\sqrt{d},x-y\sqrt{d}) \in \mathbb {R}^2. \]
Let \(K\) be a number field with \([K:\mathbb {Q}]=n\) and \(\Theta : K \to \mathbb {R}^n\) is canonical embedding. Then \(\Theta (\mathcal{O}_K)\) is a lattice in \(\mathbb {R}^n\) and if \(P=P(\Theta (\mathcal{O}_K))\) then
where \(r_2\) is the number of complex conjugate pairs of embeddings.
Furthermore, if \(\mathfrak {a}\subset \mathcal{O}_K\) is an ideal, then \(\Lambda _\mathfrak {a}:=\Theta (\mathfrak {a})\) is a sublattice of \(\Theta (\mathcal{O}_K)\). Moreover,
Let \(\{ e_i\} \) be an integral basis of \(\mathcal{O}_K\). Then
We want to compute
By definition
Now, note that
Using this we have
Doing simple row operations we can transform this into
and then to
Notice the power of \(1/2\) has changed. But now, if we look back at Proposition 2.2.16 we see that this is simply
Lets now look at the sublattice \(\Lambda _\mathfrak {a}\). Note that as additive abelian groups we have \(\mathcal{O}_K \cong \mathbb {Z}^n\) and \(\mathfrak {a}\) is a subgroup of index \(N(\mathfrak {a})\). Since \(\Theta \) is injective we have \(\Lambda _\mathfrak {a}\) is a subgroup of \(\Theta (\mathcal{O}_K)\) of index \(N(\mathfrak {a})\). From this it follows that
(compare this with the proof of Proposition 3.4.5) from which the result follows.
Let \(S_t \subset \mathbb {R}^{r_1} \times \mathbb {C}^{r_2} \cong \mathbb {R}^n\) be a subset given by points \((y_i,z_i) \in \mathbb {R}^{r_1} \times \mathbb {C}^{r_2}\) such that
Then \(S\) is compact, convex and centrally symmetric and moreover
\(S\) is closed and bounded and therefore is compact. \(S\) is also clearly symmetric. For \(\lambda \in (0,1)\) we have
\begin{align*} \sum _i |\lambda y_i+(1-\lambda )y_i’|+& 2 \sum _j |\lambda z_i+(1-\lambda )z_i’| \leq \sum _i |\lambda y_i|+|(1-\lambda )y_i’|+2 \sum _j |\lambda z_i|+|(1-\lambda )z_i’|\\ & = \lambda \sum _i | y_i|+(1-\lambda )\sum _j |z_i| + \lambda \sum _i | y_i’|+(1-\lambda )\sum _j |z_i’| \\ & \leq \lambda +(1-\lambda )=1 \end{align*}From this it follows that \(S\) is also convex.
Note that if \(r_1=1\) and \(r_2\) then \(S=[-t,t]\) which has volume (in this case length) \(2\). Similarly, if \(r_1=0\) and \(r_2=1\) then \(S\) is just a ball in \(\mathbb {C}\) of radius \(\frac{1}{2}\) so has volume (in this case area) \(\frac{\pi t^2}{4}\). We will prove the formula for the volume by induction on \((r_1,r_2)\).
Assume we know the formula for \((r_1,r_2)\). Lets look at the \((r_1+1,r_2)\) case. Here the set is given by points such that
which can be rewritten as
where \(y=y_{r_1+1}\). This set has volume
A slightly more involved, yet still elementary proof gives the \((r_1,r_2+1)\) and thus the result.
Finally, with this we can finally prove Theorem 4.0.3
Let \(K\) be a number field with \(r_1\) real embeddings and \(r_2\) conjugate pairs of complex embeddings. Let \([K:\mathbb {Q}]=n\) and let \(\mathfrak {a}\) be an ideal of \(\mathcal{O}_K\). Then there is an element \(a \in \mathfrak {a}\) such that
Let \(S_t\) be as in Lemma 6.0.11 and pick \(t\) such that
i.e. such that
Then by Lemma 6.0.7 there is an \(a \in \mathfrak {a}\) such that \(\Theta (a) \in S_t\). Then we have
(here we use Proposition 1.7.6). Now, using the arithmetic-geometric mean inequality
for \(z_i\) positive real numbers, we have
by definition of \(S_t\). Using (1) then gives the result.