1 Introduction
The construction of Eisenstein congruences has a long and consequential history. Interesting in their own right, their significance is amplified by the existence of Galois representations attached to the congruent forms, as the ones attached to Eisenstein series are always reducible, while the ones attached to cusp forms are often irreducible. Using various generalizations of the result known as Ribet’s Lemma, they lead to the construction of non-zero elements in Selmer groups. This direction was first explored by Ribet himself in the context of the group
$\operatorname {\mathrm {GL}}_2$
in [Reference Ribet45] and later used by many other authors in a variety of different settings (e.g., [Reference Brown16, Reference Skinner and Urban49, Reference Wiles60]).
In a different direction, such congruences can play a crucial role in proving modularity of deformations of reducible residual Galois representations
${\overline {\rho }}$
(see, e.g., [Reference Berger and Klosin6, Reference Berger and Klosin9, Reference Berger and Klosin10, Reference Calegari17, Reference Skinner and Wiles50, Reference Wake54, Reference Wake and Wang-Erickson56]). In [Reference Calegari17] Calegari introduced a method of proving modularity assuming
${\overline {\rho }}$
is unique up to isomorphism, which relies on proving the principality of the ideal of reducibility of the universal deformation ring R of
${\overline {\rho }}$
. This method was developed further by Berger and Klosin [Reference Berger and Klosin5, Reference Berger and Klosin6, Reference Berger and Klosin9] and Wake and Wang-Erickson [Reference Wake and Wang-Erickson56] and successfully applied in many contexts (see also [Reference Akers1, Reference Huang29]). It relies heavily on the ideas of Bellaiche and Chenevier [Reference Bellaïche and Chenevier4] and their study of generalized matrix algebras (GMAs).
In this article, we pursue both of these directions in the case of Klingen Eisenstein series of level one on the group
$\operatorname {\mathrm {Sp}}_4$
. More precisely, let
$k\geq 12$
be an even integer and
$\phi $
be a classical weight k Hecke eigenform of level
$1$
(i.e., on the group
$\operatorname {\mathrm {GL}}_{2/\mathbf {Q}}$
). Write
$E_{\phi }^{2,1}$
for the (appropriately normalized) Klingen Eisenstein series on
$\operatorname {\mathrm {Sp}}_4$
induced from
$\phi $
. It is a Siegel modular form of weight k and full level. Congruences between Klingen Eisenstein series and cusp forms have been studied previously by Kurokawa [Reference Kurokawa35, Reference Kurokawa36], Katsurada and Mizumoto [Reference Katsurada and Mizumoto32, Reference Mizumoto39], Takeda [Reference Takeda52], and Urban (unpublished). Katsurada and Mizumoto obtain congruences as an application of the doubling method. In this article, we produce congruences via a much shorter argument using results of Yamauchi [Reference Yamauchi61]. The trade-off is that while our proof is much shorter, we obtain congruences only modulo a prime
$\ell ,$
whereas Katsurada and Mizumoto obtain congruences modulo powers of
$\ell $
. However, the hypotheses required for our result are different and less restrictive than those needed in [Reference Katsurada and Mizumoto32]. We show that under certain conditions
$E_{\phi }^{2,1}$
is congruent to some cusp form f of the same weight and level with irreducible Galois representation (Theorem 3.5). This is the first main result of the article. These congruences are governed by the numerator of the (algebraic part) of the symmetric square L-function
$L_{\mathrm {alg}}(2k-2, \mathrm {Sym}^2\phi )$
of
$\phi $
. We also exhibit a concrete example when the assumptions of Theorem 3.5 are satisfied (see Example 3.6).
We then proceed to show that these congruences give rise (under some assumptions) to non-trivial elements in the Selmer group
$H_{2-k}:=H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}} \rho _{\phi }(2-k)\otimes \mathbf {Q}_{\ell }/\mathbf {Z}_{\ell })$
. Here,
$\rho _{\phi }$
is the Galois representation attached to
$\phi $
by Deligne and we use the Fontaine–Laffaille condition at
$\ell $
. Assuming the Vandiver Conjecture for
$\ell $
we also deduce the non-triviality of the Selmer group
$H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}^0 \rho _{\phi }(2-k)\otimes \mathbf {Q}_{\ell }/\mathbf {Z}_{\ell })$
(Corollary 5.7 and Remark 5.8). This is our second main result and gives evidence for new cases of the Bloch–Kato conjecture. This conjecture was studied for other twists of
$\operatorname {\mathrm {ad}} \rho _\phi $
by [Reference Diamond, Flach and Guo20, Reference Klosin34]. In [Reference Urban53] Urban assumed the existence of Klingen Eisenstein congruences to prove a result toward the main conjecture of Iwasawa theory for the adjoint L-function.
To properly analyze these Selmer groups, we require some results on extensions of Fontaine–Laffaille modules whose proofs appear to be absent in the literature. In Section 4, we carefully study certain aspects of Fontaine–Laffaille theory, in particular, prove the Hom-tensor adjunction formula and give a precise definition of Selmer groups with coefficient rings of finite length.
Given the eigenvalue congruence
$E^{2,1}_{\phi }\equiv f$
(mod
$\ell $
), we also study deformations of a non-semi-simple Galois representation
${\overline {\rho }}: G_{\mathbf {Q}} \to \operatorname {\mathrm {GL}}_4(\overline {\mathbf {F}}_{\ell })$
whose semi-simplification arises from the Klingen Eisenstein series. Such a representation is reducible with two two-dimensional Jordan–Holder blocks and more precisely, one has

Conjecturally such representations should arise as mod
$\ell $
reductions of Galois representations attached to Siegel cusp forms which are congruent to
$E_{\phi }^{2,1}$
mod
$\ell $
. We assume that
$\dim H_{2-k}[\ell ]=1$
, where
$[\ell ]$
indicates
$\ell $
-torsion. This can be seen as a refinement of the uniqueness assumption of [Reference Skinner and Wiles50] similar to the one in [Reference Berger and Klosin6] and as in [Reference Berger and Klosin6, Reference Calegari17] we prove the principality of the reducibility ideal of the universal deformation. However, this principality cannot be achieved through the method of [Reference Berger and Klosin6] because the representation in question fails to satisfy the strong self-duality property required for the method of [loc.cit.]. Instead we improve on a recent result of Akers [Reference Akers1] which replaces the self-duality condition with a one-dimensionality assumption on the Selmer group
$H_{k-2}:=H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}} {\overline {\rho }}_{\phi }(k-2))$
of the “opposite” Tate twist of
$\operatorname {\mathrm {ad}} \rho _{\phi }$
. With these assumptions in place, we are able to show that the universal deformation ring R is a discrete valuation ring and prove a modularity result guaranteeing that the unique deformation of
${\overline {\rho }}$
indeed arises from a Siegel cusp form congruent to
$E_{\phi }^{2,1}$
(Theorem 6.20). This is the third main result of the article.
We then proceed to formulate conditions for non-cyclicity of the Selmer group
$H_{k-2}$
. While many results in the literature give bounds on the orders of Selmer groups (in particular, Corollary 5.7 gives such a lower bound on
$H_{2-k}$
), the structure of these groups is notoriously mysterious. In this article, we prove that if the (local) Klingen Eisenstein ideal
$J_{\mathfrak {m}}$
is not principal then
$H_{k-2}$
is not cyclic (Corollary 7.3). We further refine this result by providing a criterion for non-principality in terms of the depth of congruences between cusp forms and
$E^{2,1}_{\phi }$
(Corollary 7.5). An intriguing feature of these results is that
$H_{k-2}$
is non-critical, i.e., this Selmer group is not controlled by a critical L-value in the sense of Deligne.
2 Background and notation
Given a field
$F,$
we denote by
$G_F$
its absolute Galois group. Fix a rational prime
$\ell>2$
. If M is a topological
$\mathbf {Z}_\ell [G_{F}]$
-module, we will write
$M(n) = M \otimes \epsilon ^{n}$
for the n-th Tate twist where
$\epsilon $
denotes the
$\ell $
-adic cyclotomic character.
For each prime p, we fix an embedding
$\overline {\mathbf {Q}} \hookrightarrow \overline {\mathbf {Q}}_{p}$
. This is equivalent to choosing a prime
$\overline {p}$
of
$\overline {\mathbf {Q}}$
lying over p and fixes an isomorphism
$D_p \cong G_{\mathbf {Q}_p}$
, where
$D_p$
is the decomposition group of
$\overline {p}$
. We will denote by
$I_p\subset D_p$
the corresponding inertia group. We also fix an isomorphism
$\overline {\mathbf {Q}}_{\ell } \cong \mathbf {C}$
.
Let E denote a finite extension of
$\mathbf {Q}_{\ell }$
with valuation ring
$\mathcal {O}$
, uniformizer
$\lambda ,$
and residue field
$\mathbf {F}$
. For a continuous homomorphism
$\rho :G_F \to \operatorname {\mathrm {GL}}_n(\mathcal {O}),$
we write
${\overline {\rho }}: G_F\to \operatorname {\mathrm {GL}}_n(\mathbf {F})$
for the mod
$\lambda $
reduction of
$\rho $
.
For
$n\in \mathbf {Z}_+$
, we denote by
$\operatorname {\mathrm {Mat}}_n$
(resp.,
$\operatorname {\mathrm {GL}}_n$
) the affine group scheme over
$\mathbf {Z}$
of
$n\times n$
(resp., invertible) matrices. Given a matrix
$\gamma \in \operatorname {\mathrm {Mat}}_{2n}$
, we will write it as
$\gamma = \left [ \begin {matrix} a_{\gamma } & b_{\gamma }\\ c_{\gamma } & d_{\gamma } \end {matrix} \right ]$
, where the blocks are in
$\operatorname {\mathrm {Mat}}_{n}$
. Set
$\operatorname {\mathrm {GSp}}_{2n} = \left \{g \in \operatorname {\mathrm {GL}}_{2n} : \, ^t\!g J_{n} g = \mu _{n}(g) J_{n} , \mu _{n}(g) \in \operatorname {\mathrm {GL}}_1\right \},$
where
$J_n=\left [ \begin {matrix} 0_n & -1_n \\ 1_n & 0_n \end {matrix} \right ]$
, where
$1_{n}$
is the n by n identity matrix, and
$\mu _{n}: \operatorname {\mathrm {GL}}_{2n} \rightarrow \operatorname {\mathrm {GL}}_{1}$
is the homomorphism defined via the equation given in the definition. Write
$\operatorname {\mathrm {GSp}}^{+}_{2n}(\mathbf {R})$
for the subgroup of
$\operatorname {\mathrm {GSp}}_{2n}(\mathbf {R})$
consisting of elements g with
$\mu _{n}(g)> 0$
. We set
$\operatorname {\mathrm {Sp}}_{2n} = \ker (\mu _{n})$
and

Note that
$\operatorname {\mathrm {Sp}}_2 = \operatorname {\mathrm {SL}}_2$
, the subgroup scheme of
$\operatorname {\mathrm {GL}}_2$
of matrices of determinant one.
The Siegel upper half-space is given by

where we write
$y>0$
to indicate that y is positive definite. The group
$\operatorname {\mathrm {GSp}}_{2n}^{+}(\mathbf {R})$
acts on
$\mathfrak {h}_{n}$
via
$\gamma z = (a_{\gamma }z + b_{\gamma })(c_{\gamma } z + d_{\gamma })^{-1}$
.
For a function
$f: \mathfrak {h}_{n} \rightarrow \mathbf {C}$
set
$(f|_{\kappa } \gamma )(z) = \mu _{n}(\gamma )^{nk/2} j(\gamma ,z)^{-k} f(\gamma z)$
for
$\gamma \in \operatorname {\mathrm {GSp}}_{2n}^{+}(\mathbf {R})$
and
$z \in \mathfrak {h}_{n}$
, where
$j(\gamma ,z) = \det (c_{\gamma } z+ d_{\gamma })$
. A Siegel modular form of weight k and level
$\Gamma _{n}$
is a holomorphic function
$f: \mathfrak {h}_{n} \rightarrow \mathbf {C}$
satisfying
$(f|_{k}\gamma )(z) = f(z)$
for all
$\gamma \in \Gamma _{n}$
. If
$n=1$
, we also require the standard growth condition at the cusp. We denote the
$\mathbf {C}$
-vector space of Siegel modular forms of weight k and level
$\Gamma _n$
as
$M_{k}(\Gamma _n)$
. Any
$f \in M_{k}(\Gamma _{n})$
has a Fourier expansion of the form

where
$\Lambda _n$
is defined to be the set of n by n half-integral (diagonal entries are in
$\mathbf {Z}$
, off diagonal are allowed to lie in
$\frac {1}{2}\mathbf {Z}$
) positive semi-definite symmetric matrices and
$e(w):=e^{2\pi i w}$
. Given a ring
$A \subset \mathbf {C}$
, we write
$f \in M_{k}(\Gamma _{n};A)$
if
$a(T;f) \in A$
for all
$T \in \Lambda _{n}$
. Define the subspace
$S_k(\Gamma _n)=\ker \Phi \subset M_k(\Gamma _n)$
of cusp forms, where
$\Phi (f)(z) = \lim _{t \rightarrow \infty } f\left (\left [ \begin {matrix} z & 0 \\ 0 & it \end {matrix} \right ] \right ).$
We will now introduce certain Eisenstein series, which will play a prominent role in this article. For
$n \geq 1$
and
$0 \leq r \leq n$
define the parabolic subgroup

We define projections
$\star : \mathfrak {h}_{n} \to \mathfrak {h}_{r}$
,
$z = \left [ \begin {matrix} z^{\star } & * \\ * &* \end {matrix} \right ] \mapsto z^{\star }$
and
$\star : P_{n,r} \to \Gamma _{r}$
,
$\gamma \mapsto \gamma ^{\star } = \left [ \begin {matrix} a_1 & b_1 \\ c_1 & d_1 \end {matrix} \right ].$
Let
$\phi \in S_{k}(\Gamma _{1})$
. The Klingen Eisenstein series attached to
$\phi $
is the series

where
$z \in \mathfrak {h}_{2}$
. The Eisenstein series converges for
$k \geq 12$
(see [Reference Klingen33, Theorem 1, p. 67] for example). Note that [Reference Klingen33, Theorem 1, p. 67] gives
$\Phi (E_{\phi }^{2,1}) = \phi $
.
Given two Siegel modular forms
$f_1, f_2 \in M_{k}(\Gamma _n)$
with at least one a cusp form, set

where
$z=x+iy$
with
$x = (x_{\alpha , \beta })$
,
$y = (y_{\alpha , \beta }) \in \operatorname {\mathrm {Mat}}_{n}(\mathbf {R})$
,
$d\mu z = (\det y)^{-(n+1)} \prod _{\alpha \leq \beta } d x_{\alpha ,\beta } \prod _{\alpha \leq \beta } dy_{\alpha ,\beta }$
with
$dx_{\alpha ,\beta }$
and
$dy_{\alpha ,\beta }$
the usual Lebesgue measure on
$\mathbf {R}$
.
Given
$\gamma \in \operatorname {\mathrm {GSp}}_{2n}^{+}(\mathbf {Q})$
, we write
$T(\gamma )$
to denote the double coset
$\Gamma _{n} \gamma \Gamma _{n}$
and set
$T(\gamma ) f = \sum _{i} f|_{k} \gamma _{i}$
, where the
$\gamma _i$
are given by the finite decomposition
$\Gamma _{n} \gamma \Gamma _{n} = \coprod _{i} \Gamma _{n} \gamma _{i}$
and
$f \in M_{k}(\Gamma _n)$
. Let
$m>1$
. We define
$T^{(n)}(m)$
via

In particular, for p a prime, we have

We also define

The spaces
$M_{k}(\Gamma _{n})$
and
$S_{k}(\Gamma _{n})$
are both stable under the action of
$T^{(n)}(p)$
and
$T_{i}^{(n)}(p^2)$
for
$1 \leq i \leq n$
and all p. We say a nonzero
$f \in M_{k}(\Gamma _{n})$
is an eigenform if it is an eigenvector of
$T^{(n)}(p)$
and
$T_{i}^{(n)}(p^2)$
for all p and all
$1 \leq i \leq n$
. As we will be focused on the case
$n=2$
, we specialize to that case. We let
$\mathbf {T}'$
denote the
$\mathbf {Z}$
-subalgebra of
$\operatorname {\mathrm {End}}_{\mathbf {C}}(S_{k}(\Gamma _{2}))$
generated by the Hecke operators
$T^{(2)}(p)$
and
$T^{(2)}_1(p^2)$
for all primes p.
Recall that
$E/\mathbf {Q}_{\ell }$
denotes a finite extension with valuation ring
$\mathcal {O}$
and uniformizer
$\lambda $
. Given eigenforms
$f_1, f_2 \in M_{k}(\Gamma _n;\mathcal {O})$
, following the notation in [Reference Yamauchi61] we write
$f_1 \equiv _{\operatorname {\mathrm {ev}}} f_2\ \pmod {\lambda }$
if
$\lambda _{f_1}(T) \equiv \lambda _{f_2}(T)\ \pmod {\lambda }$
for all
$T \in \mathbf {T}'$
, where
$Tf_{i} = \lambda _{f_i}(T)f_i$
.
For an eigenform
$\phi \in S_k(\Gamma _1),$
we set

where
$\lambda _{\phi }(p)$
is the eigenvalue of
$T(p):=T^{(1)}(p)$
corresponding to
$\phi $
and
$\alpha _p, \beta _p$
denote the roots of
$X^2-\lambda _{\phi }(p)X+p^{k-1}$
. The symmetric square L-function converges in the right half-plane
$\Re (s)> k$
, satisfies a functional equation, and has analytic continuation to the entire complex plane.
For an eigenform
$f\in S_k(\Gamma _2),$
we define

where we write
$\lambda _f(p)$
is the eigenvalue of
$T^{(2)}(p)$
corresponding to f and
$\lambda _{f}(p^2)$
for the eigenvalue
$T^{(2)}(p^2)$
corresponding to f.
Theorem 2.1 [Reference Weissauer59, Theorem 1]
Let
$f \in S_{k}(\Gamma _2)$
be an eigenform. For a sufficiently large finite extension
$F/\mathbf {Q}_{\ell }$
, one has
$L_{p}(X,f, \operatorname {\mathrm {spin}}) \in F[X]$
for all primes
$p\neq \ell $
and there is a semisimple continuous representation
$\rho _{f}: G_{\mathbf {Q}} \rightarrow \operatorname {\mathrm {GL}}_4(F),$
which is unramified outside of
$\ell $
so that for
$p \neq \ell ,$
one has
$L_p(X,f;\operatorname {\mathrm {spin}}) = \det (1 - \rho _{f}(\operatorname {\mathrm {Frob}}_p)X)$
.
3 Congruence
We keep the notation of Section 2. Throughout this section, we fix an even weight
$k \geq 12$
and an odd prime
$\ell $
and make the following assumption.
Assumption 3.1 Given an even weight
$k \geq 12$
and prime
$\ell $
, assume that
$E/\mathbf {Q}_\ell $
is sufficiently large to contain the fields F from Theorem 2.1 for all forms
$f \in S_k(\Gamma _2)$
. We also assume that for every eigenform
$\phi \in S_k(\Gamma _1),$
the field E contains all the Hecke eigenvalues of
$\phi $
as well as the value
$L_{\mathrm {alg}}(2k-2, {\operatorname {Sym}^2}\phi )$
(see (3.1) for the definition). In addition, we suppose that E contains a primitive cube root of unity.
Recall that we denote the valuation ring of E by
$\mathcal {O}$
. Let
$\phi \in S_{k}(\Gamma _1)$
be a normalized eigenform and consider the Klingen Eisenstein series
$E_{\phi }^{2,1}$
. In this section, we show under certain conditions that
$E_{\phi }^{2,1}$
is eigenvalue-congruent to a cuspidal Siegel modular form with irreducible Galois representation.
Write

For T that are singular, i.e.,
$\det T = 0$
, one has T is unimodularly equivalent to
$\left [ \begin {matrix} n & 0 \\ 0 & 0 \end {matrix} \right ]$
for some
$n \in \mathbf {Z}_{\geq 0}$
. For such T, one has
$a(T;E^{2,1}_{\phi }) = a(n;\phi ),$
where
$\phi (z) = \sum _{n> 0} a(n;\phi ) e(nz)$
.
We use the following result to prove our congruence.
Corollary 3.2 [Reference Yamauchi61, Corollary 2.3]
Assume
$\ell \geq 7$
. Let g be a Hecke eigenform in
$M_{k}(\Gamma _2;\mathcal {O})$
with Fourier expansion
$g(z) = \sum _{T \in \Lambda _2} a(T;g)e(\operatorname {\mathrm {Tr}}(Tz))$
. Assume that
$\lambda \mid a(T;g)$
for all T with
$\det T =0$
and that there exists at least one
$T> 0$
with
$a(T;g) \in \mathcal {O}^{\times }$
. Then, there exists a Hecke eigenform
$f \in S_{k}(\Gamma _2;\mathcal {O})$
so that
$g \equiv _{\operatorname {\mathrm {ev}}} f \not \equiv _{\operatorname {\mathrm {ev}}} 0\ \pmod {\lambda } $
.
For
$T = \left [ \begin {matrix} m & r/2 \\ r/2 & n \end {matrix} \right ]$
, we say T is primitive if
$\gcd (m,n,r) = 1$
. We set
$\det (2T) = \Delta (T) \mathfrak {f}^2$
for a positive integer
$\mathfrak {f}$
and where
$-\Delta (T)$
is the discriminant of the quadratic field
$\mathbf {Q}(\sqrt {-\det (2T)})$
. We set
$\chi _{T} = \left (\frac {-\Delta (T)}{\cdot }\right )$
, the quadratic character associated with the field
$\mathbf {Q}(\sqrt {-\det (2T)})$
.
Define
$ \vartheta _{T}(z) = \sum _{a,b \in \mathbf {Z}^2} e(z(ma^2 + r ab + nb^2)) = \sum _{n \geq 0} b(n;\vartheta _{T}) e(nz). $
Given
$v \in \mathbf {Z}_{\geq 1}$
, set

One can check that
$\vartheta _{T}^{(v)} \in M_1(\Gamma (4\det T)),$
where
$\Gamma (N) = \ker \left (\operatorname {\mathrm {SL}}_2(\mathbf {Z}) \rightarrow \operatorname {\mathrm {SL}}_2(\mathbf {Z}/N\mathbf {Z})\right )$
and
$M_k(\Gamma (N))$
denotes the modular forms of weight k and level
$\Gamma (N)$
. Set

We have that
$D(s,\phi ,\vartheta _{T}^{(v)})$
converges in a right half-plane with meromorphic continuation to the entire complex plane [Reference Shimura47]. Set


and

We have each of these terms is algebraic (see [Reference Shimura47, Reference Sturm51, Reference Zagier62]. Moreover, we have via [Reference Zagier62, Equation (22)] that if
$\ell> k-1$
, then
$L_{\operatorname {\mathrm {alg}}}(k-1,\chi _{T})$
is
$\ell $
-integral.
Theorem 3.3 [Reference Mizumoto38]
Let
$\phi \in S_{k}(\Gamma _1)$
be a normalized eigenform with a Fourier expansion as above. Let
$T> 0$
be primitive. We have

where

Note that while this theorem is only stated for Fourier coefficients indexed by primitive T, we have that Fourier coefficients indexed by non-primitive T are an integral linear combination of Fourier coefficients indexed by primitive T by [Reference Mizumoto38, Equation (1.3)] so we only need to consider the primitive T to guarantee the hypotheses of Corollary 3.2 are satisfied.
Lemma 3.4 Assume
$\ell> 4k-7$
. Let
$f \in S_k(\Gamma _2;\mathcal {O})$
be an eigenform. If there exists a normalized eigenform
$\phi \in S_k(\Gamma _1;\mathcal {O})$
so that
$f \equiv _{\operatorname {\mathrm {ev}}} E_{\phi }^{2,1}\ \pmod {\lambda }$
and that
${\overline {\rho }}_\phi $
is irreducible, then
$\rho _f$
is irreducible.
Proof We know via [Reference Weissauer59] that if
$\rho _{f}$
is reducible, then the automorphic representation associated with f is either CAP or a weak endoscopic lift. Moreover, by [Reference Pitale and Schmidt42, Corollary 4.5] since
$f \in S_{k}(\Gamma _2)$
and
$k>2$
, the automorphic representation attached to f can be CAP only with respect to the Siegel parabolic, i.e., f is a classical Saito–Kurokawa lift. Suppose that f is a Saito–Kurokawa lift of
$\psi \in S_{2k-2}(\Gamma _1)$
. Then, we have
${\overline {\rho }}_{f}^{\operatorname {\mathrm {ss}}} = {\overline {\rho }}_{\psi } \oplus \overline {\epsilon }^{k-1} \oplus \overline {\epsilon }^{k-2}$
. Using the fact that
$f \equiv _{\operatorname {\mathrm {ev}}} E_{\phi }^{2,1}\ \pmod {\lambda }$
and that the eigenvalues of
$E_{\phi }^{2,1}$
are given by
$\lambda (p;E_{\phi }^{2,1}) = a(p;\phi ) + p^{k-2}a(p;\phi )$
, the Brauer–Nesbitt and Chebotarev Theorems give that
${\overline {\rho }}_{f}^{\operatorname {\mathrm {ss}}} = {\overline {\rho }}_{\phi } \oplus {\overline {\rho }}_{\phi }(k-2)$
, where recall that we write
${\overline {\rho }}_{\phi }(k-2)$
for
${\overline {\rho }}_{\phi } \otimes \overline {\epsilon }^{k-2}$
. This is a contradiction if
${\overline {\rho }}_{\phi }$
is irreducible. Thus, f cannot be a Saito–Kurokawa lift. It remains to show that the automorphic representation associated with f is not a weak endoscopic lift. The possible decompositions of
$\rho _{f}$
are given in [Reference Skinner and Urban48, Theorem 3.2.1] under the assumption that
$\ell> 4k-7$
. Of these, the only case remaining to check is Case B(v), which states if
$\rho _{f} = \sigma \oplus \sigma '$
with
$\sigma $
and
$\sigma '$
both two-dimensional, then
$\det (\sigma ) = \det (\sigma ')$
. In our case, this would require
$\det (\rho _{\phi }) = \det (\rho _{\phi }(k-2))$
, i.e.,
$\overline {\epsilon }^{k-1} = \overline {\epsilon }^{2k-3}$
, which is impossible by our assumption that
$\ell>4k-7$
. Thus,
$\rho _{f}$
is irreducible.
Theorem 3.5 Assume that
$\ell> 4k-7$
. Let
$\phi \in S_{k}(\Gamma _1; \mathcal {O})$
be a normalized eigenform. Suppose that
$\lambda \mid L_{\operatorname {\mathrm {alg}}}(2k-2, {\operatorname {Sym}^2} \phi )$
. Furthermore, assume there exists
$T_0> 0$
so that

Then, there exists an eigenform
$f \in S_{k}(\Gamma _2; \mathcal {O})$
so that

If in addition
${\overline {\rho }}_{\phi }$
is irreducible, then
$\rho _{f}$
is irreducible.
Proof Set
$H_{\phi }^{2,1}(z) = L_{\operatorname {\mathrm {alg}}}(2k-2,{\operatorname {Sym}^2} \phi ) E^{2,1}_{\phi }(z)$
. For
$T \geq 0$
, define
$c(T) = \operatorname {\mathrm {val}}_{\lambda }(a(T;H_{\phi }^{2,1})). $
Let
$c = \min _{T \geq 0} c(T)$
. Since
$H_{\phi }^{2,1} \in M_k(\Gamma _2)$
, the Fourier coefficients
$a(T;H_{\phi }^{2,1})$
have bounded denominators so c is well-defined [Reference Shimura46]. Moreover, our assumption that there is a
$T_0> 0$
with
$\operatorname {\mathrm {val}}_{\lambda }(a(T_0;H_{\phi }^{2,1})) = \operatorname {\mathrm {val}}_{\lambda }\left (L_{\operatorname {\mathrm {alg}}}(2k-2, {\operatorname {Sym}^2} \phi )a(T_0,E^{2,1}_{\phi })\right ) \leq 0$
gives that
$c \leq 0$
. Set

We have
$a(T;G_{\phi }^{2,1}) \in \mathcal {O}$
for all
$T \geq 0$
since
$c(T) - c \geq 0$
for all
$T \geq 0$
. Observe that for T with
$\det T=0$
, we have
$a(T;G^{2,1}_{\phi }) = \lambda ^{-c}L_{\operatorname {\mathrm {alg}}}(2k-2,{\operatorname {Sym}^2} \phi ) a(n;\phi )$
for some
$n \in \mathbf {Z}_{\geq 0}$
. Since
$a(n;\phi ) \in \mathcal {O}$
by assumption and
$-c\geq 0$
, this gives
$\lambda \mid a(T;G^{2,1}_{\phi })$
for all T with
$\det T = 0$
, i.e., all the Fourier coefficients indexed by singular T vanish modulo
$\lambda $
. Moreover, since
$c = c(\widetilde {T})$
for some
$\widetilde {T}$
, we have
$a(\widetilde {T};G_{\phi }^{2,1}) \in \mathcal {O}^{\times }$
for some
$\widetilde {T}$
. Since
$c\leq 0$
and
$\lambda \mid a(T;G_{\phi }^{2,1})$
for all singular T, we have
$\widetilde {T}>0$
. Thus, Corollary 3.2 and the fact that
$G^{2,1}_{\phi }$
and
$E^{2,1}_{\phi }$
have the same eigenvalues gives an eigenform
$f \in S_{k}(\Gamma _2;\mathcal {O})$
so that
$ E^{2,1}_{\phi } \equiv _{\operatorname {\mathrm {ev}}} f \not \equiv 0\ \pmod {\lambda }. $
By Lemma 3.4, we get that
$\rho _{f}$
is irreducible.
Example 3.6 Consider the space
$M_{26}(\Gamma _2)$
. This space has dimension seven and is spanned by
$E^{2,0}$
(Siegel Eisenstein series),
$E^{2,1}_{\phi }$
(Klingen Eisenstein series), three Saito–Kurokawa lifts, and two non-lift forms
$\Upsilon _1$
and
$\Upsilon _2$
, where here
$\phi \in S_{26}(\Gamma _1)$
is the unique newform given by

We have via [Reference Dummigan21] that

We consider
$\ell \in \{163, 187273\}$
and show that both primes produce an example for Theorem 3.5.
The Klingen Eisenstein series associated with
$\phi $
is given in the beta version of LMFDB. By considering the Fourier coefficients indexed by
$\left [ \begin {matrix} 1 & 0 \\ 0 & 0 \end {matrix} \right ]$
and
$\left [ \begin {matrix} 2 & 0 \\ 0 & 0 \end {matrix} \right ]$
, one can see that the Klingen Eisenstein series given there, say
$E^{\operatorname {\mathrm {LMFDB}}}_{\phi }$
, is given by

We have from LMFDB that

Consider
$G^{2,1}_{\phi }(z) = L_{\operatorname {\mathrm {alg}}}(50,{\operatorname {Sym}^2} \phi ) E^{2,1}_{\phi }(z)$
. We have for
$\ell $
as above that
$\ell \mid a(T;G^{2,1}_{\phi })$
for all T with
$\det T= 0$
and
$a\left (\left [ \begin {matrix} 1 & 1/2 \\ 1/2 & 1 \end {matrix} \right ]; G^{2,1}_{\phi }\right ) \not \equiv 0\ \pmod {\ell }$
. Thus, by Theorem 3.5, there exists a non-trivial Hecke eigenform
$f \in S_{k}(\Gamma _2; \mathbf {Z}_{\ell })$
with
$E^{2,1}_{\phi } \equiv _{\operatorname {\mathrm {ev}}} f\ \pmod {\ell }$
.
Consider first the prime
$\ell = 163$
and suppose that
${\overline {\rho }}^{\mathrm {ss}}_{\phi ,163}=\psi _1 \oplus \psi _2$
for some characters
$\psi _1, \psi _2$
. Since
${\overline {\rho }}_\phi $
is unramified for all
$p \neq \ell ,$
we see that
$\psi _1$
and
$\psi _2$
are each an integer power of
$\overline {\epsilon }$
(see the proof of Lemma 5.3). As
$163 \nmid a(163;\phi ),$
we know
$\phi $
is ordinary at
$163$
and we get
${\overline {\rho }}^{\mathrm {ss}}_{\phi ,163}=\overline {\epsilon }^{25} \oplus 1$
. By [Reference Ribet45, Proposition 2.1] we can find a lattice such that

One can use ordinarity of
$\phi $
to show that
$*$
gives an unramified
$163$
-extension of
$\mathbf {Q}(\zeta _{163})$
(see, e.g., the proof of Theorem 4.28 in [Reference Berger and Klosin10]). By Herbrand’s Theorem, this implies that
$163 \mid B_{26}$
. However, one can check this is not true, so we must have that
${\overline {\rho }}_{\phi ,163}$
is irreducible and so
$E^{2,1}_{\phi }$
must be congruent (modulo 163) to a cusp form f that is not a Saito–Kurokawa lift, i.e.,
$\rho _f$
is irreducible by Theorem 3.5. One uses LMFDB to check that
$f= \Upsilon _2 $
.
Now consider the case that
$\ell = 187273$
. In this case, it is less practical to calculate
$a(187273; \phi )$
, so we directly eliminate the possibility that
$E^{2,1}_{\phi }$
is congruent to a Saito–Kurokawa lift modulo
$187273$
. The space to consider is
$S_{50}(\Gamma _1)$
. This space has one Galois conjugacy class of newforms consisting of three newforms, call them
$\psi _1, \psi _2$
, and
$\psi _3$
. Each newform has a field of definition
$K_{\psi _{i}}$
generated by a root
$\alpha _{i}$
of

One has that
$\lambda (2, E^{2,1}_{\phi }) = -805306416$
and that
$\lambda (2,\psi _{i}) = 2^{49}+2^{48} + \alpha _{i}$
. One uses SAGE to check that
$\lambda (2,E^{2,1}_{\phi }) \not \equiv \lambda (2,\psi _{i})\ \pmod {187273}$
, so
$E^{2,1}_{\phi }$
must be congruent to a cusp form that is not a Saito–Kurokawa lift. One uses LMFDB to see that
$E^{2,1}_{\phi }\equiv _{\operatorname {\mathrm {ev}}} \Upsilon _1\ \pmod {187273}$
.
4 Extensions of Fontaine–Laffaille modules
In this section, we gather various facts (in particular, Propositions 4.8 and 4.20) about extensions of Fontaine–Laffaille modules, which we use in this article but which to the best of our knowledge have not been published elsewhere.
4.1 Definitions
We keep our assumption that
$\ell $
is an odd prime. We fix integers
$a,b$
such that
$0 \leq b-a \leq \ell -2$
. In this section, let E be an arbitrary finite extension of
$\mathbf {Q}_{\ell }$
with ring of integers
$\mathcal {O}$
, uniformizer
$\lambda ,$
and residue field
$\mathbf {F}$
. Write
$\mathrm {LCA}_{\mathcal {O}}$
(respectively,
$\mathrm {LCN}_{\mathcal {O}}$
) for the category of local complete Artinian (respectively, Noetherian)
$\mathcal {O}$
-algebras with residue field
$\mathbf {F}$
. For a category
$\mathcal {C,}$
we will write
$X \in \mathcal {C}$
to mean that X is an object of
$\mathcal {C}$
.
Definition 4.1 [Reference Kalloniatis31, Definition 2.3]/[Reference Booher13, Definition 4.1]
-
1. A Fontaine–Laffaille module is a finitely generated
$\mathbf {Z}_\ell $ -module M together with a decreasing filtration by
$\mathbf {Z}_\ell $ -module direct summands
$M^i$ for
$i \in \mathbf {Z}$ such that there exists
$k \leq l$ with
$M^i=M$ for
$i \leq k$ and
$M^{i+1}=0$ for
$i \geq l$ , and a collection of
$\mathbf {Z}_\ell $ -linear maps
$\phi ^i_M: M^i \to M$ such that
$\phi ^i_M|_{M^{i+1}}=\ell \phi ^{i+1}_M$ for all i and
$M=\sum _i \phi ^i_M(M^i)$ . The category of all Fontaine–Laffaille modules is denoted
$MF^{f}_{\mathbf {Z}_\ell }$ . Morphisms in this category are
$\mathbf {Z}_\ell $ -linear maps
$f: M \to N$ satisfying
$f(M^i) \subset N^i$ and
$f \circ \phi ^i_M=\phi ^i_N \circ f|_{M^i}$ for all i. We will write
$MF^{f}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }$ for the full subcategory whose objects are of finite length as
$\mathbf {Z}_\ell $ -modules.
-
2. For a fixed interval
$[k,l],$ we denote the full subcategory of
$MF^{f}_{?,\mathbf {Z}_\ell }$ whose objects M have a filtration satisfying
$M^k=M$ and
$M^{l+1}=0$ by
$MF^{f, [k,l]}_{?,\mathbf {Z}_\ell }$ for
$?\in \{\emptyset , \text {tor}\}$ .
-
3. For any
$A \in \mathrm {LCA}_{\mathcal {O}}$ , a Fontaine–Laffaille module over A consists of an object
$M \in MF^{f, [a,b]}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }$ together with a map
$\theta : A \to \mathrm {End}_{MF^{f, [a,b]}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }}(M)$ that makes M into a free finitely generated module over A in such a way that
$M^i$ is an A-direct summand of M for each i. A morphism between two such objects is required to additionally preserve the A-structure. We will denote this category of Fontaine–Laffaille modules over A as
$MF^{f, [a,b]}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell } \otimes _{\mathbf {Z}_\ell } A$ .
-
4. For
$M \in MF^{f, [a,b]}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell } \otimes _{\mathbf {Z}_\ell } A,$ any integer i for which
$M^i/M^{i+1} \neq 0$ is called a Fontaine–Laffaille weight for M. The set of Fontaine–Laffaille weights for M will be denoted by
$\mathrm {FL}(M)$ .
Remark 4.2 We impose the stronger restriction on the length of the filtration as in [Reference Bloch and Kato12, Section 4] and [Reference Clozel, Harris and Taylor18, Section 2.4.1] compared to that in Section 1.1.2 of [Reference Diamond, Flach and Guo20] or [Reference Kalloniatis31, Definition 2.3] (which allow the length to be
$\ell -1$
).
Definition 4.3 We introduce the following examples of Fontaine–Laffaille modules:
-
1. If
$0 \in [a,b],$ we write
$\textbf {1} \in MF^{f, [a,b]}_{\mathbf {Z}_\ell }$ for the Fontaine–Laffaille module defined by
$\textbf {1}^i=\mathbf {Z}_\ell $ for
$i \leq 0$ and
$\textbf {1}^i=0$ for
$i>0$ . We set
$\phi ^i: \textbf {1}^i \to \textbf {1}$ to be given by
$x \mapsto \ell ^{-i} x$ for
$i \leq 0$ .
-
2. For any
$A \in \mathrm {LCA}_{\mathcal {O}}$ , we define
$M_{n, A} \in MF^{f, [a,b]}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell } \otimes _{\mathbf {Z}_\ell } A$ to be the free rank one A-module equipped with the filtration
$M_{n, A}^i=A$ for
$i \leq n$ ,
$M_{n, A}^{n+1}=0$ and
$\phi ^i: M_{n, A}^i \to M_{n, A}$ given by
$x \mapsto \ell ^{n-i} x$ for
$i \leq n$ . We put
$\mathbf {1}_A=M_{0, A}$ .
Definition 4.4 [Reference Booher13, Definition 4.9]
For
$M \in MF^{f, [a,b]}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }$
and
$s \in \mathbf {Z}$
define
$M(s)$
to be the same underlying
$\mathbf {Z}_{\ell }$
-module, but change the filtration to
$M(s)^i=M^{i-s}$
for any
$i \in \mathbf {Z}$
. This means that
$M(s) \in MF^{f, [a+s,b+s]}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }$
.
4.2 Extensions
To ease notation in the rest of this section, we put
$\mathcal {C}_A^I=MF_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }^{f, I} \otimes _{\mathbf {Z}_\ell } A$
for
$A \in \mathrm {LCA}_{\mathcal {O}}$
. Here,
$I=[a,b]$
.
Definition 4.5 (Definition/Lemma)
Given
$M, N \in \mathcal {C}_A^I$
define a filtration on the A-module
$\operatorname {\mathrm {Hom}}_{A}(M,N)$
by

and
$\mathbf {Z}_\ell $
-linear maps
$\phi ^i: \operatorname {\mathrm {Hom}}_{A}(M,N)^i \to \operatorname {\mathrm {Hom}}_{A}(M,N)$
by

(note that
$M=\sum \phi _M^j(M^j)$
) for
$ f \in \operatorname {\mathrm {Hom}}_{A}(M,N)^i$
and all
$m \in M^j$
and
$j \in \mathbf {Z}$
. We claim this defines a Fontaine–Laffaille structure and that
$\operatorname {\mathrm {Hom}}_{A}(M,N)\in MF_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }^{f, [a-b, b-a]} \otimes _{\mathbf {Z}_\ell } A.$
Proof First note that there exists a canonical A-module homomorphism
$\psi : M^\vee \otimes _A N \to \operatorname {\mathrm {Hom}}_{A}(M,N)$
, where
$M^\vee =\operatorname {\mathrm {Hom}}_A(M,A)$
. Definition 4.19 in [Reference Booher13] defines a Fontaine–Laffaille structure on
$M^\vee $
(and Lemmas 4.20 and 4.21 prove that this structure is well-defined and so we get an object in
$MF_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }^{f, [-b, -a]} \otimes _{\mathbf {Z}_\ell } A$
). Definition 4.17 in [Reference Booher13] then gives us the Fontaine–Laffaille structure on
$M^\vee \otimes _A N$
.
We claim that transferring this structure on
$M^\vee \otimes _A N$
via
$\psi $
to
$\operatorname {\mathrm {Hom}}_{A}(M,N)$
matches our definition. Recall from [Reference Booher13] that
$(M^\vee )^i=\{f\in \operatorname {\mathrm {Hom}}_{A}(M,A)| f(M^k) \subset \mathbf {1}_A^{i+k} \text { for all } k \in \mathbf {Z}\}$
and
$(M^\vee \otimes N)^n=\sum _{i+j=n} (M^\vee )^i \otimes _A N^j.$
We will first show that
$\psi ((M^\vee \otimes N)^n) \subset \operatorname {\mathrm {Hom}}_{A}(M,N)^n$
. Let
$f_i \otimes n_j \in (M^\vee )^i \otimes _A N^j$
. Then,
$\psi (f_i \otimes n_j): m \in M^k \mapsto f_i(m)n_j \in N^j$
. In fact, the image lies in
$N^{n+k}$
. This is clear for
$j \geq n+k$
. If
$j<n+k$
(and hence
$0<i+k$
) it follows since
$f_i(m) \in \mathbf {1}_A^{i+k}=0$
. To show the reverse inclusion
$\psi ((M^\vee \otimes N)^n) \supset \operatorname {\mathrm {Hom}}_{A}(M,N)^n$
consider
$f \in \operatorname {\mathrm {Hom}}_{A}(M,N)^n$
and let j be maximal among integers l such that
$f(M) \subset N^l$
. To satisfy
$f(M^k) \subset N^{k+n}$
for all integers
$k,$
we need
$f(M^k)=0$
for
$k+n>j$
by maximality of j. This means that we need f to factor through
$M/M^{1-i}$
for
$i:=n-j$
. By [Reference Booher13, Lemma 4.20] we have
$(M^\vee )^i=\operatorname {\mathrm {Hom}}_A(M/M^{1-i}, A)$
so we get

We conclude that
$f \in \psi ^{-1}((M^\vee )^i \otimes N^j) \subset \psi ^{-1}((M^\vee \otimes N)^n)$
.
Now, we check the
$\mathbf {Z}_\ell $
-linear maps: Recall from [Reference Booher13] that for
$f \in M^\vee $
, we have
$\phi ^i_{M^\vee }(f)(\phi ^j_M(m))=\phi ^{i+j}(f(m))$
for all
$m \in M^j$
and
$j \in \mathbf {Z}$
. We also have
$\phi ^n_{M^\vee \otimes _A N}=\sum _{i+j=n} \phi ^i_{M^\vee } \otimes \phi ^j_N.$
We claim that
$\phi ^n_{\operatorname {\mathrm {Hom}}_A(M,N)} \circ \psi =\psi \circ \phi ^n_{M^\vee \otimes _A N}: (M^\vee \otimes N)^n \to \operatorname {\mathrm {Hom}}_A(M,N).$
For this, one calculates that both sides map
$f \otimes n \in (M^\vee )^i \otimes N^{n-i}$
to the homomorphism, for which

for any
$m \in M^k$
(for
$\psi \circ \phi ^n_{M^\vee \otimes _A N}$
this uses
$\phi ^{n+k}_N|_{N^{n-i}}=\ell ^{k+i} \phi ^{n-i}_N$
for
$i+k \leq 0$
). This claim, combined with the results in [Reference Booher13] shows that the definition of
$\phi ^n_{\operatorname {\mathrm {Hom}}_A(M,N)}$
is well-defined and satisfies the requirements for
$\operatorname {\mathrm {Hom}}_A(M,N)$
to be a Fontaine–Laffaille module in
$ MF_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }^{f, [a-b, b-a]} \otimes _{\mathbf {Z}_\ell } A$
.
For
$M, N\in \mathcal {C}_A^I$
consider the map
$\phi -1: \operatorname {\mathrm {Hom}}_{A}(M,N)^0 \to \operatorname {\mathrm {Hom}}_{A}(M,N),$
which takes f to the homomorphism that sends
$m=\sum _j \phi ^j_{M}(m_j)$
to

Note that
$\ker (\phi -1)=\operatorname {\mathrm {Hom}}_{\mathcal {C}_A^I}(M,N).$
Proposition 4.6 [Reference Clozel, Harris and Taylor18, Lemma 2.4.2] and [Reference Kalloniatis31, Proposition 2.17]
Given
$M,N \in \mathcal {C}_A^I$
, we have an exact sequence of A-modules (note that
$\operatorname {\mathrm {Hom}}_{\operatorname {\mathrm {Fil}}, A}(M,N)$
in [Reference Kalloniatis31] equals
$\operatorname {\mathrm {Hom}}_{A}(M,N)^0$
)

Given
$M,N \in \mathcal {C}_A^I$
, we write
$\mathrm {FL}(M)>\mathrm {FL}(N)$
if there is an integer j such that all elements of
$\mathrm {FL}(M)$
are greater than or equal to j, and all elements of
$\mathrm {FL}(N)$
are strictly less than j.
Proposition 4.7 The extension group
$\operatorname {\mathrm {Ext}}^1_{\mathcal {C}_A^I}(M,N)$
is a finitely generated A-module. Furthermore, one has:
-
1. If
$\mathrm {FL}(M)>\mathrm {FL}(N)$ then
$\operatorname {\mathrm {Ext}}^1_{\mathcal {C}_A^I}(M,N) \cong \operatorname {\mathrm {Hom}}_A(M,N)$ , in particular, it is a free A-module and
$\mathrm {rk}_A(\operatorname {\mathrm {Ext}}^1_{\mathcal {C}_A^I}(M,N))=\mathrm {rk}_A(M) \mathrm {rk}_A(N)$ .
-
2. If
$\mathrm {FL}(M)<\mathrm {FL}(N)$ then
$\operatorname {\mathrm {Ext}}^1_{\mathcal {C}_A^I}(M,N)=0$ .
Proof This follows from Proposition 4.6. In particular,
$\operatorname {\mathrm {Ext}}^1_{\mathcal {C}_A^I}(M,N)$
is a quotient of the finitely generated A-module
$\operatorname {\mathrm {Hom}}_A(M,N)$
. The calculation on [Reference Kalloniatis31, p. 238] (“two notable cases”) is carried out for
$MF^{f, [0,\ell -1]}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell } \otimes _{\mathbf {Z}_\ell } A$
, but applies verbatim to
$\mathcal {C}_A^I$
. If
$\mathrm {FL}(M)>\mathrm {FL}(N)$
then this calculation shows that
$\operatorname {\mathrm {Hom}}_A(M,N)^0=0$
, while if
$\mathrm {FL}(M)<\mathrm {FL}(N)$
then one gets
$\operatorname {\mathrm {Hom}}_A(M,N)^0=\operatorname {\mathrm {Hom}}_A(M,N)$
.
Proposition 4.8 (Hom-tensor adjunction)
Let
$M, N \in \mathcal {C}_A^I$
. Assume that
$\operatorname {\mathrm {Hom}}_{A}(M,N)$
equipped with the filtration as in Definition 4.5 is an object in
$\mathcal {C}_A^I$
and that
$0 \in I$
. Then, there exists a canonical isomorphism of A-modules:

Proof The statement follows from the existence of the following commutative diagram with exact columns:

The exactness of both columns follows from Proposition 4.6. The second horizontal arrow is the usual isomorphism
$\psi $
of A-modules given by
$f\mapsto (a\mapsto af)$
(recall that the underlying module of the object
$\mathbf {1}_A$
is A) with the inverse map sending g to
$g(1)$
, where
$1$
is the multiplicative identity of A. The map
$\tilde \psi $
is defined by lifting an element of
$ \operatorname {\mathrm {Ext}}^1_{\mathcal {C}_A^I}(M,N)$
to
$\operatorname {\mathrm {Hom}}_{A}(M,N)$
and using
$\psi $
. The exactness of the first column ensures that such a map is well-defined.
The first horizontal arrow is the restriction
$\psi '$
of
$\psi $
to
$\operatorname {\mathrm {Hom}}_{A}(M,N)^0$
(note that
$\operatorname {\mathrm {Hom}}_{A}(M,N)^0$
is a subgroup of
$\operatorname {\mathrm {Hom}}_{A}(M,N)$
even though
$\phi -1$
is not necessarily injective). We need to check that
$\psi '$
lands in
$\operatorname {\mathrm {Hom}}_{ A}(\mathbf {1}_A, \operatorname {\mathrm {Hom}}_{A}(M,N))^0$
. By its definition, we need to check if
$f(\mathbf {1}_A^j) \subset \operatorname {\mathrm {Hom}}_{A}(M,N)^j$
. If
$j>0$
there is nothing to check as then
$\mathbf {1}_A^j=0$
, so assume that
$j\leq 0$
. Then,
$\mathbf {1}_A^j=A$
and
$\operatorname {\mathrm {Hom}}_{A}(M,N)^j\supset \operatorname {\mathrm {Hom}}_{A}(M,N)^0$
. So, it is enough to show that if
$f\in \operatorname {\mathrm {Hom}}_{A}(M,N)^0$
then
$\psi '(f)(A)\subset \operatorname {\mathrm {Hom}}_{A}(M,N)^0$
. Let
$a\in A$
. Then,
$\psi '(f)(a)=af,$
which clearly lies in
$\operatorname {\mathrm {Hom}}_{A}(M,N)^0$
as
$\operatorname {\mathrm {Hom}}_{A}(M,N)^0$
is an A-module.
Now, let
$g\in \operatorname {\mathrm {Hom}}_{ A}(\mathbf {1}_A, \operatorname {\mathrm {Hom}}_{A}(M,N))^0$
. We need to show that
$\psi ^{-1}(g)$
lands in
$\operatorname {\mathrm {Hom}}_{A}(M,N)^0$
. Again we need to consider
$\psi ^{-1}(g)(\mathbf {1}_A^j)$
. If
$j>0$
, then
$g=0$
, hence we are done. Assume that
$j\leq 0$
. Then,
$\mathbf {1}_A^j=A$
and
$\psi ^{-1}(g)=g(1)$
. As
$1\in \mathbf {1}_A^0$
and
$g \in \operatorname {\mathrm {Hom}}_{ A}(\mathbf {1}_A, \operatorname {\mathrm {Hom}}_{A}(M,N))^0$
we must have that
$g(1)\in \operatorname {\mathrm {Hom}}_{A}(M,N)^0$
. So, we are done again.
This shows that
$\psi '$
is a bijection, hence an isomorphism. Hence, by the second Four Lemma,
$\tilde \psi $
is injective, and since it is clearly surjective, it is an isomorphism.
4.3 Fontaine–Laffaille Galois representations
Fix an interval
$I=[a,b]$
with
$a, b \in \mathbf {Z}$
and
$b-a \leq \ell -2$
. In this section, we introduce certain categories of
$G_{\mathbf {Q}_\ell }$
-representations and define a covariant version
$V_I$
of the functor in [Reference Fontaine and Laffaille25] from the categories of Fontaine–Laffaille modules defined in Section 4.1 to these categories of Galois representations.
Let
$A_{\operatorname {\mathrm {cris}}}$
and
$B_{\operatorname {\mathrm {cris}}}$
denote the usual Fontaine’s
$\ell $
-adic period rings (see Definitions 7.3 and 7.7 in [Reference Fontaine and Ouyang26] and [Reference Fontaine24]). We recall that a
$\mathbf {Q}_\ell [G_{\mathbf {Q}_\ell }]$
-module V is called crystalline if
$\dim _{\mathbf {Q}_{\ell }}V = \dim _{\mathbf {Q}_{\ell }} H^0(\mathbf {Q}_{\ell }, V \otimes _{\mathbf {Q}_{\ell }} B_{\operatorname {\mathrm {cris}}})$
. Our convention is that the Hodge–Tate weight of the cyclotomic character is
$+1$
.
Definition 4.9 Let
$A\in \mathrm {LCA}_{\mathcal {O}}$
. We introduce the following categories:
-
(i)
$\mathrm {Rep}_{\mathbf {Z}_\ell }^{f}(G_{\mathbf {Q}_{\ell }})$ , the category of
$\mathbf {Z}_\ell [G_{\mathbf {Q}_{\ell }}]$ -modules that are finitely generated as
$\mathbf {Z}_\ell $ -modules.
-
(ii)
$\mathrm {Rep}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }^{f}(G_{\mathbf {Q}_{\ell }})$ , the full subcategory of
$\mathrm {Rep}_{\mathbf {Z}_\ell }^{f}(G_{\mathbf {Q}_{\ell }})$ whose objects are required to be of finite length as
$\mathbf {Z}_\ell [G_{\mathbf {Q}_{\ell }}]$ -modules.
-
(iii)
$\mathrm {Rep}_{\mathbf {Z}_\ell }^{\mathrm {cris}, I}(G_{\mathbf {Q}_\ell })$ , the full subcategory of
$\mathrm {Rep}_{\mathbf {Z}_\ell }^{f}(G_{\mathbf {Q}_{\ell }})$ whose objects are isomorphic to
$T/T'$ , where T and
$T'$ are
$G_{\mathbf {Q}_{\ell }}$ -stable finitely generated submodules of a crystalline
$\mathbf {Q}_\ell $ -representation with Hodge–Tate weights in I.
-
(iv)
$\mathrm {Rep}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }^{\mathrm {cris}, I}(G_{\mathbf {Q}_\ell })$ , the full subcategory of
$\mathrm {Rep}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }^{f}(G_{\mathbf {Q}_{\ell }})$ whose objects are isomorphic to
$T/T'$ , where T and
$T'$ are
$G_{\mathbf {Q}_{\ell }}$ -stable lattices in a crystalline
$\mathbf {Q}_\ell $ -representation with Hodge–Tate weights in I.
-
(v)
$\mathrm {Rep}_{\mathrm {free}, A}^{\mathrm {cris}, I}(G_{\mathbf {Q}_\ell })$ , the category of free finite rank A-modules M with an A-linear
$G_{\mathbf {Q}_\ell }$ -action, for which there exists a crystalline representation of
$G_{\mathbf {Q}_\ell }$ defined over E with Hodge–Tate weights in I containing
$G_{\mathbf {Q}_\ell }$ -stable
$\mathcal {O}$ -lattices
$T' \subset T$ , and an
$\mathcal {O}$ -algebra map
$A \to \mathrm {End}_{\mathcal {O}}(T/T')$ such that M is isomorphic as an
$A[G_{\mathbf {Q}_\ell }]$ -module to
$T/T'$ . We will call objects of this category Fontaine–Laffaille A-representations (with weights in I).
Remark 4.10 Definition 4.9(v) matches Definition 2.1 in [Reference Kalloniatis31] .
Definition 4.11 [Reference Bloch and Kato12, p. 363] and [Reference Booher13, Definitions 4.7 and 4.9]
Similar to [Reference Booher13] we define the following two functors:
-
1. A covariant functor
$T_{\mathrm {cris}}: MF^{f, [2-\ell ,0]}_{\mathbf {Z}_\ell } \to \mathrm {Rep}_{\mathbf {Z}_\ell }^{f}(G_{\mathbf {Q}_\ell })$ defined via
$$ \begin{align*}T_{\mathrm{cris}}(M):=\ker\left(1-\phi^0_{A_{\mathrm{cris}} \otimes_{\mathbf{Z}_\ell} M}: \mathrm{Fil}^0(A_{\mathrm{cris}} \otimes_{\mathbf{Z}_\ell} M) \to A_{\mathrm{cris}} \otimes_{\mathbf{Z}_\ell} M \right).\end{align*} $$
-
2. A covariant functor
$V_I: MF^{f, [a,b]}_{\mathbf {Z}_\ell } \to \mathrm {Rep}_{\mathbf {Z}_\ell }^{f}(G_{\mathbf {Q}_\ell }),$ defined via
(4.2)$$ \begin{align} V_I(M)=T_{\mathrm{cris}}(M(-b))(- b).\end{align} $$
Recall that
$M(-b)$ was defined in Definition 4.4, while
$(-b)$ on the outside denotes the Tate twist as defined in Section 2.
Remark 4.12 We note that for
$?\in \{\emptyset , \operatorname {\mathrm {tor}}\},$
the category
$MF_{?, \mathbf {Z}_{\ell }}^{f,[a,b]}$
is a full subcategory of
$MF_{?, \mathbf {Z}_{\ell }}^{f,[a,a+\ell -2]}$
, since they are both full subcategories of
$MF_{?, \mathbf {Z}_{\ell }}^{f}$
(cf. Definition 4.1), so in particular (4.2) makes sense.
Remark 4.13 Note that
$V_I$
extends
$T_{\mathrm {cris}}$
to general I (in particular,
$V_{[2-\ell ,0]}=T_{\mathrm {cris}}$
). Also observe that for
$M \in MF^{f, [a,b]}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }$
, we have
$M(-b) \in MF^{f, [2-\ell , 0]}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }$
since
$M(-b)^{1}=M^{b+1}=0$
and
$M(-b)^{2-\ell }=M^{2-\ell +b}=M$
as
$b+2-\ell \leq a$
. In particular, the definition of
$V_I$
makes sense.
Compared to [Reference Booher13] we work with the more restrictive interval
$[2-\ell ,0]$
for
$T_{\mathrm {cris}}$
and correct a sign error in the Galois twist in [Reference Booher13, Definition 4.9]
Theorem 4.14 [Reference Bloch and Kato12, Theorem 4.3] [Reference Niziol41, Section 2] [Reference Diamond, Flach and Guo20, Section 1.1.2] [Reference Hattori27, Section 2.2] [Reference Booher13, Fact 4.10] and [Reference Kalloniatis31, Theorem 2.10]
We have:
-
(i) The covariant functor
$V_{[a,b]}:MF^{f, [a,b]}_{\mathbf {Z}_\ell } \to \mathrm {Rep}^f_{\mathbf {Z}_\ell }(G_{\mathbf {Q}_\ell })$ is well-defined, exact, and fully faithful.
-
(ii) For
$M \in MF^{f, [a,b]}_{\mathbf {Z}_\ell }$ , one has
$V_{[a,b]}(M)=\mathop {\varprojlim }\limits _n V_{[a,b]}(M/\ell ^n)$ .
-
(iii) The essential image of
$V_{[a,b]}$ is closed under formation of sub-objects, quotients, and finite direct sums. It is given by the subcategory
$\mathrm {Rep}_{\mathbf {Z}_\ell }^{\mathrm {cris}, [-b,-a]}(G_{\mathbf {Q}_\ell })$ . For
$M \in MF^{f, [a,b]}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }$ , the lengths of M and
$V_I(M)$ as
$\mathbf {Z}_\ell $ -modules agree; in particular, the essential image of
$ MF^{f, [a,b]}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }$ under
$V_{[a,b]}$ is
$\mathrm {Rep}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }^{\mathrm {cris}, [-b,-a]}(G_{\mathbf {Q}_\ell })$ .
-
(iv) For
$A \in \mathrm {LCA}_{\mathcal {O}}$ , the functor
$V_{[a,b]}$ induces a functor from
$MF^{f, [a,b]}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell } \otimes _{\mathbf {Z}_\ell } A$ to the category of free finite rank A-modules with an A-linear
$G_{\mathbf {Q}_\ell }$ -action, which we will also denote by
$V_{[a,b]}$ . Its essential image is given by
$\mathrm {Rep}^{\mathrm {cris}, [-b,-a]}_{\mathrm {free}, A}(G_{\mathbf {Q}_\ell })$ . In fact,
$V_{[a,b]}$ gives an equivalence of categories between
$ MF^{f, [a,b]}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell } \otimes _{\mathbf {Z}_\ell } A $ and
$\mathrm {Rep}^{\mathrm {cris}, [-b,-a]}_{\mathrm {free}, A}(G_{\mathbf {Q}_\ell })$ .
Remark 4.15
-
(1) Note that for
$M \in MF^{f, [a,b]}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }$ , we have
$V_{[a+s,b+s]}(M(s))=V_{[a,b]}(M)(-s)$ .
-
(2) For
$I=[a,b]=[0, \ell -2],$ the functor
$V_I$ agrees with that of the functor
$\mathbb {V}$ in [Reference Diamond, Flach and Guo20, p. 670] by [Reference Breuil14, Proposition 3.2.1.7]
-
(3) For
$M \in MF^{f, [a,b]}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell } \otimes _{\mathbf {Z}_\ell } A$ , the Hodge–Tate weights of
$V_I(M)$ (in the sense of Definition 4.9(3)) equal the negatives of the Fontaine–Laffaille weights of M, defined in Definition 4.1(3), due to our convention that the Hodge–Tate weight of the cyclotomic character is
$+1$ .
As an immediate consequence of the equivalence of categories in Theorem 4.14(iv), we obtain the following corollary.
Corollary 4.16 For any
$M, N \in MF_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }^{f, I} \otimes _{\mathbf {Z}_\ell } A,$
there is an isomorphism of A-modules

4.4 Local Selmer groups
Let
$I=[a,b]$
be an interval as in the previous section (so
$0\leq b-a\leq \ell -2$
) but we now also require that
$0 \in I$
(so that
$\mathbf {1} \in MF^{f, I}_{\mathbf {Z}_\ell }$
, see Definition 4.3).
For an extension between two objects
$M,N$
in
$\mathrm {Rep}_{A}(G_{\mathbf {Q}_\ell }) 0 \to M \to E \to N \to 0,$
we define the n-th Tate twist of the extension to be the extension
$0 \to M(n) \to E(n) \to N(n) \to 0.$
For a subgroup G of
$\mathrm {Ext}^1_{\mathrm {Rep}_{A}(G_{\mathbf {Q}_{\ell }})}(M,N),$
we define
$G(n)$
to consist of extensions which are the n-th Tate twists of the elements of G.
Given an extension
$\mathcal {E} \in \mathrm {Ext}^1_{MF^{f, I}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell } \otimes _{\mathbf {Z}_\ell } A}(M_3,M_1)$
represented by an exact sequence

we will write
$V_I(\mathcal {E})$
for the extension in
$\mathrm {Ext}^1_{\mathrm {Rep}^{\mathrm {cris}, -I}_{\mathrm {free}, A}(G_{\mathbf {Q}_{\ell }})} (V_I(M_3),V_I(M_1))$
represented by

This uses the exactness of the functor
$V_I$
(cf. Theorem 4.14(i)). Since we defined
$V_I(M)=T_{\mathrm {cris}}(M(-b))(-b)$
(see Equation (4.2)), we conclude the following lemma.
Lemma 4.17 For
$A \in \mathrm {LCA}_{\mathcal {O}}$
and
$M \in MF^{f, I}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell } \otimes _{\mathbf {Z}_\ell } A,$
we have

Note that the latter is naturally isomorphic to
$\mathrm {Ext}^1_{\mathrm {Rep}^{\mathrm {cris}, [0,\ell -2]}_{\mathrm {free}, A}(G_{\mathbf {Q}_{\ell }})}(A(b), T_{\mathrm {cris}}(M(-b)))$
and they give rise to the same subgroup of
$H^1(\mathbf {Q}_\ell , V_I(M))$
, see Definition 4.18.
Definition 4.18 For
$M \in MF^{f, I}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell } \otimes _{\mathbf {Z}_\ell } A,$
let
$H^1_{f, I}(\mathbf {Q}_\ell , V_I(M))=V_I(\mathrm {Ext}^1_{MF^{f, I}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }\otimes _{\mathbf {Z}_\ell } A}(\mathbf {1}_A,M)) \subset H^1(\mathbf {Q}_\ell , V_I(M))$
.
Remark 4.19 This is a more precise version of the definition made in [Reference Berger and Klosin6, Section 5.2.1] (where the prime
$\ell $
was denoted by p). In [Reference Berger and Klosin6] we worked (implicitly) with
$I=[0,p-2]$
, but the results in [Reference Berger and Klosin6, Section 5] (in particular, Corollary 5.4 and Proposition 5.8 restated below) carry over to
$H^1_{f, I}$
defined here for general I.
T.B. and K.K. would like to clarify how certain definitions and results in some of our papers fit in with this more precise description of the groups
$H^1_{f, I}$
: In [Reference Berger and Klosin8] the relevant interval I is
$I=[1-k, k-1]$
for Section 5, and p should satisfy
$p-1>2k-2$
. The examples in Section 6 of [loc. cit] satisfy this stronger condition. Similarly, in [Reference Berger and Klosin9] one has
$I=[3-2k, 2k-3]$
(
$p-1>4k-6$
). In [Reference Berger and Klosin6, Section 6] the suitable interval I is such that
$\operatorname {\mathrm {Hom}}_{\mathcal {O}}(\tilde \rho _2, \tilde \rho _1)$
has Hodge–Tate weights in I. For
$i, j \in \{1, 2\},$
the local condition at
$v \mid p$
for the Selmer groups
$H^1_\Sigma (F, \operatorname {\mathrm {Hom}}_{\mathbf {F}}(\rho _i, \rho _j))$
is
$H^1_{f, I}(F_v, \operatorname {\mathrm {Hom}}_{\mathbf {F}}(\tilde \rho _i, \tilde \rho _j))$
. In [loc.cit.] Section 9, one has
$I=[-1,1]$
(
$p-1>2$
), in Section 10,
$I=[1-k,k-1]$
(
$p-1>2k-2$
). In [Reference Berger and Klosin7, Sections 7 and 8] the same comment applies as for [Reference Berger and Klosin6, Section 9]
In J.B.’s paper [Reference Brown16] the argument in Sections 8 and 9 to show the splitting at
$\ell $
of
$\begin {pmatrix} \overline {\epsilon }^{k-2}&*\\0&\overline {\epsilon }^{k-1} \end {pmatrix}$
by relating it to
$H^1_f(\mathbf {Q}_\ell , \mathbf {F}(-1))=0$
requires an interval I containing
$-1$
and
$2k-3$
, so would need
$p-1>2k-2$
. However, one could instead not twist and invoke Proposition 4.7.
Similar comments apply to other results in the literature, e.g., in [Reference Diamond, Flach and Guo20, Corollary 2.3] the expression
$H^1_f(\mathbf {Q}_\ell , \mathrm {ad}^0_{\kappa } \overline {L})$
is only indirectly defined by
$H^1_f(\mathbf {Q}_\ell , \mathrm {ad}_{\kappa } \overline {L})=H^1_f(\mathbf {Q}_\ell , \mathrm {ad}^0_{\kappa } \overline {L}) \oplus H^1_f(\mathbf {Q}_\ell , \kappa )$
. To define the Selmer group for the trace zero endomorphisms and prove this identity requires
$\mathrm {ad}^0_{\kappa }$
to lie in the essential image of the Fontaine–Laffaille functor, and therefore
$I=[1-k, k-1]$
should be specified, rather than
$I=[0, \ell -2]$
as in [Reference Diamond, Flach and Guo20, Section 1.1.2]
If
$M,N \in \mathrm {Rep}_{\mathrm {free}, A}^{\mathrm {cris}, I}(G_{\mathbf {Q}_{\ell }})$
, then
$M\oplus N\in \mathrm {Rep}_{\mathrm {free}, A}^{\mathrm {cris}, I}(G_{\mathbf {Q}_{\ell }})$
and it is clear that

because the extension groups as well as the functor
$V_I$
commute with direct sums.
Proposition 4.20 For any
$n \in [2-\ell , \ell -2]$
such that
$0, -n \in I,$
the group
$H^1_{f, I}(\mathbf {Q}_\ell , V_I(M_{-n, \mathbf {F}}))$
is independent of I. In fact, we have

where

and
$H^1_{\mathrm {fl}}(\mathbf {Q}_\ell , \mu _\ell )$
denotes the peu ramifiée classes, namely, those classes corresponding to
$\mathbf {Z}_\ell ^\times /(\mathbf {Z}_\ell ^\times )^\ell \subset \mathbf {Q}_\ell ^\times /(\mathbf {Q}_\ell ^\times )^\ell \cong H^1(\mathbf {Q}_\ell , \mathbf {F}(1))$
. For
$n \geq 0,$
we note that
$\dim _{\mathbf {F}} H^1_{f, I}(\mathbf {Q}_\ell , \mathbf {F}(n)) =1$
.
Remark 4.21
-
(1) Proposition 4.20 justifies writing
$H^1_{\Sigma }(\mathbf {Q}_\ell , V_I(M_n))$ as we did in [Reference Berger and Klosin8], without specifying the interval I, as long as I contains
$-n$ . Under the conditions of Proposition 4.23 (see comment after Proposition 5.1), once we have fixed a suitable interval
$I,$ we will also drop the subscript I in this article.
-
(2) Note that the definition of
$H^1_{f, I}(\mathbf {Q}_\ell , V_I(M_n))$ depends on
$n \in \mathbf {Z}$ , even though the coefficients
$V_I(M_n)=\mathbf {F}(n)$ only depend on
$n\ \mod {\ell -1}$ .
-
(3) [Reference Niziol41, Section 9.3] states a version of this result for the local crystalline cohomology of unramified extensions of
$\mathbf {Q}_\ell $ and with
$\mathbf {Z}_\ell /\ell ^m(n)$ coefficients for
$m \in \mathbf {Z}_{>0}$ .
Proof We first note that
$H^1(\mathbf {Q}_\ell , \mathbf {F}(n))$
is one-dimensional for
$n \neq 0,1$
, which follows from local Tate duality and the Euler characteristic formula (see, e.g., [Reference Washington58, Theorem 1 and Proposition 3]
For
$n=0,$
we refer the reader to [Reference Clozel, Harris and Taylor18, Corollary 2.4.4] for identifying
$H^1_{f, I}(\mathbf {Q}_\ell , \mathbf {F}(n))$
with
$H^1_{\operatorname {\mathrm {un}}}(\mathbf {Q}_\ell , \mathbf {F})$
. That
$H^1_{\operatorname {\mathrm {un}}}(\mathbf {Q}_\ell , \mathbf {F})$
is one-dimensional follows since
$\#H^1(G_{\mathbf {Q}_\ell }/I_\ell , \mathbf {F})=\#H^0(\mathbf {Q}_\ell , \mathbf {F})$
. Recall that

If
$n<0$
then by Proposition 4.7(ii)
$\mathrm {Ext}^1_{MF^{f, I}_{\operatorname {\mathrm {tor}}, \mathbf {Z}_\ell }\otimes _{\mathbf {Z}_\ell } \mathbf {F}}(M_{0, \mathbf {F}},M_{-n,\mathbf {F}})=0$
since the Fontaine–Laffaille weights satisfy the inequality
$-n>0$
.
On the other hand, if
$n> 0$
then
$H^1_{f, I}(\mathbf {Q}_\ell , V_I(M_{-n}))$
is one-dimensional by Proposition 4.7(i). For
$n>1,$
this equals
$H^1(\mathbf {Q}_\ell , \mathbf {F}(n))$
by our observation at the start of the proof.
For
$n=1,$
we have
$H^1(\mathbf {Q}_\ell , \mathbf {F}(1)) \cong \mathbf {Q}_\ell ^\times /(\mathbf {Q}_\ell ^\times )^\ell $
is two-dimensional, and one can identify the Fontaine–Laffaille extensions with the peu ramifiée classes (see, e.g., [Reference Breuil15, Lemma 8.1.3]
Remark 4.22 Note that
$[2-\ell , 0]$
contains both
$0$
and
$2-\ell $
(and is the only interval of this length that contains both). Then, since
$\mathbf {F}(-1)=\mathbf {F}(\ell -2)=V_{[2-\ell , 0]}(M_{2-\ell }),$
we get

corresponding to the crystalline non-split extension
$\begin {pmatrix} \overline {\epsilon }^{\ell -2}&*\\0&1 \end {pmatrix}$
. Note that
$1 \notin [2-\ell , 0]$
.
However, for all other intervals
$I \subset [2-\ell , \ell -2]$
of length
$\ell -2,$
we have
$1 \in I$
and so

by Proposition 4.20. This demonstrates that
$H^1_{f, I}(\mathbf {Q}_\ell , \mathbf {F}(n))$
is only independent of I for I containing
$-n$
.
Following [Reference Bloch and Kato12] for a
$\mathbf {Q}_\ell [G_{\mathbf {Q}_\ell }]$
-module V define
$H_{f}^1(\mathbf {Q}_\ell ,V)= \ker \left (H^1(\mathbf {Q}_\ell ,V) \rightarrow H^1(\mathbf {Q}_\ell , V \otimes _{\mathbf {Q}_{\ell }} B_{\operatorname {\mathrm {cris}}})\right ).$
Let V be a finite-dimensional E-vector space and
$T \subset V$
be a
$G_{\mathbf {Q}_\ell }$
-stable
$\mathcal {O}$
-lattice, i.e., T is a free
$\mathcal {O}$
-submodule of V that spans V as a vector space over E. We set
$W = V/T$
and
$W[\lambda ^{m}] = \{w \in W: \lambda ^{m}w = 0\} \cong T/\lambda ^{m} T$
for any
$m \in \mathbf {Z}_{>0}$
. Note that
$W[\lambda ^{m}]$
lies in
$\mathrm {Rep}^{\mathrm {cris}, -I}_{\mathcal {O}/\lambda ^m}(G_{\mathbf {Q}_\ell })$
if V is crystalline with Hodge–Tate weights in
$-I$
. We let
$H^1_{f}(\mathbf {Q}_\ell ,W)$
be the image of
$H^1_{f}(\mathbf {Q}_\ell ,V)$
under the natural map
$H^1(\mathbf {Q}_\ell ,V) \rightarrow H^1(\mathbf {Q}_\ell ,W)$
.
Proposition 4.23 [Reference Diamond, Flach and Guo20, Proposition 2.2]
Assume V is a crystalline
$E[G_{\mathbf {Q}_\ell }]$
-module as above with Hodge–Tate weights in
$-I=[-b,-a]$
(and
$0 \in I$
). For
$T \subset V$
and
$W=V/T$
as above, we then have
$H^1_{f}(\mathbf {Q}_\ell , W)= \mathop {\varinjlim }\limits _m H^1_{f,I}(\mathbf {Q}_\ell , W[\lambda ^m])$
.
Proof We note that the proof of [Reference Diamond, Flach and Guo20, Proposition 2.2] carries over from
$[0, \ell -2]$
to general I (in particular, one has Proposition 4.6) and apply the argument with (in their notation)
$V_1$
the trivial
$G_{\mathbf {Q}_\ell }$
-representation and
$V_2=V$
.
Corollary 4.24 [Reference Diamond, Flach and Guo20, (33)] and [Reference Berger and Klosin6, Corollary 5.4]
For every
$m \in \mathbf {Z}_{>0}$
, we have an exact sequence of
$\mathcal {O}$
-modules

Corollary 4.25 For
$n \in \mathbf {Z}$
with
$0, n \in I \subset [2-\ell , \ell -2]$
and
$n \neq 0,$
we have

Proof Note that
$H^0(\mathbf {Q}_\ell , E/\mathcal {O}(n)[\lambda ])=0$
since
$n \not \equiv 0\ \mod {\ell -1}$
. This implies
$H^0(\mathbf {Q}_\ell , E/\mathcal {O}(n))=0$
, hence, we are done by Corollary 4.24.
5 Selmer groups
5.1 Definitions
For M a topological
$\mathbf {Z}_\ell [G_{\mathbf {Q}}]$
-module set

for every prime p. Let
$E/\mathbf {Q}_{\ell }$
be a finite extension with valuation ring
$\mathcal {O}$
, uniformizer
$\lambda $
, and residue field
$\mathbf {F}$
. Let V be a finite-dimensional E-vector space on which one has a continuous E-linear
$G_{\mathbf {Q}}$
action. For finite primes p with
$p \neq \ell $
, we set

For
$p=\ell $
, we recall from Section 4 that

Let
$T \subset V$
be a
$G_{\mathbf {Q}}$
-stable
$\mathcal {O}$
-lattice. We set
$W = V/T$
and
$W[\lambda ^{n}] = \{w \in W: \lambda ^{n}w = 0\} \cong T/\lambda ^{n} T$
. For every
$p,$
we let
$H^1_{f}(\mathbf {Q}_p,W)$
be the image of
$H^1_{f}(\mathbf {Q}_p,V)$
under the natural map
$H^1(\mathbf {Q}_p,V) \rightarrow H^1(\mathbf {Q}_p,W)$
. We have
$H^1_{f}(\mathbf {Q}_p,W) = H^1_{\operatorname {\mathrm {un}}}(\mathbf {Q}_p, W)$
for all
$p \neq \ell $
, as long as V is unramified at p, which for us will always be the case.
We define the global Selmer group of W as

We note that as
$H^1_{f}(\mathbf {Q}_\ell , W)$
commutes with direct sums and so clearly does
$H^1_{\operatorname {\mathrm {un}}}(\mathbf {Q}_\ell , W)$
, we get that
$H^1_f(\mathbf {Q}, W)$
does as well.
Let
$I=[a,b]$
with
$a, b \in \mathbf {Z}$
and
$b-a \leq \ell -2$
and assume that
$0 \in I$
. If V is crystalline with Hodge–Tate weights in
$-I,$
we define

As noted in (4.4),
$H^1_{f}(\mathbf {Q}_\ell , W[\lambda ^n])$
also commutes with direct sums and so we get that
$H^1_{f, I}(\mathbf {Q}, W[\lambda ^n])$
does as well.
Proposition 5.1 Assume that the interval
$I=[a,b]$
contains
$0$
and V is
$E[G_{\mathbf {Q}}]$
-module, which is finite-dimensional as an E-vector space and a crystalline
$G_{\mathbf {Q}_{\ell }}$
-module with Hodge–Tate weights in
$-I$
. If
$H^0(\mathbf {Q}, W[\lambda ])=0$
then we have

Proof [Reference Berger and Klosin6, Proposition 5.8] proves the claim under the assumption
$H^0(\mathbf {Q}, W)=0$
.
Suppose we have
$\alpha \in H^0(\mathbf {Q}, W)$
. We know every element of W is annihilated by some power of
$\lambda $
, so if
$\alpha \neq 0$
there is an integer m so that
$\lambda ^{m} \alpha =0$
but
$\lambda ^{n} \alpha \neq 0$
for all
$0 < n < m$
. However, this gives
$\lambda ^{m-1} \alpha \in H^0(\mathbf {Q}, W[\lambda ]) =0$
, so it must be that
$\alpha = 0$
. Thus,
$H^0(\mathbf {Q}, W) = 0$
as desired.
After a suitable interval, I has been fixed, we will therefore also drop the subscript I and write
$H^1_{f}(\mathbf {Q}, W[\lambda ^n])$
.
Let G be a group, R a commutative ring with identity, and
$M_{i}$
finitely generated free R-modules with R-linear action given by
$\rho _{i}: G \rightarrow \operatorname {\mathrm {GL}}_R(M_i)$
for
$i=1,2$
. The action of G on
$\operatorname {\mathrm {Hom}}_{R}(\rho _2,\rho _1)$
is given by
$(g\cdot \varphi )(v) = \rho _1(v) \varphi (\rho _2(g^{-1})v).$
In particular, if
$\rho _1 = \rho _2 = \rho $
, we define the adjoint representation of
$\rho $
to be the
$R[G]$
-module
$\operatorname {\mathrm {ad}} \rho = \operatorname {\mathrm {Hom}}_R(\rho ,\rho )$
. We write
$\operatorname {\mathrm {ad}}^0\rho $
for the
$R[G]$
-submodule of
$\operatorname {\mathrm {ad}}\rho $
consisting of endomorphisms of trace zero.
If
$\rho $
is of rank n and
$2n \in R^\times $
then we have an isomorphism of
$R[G]$
-modules

5.2 Non-vanishing of a Selmer group
In this section, we explain how the congruence of a Siegel cusp form to the Klingen Eisenstein series in Section 3 leads to a non-zero element of
$H^1_f(\mathbf {Q}, \mathrm {ad}^0(\rho _{\phi , \lambda })(2-k) \otimes E/\mathcal {O})$
.
From now on, we fix the weight
$k \geq 12$
even and the prime
$\ell $
satisfying
$\ell>4k-5$
and impose Assumption 3.1 on the field
$E/\mathbf {Q}_\ell $
. Let
$\phi \in S_{k}(\Gamma _1)$
be a normalized eigenform. Let
$\rho _{\phi }$
be the
$\lambda $
-adic Galois representation associated with
$\phi $
and assume
${\overline {\rho }}_{\phi }$
is irreducible. Let
$f \in S_{k}(\Gamma _2)$
be an eigenform with irreducible Galois representation
$\rho _{f}$
so that f is eigenvalue congruent to
$E_{\phi }^{2,1}$
modulo
$\lambda $
.
The following result shows we can choose a lattice so that the residual Galois representation gives rise to a non-split extension.
Lemma 5.2 There exists a
$G_{\mathbf {Q}}$
-stable lattice in the space of
$\rho _f$
such that with respect to this lattice

Proof Using the compactness of
$G_{\mathbf {Q}}$
, one can show that there exists a
$G_{\mathbf {Q}}$
-stable lattice
$\Lambda '$
in the space of
$\rho _f$
. One uses Brauer–Nesbitt Theorem together with the Chebotarev Density Theorem to conclude that
${\overline {\rho }}_{f, \Lambda '}^{\mathrm {ss}}={\overline {\rho }}_{\phi }\oplus {\overline {\rho }}_{\phi }(k-2)$
. Now, the existence of the desired lattice which gives the non-split extension follows from Theorem 4.1 in [Reference Berger and Klosin9].
From now on, whenever we write
$\rho _f$
, we assume we have made a choice of lattice as in Lemma 5.2, so we consider
$\rho _f$
as a map from
$G_{\mathbf {Q}}$
to
$\operatorname {\mathrm {GL}}_4(\mathcal {O})$
.
We now choose the interval
$I=[3-2k,2k-3]$
so that it contains all the Hodge–Tate weights of
$\rho _f$
,
$\rho _\phi $
,
$\rho _\phi (k-2)$
,
$\operatorname {\mathrm {ad}} \rho _\phi (2-k)$
, and
$\operatorname {\mathrm {ad}} \rho _\phi (k-2)$
. Note that
$-I=I$
. We assume that
$\ell -2\geq 4k-6$
. When we write
$H^1_f$
from now on, this refers to
$H^1_{f, I}$
as defined in Section 5.1.
Let
$\rho $
be any of the representations above and write V for the representation space of
$\rho $
. We choose a
$G_{\mathbf {Q}}$
-stable lattice
$T\subset V$
and recall that the isomorphism class of the semi-simplification of the
$\mathbf {F}[G_{\mathbf {Q}}]$
-representation
$T/\lambda T$
is independent of the choice of T. It is well-known that if
$T/\lambda T$
is irreducible then the
$\mathcal {O}$
-length of
$H^1_f(\mathbf {Q}, W)$
is independent of T, where as before
$W=V/T$
. By Proposition 5.1, we then conclude that also the
$\mathcal {O}$
-length of
$H^1_f(\mathbf {Q},W[\lambda ^n])$
is independent of the choice of T as long as
$H^0(\mathbf {Q},W)=0$
.
Lemma 5.3 Under our assumptions (in particular,
${\overline {\rho }}_{\phi }$
irreducible and
$\ell>4k-5$
), the modulo
$\lambda $
reduction of
$\operatorname {\mathrm {ad}}^0\rho _{\phi }$
is irreducible.
Proof Assume the three-dimensional representation
$\operatorname {\mathrm {ad}}^0{\overline {\rho }}_{\phi }$
is reducible. Then, it either has a one-dimensional
$G_{\mathbf {Q}}$
-stable subspace or quotient. Since
$\operatorname {\mathrm {ad}}\rho _{\phi }$
and
$\mathbf {1}$
are self-dual, so is
$\operatorname {\mathrm {ad}}^0 {\overline {\rho }}_{\phi }$
. Hence, we can assume without loss of generality that
$\operatorname {\mathrm {ad}}^0{\overline {\rho }}_{\phi }$
has a
$G_{\mathbf {Q}}$
-stable line. Write
$\psi $
for the character by which
$G_{\mathbf {Q}}$
acts on the line.
As
${\overline {\rho }}_{\phi }$
is unramified away from
$\ell $
and the order of
$\psi $
is prime to
$\ell $
, we have
$\psi =\overline {\epsilon }^a$
for some integer
$a \in I$
. This would require
$H^0(\mathbf {Q}, \operatorname {\mathrm {ad}}^0 {\overline {\rho }}_{\phi }(-a)) \neq 0$
. Note that
$H^0(\mathbf {Q}, \operatorname {\mathrm {ad}} {\overline {\rho }}_{\phi }(-a))=\operatorname {\mathrm {Hom}}_{G_{\mathbf {Q}}}({\overline {\rho }}_\phi (a), {\overline {\rho }}_\phi ).$
If
$a\equiv 0$
(mod (
$\ell -1$
)), then this space is one-dimensional by Schur’s Lemma since
${\overline {\rho }}_{\phi }$
is irreducible. So,
$H^0(\mathbf {Q}, \operatorname {\mathrm {ad}}^0{\overline {\rho }}_{\phi })=0$
, contradiction.
If
$a\not \equiv 0$
(mod (
$\ell -1$
)), then
$H^0(\mathbf {Q}, \operatorname {\mathrm {ad}} {\overline {\rho }}_{\phi }(-a))=H^0(\mathbf {Q}, \operatorname {\mathrm {ad}}^0 {\overline {\rho }}_{\phi }(-a))\neq 0.$
This means that
${\overline {\rho }}_{\phi }$
is isomorphic to
${\overline {\rho }}_{\phi }(a)$
. Considering the determinant,
$\overline {\epsilon }^a$
must be the trivial character or the quadratic character
$\overline {\epsilon }^{(\ell -1)/2}$
. Both are ruled out since
$a \in I=[3-2k, 2k-3]$
by our assumption that
$\ell>4k-5$
.
Remark 5.4 From Lemma 5.3, we conclude that when
$\rho \in \{\rho _{\phi }, \rho _{\phi }(k-2), \operatorname {\mathrm {ad}}^0\rho _{\phi }(2-k), \operatorname {\mathrm {ad}}^0\rho _{\phi }(k-2)\}$
, the
$\mathcal {O}$
-lengths of
$H^1_f(\mathbf {Q}, W)$
and
$H^1_f(\mathbf {Q}, W[\lambda ^n])$
are independent of the choice of T. As we will ever only be interested in the order of these groups, the choice of T is immaterial and we will simply assume that such a choice was made. So, for example, we will use the notation
$H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}^0\rho _{\phi , \lambda }(k-2)\otimes E/\mathcal {O})$
, thus assuming that when we write
$\operatorname {\mathrm {ad}}^0\rho _{\phi , \lambda }(k-2)$
, we have made a choice of a lattice for this representation. Likewise any one-dimensional representation
$\rho $
is irreducible, so the
$\mathcal {O}$
-length of
$H^1_f(\mathbf {Q}, \rho \otimes E/\mathcal {O})$
is independent of the choice of T.
For the representation
$\operatorname {\mathrm {ad}}\rho (m)$
,
$m\in \{k-2, 2-k\}$
(which is reducible), we choose a lattice which is a direct sum of a lattice inside
$\operatorname {\mathrm {ad}}^0\rho (m)$
and a lattice inside
$E(m)$
. So, from now on, whenever we write
$\operatorname {\mathrm {ad}}\rho (m)$
we mean such a lattice. Since the formation of Selmer groups commutes with direct sums, we then get

for
$m\in \{k-2,2-k\}$
. Note that the
$\mathcal {O}$
-length (and in particular, the non-triviality) of
$H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}} \rho (m)\otimes E/\mathcal {O})$
is independent of the choice of a lattice inside
$\operatorname {\mathrm {ad}}\rho _{\phi }(m)$
as long as it is the direct sum of lattices in
$\operatorname {\mathrm {ad}}^0\rho _{\phi }(m)$
and
$E(m)$
.
Theorem 5.5 With the set-up as above, we have
$H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}\rho _{\phi }(2-k)\otimes E/\mathcal {O})\neq 0$
.
Proof We have via Lemma 5.2 that there is a lattice
$T_{f} \subset V_{f}$
so that the residual representation
${\overline {\rho }}_{f}: G_{\mathbf {Q}} \rightarrow \operatorname {\mathrm {GL}}_4(\mathbf {F})$
has the form

and is not semisimple. The fact that
$\psi $
as in (5.3) gives a non-trivial class
$[\psi ]$
in
$H^1(\mathbf {Q},\operatorname {\mathrm {Hom}}_{\mathbf {F}}({\overline {\rho }}_2, {\overline {\rho }}_1))= H^1(\mathbf {Q}, \operatorname {\mathrm {ad}}\rho _{\phi }(2-k)\otimes E/\mathcal {O}[\lambda ])$
is clear. We need to show that
$[\psi ]$
lies in
$H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}\rho _{\phi }(2-k)\otimes E/\mathcal {O}[\lambda ])$
and that the latter group injects into
$H_f^1(\mathbf {Q}, \operatorname {\mathrm {ad}}\rho _{\phi }(2-k)\otimes E/\mathcal {O})$
.
We first show that
$[\psi ]$
satisfies the conditions to be in
$H^1_{f}(\mathbf {Q}, \operatorname {\mathrm {ad}}\rho _{\phi }(2-k)\otimes E/\mathcal {O}[\lambda ])$
. We have that
${\rho }_f$
is unramified at all primes
$p \neq \ell $
, so the local conditions are satisfied for all primes
$p \neq \ell $
.
Since f has level one and weight k,
$\rho _{f}|_{D_{\ell }}$
is crystalline with Hodge–Tate weights in
$[0, 2k-3] \subset I=-I$
. Hence,
${\overline {\rho }}_f$
(considered as a
$G_{\mathbf {Q}_{\ell }}$
-module) belongs to
$\mathrm {Rep}_{\mathrm {free}, \mathbf {F}}^{\mathrm {cris}, I}(G_{\mathbf {Q}_{\ell }})$
and gives rise to an element of
$\mathrm {Ext}^1_{ {\mathrm {Rep}_{\mathrm {free}, \mathbf {F}}^{\mathrm {cris}, I}(G_{\mathbf {Q}_{\ell }})}}({\overline {\rho }}_\phi (k-2),{\overline {\rho }}_\phi )\subset \operatorname {\mathrm {Ext}}^1_{\mathbf {F}[G_{\mathbf {Q}_{\ell }}]}(\rho _\phi (k-2) \otimes E/\mathcal {O}[\lambda ], \rho _\phi \otimes E/\mathcal {O}[\lambda ]).$
By our choice of
$I,$
we can use (4.3) and Proposition 4.8 to get a non-zero element in

As this extension maps to
$[\psi |_{G_{\mathbf {Q}_\ell }}]$
in
$H^1(\mathbf {Q}_{\ell }, \operatorname {\mathrm {ad}}\rho _{\phi }(2-k)\otimes E/\mathcal {O}[\lambda ])$
under the canonical isomorphism
$\operatorname {\mathrm {Ext}}^1_{\mathbf {F}[G_{\mathbf {Q}_{\ell }}]}(\mathbf {F}, \operatorname {\mathrm {ad}}\rho _{\phi }(2-k)\otimes E/\mathcal {O}[\lambda ])\cong H^1(\mathbf {Q}_{\ell }, \operatorname {\mathrm {ad}}\rho _{\phi }(2-k)\otimes E/\mathcal {O}[\lambda ])$
, we conclude that

Therefore, we have established that
$[\psi ] \in H^1_{f}(\mathbf {Q}, \operatorname {\mathrm {ad}}\rho _{\phi }(2-k)\otimes E/\mathcal {O}[\lambda ]).$
By Proposition 5.1, this group is isomorphic to
$H^1_{f}(\mathbf {Q}, \operatorname {\mathrm {ad}}\rho _{\phi }(2-k)\otimes E/\mathcal {O})[\lambda ]$
if
$H^0(\mathbf {Q}, \operatorname {\mathrm {ad}}\rho _{\phi }(2-k)\otimes E/\mathcal {O}[\lambda ])=0$
. The latter holds since

as
${\overline {\rho }}_\phi $
and
${\overline {\rho }}_\phi (k-2)$
are absolutely irreducible (by assumption) and non-isomorphic since
$k-2 \not \equiv 0, \frac {\ell -1}{2}\ \pmod {\ell -1}$
as
$\ell>4k-5$
and
$k \neq 2$
(cf. the proof of Lemma 5.3).
Lemma 5.6 Let n be an even integer satisfying
$3-2k<n \leq 0$
. Assuming
$\ell \nmid \#\operatorname {\mathrm {Cl}}_{\mathbf {Q}(\zeta _{\ell })^{+}}^{\overline {\epsilon }^{n}}$
, one has
$H^1_f(\mathbf {Q}, \mathbf {F}(n))=0$
and, if additionally
$n \neq 0$
,
$H^1_f(\mathbf {Q}, E/\mathcal {O}(n))=0$
.
Proof We see from Proposition 4.20 that any cohomology class in
$H^1_f(\mathbf {Q}, \mathbf {F}(n))$
must vanish when restricted to
$I_{\ell }$
. As all classes in
$H^1_f(\mathbf {Q}, \mathbf {F}(n))$
are unramified away from
$\ell $
, we get that they are unramified everywhere. Using inflation-restriction sequence where
$H=\operatorname {\mathrm {Gal}}(\mathbf {Q}(\zeta _{\ell })^{+}/\mathbf {Q}),$
we see that

Note that everywhere unramified classes map to homomorphisms that kill all the inertia groups. Hence, the image of
$H^1_f(\mathbf {Q}, \mathbf {F}(n))$
lands inside
$\operatorname {\mathrm {Hom}}\left (\operatorname {\mathrm {Cl}}_{\mathbf {Q}(\zeta _{\ell })^{+}}^{\overline {\epsilon }^{n}}, \mathbf {F}\right )=0$
.
Note that a torsion
$\mathcal {O}$
-module M is zero if and only if
$M[\lambda ]=0$
. Therefore, the vanishing of
$H^1_f(\mathbf {Q}, E/\mathcal {O}(n))$
follows from Proposition 5.1, which tells us that
$H^1_f(\mathbf {Q}, E/\mathcal {O}(n))[\lambda ]=H^1_f(\mathbf {Q}, \mathbf {F}(n))$
if
$H^0(\mathbf {Q},E/\mathcal {O}(n))=0$
. We know that
$H^0(\mathbf {Q}_\ell ,E/\mathcal {O}(n)[\lambda ])=H^0(\mathbf {Q},\mathbf {F}(n))=0$
for
$n \neq 0$
since
$n \not \equiv 0\ \pmod {\ell -1}$
under our assumption
$\ell>4k-5$
.
Corollary 5.7 Let
$\phi \in S_{k}(\Gamma _1)$
be as in Theorem 3.5 and assume the hypotheses of Theorem 3.5 are satisfied. Assuming
$\ell \nmid \#\operatorname {\mathrm {Cl}}_{\mathbf {Q}(\zeta _{\ell })^{+}}^{\overline {\epsilon }^{2-k}}$
, one has
$H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}^0\rho _{\phi }(2-k)\otimes E/\mathcal {O})\neq 0$
.
Remark 5.8 If we assume Vandiver’s conjecture for the prime
$\ell $
, this gives that
$\ell \nmid \#\operatorname {\mathrm {Cl}}_{\mathbf {Q}(\zeta _{\ell })^{+}}^{\overline {\epsilon }^{2-k}}$
.
6 Modularity
We begin with the following commutative algebra result that will be useful in this section.
Lemma 6.1 If J is an ideal of
$\mathbf {F}[[X_1,\dots , X_n]]$
that is strictly contained in the maximal ideal, then
$\mathbf {F}[[X_1,\dots , X_n]]/J$
admits an
$\mathbf {F}$
-algebra surjection to
$\mathbf {F}[T]/T^2$
.
Proof For a positive integer
$k,$
let
$I_k$
be the ideal of
$\mathbf {F}[[X_1,\dots , X_k]]$
generated by all the monomials of degree at least 2. Set
$S_k:=\mathbf {F}[[X_1,\dots , X_k]]/I_k$
and write
$\phi _k: \mathbf {F}[[X_1,\dots , X_k]]\to S_k$
for the canonical
$\mathbf {F}$
-algebra surjection. If
$\phi _n(J)=0$
, then composing
$\phi _n$
with the map
$S_n\to \mathbf {F}[[T]]/T^2$
sending
$X_1$
to T and
$X_i$
for
$i>1$
to zero gives the desired surjection.
Now suppose
$\phi _n(J)\neq 0$
. Without loss of generality (renumbering the variables if necessary), we may assume then that J contains an element of the form
$u:=X_n+f(X_1,\dots , X_{n-1})+g(X_1,\dots , X_n)$
, where f is homogeneous of degree one and all the terms in g have degree at least 2. Note that we can assume without loss of generality that some power of
$X_n$
appears in g. (Indeed, if g contains no
$X_n$
then we replace u by
$u+u^2 \in J$
.) By Theorem 7.16(a) in [Reference Eisenbud23] there is a unique
$\mathbf {F}$
-algebra map from
$\mathbf {F}[[X_1,\dots , X_n]]$
to itself sending
$X_n$
to
$-f-g$
and
$X_i$
to itself for
$i<n$
. In other words, for any power series
$h(X_1,\dots , X_n)$
, the element
$h(X_1,\dots , X_{n-1}, -f-g)$
also lives in
$\mathbf {F}[[X_1, \dots , X_n]]$
and we denote it by
$h'(X_1, \dots , X_n)$
. Clearly,
$h-h'\in J$
.
Thus, for any power series
$h,$
where the smallest total degree of any term containing
$X_n$
is
$s,$
we have

for some power series
$h'$
with the smallest total degree of any term containing
$X_n$
equal to
$s'>s$
. By the same process, we get an
$h"$
such that
$h' \equiv h"\ \mod {J}$
and the smallest total degree of any term
$X_n$
in
$h"$
is strictly greater than
$s'$
. This way, we can construct a sequence of power series
$h_s$
where for every
$s,$
we have the smallest total degree of any term containing
$X_n$
being greater than or equal to s and such that
$h- h_s\in J$
for every s. We note that
$h_s$
is a Cauchy sequence with respect to the
$(X_1,\dots ,X_n)$
-adic topology (indeed, for
$t,u>s,$
we see that
$h_t-h_u$
lies in
$(X_1,\dots ,X_n)^s$
). Set
$h_0=\lim _{s\to \infty }h_s$
. As J is a closed ideal, we get that
$h_0-h\in J$
. For every
$s,$
we have

for some
$w_s \in \mathbf {F}[[X_1, \dots , X_{n-1}]]$
. Note that the
$w_s$
also form a Cauchy sequence since
$h_s$
does. Set
$w:= \lim _{s \to \infty } w_s \in \mathbf {F}[[X_1, \dots , X_{n-1}]]$
. Thus,
$h_0 \equiv w$
modulo
$\bigcap _s (X_n^s) \subset \bigcap _s (X_1, \ldots , X_n)^s=0$
, so
$h_0 \in \mathbf {F}[[X_1, \dots , X_{n-1}]]$
.
Hence, the natural
$\mathbf {F}$
-algebra map
$\psi _{n-1}:\mathbf {F}[[X_1,\dots , X_{n-1}]]\to \mathbf {F}[[X_1, \dots , X_n]]/J$
given by
$h_0 \mapsto h_0+J$
is surjective. Thus, we get an
$\mathbf {F}$
-algebra isomorphism
$\mathbf {F}[[X_1,\dots , X_n]]/J \to \mathbf {F}[[X_1, \dots , X_{n-1}]]/J_{n-1}$
, where
$J_{n-1}=\ker \psi _{n-1}$
.
If
$\phi _{n-1}(J_{n-1})\neq 0$
, continue this way obtaining a sequence of ideals
$J_{n-2},J_{n-3},\ldots $
. If at any stage (
$1\leq r\leq n-2$
), we get
$\phi _{n-r}(J_{n-r})=0$
, then we are done. Otherwise, we can eliminate all but one variable and get
$\mathbf {F}[[X_1,\dots , X_n]]/J\cong \mathbf {F}[[X_1]]/J_1$
and now we must have
$\phi _1(J_1)=0$
as otherwise
$J_1$
and hence J is maximal.
Recall that in the earlier sections we fixed the weight
$k \geq 12$
even and prime
$\ell> 4k-5$
and imposed Assumption 3.1 on the field
$E/\mathbf {Q}_\ell $
. We also fixed the Fontaine–Laffaille interval
$I=[3-2k, 2k-3]$
. Let
$\phi \in S_k(\Gamma _1)$
be a newform such that
${\overline {\rho }}_{\phi }$
is irreducible. The goal of this section is to prove a modularity theorem under the following assumption.
Assumption 6.2 For k and
$\phi $
as above, we assume that:
-
(i) there exists
$f \in S_k(\Gamma _2)$ such that
$f\equiv _{\mathrm {ev}}E^{2,1}_{\phi }$ (mod
$\lambda $ ), and
-
(ii)
$\#H^1_f(\mathbf {Q}, \mathrm {ad}^0\rho _{\phi }(2-k)\otimes _{\mathcal {O}}E/\mathcal {O})=\#\mathcal {O}/\lambda $ (recall that the left-hand side is independent of the choice of lattice, see Remark 5.4), and
-
(iii)
$H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}^0{\overline {\rho }}_{\phi })=0.$
Remark 6.3 Assumption 6.2 (i) is satisfied under the assumptions of Theorem 3.5, and so is one inequality in Assumption 6.2 (ii) under the assumptions of Corollary 5.7.
We impose Assumption 6.2 and fix f as in Assumption 6.2 in what follows. We will write
$G_{\{\ell \}}$
for the Galois group of the maximal Galois extension of
$\mathbf {Q}$
unramified away from
$\ell $
. Let
$\rho _f: G_{\{\ell \}}\to \operatorname {\mathrm {GL}}_4(E)$
be as in Theorem 2.1. Lemma 3.4 gives that
$\rho _{f}$
is irreducible. We will use Mazur’s deformation theory and refer the reader to standard references, such as [Reference Cornell, Silverman and Stevens19, Reference Ramakrishna43] for the definitions and basic properties.
Definition 6.4 For
$B \in \mathrm {LCN}_{\mathcal {O}}$
, we say that a representation
$\rho :G_{\mathbf {Q}_\ell } \to \operatorname {\mathrm {GL}}_n(B)$
is Fontaine–Laffaille (with Hodge–Tate weights in
$-I$
) if
$\rho \otimes _B A$
lies in
$\mathrm {Rep}_{\mathrm {free}, A}^{\mathrm {cris}, -I}(G_{\mathbf {Q}_\ell })$
(see Definition 4.9(v)) for every Artinian quotient A of B. By Theorem 4.14(iv), this is equivalent to requiring
$\rho \otimes _B A$
to lie in the essential image of the Fontaine–Laffaille functor.
Remark 6.5 We know that any choice of
$\mathcal {O}$
-lattice
$\rho _L$
in
$\rho _\phi $
or
$\rho _f$
is Fontaine–Laffaille in this sense, since their restrictions to
$G_{\mathbf {Q}_\ell }$
lie in
$\mathrm { Rep}_{\mathbf {Z}_\ell }^{\mathrm {cris}, -I}(G_{\mathbf {Q}_{\ell }})$
and therefore in the essential image of the Fontaine–Laffaille functor by Theorem 4.14(iii). Since they are also free
$\mathcal {O}$
-modules this implies by Theorem 4.14 (iii) and (iv) that
$\rho _L \otimes B$
lies in
$\mathrm {Rep}_{\mathrm {free}, A}^{\mathrm {cris}, -I}(G_{\mathbf {Q}_\ell })$
for every Artinian quotient B of
$\mathcal {O}$
.
For any local complete Noetherian
$\mathcal {O}$
-algebra A with residue field
$\mathbf {F}$
by a deformation of a residual Galois representation
$\tau : G_{\{\ell \}} \to \operatorname {\mathrm {GL}}_n(\mathbf {F}),$
we will mean a strict equivalence class of lifts
$\tilde {\tau }:G_{\{\ell \}}\to \operatorname {\mathrm {GL}}_n(A)$
of
$\tau $
that are Fontaine–Laffaille at
$\ell $
. This deformation condition is introduced in [Reference Berger and Klosin6, Section 5.3] and [Reference Clozel, Harris and Taylor18, p. 35]
As is customary, we will denote a strict equivalence class of deformations by any of its members. If
$\tau $
has scalar centralizer then this deformation problem is representable by a local complete Noetherian
$\mathcal {O}$
-algebra which we will denote by
$R_{\tau }$
[Reference Ramakrishna44]. In particular, the identity map in
$\operatorname {\mathrm {Hom}}_{\mathcal {O}-\mathrm {alg}}(R_{\tau },R_{\tau })$
furnishes what is called the universal deformation
$\tau ^{\mathrm {univ}}: G_{\{\ell \}}\to \operatorname {\mathrm {GL}}_n(R_{\tau })$
.
Lemma 6.6 One has
$R_{{\overline {\rho }}_{\phi }}\cong R_{{\overline {\rho }}_{\phi }(k-2)}\cong \mathcal {O}$
. Furthermore,
$\rho _{\phi }$
(resp.,
$\rho _{\phi }(k-2)$
) is the unique deformation of
${\overline {\rho }}_{\phi }$
(resp.,
${\overline {\rho }}_{\phi }(k-2)$
) to
$\operatorname {\mathrm {GL}}_2(\mathcal {O})$
.
Proof We have

where the first equality follows from the fact that our deformation condition is the property of being Fontaine–Laffaille (see, e.g., [Reference Clozel, Harris and Taylor18, Section 2.4.1]), and the second one holds since we have
$H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}{\overline {\rho }}_{\phi })=H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}^0{\overline {\rho }}_{\phi }) \oplus H^1_f(\mathbf {Q}, \mathbf {F})=0$
and
$H^1_f(\mathbf {Q}, \mathbf {F})=0$
by Lemma 5.6 as we have imposed Assumption 6.2(iii).
By Theorem 7.16 in [Reference Eisenbud23] we know that any local complete Noetherian
$\mathcal {O}$
-algebra with residue field
$\mathbf {F}$
is a quotient of
$\mathcal {O}[[X_1, \dots , X_n]]$
for some positive integer n. Hence,
$S:=R_{{\overline {\rho }}_{\phi }}/(\lambda R_{{\overline {\rho }}_{\phi }})\cong \mathbf {F}[[X_1, \dots , X_n]]/J$
for some ideal J. Suppose first that J is not maximal. Then, by Lemma 6.1, we know that S admits a surjection
$\varphi $
to
$\mathbf {F}[T]/T^2$
. This contradicts (6.1), hence
$S=\mathbf {F}$
. We now use the complete version of Nakayama’s Lemma to conclude that the structure map
$\mathcal {O}\to R_{{\overline {\rho }}_{\phi }}$
is surjective (cf. [Reference Eisenbud23, Exercise 7.2] or [Reference Matsumura37, Theorem 8.4]). Let us briefly explain why this version applies here. As
$R_{{\overline {\rho }}_{\phi }}\otimes _{\mathcal {O}}\mathbf {F}\neq 0$
, we see that
$\lambda \in \mathfrak {m}$
, where
$\mathfrak {m}$
is the maximal ideal of
$R_{{\overline {\rho }}_{\phi }}$
. Hence,

The latter intersection is zero, since
$R_{{\overline {\rho }}_{\phi }}$
is complete, so separated with respect to
$\mathfrak {m}$
. Hence, (6.2) implies that
$R_{{\overline {\rho }}_{\phi }}$
is separated with respect to
$\lambda R_{{\overline {\rho }}_{\phi }}$
allowing for the application of the complete version of Nakayama’s Lemma.
As
$\rho _{\phi }$
is a deformation to
$\mathcal {O}$
, we conclude that
$R_{{\overline {\rho }}_{\phi }}=\mathcal {O}$
. This implies that if
$\rho : G_{\{\ell \}}\to \operatorname {\mathrm {GL}}_2(\mathcal {O})$
is any deformation of
${\overline {\rho }}_\phi $
, one has
$\rho \cong \rho _{\phi }$
. Similarly, if
$\rho : G_{\{\ell \}} \to \operatorname {\mathrm {GL}}_2(\mathcal {O})$
is a deformation of
${\overline {\rho }}_{\phi }(k-2)$
then
$\rho (2-k)$
is a deformation of
${\overline {\rho }}_{\phi }$
. Note that our choice of
$I=[3-2k, 2k-3]$
means that this twisting stays inside our category of Fontaine–Laffaille representations. Hence, we get that
$\rho (2-k)\cong \rho _{\phi }$
, and so we are done.
Remark 6.7 Note that the determinant of our deformations is automatically fixed as
$H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}{\overline {\rho }}_{\phi })=H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}^0{\overline {\rho }}_{\phi })$
under our assumptions. This means that all deformations
$\rho $
of
${\overline {\rho }}_{\phi }$
(respectively,
${\overline {\rho }}_{\phi }(k-2)$
) satisfy
$\det \rho =\epsilon ^{k-1}$
(respectively,
$\det \rho =\epsilon ^{2k-3}$
).
Remark 6.8 Regarding Assumption 6.2(iii), we note that if one additionally assumes that
${\overline {\rho }}_\phi $
is absolutely irreducible when restricted to
$\operatorname {\mathrm {Gal}}(\overline {\mathbf {Q}}/\mathbf {Q}(\sqrt {(-1)^{(\ell -1)/2}\ell })$
then [Reference Diamond, Flach and Guo20, Theorem 3.7] (see also [Reference Hida28, Theorem 5.20] relates
$H^1_f(\mathbf {Q}, \mathrm {ad}^0 \rho _\phi \otimes E/\mathcal {O})$
(via an
$R_{{\overline {\rho }}_{\phi }}=\mathbf {T}$
theorem) to a congruence ideal
$\eta _\phi ^\emptyset $
. One can use Proposition 5.1 to see that
$H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}^0{\overline {\rho }}_{\phi })=H^1_f(\mathbf {Q}, \mathrm {ad}^0 \rho _\phi \otimes E/\mathcal {O})[\lambda ]=0$
if
$\eta _\phi ^\emptyset $
is coprime to
$\ell $
.
Lemma 6.9 Let G be a group and F be a field. For
$i\in \{1,2\}$
, let
$n_i \in \mathbf {Z}_+$
and
$\rho _i: G \to \operatorname {\mathrm {GL}}_{n_i}(F)$
be an irreducible representation with
$\rho _1\not \cong \rho _2$
. Let
$\rho : G \to \operatorname {\mathrm {GL}}_{n_1+n_2}(F)$
be a representation such that

Then,
$\rho $
has scalar centralizer.
Proof This is a simple consequence of Schur’s Lemma and the fact that
$\tilde {a}: g \to \rho _2(g)^{-1}a(g)$
defines a cocycle from G to
$\operatorname {\mathrm {Hom}}(\rho _2, \rho _1)$
which is not a coboundary.
Fix a lattice in the space of
$\rho _f$
as in Lemma 5.2, i.e., such that
${\overline {\rho }}_{f}=\left [ \begin {matrix} {\overline {\rho }}_{\phi } &* \\ &{\overline {\rho }}_{\phi }(k-2)\end {matrix} \right ]: G_{\{\ell \}}\to \operatorname {\mathrm {GL}}_4(\mathbf {F})$
is non-semisimple. For simplicity, we will write R for the universal deformation ring
$R_{{\overline {\rho }}_{f}}$
of
${\overline {\rho }}_{f}$
and
$\rho ^{\mathrm {univ}}: G_{\{\ell \}}\to \operatorname {\mathrm {GL}}_4(R)$
for the universal deformation. Note that the deformation problem is representable because
${\overline {\rho }}_{f}$
is non-semisimple with irreducible, mutually non-isomorphic Jordan–Holder factors, hence by Lemma 6.9, the centralizer of
${\overline {\rho }}_f$
consists of only scalar matrices. We say that a deformation
$\tilde \rho $
is upper-triangular if
$\tilde \rho $
is strictly equivalent to a deformation of
${\overline {\rho }}_{f}$
of the form
$\left [ \begin {matrix} *&*\\ 0&*\end {matrix} \right ]$
with the stars representing
$2\times 2$
blocks.
Lemma 6.10 There do not exist any non-trivial deformations of
${\overline {\rho }}_{f}$
into
$\operatorname {\mathrm {GL}}_4(\mathbf {F}[X]/X^2)$
that are upper-triangular.
Proof We use Proposition 7.2 in [Reference Berger and Klosin6] noting that Assumption 6.1(i) in [loc.cit.] is satisfied because we impose the current Assumption 6.2(ii). On the other hand, Assumption 6.1(ii) in [loc.cit.] is satisfied because of Lemma 6.6.
Definition 6.11 The smallest ideal I of R such that
$\text {tr}\hspace {2pt} \rho ^{\mathrm {univ}}$
is the sum of two pseudocharacters mod I will be called the reducibility ideal of R. We will denote this ideal by
$I_{\mathrm {re}}$
.
Proposition 6.12 Let
$I\subset R$
be an ideal such that
$R/I$
is an Artin ring. Then,
$I\supset I_{\mathrm {re}}$
if and only if
$\rho ^{\mathrm {univ}}$
(mod I) is upper-triangular.
Proof This is proved as Corollary 7.8 in [Reference Berger and Klosin6].
Corollary 6.13 The structure map
$\mathcal {O}\to R/I_{\mathrm {re}}$
is surjective and descends to an isomorphism
$\mathcal {O}/\lambda ^s \to R/I_{\mathrm {re}}$
for some
$s\in \mathbf {Z}_{\geq 0}\cup \{\infty \}$
. In fact, one has

Proof By Theorem 7.16 in [Reference Eisenbud23] we know that any local complete Noetherian
$\mathcal {O}$
-algebra with residue field
$\mathbf {F}$
is a quotient of
$\mathcal {O}[[X_1, \dots , X_n]]$
for some positive integer n. Hence,
$S:=R/(I_{\mathrm {re}}+\lambda R)\cong \mathbf {F}[[X_1, \dots , X_n]]/J$
for some ideal J. Suppose first that J is not maximal. Then, by Lemma 6.1, we know that S admits a surjection
$\varphi $
to
$\mathbf {F}[T]/T^2$
. This means that there exists a non-trivial (because the image of
$\varphi $
is not contained in
$\mathbf {F}$
) deformation of
$\rho $
to
$\mathbf {F}[T]/T^2$
which is upper-triangular (by Proposition 6.12), which contradicts Lemma 6.10. Thus, indeed,
$S=\mathbf {F}$
.
Hence, the structure map
$\mathcal {O}\to R/I_{\mathrm {re}}$
is surjective by the complete version of Nakayama’s Lemma (see the proof of Lemma 6.6). So,
$R/I_{\mathrm {re}}\cong \mathcal {O}/\lambda ^s$
for some
$s\in \mathbf {Z}_{\geq 0}\cup \{\infty \}$
.
The composition of
$\rho ^{\mathrm {univ}}$
with the map
$R\to R/I_{\mathrm {re}}$
gives rise to a deformation
$\rho _{\mathrm {re}}: G_{\{\ell \}}\to \operatorname {\mathrm {GL}}_4(R/I_{\mathrm {re}})=\operatorname {\mathrm {GL}}_4(\mathcal {O}/\lambda ^s)$
. By Proposition 6.12, this deformation is upper triangular, i.e., one has
$\rho _{\mathrm {re}}=\left [ \begin {matrix} *_1&*_2\\ &*_3\end {matrix} \right ].$
As the property of being Fontaine–Laffaille is preserved by subobjects and quotients, we see that
$*_1$
and
$*_3$
are Fontaine–Laffaille representations with values in
$\operatorname {\mathrm {GL}}_2(R/I_{\mathrm { re}})=\operatorname {\mathrm {GL}}_2(\mathcal {O}/\lambda ^s)$
. Thus, by Lemma 6.6, we can conclude that
$*_1=\rho _{\phi }$
,
$*_3=\rho _{\phi }(k-2)$
mod
$\lambda ^s$
. Hence, by (5.4) and Proposition 5.1,
$*_2$
gives rise to a class in
$H_f^1(\mathbf {Q},\mathrm {ad}^0\rho _{\phi }(2-k)\otimes _{\mathcal {O}}E/\mathcal {O})$
as
$\rho _{\mathrm {re}}$
is Fontaine–Laffaille. As
$\rho $
is non-semi-simple, we conclude that
$*_2$
is not annihilated by
$\lambda ^{s-1}$
, i.e., the class of
$*_2$
gives rise to a subgroup of
$H^1_f(\mathbf {Q},\mathrm { ad}^0\rho _{\phi }(2-k)\otimes _{\mathcal {O}}E/\mathcal {O})$
isomorphic to
$\mathcal {O}/\lambda ^s$
. Thus,
$s\leq 1$
as
$\#H^1_f(\mathbf {Q}, \mathrm { ad}^0\rho _{\phi }(2-k)\otimes _{\mathcal {O}}E/\mathcal {O})\leq \#\mathcal {O}/\lambda $
by Assumption 6.2(ii). Finally,
$s>0$
as
${\overline {\rho }}_{f}$
itself is reducible. This concludes the proof.
The following proposition does not use Assumption 6.2(ii).
Proposition 6.14 Assume that
$\dim H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}{\overline {\rho }}_{\phi }(k-2))\leq 1$
. Then, the ideal
$I_{\mathrm {re}}$
is a principal ideal.
Proof Since
$\rho ^{\mathrm {univ}}$
is a trace representation in the sense of Section 1.3.3 of [Reference Bellaïche and Chenevier4] Lemma 1.3.7 in [loc.cit.] tells us that we can conjugate
$\rho ^{\mathrm {univ}}$
by a matrix
$P \in \operatorname {\mathrm {GL}}_2(R)$
(here we use that every finite type projective R-module is free since R is local) to get
$\rho ^{\mathrm {univ}}$
adapted to a data of GMA idempotents for
$R[G_{\{\ell \}}]/\ker \rho ^{\mathrm {univ}}$
. By [Reference Bellaïche and Chenevier4, Lemma 1.3.8] we then get an isomorphism of R-modules

for ideals
$B, C \subset R$
. By [Reference Bellaïche and Chenevier4, Proposition 1.5.1] we further know that
$I_{\mathrm {re}}=BC$
.
[Reference Bellaïche and Chenevier4, Theorem 1.5.5] proves that there are injections
$\operatorname {\mathrm {Hom}}_R(B, \mathbf {F}) \hookrightarrow H^1(G_{\{\ell \}}, \operatorname {\mathrm {ad}}{\overline {\rho }}_{\phi }(2-k))$
and
$\operatorname {\mathrm {Hom}}_R(C, \mathbf {F}) \hookrightarrow H^1(G_{\{\ell \}}, \operatorname {\mathrm {ad}}{\overline {\rho }}_{\phi }(k-2)).$
Arguing as in [Reference Akers1, Proposition 4.2] (see also [Reference Wake and Wang-Erickson55, Theorem 4.3.5 and Remark 4.3.6] one sees that the images are contained in the Selmer groups
$H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}{\overline {\rho }}_{\phi }(2-k))$
and
$H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}{\overline {\rho }}_{\phi }(k-2))$
, respectively. From Assumption 6.2 (ii) and Proposition 5.1, we see that
$H^1(\mathbf {Q}, \operatorname {\mathrm {ad}}{\overline {\rho }}_{\phi }(2-k)) \cong \mathbf {F}$
. Together with the assumption
$\dim H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}{\overline {\rho }}_{\phi }(k-2))\leq 1,$
we deduce by Nakayama’s Lemma that both B and C, and therefore also
$I_{\mathrm {re}}$
are principal ideals of R. Note that Nakayama’s Lemma applies since B and C are ideals in R, which is Noetherian, hence they are finitely generated over R.
Remark 6.15 [Reference Akers1, Proposition 3.10] proves the principality of the reducibility ideal of the reduced Fontaine–Laffaille deformation ring
$R^{\mathrm {red}}$
for any residual representations with two Jordan-Hölder factors. Our argument (while relying on [Reference Akers1, Proposition 4.2] is slightly more general as it allows us to treat the case of non-reduced deformation rings.
Remark 6.16 By (5.2), we have

However, as opposed to the case of the
$(2-k)$
-twist of the trivial representation (cf. proof of Lemma 5.6), there is no simple relation between
$H^1_f(\mathbf {Q}, \mathbf {F}(k-2))$
and part of a class group except for the case
$k=2$
by Proposition 4.20. By the same proposition for
$2<k\leq \ell ,$
the group
$H^1_f(\mathbf {Q}, \mathbf {F}(k-2))$
requires no ramification condition at
$\ell $
, so equals
$H^1(G_{\{\ell \}}, \mathbf {F}(k-2))$
.
We have the following results about
$H^1(G_{\{\ell \}}, \mathbf {F}(n))$
for
$n>0$
.
Proposition 6.17 [Reference Berger and Klosin8, Proposition 6.5]
Suppose
$n \in \mathbf {Z}_{>0}$
and
$n \not \equiv 1\ \mod {\ell -1}$
. Assume that
$\ell \nmid \#\operatorname {\mathrm {Cl}}_{\mathbf {Q}(\zeta _{\ell })}^{\overline {\epsilon }^{n}}$
. Then,
$\dim H^1(G_{\{\ell \}}, \mathbf {F}(n)) \leq 1$
.
Proposition 6.18 Let
$n>0$
be an even integer. Assume
$\ell \nmid B_n$
(the n-th Bernoulli number) and
$n \not \equiv 0\ \mod {\ell -1}$
. Then,
$H^1(G_{\{\ell \}}, \mathbf {F}(n))=0$
.
Proof Since n is even and
$H^0(G_{\{\ell \}}, \mathbf {F}(n))=0$
as
$n \not \equiv 0\ \mod {\ell -1}$
we know
$\dim _{\mathbf {F}} H^1(G_{\{\ell \}}, \mathbf {F}(n))=\dim _{\mathbf {F}} H^2(G_{\{\ell \}}, \mathbf {F}(n))$
by [Reference Neukirch, Schmidt and Wingberg40, Corollary 8.7.5] (Euler Poincare characteristic). [Reference Assim3, Proposition 1.3] (condition
$(ii, \beta )$
) proves that
$H^2(G_{\{\ell \}}, \mathbf {F}(n))=0$
if
$n \not \equiv 1 \ \mod {\ell -1}$
(which is automatically satisfied for even n) and
$\ell \nmid \#\operatorname {\mathrm {Cl}}_{\mathbf {Q}(\zeta _{\ell })}^{\overline {\epsilon }^{1-n}}$
. By Herbrand’s Theorem (see, e.g., [Reference Washington57, Theorem 6.17] the latter follows from our assumption that
$\ell \nmid B_n$
(here we use again
$n \not \equiv 0\ \mod {\ell -1}$
).
Remark 6.19 Note that the assumption
$\ell \nmid B_{n}$
is stronger than
$\ell \nmid \#\operatorname {\mathrm {Cl}}_{\mathbf {Q}(\zeta _{\ell })}^{\overline {\epsilon }^{n}}$
in [Reference Berger and Klosin8, Proposition 6.5] As noted in the proof of Proposition 6.18,
$\ell \nmid B_{n}$
implies
$\ell \nmid \#\operatorname {\mathrm {Cl}}_{\mathbf {Q}(\zeta _{\ell })}^{\overline {\epsilon }^{\ell -n}}$
by Herbrand’s Theorem. By the “reflection theorem” [Reference Washington57, Theorem 10.9] this means that also
$\ell \nmid \operatorname {\mathrm {Cl}}_{\mathbf {Q}(\zeta _{\ell })}^{\overline {\epsilon }^{n}}$
.
This allows us to prove the following modularity theorem.
Theorem 6.20 Recall that we impose Assumptions 3.1 and 6.2. Furthermore, assume that
$\dim H^1_f(\mathbf {Q}, \mathrm { ad}{\overline {\rho }}_{\phi }(k-2))\leq 1$
. Then, the structure map
$\iota :\mathcal {O}\to R$
is an isomorphism. In particular, if
$\tau :G_{\mathbf {Q}}\to \operatorname {\mathrm {GL}}_4(E)$
is any continuous irreducible homomorphism unramified outside
$\ell $
, crystalline at
$\ell $
with Hodge–Tate weights in
$[3-2k,2k-3]$
and such that

then
$\tau \cong \rho ^{\mathrm {univ}}\cong \rho _f$
, i.e., in particular,
$\tau $
is modular.
Proof It follows from Corollary 6.13 that
$I_{\mathrm {re}}$
is a maximal ideal of R. As the deformation
$\rho _f$
induces a surjective map
$j: R\to \mathcal {O}$
, we get the following commutative diagram of
$\mathcal {O}$
-algebra maps:

As
$\overline {\iota }$
is an isomorphism, we get that so is
$\overline {j}$
. So, using the fact that
$I_{\mathrm {re}}$
is principal (Proposition 6.14), we can now apply Theorem 6.9 in [Reference Berger and Klosin5] to the right square to conclude that j is an isomorphism.
Now, let
$\tau $
be as in the statement of the theorem. Then,
$\tau $
factors through a representation of
$G_{\{\ell \}}$
. Using that
$\tau $
is irreducible, Theorem 4.1 in [Reference Berger and Klosin9] allows us to find a lattice in the space of
$\tau $
such that with respect to that lattice, one has

that is non-semi-simple. Using Remark 6.5, we see that this lattice is Fontaine–Laffaille, so the star gives rise to a non-zero element in
$H^1_f(\mathbf {Q}, \mathrm {ad}^0\rho _{\phi \textbf {}}(2-k)\otimes _{\mathcal {O}}E/\mathcal {O})$
. As the latter group has order
$\#\mathcal {O}/\lambda $
by Assumption 6.2(ii), we conclude that
$\overline {\tau }\cong \rho $
. In particular,
$\tau $
is a deformation of
$\rho $
. Hence,
$\tau $
gives rise to an
$\mathcal {O}$
-algebra map
$R\to \mathcal {O}$
, which must equal j by the first part of the theorem.
Remark 6.21 We return to Example 3.6 and note that Assumption 6.2 (i) holds, as discussed earlier. Since
$\ell =163$
or
$187273$
do not divide
$(2k-1)(2k-3)k!$
for
$k=26$
and
${\overline {\rho }}_\phi $
is irreducible, [Reference Diamond, Flach and Guo20, Lemma 2.5] proves that
${\overline {\rho }}_\phi $
stays irreducible when restricted to
$\operatorname {\mathrm {Gal}}(\overline {\mathbf {Q}}/\mathbf {Q}(\sqrt {(-1)^{(\ell -1)/2}\ell }))$
. Via Remark 6.8, we can therefore check that
$H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}^0{\overline {\rho }}_{\phi })=0$
as
$\phi $
is the only cusp form of weight 26 and level 1, so in particular,
$\phi $
is not congruent mod
$\ell $
to other forms. Since, in addition,
$L_{\mathrm {alg}}(50, \mathrm {Sym}^2 \phi )$
has
$\ell $
-valuation 1 for both
$\ell =163$
and
$187273,$
the Bloch–Kato conjecture for
$\#H^1_f(\mathbf {Q}, \mathrm {ad}^0 \rho _\phi (2-k) \otimes E/\mathcal {O})=\#\mathcal {O}/\lambda $
(see [Reference Dummigan22, Conjecture (5.2) and (5)] would imply that Assumption (ii) holds.
We do not know how to check
$\dim H^1_f(\mathbf {Q}, \mathrm {ad}{\overline {\rho }}_{\phi }(k-2))\leq 1$
, as the corresponding divisible Selmer group is not critical (in the sense of Deligne). Note that
$\dim H^1_f(\mathbf {Q}, \mathrm {ad}{\overline {\rho }}_{\phi }(k-2))=\dim H^1_f(\mathbf {Q}, \mathrm {ad}^0{\overline {\rho }}_{\phi }(k-2))$
by Proposition 6.18, since neither prime
$\ell $
divides
$B_{24}$
.
7 (Non-)principality of Eisenstein ideals
In this section, we formulate conditions when the Eisenstein ideal of the local Hecke algebra acting on
$S_k(\Gamma _2)$
is non-principal and
$\dim _{\mathbf {F}}H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}^0{\overline {\rho }}_{\phi }(k-2))>1.$
In particular, in that case,
$R\not \cong \mathcal {O}$
.
Let
$\mathbf {T}'$
be as in Section 2. Let
$\mathbf {T}$
denote the
$\mathcal {O}$
-subalgebra of
$\mathbf {T}'\otimes _{\mathbf {Z}}\mathcal {O}$
generated by the operators
$T^{(2)}(p)$
and
$T_1^{(2)}(p^2)$
for all primes
$p\nmid \ell $
. Since strong multiplicity, one holds in the level one case, we can choose an orthogonal basis
$\mathcal {N}'$
of
$S_{k}(\Gamma _{2})$
consisting of eigenforms for all the operators in
$\mathbf {T}$
.
Each
$g\in \mathcal {N}'$
gives rise to
$\psi _g\in \operatorname {\mathrm {Hom}}_{\mathcal {O}-\mathrm {alg}}(\mathbf {T}, \mathcal {O}),$
where
$\psi _g(T)=\lambda _g(T)$
, with
$\lambda _g(T)$
the eigenvalue of the operator T corresponding to g. Thus, we get a map
$\Psi : \mathcal {N}'\to \operatorname {\mathrm {Hom}}_{\mathcal {O}-\mathrm {alg}}(\mathbf {T}, \mathcal {O})$
given by
$g\mapsto \lambda _g$
, which by strong multiplicity, one is an injection.
Lemma 7.1 The natural
$\mathcal {O}$
-algebra map

is injective and has finite cokernel, i.e.,
$\mathbf {T}$
can be viewed as a lattice in
$\prod _{g\in \mathcal {N}'}\mathcal {O}$
.
Proof The injectivity follows from the fact that the elements of
$\mathcal {N}'$
form a basis.
We will now show that the map has finite cokernel. Note that the (set) map
$\Psi \otimes \overline {\mathbf {Q}}_\ell : \mathcal {N}'\to \operatorname {\mathrm {Hom}}_{\overline {\mathbf {Q}}_{\ell }-\mathrm {alg}}(\mathbf {T}\otimes \overline {\mathbf {Q}}_{\ell },\overline {\mathbf {Q}}_{\ell })\hookrightarrow \operatorname {\mathrm {Hom}}_{\overline {\mathbf {Q}}_{\ell }}(\mathbf {T}\otimes \overline {\mathbf {Q}}_{\ell }, \overline {\mathbf {Q}}_{\ell })$
given by
$g \mapsto \lambda _g \otimes \overline {\mathbf {Q}}_\ell $
is injective (because
$\Psi $
is injective), and strong multiplicity one implies that no non-trivial linear relation
$\sum _{g\in \mathcal {N}'} c_g \lambda _{g}=0$
can hold. Thus, the set
$\{\lambda _g\mid g\in \mathcal {N}'\}$
is a linearly independent subset of
$\operatorname {\mathrm {Hom}}_{\overline {\mathbf {Q}}_{\ell }}(\mathbf {T}\otimes \overline {\mathbf {Q}}_{\ell }, \overline {\mathbf {Q}}_{\ell })$
. Hence,

Tensoring the map (7.1) with
$\overline {\mathbf {Q}}_{\ell }$
we get a corresponding map
$\mathbf {T}\otimes \overline {\mathbf {Q}}_{\ell }\to \prod _{g\in \mathcal {N}'}\overline {\mathbf {Q}}_{\ell }$
, which is injective as (7.1) is. Thus, it must be surjective by (7.2). Hence, the map (7.1) has finite cokernel.
We now identify
$\mathbf {T}$
with the image of the map (7.1) and note that
$\mathbf {T}=\prod _{\mathfrak {m} \in \mathrm {MaxSpec}\mathbf {T}}\mathbf {T}_{\mathfrak {m}},$
where
$\mathbf {T}_{\mathfrak {m}}$
is the localization of
$\mathbf {T}$
at the maximal ideal
$\mathfrak {m}$
. Let
$\mathcal {N}$
be the subset of
$\mathcal {N}'$
consisting of all the
$g\in \mathcal {N}'$
which satisfy

We write
$\mathfrak {m}$
for the corresponding maximal ideal. Set
$J\subset \mathbf {T}$
to be the Eisenstein ideal, i.e., J is the ideal of
$\mathbf {T}$
generated by the set
$\{T^{(2)}(p)-(\text {tr}\hspace {2pt} \rho _{\phi }(\operatorname {\mathrm {Frob}}_p)+\text {tr}\hspace {2pt}\rho _{\phi }(k-2)(\operatorname {\mathrm {Frob}}_p))\mid p\neq \ell \}.$
Write
$J_{\mathfrak {m}}$
to be the image of J under the canonical map
$\mathbf {T}\to \mathbf {T}_{\mathfrak {m}}$
.
Recall that we fixed in Section 5.2 the weight
$k \geq 12$
even and prime
$\ell> 4k-5$
and imposed Assumption 3.1 on the field
$E/\mathbf {Q}_\ell $
. We also fixed the Fontaine–Laffaille interval
$I=[3-2k, 2k-3]$
. Let
$\phi \in S_k(\Gamma _1)$
be a newform such that
${\overline {\rho }}_{\phi }$
is irreducible.
For the rest of this section, we also impose Assumption 6.2 and fix the corresponding
$f \in S_k(\Gamma _2)$
. Then,
$f\in \mathcal {N}$
, i.e.,
$\mathbf {T}_{\mathfrak {m}}/J_{\mathfrak {m}}\neq 0$
. Let
$R=R_{{\overline {\rho }}_f}$
be the universal deformation ring defined in Section 6.
Theorem 7.2 Recall that we impose Assumptions 3.1 and 6.2. Then, there exists a surjective
$\mathcal {O}$
-algebra map
$\varphi : R\to \mathbf {T}_{\mathfrak {m}}$
such that
$\varphi (I_{\mathrm {re}})=J_{\mathfrak {m}}$
and
$J_{\mathfrak {m}}$
is a maximal ideal of
$\mathbf {T}_{\mathfrak {m}}$
. If, in addition,
$\dim _{\mathbf {F}} H^1_f(\mathbf {Q}, \mathrm {ad}{\overline {\rho }}_{\phi }(k-2))\leq 1$
, then all of the following are true:
-
• the map
$\varphi $ is an isomorphism;
-
• the Hecke ring
$\mathbf {T}_{\mathfrak {m}}$ is isomorphic to
$\mathcal {O}$ ;
-
• the Eisenstein ideal
$J_{\mathfrak {m}}$ is principal.
Proof Let
$g\in \mathcal {N}$
. Then, by Lemma 5.2, there exists a
$G_{\mathbf {Q}}$
-stable lattice with respect to which one has
${\overline {\rho }}_{g}=\left [ \begin {matrix} {\overline {\rho }}_{\phi }&* \\ & {\overline {\rho }}_{\phi }(k-2)\end {matrix} \right ]$
and is not semi-simple. Hence, the
$*$
gives rise to an element in
$H^1_f(\mathbf {Q}, W[\lambda ])$
, where
$W=\mathrm {ad}^0\rho _{\phi }(2-k)\otimes _{\mathcal {O}}E/\mathcal {O}$
.
By (5.4) and Proposition 5.1, we get
$H^1_f(\mathbf {Q}, W[\lambda ])=H^1_f(\mathbf {Q}, W)[\lambda ]$
. The latter group is cyclic by Assumption 6.2 (ii), so we must have that
${\overline {\rho }}_{g}\cong {\overline {\rho }}_f$
, and so after adjusting the basis, if necessary, we get that
$\rho _{g}$
is a deformation of
${\overline {\rho }}_f$
.
This implies that for every
$g\in \mathcal {N,}$
we get an
$\mathcal {O}$
-algebra (hence continuous) map
$\varphi _{g}:R\to \mathcal {O}$
with the property that
$\text {tr}\hspace {2pt}\rho ^{\mathrm { univ}}(\operatorname {\mathrm {Frob}}_p) \mapsto \lambda _{g}(T^{(2)}(p))$
. This property completely determines
$\varphi _{g}$
because R is topologically generated by the set
$\{\text {tr}\hspace {2pt}\rho ^{\mathrm { univ}}(\operatorname {\mathrm {Frob}}_p)\mid p\neq \ell \}$
by Proposition 7.13 in [Reference Berger and Klosin6]. Putting these maps together we get an
$\mathcal {O}$
-algebra map
$\varphi : R\to \prod _{g\in \mathcal {N}}\mathcal {O}$
whose image is an
$\mathcal {O}$
-subalgebra of
$\prod _{g\in \mathcal {N}}\mathcal {O}$
generated by
$\{T^{(2)}(p)\mid p\neq \ell \}$
. Note that
$\varphi (R)\subset \mathbf {T}_{\mathfrak {m}}$
. To see the opposite inclusion consider the characteristic polynomial
$f_p(X)\in R[X]$
of
$\rho ^{\mathrm {univ}}(\operatorname {\mathrm {Frob}}_p)$
for
$p\neq \ell $
. Combining Theorem 2.1 with the definition of
$L_p(X, f; \mathrm {spin}),$
we see that the coefficient at
$X^2$
is mapped by
$\varphi $
to
$T^{(2)}(p)^2-T^{(2)}(p^2)-p^{2k-4}\in \prod _{g\in \mathcal {N}}\mathcal {O}$
. As
$T^{(2)}(p)$
and
$p^{2k-4}$
both belong to
$\varphi (R)$
, so therefore must
$T^{(2)}(p^2)$
. We now use the fact [Reference Andrianov2, 3.3.38] and [Reference Johnson-Leung and Roberts30, p. 547] that

to conclude that
$T_1^{(2)}(p^2) \in \varphi (R)$
. Hence,
$\varphi (R)$
contains all the Hecke operators away from
$\ell $
, i.e.,
$\varphi (R)=\mathbf {T}_{\mathfrak {m}}$
. We denote the resulting
$\mathcal {O}$
-algebra epimorphism
$R\to \mathbf {T}_{\mathfrak {m}}$
again by
$\varphi $
. We claim that
$\varphi (I_{\mathrm {re}})\subset J_{\mathfrak {m}}$
.
Indeed, using the Chebotarev Density Theorem, one sees that

so
$I_{\mathrm {re}}\subset \varphi ^{-1}(J_{\mathfrak {m}})$
. As
$\varphi $
is a surjection, this implies that
$\varphi (I_{\mathrm {re}})\subset J_{\mathfrak {m}}$
. Hence,
$\varphi $
gives rise to a sequence of
$\mathcal {O}$
-algebra surjections
$R/I_{\mathrm {re}}\to \mathbf {T}_{\mathfrak {m}}/\varphi (I_{\mathrm {re}})\to \mathbf {T}_{\mathfrak {m}}/J_{\mathfrak {m}}$
. As
$R/I_{\mathrm {re}}=\mathbf {F}$
by Corollary 6.13 we conclude that all these surjections are isomorphisms (note that
$\mathbf {T}_{\mathfrak {m}}/J_{\mathfrak {m}}\neq 0$
), hence
$\varphi (I_{\mathrm {re}})=J_{\mathfrak {m}}$
and
$J_{\mathfrak {m}}$
is maximal. This proves the first claim.
Now assume in addition that
$\dim H^1_f(\mathbf {Q}, \mathrm {ad}{\overline {\rho }}_{\phi }(k-2))\leq 1$
. Then, Theorem 6.20 gives us that
$R=\mathcal {O}$
, so we get that
$\varphi $
is an isomorphism, and so
$R\cong \mathbf {T}_{\mathfrak {m}}\cong \mathcal {O}$
. Hence,
$J_{\mathfrak {m}}$
is a principal ideal.
Corollary 7.3 If
$J_{\mathfrak {m}}$
is not principal, then
$\dim _{\mathbf {F}}H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}{\overline {\rho }}_{\phi }(k-2))>1.$
If in addition
$\ell \nmid B_{k-2}$
then
$\dim _{\mathbf {F}}H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}^0{\overline {\rho }}_{\phi }(k-2))>1.$
Proof The first inequality is just a restatement of one of the claims of Theorem 7.2. The second follows from the first one and Proposition 6.18.
Proposition 7.4 For each
$g\in \mathcal {N,}$
write
$m_{g}$
for the largest positive integer m such that
$g\equiv E_{2,1}^{\phi }$
mod
$\lambda ^{m}$
. If

then
$J_{\mathfrak {m}}$
is not principal.
Proof Set
$A = \prod _{g\in \mathcal {N}}A_{g}$
, where
$A_{g}=\mathcal {O}$
for all
$g\in \mathcal {N}$
. Let
$\phi _{g} : A \to A_{g}$
be the canonical projection. Since, by Lemma 7.1,
$\mathbf {T}$
is a full rank
$\mathcal {O}$
-submodule of
$\prod _{g \in \mathcal {N}'} \mathcal {O,}$
we conclude that the local complete
$\mathcal {O}$
-subalgebra
$\mathbf {T}_{\mathfrak {m}} \subset A$
is of full rank as an
$\mathcal {O}$
-submodule and
$J_{\mathfrak {m}} \subset \mathbf {T}_{\mathfrak {m}}$
is an ideal of finite index. Set
$T_{g} = \phi _{g}(\mathbf {T}_{\mathfrak {m}})=A_{g}=\mathcal {O}$
and
$J_{g} = \phi _{g}(J_{\mathfrak {m}})=\lambda ^{m_{g}}\mathcal {O}$
. Hence, we are in the setup of Section 2 of [Reference Berger, Klosin and Kramer11]. Assume
$J_{\mathfrak {m}}$
is principal. Then, Proposition 2.3 in [Reference Berger, Klosin and Kramer11] gives us that

Note that one has

This equality, together with (7.4), contradicts the inequality (7.3).
Corollary 7.5 Let
$m_{g}$
be defined as in Proposition 7.4. If
$ \sum _{g\in \mathcal {N}}m_{g}>1$
then
$J_{\mathfrak {m}}$
is not principal and
$\dim _{\mathbf {F}}H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}{\overline {\rho }}_{\phi }(k-2))>1.$
If in addition
$\ell \nmid B_{k-2}$
then
$\dim _{\mathbf {F}}H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}^0{\overline {\rho }}_{\phi }(k-2))>1.$
Proof Note that from the proof of Theorem 7.2, we get that
$\mathbf {T}_{\mathfrak {m}}/J_{\mathfrak {m}}=\mathbf {F}$
, even without assuming
$\dim _{\mathbf {F}}H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}{\overline {\rho }}_{\phi }(k-2))\leq 1$
. Assume that
$J_{\mathfrak {m}}$
is principal. Then, from (7.4) and (7.5), we conclude that
$\sum _{g\in \mathcal {N}}m_{g}=1$
, which contradicts our assumption. Hence,
$J_{\mathfrak {m}}$
is not principal. The Selmer group inequalities now follow from Corollary 7.3.
Remark 7.6 Corollary 7.3 directly ties the cyclicity of the non-critical Selmer group
$H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}{\overline {\rho }}_{\phi }(k-2))$
with the principality of the Eisenstein ideal
$J_{\mathfrak {m}}$
. We note that Assumption 6.2(ii) implies the equality
$\mathbf {T}_{\mathfrak {m}}/J_{\mathfrak {m}}=\mathbf {F}$
. Contrary to what one might think, the existence of several forms
$g\equiv E_{2,1}^{\phi }$
mod
$\lambda $
does not preclude this equality. For example, if there are exactly two linearly independent eigenforms
$g_1, g_2 \in \mathcal {N}$
with
$m_{g_1}=m_{g_2}=1$
such that
$g_1 \not \equiv g_2\ \mod {\lambda ^2}$
then
$\mathbf {T}_{\mathfrak {m}} \cong \mathcal {O} \times _{\mathbf {F}} \mathcal {O}=\{(a,b) \in \mathcal {O} \times \mathcal {O} \mid a \equiv b\ \mod {\lambda }\}$
and in this case,
$J_{\mathfrak {m}}$
is the maximal ideal, i.e.,
$\mathbf {T}_{\mathfrak {m}}/J_{\mathfrak {m}}=\mathbf {F}$
, so Corollary 7.5 applies and
$\dim _{\mathbf {F}}H^1_f(\mathbf {Q}, \operatorname {\mathrm {ad}}{\overline {\rho }}_{\phi }(k-2))>1$
.
Acknowledgements
The authors would like to thank Jeremy Booher and Neil Dummigan for helpful discussions.