\undefine@key

newfloatplacement\undefine@keynewfloatname\undefine@keynewfloatfileext\undefine@keynewfloatwithin

Non-abelian amplification and bilinear forms with Kloosterman sums

Alexandru Pascadi Mathematisches Institut, Endenicher Allee 60, 53115 Bonn, Germany alexpascadi@gmail.com

Abstract.

We introduce a new method to bound bilinear (Type II) sums of Kloosterman sums with composite moduli $c$ , using Fourier analysis on $\mathrm{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ and an amplification argument with non-abelian characters. For sums of length $\sqrt{c}$ , our method produces a non-trivial bound for all moduli except near-primes, saving $c^{-1/12}$ for products of two primes of the same size. Combining this with previous results for prime moduli, we achieve savings beyond the Pólya–Vinogradov range for all moduli. We give applications to moments of twisted cuspidal $L$ -functions, and to large sieve inequalities for exceptional cusp forms with composite levels.

1. Introduction

1.1. Brief background

There is by now a fairly comprehensive history of bounds for bilinear forms with Kloosterman sums and their applications [3, 1, 2, 14, 24, 25, 11, 28, 32, 40, 39, 22, 46, 44]. In the simplest form, the objects of interest are the sums

\sum_{m\leq M}\sum_{n\leq N}\alpha_{m}\beta_{n}S(m,n;c),\qquad\text{where}\qquad S(m,n;c):=\sum_{x\in(\mathbb{Z}/c\mathbb{Z})^{\times}}e\left(\frac{mx+n\overline{x}}{c}\right),

(1.1)

for positive integers $c,M,N$ with $M,N\leq c$ and complex sequences $(\alpha_{m})$ , $(\beta_{n})$ ; here $e(t):=\exp(2\pi it)$ and $x\overline{x}\equiv 1\ (\textnormal{mod }c)$ . In this work, we are mainly concerned with the ‘Type II’ setting where $(\alpha_{m})$ and $(\beta_{n})$ are arbitrary sequences, and we search for an upper bound in terms of their $\ell^{2}$ norms $\|\alpha\|:=(\sum_{m}|\alpha_{m}|^{2})^{1/2}$ , $\|\beta\|:=(\sum_{n}|\beta_{n}|^{2})^{1/2}$ . This is equivalent to bounding the operator norm, or the largest singular value, of the $M\times N$ matrix $(S(m,n;c))_{m\leq M,n\leq N}$ .

For the Type II sums, it is in general necessary to incorporate a coprimality constraint $(m,n,c)=1$ . In practice, since $S(gm,gn;gc)=\tfrac{\phi(gc)}{\phi(c)}S(m,n;c)$ , one can separately consider each value of $(m,n,c)$ ; therefore, in most bounds discussed henceforth, one can replace the restriction $(m,n,c)=1$ with the assumption that $(\alpha_{m})$ and $(\beta_{n})$ are $1$ -bounded (and the norms $\|\alpha\|$ , $\|\beta\|$ with $\sqrt{M}$ , $\sqrt{N}$ ).

There are two main ‘trivial’ bounds to beat, packaged together into the following inequality with a slightly more general setup. For any (integer) intervals $\mathcal{I},\mathcal{J}\subset\mathbb{Z}$ with $|\mathcal{I}|=M\leq c$ , $|\mathcal{J}|=N\leq c$ , any complex sequences $(\alpha_{m})_{m\in\mathcal{I}}$ , $(\beta_{n})_{n\in\mathcal{J}}$ , and any $a\in(\mathbb{Z}/c\mathbb{Z})^{\times}$ , one has

\mathop{\sum\sum}_{\begin{subarray}{c}m\in\mathcal{I},n\in\mathcal{J}\\ (m,n,c)=1\end{subarray}}\alpha_{m}\beta_{n}S(am,n;c)\ll\|\alpha\|\|\beta\|c^{o(1)}\min\left(c,\sqrt{MNc}\right).

(1.2)

The term $c$ comes from Fourier analysis (a.k.a. the Pólya–Vinogradov method¹¹1See also [14, Theorem 1.17] for more standard versions of the Pólya–Vinogradov bound.) and is sharp when $M=N=c$ , while the term $\sqrt{MNc}$ comes from the pointwise Weil bound $S(am,n;c)\ll c^{o(1)}\sqrt{(m,n,c)c}$ , and performs better when $MN<c$ . The best one could hope for is the perfect-orthogonality bound $\|\alpha\|\|\beta\|c^{o(1)}\sqrt{(M+N)c}$ , but making any improvement over ˜1.2, particularly in the range $MN\approx c$ where the two trivial bounds match, is notoriously difficult. We note that while many applications [24, 25] require an improvement of the pointwise Weil bound when $MN\ll c$ (i.e., beyond the Pólya–Vinogradov range), some applications require an improvement of the Fourier-theoretic bound for larger values of $M,N\leq c^{1-o(1)}$ ; this is the case in our Section˜9.

The first improvement of ˜1.2 when $MN\approx c$ was the celebrated breakthrough of Kowalski–Michel–Sawin [24], which requires a prime modulus $c=p$ , and which saves a factor of $p^{-1/64}$ when $M,N\asymp\sqrt{p}$ ; see also [25] for their follow-up work, which outperforms the pointwise Weil bound for $MN$ as small as $p^{3/4+o(1)}$ . These results rely on a shift-by- $ab$ trick of Vinogradov and Karatsuba, a Hölder step, and deep inputs of $\ell$ -adic cohomology; notably, the same bounds hold for more general algebraic trace functions, including hyper-Kloosterman sums.

Closer to our methods is an approach of Shkredov [38] for prime moduli $p$ , which relies (in the Type II setting) on non-abelian Fourier analysis [37, Lemma 22], expansion in $\textnormal{SL}_{2}(\mathbb{Z}/p\mathbb{Z})$ [37, Theorem 50], and combinatorics; this beats ˜1.2 in the full range $M,N\in(p^{1/2-\delta},p^{1-o(1)})$ with a small but effective power saving, and for sequences $(\alpha_{m})$ , $(\beta_{n})$ with more general additively-structured supports. We also mention some related additive-combinatorial approaches of Shparlinski–Zhang [40] for smooth sequences, of the author [32, §4] for additively-structured sequences, and of Kerr–Shparlinski–Wu–Xi [22] for Type I bilinear forms (where only the sequence $(\alpha_{m})$ is smooth).

For moduli with a suitable factorization, the best Type II bounds so far have come from the $q$ -van der Corput method [15], which relies on the twisted multiplicativity of Kloosterman sums, Cauchy–Schwarz, and a shifting trick; it was first applied in this setting by Blomer–Milićević [3]. The $q$ -van der Corput method can also be iterated, leading to strong results for smooth square-free moduli [46, 44]. Unfortunately, these arguments fail to handle certain types of composite moduli when $MN\approx c$ , including squares of primes and products of two distinct primes of the same size.

1.2. Main results.

In this work, we develop a new method to bound bilinear forms with Kloosterman sums for essentially all composite moduli. Like Shkredov [38, 37], we rely on non-abelian Fourier analysis; unlike Shkredov, we use the normal subgroups of $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ to our advantage, and we avoid relying on $L^{2}$ -flattening results, to arrive at quantitatively-good power savings over ˜1.2 (up to $c^{-1/12}$ ; see Example˜1.3). Our key innovation is a new type of amplification argument with non-abelian characters, detailed in Section˜2.3, which may be of independent interest.

Combining our bounds with those of Kowalski–Michel–Sawin²²2One could also combine our bounds with the results of Shkredov [38, Theorem 4] for prime moduli, to obtain a result like Theorem 1.1 which does not rely on algebraic geometry. [24] (as well as Blomer–Milićević [3] for an optimization), we obtain a non-trivial result for general moduli beyond the Pólya–Vinogradov range, given in Theorem˜7.4. We state a particular case of this result below, when $M,N\ll c^{1/2+o(1)}$ .

Theorem 1.1.

Let $c,M,N\in\mathbb{Z}_{+}$ with $M,N\ll c^{1/2+o(1)}$ . Then for any complex sequences $(\alpha_{m})_{m\leq M}$ , $(\beta_{n})_{n\leq N}$ and $a\in(\mathbb{Z}/c\mathbb{Z})^{\times}$ , one has

\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{(m,n,c)=1}\alpha_{m}\beta_{n}S(am,n;c)\ll\|\alpha\|\|\beta\|c^{1-\frac{1}{700}+o(1)}.

Moreover, if $|\alpha_{m}|\leq 1$ for all $m$ (so $\|\alpha\|\leq\sqrt{M}$ ), then

\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{(n,c)=1}\alpha_{m}\beta_{n}S(am,n;c)\ll\sqrt{M}\|\beta\|c^{1-\frac{1}{276}+o(1)}.

Our main technical result leading to Theorem˜1.1 is Theorem˜7.1, which considers a factorization of the modulus into three parts, $c=dd^{\prime}e$ (but one can usually take $d^{\prime}=1$ or $e=1$ ). Below we state a particular case of Theorem˜7.1, focusing on the same range $M,N\ll c^{1/2+o(1)}$ as in Theorem˜1.1.

Theorem 1.2.

Let $c=dd^{\prime}e$ for some $d,d^{\prime},e\in\mathbb{Z}_{+}$ with $d^{\prime}\mid d$ and $(d,e)=1$ , and let $f$ be the largest integer such that $f^{2}\mid cd$ . Let $\mathcal{I},\mathcal{J}\subset\mathbb{Z}$ be intervals with $|\mathcal{I}|,|\mathcal{J}|\ll c^{1/2+o(1)}$ . Then for any complex sequences $(\alpha_{m})_{m\in\mathcal{I}}$ , $(\beta_{n})_{n\in\mathcal{J}}$ and $a\in(\mathbb{Z}/c\mathbb{Z})^{\times}$ , one has

\displaystyle\mathop{\sum\sum}_{\begin{subarray}{c}m\in\mathcal{I},n\in\mathcal{J}\\ (m,n,c)=1\end{subarray}}\alpha_{m}\beta_{n}S(am,n;c)

\displaystyle\ll\|\alpha\|\|\beta\|c^{1+o(1)}\left(\frac{f}{\min(c,d^{2})}\right)^{\frac{1}{6}}.

Since $f\leq\sqrt{cd}$ , Theorem˜1.2 automatically gives a non-trivial result when $d\in(c^{1/3+o(1)},c^{1-o(1)})$ . Unless $c$ has a prime factor larger than $c^{1-o(1)}$ , one can always find a factorization $c=dd^{\prime}e$ with $d$ in this range, $(d,e)=1$ , and $d^{\prime}\mid d$ , which makes the general result in Theorem˜1.1 possible.

Example 1.3.

Let $d\mid c$ such that $\tfrac{c}{d}$ is square-free. Then one can take $d^{\prime}:=(\tfrac{c}{d},d)$ , $e:=\tfrac{c}{dd^{\prime}}$ , $f=d$ in Theorem˜1.2, so the saving over the trivial bound becomes $(d+\tfrac{c}{d})^{-1/6}$ . If $d\asymp\sqrt{c}$ , this is roughly

c^{-\frac{1}{12}}.

In particular, this saving is achieved if $c\in\{p^{2},pq\}$ , where $p$ and $q$ are distinct primes with $p\asymp q$ . For the same values of $c$ , the more general Theorem˜7.1 beats ˜1.2 in the range

M\asymp N\in[c^{\frac{5}{12}+o(1)},c^{\frac{5}{8}-o(1)}].

Notably, while the values $c\in\{p^{2},pq\}$ , $M,N\asymp\sqrt{c}$ give blind spots of the $q$ -van der Corput method, they happen to give one of the best cases for our methods; this case has until now constituted the remaining barrier towards the application in Theorem˜1.5.

As a quick corollary of Theorem˜1.2, we prove a trilinear-sum bound which includes a short averaging over $c$ with a given large divisor $q$ ; the point is that only a factorization of $q$ (rather than $c$ ) is assumed. Such sums arise in the spectral theory of automorphic forms, in particular in Section˜9. Below is such a trilinear-sum bound, which is a particular case of Corollary˜7.5.

Corollary 1.4.

Let $C\geq\tfrac{1}{2}$ , $q=dd^{\prime}e$ for some $d,d^{\prime},e\in\mathbb{Z}_{+}$ with $d^{\prime}\mid d$ and $(d,e)=1$ , and let $f$ be the largest integer such that $f^{2}\mid qd$ . Let $\mathcal{I},\mathcal{J}\subset\mathbb{Z}$ be intervals of lengths $|\mathcal{I}|,|\mathcal{J}|\ll C^{1/2+o(1)}$ . Then or any complex sequences $(\alpha_{m})_{m\in\mathcal{I}}$ , $(\beta_{n})_{n\in\mathcal{J}}$ , one has

\displaystyle\sum_{\begin{subarray}{c}C<c\leq 2C\\ q\mid c\end{subarray}}\left|\mathop{\sum\sum}_{\begin{subarray}{c}m\in\mathcal{I},n\in\mathcal{J}\\ (m,n,q)=1\end{subarray}}\alpha_{m}\beta_{n}S(m,n;c)\right|

\displaystyle\ll\|\alpha\|\|\beta\|\frac{C^{2+o(1)}}{q}\left(\frac{f}{\min(C,d^{2})+\min(q,d^{2}\frac{C}{q})}\right)^{\frac{1}{6}}.

Remark.

Milićević, Qin and Wu [29] have simultaneously and independently obtained results similar to our Theorems˜1.1 and 1.5 using substantially different methods. The two papers are complementary, each performing slightly better in different ranges and for different types of moduli, and both achieving power savings for bilinear sums of square-root length and general moduli. The methods in [29] (which use algebraic geometry and build on Kowalski–Michel–Sawin [24] and Blomer–Milićević [3]) obtain better savings for general moduli and remove the dependency on the Ramanujan–Petersson conjecture in the application to moments of twisted cuspidal $L$ -functions. Our methods (which use non-abelian Fourier analysis and are closer to the work of Shkredov [38, 37]) perform better and in longer ranges for specific classes of moduli $c$ (see Examples˜1.3 and 7.2), can handle more general supports of the sequences $(\alpha_{m}),(\beta_{n})$ (intervals or other additively-structured subsets of $\mathbb{Z}/c\mathbb{Z}$ ), and find an application to exceptional-spectrum large sieve inequalities (Corollary˜1.6).

Finally, we note that our methods might also lead to bounds for other exponential sums with $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ or $\textnormal{GL}_{2}(\mathbb{Z}/c\mathbb{Z})$ structure, such as

\ \sideset{}{{}^{*}}{\sum}_{x\in\mathbb{Z}/c\mathbb{Z}}e\left(\frac{mx+nF(x)}{c}\right),

where $F(x)$ is a suitable Möbius transformation (and the sum is restricted to those $x$ such that $F(x)$ is well-defined). Related methods for $\textnormal{SL}_{3}(\mathbb{Z}/c\mathbb{Z})$ could be worth investigating as well.

1.3. Applications

As our first application, we prove an asymptotic for the averaged second moment of modular $L$ -functions twisted by primitive Dirichlet characters modulo $q$ , where the modulus $q$ is arbitrary. Blomer–Milićević [1] established such an asymptotic for most moduli, more specifically whenever $q$ is not close to a prime or to a product of two primes of the same size. The missing ingredient in these cases has been precisely a power-saving bound for bilinear forms with Kloosterman sums modulo $q$ , where both sums have length $\approx\sqrt{q}$ . The case of prime moduli $q$ was established by Kowalski–Michel–Sawin [24, Theorem 1.5], and the remaining case can now be handled using our Theorem˜1.1 (essentially in the setting from Example˜1.3).

To state this application, we introduce some quick notation as in [3]. Given $q\in\mathbb{Z}_{+}$ , we write

\phi^{*}(q):=\sum_{d\mid q}\phi(d)\,\mu\left(\frac{q}{d}\right)

for the number of primitive characters modulo $q$ ; this vanishes if and only if $q\equiv 2\ (\textnormal{mod }4)$ , and is otherwise of size $q^{1-o(1)}$ . We write $\sum^{*}_{\chi\ \textnormal{mod }q}$ for a sum over all primitive characters modulo $q$ . Also following [3], given an $L$ -function $L(s)$ , we write $L_{q}(s)$ for the product $\prod_{p\mid q}L_{p}(s)$ over all local factors at primes dividing $q$ ; thus for example $\zeta_{q}(s)=\prod_{p\mid q}(1-p^{-s})^{-1}$ .

Theorem 1.5.

Let $f_{1},f_{2}$ be fixed holomorphic cuspidal newforms for $\textnormal{SL}_{2}(\mathbb{Z})$ with even weights $\kappa_{1},\kappa_{2}$ and Hecke eigenvalues $\lambda_{1}(n),\lambda_{2}(n)$ normalized as in ˜8.1. Provided that $\kappa_{1}\equiv\kappa_{2}\ (\textnormal{mod }4)$ , one has the asymptotic

\ \sideset{}{{}^{*}}{\sum}_{\chi\ \textnormal{mod }q}L(\tfrac{1}{2},f_{1}\otimes\chi)\overline{L(\tfrac{1}{2},f_{2}\otimes\chi)}=\frac{2\phi^{*}(q)}{\zeta(2)}M(f_{1},f_{2};q)+O_{f_{1},f_{2}}\left(q^{1-\frac{1}{674}+o(1)}\right),

with a main term of

M(f_{1},f_{2};q):=\begin{cases}P(1)\,L(1,\textnormal{sym}^{2}f_{1})\left(\log q+c(f_{1})+\frac{P^{\prime}(1)}{P(1)}\right),&f_{1}=f_{2},\\ Q(1)\,L(1,f_{1}\times f_{2}),&f_{1}\neq f_{2},\end{cases}

where $c(f_{1})$ is a constant depending only on $f_{1}$ and

P(s):=\frac{\zeta_{q}(2s)}{L_{q}(s,\textnormal{sym}^{2}f_{1})},\qquad\qquad Q(s):=\frac{\zeta_{q}(2s)}{L_{q}(s,f_{1}\times f_{2})}.

Remark.

Similar results can be obtained for Maass cusp forms, with some care in removing the dependency on the Ramanujan conjecture; see also [29]. The case of (non-cuspidal) Eisenstein series reduces to the result of Young [47] on fourth moments of Dirichlet $L$ -functions for prime moduli, extended to all moduli by Wu [45].

Our second application concerns the exceptional spectrum of the hyperbolic Laplacian on $\Gamma_{0}(q)\backslash\mathbb{H}$ , consisting of Maass cusp forms of level $q\in\mathbb{Z}_{+}$ with eigenvalues $\lambda<\tfrac{1}{4}$ . Selberg’s eigenvalue conjecture [34], one of the central open problems in the theory of $\textnormal{GL}_{2}$ automorphic forms, states that this exceptional spectrum is empty. However, unconditionally, exceptional forms often produce the worst contribution in applications of the Kuznetsov trace formula [11, 27] to analytic number theory problems [12, 43, 7, 32], losing exponential factors in the parameter $\theta:=(\tfrac{1}{4}-\lambda)^{1/2}$ . The best known pointwise bound is Kim–Sarnak $\theta\leq\tfrac{7}{64}$ [23, Appendix 2], but on-average results can also lead to savings in the $\theta$ -aspect, sometimes enough to match the conditional results [33, 17]. Following Deshouillers–Iwaniec [11], these on-average results often take the shape of large sieve inequalities for the Fourier coefficients of exceptional Maass forms, incorporating factors of $X^{2\theta}$ . While improvements are now possible [32] for exceptional-spectrum large sieve inequalities with special sequences $(\alpha_{n})_{n\leq N}$ , the savings in the $\theta$ -aspect for arbitrary sequences have been limited to $(\tfrac{q}{N})^{2\theta}$ , due to Deshouillers–Iwaniec [11, Theorem 5]. In fact, obtaining any power saving for arbitrary sequences when $N\asymp q$ is as hard as proving Selberg’s eigenvalue conjecture [32, §2].

In Theorem˜9.4, we overcome this barrier at $(\tfrac{q}{N})^{2\theta}$ if $q$ is suitably-composite and $N$ is not too large, using Theorem˜7.1. We note that in many applications [4, 28, 12, 9, 10], the level $q$ is a product of two factors of similar sizes, and $N\in(\sqrt{q},q)$ . We state below a particular case of Theorem˜9.4, for $N\approx\sqrt{q}$ . We point the reader to Section˜9 for more background and notation.

Corollary 1.6.

Let $q\in\mathbb{Z}_{+}$ have a divisor $d\asymp\sqrt{q}$ such that $\tfrac{q}{d}$ is square-free. Consider an orthonormal basis of Maass cusp forms for $\Gamma_{0}(q)$ , with Laplacian eigenvalues $\lambda_{j}$ and Fourier coefficients $(\rho_{j}(n))_{n\in\mathbb{Z}}$ around $\infty$ (normalized as in [11, 32]). Let $N\ll q^{\frac{1}{2}+o(1)}$ , and $(\alpha_{n})_{N<n\leq 2N}$ be a complex sequence supported on $(n,q)=1$ . Then with $\theta_{j}:=(\tfrac{1}{4}-\lambda_{j})^{1/2}\leq\tfrac{7}{64}$ , one has

\sum_{\lambda_{j}<\frac{1}{4}}q^{\frac{6}{5}\theta_{j}}\left|\sum_{N<n\leq 2N}\alpha_{n}\,\rho_{j}(n)\right|^{2}\ll(qN)^{o(1)}\|\alpha\|^{2}.

(1.3)

For reference, [11, Theorem 5] of Deshouillers–Iwaniec would include a factor of $(\tfrac{q}{N})^{2\theta_{j}}$ (which is $q^{\theta_{j}}$ when $N=\sqrt{q}$ ) in the left-hand side, so Corollary˜1.6 wins a factor of $q^{\theta/5}$ in this case.

Remark.

It follows from the more general Theorem˜9.4 that one can relax the condition that $\tfrac{q}{d}$ is square-free when some averaging over levels $q\leq Q$ with $d\mid q$ (and $d\asymp\sqrt{Q}$ ) is available. The sequence $(\alpha_{n})$ inside the large sieve may depend on $q$ in this case, unlike in [11, Theorem 6].

1.4. Acknowledgements

The author is deeply grateful to Valentin Blomer, James Maynard, Sary Drappeau, Philippe Michel, Emmanuel Kowalski, and Ilya Shkredov for helpful comments and discussions. This work was supported by the ERC Advanced Grant 101054336 and Germany’s Excellence Strategy grant EXC-2047/1-390685813. For a part of the duration of this project, the author was also supported by an EPSRC Scholarship, as well as a Campus France Scholarship.

2. Outline

2.1. Structure of the paper

Our proof of Theorem˜1.2 has three main steps:

I

(Fourier analysis). In Section˜4 (particularly, Proposition˜4.10), we relate matrices of Kloosterman sums to the Fourier transform of certain functions at a special representation $\rho_{c}^{\circ}$ of $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ . This involves Fourier analysis on both abelian ( $\mathbb{Z}/c\mathbb{Z}$ , $\mathbb{R}$ ) and non-abelian ( $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ ) groups, and a Möbius inversion process for representations of $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ .
II

(Amplification). In Section˜5 (particularly, Proposition˜5.5), we upper bound the spectral norm of the above Fourier coefficients by a weighted count of solutions to an equation in $\textnormal{PSL}_{2}(\mathbb{Z}/d\mathbb{Z})$ , with $d\mid c$ . This is where our non-abelian amplification argument comes in.
III

(Combinatorics). In Section˜6 (particularly, Proposition˜6.4), we analyze this counting problem using elementary arguments. In particular, we do not rely on expansion techniques.

In Section˜7, we combine these ingredients to deduce Theorem˜1.2 and its variations. The applications to moments of twisted cuspidal $L$ -functions and large sieve inequalities for exceptional cusp forms are handled in Sections˜8 and 9, respectively.

For the rest of this section, we give a brief informal overview of our argument, ignoring various technical details. We will use the symbols ‘ $\approx$ ’, ‘ $\lesssim$ ’ for identities and inequalities that are ‘morally’ true (and can be made rigorous with minor modifications, such as including $c^{o(1)}$ factors).

2.2. First steps: Fourier analysis

Let us focus on the balanced case $M=N$ . We begin by considering the $N\times N$ complex matrix

K:=\left(S(m,n;c)\right)_{m,n\leq N}.

where $c,N\in\mathbb{Z}_{+}$ with $N\leq c$ . Our task is to bound the operator norm $\|K\|$ by less than $\min(c,N\sqrt{c})$ , to beat ˜1.2. We extend $K$ to a $c\times c$ matrix, and multiply it on both sides by the unitary matrix $(\tfrac{1}{\sqrt{c}}e(\tfrac{xy}{c}))_{x,y\in\mathbb{Z}/c\mathbb{Z}}$ , which preserves the operator norm and essentially amounts to taking a Fourier transform in the $m,n$ variables. Letting $H\approx\tfrac{c}{N}$ , a truncated version of Poisson summation yields

c\,U^{*}KU^{*}\approx\frac{1}{H^{2}}\left(\sum_{|h|\leq H}\mathcal{T}^{h}\right)\mathcal{S}\left(\sum_{|h|\leq H}\mathcal{T}^{h}\right),\qquad\text{where}\qquad\begin{cases}\mathcal{T}:=(\mathbbm{1}_{u=x+1})_{u,x\in\mathbb{Z}/c\mathbb{Z}},\\ \mathcal{S}:=(\mathbbm{1}_{xy=-1})_{x,y\in\mathbb{Z}/c\mathbb{Z}}.\end{cases}

By inserting a few more rows and columns, we can in fact work over the projective line $\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ rather than $\mathbb{Z}/c\mathbb{Z}$ . The matrices $\mathcal{T}$ and $\mathcal{S}$ then extend to $\rho_{c}(T)$ and $\rho_{c}(S)$ , where $T$ and $S$ are the usual generators of $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ (see ˜3.13), and

\rho_{c}:\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})\to\left\{\text{Unitary maps of }\mathbb{C}^{\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})}\right\}

is the $c^{1+o(1)}$ -dimensional permutation representation corresponding to the action of $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ on $\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ by Möbius transformations. It then remains to bound the spectral norm of the matrix

\frac{1}{H^{2}}\sum_{|h_{1}|,|h_{2}|\leq H}\rho_{c}(T^{h_{1}}ST^{h_{2}})

by less than $\min(1,N/\sqrt{c})$ . In this form, our task is actually impossible: the matrix above decomposes as a direct sum corresponding to the irreducible representations inside $\rho_{c}$ , one of which is the trivial representation—and this contributes exactly one singular value of size $1$ . Other small-dimensional subrepresentations of $\rho_{c}$ are also problematic for similar reasons.

This is where the coprimality constraint $(m,n,c)=1$ comes in. Incorporating this weight into the matrix $K$ and expanding it by Möbius inversion ultimately results in a ‘sifted’ representation,

K^{\circ}:=(S(m,n;c)\mathbbm{1}_{(m,n,c)=1})_{m\leq M,n\leq N}\qquad\rightsquigarrow\qquad\rho_{c}^{\circ}:\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z}),

where $\rho_{c}^{\circ}$ is essentially obtained by removing from $\rho_{c}$ the contribution of all subrepresentations isomorphic to

\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})\xrightarrow{\text{Reduction mod }d}\textnormal{SL}_{2}(\mathbb{Z}/d\mathbb{Z})\xrightarrow{\rho_{d}}\left\{\text{Unitary maps of }\mathbb{C}^{\mathbb{P}^{1}(\mathbb{Z}/d\mathbb{Z})}\right\},

for $d\mid c$ . Although $\rho_{c}^{\circ}$ is not irreducible in general, it has the key property that all of its $c^{o(1)}$ irreducible subrepresentations are large, of dimension $c^{1-o(1)}$ (see Proposition˜4.6).

2.3. The key step: Amplification

We are left to bound the spectral norm of the non-abelian Fourier coefficient

\widehat{F}(\rho)=\sum_{g\in\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})}F(g)\rho(g),\qquad\quad F:=\frac{1}{H^{2}}\sum_{|h_{1}|,|h_{2}|\leq H}\mathbbm{1}_{T^{h_{1}}ST^{h_{2}}}:\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})\to\mathbb{C},

where $\rho$ is any irreducible subrepresentation of $\rho_{c}^{\circ}$ . A natural approach is to use the trace method, i.e., to bound the top singular value $\|\widehat{F}(\rho_{c}^{\circ})\|$ by an even moment of all singular values, and then to expand the latter as a trace; this brings in the character $\chi:=\textnormal{Tr}\rho$ .

One can then attempt to use non-abelian Fourier analysis by summing over all irreducible characters $\chi^{\prime}$ of $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ . However, this sum must somehow amplify the contribution of $\chi^{\prime}=\chi$ compared to other irreducible characters of $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ , especially the small-dimensional ones—otherwise, our construction of $\rho_{c}^{\circ}$ by eliminating various subrepresentations from $\rho_{c}$ will have been useless.

If $\chi$ was an abelian character of $(\mathbb{Z}/c\mathbb{Z})^{\times}$ , i.e., a Dirichlet character, then following the ideas of Duke–Friendlander–Iwaniec [13], one could weigh the sum by an amplifier of the shape

A(\chi^{\prime}):=\left|\sum_{\ell\in\mathcal{L}}\overline{\chi}^{\prime}(\ell)\chi(\ell)\right|^{2}=\sum_{\ell_{1},\ell_{2}\in\mathcal{L}}\overline{\chi}^{\prime}(\ell_{1}\ell_{2}^{-1})\chi(\ell_{1}\ell_{2}^{-1}),\qquad\quad\chi\in\widehat{(\mathbb{Z}/c\mathbb{Z})^{\times}},

where $\mathcal{L}$ is some set of positive integers (e.g., the primes in a dyadic interval). This $A(\chi^{\prime})$ has size $\approx|\mathcal{L}|^{2}$ when $\chi^{\prime}=\chi$ , and should typically obey square-root cancellation when $\chi^{\prime}\neq\chi$ .

Inspired by this, we construct a general amplifier for irreducible representations of a finite non-abelian group $G$ —which is to the best of our knowledge the first instance of such a construction, and which might find applications to other problems. We set

A(\chi^{\prime}):=\left\|\sum_{\ell\in\mathcal{L}}\overline{\rho^{\prime}(\ell)}\otimes\rho(\ell)\right\|_{S^{2}}^{2}=\sum_{\ell_{1},\ell_{2}\in\mathcal{L}}\overline{\chi}^{\prime}(\ell_{1}\ell_{2}^{-1})\chi(\ell_{1}\ell_{2}^{-1}),\qquad\quad\rho^{\prime}\in\widehat{G},\ \chi^{\prime}:=\textnormal{Tr}\rho^{\prime},

where $\|\cdot\|_{S^{2}}$ denotes the Frobenius norm of a map (which is the $\ell^{2}$ norm of its singular values), and $\mathcal{L}$ is a well-chosen subset of $G$ . It is most convenient to pick $\mathcal{L}$ to be a normal subgroup of $G$ ; note that when $G=\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ , this is only possible if $c$ is composite. We will in fact pick

\mathcal{L}=\Gamma_{c}(d):=\ker\left(\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})\to\textnormal{SL}_{2}(\mathbb{Z}/d\mathbb{Z})\right),

for a suitable divisor $d$ of $c$ . The result of this amplification argument for the sixth moment of singular values is a bound of the shape (see Proposition˜5.1)

\|\widehat{F}(\rho)\|^{6}\lesssim\frac{c^{3}H^{-6}}{\sum_{\ell\in\Gamma_{c}(d)}|\chi(\ell)|^{2}}\sum_{\begin{subarray}{c}|h_{1}|,\ldots,|h_{6}|\leq H\\ T^{h_{1}}S\cdots T^{h_{6}}S\in\Gamma_{c}(d)\end{subarray}}\chi(T^{h_{1}}S\cdots T^{h_{6}}S).

(2.1)

To go any further, we need to know the typical size of the character $\chi$ on $\Gamma_{c}(d)$ , based on the information that $\dim\chi\gg c^{1-o(1)}$ . This is a somewhat challenging computation involving Clifford theory, and depends on the factorizations of $c$ and $d$ ; see Lemmas˜5.4 and 5.2.

Let us now focus on the case when $c=p^{2}$ is the square of a prime and $N\approx H\approx p$ ; we naturally pick $d=p$ . It turns out that $\chi$ typically has size $\approx p$ on $\Gamma_{p^{2}}(p)$ , and roughly $\approx p^{2}$ at $\pm I\in\textnormal{SL}_{2}(\mathbb{Z}/p^{2}\mathbb{Z})$ . To beat ˜1.2, it essentially remains to bound

\sum_{|h_{1}|,\ldots,|h_{6}|\leq p}\mathbbm{1}_{T^{h_{1}}S\cdots T^{h_{6}}S\equiv\pm I\ (\textnormal{mod }p^{2})}\stackrel{{\scriptstyle?}}{{<}}p^{4},\qquad\sum_{|h_{1}|,\ldots,|h_{6}|\leq p}\mathbbm{1}_{T^{h_{1}}S\cdots T^{h_{6}}S\equiv\pm I\ (\textnormal{mod }p)}\stackrel{{\scriptstyle?}}{{<}}p^{5}.

(2.2)

2.4. Final steps: Combinatorics

The estimates in ˜2.2 amount to counting the number of solutions to the system of congruences

\begin{cases}1-h_{2}h_{3}\equiv\mp(1-h_{5}h_{6})\\ h_{1}(1-h_{2}h_{3})+h_{3}\equiv\pm h_{5}\\ h_{4}(1-h_{2}h_{3})+h_{2}\equiv\pm h_{6}\end{cases}\ (\textnormal{mod }p^{2}\text{, respectively, }p),

with $|h_{1}|,\ldots,|h_{6}|\leq p$ . For the generic solutions, one can expect each congruence to cut down the total number of solutions $p^{6}$ by the size of the modulus—but one must also account for certain diagonal solutions where some $h_{i}=0$ . A careful but elementary analysis (which becomes more involved when the modulus $c$ is arbitrary) shows that these congruences have $\approx p^{2}$ solutions modulo $p^{2}$ and $\approx p^{3}$ solutions modulo $p$ ; see Proposition˜6.4. Both of these counts are sharp, and save a factor of $p$ over the bounds required in ˜2.2. This saving is ultimately raised to the power $\tfrac{1}{6}$ in ˜2.1 (since we considered a sixth moment of singular values), and putting everything together yields

\left\|\left(S(m,n;p^{2})\right)_{m,n\leq p}\mathbbm{1}_{(m,n,p)=1}\right\|\lesssim p^{2-\frac{1}{6}},

as in Example˜1.3. We note that for the other case $c=pq$ from Example˜1.3, one can simplify the amplification argument by noting that all irreducible characters of $\textnormal{SL}_{2}(\mathbb{Z}/pq\mathbb{Z})$ are tensor products of irreducible characters of $\textnormal{SL}_{2}(\mathbb{Z}/p\mathbb{Z})$ and $\textnormal{SL}_{2}(\mathbb{Z}/q\mathbb{Z})$ , but the end result is the same.

2.5. Comments on prime moduli

When $c=p$ is a prime and $d=p$ , the amplifier from Section˜2.3 reduces to the ‘trivial’ choice

A(\chi^{\prime})=\overline{\chi}^{\prime}(I)\chi(I)=\dim\chi^{\prime}\dim\chi,

since $\mathcal{L}=\Gamma_{c}(c)=\{I\}$ . In this setting, ˜2.1 (with $6$ replaced by another even integer $q$ ) reads

\|\widehat{F}(\rho)\|^{q}\lesssim p^{2}H^{-q}\sum_{|h_{1}|,\ldots,|h_{q}|\leq H}\mathbbm{1}_{T^{h_{1}}S\cdots T^{h_{q}}S=I}.

This was observed, using a somewhat different language, by Shkredov [37, proofs of Lemmas 22 and 53]. Shkredov then relied on an $L^{2}$ -flattening lemma [37, Theorem 50], which stems from a result of Helfgott [19], to bound the right-hand side above by $O(p^{2}H^{-q}H^{q}p^{-3})=O(p^{-1})$ for a quite large value of $q$ depending on $\tfrac{\log p}{\log H}$ . This leads to a bound for bilinear (Type II) sums of Kloosterman sums with prime moduli [38, (6) of Theorem 4], with a power saving of $p^{-\delta}$ , where $\delta\approx\tfrac{1}{q}$ .

To obtain a more quantitatively-relevant power saving, competitive with [24, 25], one must use a smaller value of $q$ ; one would then need to solve a counting problem with few variables $h_{1},\ldots,h_{q}$ , as in Section˜2.4. We do not know how to do this, but such an approach could produce good results when, e.g., $q\in\{8,10,12\}$ . In particular, assuming ˜6.2 for $q=8$ , one could prove a non-trivial bound for bilinear sums of Kloosterman sums with prime moduli $p$ and sequences of lengths $M\asymp N>p^{3/8+o(1)}$ ; interestingly, the same limit at $p^{3/8+o(1)}$ appears in the results of Kowalski–Michel–Sawin [25], so our work reaffirms the difficulty of this barrier.

Alternatively, to obtain non-trivial results at prime moduli, it might be possible to use a different choice of subset $\mathcal{L}\subset\textnormal{SL}_{2}(\mathbb{Z}/p\mathbb{Z})$ in the construction of the amplifier from Section˜2.4. Indeed, although a normal subgroup is the most natural choice for $\mathcal{L}$ , it is possible that another conjugation-invariant subset might produce a useful amplifier when normal subgroups are not available.

3. Preliminaries

3.1. Analytic and arithmetic notation

We use the standard asymptotic notation from analytic number theory, indicating dependencies of implicit constants on a parameter $\varepsilon$ through subscripts. In particular, $f\ll_{\varepsilon}g$ and $f=O_{\varepsilon}(g)$ both mean $|f|\leq C_{\varepsilon}g$ for some constant $C_{\varepsilon}>0$ depending only on $\varepsilon$ ; $f\asymp_{\varepsilon}g$ means $f\ll_{\varepsilon}g\ll_{\varepsilon}f$ , $f=\Omega_{\varepsilon}(g)$ means $f\gg_{\varepsilon}g$ ; $f(x)=o(g(x))$ means $\tfrac{f(x)}{g(x)}\to 0$ as $x\to\infty$ ; $f(x)\ll x^{o(1)}g(x)$ is equivalent to the statement that $f(x)\ll_{\varepsilon}x^{\varepsilon}g(x)$ for all $\varepsilon>0$ . With this notation, the divisor bound reads $\sum_{d\mid c}1\ll c^{o(1)}$ .

We use the notation $\mathbbm{1}_{S}$ for both indicator functions of sets $S$ and truth values ( $0$ or $1$ ) of statements $S$ ; we also abbreviate $\mathbbm{1}_{x}:=\mathbbm{1}_{\{x\}}$ for singletons. We write $n\sim N$ for the range $N<n\leq 2N$ , $\|\alpha\|:=(\sum_{n}|\alpha_{n}|^{2})^{1/2}$ for the $\ell^{2}$ norm of a sequence $(\alpha_{n})_{n\in\mathcal{N}}$ for some $\mathcal{N}\subset\mathbb{Z}$ (or $\mathcal{N}\subset\mathbb{Z}/c\mathbb{Z}$ ), and $e(t):=\exp(2\pi it)$ for $t\in\mathbb{R}/\mathbb{Z}$ . Given a positive integer $c$ , we let $c\mathbb{Z}$ (resp., $c\mathbb{Z}_{+}$ ) be the sets of integers (resp., positive integers) divisible by $c$ , and $\overline{x}$ be the inverse of $x$ modulo $c$ (here $c$ may be implied from context, e.g., in an exponential phase $e(\overline{x}/c)$ ). We use $\mu$ and $\phi$ be the Möbius and Euler totient functions. Given $a,b\in\mathbb{Z}_{+}$ , we write $(a,b)$ for their greatest common divisor (and similarly for more positive integers), and $(a,b^{\infty})$ for the greatest divisor of $a$ whose prime factors all divide $b$ . We write $p^{k}\|c$ when a prime power exactly divides a positive integer, meaning that $p^{k}\mid c$ but $p^{k+1}\nmid c$ . We will reserve the letter $\psi$ for functions on $\mathbb{Z}/c\mathbb{Z}$ , and $\Phi,\Psi$ for functions on $\mathbb{R}$ . We denote the Fourier transform of an $L^{1}$ function $\Phi:\mathbb{R}\to\mathbb{C}$ by

\widehat{\Phi}(\xi):=\int_{-\infty}^{\infty}\Phi(t)\,e(-t\xi)\,dt.

(3.1)

In particular, if $\Psi(t):=\Phi(At)e(Bt)$ for some $A>0$ and $B\in\mathbb{R}$ , then a change of variables yields

\displaystyle\widehat{\Psi}(\xi)

\displaystyle=\int_{-\infty}^{\infty}\Phi(At)\,e(-t(\xi-B))\,dt=\frac{1}{A}\widehat{\Phi}\left(\frac{\xi-B}{A}\right),

(3.2)

and the Poisson summation identity reads

\sum_{n\in\mathbb{Z}}\Phi(n)=\sum_{k\in\mathbb{Z}}\widehat{\Phi}(k).

(3.3)

Given a map $M$ between finite-dimensional complex Hilbert spaces, we write its operator norm as

\|M\|:=\sup_{\|\vec{v}\|=1}\|M\vec{v}\|=\sup_{\|\vec{v}\|=\|\vec{w}\|=1}|\vec{w}^{T}M\vec{v}|.

(3.4)

On the Hilbert space $\mathbb{C}^{n}$ equipped with the Euclidean norm, we define $\|M\|_{S^{q}}$ as the $\ell^{q}$ norm of singular values of a map (or a matrix) $M$ , for $q\in[1,\infty]$ . In particular, we have

\|M\|_{S^{\infty}}=\|M\|\qquad\quad\text{and}\qquad\quad\|M\|_{S^{q}}^{q}=\textnormal{Tr}\left((MM^{*})^{q/2}\right)^{\frac{1}{q}}\text{ for }q\in 2\mathbb{Z}_{+},

(3.5)

where $M^{*}$ denotes the adjoint (conjugate transpose) of $M$ . We quickly record the following simple fact about projections and operator norms.

Lemma 3.1.

Let $V$ be a finite-dimensional complex Hilbert space, $W\subset V$ be a subspace, and $P_{W}:V\to V$ be the orthogonal projection onto $W$ . Suppose that $W$ is an invariant subspace of a linear map $M:V\to V$ (i.e., the restriction $M|_{W}:W\to W$ is well-defined). Then $\|M|_{W}\|=\|MP_{W}\|$ .

Proof.

By definition, we have $\|M|_{W}\|=\sup_{\vec{w}\in W,\|\vec{w}\|=1}\|M\vec{w}\|$ and $\|MP_{W}\|=\sup_{\vec{v}\in V,\|\vec{v}\|=1}\|MP_{W}\vec{v}\|$ . Since $P_{W}\vec{v}\in W$ with $\|P_{W}\vec{v}\|\leq\|\vec{v}\|=1$ for all $\vec{v}\in V$ with $\|\vec{v}\|=1$ , we have $\|MP_{W}\|\leq\|M|_{W}\|$ . On the other hand, for each $\vec{w}\in W$ with $\|\vec{w}\|=1$ , we have $P_{W}\vec{w}=\vec{w}$ , so $\|M|_{W}\|\leq\|MP_{W}\|$ . ∎

3.2. Bounds for Kloosterman sums

We now recall the Ramanujan and Weil bounds for Kloosterman sums, as well as some results of Kowalski–Michel–Sawin [24] and Blomer–Milićević [3].

Lemma 3.2 (Ramanujan bound).

For $c\in\mathbb{Z}_{+}$ and $n\in\mathbb{Z}$ , one has

|S(0,n;c)|\leq(n,c).

Proof.

This is a classical result which follows from Möbius inversion. ∎

Lemma 3.3 (Weil bound).

For $c\in\mathbb{Z}_{+}$ and $m,n\in\mathbb{Z}$ , one has

S(m,n;c)\ll c^{o(1)}\sqrt{(m,n,c)c}.

Proof.

This is [21, Corollary 11.12] followed by the divisor bound. ∎

For the sake of completeness, we give a quick proof of the trivial bound from ˜1.2.

Proof of ˜1.2.

The second bound implicit in ˜1.2, with a term of $\sqrt{MNc}$ , follows immediately from Lemma˜3.3 and Cauchy–Schwarz. For the first bound implicit in ˜1.2, we eliminate the constraint $(m,n,c)=1$ by Möbius inversion and use the identity $S(dm,dn;c)=\tfrac{\phi(c)}{\phi(c/d)}S(m,n;\tfrac{c}{d})$ to write

\mathop{\sum\sum}_{\begin{subarray}{c}m\in\mathcal{I},n\in\mathcal{J}\\ (m,n,c)=1\end{subarray}}\alpha_{m}\beta_{n}S(am,n;c)\ll c^{o(1)}\max_{d\mid c}d\left|\sum_{dm\in\mathcal{I}}\alpha_{dm}\sum_{dn\in\mathcal{J}}\beta_{dn}S(am,n;\tfrac{c}{d})\right|.

Now apply Cauchy–Schwarz in the sum over $m$ , and complete the sum over $m\ (\textnormal{mod }\tfrac{c}{d})$ to get

d\left|\sum_{dm\in\mathcal{I}}\alpha_{dm}\sum_{dn\in\mathcal{J}}\beta_{dn}S(am,n;\tfrac{c}{d})\right|\leq\|\alpha\|\left(d^{2}\sum_{m\ (\textnormal{mod }\frac{c}{d})}\left|\sum_{dn\in\mathcal{J}}\beta_{dn}S(am,n;\tfrac{c}{d})\right|^{2}\right)^{\frac{1}{2}}.

Expanding the square and the Kloosterman sums, then performing the sum over $m$ , one reaches

d^{2}\sum_{m\ (\textnormal{mod }\frac{c}{d})}\left|\sum_{dn\in\mathcal{J}}\beta_{dn}S(am,n;\tfrac{c}{d})\right|^{2}=dc\sum_{x\in(\mathbb{Z}/\frac{c}{d}\mathbb{Z})^{\times}}\left|\sum_{dn\in\mathcal{J}}\beta_{dn}e\left(\frac{nx}{c/d}\right)\right|^{2}.

Finally, complete the sum over $x\ (\textnormal{mod }\tfrac{c}{d})$ , expand the square, and perform the sum over $x$ to obtain

dc\sum_{x\in(\mathbb{Z}/\frac{c}{d}\mathbb{Z})^{\times}}\left|\sum_{dn\in\mathcal{J}}\beta_{dn}e\left(\frac{nx}{c/d}\right)\right|^{2}\leq c^{2}\|\beta\|^{2}.

Putting these bounds together completes our proof. ∎

Theorem 3.4 (Kowalski–Michel–Sawin [24]).

Let $p$ be a prime and $M,N\in\mathbb{Z}$ be such that $1\leq N\leq M\leq p-1$ and $p^{1/4}<MN<p^{5/4}$ . Then for any complex sequences $(\alpha_{m})_{m\leq M}$ , $(\beta_{n})_{n\leq N}$ and any $a\in(\mathbb{Z}/p\mathbb{Z})^{\times}$ , one has

\sum_{m=1}^{M}\sum_{n=1}^{N}\alpha_{m}\beta_{n}S(am,n;p)\ll\|\alpha\|\|\beta\|p^{o(1)}\sqrt{MNp}\left(N^{-\frac{1}{2}}+(MN)^{-\frac{3}{16}}p^{\frac{11}{64}}\right).

Proof.

This is [24, Theorem 1.1] with $k=2$ and $M,N$ swapped. ∎

Remark.

The constraint $p^{1/4}<MN<p^{5/4}$ from Theorem˜3.4 can be removed in light of the trivial bound ˜1.2. Indeed, if $MN\leq p^{1/4}$ , then

\sqrt{MNp}\cdot(MN)^{-\frac{3}{16}}\cdot p^{\frac{11}{64}}\geq\sqrt{MNp}\cdot p^{\frac{11}{64}-\frac{3}{64}}>\sqrt{MNp},

so the bound ˜1.2 is better. Similarly, if $MN\geq p^{5/4}$ , then

\sqrt{MNp}\cdot(MN)^{-\frac{3}{16}}\cdot p^{\frac{11}{64}}\geq p^{\frac{5}{4}(\frac{1}{2}-\frac{3}{16})}\cdot p^{\frac{1}{2}+\frac{11}{64}}=p^{\frac{25}{64}+\frac{43}{64}}>p.

Theorem 3.5 (Blomer–Milićević [3]).

Let $c,d,M,N\in\mathbb{Z}_{+}$ such that $d\mid c$ and $d$ is odd. Then for any complex sequences $(\alpha_{m})_{m\leq M}$ and $(\beta_{n})_{n\leq N}$ such that $|\alpha_{m}|\leq 1$ for all $m$ , and any $a\in(\mathbb{Z}/c\mathbb{Z})^{\times}$ , one has

\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{(n,c)=1}\alpha_{m}\beta_{n}S(am,n;c)\ll\sqrt{M}\|\beta\|(MNc)^{\frac{1}{2}+o(1)}\left(\frac{c^{1/2}}{d^{1/2}M^{1/2}}+\frac{1}{d^{1/4}}+\frac{d^{1/4}}{N^{1/2}}\right).

Proof.

Dyadically summing instances of [3, Theorem 5] with $(q,r,s,M,K,\lambda(k))$ in loc. cit. replaced by $(c,c,\tfrac{c}{d},N,M,\alpha_{m})$ , one obtains the bound³³3[3, Theorem 5] does not include an $a$ -scalar inside the Kloosterman sum, but it holds in this slightly more general form with the same proof, and it is in fact applied this way in [3, p. 471, after (4.2)].

\sum_{\begin{subarray}{c}n\leq N\\ (n,c)=1\end{subarray}}\left|\sum_{m\leq M}\alpha_{m}S(am,n;c)\right|^{2}\ll(cMN)^{o(1)}M^{2}Nc\left(\frac{c}{dM}+\frac{1}{\sqrt{d}}+\frac{\sqrt{d}}{N}\right).

The desired bound now follows from Cauchy–Schwarz in the shape

\left|\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{(n,c)=1}\alpha_{m}\beta_{n}S(am,n;c)\right|^{2}\leq\|\beta\|^{2}\sum_{\begin{subarray}{c}n\leq N\\ (n,c)=1\end{subarray}}\left|\sum_{m\leq M}\alpha_{m}S(am,n;c)\right|^{2}.

(Since $(\beta_{n})$ can be chosen to attain equality in this Cauchy–Schwarz step, Theorem˜3.5 is in fact a restatement of [3, Theorem 5].) ∎

3.3. Fourier analysis on finite groups

Here we recall some general facts and notation from representation theory on finite groups; we point the reader to [35, 16, 42, 20] for more background. Let $G$ be a finite group with identity element $e$ . A (unitary) representation of $G$ is a homomorphism

\rho:G\to U(V),

where $V$ is a finite-dimensional complex Hilbert space and $U(V)$ is the set of unitary transformations of $V$ . In particular, $\rho(e)=\textnormal{Id}_{V}$ is the identity transformation on $V$ . We write⁴⁴4Given a choice of orthonormal basis of $V\cong\mathbb{C}^{\dim\rho}$ , one may of course represent the transformations $\rho(g)$ for $g\in G$ as matrices in $\mathbb{C}^{\dim\rho\times\dim\rho}$ .

\dim\rho:=\dim V

for the dimension of $\rho$ . We say that two representations $\rho_{1}:G\to U(V_{1})$ , $\rho_{2}:G\to U(V_{2})$ are isomorphic iff there is an invertible linear map $M:V_{1}\to V_{2}$ such that $M\circ\rho_{1}(g)=\rho_{2}(g)\circ M$ for all $g\in G$ (since we normalize all representations to be unitary, the map $M$ can also be taken unitary).

Example 3.6.

We write $\mathbf{0}:G\to U(\{0\})$ for the zero representation given by $\mathbf{0}(g)=0\ \forall g\in G$ , and $\mathbf{1}:G\to U(\mathbb{C})$ for the trivial representation given by $\mathbf{1}(g)=\textnormal{Id}_{\mathbb{C}}\ \forall g\in G$ . Any action of $G$ on a finite set $X$ induces a permutation representation $\rho:G\to U(\mathbb{C}^{X})$ , defined by $(\rho(g)f)(x):=f(g^{-1}x)$ for $g\in G$ , $x\in X$ . The regular representation $R_{G}$ is the permutation representation induced by the action by left-multiplication on $X=G$ , so $\dim R_{G}=|G|$ .

Given two representations $\rho_{1}:G\to U(V_{1})$ and $\rho_{2}:G\to U(V_{2})$ , we write $\rho_{1}\oplus\rho_{2}:G\to U(V_{1}\oplus V_{2})$ and $\rho_{1}\otimes\rho_{2}:G\to U(V_{1}\otimes V_{2})$ for their direct sum and product. The operations $\oplus$ and $\otimes$ have identity elements $\mathbf{0}$ and $\mathbf{1}$ respectively (up to isomorphism). Given $\rho:G\to U(V)$ , we write

\rho\oplus^{m}:=\underbrace{\rho\oplus\cdots\oplus\rho}_{m\text{ times}}

for all nonnegative integers $m$ ; when $m=0$ , we interpret this as the zero representation $\mathbf{0}$ . We use a similar notation for repeated direct sums of linear maps.

An invariant subspace $W$ of a representation $\rho:G\to U(V)$ is a subspace of $V$ such that $\rho(g)W\subset W$ for all $g\in G$ . For such $W$ , we define $\rho|_{W}:G\to U(W)$ by $\rho|_{W}(g):=\rho(g)$ for all $g\in G$ , which is automatically unitary, and we say that $\rho|_{W}$ is a subrepresentation of $\rho$ . One can decompose $\rho\cong\rho_{W}\oplus\rho_{W^{\perp}}$ ; conversely, if $\rho\cong\rho_{1}\oplus\rho_{2}$ then $\rho_{1}$ and $\rho_{2}$ are isomorphic to subrepresentations of $\rho$ .

We say that a representation of $G$ is irreducible iff it is nonzero and has no nonzero subrepresentations other than itself. We write $\widehat{G}$ for a complete set of irreducible representations of $G$ up to isomorphism, which always includes the trivial representation $\mathbf{1}$ . Any representation $\rho$ of $G$ has a unique decomposition (up to permutation and isomorphism) into irreducible representations,

\rho\cong\bigoplus_{\rho^{\prime}\in\widehat{G}}\rho^{\prime}\oplus^{\textnormal{Mult}(\rho^{\prime},\rho)},

(3.6)

where $\textnormal{Mult}(\rho^{\prime},\rho)$ is called the multiplicity of $\rho^{\prime}$ inside $\rho$ . In particular, $\textnormal{Mult}(\rho^{\prime},R_{G})=\dim\rho^{\prime}$ .

Given two finite groups $G_{1},G_{2}$ and representations $\rho_{1}:G_{1}\to U(V_{1})$ and $\rho_{2}:G_{2}\to U(V_{2})$ , we write $\rho_{1}\boxtimes\rho_{2}:G_{1}\times G_{2}\to U(V_{1}\otimes V_{2})$ for the representation of $G_{1}\times G_{2}$ given by

(\rho_{1}\boxtimes\rho_{2})(g_{1},g_{2}):=\rho_{1}(g_{1})\otimes\rho_{2}(g_{2}),\qquad\quad g_{1}\in G_{1},\ g_{2}\in G_{2}.

The irreducible representations of $G_{1}\times G_{2}$ are (up to isomorphism) precisely those of the form $\rho_{1}\boxtimes\rho_{2}$ where $\rho_{1}\in\widehat{G}_{1}$ and $\rho_{2}\in\widehat{G}_{2}$ [35, §3.2].

Notation 3.7.

If $G_{1},G_{2},\rho_{1},\rho_{2}$ are as above, and $G_{1,2}$ is a group isomorphic to $G_{1}\times G_{2}$ by a fixed implicit map (such as ˜3.16), we also use the notation $\rho_{1}\boxtimes\rho_{2}$ to describe representations of $G_{1,2}$ .

A character $\chi:G\to\mathbb{C}$ is any function of the form $\chi(g)=\textnormal{Tr}\rho(g)$ , where $\rho$ is a representation of $G$ ; note that characters are constant on conjugacy classes, that $\chi(e)=\dim\rho$ and $\chi(g^{-1})=\overline{\chi}(g)$ , and that isomorphic representations induce the same character. If $\rho_{1},\rho_{2}$ are two representations of $G$ with characters $\chi_{1},\chi_{2}$ , then $\textnormal{Tr}(\rho_{1}\oplus\rho_{2})=\chi_{1}+\chi_{2}$ and $\textnormal{Tr}(\rho_{1}\otimes\rho_{2})=\chi_{1}\chi_{2}$ . If $\rho_{1},\rho_{2}$ are representations of $G_{1},G_{2}$ with characters $\chi_{1},\chi_{2}$ (respectively), then $\textnormal{Tr}(\rho_{1}\boxtimes\rho_{2})(g_{1},g_{2})=\chi_{1}(g_{1})\chi_{2}(g_{2})$ . We say that $\chi$ is irreducible iff $\rho$ is, and write $\textnormal{Irr}(G)$ for the set of all irreducible characters of $G$ . The character table of $G$ satisfies the following orthogonality relations.

Lemma 3.8 (Character orthogonality).

One has

	$\displaystyle\sum_{g\in G}\chi_{1}(g)\overline{\chi}_{2}(g)$	$\displaystyle=\|G\|\mathbbm{1}_{\chi_{1}=\chi_{2}},\qquad\qquad\chi_{1},\chi_{2}\in\textnormal{Irr}(G),$		(3.7)
	$\displaystyle\sum_{\chi\in\textnormal{Irr}(G)}\chi(g_{1})\overline{\chi}(g_{2})$	$\displaystyle=\begin{cases}\frac{\|G\|}{\|C\|},&g_{1},g_{2}\text{ belong to the same conjugacy class $C$ of $G$,}\\ 0,&g_{1},g_{2}\in G\text{ are not conjugate.}\end{cases}$		(3.8)

Proof.

See, e.g., [16, Theorem 2.12 and Exercise 2.21]. ∎

It follows from ˜3.7 and 3.6 that for an arbitrary character $\chi=\textnormal{Tr}\rho$ of $G$ , one has

\frac{1}{|G|}\sum_{g\in G}|\chi(g)|^{2}=\sum_{\rho^{\prime}\in\widehat{G}}\textnormal{Mult}(\rho^{\prime},\rho)^{2}.

(3.9)

We may also restrict a representation $\rho:G\to U(V)$ and its character $\chi=\textnormal{Tr}\rho$ to a subgroup $H\leq G$ , to obtain a representation of $\rho|_{H}:H\to U(V)$ with character $\chi|_{H}=\textnormal{Tr}\rho|_{H}$ . If $\rho$ is irreducible, $\rho|_{H}$ is not necessarily irreducible. When $H$ is a normal subgroup, the structure of $\rho|_{H}$ can be better understood using Clifford theory [6].

Lemma 3.9 (Clifford).

Let $G$ be a group, $N\triangleleft G$ be a normal subgroup, and $\rho\in\widehat{G}$ be an irreducible representation. Then there exist positive integers $L,m,d$ with $\dim\rho=Lmd$ , and non-isomorphic irreducible representations $\sigma_{1},\ldots,\sigma_{L}\in\widehat{N}$ of dimension $d$ , all lying in the same orbit of the action of $G$ by conjugation (i.e., $(g\cdot\sigma)(n):=\sigma(gng^{-1})$ for $g\in G$ and $n\in N$ ), such that

\rho|_{N}\cong\bigoplus_{\ell=1}^{L}\sigma_{\ell}\oplus^{m}.

Proof.

See, e.g., [20, Theorem 6.5]. ∎

Given a function $F:G\to\mathbb{C}$ and a (not necessarily irreducible) representation $\rho:G\to U(V)$ , we define the Fourier coefficient $\widehat{F}(\rho):V\to V$ by

\widehat{F}(\rho):=\sum_{g\in G}F(g)\rho(g).

(3.10)

This obeys $\widehat{F_{1}*F_{2}}(\rho)=\widehat{F_{1}}(\rho)\widehat{F_{2}}(\rho)$ , where $(F_{1}*F_{2})(g):=\sum_{g_{1}g_{2}=g}F_{1}(g)F_{2}(g)$ denotes the convolution of two functions $F_{1},F_{2}:G\to\mathbb{C}$ . In particular, if $G=\mathbb{Z}/c\mathbb{Z}$ , the irreducible representations (which are all $1$ -dimensional since $G$ is abelian) are of the shape $\rho_{a}(g):=e(\tfrac{ag}{c})$ for $a,g\in\mathbb{Z}/c\mathbb{Z}$ . In this case, we write

\widehat{F}(a):=\widehat{F}(\rho_{-a})=\sum_{g\in\mathbb{Z}/c\mathbb{Z}}F(g)\,e\left(-\frac{ag}{c}\right).

(3.11)

Lemma 3.10.

Let $F:G\to\mathbb{C}$ , $\rho:G\to U(V)$ be a representation, and $q\in[1,\infty)$ . Then one has

\|\widehat{F}(\rho)\|_{S^{q}}^{q}=\sum_{\rho^{\prime}\in\widehat{G}}\textnormal{Mult}(\rho^{\prime},\rho)\|\widehat{F}(\rho^{\prime})\|_{S^{q}}^{q},\qquad\qquad\|\widehat{F}(\rho)\|=\max_{\begin{subarray}{c}\rho^{\prime}\in\widehat{G}\\ \textnormal{Mult}(\rho^{\prime},\rho)>0\end{subarray}}\|\widehat{F}(\rho^{\prime})\|.

Proof.

By ˜3.6, there exists a unitary map $U$ (from $V$ to the direct sum of $\rho$ ’s irreducible invariant subspaces) such that for any $g\in G$ ,

U\rho(g)U^{*}=\bigoplus_{\rho^{\prime}\in\widehat{G}}{\rho^{\prime}(g)\oplus}^{\textnormal{Mult}(\rho^{\prime},\rho)},

using some implicit ordering of $\widehat{G}$ . But then, by ˜3.10, we have

	$\displaystyle U\widehat{F}(\rho)U^{}=\sum_{g\in G}F(g)U\rho(g)U^{}$	$\displaystyle=\sum_{g\in G}F(g)\bigoplus_{\rho^{\prime}\in\widehat{G}}\rho^{\prime}(g)\oplus^{\textnormal{Mult}(\rho^{\prime},\rho)}$
		$\displaystyle=\bigoplus_{\rho^{\prime}\in\widehat{G}}\left(\sum_{g\in G}F(g)\rho^{\prime}(g)\right)\oplus^{\textnormal{Mult}(\rho^{\prime},\rho)}=\bigoplus_{\rho^{\prime}\in\widehat{G}}\widehat{F}(\rho^{\prime})\oplus^{\textnormal{Mult}(\rho^{\prime},\rho)},$

and the conclusion follows from the fact that the multiset of singular values of a direct sum of matrices is the union of the multisets of singular values of those matrices. ∎

3.4. Facts about $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$

Let $c\in\mathbb{Z}_{+}$ . Recall the special linear groups $\textnormal{SL}_{2}(\mathbb{Z})$ and $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ of matrices in $\mathbb{Z}^{2\times 2}$ (resp., $(\mathbb{Z}/c\mathbb{Z})^{2\times 2}$ ) with determinant $1$ , and the projective special linear groups,

\textnormal{PSL}_{2}(\mathbb{Z}):=\textnormal{SL}_{2}(\mathbb{Z})/\{\pm I\},\qquad\textnormal{PSL}_{2}(\mathbb{Z}/c\mathbb{Z}):=\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})/_{\{\gamma I:\gamma\in\mathbb{Z}/c\mathbb{Z},\gamma^{2}=1\}}.

(3.12)

When the group $\textnormal{SL}_{2}(\mathbb{Z})$ , $\textnormal{PSL}_{2}(\mathbb{Z})$ , $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ or $\textnormal{PSL}_{2}(\mathbb{Z}/c\mathbb{Z})$ is understood from context, we write

I:=\begin{pmatrix}1&0\\ 0&1\end{pmatrix},\qquad T:=\begin{pmatrix}1&1\\ 0&1\end{pmatrix},\qquad S:=\begin{pmatrix}0&-1\\ 1&0\end{pmatrix},

(3.13)

which satisfy the relations $-S^{2}=-(ST)^{3}=I$ , and in the case of $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ or $\textnormal{PSL}_{2}(\mathbb{Z}/c\mathbb{Z})$ , $T^{c}=I$ . Note that $T$ and $S$ generate $\textnormal{SL}_{2}(\mathbb{Z})$ .

Notation 3.11 (Projective line).

For $c\in\mathbb{Z}_{+}$ , we recall the projective line

\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z}):=\left\{(x,y):x,y\in\mathbb{Z}/c\mathbb{Z},\ x(\mathbb{Z}/c\mathbb{Z})+y(\mathbb{Z}/c\mathbb{Z})=\mathbb{Z}/c\mathbb{Z}\right\}/_{\sim},

where $\sim$ is the equivalence relation generated by $(x,y)\sim(\alpha x,\alpha y)$ for $\alpha\in(\mathbb{Z}/c\mathbb{Z})^{\times}$ . We write the equivalence class of $(x,y)$ as $[x:y]$ , and we will typically use the letters $u,v$ to denote projective points in $\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ , reserving $x,y$ for elements of $\mathbb{Z}/c\mathbb{Z}$ . For $d\mid c$ , we write the natural map $\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})\to\mathbb{P}^{1}(\mathbb{Z}/d\mathbb{Z})$ which reduces both entries modulo $d$ as $u\mapsto u\ \textnormal{mod }d$ .

The group $\textnormal{PSL}_{2}(\mathbb{Z}/c\mathbb{Z})$ (and, through it, $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ ) acts on $\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ by

\begin{pmatrix}m&n\\ p&q\end{pmatrix}[x:y]:=[mx+ny:px+qy].

(3.14)

One can think of $\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ as $\mathbb{Z}/c\mathbb{Z}$ with a few additional points, which must be included to obtain a well-defined action. Indeed, any projective point $u\in\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ can be written as $u=[x:y]$ for some $x\in\{1,\ldots,c\}$ and $y\mid c$ with $(x,y)=1$ , and thus $|\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})|=c^{1+o(1)}$ . In particular, one can embed $\mathbb{Z}/c\mathbb{Z}\subset\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ by $x\mapsto[x:1]$ , and via this embedding, the generators from ˜3.13 act on elements of $\mathbb{Z}/c\mathbb{Z}$ by

Tx=x+1,\qquad\qquad Sy=-\overline{y},\qquad\qquad\text{for }x\in\mathbb{Z}/c\mathbb{Z},\ y\in(\mathbb{Z}/c\mathbb{Z})^{\times}.

We now briefly go over a few facts about the subgroups and representations of $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ .

Notation 3.12 (Reduction mod $d$ ).

Given a positive integer $d$ with $d\mid c$ , we denote by

\pi_{c,d}:\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})\to\textnormal{SL}_{2}(\mathbb{Z}/d\mathbb{Z})

the natural epimorphism which ‘reads’ the entries of $g\in\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ modulo $d$ . We write

\Gamma_{c}(d):=\ker\pi_{c,d}

for the congruence subgroup given by the kernel of this map (consisting of matrices of the form $I+dA$ , where one may view the entries of $A$ as elements of $\mathbb{Z}/\tfrac{c}{d}\mathbb{Z}$ ).

Lemma 3.13.

$\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ acts transitively on $\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ (i.e., there is only one orbit). In fact, for $d\mid c$ , there is a bijection between $\mathbb{P}^{1}(\mathbb{Z}/d\mathbb{Z})$ and orbits of $\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ under $\Gamma_{c}(d)$ ,

\begin{array}[]{rcl}\Gamma_{c}(d)\backslash\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})&\longrightarrow&\mathbb{P}^{1}(\mathbb{Z}/d\mathbb{Z}),\\ \Gamma_{c}(d)\cdot u&\longmapsto&u\ \textnormal{mod }d.\end{array}

(3.15)

Proof.

For any $[x:y]\in\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ , there exist by definition $a,b\in\mathbb{Z}/c\mathbb{Z}$ with $ax+by\equiv 1\ (\textnormal{mod }c)$ , so $[x:y]=\left(\begin{smallmatrix}x&-b\\ y&a\end{smallmatrix}\right)[1:0]\in\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})\cdot[1:0]$ . Thus the action of $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ on $\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ is transitive.

The map in ˜3.15 is well-defined since $(I+dA)u\ (\textnormal{mod }d)=u\ (\textnormal{mod }d)$ for any $I+dA\in\Gamma_{c}(d)$ . It is surjective since the original map $\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})\to\mathbb{P}^{1}(\mathbb{Z}/d\mathbb{Z})$ is surjective. To show that ˜3.15 is also injective, suppose $u\ (\textnormal{mod }d)=v\ (\textnormal{mod }d)$ for some $u,v\in\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ , and we aim to show that $\Gamma_{c}(d)\cdot u=\Gamma_{c}(d)\cdot v$ . By the transitivity of the action of $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ , we can find $g\in\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ such that $gv=[1:0]\in\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ , so

(gu)\ \textnormal{mod }d=(gv)\ \textnormal{mod }d=[1:0]\in\mathbb{P}^{1}(\mathbb{Z}/d\mathbb{Z}).

Write $gu=[xd+1:yd]$ for some $x,y\in\mathbb{Z}/c\mathbb{Z}$ . Since $gu\in\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ , we have $1=(xd+1,yd,c)=(xd+1,yd^{2},c)$ , so there exist $a,b\in\mathbb{Z}/c\mathbb{Z}$ with $a(xd+1)+byd^{2}\equiv 1\ (\textnormal{mod }c)$ , and in particular $a\equiv 1\ (\textnormal{mod }d)$ . Then,

gu=\begin{pmatrix}xd+1&-bd\\ yd&a\end{pmatrix}[1:0]\in\Gamma_{c}(d)\cdot gv=g\Gamma_{c}(d)\cdot v,

where the last equality is due to the normality of $\Gamma_{c}(d)$ . Hence $u\in\Gamma_{c}(d)\cdot v$ , as we wanted. ∎

By the Chinese remainder theorem ( $\mathbb{Z}/c\mathbb{Z}\cong\prod_{p^{k}\|c}\mathbb{Z}/p^{k}\mathbb{Z}$ ), combining the maps $\pi_{c,p^{k}}$ for $p^{k}\|c$ produces isomorphisms

\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})\cong\prod_{p^{k}\|c}\textnormal{SL}_{2}(\mathbb{Z}/p^{k}\mathbb{Z}),\qquad\qquad\Gamma_{c}(d)\cong\prod_{\begin{subarray}{c}p^{k}\|c\\ p^{j}\|d\end{subarray}}\Gamma_{p^{k}}(p^{j}),

(3.16)

for $d\mid c$ (in the products above, it is understood that only primes which divide $c$ are included, so $k\geq 1$ , but we allow $j=0$ ). Since $|\textnormal{SL}_{2}(\mathbb{Z}/p^{k}\mathbb{Z})|=p^{3k}(1-\tfrac{1}{p^{2}})$ , it follows that

|\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})|=c^{3}\prod_{\text{prime }p|c}\left(1-\frac{1}{p^{2}}\right)\asymp c^{3}\qquad\Rightarrow\qquad|\Gamma_{c}(d)|=\frac{|\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})|}{|\textnormal{SL}_{2}(\mathbb{Z}/d\mathbb{Z})|}=\frac{c^{3}}{d^{3}},

(3.17)

and that the irreducible representations of $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ can be parametrized as

\widehat{\textnormal{SL}}_{2}(\mathbb{Z}/c\mathbb{Z})=\left\{\mathop{\mathchoice{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}}_{p^{k}\|c}\rho_{p,k}:\rho_{p,k}\in\widehat{\textnormal{SL}}_{2}(\mathbb{Z}/p^{k}\mathbb{Z})\right\}.

(3.18)

Now let $p$ be a prime and $k\in\mathbb{Z}_{+}$ , and let us focus on understanding $\widehat{\textnormal{SL}}_{2}(\mathbb{Z}/p^{k}\mathbb{Z})$ .

Definition 3.14 (Primitive representations).

A representation $\rho:\textnormal{SL}_{2}(\mathbb{Z}/p^{k}\mathbb{Z})\to U(V)$ is called primitive iff its kernel does not contain $\Gamma_{p^{k}}(p^{k-1})$ . Equivalently (by the first isomorphism theorem), $\rho$ cannot be factored as $\rho^{\prime}\circ\pi_{p^{k},p^{k-1}}$ for some representation $\rho^{\prime}$ of $\textnormal{SL}_{2}(\mathbb{Z}/p^{k-1}\mathbb{Z})$ . A primitive (resp., non-primitive) character is one induced by a primitive (resp., non-primitive) representation.

Thus the primitive irreducible representations of $\textnormal{SL}_{2}(\mathbb{Z}/p^{k}\mathbb{Z})$ are ‘new’ at level $p^{k}$ , much like primitive Dirichlet characters or newforms in the theory of automorphic representations. We can easily isolate the ‘maximal’ non-primitive component of a representation using the following lemma.

Lemma 3.15.

Let $\rho:\textnormal{SL}_{2}(\mathbb{Z}/p^{k}\mathbb{Z})\to U(V)$ be a representation and

V_{f}:=\{v\in V:\rho(g)v=v,\ \forall g\in\Gamma_{p^{k}}(p^{k-1})\}.

Then $\rho|_{V_{f}}$ is non-primitive, and $\rho|_{V_{f}^{\perp}}$ is isomorphic to a direct sum of primitive irreducible representations.

Proof.

The fact that $V_{f}$ and thus $V_{f}^{\perp}$ are an invariant subspaces of $V$ follows quickly from the fact that $\Gamma_{p^{k}\to p^{k-1}}\triangleleft G$ , so $\rho|_{V_{f}}$ and $\rho|_{V_{f}^{\perp}}$ are well-defined. By definition, $\rho|_{V_{f}}(g)=\textnormal{Id}_{V_{f}}$ for all $g\in\Gamma_{p^{k}\to p^{k-1}}$ , so the kernel of $\rho|_{V_{f}}$ includes $\Gamma_{p^{k}\to p^{k-1}}$ , i.e., $\rho|_{V_{f}}$ is non-primitive.

Now let $\rho|_{V_{0}}$ be any irreducible subrepresentation of $\rho|_{V_{f}^{\perp}}$ , where $V_{0}\subset V_{f}^{\perp}$ . Since $V_{0}\neq\{0\}$ and $V_{0}\cap V_{f}=\{0\}$ , we can find some $v\in V_{0}\setminus V_{f}$ , and thus some $g\in\Gamma_{p^{k}\to p^{k-1}}$ such that $\rho(g)v\neq v$ . But then $\rho|_{V_{0}}(g)\neq\textnormal{Id}_{V_{0}}$ , so the kernel of $\rho|_{V_{0}}$ does not contain $\Gamma_{p^{k}\to p^{k-1}}$ , i.e., $\rho_{V_{0}}$ is primitive. ∎

The primitive irreducible representations of $\textnormal{SL}_{2}(\mathbb{Z}/p^{k}\mathbb{Z})$ are fairly complicated, but the following lemma will suffice for our purposes. This generalizes the classical spectral-gap result for $\textnormal{SL}_{2}(\mathbb{Z}/p\mathbb{Z})$ .

Lemma 3.16.

Any primitive irreducible representation $\rho$ of $\textnormal{SL}_{2}(\mathbb{Z}/p^{k}\mathbb{Z})$ has $\dim\rho\gg p^{k}$ .

Proof.

Complete tables with the dimensions of irreducible representations of $\textnormal{SL}_{2}(\mathbb{Z}/p^{k}\mathbb{Z})$ , including the case $p=2$ , were given by Nobs–Wolfart [31, p. 525] (who refer to primitive representations in the sense of our Lemma˜3.16 as having ‘level $k$ ’). For odd primes $p$ , these had been classified by Shalika [36, §4.3], Tanaka [41], and Kutzko [26].

For a more direct proof of the lower bound via Clifford theory, we refer to Bourgain–Gamburd’s [5, Lemma 7.1] (this assumes $p$ is odd, but an analogous argument applies if $p=2$ ). To summarize their argument when $k$ is even, Bourgain–Gamburd apply (a variant of) Lemma˜3.9 with $G:=\textnormal{SL}_{2}(\mathbb{Z}/p^{k}\mathbb{Z})$ and $N:=\Gamma_{p^{k}}(p^{k/2})$ , to decompose $\rho|_{N}$ into irreducible representations $\sigma_{1},\ldots,\sigma_{L}\in\widehat{N}$ , all lying in the same orbit under $G$ -conjugation. But $N$ is abelian, so $\widehat{N}\cong N$ , and $\sigma_{1},\ldots,\sigma_{L}$ correspond to $G$ -conjugate elements $g_{1},\ldots,g_{L}\in N=\Gamma_{p^{k}}(p^{k/2})$ . Moreover, the primitivity condition that $\ker\rho$ does not contain $\Gamma_{p^{k}}(p^{k-1})$ implies that $g_{1},\ldots,g_{L}\not\in\Gamma_{p^{k}}(p^{(k/2)+1})$ . It follows that

\dim\rho\geq L=\frac{|G|}{|C_{G}(g_{1})|}\gg\frac{p^{3k}}{|C_{G}(g_{1})|},

where $C_{G}(g_{1})$ is the centralizer of $g_{1}$ in $G$ . It thus remains to bound $|C_{G}(g)|\ll p^{2k}$ for $g\in\Gamma_{p^{k}}(p^{k/2})\setminus\Gamma_{p^{k}}(p^{(k/2)+1})$ , which follows from an explicit matrix computation [5, Claim 7.1] (minor modifications are needed here if $p=2$ , but these only incur a constant-factor loss). ∎

4. Representations and Kloosterman matrices

Here we connect matrices of Kloosterman sums modulo $c$ to Fourier analysis on $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ .

4.1. The relevant representations

When digesting the notation below, the reader should keep in mind the informal outline from Section˜2.2. We will first define the simpler representations $(\rho_{c},V_{c})$ which are connected to matrices of Kloosterman sums $S(m,n;c)$ , and then the more relevant subrepresentations $(\rho^{\circ}_{c},V_{c}^{\circ})$ which correspond to adding the restriction $(m,n,c)=1$ . In fact, the subspace $V^{\circ}_{c}\subset V_{c}$ will be constructed by sifting out ‘old’ subspaces isomorphic to $V_{d}$ for $d\mid c$ .

Definition 4.1 (Permutation representations of the projective action).

For $c\in\mathbb{Z}_{+}$ , we denote the permutation representation of $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ induced by the action ˜3.14 on $\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ by

\rho_{c}:\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})\to U(V_{c}),\qquad\quad V_{c}:=\mathbb{C}^{\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})},

(4.1)

and its character by $\chi_{c}:=\textnormal{Tr}\rho_{c}$ . Hence $V_{c}$ is the space of functions $f:\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})\to\mathbb{C}$ , equipped with the standard inner product, and $(\rho_{c}(g)f)(u)=f(g^{-1}u)$ for $g\in\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ , $u\in\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ . In particular, for any $u\in\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ , one has $\rho_{c}(g)\mathbbm{1}_{u}=\mathbbm{1}_{gu}$ .

Definition 4.2 (Invariant subspaces).

For $c,d\in\mathbb{Z}_{+}$ with $d\mid c$ , define

V_{c}(d):=\left\{f\in V_{c}:\rho_{c}(n)f=f\quad\forall n\in\Gamma_{c}(d)\right\}\subset V_{c}.

In particular, $V_{c}(c)=V_{c}$ . Thus $V_{c}(d)$ is the space of complex-valued functions on $\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ which are constant on orbits of $\Gamma_{c}(d)$ , so by Lemma˜3.13,

V_{c}(d)\cong\mathbb{C}^{\Gamma_{c}(d)\backslash\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})}\cong\mathbb{C}^{\mathbb{P}^{1}(\mathbb{Z}/d\mathbb{Z})}=V_{d}.

(4.2)

Lemma 4.3.

For $c,d\in\mathbb{Z}_{+}$ with $d\mid c$ , $V_{c}(d)$ is an invariant subspace of $\rho_{c}$ . In fact, using Notation˜3.12, we have

\rho_{c}|_{V_{c}(d)}\cong\rho_{d}\circ\pi_{c,d}.

Proof.

The fact that $V_{c}(d)$ is an invariant subspace follows immediately from the normality of $\Gamma_{c}(d)$ . Now let $\Phi:V_{d}\to V_{c}(d)$ be the invertible linear map from ˜4.2, which relies on the bijection from ˜3.15. Then for any $g\in\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ , one can easily check that $\rho_{c}(g)|_{V_{c}(d)}\circ\Phi=\Phi\circ\rho_{d}(\pi_{c,d}(g))$ : both maps take the basis vector $\mathbbm{1}_{u}\in V_{d}=\mathbb{C}^{\mathbb{P}^{1}(\mathbb{Z}/d\mathbb{Z})}$ to the $L^{2}$ -normalized function in $V_{c}(d)$ which is only nonzero on the orbit $g\Gamma_{c}(d)\cdot u=\Gamma_{c}(d)\cdot gu$ . ∎

In light of Lemma˜4.3, we will need to remove the contribution of ‘old’ representations $(\rho_{d},V_{d})$ to $(\rho_{c},V_{c})$ . To this end, it will be helpful to adopt the following convention for tensor products.

Notation 4.4 (Ordered tensor products).

If $c,c_{1},c_{2}\in\mathbb{Z}_{+}$ are such that $c=c_{1}c_{2}$ and $(c_{1},c_{2})=1$ , then the Chinese Remainder Theorem gives $\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})\cong\mathbb{P}^{1}(\mathbb{Z}/c_{1}\mathbb{Z})\times\mathbb{P}^{1}(\mathbb{Z}/c_{2}\mathbb{Z})$ by $u\mapsto(u\ \textnormal{mod }c_{1},u\ \textnormal{mod }c_{2})$ , so $V_{c}\cong V_{c_{1}}\otimes V_{c_{2}}$ . Since tensor products of vector spaces are defined up to isomorphism, it is not a great abuse of notation to write

V_{c}=V_{c_{1}}\otimes V_{c_{2}}.

In particular, given $f_{1}\in V_{c_{1}}$ and $f_{2}\in V_{c_{2}}$ , we view $f_{1}\otimes f_{2}$ as a function on $\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ with values $(f_{1}\otimes f_{2})(u)=f_{1}(u\ \textnormal{mod }c_{1})\cdot f_{2}(u\ \textnormal{mod }c_{2})$ . This notation extends to tensor products of subspaces $W_{1}\subset V_{c_{1}}$ , $W_{2}\subset V_{c_{2}}$ (so $W_{1}\otimes W_{2}\subset V_{c}$ ), and of linear transformations $T_{1}:V_{c_{1}}\to V_{c_{1}}$ , $T_{2}:V_{c_{2}}\to V_{c_{2}}$ .

We note that with the conventions from Notations˜3.7 and 4.4, for $c=c_{1}c_{2}$ with $(c_{1},c_{2})=1$ , the product $\rho_{c_{1}}\boxtimes\rho_{c_{2}}:\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})\to U(V_{c_{1}}\otimes V_{c_{2}})=U(V_{c})$ is precisely the permutation representation $\rho_{c}$ (this is a genuine equality, not just an isomorphism). Moreover, a tensor product of invariant subspaces of $\rho_{c_{1}}$ and $\rho_{c_{2}}$ gives an invariant subspace of $\rho_{c}$ , and in fact $V_{c_{1}}(d_{1})\otimes V_{c_{2}}(d_{2})=V_{c}(d)$ for $d_{1}\mid c_{1}$ , $d_{2}\mid c_{2}$ , and $d=d_{1}d_{2}$ . Iterating this yields the factorizations

V_{c}=\bigotimes_{p^{k}\|c}V_{p^{k}},\qquad\qquad\rho_{c}=\mathop{\mathchoice{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}}_{p^{k}\|c}\rho_{p^{k}}.

(4.3)

and, more generally, for $d\mid c$ ,

V_{c}(d)=\bigotimes_{\begin{subarray}{c}p^{k}\|c\\ p^{j}\|d\end{subarray}}V_{p^{k}}(p^{j}),\qquad\qquad\rho_{c}|_{V_{c}(d)}=\mathop{\mathchoice{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}}_{\begin{subarray}{c}p^{k}\|c\\ p^{j}\|d\end{subarray}}\rho_{p^{k}}|_{V_{p^{k}}(p^{j})}.

(4.4)

Finally, we can define the representations $(\rho_{c}^{\circ},V_{c}^{\circ})$ .

Definition 4.5 (Sifted representations).

For a prime power $p^{k}$ , we let $V_{p^{k}}^{\circ}:=V_{p^{k}}(p^{k-1})^{\perp}\subset V_{p^{k}}$ be the orthogonal complement of $V_{p^{k}}(p^{k-1})$ inside $V_{p^{k}}$ (which is an invariant subspace of $\rho_{p^{k}}$ ). For $c\in\mathbb{Z}_{+}$ , we define

V_{c}^{\circ}:=\bigotimes_{p^{k}\|c}V_{p^{k}}^{\circ},\qquad\qquad\rho_{c}^{\circ}:=\rho_{c}|_{V_{c}^{\circ}},\qquad\qquad\chi_{c}^{\circ}:=\textnormal{Tr}\rho_{c}^{\circ}.

Proposition 4.6 (Decomposition of sifted representations).

For any $c\in\mathbb{Z}_{+}$ , one has

\rho_{c}^{\circ}=\mathop{\mathchoice{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}}_{p^{k}\|c}\rho_{p^{k}}^{\circ}.

(4.5)

Moreover, each $\rho_{p^{k}}^{\circ}$ is isomorphic to a nonempty direct sum of primitive irreducible representations, and $\rho_{c}^{\circ}$ is isomorphic to a direct sum of $c^{o(1)}$ irreducible representations of dimensions $c^{1-o(1)}$ .

Proof.

The factorization in ˜4.5 follows immediately from ˜4.3 and 4.5. The fact that $\rho_{p^{k}}^{\circ}=\rho_{p^{k}}|_{V_{p^{k}}^{\circ}}$ is isomorphic to a direct sum of primitive irreducible representations is precisely the content of Lemma˜3.15, wherein $V_{f}=V_{p^{k}}(p^{k-1})$ and $V_{f}^{\perp}=V_{p^{k}}^{\circ}$ . One can easily construct a function on $V_{p^{k}}$ which is not constant on orbits of $\Gamma_{p^{k}}(p^{k-1})$ , so $V_{p^{k}}^{\circ}\neq\{0\}$ , and thus $\rho_{p^{k}}^{\circ}\neq 0$ .

Now write each $\rho_{p^{k}}^{\circ}$ as a direct sum of primitive irreducible representations $\rho_{p,k}$ of $\textnormal{SL}_{2}(\mathbb{Z}/p^{k}\mathbb{Z})$ (up to isomorphism), and expand the tensor product in ˜4.5. This expresses $\rho_{c}^{\circ}$ as a direct sum of representations (potentially with repetitions) of the shape

\rho=\mathop{\mathchoice{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}}_{p^{k}\|c}\rho_{p,k},

which are irreducible, and have dimensions $\gg c^{1-o(1)}$ by Lemma˜3.16 and the divisor bound. Since $\dim\rho_{c}^{\circ}\leq\dim\rho_{c}=|\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})|=c^{1+o(1)}$ , the number of these representations is at most $c^{o(1)}$ . ∎

We now briefly analyze the orthogonal projections onto invariant subspaces of $V_{c}$ . It will turn out that the projection onto $V_{c}^{\circ}$ can be obtained by a Möbius-inversion-type process.

Definition 4.7 (Special projections).

For $c,d\in\mathbb{Z}_{+}$ with $d\mid c$ , we let $P_{c}(d),P_{c}^{\circ}:V_{c}\to V_{c}$ be the orthogonal projections onto $V_{c}(d)$ , respectively $V_{c}^{\circ}$ . In particular, $P_{c}(c)$ is the identity map on $V_{c}$ .

Lemma 4.8.

For $d\mid c$ , one has

P_{c}(d)=\bigotimes_{\begin{subarray}{c}p^{k}\|c\\ p^{j}\|d\end{subarray}}P_{p^{k}}(p^{j})=\frac{1}{|\Gamma_{c}(d)|}\sum_{n\in\Gamma_{c}(d)}\rho_{c}(n).

(4.6)

In particular, $P_{c}(d)$ commutes with $\rho_{c}(g)$ for any $g\in\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ . The matrix representation of this map with respect to the standard basis of $V_{c}=\mathbb{C}^{\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})}$ has entries

P_{c}(d)_{u,v}=\frac{d^{2}\phi(c)}{c^{2}\phi(d)}\mathbbm{1}_{u\in\Gamma_{c}(d)\cdot v},\qquad\qquad u,v\in\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z}).

(4.7)

The proof of Lemma˜4.8 is left to Appendix˜A.

Lemma 4.9.

For $c\in\mathbb{Z}_{+}$ , one has

P_{c}^{\circ}=\bigotimes_{p^{k}\|c}P_{p^{k}}^{\circ}=\sum_{d\mid c}\mu\left(\frac{c}{d}\right)P_{c}(d).

Proof.

The factorization as a tensor product follows immediately from Definitions˜4.5 and 4.7. Now for a prime power $p^{k}$ , recall that $P_{p^{k}}(p^{k})$ is the identity map on $V_{p^{k}}$ and $P_{p^{k}}(p^{k-1})$ is the orthogonal projection onto $V_{p^{k}}(p^{k-1})$ , so the orthogonal projection onto $V_{p^{k}}^{\circ}=V_{p^{k}}(p^{k-1})^{\perp}$ can be written as

P_{p^{k}}^{\circ}=P_{p^{k}}(p^{k})-P_{p^{k}}(p^{k-1}).

It follows from this and ˜4.6 that

	$\displaystyle\bigotimes_{p^{k}\\|c}P_{p^{k}}^{\circ}$	$\displaystyle=\bigotimes_{p^{k}\\|c}\left(P_{p^{k}}(p^{k})-P_{p^{k}}(p^{k-1})\right)$
		$\displaystyle=\sum_{d\mid c}\mu\left(\frac{c}{d}\right)\bigotimes_{\begin{subarray}{c}p^{k}\\|c\\ p^{j}\\|d\end{subarray}}P_{p^{k}}(p^{j})\quad=\sum_{d\mid c}\mu\left(\frac{c}{d}\right)P_{c}(d),$

as claimed. ∎

4.2. The Kloosterman matrix

Here we finally relate the abstract discussion in the preceding subsections to the classical Kloosterman sums.

Proposition 4.10 (From Kloosterman matrices to Fourier coefficients).

Let $c\in\mathbb{Z}_{+}$ , $\psi_{1},\psi_{2}:\mathbb{Z}/c\mathbb{Z}\to\mathbb{C}$ , and $K_{c}^{\psi_{1},\psi_{2}}\in\mathbb{C}^{\mathbb{Z}/c\mathbb{Z}\times\mathbb{Z}/c\mathbb{Z}}$ be the $c\times c$ complex matrix with entries

(K_{c}^{\psi_{1},\psi_{2}})_{m,n}:=\psi_{1}(m)\psi_{2}(n)\mathbbm{1}_{(m,n,c)=1}S(m,n;c).

(4.8)

Consider the function $F_{c}^{\psi_{1},\psi_{2}}:\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})\to\mathbb{C}$ given by

F_{c}^{\psi_{1},\psi_{2}}:=\frac{1}{c^{2}}\sum_{h_{1},h_{2}\in\mathbb{Z}/c\mathbb{Z}}\widehat{\psi}_{1}(h_{1})\widehat{\psi}_{2}(h_{2})\,\mathbbm{1}_{T^{h_{1}}ST^{h_{2}}},

(4.9)

where $T$ and $S$ are as in ˜3.13. Then one has the inequality of operator norms

\|K_{c}^{\psi_{1},\psi_{2}}\|\leq c\|\widehat{F}_{c}^{\psi_{1},\psi_{2}}(\rho_{c}^{\circ})\|.

Remark.

In $\widehat{\psi}_{1}$ and $\widehat{\psi}_{2}$ , the Fourier transform is taken over $\mathbb{Z}/c\mathbb{Z}$ , as in ˜3.11. In $\widehat{F}^{\psi_{1},\psi_{2}}_{c}$ , the Fourier transform is taken over the non-abelian group $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ , as in ˜3.10.

Proof of Proposition˜4.10.

Let $U_{c}$ be the unitary $c\times c$ matrix with entries $(U_{c})_{u,v}=c^{-1/2}e(\tfrac{uv}{c})$ . By expanding the Kloosterman sums, we have

(U_{c}^{*}K_{c}^{\psi_{1},\psi_{2}}U_{c})_{u,v}=\frac{1}{c}\sum_{\begin{subarray}{c}m,n\in\mathbb{Z}/c\mathbb{Z}\\ x\in(\mathbb{Z}/c\mathbb{Z})^{\times}\end{subarray}}\mathbbm{1}_{(m,n,c)=1}\psi_{1}(m)\,e\left(\frac{m(x-u)}{c}\right)\psi_{2}(n)\,e\left(\frac{n(\overline{x}+v)}{c}\right),

for any $u,v\in\mathbb{Z}/c\mathbb{Z}$ . We then expand the indicator $\mathbbm{1}_{(m,n,c)=1}$ by Möbius inversion and Fourier analysis,

\displaystyle\mathbbm{1}_{(m,n,c)=1}=\sum_{d\mid c}\mu(d)\mathbbm{1}_{d\mid m}\mathbbm{1}_{d\mid n}=\sum_{d\mid c}\frac{\mu(d)}{d^{2}}\sum_{a,b\in\mathbb{Z}/d\mathbb{Z}}e\left(\frac{am}{d}\right)e\left(\frac{bm}{d}\right),

and evaluate the sums over $m,n$ to obtain

	$\displaystyle(U_{c}^{*}K_{c}^{\psi_{1},\psi_{2}}U_{c})_{u,v}$	$\displaystyle=\frac{1}{c}\sum_{d\mid c}\frac{\mu(d)}{d^{2}}\sum_{x\in(\mathbb{Z}/c\mathbb{Z})^{\times}}\sum_{a,b\in\mathbb{Z}/d\mathbb{Z}}\widehat{\psi}_{1}\left(-x+u-\frac{ac}{d}\right)\widehat{\psi}_{2}\left(-\overline{x}-v-\frac{bc}{d}\right)$
		$\displaystyle=\frac{1}{c}\sum_{d\mid c}\frac{\mu(d)}{d^{2}}\sum_{x\in(\mathbb{Z}/c\mathbb{Z})^{\times}}\sum_{h_{1},h_{2}\in\mathbb{Z}/c\mathbb{Z}}\widehat{\psi}_{1}(h_{1})\widehat{\psi}_{2}(h_{2})\mathbbm{1}_{\begin{subarray}{c}x\equiv u-h_{1}\ (\textnormal{mod }\frac{c}{d})\\ -\overline{x}\equiv v+h_{2}\ (\textnormal{mod }\frac{c}{d})\end{subarray}},$

where we substituted $h_{1}:=-x+u+\tfrac{ac}{d}$ , $h_{2}=-\overline{x}-v-\tfrac{bc}{d}$ . Switching divisors $d\mapsto\tfrac{c}{d}$ , swapping sums, and evaluating the sum over $x$ (which gives either $0$ or $\phi(c)/\phi(d)$ solutions), we reach

(U_{c}^{*}K_{c}^{\psi_{1},\psi_{2}}U_{c})_{u,v}=\frac{1}{c}\sum_{h_{1},h_{2}\in\mathbb{Z}/c\mathbb{Z}}\widehat{\psi}_{1}(h_{1})\widehat{\psi}_{2}(h_{2})\sum_{d\mid c}\mu\left(\frac{c}{d}\right)\frac{d^{2}\phi(c)}{c^{2}\phi(d)}\mathbbm{1}_{(u-h_{1})(v+h_{2})\equiv-1\ (\textnormal{mod }d)}.

(4.10)

Let us keep this in mind. Separately, by ˜4.9 and 4.5, we have

	$\displaystyle\widehat{F}_{c}^{\psi_{1},\psi_{2}}(\rho_{c}^{\circ})$	$\displaystyle=\frac{1}{c^{2}}\sum_{h_{1},h_{2}\in\mathbb{Z}/c\mathbb{Z}}\widehat{\psi}_{1}(h_{1})\widehat{\psi}_{2}(h_{2})\rho_{c}^{\circ}(T^{h_{1}}ST^{h_{2}})$
		$\displaystyle=\Bigg(\frac{1}{c^{2}}\sum_{h_{1},h_{2}\in\mathbb{Z}/c\mathbb{Z}}\widehat{\psi}_{1}(h_{1})\widehat{\psi}_{2}(h_{2})\rho_{c}(T^{h_{1}}ST^{h_{2}})\Bigg)\Bigg\|_{V_{c}^{\circ}},$

and thus by Lemma˜3.1,

\|\widehat{F}_{c}^{\psi_{1},\psi_{2}}(\rho_{c}^{\circ})\|=\|M_{c}^{\psi_{1},\psi_{2}}\|,

(4.11)

where $M_{c}^{\psi_{1},\psi_{2}}:V_{c}\to V_{c}$ is the map

\displaystyle M_{c}^{\psi_{1},\psi_{2}}

\displaystyle=\frac{1}{c^{2}}\sum_{h_{1},h_{2}\in\mathbb{Z}/c\mathbb{Z}}\widehat{\psi}_{1}(h_{1})\widehat{\psi}_{2}(h_{2})\rho_{c}(T^{h_{1}}ST^{h_{2}})P_{c}^{\circ}.

By Lemma˜4.9 and the commutativity claim in Lemma˜4.8, we can further write

	$\displaystyle M_{c}^{\psi_{1},\psi_{2}}$	$\displaystyle=\frac{1}{c^{2}}\sum_{h_{1},h_{2}\in\mathbb{Z}/c\mathbb{Z}}\widehat{\psi}_{1}(h_{1})\widehat{\psi}_{2}(h_{2})\rho_{c}(T^{h_{1}}ST^{h_{2}})\sum_{d\mid c}\mu\left(\frac{c}{d}\right)P_{c}(d)$
		$\displaystyle=\frac{1}{c^{2}}\sum_{h_{1},h_{2}\in\mathbb{Z}/c\mathbb{Z}}\widehat{\psi}_{1}(h_{1})\widehat{\psi}_{2}(h_{2})\sum_{d\mid c}\mu\left(\frac{c}{d}\right)\rho_{c}(T^{h_{1}})P_{c}(d)\rho_{c}(ST^{h_{2}}).$

By Definitions˜4.1 and 4.7, we can represent this map as a matrix in $\mathbb{C}^{\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})\times\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})}$ with entries

(M_{c}^{\psi_{1},\psi_{2}})_{u,v}=\frac{1}{c^{2}}\sum_{h_{1},h_{2}\in\mathbb{Z}/c\mathbb{Z}}\widehat{\psi}_{1}(h_{1})\widehat{\psi}_{2}(h_{2})\sum_{d\mid c}\mu\left(\frac{c}{d}\right)\frac{d^{2}\phi(c)}{c^{2}\phi(d)}\mathbbm{1}_{T^{-h_{1}}u\in\Gamma_{c}(d)\cdot ST^{h_{2}}v}

(4.12)

for $u,v\in\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ ; compare this to ˜4.10. We will show that restricting the matrix $M_{c}^{\psi_{1},\psi_{2}}$ to those rows and columns indexed by $u,v\in\mathbb{Z}/c\mathbb{Z}\subset\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z})$ (by the canonical embedding $x\mapsto[x:1]$ ) yields precisely the matrix $U_{c}^{*}K_{c}^{\psi_{1},\psi_{2}}U_{c}$ . Indeed, using the notation above, if $u,v\in\mathbb{Z}/c\mathbb{Z}$ , then $T^{-h_{1}}u=u-h_{1}=:x\in\mathbb{Z}/c\mathbb{Z}$ , $T^{h_{2}}v=v+h_{2}=:y\in\mathbb{Z}/c\mathbb{Z}$ , and we have $x\in\Gamma_{c}(d)\cdot Sy$ if and only if the equation

(I+dA)\begin{pmatrix}x\\ 1\end{pmatrix}=\alpha\begin{pmatrix}-1\\ y\end{pmatrix}

has solutions in $I+dA\in\Gamma_{c}(d)$ and $\alpha\in(\mathbb{Z}/c\mathbb{Z})^{\times}$ . On the one hand, the existence of such solutions implies that $\left(\begin{smallmatrix}x\\ 1\end{smallmatrix}\right)\equiv\left(\begin{smallmatrix}-\alpha\\ \alpha y\end{smallmatrix}\right)\ (\textnormal{mod }d)$ , so $xy\equiv-1\ (\textnormal{mod }d)$ . On the other hand, if $xy\equiv-1\ (\textnormal{mod }d)$ , then one can take $A=0$ and $\alpha=-x$ to obtain a solution. It follows that

\mathbbm{1}_{T^{-h_{1}}u\in\Gamma_{c}(d)\cdot ST^{h_{2}}v}=\mathbbm{1}_{(u-h_{1})(v+h_{2})\equiv-1\ (\textnormal{mod }d)},\qquad\qquad u,v\in\mathbb{Z}/c\mathbb{Z}\subset\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z}),

and then by comparing ˜4.10 and 4.12, we find that

(U_{c}^{*}K_{c}^{\psi_{1},\psi_{2}}U_{c})_{u,v}=c(M_{c}^{\psi_{1},\psi_{2}})_{u,v}\qquad\qquad u,v\in\mathbb{Z}/c\mathbb{Z}\subset\mathbb{P}^{1}(\mathbb{Z}/c\mathbb{Z}).

Since removing some rows and columns of a matrix can only decrease its spectral norm, we conclude that

\|K_{c}^{\psi_{1},\psi_{2}}\|=\|U_{c}^{*}K_{c}^{\psi_{1},\psi_{2}}U_{c}\|\leq c\|M_{c}^{\psi_{1},\psi_{2}}\|,

which, together with ˜4.11, completes our proof. ∎

Corollary 4.11.

Let $c,M,N\in\mathbb{Z}_{+}$ with $1\leq M,N\leq c$ , $a\in(\mathbb{Z}/c\mathbb{Z})^{\times}$ , and $\mathcal{I},\mathcal{J}\subset\mathbb{Z}$ be intervals of lengths $|\mathcal{I}|=M$ , $|\mathcal{J}|=N$ . Let $K_{c,a}^{\mathcal{I},\mathcal{J}}\in\mathbb{C}^{\mathcal{I}\times\mathcal{J}}$ be the $M\times N$ matrix indexed by $m\in\mathcal{I}$ and $n\in\mathcal{J}$ , with entries

(K_{c,a}^{\mathcal{I},\mathcal{J}})_{m,n}:=S(am,n;c)\mathbbm{1}_{(m,n,c)=1}.

(4.13)

Let $\varepsilon>0$ and $H_{1}:=c^{1+\varepsilon}M^{-1}$ , $H_{2}:=c^{1+\varepsilon}N^{-1}$ . Then there exist complex weights $\alpha_{h},\beta_{h}\ll 1$ such that for the function $F_{c,a}^{H_{1},H_{2}}:\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})\to\mathbb{C}$ given by

F_{c,a}^{H_{1},H_{2}}:=\frac{1}{H_{1}H_{2}}\sum_{\begin{subarray}{c}|h_{1}|\leq H_{1}\\ |h_{2}|\leq H_{2}\end{subarray}}\alpha_{h_{1}}\beta_{h_{2}}\mathbbm{1}_{T^{\overline{a}h_{1}}ST^{h_{2}}},

(4.14)

one has

\|K_{c,a}^{\mathcal{I},\mathcal{J}}\|\leq c^{1+2\varepsilon}\|\widehat{F}_{c,a}^{H_{1},H_{2}}(\rho_{c}^{\circ})\|+O_{\varepsilon}(c^{-100}).

(4.15)

Remark.

Given ˜4.14, one can apply the triangle inequality for the operator norm to obtain

	$\displaystyle\\|\widehat{F}_{c,a}^{H_{1},H_{2}}(\rho_{c}^{\circ})\\|$	$\displaystyle=\Bigg\\|\frac{1}{H_{1}H_{2}}\sum_{\begin{subarray}{c}\|h_{1}\|\leq H_{1}\\ \|h_{2}\|\leq H_{2}\end{subarray}}\alpha_{h_{1}}\beta_{h_{2}}\,\rho_{c}^{\circ}(T^{ah_{1}}ST^{h_{2}})\Bigg\\|$		(4.16)
		$\displaystyle\leq\frac{1}{H_{1}H_{2}}\sum_{\begin{subarray}{c}\|h_{1}\|\leq H_{1}\\ \|h_{2}\|\leq H_{2}\end{subarray}}\|\alpha_{h_{1}}\beta_{h_{2}}\|\\|\rho_{c}^{\circ}(T^{ah_{1}}ST^{h_{2}})\\|\ll 1,$		(4.16)

since $\|\rho_{c}^{\circ}(T^{ah_{1}}ST^{h_{2}})\|=1$ (as the norm of a unitary map). Plugging this into ˜4.15 recovers the trivial bound $\|K_{c}^{\mathcal{I},\mathcal{J}}\|\ll c^{1+o(1)}$ from ˜1.2. Our task in the later sections will therefore be to establish some power-saving spectral cancellation in the sum over $h_{1},h_{2}$ from ˜4.16.

Proof of Corollary˜4.11.

Let us write $[M]:=\{1,\ldots,M\}$ , $[N]:=\{1,\ldots,N\}$ , and $\mathcal{I}=[M]+r$ , $\mathcal{J}=[N]+s$ for some $r,s\in\mathbb{Z}$ . Since $M,N\leq c$ , we may identify $\mathcal{I}$ , $\mathcal{J}$ with their images in $\mathbb{Z}/c\mathbb{Z}$ . Let $\Phi:\mathbb{R}\to\mathbb{C}$ be a smooth function supported in $[-1,2]$ , such that $\Phi\geq\mathbbm{1}_{[0,1]}$ and $\Phi^{(j)}\ll_{j}1$ for $j\geq 0$ , and define $\psi_{1},\psi_{2}:\mathbb{Z}/c\mathbb{Z}\to\mathbb{C}$ by

\psi_{1}(m):=\sum_{\begin{subarray}{c}m^{\prime}\in\mathbb{Z}\\ a(m^{\prime}+r)\equiv m\ (\textnormal{mod }c)\end{subarray}}\Phi\left(\frac{m^{\prime}}{M}\right),\qquad\qquad\psi_{2}(n):=\sum_{\begin{subarray}{c}n^{\prime}\in\mathbb{Z}\\ n^{\prime}+s\equiv n\ (\textnormal{mod }c)\end{subarray}}\Phi\left(\frac{n^{\prime}}{N}\right).

(4.17)

Since $\Phi\geq\mathbbm{1}_{[0,1]}$ , we have $\psi_{1}\geq\mathbbm{1}_{a\mathcal{I}}$ and $\psi_{2}\geq\mathbbm{1}_{\mathcal{J}}$ (viewing these as functions on $\mathbb{Z}/c\mathbb{Z}$ ). But scaling a row or a column of a matrix by a constant in $[0,1]$ can only decrease its spectral norm, so with the notation from ˜4.8 we get

\|K_{c}^{\psi_{1},\psi_{2}}\|\geq\|K_{c,a}^{\mathcal{I},\mathcal{J}}\|.

So from Proposition˜4.10, it follows that

\|K_{c,a}^{\mathcal{I},\mathcal{J}}\|\leq c\|\widehat{F}_{c}^{\psi_{1},\psi_{2}}(\rho_{c}^{\circ})\|,

(4.18)

and it remains to compute $F_{c}^{\psi_{1},\psi_{2}}$ . For $h\in\mathbb{Z}/c\mathbb{Z}$ , we obtain from ˜4.17, ˜3.11, ˜3.3, and ˜3.2 that

	$\displaystyle\widehat{\psi}_{1}(h)=\sum_{m\in\mathbb{Z}/c\mathbb{Z}}\psi_{1}(m)\,e\left(-\frac{hm}{c}\right)$	$\displaystyle=\sum_{m^{\prime}\in\mathbb{Z}}\Phi\left(\frac{m^{\prime}}{M}\right)e\left(-\frac{ah(m^{\prime}+r)}{c}\right)$
		$\displaystyle=Me\left(-\frac{rah}{c}\right)\sum_{k\in\mathbb{Z}}\widehat{\Phi}\left(M\Big(k+\frac{ah}{c}\Big)\right)$
		$\displaystyle=Me\left(-\frac{rah}{c}\right)\sum_{\begin{subarray}{c}h^{\prime}\in\mathbb{Z}\\ h^{\prime}\equiv ah\ (\textnormal{mod }c)\end{subarray}}\widehat{\Phi}\left(\frac{h^{\prime}}{c/M}\right),$

and similarly for $\widehat{\psi}_{2}(h)$ (with $1$ in place of $a$ ). Plugging this into ˜4.9, we conclude that

	$\displaystyle F_{c}^{\psi_{1},\psi_{2}}$	$\displaystyle=\frac{MN}{c^{2}}\sum_{h_{1},h_{2}\in\mathbb{Z}/c\mathbb{Z}}\sum_{\begin{subarray}{c}h_{1}^{\prime},h_{2}^{\prime}\in\mathbb{Z}\\ h_{1}^{\prime}\equiv ah_{1}\ (\textnormal{mod }c)\\ h_{2}^{\prime}\equiv h_{2}\ (\textnormal{mod }c)\end{subarray}}\widehat{\Phi}\left(\frac{h_{1}^{\prime}}{c/M}\right)\widehat{\Phi}\left(\frac{h_{2}^{\prime}}{c/N}\right)e\left(\frac{-rah_{1}-sh_{2}}{c}\right)\mathbbm{1}_{T^{h_{1}}ST^{h_{2}}}$
		$\displaystyle=\frac{c^{2\varepsilon}}{H_{1}H_{2}}\sum_{h_{1}^{\prime},h_{2}^{\prime}\in\mathbb{Z}}\widehat{\Phi}\left(\frac{h_{1}^{\prime}}{c/M}\right)\widehat{\Phi}\left(\frac{h_{2}^{\prime}}{c/N}\right)e\left(\frac{-rah_{1}^{\prime}-sh_{2}^{\prime}}{c}\right)\mathbbm{1}_{T^{\overline{a}h_{1}^{\prime}}ST^{h_{2}^{\prime}}}.$

Using the Schwarz decay of $\widehat{\Phi}$ , we can discard the contribution of the terms with $|h_{1}^{\prime}|>H_{1}$ or $|h_{2}^{\prime}|>H_{2}$ to $\|\widehat{F}_{c}^{\psi_{1},\psi_{2}}(\rho_{c}^{\circ})\|$ , up to an error of $O_{\varepsilon}(c^{-100})$ . Choosing

\alpha_{h}:=\widehat{\Phi}\left(\frac{h}{c/M}\right)e\left(-\frac{rah}{c}\right),\qquad\qquad\beta_{h}:=\widehat{\Phi}\left(\frac{h}{c/N}\right)e\left(-\frac{sh}{c}\right)

concludes our proof in light of ˜4.18 and ˜4.14. ∎

5. The amplification argument

Recall that Propositions˜4.10 and 4.11 reduce the problem of bounding bilinear forms with Kloosterman sums to that of bounding Fourier coefficients of certain functions on $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ at a certain representation. One can then reduce to irreducible subrepresentations via Lemma˜3.10. To use Fourier analysis, we will pass to a sum over all irreducible representations of $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ —but to avoid a critical loss, we insert an amplifier weight in this sum, as outlined in Section˜2.3. Making this work in a non-abelian setting is the key step in our argument.

5.1. Introducing the amplifier

We state the results in this subsection in fair generality, since they may be useful in other contexts. We recall the notation for Schatten norms from Section˜3.1.

Proposition 5.1 (Non-abelian amplification).

Let $G$ be a finite group, $N\triangleleft G$ , $F:G\to\mathbb{C}$ , $\rho\in\widehat{G}$ , $\chi:=\textnormal{Tr}\,\rho$ , and $q$ be an even positive integer. Then one has

\|\widehat{F}(\rho)\|_{S^{q}}^{q}\leq\frac{|G|}{\sum_{n\in N}|\chi(n)|^{2}}\sum_{\begin{subarray}{c}g_{1},\ldots,g_{q}\in G\\ g_{1}\cdots g_{q}\in N\end{subarray}}F(g_{1})\overline{F}(g_{2}^{-1})\cdots F(g_{q-1})\overline{F}(g_{q}^{-1})\chi(g_{1}\cdots g_{q}).

Remark.

In comparison, expanding $\|\widehat{F}(\rho)\|_{S^{q}}^{q}$ by ˜3.10 and 3.5 yields

\|\widehat{F}(\rho)\|_{S^{q}}^{q}=\sum_{g_{1},\ldots,g_{q}\in G}F(g_{1})\overline{F}(g_{2}^{-1})\cdots F(g_{q-1})\overline{F}(g_{q}^{-1})\chi(g_{1}\cdots g_{q}).

The upper bound from Proposition˜5.1 replaces $\chi$ with a function proportional to $\chi\mathbbm{1}_{N}$ , normalized so that equality is attained when $F=\overline{\chi}$ . The requirement that $\rho$ be irreducible is crucial.

We first prove Proposition˜5.1 using character orthogonality, which resembles more classical amplification arguments, and then give a more conceptual sketch of proof via induced representations. The first proof has the advantage that it may be generalized by considering other amplifiers.

Proof of Proposition˜5.1 via character orthogonality.

Write $F_{0}:=F$ and $F_{1}:G\to\mathbb{C}$ for the function $F_{1}(g):=\overline{F}(g^{-1})$ , so that $\widehat{F}_{1}(\rho)=\widehat{F}(\rho)^{*}$ . By considering the function

F_{\text{conv}}:=F_{0}*F_{1}*F_{0}*\cdots*F_{\frac{q}{2}\ \textnormal{mod }2},

where there are $\tfrac{q}{2}$ factors in the convolution, so that $\|\widehat{F}_{\text{conv}}(\rho)\|_{S^{2}}^{2}=\|\widehat{F}(\rho)\|_{S^{q}}^{q}$ , we see that it suffices to prove the desired result when $q=2$ .

Let $\mathbbm{1}_{N}:G\to\{0,1\}$ be the indicator function of $N$ , and consider the amplifier $A:\widehat{G}\to[0,\infty)$ given at $\rho^{\prime}\in\widehat{G}$ with $\chi^{\prime}:=\textnormal{Tr}\rho^{\prime}$ by

	$\displaystyle A(\rho^{\prime})=\frac{1}{\|N\|}\left\\|\widehat{\mathbbm{1}}_{N}\left(\overline{\rho}^{\prime}\otimes\rho\right)\right\\|_{S^{2}}^{2}$	$\displaystyle=\frac{1}{\|N\|}\left\\|\sum_{n\in N}\overline{\rho}^{\prime}(n)\otimes\rho(n)\right\\|_{S^{2}}^{2}$
		$\displaystyle=\frac{1}{\|N\|}\textnormal{Tr}\left(\sum_{n_{1},n_{2}\in N}\overline{\rho}^{\prime}(n_{1}n_{2}^{-1})\otimes\rho(n_{1}n_{2}^{-1})\right)=\sum_{n\in N}\overline{\chi^{\prime}}(n)\chi(n),$

where we implicitly used that $N$ is a group. In particular, $A(\rho)=\sum_{n\in N}|\chi(n)|^{2}$ is a positive integer multiple of $N$ , due to ˜3.9. It follows from this and nonnegativity that

	$\displaystyle\left(\sum_{n\in N}\|\chi(n)\|^{2}\right)\\|\widehat{F}(\rho)\\|_{S^{2}}^{2}$	$\displaystyle\leq\sum_{\rho^{\prime}\in\widehat{G}}A(\rho^{\prime})\\|\widehat{F}(\rho^{\prime})\\|_{S^{2}}^{2}$
		$\displaystyle=\sum_{\rho^{\prime}\in\widehat{G}}\sum_{n\in N}\overline{\textnormal{Tr}\rho^{\prime}}(n)\chi(n)\\|\widehat{F}(\rho^{\prime})\\|_{S^{2}}^{2}=\sum_{n\in N}\chi(n)\sum_{\rho^{\prime}\in\widehat{G}}\overline{\textnormal{Tr}\rho^{\prime}}(n)\\|\widehat{F}(\rho^{\prime})\\|_{S^{2}}^{2}.$

Expanding $\|\widehat{F}(\rho^{\prime})\|_{S^{2}}^{2}$ via ˜3.10 and 3.5 and then using ˜3.8, the sum over $\rho^{\prime}$ above becomes

	$\displaystyle\sum_{\rho^{\prime}\in\widehat{G}}\overline{\textnormal{Tr}\rho^{\prime}}(n)\sum_{g_{1},g_{2}\in G}F(g_{1})\overline{F}(g_{2}^{-1})\textnormal{Tr}\rho^{\prime}(g_{1}g_{2})$	$\displaystyle=\sum_{g_{1},g_{2}\in G}F(g_{1})\overline{F}(g_{2}^{-1})\sum_{\chi^{\prime}\in\textnormal{Irr}(G)}\overline{\chi^{\prime}}(n)\chi^{\prime}(g_{1}g_{2})$
		$\displaystyle=\sum_{g_{1},g_{2}\in G}F(g_{1})\overline{F}(g_{2}^{-1})\frac{\|G\|}{\|C_{n}\|}\mathbbm{1}_{g_{1}g_{2}\in C_{n}},$

where $C_{n}$ denotes the conjugacy class of $n$ in $G$ . Since $N$ can be partitioned into such conjugacy classes by normality, and since characters are constant on conjugacy classes, it follows that

	$\displaystyle\left(\sum_{n\in N}\|\chi(n)\|^{2}\right)\\|\widehat{F}(\rho)\\|_{S^{2}}^{2}$	$\displaystyle\leq\|G\|\sum_{g_{1},g_{2}\in G}F(g_{1})\overline{F}(g_{2}^{-1})\sum_{n\in N}\chi(n)\frac{\mathbbm{1}_{g_{1}g_{2}\in C_{n}}}{\|C_{n}\|}$
		$\displaystyle=\|G\|\sum_{\begin{subarray}{c}g_{1},g_{2}\in G\\ g_{1}g_{2}\in N\end{subarray}}F(g_{1})\overline{F}(g_{2}^{-1})\chi(g_{1}g_{2}).$

This settles the case $q=2$ , thus completing our proof. ∎

Remark.

After reducing to the case $q=2$ , one could also attempt to use the triangle inequality for Schatten norms and Cauchy–Schwarz, in the shape

	$\displaystyle\left\\|\sum_{g\in G}F(g)\rho(g)\right\\|_{S^{2}}^{2}$	$\displaystyle\leq\left(\sum_{Ng_{0}\in N\backslash G}\left\\|\sum_{g\in Ng_{0}}F(g)\rho(g)\right\\|_{S^{2}}\right)^{2}$
		$\displaystyle\leq[G:N]\sum_{Ng_{0}\in N\backslash G}\left\\|\sum_{g\in Ng_{0}}F(g)\rho(g)\right\\|_{S^{2}}^{2}=\frac{\|G\|}{\|N\|}\sum_{\begin{subarray}{c}g_{1},g_{2}\in G\\ g_{1}g_{2}\in N\end{subarray}}F(g_{1})\overline{F}(g_{2}^{-1})\chi(g_{1}g_{2}).$

This argument does not use that $N$ is normal or that $\rho$ is irreducible, but it produces a weaker bound in general. Indeed, compared to Proposition˜5.1, the bound above loses a factor of

\frac{1}{|N|}\sum_{n\in N}|\chi(n)|^{2},

which is a (potentially large) positive integer by ˜3.9; note that although $\chi$ is irreducible on $G$ , it is usually not irreducible on $N$ (e.g., when $N=\{e\}$ is the trivial subgroup, the factor lost is $(\dim\rho)^{2}$ ). This is a discrepancy which does not arise in the abelian setting, when all irreducible characters are $1$ -dimensional—which is why classical amplification arguments with Dirichlet characters can often be re-formulated by applying Cauchy–Schwarz to a sum over residues (see, e.g., [28]).

Proof sketch of Proposition˜5.1 using induced representations.

Starting from $\rho:G\to U(V)$ , consider the restricted representation $\rho|_{N}:N\to U(V)$ , and then the induced representation

R:=\mathrm{Ind}_{N}^{G}(\rho|_{N}).

This acts by translation on the space

W:=\{f\in V^{G}:f(ng)=\rho(n)f(g),\forall n\in N,g\in G\}\cong V^{G/N},

so $\dim R=\dim W=[G:N]\dim V=\tfrac{|G|}{|N|}\dim\rho$ . It can be shown using ˜3.9 that $\rho$ is a subrepresentation of $R$ , with

\textnormal{Mult}(\rho,R)=\frac{1}{|N|}\sum_{n\in N}|\chi(n)|^{2},

and therefore, by Lemma˜3.10,

\|\widehat{F}(R)\|_{S^{q}}^{q}=\sum_{\rho^{\prime}\in\widehat{G}}\textnormal{Mult}(\rho^{\prime},R)\|\widehat{F}(\rho^{\prime})\|_{S^{q}}^{q}\geq\frac{1}{|N|}\left(\sum_{n\in N}|\chi(n)|^{2}\right)\|\widehat{F}(\rho)\|_{S^{q}}^{q}.

To complete the proof, one can expand $\|\widehat{F}(R)\|_{S^{q}}^{q}$ via ˜3.10 and 3.5, and use the Frobenius formula

\textnormal{Tr}R(g)=\frac{1}{|N|}\sum_{\begin{subarray}{c}x\in G\\ x^{-1}gx\in N\end{subarray}}\chi(x^{-1}gx)=\frac{|G|}{|N|}\chi(g)\mathbbm{1}_{N}(g),

for all $g\in G$ , where the last equality uses the normality of $N$ . ∎

Remark.

In the particular case when $N=\{e\}$ is the trivial subgroup, the induced representation $R$ in the proof above is (isomorphic to) the regular representation $R_{G}$ . In this case, the conclusion of Proposition˜5.1 simply reads

\|\widehat{F}(\rho)\|_{S^{q}}^{q}\leq\frac{1}{\dim\rho}\|\widehat{F}(R_{G})\|_{S^{q}}^{q},

and similar ideas appear in [37, 30].

5.2. Passing to a counting problem

We now return to the setting when $G=\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ and $N=\Gamma_{c}(d)$ for some $d\mid c$ . The goal is to pass from the spectral norm $\|\widehat{F}_{c,a}^{H_{1},H_{2}}(\rho_{c}^{\circ})\|$ in ˜4.15 to a count of solutions to a certain equation in $\textnormal{PSL}_{2}(\mathbb{Z}/d\mathbb{Z})$ . After applying Proposition˜5.1, we will need an upper bound for $\chi(g_{1}\cdots g_{q})$ , and a lower bound for the denominator $\sum_{n\in N}|\chi(n)|^{2}$ . For the specific characters $\chi_{c}$ and $\chi^{\circ}_{c}$ from Section˜4, such bounds are given in the following results.

Lemma 5.2.

Let $c\in\mathbb{Z}_{+}$ , $g\in\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ , and $d$ be the largest divisor of $c$ such that

g\in\{\gamma\in\mathbb{Z}/c\mathbb{Z}:\gamma^{2}=1\}\cdot\Gamma_{c}(d).

Let $f\leq\sqrt{cd}$ be the largest positive integer such that $f^{2}\mid cd$ . Then one has

\chi_{c}(g)\ll c^{o(1)}f.

(5.1)

Remark.

The bound in ˜5.1 is sharp in terms of $c$ and $d$ , as can be seen by taking $g=T^{d}$ . In particular, if $c=p^{k}$ is a prime power, then $g=T^{p^{j}}$ has $p^{\left\lfloor(k+j)/2\right\rfloor}$ fixed points of the shape $[1:x]$ , where $p^{j}x^{2}\equiv 0\ (\textnormal{mod }p^{k})$ .

The proof of Lemma˜5.2 is left to Appendix˜A.

Proposition 5.3 (Lower bound for squared character sums).

Let $c\in\mathbb{Z}_{+}$ and $\chi$ be an irreducible character inside $\chi_{c}^{\circ}$ . Let $d,d^{\prime},e\in\mathbb{Z}_{+}$ be such that $d^{\prime}\mid d$ , $(d,e)=1$ , and $c=dd^{\prime}e$ . Then one has

\sum_{n\in\Gamma_{c}(d)}|\chi(n)|^{2}\gg\frac{c^{3-o(1)}}{d}.

Remark.

The lower bound in Proposition˜5.3 wins a factor of about $d^{2}$ over the ‘trivial’ bound of $|\Gamma_{c}(d)|=\tfrac{c^{3}}{d^{3}}$ due to ˜3.9. This is because $|\chi(n)|$ typically has size $\gtrsim d$ when $n\in\Gamma_{c}(d)$ (note that when $d=c$ , one has $\Gamma_{c}(c)=\{I\}$ and $\chi(I)=\dim\chi$ ).

The proof of Proposition˜5.3 reduces to a local computation (i.e., for $c=p^{k}$ a prime power) given below, which builds on Lemma˜3.16. Indeed, one can rephrase Lemma˜3.16 as follows: if $\chi\in\textnormal{Irr}(\textnormal{SL}_{2}(\mathbb{Z}/p^{k}\mathbb{Z}))$ is primitive, then $|\chi(I)|^{2}\gg p^{2k}$ . Using a bit of Clifford theory, we can generalize this to averages of $|\chi|^{2}$ over the congruence subgroups $\Gamma_{p^{k}}(p^{j})$ , for some values of $j$ .

Lemma 5.4.

Let $p^{k}$ be a prime power and $j\in\mathbb{Z}$ be such that either $j=0$ or $\tfrac{k}{2}\leq j\leq k$ . Let $\chi$ be a primitive irreducible character of $\textnormal{SL}_{2}(\mathbb{Z}/p^{k}\mathbb{Z})$ . Then

\sum_{n\in\Gamma_{p^{k}}(p^{j})}|\chi(n)|^{2}\gg(k-j+1)^{-1}p^{3k-j}.

Remark.

We expect the bound in Lemma˜5.4 to be sharp, and to actually hold for all $0\leq j\leq k$ . This might follow from a more careful study of $\widehat{\Gamma}_{p^{k}}(p^{j})$ for $1\leq j<\tfrac{k}{2}$ , and it would imply Proposition˜5.3 (as well as Theorem˜1.2) for more flexible factorizations $c=de$ .

The proof of Lemma˜5.4 is also left to Appendix˜A, a key ingredient being the fact that $\Gamma_{p^{k}}(p^{j})$ is abelian if $\tfrac{k}{2}\leq j\leq k$ .

Proof of Proposition˜5.3.

By (the proof of) Proposition˜4.6, $\rho_{c}^{\circ}$ is isomorphic to a direct sum of representations of the shape

\rho=\mathop{\mathchoice{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}{\vbox{\hbox{\scalebox{2.0}{$\displaystyle\boxtimes$}}}}}_{p^{k}\|c}\rho_{p,k},

(5.2)

where each $\rho_{p,k}$ is a primitive irreducible representation of $\textnormal{SL}_{2}(\mathbb{Z}/p^{k}\mathbb{Z})$ . This gives the decomposition of $\rho_{c}^{\circ}$ into irreducible representations, so any irreducible representation occurring inside $\rho_{c}^{\circ}$ must be of the form in ˜5.2. Now let $\chi:=\textnormal{Tr}\rho$ and $\chi_{p,k}:=\textnormal{Tr}\rho_{p,k}$ for such a representation. From ˜3.16 and fact that $\chi(n)=\prod_{p^{k}\|n}\chi_{p,k}(\pi_{c,p^{k}}(n))$ for $n\in\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ , it follows that

\sum_{n\in\Gamma_{c}(d)}|\chi(n)|^{2}=\prod_{\begin{subarray}{c}p^{k}\|c\\ p^{j}\|d\end{subarray}}\,\sum_{n\in\Gamma_{p^{k}}(p^{j})}|\chi_{p,k}(n)|^{2}.

One can then apply Lemma˜5.4 to obtain

\sum_{n\in\Gamma_{c}(d)}|\chi(n)|^{2}\gg\prod_{p^{k}\|c}(k+1)^{-1}p^{3k-j},

Note that the hypothesis on $j$ from Lemma˜5.4 is satisfied because whenever $p$ is a prime dividing $c$ with $p^{k}\|c$ and $p^{j}\|d$ , one of the following holds:

(1)

One has $p\mid e$ and $p\nmid d$ , so $j=0$ , or
(2)

One has $p\mid d$ and $p\nmid e$ , so $k=v_{p}(dd^{\prime})\leq 2j$ .

The desired conclusion then follows from the divisor bound. ∎

Finally, we can state the result of our amplification argument.

Proposition 5.5 (From Fourier coefficients to a counting problem).

Let $c\in\mathbb{Z}_{+}$ , $a\in(\mathbb{Z}/c\mathbb{Z})^{\times}$ , $H_{1},H_{2}\gg 1$ , and $F_{c,a}^{H_{1},H_{2}}$ be as in ˜4.14. Let $d,d^{\prime},e\in\mathbb{Z}_{+}$ be such that $(d,e)=1$ , $d^{\prime}\mid d$ , and $c=dd^{\prime}e$ . Then for any even positive integer $q$ , one has

\|\widehat{F}_{c,a}^{H_{1},H_{2}}(\rho_{c}^{\circ})\|_{S^{q}}^{q}\ll\frac{c^{o(1)}d}{(H_{1}H_{2})^{q/2}}\max_{\begin{subarray}{c}d\mid\tilde{d}\mid c\\ \tilde{f}^{2}\mid c\tilde{d}\end{subarray}}\tilde{f}\sum_{\begin{subarray}{c}h_{1},\ldots,h_{q}\in\mathbb{Z}\\ |h_{i}|\leq 2H_{j}\\ \forall i\equiv j\ (\textnormal{mod }2)\end{subarray}}\mathbbm{1}_{T^{\overline{a}h_{1}}ST^{h_{2}}S\cdots T^{\overline{a}h_{q-1}}ST^{h_{q}}S=I\text{ in }\textnormal{PSL}_{2}(\mathbb{Z}/\tilde{d}\mathbb{Z})},

where both variables $\tilde{d},\tilde{f}$ in the maxima are understood to be integers (note that $\tilde{f}\leq\sqrt{c\tilde{d}}$ ).

Proof.

By Proposition˜4.6, $\rho_{c}^{\circ}$ is a sum of $c^{o(1)}$ irreducible representations $\rho$ . By Lemma˜3.10, it suffices to prove the desired upper bound for each $\|\widehat{F}_{c,a}^{H_{1},H_{2}}(\rho)\|_{S^{q}}^{q}$ . The loss factor of $c^{o(1)}$ is acceptable here (but if one is only interested in the spectral norm, so $q=\infty$ , there is actually no loss at this step).

For each irreducible representation $\rho$ with $\textnormal{Mult}(\rho,\rho_{c}^{\circ})>0$ , we apply Propositions˜5.1 and 5.3 with $F:=F_{c,a}^{H_{1},H_{2}}$ to obtain

\|\widehat{F}(\rho)\|_{S^{q}}^{q}\ll\frac{c^{3}}{c^{3-o(1)}/d}\sum_{\begin{subarray}{c}g_{1},\ldots,g_{q}\in\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})\\ g_{1}\cdots g_{q}\in\Gamma_{c}(d)\end{subarray}}F(g_{1})\overline{F}(g_{2}^{-1})\cdots F(g_{q-1})\overline{F}(g_{q}^{-1})\chi(g_{1}\cdots g_{q}),

where $\chi:=\textnormal{Tr}\rho$ . In fact, by Proposition˜5.1, the sum above is nonnegative if one replaces $\chi$ with any irreducible character of $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ . Summing over all such characters $\chi^{\prime}=\textnormal{Tr}\rho^{\prime}$ with weight $\textnormal{Mult}(\rho^{\prime},\rho_{c})$ (which is at least $1$ when $\rho^{\prime}=\rho$ ), we find that

\|\widehat{F}(\rho)\|_{S^{q}}^{q}\ll c^{o(1)}d\sum_{\begin{subarray}{c}g_{1},\ldots,g_{q}\in\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})\\ g_{1}\cdots g_{q}\in\Gamma_{c}(d)\end{subarray}}F(g_{1})\overline{F}(g_{2}^{-1})\cdots F(g_{q-1})\overline{F}(g_{q}^{-1})\chi_{c}(g_{1}\cdots g_{q}).

Here $\rho_{c}$ is the original permutation representation from Definition˜4.1. In light of Lemma˜5.2, we ought to split the sum above based on the largest $\tilde{d}\mid c$ such that $g_{1}\cdots g_{q}\in\gamma\Gamma_{c}(\tilde{d})$ for some $\gamma\in\mathbb{Z}/c\mathbb{Z}$ with $\gamma^{2}=1$ ; note that by ˜3.12, this is equivalent to the equation $g_{1}\cdots g_{q}=I$ in $\textnormal{PSL}_{2}(\mathbb{Z}/\tilde{d}\mathbb{Z})$ . Then from the triangle inequality, Lemma˜5.2, and the divisor bound for $\tilde{d}$ , we find that

\|\widehat{F}(\rho)\|_{S^{q}}^{q}\ll c^{o(1)}d\max_{\begin{subarray}{c}d\mid\tilde{d}\mid c\\ \tilde{f}^{2}\mid c\tilde{d}\end{subarray}}\tilde{f}\sum_{g_{1},\ldots,g_{q}\in\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})}|F(g_{1})F(g_{2}^{-1})\cdots F(g_{q-1})F(g_{q}^{-1})|\mathbbm{1}_{g_{1}\cdots g_{q}=I\text{ in }\textnormal{PSL}_{2}(\mathbb{Z}/\tilde{d}\mathbb{Z})}.

(5.3)

Now recalling ˜4.14, we can expand

|F(g)|=|F_{c,a}^{H_{1},H_{2}}(g)|\ll\frac{1}{H_{1}H_{2}}\sum_{\begin{subarray}{c}|h_{1}|\leq H_{1}\\ |h_{2}|\leq H_{2}\end{subarray}}\mathbbm{1}_{g=T^{\overline{a}h_{1}}ST^{h_{2}}}.

The conclusion follows by plugging this into ˜5.3, and noting that as $h,h^{\prime}$ vary in $[-H,H]\cap\mathbb{Z}$ , the difference $h-h^{\prime}$ varies in $[-2H,2H]\cap\mathbb{Z}$ , each value being attained $O(H)$ times. ∎

6. Counting solutions in $\textnormal{PSL}_{2}(\mathbb{Z}/c\mathbb{Z})$

We now develop the final ingredient towards Theorem˜1.2, as outlined in Section˜2.4. Given $c,q\in\mathbb{Z}_{+}$ with $q$ even, $a_{1},a_{2}\in(\mathbb{Z}/c\mathbb{Z})^{\times}$ , and $1\leq H_{1}\leq H_{2}\ll c$ , we will count solutions in $(h_{1},\ldots,h_{q})\in\mathbb{Z}$ to the system

\begin{cases}T^{a_{1}h_{1}}ST^{a_{2}h_{2}}S\cdots T^{a_{1}h_{q-1}}T^{a_{2}h_{q}}S=I\text{ in }\textnormal{PSL}_{2}(\mathbb{Z}/c\mathbb{Z}),\\ |h_{i}|\leq H_{j},\text{ for all }i,j\text{ with }i\equiv j\ (\textnormal{mod }2).\end{cases}

(6.1)

In particular, the ranges of $h_{i}$ alone produce the trivial bound $\ll_{q}(H_{1}H_{2})^{q/2}$ for the number of solutions. Focusing on the case $a_{1}=a_{2}=1$ , below are some classes of solutions to ˜6.1:

$(i)$ .

Integer solutions (1). If $h_{1}=h_{\frac{q}{2}+1}=0$ (and similarly for cyclic permutations of this case), noting that $S^{2}=I$ in $\textnormal{PSL}_{2}(\mathbb{Z}/c\mathbb{Z})$ , ˜6.1 becomes

$T^{h_{2}}S\cdots ST^{h_{\frac{q}{2}}}=T^{-h_{q}}ST^{-h_{q-1}}S\cdots ST^{-h_{\frac{q}{2}+2}}.$

This has $\asymp_{q}H_{1}^{\left\lfloor(q-2)/4\right\rfloor}H_{2}^{\left\lceil(q-2)/4\right\rceil}$ diagonal solutions with $h_{i}=h_{q-i+2}$ , which actually give solutions to ˜6.1 in $\textnormal{PSL}_{2}(\mathbb{Z})$ .
$(ii)$ .

Integer solutions (2). If $h_{1}=h_{3}=\cdots=h_{q-1}=0$ , ˜6.1 becomes

$T^{h_{2}+h_{4}+\cdots+h_{q}}=I\qquad\iff\qquad h_{2}+h_{4}+\cdots+h_{q}\equiv 0\ (\textnormal{mod }c).$

This has $\asymp_{q}H_{2}^{(q-2)/2}$ solutions, which supersedes the diagonal contribution from $(i)$ .
$(iii)$ .

Generic terms. The product $T^{h_{1}}ST^{h_{2}}S\cdots T^{h_{q}}S$ can take $\approx c^{3}$ values in $\textnormal{PSL}_{2}(\mathbb{Z}/c\mathbb{Z})$ . If each matrix is attained roughly the same number of times (and such equidistribution ought to happen for large enough $q$ ), this gives an expected number of $\approx c^{-3}(H_{1}H_{2})^{q/2}$ solutions.

These heuristics can be formalized to produce a lower bound, imposing a limitation on our methods.

Lemma 6.1.

Let $c,q\in\mathbb{Z}_{+}$ with $q$ even, $a_{1},a_{2}\in(\mathbb{Z}/c\mathbb{Z})^{\times}$ , and $1\leq H_{1}\leq H_{2}\ll c$ . The number of solutions $(h_{1},\ldots,h_{q})\in\mathbb{Z}^{q}$ to ˜6.1 is at least

\gg_{q}H_{2}^{(q-2)/2}+\frac{(H_{1}H_{2})^{q/2}}{c^{3}}.

(6.2)

Proof.

The lower bound by the first term in ˜6.2 follows by considering the aforementioned integer solutions with $h_{1}=h_{3}=\cdots=h_{q-1}=0$ and $h_{2}+h_{4}+\cdots+h_{q}=0$ . To obtain a lower bound by the second term in ˜6.2, we let $k:=\tfrac{q+2}{2}$ , write $(H_{j},a_{j})$ for $(H_{1},a_{1})$ or $(H_{2},a_{2})$ depending on whether $j$ is odd or even, and apply Cauchy–Schwarz to obtain

	$\displaystyle\sum_{\begin{subarray}{c}h_{1},\ldots,h_{k}\in\mathbb{Z}\\ \|h_{i}\|\leq\frac{1}{2}H_{j}\\ \forall i\equiv j\ (\textnormal{mod }2)\end{subarray}}1$	$\displaystyle=\sum_{g\in\textnormal{PSL}_{2}(\mathbb{Z}/c\mathbb{Z})}\sum_{\begin{subarray}{c}h_{1},\ldots,h_{k}\in\mathbb{Z}\\ \|h_{i}\|\leq\frac{1}{2}H_{j}\\ \forall i\equiv j\ (\textnormal{mod }2)\end{subarray}}\mathbbm{1}_{T^{a_{1}h_{1}}S\cdots T^{a_{k}h_{k}}S=g}$
		$\displaystyle\ll c^{3/2}\Bigg(\sum_{g\in\textnormal{PSL}_{2}(\mathbb{Z}/c\mathbb{Z})}\Bigg(\sum_{\begin{subarray}{c}h_{1},\ldots,h_{k}\in\mathbb{Z}\\ \|h_{i}\|\leq\frac{1}{2}H_{j}\\ \forall i\equiv j\ (\textnormal{mod }2)\end{subarray}}\mathbbm{1}_{T^{a_{1}h_{1}}S\cdots T^{a_{k}h_{k}}S=g}\Bigg)^{2}\Bigg)^{1/2}$
		$\displaystyle=c^{3/2}\Bigg(\sum_{\begin{subarray}{c}h_{1},\ldots,h_{k}\in\mathbb{Z}\\ h_{1}^{\prime},\ldots,h_{k}^{\prime}\in\mathbb{Z}\\ \|h_{i}\|,\|h_{i}^{\prime}\|\leq\frac{1}{2}H_{j}\\ \forall i\equiv j\ (\textnormal{mod }2)\end{subarray}}\mathbbm{1}_{T^{a_{1}h_{1}}S\cdots T^{a_{k}h_{k}}S=T^{a_{1}h_{1}^{\prime}}S\cdots T^{a_{k}h_{k}^{\prime}}S\text{ in }\textnormal{PSL}_{2}(\mathbb{Z}/c\mathbb{Z})}\Bigg)^{1/2}.$

One can rewrite the last equation $T^{a_{1}h_{1}}S\cdots T^{a_{k}h_{k}}S=T^{a_{1}h_{1}^{\prime}}S\cdots T^{a_{k}h_{k}^{\prime}}S$ in $\textnormal{PSL}_{2}(\mathbb{Z}/c\mathbb{Z})$ as

T^{a_{1}(h_{1}-h_{1}^{\prime})}ST^{a_{2}h_{2}}S\cdots T^{a_{k-1}h_{k-1}}ST^{a_{k}(h_{k}-h_{k}^{\prime})}ST^{-a_{k-1}h_{k-1}^{\prime}}S\cdots T^{-a_{1}h_{1}^{\prime}}S=I.

Comparing this with ˜6.1, recalling that $k=\tfrac{q+2}{2}$ , and noting that $h_{1}-h_{1}^{\prime}$ takes each value in $\mathbb{Z}\cap[-H_{1},H_{1}]$ at most $O(H_{1})$ times (and similarly for $h_{k}-h_{k}^{\prime}$ ), we conclude that the desired count of solutions is at least

\gg_{q}\frac{1}{c^{3}H_{1}H_{k}}\Bigg(\sum_{\begin{subarray}{c}h_{1},\ldots,h_{k}\in\mathbb{Z}\\ |h_{i}|\leq\frac{1}{2}H_{j}\\ \forall i\equiv j\ (\textnormal{mod }2)\end{subarray}}1\Bigg)^{2}\gg_{q}\frac{1}{c^{3}H_{1}H_{k}}H_{1}^{2\left\lceil k/2\right\rceil}H_{2}^{2\left\lfloor k/2\right\rfloor}=\frac{H_{1}^{k-1}H_{2}^{k-1}}{c^{3}}.

Since $k-1=\tfrac{q}{2}$ , this completes our proof. ∎

Optimistically, we may expect the lower bound in Lemma˜6.1 to be essentially sharp.

Conjecture 6.2.

Let $c,q\in\mathbb{Z}_{+}$ with $q$ even. For all $a_{1},a_{2}\in(\mathbb{Z}/c\mathbb{Z})^{\times}$ and $1\leq H_{1}\leq H_{2}\ll c$ , the number of solutions $(h_{1},\ldots,h_{q})\in\mathbb{Z}^{q}$ to ˜6.1 is at most

\ll_{q}c^{o(1)}\left(H_{2}^{(q-2)/2}+\frac{(H_{1}H_{2})^{q/2}}{c^{3}}\right).

(6.3)

Remark.

When $c$ is large enough in terms of $H_{1},H_{2},q$ (so in particular, the first term in ˜6.3 dominates), ˜6.2 becomes a statement about $\textnormal{PSL}_{2}(\mathbb{Z})$ , which follows from a somewhat tedious combinatorial computation. When $q$ is large enough in terms of $H_{1},H_{2},c$ (so in particular, the second term in ˜6.3 dominates), ˜6.2 can be attacked using expansion methods; see [37, Lemma 53 and Theorem 50] for the case when $c$ is prime. However, note that ˜6.3 saves at best $c^{3}$ over the trivial bound of $(H_{1}H_{2})^{q/2}$ , and this saving becomes $c^{3/q}$ in our final bounds; therefore, using a large value of $q$ ultimately produces a quantitatively-weak power saving. On the other hand, if $q$ is too small, then combining ˜6.2 with Proposition˜5.5 gives information about a small moment of singular values, which produces a weak bound for the top singular value.

Because of this, ˜6.2 is most relevant in the median range when $q\asymp 1$ , say $q\in\{6,8,10\}$ , and $H_{1},H_{2}\in[\sqrt{c},c]$ . It seems very difficult to fully establish ˜6.2 in these cases, but we can nevertheless make some partial progress towards it. When $c$ is prime, the cases $q\in\{4,6\}$ are also related to some computations of Shkredov [38, Lemma 15].

Lemma 6.3.

˜6.2 holds if $q=2$ or $q=4$ .

Proof.

When $q=2$ , the equation in ˜6.1 reads $T^{a_{1}h_{1}}S=ST^{-a_{2}h_{2}}$ , which implies the entry-wise congruence

\begin{pmatrix}a_{1}h_{1}&-1\\ 1&0\end{pmatrix}\equiv\gamma\begin{pmatrix}0&-1\\ 1&-a_{2}h_{2}\end{pmatrix}\ (\textnormal{mod }c),

for some $\gamma\in\mathbb{Z}/c\mathbb{Z}$ with $\gamma^{2}=1$ . This actually forces $\gamma=1$ , and $h_{1},h_{2}\equiv 0\ (\textnormal{mod }c)$ . Since $H_{1},H_{2}\ll c$ , we obtain $O(1)$ choices of $h_{1},h_{2}$ , which matches the bound from ˜6.3.

When $q=4$ , the equation in ˜6.1 reads $T^{a_{1}h_{1}}ST^{a_{2}h_{2}}S=ST^{-a_{2}h_{4}}ST^{-a_{1}h_{3}}$ , which translates to

\begin{pmatrix}a_{1}a_{2}h_{1}h_{2}-1&-a_{1}h_{1}\\ a_{2}h_{2}&-1\end{pmatrix}\equiv\gamma\begin{pmatrix}-1&a_{1}h_{3}\\ -a_{1}h_{4}&a_{1}a_{2}h_{3}h_{4}-1\end{pmatrix}\ (\textnormal{mod }c),

for some $\gamma\in\mathbb{Z}/c\mathbb{Z}$ with $\gamma^{2}=1$ . Since there are $c^{o(1)}$ such values⁵⁵5This follows from a local computation via Hensel’s lemma, and the divisor bound. of $\gamma$ , we may fix $\gamma$ up to an acceptable loss. To establish the desired bound of $O(c^{o(1)}H_{2})$ for the number of solutions $(h_{1},h_{2},h_{3},h_{4})$ with $|h_{1}|,|h_{3}|\leq H_{1}$ and $|h_{2}|,|h_{4}|\leq H_{2}$ , we split into three cases.

Case 1: $h_{1}=0$ . This forces $h_{3}\equiv 0\ (\textnormal{mod }c)$ , and each choice of $h_{2}$ induces a unique residue of $h_{4}\ (\textnormal{mod }c)$ . Since $H_{1},H_{2}\ll c$ , this gives a total of $O(c^{o(1)}H_{2})$ solutions.

Case 2: $h_{2}=0$ . This forces $h_{4}\equiv 0\ (\textnormal{mod }c)$ , and each choice of $h_{1}$ induces a unique residue of $h_{3}\ (\textnormal{mod }c)$ . Since $H_{1},H_{2}\ll c$ , this gives a total of $O(c^{o(1)}H_{1})$ solutions, and recall $H_{1}\leq H_{2}$ .

Case 3: $h_{1}h_{2}\neq 0$ . Then the congruence $a_{1}a_{2}h_{1}h_{2}-1\equiv-\gamma\ (\textnormal{mod }c)$ fixes the residue of $h_{1}h_{2}\ (\textnormal{mod }c)$ , leaving $O(1+\tfrac{H_{1}H_{2}}{c})$ possible values of $h_{1}h_{2}$ , each of which gives $O(c^{o(1)})$ choices of $h_{1},h_{2}$ by the divisor bound. This gives a total of $\ll c^{o(1)}(1+\tfrac{H_{1}H_{2}}{c})\ll c^{o(1)}H_{2}$ solutions. ∎

Proposition 6.4 (Combinatorial count for $q=6$ ).

Let $c\in\mathbb{Z}_{+}$ , $a_{1},a_{2}\in(\mathbb{Z}/c\mathbb{Z})^{\times}$ , and $1\leq H_{1}\leq H_{2}\ll c$ . The number of solutions $(h_{1},\ldots,h_{6})\in\mathbb{Z}^{6}$ to ˜6.1 with $q=6$ is at most

\ll c^{o(1)}\left(H_{2}^{2}+\frac{(H_{1}H_{2})^{2}}{c}\right).

(6.4)

Remark.

˜6.2 would replace the second term in ˜6.4 with $\tfrac{(H_{1}H_{2})^{3}}{c^{3}}$ . In particular, Proposition˜6.4 establishes ˜6.2 when $q=6$ and either $H_{1}^{2}\ll c$ or $H_{1},H_{2}\asymp c$ .

Proof of Proposition˜6.4.

To simplify the exposition, we focus on the case $a_{1}=a_{2}=1$ . The proof is almost completely unchanged when $a_{1},a_{2}\in(\mathbb{Z}/c\mathbb{Z})^{\times}$ are arbitrary. We may then write the equation in ˜6.1 (with $q=6$ ) as

T^{h_{1}}ST^{h_{2}}ST^{h_{3}}ST^{h_{4}}S=ST^{-h_{6}}ST^{-h_{5}}.

A short computation brings this to the entry-wise congruence

\begin{pmatrix}h_{1}h_{2}h_{3}h_{4}-h_{1}h_{4}-h_{3}h_{4}-h_{1}h_{2}+1&-h_{1}h_{2}h_{3}+h_{1}+h_{3}\\ h_{2}h_{3}h_{4}-h_{2}-h_{4}&-h_{2}h_{3}+1\end{pmatrix}\equiv\gamma\begin{pmatrix}-1&h_{5}\\ -h_{6}&h_{5}h_{6}-1\end{pmatrix}\ (\textnormal{mod }c)

for some $\gamma\in\mathbb{Z}/c\mathbb{Z}$ with $\gamma^{2}=1$ . As before, since there are $c^{o(1)}$ possible values of $\gamma$ , we may as well regard $\gamma$ as fixed. Since both sides have determinant $1$ , this is actually a system of three congruences:

\begin{cases}1-h_{2}h_{3}\equiv\gamma(h_{5}h_{6}-1)\ (\textnormal{mod }c),\\ h_{1}(1-h_{2}h_{3})+h_{3}\equiv\gamma h_{5}\ (\textnormal{mod }c),\\ h_{4}(1-h_{2}h_{3})+h_{2}\equiv\gamma h_{6}\ (\textnormal{mod }c).\end{cases}

(6.5)

Our argument now requires some casework.

Case 1: One has $h_{j}=0$ for some $j\in\{1,\ldots,6\}$ . Since the original equation can also be written as $T^{h_{j-1}}ST^{h_{j}}ST^{h_{j+1}}ST^{h_{j+2}}ST^{h_{j+3}}ST^{h_{j+4}}S=I$ in $\textnormal{PSL}_{2}(\mathbb{Z}/c\mathbb{Z})$ (viewing indices modulo $6$ ), we may assume without loss of generality that $j=2$ , up to potentially swapping $H_{1}$ and $H_{2}$ in the final bound (so we momentarily forget that $H_{1}\leq H_{2}$ ). So let us say $h_{2}=0$ , which reduces ˜6.5 to

\begin{cases}1\equiv\gamma(h_{5}h_{6}-1)\ (\textnormal{mod }c),\\ h_{1}+h_{3}\equiv\gamma h_{5}\ (\textnormal{mod }c),\\ h_{4}\equiv\gamma h_{6}\ (\textnormal{mod }c).\end{cases}

(6.6)

Subcase 1.1: One has $h_{5}=0$ . Then for any values of $h_{1}$ and $h_{6}$ , the system in ˜6.6 leaves only $O(1)$ possibilities for $h_{3},h_{4}$ (since $H_{1},H_{2}\ll c$ ). This gives $O(H_{1}H_{2})$ solutions.

Subcase 1.2: One has $h_{6}=0$ . Then for any values of $h_{1}$ and $h_{5}$ , the system in ˜6.6 leaves only $O(1)$ possibilities for $h_{3},h_{4}$ . This gives $O(H_{1}^{2})$ solutions.

Subcase 1.3: One has $h_{5}h_{6}\neq 0$ . Then the first congruence in ˜6.6 fixes $h_{5}h_{6}\ (\textnormal{mod }c)$ , leaving $1+\tfrac{H_{1}H_{2}}{c}$ possibilities for the nonzero integer $h_{5}h_{6}$ , each of which gives $O(c^{o(1)})$ possible values for $h_{5},h_{6}$ by the divisor bound. Once $h_{5}$ and $h_{6}$ are fixed, each value of $h_{1}$ produces $O(1)$ final solutions. This gives $(1+\tfrac{H_{1}H_{2}}{c})H_{1}$ solutions.

From Case 1, we obtain a total number of solutions of

\ll c^{o(1)}\left(H_{1}^{2}+H_{2}^{2}+\left(1+\frac{H_{1}H_{2}}{c}\right)(H_{1}+H_{2})\right),

which is $O(c^{o(1)}H_{2}^{2})$ once we remember that $H_{1}\leq H_{2}$ and $H_{1}\ll c$ . This is acceptable in ˜6.4.

Case 2: One has $h_{j}\neq 0$ for all $j\in\{1,\ldots,6\}$ . We fix $d:=(1-h_{2}h_{3},c)=(h_{5}h_{6}-1,c)$ up to an acceptable $O(c^{o(1)})$ loss. Since $h_{5}h_{6}\equiv 1\ (\textnormal{mod }d)$ , there are $O(1+\frac{H_{1}H_{2}}{c})$ ways to pick the nonzero integer $h_{5}h_{6}$ , each of which gives $O(c^{o(1)})$ ways to pick $h_{5},h_{6}$ by the divisor bound.

Once $h_{5},h_{6}$ are fixed, we pick $h_{2},h_{3}$ subject to the system

\begin{cases}1-h_{2}h_{3}\equiv\gamma(h_{5}h_{6}-1)\ (\textnormal{mod }c),\\ h_{2}\equiv\gamma h_{6}=:r\ (\textnormal{mod }d),\end{cases}

(6.7)

which follows from ˜6.5. We can do this in either of the following ways:

•

Pick the nonzero integer $h_{2}h_{3}$ subject to its residue mod $c$ (due to the first congruence in ˜6.7) in $O(1+\tfrac{H_{1}H_{2}}{c})$ ways, and then $h_{2},h_{3}$ in $O(c^{o(1)})$ ways by the divisor bound.
•

For each choice of $h_{2}$ with $|h_{2}|\leq H_{2}$ and $h_{2}\equiv r\ (\textnormal{mod }d)$ , pick $h_{3}$ subject to its residue mod $\tfrac{c}{(h_{2},c)}$ (again, due to the first congruence in ˜6.7) in $O(1+\frac{H_{1}}{c}(h_{2},c))$ ways.

Finally, once $h_{5},h_{6},h_{2},h_{3}$ are fixed, ˜6.5 determines the residues of $h_{1}$ and $h_{4}$ modulo $\tfrac{c}{d}$ , so there are $(1+\tfrac{H_{1}}{c}d)(1+\tfrac{H_{2}}{c}d)$ choices of $h_{1},h_{4}$ . From Case 2, we obtain a total number of solutions of

	$\displaystyle\ll c^{o(1)}\max_{d\mid c}\underbrace{\left(1+\frac{H_{1}H_{2}}{d}\right)}_{\text{from picking }h_{5},h_{6}}\underbrace{\min\Bigg(1+\frac{H_{1}H_{2}}{c},\max_{r\in(\mathbb{Z}/d\mathbb{Z})^{\times}}\sum_{\begin{subarray}{c}\|h_{2}\|\leq H_{2}\\ h_{2}\equiv r\ (\textnormal{mod }d)\end{subarray}}\left(1+\frac{H_{1}}{c}(h_{2},c)\right)\Bigg)}_{\text{from picking }h_{2},h_{3}}$		(6.8)
	$\displaystyle\times\underbrace{\left(1+\frac{H_{1}}{c}d\right)}_{\text{from picking }h_{1}}\ \underbrace{\left(1+\frac{H_{2}}{c}d\right)}_{\text{from picking }h_{4}}.$		(6.8)

To bound the sum over $h_{2}$ , we write

	$\displaystyle\sum_{\begin{subarray}{c}\|h_{2}\|\leq H_{2}\\ h_{2}\equiv r\ (\textnormal{mod }d)\end{subarray}}(h_{2},c)$	$\displaystyle\leq\sum_{\begin{subarray}{c}g\mid c\\ (g,d)=1\end{subarray}}g\sum_{\begin{subarray}{c}\|h_{2}\|\leq H_{2}\\ g\mid h_{2}\equiv r\ (\textnormal{mod }d)\end{subarray}}1$
		$\displaystyle=\sum_{\begin{subarray}{c}g\mid\frac{c}{d}\\ (g,d)=1\end{subarray}}g\sum_{\begin{subarray}{c}\|h_{2}^{\prime}\|\leq\frac{H_{2}}{g}\\ h_{2}^{\prime}\equiv\overline{g}r\ (\textnormal{mod }d)\end{subarray}}1$
		$\displaystyle\ll\sum_{\begin{subarray}{c}\|g\|\leq\frac{c}{d}\\ (g,d)=1\end{subarray}}g\left(1+\frac{H_{2}}{gd}\right)\ll c^{o(1)}\left(\frac{c}{d}+\frac{H_{2}}{d}\right),$

and the term $\tfrac{H_{2}}{d}$ can be omitted since $H_{2}\ll c$ . Plugging this into ˜6.8 gives a total count of

\ll c^{o(1)}\max_{d\mid c}\left(1+\frac{H_{1}H_{2}}{d}\right)\left(1+\frac{H_{2}}{c}d\right)\left(1+\frac{H_{1}}{c}d\right)\min\left(1+\frac{H_{1}H_{2}}{c},1+\frac{H_{2}}{d}+\frac{H_{1}}{c}\cdot\frac{c}{d}\right)

Since $H_{1}\leq H_{2}$ , the final term of $\tfrac{H_{1}}{c}\cdot\tfrac{c}{d}=\tfrac{H_{1}}{d}$ can be omitted. The bound above now becomes

		$\displaystyle\ll c^{o(1)}\max_{d\mid c}\left(1+\frac{H_{1}H_{2}}{d}\right)\left(1+\frac{H_{2}}{c}d\right)\max\left(\frac{H_{1}}{c}d,1\right)\left(1+\frac{H_{2}}{d}\min\left(\frac{H_{1}}{c}d,1\right)\right)$
		$\displaystyle=c^{o(1)}\max_{d\mid c}\left(1+\frac{H_{1}H_{2}}{d}\right)\left(1+\frac{H_{2}}{c}d\right)\left(\max\left(\frac{H_{1}}{c}d,1\right)+\frac{H_{2}}{d}\left(\frac{H_{1}}{c}d\cdot 1\right)\right)$
		$\displaystyle\leq c^{o(1)}\max_{d\mid c}\left(1+\frac{H_{1}H_{2}}{d}\right)\left(1+\frac{H_{2}}{c}d\right)\left(1+\frac{H_{1}}{c}d+\frac{H_{1}H_{2}}{c}\right).$

After expanding the expression inside the maximum, each term is either strictly increasing, constant, or strictly decreasing in $d\in[1,c]$ , so each term is maximized when $d=1$ or $d=c$ . It follows that we can bound the maximum over $d\mid c$ , up to a constant, by looking only at the extreme points $d=1$ and $d=c$ . This gives a total count of

	$\displaystyle\ll c^{o(1)}\max\left(H_{1}H_{2}\left(1+\frac{H_{2}}{c}\right)\left(1+\frac{H_{1}}{c}+\frac{H_{1}H_{2}}{c}\right),\left(1+\frac{H_{1}H_{2}}{c}\right)H_{2}\left(H_{1}+\frac{H_{1}H_{2}}{c}\right)\right)$
	$\displaystyle\ll c^{o(1)}\max\left(H_{1}H_{2}\left(1+\frac{H_{1}H_{2}}{c}\right),\left(1+\frac{H_{1}H_{2}}{c}\right)H_{2}H_{1}\right),$

where, to reach the last line, we used that $H_{1},H_{2}\ll c$ . This establishes the desired bound. ∎

Finally, we may remove the restriction that $H_{1},H_{2}\ll c$ from the counting results in this section up to some additional factors.

Corollary 6.5.

Let $c\in\mathbb{Z}_{+}$ , $a_{1},a_{2}\in(\mathbb{Z}/c\mathbb{Z})^{\times}$ , and $1\leq H_{1}\leq H_{2}$ . Then the number of solutions to ˜6.1 with $q=4$ is at most

\ll c^{o(1)}\left(1+\frac{H_{1}^{2}}{c^{2}}\right)\left(1+\frac{H_{2}}{c}\right)H_{2},

(6.9)

and the number of solutions to ˜6.1 with $q=6$ is at most

\ll c^{o(1)}\left(1+\frac{H_{1}H_{2}}{c}+\frac{H_{1}^{3}H_{2}}{c^{3}}\right)H_{2}^{2}.

(6.10)

Proof.

Since the equation in ˜6.1 only depends on the residues of $h_{1},\ldots,h_{q}\ (\textnormal{mod }c)$ , we may as well count solutions to the system

\begin{cases}T^{a_{1}h_{1}^{\prime}}ST^{a_{2}h_{2}^{\prime}}S\cdots T^{a_{1}h_{q-1}^{\prime}}T^{a_{2}h_{q}^{\prime}}S=I\text{ in }\textnormal{PSL}_{2}(\mathbb{Z}/c\mathbb{Z}),\\ |h_{i}^{\prime}|\leq\min(H_{j},c),\text{ for all }i,j\text{ with }i\equiv j\ (\textnormal{mod }2),\end{cases}

(6.11)

and multiply the final count by a factor of $\ll_{q}(1+\tfrac{H_{1}}{c})^{q/2}(1+\tfrac{H_{2}}{c})^{q/2}$ (indeed, each solution $(h_{1}^{\prime},\ldots,h_{q}^{\prime})$ to ˜6.11 induces at most this many solutions $(h_{1},\ldots,h_{q})$ to ˜6.1 with the same residues modulo $c$ , and all solutions to ˜6.1 can be obtained this way).

If $q=4$ , Lemma˜6.3 applied for $\min(H_{1},c)$ and $\min(H_{2},c)$ gives a total number of solutions of

\ll c^{o(1)}\left(1+\frac{H_{1}}{c}\right)^{2}\left(1+\frac{H_{2}}{c}\right)^{2}\min(H_{2},c),

and the bound in ˜6.9 follows by noting that $\min(H_{2},c)\asymp H_{2}\left(1+\frac{H_{2}}{c}\right)^{-1}$ .

Similarly, if $q=6$ , Proposition˜6.4 applied for $\min(H_{1},c)$ and $\min(H_{2},c)$ gives a total count of

		$\displaystyle\ll c^{o(1)}\left(1+\frac{H_{1}}{c}\right)^{3}\left(1+\frac{H_{2}}{c}\right)^{3}\left(\min(H_{2},c)^{2}+\frac{\min(H_{1},c)^{2}\min(H_{2},c)^{2}}{c}\right)$
		$\displaystyle\ll c^{o(1)}\left(1+\frac{H_{1}^{3}}{c^{3}}\right)\left(1+\frac{H_{2}}{c}\right)H_{2}^{2}+c^{o(1)}\left(1+\frac{H_{1}}{c}\right)\left(1+\frac{H_{2}}{c}\right)\frac{H_{1}^{2}H_{2}^{2}}{c}$
		$\displaystyle\ll c^{o(1)}\left(1+\frac{H_{2}}{c}+\frac{H_{1}^{3}}{c^{3}}+\frac{H_{1}^{3}H_{2}}{c^{4}}\right)H_{2}^{2}+c^{o(1)}\left(1+\frac{H_{2}}{c}+\frac{H_{1}H_{2}}{c^{2}}\right)\frac{H_{1}^{2}H_{2}^{2}}{c}.$

We note that the third and fourth terms in the first parenthesis above can be ignored: their contribution to the final bound is $\tfrac{H_{1}^{3}H_{2}^{2}}{c^{3}}+\tfrac{H_{1}^{3}H_{2}^{3}}{c^{4}}$ , which is superseded by the contribution of $\tfrac{H_{1}^{3}H_{2}^{3}}{c^{3}}$ from the third term in the second parenthesis. This gives a total count of

\ll c^{o(1)}\left(1+\frac{H_{2}}{c}+\frac{H_{1}^{2}}{c}+\frac{H_{1}^{2}H_{2}}{c^{2}}+\frac{H_{1}^{3}H_{2}}{c^{3}}\right)H_{2}^{2}.

(6.12)

This bounded by ˜6.10, noting that $\tfrac{H_{1}^{2}H_{2}}{c^{2}}$ is the geometric mean of $\tfrac{H_{1}H_{2}}{c}$ and $\tfrac{H_{1}^{3}H_{2}}{c^{3}}$ . ∎

7. Bilinear forms with Kloosterman sums

We now combine the work in Sections˜4, 5 and 6, to deduce our main results from Theorems˜1.2 and 1.1.

7.1. Composite moduli

Here we prove a generalization of Theorem˜1.2, which allows for larger values of $M,N$ . We state our upper bound in two ways, to facilitate comparison with ˜1.2.

Theorem 7.1.

Let $c=dd^{\prime}e$ for some $d,d^{\prime},e\in\mathbb{Z}_{+}$ with $d^{\prime}\mid d$ and $(d,e)=1$ , and $f\leq\sqrt{cd}$ be the largest integer with $f^{2}\mid cd$ . Let $\mathcal{I},\mathcal{J}\subset\mathbb{Z}$ be intervals of lengths $|\mathcal{I}|=M$ , $|\mathcal{J}|=N$ , with⁶⁶6The assumption that $N\leq M$ is only included to shorten the statement of Theorem 7.1; one can of course swap $m$ and $n$ in the bilinear sum, up to swapping $M$ and $N$ in the upper bound. $1\leq N\leq M\leq c$ . Then for any complex sequences $(\alpha_{m})_{m\in\mathcal{I}}$ , $(\beta_{n})_{n\in\mathcal{J}}$ and $a\in(\mathbb{Z}/c\mathbb{Z})^{\times}$ , one has

	$\displaystyle\mathop{\sum\sum}_{\begin{subarray}{c}m\in\mathcal{I},n\in\mathcal{J}\\ (m,n,c)=1\end{subarray}}\alpha_{m}\beta_{n}S(am,n;c)$	$\displaystyle\ll\\|\alpha\\|\\|\beta\\|c^{1+o(1)}\left(\frac{dM^{3}N}{c^{3}}+\frac{fM^{2}}{c^{2}}+\frac{f}{d^{2}}\right)^{\frac{1}{6}}$
		$\displaystyle=\\|\alpha\\|\\|\beta\\|c^{o(1)}\sqrt{MNc}\left(\frac{d}{N^{2}}+\frac{fc}{MN^{3}}+\frac{fc^{3}}{d^{2}M^{3}N^{3}}\right)^{\frac{1}{6}}.$

Example 7.2.

Suppose $M\asymp N$ . The smallest value of $N$ for which Theorem˜7.1 can beat the Weil bound in ˜1.2 is $N=c^{2/5+o(1)}$ , attained, e.g., when $c=pq$ where $p,q$ are distinct primes with $p\asymp q^{3/2}$ . The largest value of $N$ for which Theorem˜7.1 can beat the Fourier-theoretic bound in ˜1.2 is $N=c^{3/4-o(1)}$ , attained when $c$ has a divisor $d=c^{o(1)}$ such that $c/d$ is square-free.

Remark.

Additional savings are possible in Theorem˜7.1 in the unbalanced range $M>N$ . Firstly, the bound in ˜6.10 can be refined to ˜6.12, but we omit this optimization for the sake of simplicity. Secondly and more substantially, bounding the largest singular value of an $M\times N$ matrix by the sixth moment of its singular values (as we do) can be particularly lossy if $M>N$ , since then the singular values often exhibit concentration near their maximum; one can try to amend this by subtracting a suitable main term from the sixth moment, as in [18, Lemma 4.2].

Proof of Theorem˜7.1.

Let $\varepsilon>0$ and $H_{1}:=c^{1+\varepsilon}M^{-1}$ , $H_{2}:=c^{1+\varepsilon}N^{-1}$ . Since $M\geq N$ , we have $H_{1}\leq H_{2}$ . Let $q$ be an even positive integer.

Then by the characterization of operator norms from ˜3.4, Corollary˜4.11 (using the notation from ˜4.13 and 4.14), the fact that $\|A\|\leq\|A\|_{S^{q}}$ for any linear map $A$ , and then Proposition˜5.5, we have that

$\displaystyle\left\|\mathop{\sum\sum}_{\begin{subarray}{c}m\in\mathcal{I},n\in\mathcal{J}\\ (m,n,c)=1\end{subarray}}\alpha_{m}\beta_{n}S(am,n;c)\right\|$	$\displaystyle\leq\\|\alpha\\|\\|\beta\\|\\|K_{c,a}^{\mathcal{I},\mathcal{J}}\\|$	(7.1)
	$\displaystyle\leq\\|\alpha\\|\\|\beta\\|\left(c^{1+2\varepsilon}\\|\widehat{F}_{c,a}^{H_{1},H_{2}}(\rho_{c}^{\circ})\\|+O_{\varepsilon}(c^{-100})\right)$
	$\displaystyle\ll_{\varepsilon}\\|\alpha\\|\\|\beta\\|\left(c^{1+3\varepsilon}\frac{d^{1/q}}{(H_{1}H_{2})^{1/2}}\mathscr{S}^{1/q}+O_{\varepsilon}(c^{-100})\right),$

where

\mathscr{S}:=\max_{\begin{subarray}{c}d\mid\tilde{d}\mid c\\ \tilde{f}^{2}\mid c\tilde{d}\end{subarray}}\tilde{f}\sum_{\begin{subarray}{c}h_{1},\ldots,h_{q}\in\mathbb{Z}\\ |h_{i}|\leq 2H_{j}\\ \forall i\equiv j\ (\textnormal{mod }2)\end{subarray}}\mathbbm{1}_{T^{\overline{a}h_{1}}ST^{h_{2}}S\cdots T^{\overline{a}h_{q-1}}ST^{h_{q}}S=I\text{ in }\textnormal{PSL}_{2}(\mathbb{Z}/\tilde{d}\mathbb{Z})}.

The inner sum is a count of solutions to an equation of the type ˜6.1 with $c$ replaced by $\tilde{d}$ , so we can estimate $\mathscr{S}$ using Corollary˜6.5. We will make use of the following quick fact. Recalling that $f$ is the maximal positive integer with $f^{2}\mid cd$ , we claim that for all integers $k\geq 0$ , one has

\max_{\begin{subarray}{c}d\mid\tilde{d}\mid c\\ \tilde{f}^{2}\mid c\tilde{d}\end{subarray}}\frac{\tilde{f}}{\tilde{d}^{k}}=\begin{cases}\frac{f}{d^{k}},&k\geq 1,\\ c,&k=0.\end{cases}

(7.2)

Indeed, when one appends a prime $p$ to $\tilde{d}$ , the numerator $\tilde{f}$ can increase by at most $p$ , so the expression $\tilde{f}/\tilde{d}^{k}$ cannot increase if $k\geq 1$ , and the maximum is attained when $\tilde{d}=d$ . On the other hand, if $k=0$ , then clearly $\tilde{f}\leq\sqrt{c\cdot c}=c$ , and the maximum is attained when $\tilde{d}=c$ .

Now set $q=6$ . From ˜6.10 and ˜7.2, we obtain that

\mathscr{S}\ll_{\varepsilon}c^{\varepsilon}\max_{\begin{subarray}{c}d\mid\tilde{d}\mid c\\ \tilde{f}^{2}\mid c\tilde{d}\end{subarray}}\tilde{f}\left(1+\frac{H_{1}H_{2}}{\tilde{d}}+\frac{H_{1}^{3}H_{2}}{\tilde{d}^{3}}\right)H_{2}^{2}\leq c^{\varepsilon}\left(c+\frac{fH_{1}H_{2}}{d}+\frac{fH_{1}^{3}H_{2}}{d^{3}}\right)H_{2}^{2}.

Plugging this into ˜7.1, we conclude that

	$\displaystyle\mathop{\sum\sum}_{\begin{subarray}{c}m\in\mathcal{I},n\in\mathcal{J}\\ (m,n,c)=1\end{subarray}}\alpha_{m}\beta_{n}S(am,n;c)$	$\displaystyle\ll_{\varepsilon}\\|\alpha\\|\\|\beta\\|c^{1+4\varepsilon}\frac{d^{1/6}}{(H_{1}H_{2})^{1/2}}\left(cH_{2}^{2}+\frac{fH_{1}H_{2}^{3}}{d}+\frac{fH_{1}^{3}H_{2}^{3}}{d^{3}}\right)^{1/6}.$
		$\displaystyle=\\|\alpha\\|\\|\beta\\|c^{1+4\varepsilon}\left(\frac{cd}{H_{1}^{3}H_{2}}+\frac{f}{H_{1}^{2}}+\frac{f}{d^{2}}\right)^{1/6}.$

The desired bound follows by recalling that $H_{1}=c^{1+\varepsilon}M^{-1}$ , $H_{2}=c^{1+\varepsilon}N^{-1}$ . ∎

Remark.

Using ˜6.9 with $q=4$ instead of ˜6.10 with $q=6$ in the proof above leads to a final bound of

	$\displaystyle\mathop{\sum\sum}_{\begin{subarray}{c}m\in\mathcal{I},n\in\mathcal{J}\\ (m,n,c)=1\end{subarray}}\alpha_{m}\beta_{n}S(am,n;c)$	$\displaystyle\ll c^{o(1)}\\|\alpha\\|\\|\beta\\|c\left(\frac{dM^{2}N}{c^{2}}+\frac{fM^{2}}{c^{2}}+\frac{fN}{dc}+\frac{f}{d^{2}}\right)^{1/4}$
		$\displaystyle=c^{o(1)}\\|\alpha\\|\\|\beta\\|\sqrt{MNc}\left(\frac{d}{N}+\frac{f}{N^{2}}+\frac{fc}{dM^{2}N}+\frac{fc^{2}}{d^{2}M^{2}N^{2}}\right)^{1/4},$

which is weaker than Theorem˜7.1 in the main ranges of interest.

Proof of Theorem˜1.2.

Note that the result holds trivially if $c=O(1)$ . Since $M,N\ll c^{1/2+o(1)}$ and the result is symmetric in $M,N$ , we can assume without loss of generality that $N\leq M\leq c$ . One can then apply Theorem˜7.1, and since $M,N\ll c^{1/2+o(1)}$ , the upper bound becomes

\|\alpha\|\|\beta\|c^{1+o(1)}\left(\frac{d}{c}+\frac{f}{c}+\frac{f}{d^{2}}\right)^{\frac{1}{6}}.

The first term can be omitted in light of the bound $d\leq f$ (since $d^{2}\mid cd$ ). ∎

7.2. Near-prime moduli

Building towards an unconditional result for general moduli, we need to slightly develop the result of Kowalski–Michel–Sawin [24] from Theorem˜3.4.

Corollary 7.3 (Kowalski–Michel–Sawin bounds for near-primes).

Let $c=pq$ where $p$ is a prime, $q\in\mathbb{Z}_{+}$ , and $p\nmid q$ . Let $M,N\in\mathbb{Z}$ be integers such that $1\leq N\leq M\leq c$ . Then for any complex sequences $(\alpha_{m})_{m\leq M}$ and $(\beta_{n})_{n\leq N}$ , and any $a\in(\mathbb{Z}/c\mathbb{Z})^{\times}$ , one has

\displaystyle\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{(m,n,c)=1}\alpha_{m}\beta_{n}S(am,n;c)\ll\|\alpha\|\|\beta\|c^{o(1)}\sqrt{MNc}\left(N^{-\frac{1}{2}}q+(MN)^{-\frac{3}{16}}c^{\frac{11}{64}}q^{\frac{53}{64}}\right).

Proof.

Using the twisted multiplicativity of Kloosterman sums and splitting the sums over $m,n$ according to their residues modulo $q$ , we can write

	$\displaystyle\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{(m,n,pq)=1}\alpha_{m}\beta_{n}S(am,n;pq)$	$\displaystyle=\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{(m,n,pq)=1}\alpha_{m}\beta_{n}S(a\overline{q}^{2}m,n;p)S(a\overline{p}^{2}m,n;q)$		(7.3)
		$\displaystyle=\mathop{\sum_{r=1}^{q}\sum_{s=1}^{q}}_{(r,s,q)=1}S(a\overline{p}^{2}r,s;q)\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{(m,n,p)=1}\alpha_{m,r}\beta_{n,s}S(a\overline{q}^{2}m,n;p),$		(7.3)

where

\alpha_{m,r}:=\alpha_{m}\mathbbm{1}_{m\equiv r\ (\textnormal{mod }q)},\qquad\qquad\beta_{n,s}:=\beta_{n}\mathbbm{1}_{n\equiv s\ (\textnormal{mod }q)}.

First, we consider the contribution to ˜7.3 of those $m,n$ with $p\mid m$ (and $p\nmid n$ ) or $p\mid n$ (and $p\nmid m$ ). By the Weil and Ramanujan bounds from Lemmas˜3.3 and 3.2, this contribution is

	$\displaystyle\ll\mathop{\sum_{r=1}^{q}\sum_{s=1}^{q}}_{(r,s,q)=1}q^{\frac{1}{2}+o(1)}\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{(m,n,p)=1}\|\alpha_{m,r}\beta_{n,s}\|$	$\displaystyle\ll q^{\frac{1}{2}+o(1)}\left(\sum_{m=1}^{M}\|\alpha_{m}\|\right)\left(\sum_{n=1}^{N}\|\beta_{n}\|\right)$
		$\displaystyle\ll\\|\alpha\\|\\|\beta\\|q^{o(1)}\sqrt{MNq},$

Plugging this into ˜7.3 and applying the Weil bound from Lemma˜3.3, we find that

	$\displaystyle\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{(m,n,pq)=1}\alpha_{m}\beta_{n}S(am,n;pq)\ll\mathop{\sum_{r=1}^{q}\sum_{s=1}^{q}}_{(r,s,q)=1}q^{\frac{1}{2}+o(1)}\left\|\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{p\nmid mn}\alpha_{m,r}\beta_{n,s}S(a\overline{q}^{2}m,n;p)\right\|$		(7.4)
	$\displaystyle+\\|\alpha\\|\\|\beta\\|q^{o(1)}\sqrt{MNq}.$		(7.4)

The last bilinear sum over $m,n$ is now almost in the correct shape to apply Theorem˜3.4. Indeed, splitting it according to the residues of $m$ and $n$ modulo $p$ , we can rewrite it as

\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{p\nmid mn}\alpha_{m,r}\beta_{n,s}S(a\overline{q}^{2}m,n;p)=\sum_{m^{\prime}=1}^{\min(M,p-1)}\ \sum_{n^{\prime}=1}^{\min(N,p-1)}\alpha_{m^{\prime},r}\beta_{n^{\prime},s}S(a\overline{q}^{2}m^{\prime},n^{\prime};p),

(7.5)

where

\alpha_{m^{\prime},r}:=\sum_{\begin{subarray}{c}m\leq M\\ m\equiv m^{\prime}\ (\textnormal{mod }p)\end{subarray}}\alpha_{m,r}=\sum_{\begin{subarray}{c}m\leq M\\ m\equiv m^{\prime}\ (\textnormal{mod }p)\\ m\equiv r\ (\textnormal{mod }q)\end{subarray}}\alpha_{m},

and $\beta_{n^{\prime},s}$ is defined similarly. Note that the last sum above contains at most one term, so

\sum_{m^{\prime}=1}^{\min(M,p-1)}|\alpha_{m^{\prime},r}|^{2}=\sum_{m^{\prime}=1}^{\min(M,p-1)}\sum_{\begin{subarray}{c}m\leq M\\ m\equiv m^{\prime}\ (\textnormal{mod }p)\\ m\equiv r\ (\textnormal{mod }q)\end{subarray}}|\alpha_{m}|^{2}\leq\sum_{\begin{subarray}{c}m\leq M\\ m\equiv r\ (\textnormal{mod }q)\end{subarray}}|\alpha_{m}|^{2},

and similarly for $\sum_{n^{\prime}=1}^{\min(N,p-1)}|\beta_{n^{\prime},s}|^{2}$ . Applying Theorem˜3.4 (and the remark that follows it) for a bilinear sum with lengths $\min(M,p-1)$ and $\min(N,p-1)$ , and using the monotonicity of the bound from Theorem˜3.4 in $M$ and $N$ , we obtain

	$\displaystyle\sum_{m^{\prime}=1}^{\min(M,p-1)}\ \sum_{n^{\prime}=1}^{\min(N,p-1)}\alpha_{m^{\prime},r}\beta_{n^{\prime},s}S(a\overline{q}^{2}m^{\prime},n^{\prime};p)\ll\sqrt{\sum_{\begin{subarray}{c}m\leq M\\ m\equiv r\ (\textnormal{mod }q)\end{subarray}}\|\alpha_{m}\|^{2}\sum_{\begin{subarray}{c}n\leq N\\ n\equiv s\ (\textnormal{mod }q)\end{subarray}}\|\beta_{n}\|^{2}}$
	$\displaystyle\times\ p^{o(1)}\sqrt{MNp}\left(N^{-\frac{1}{2}}+(MN)^{-\frac{3}{16}}p^{\frac{11}{64}}\right).$

Plugging this into ˜7.5 and ˜7.4, and applying Cauchy–Schwarz to the sum over $r,s$ , we find that

	$\displaystyle\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{(m,n,c)=1}\alpha_{m}\beta_{n}S(am,n;c)\ll c^{o(1)}q^{\frac{3}{2}}\\|\alpha\\|\\|\beta\\|\sqrt{MNp}\left(N^{-\frac{1}{2}}+(MN)^{-\frac{3}{16}}p^{\frac{11}{64}}\right)$
	$\displaystyle+\\|\alpha\\|\\|\beta\\|c^{o(1)}\sqrt{MNq}.$

Finally, recalling that $pq=c$ and $M,N\leq c$ (which imply $\sqrt{MNq}\leq\sqrt{Mcq}\leq q^{3/2}\sqrt{Mp}$ ), the last term can be omitted, and we arrive at the desired bound. ∎

7.3. General moduli

Finally, we prove a generalization of Theorem˜1.1.

Theorem 7.4.

Let $\delta\in[0,\tfrac{1}{24}]$ , $c,M,N\in\mathbb{Z}_{+}$ with $M,N\leq c$ . Let $\tilde{M}:=\max(M,N)$ and $\tilde{N}:=\min(M,N)$ . Then for any complex sequences $(\alpha_{m})_{m\leq M}$ , $(\beta_{n})_{n\leq N}$ and any $a\in(\mathbb{Z}/c\mathbb{Z})^{\times}$ , one has

	$\displaystyle\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{(m,n,c)=1}\alpha_{m}\beta_{n}S(am,n;c)$	$\displaystyle\ll\\|\alpha\\|\\|\beta\\|c^{o(1)}\sqrt{MNc}$		(7.6)
		$\displaystyle\times\left(\frac{c^{\frac{11+53\delta}{64}}}{(MN)^{\frac{3}{16}}}+\frac{c^{\frac{1-\delta}{6}}}{\tilde{N}^{\frac{1}{3}}}+\frac{c^{\frac{4-\delta}{12}}}{\tilde{M}^{\frac{1}{6}}\tilde{N}^{\frac{1}{2}}}+\frac{c^{\frac{11}{24}}}{(MN)^{\frac{1}{2}}}\right).$		(7.6)

Moreover, if $|\alpha_{m}|\leq 1$ for all $m$ (so $\|\alpha\|\leq\sqrt{M}$ ), then

	$\displaystyle\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{(n,c)=1}\alpha_{m}\beta_{n}S(am,n;c)$	$\displaystyle\ll\sqrt{M}\\|\beta\\|c^{o(1)}\sqrt{MNc}$		(7.7)
		$\displaystyle\times\left(\frac{c^{\frac{11+53\delta}{64}}}{(MN)^{\frac{3}{16}}}+\frac{c^{\frac{1-\delta}{4}}}{\tilde{N}^{\frac{1}{2}}}+\frac{1}{c^{\frac{3}{16}}}+\frac{c^{\frac{1}{8}}}{\tilde{N}^{\frac{1}{3}}}+\frac{c^{\frac{11}{24}}}{(MN)^{\frac{1}{2}}}\right).$		(7.7)

Remark.

The bound ˜7.7 actually holds without the assumption that $M,N\leq c$ . Indeed, Theorem˜7.4 covers the case $M,N\leq c$ . If $M,N>c$ , then applying the (first) bound from ˜1.2 for the sequences $(\alpha^{\prime}_{m^{\prime}})_{m^{\prime}\leq c}$ , $(\beta^{\prime}_{n})_{n\leq N}$ given by $\alpha^{\prime}_{m^{\prime}}:=\sum_{m\equiv m^{\prime}\ (\textnormal{mod }c)}\alpha_{m}$ and $\beta^{\prime}_{n}:=\mathbbm{1}_{(n,c)=1}\sum_{n\equiv n^{\prime}\ (\textnormal{mod }c)}\beta_{n}$ leads to the bound

	$\displaystyle\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{(n,c)=1}\alpha_{m}\beta_{n}S(am,n;c)\ll\\|\alpha^{\prime}\\|\\|\beta^{\prime}\\|c^{1+o(1)}$	$\displaystyle\ll\frac{M}{c}\sqrt{c}\sqrt{\frac{N}{c}}\\|\beta\\|c^{1+o(1)}$
		$\displaystyle\ll\sqrt{M}\\|\beta\\|c^{o(1)}\sqrt{MNc}\cdot c^{-3/16},$

so ˜7.7 still holds. If $N\leq c<M$ , then applying the (first) trivial bound from ˜1.2 for the sequences $(\alpha^{\prime}_{m^{\prime}})_{m^{\prime}\leq c}$ , $(\beta^{\prime}_{n})_{n\leq N}$ given by $\alpha^{\prime}_{m^{\prime}}:=\sum_{m\equiv m^{\prime}\ (\textnormal{mod }c)}\alpha_{m}$ and $\beta^{\prime}_{n}:=\beta_{n}\mathbbm{1}_{(n,c)=1}$ leads to the bound

	$\displaystyle\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{(n,c)=1}\alpha_{m}\beta_{n}S(am,n;c)\ll\\|\alpha^{\prime}\\|\\|\beta^{\prime}\\|c^{1+o(1)}$	$\displaystyle\ll\frac{M}{c}\sqrt{c}\\|\beta\\|c^{1+o(1)}$
		$\displaystyle\ll\sqrt{M}\\|\beta\\|c^{o(1)}\sqrt{MNc}\cdot\frac{c^{\frac{1-\delta}{4}}}{N^{\frac{1}{2}}},$

so ˜7.7 still holds. An analogous argument covers the remaining case $M\leq c<N$ .

Proof of Theorem˜7.4.

We begin by noting a quick consequence of Theorem˜7.1. Suppose $c$ has a factorization $c=dd^{\prime}e$ with $d^{\prime}\mid d$ and $(d,e)=1$ such that $d\geq c^{1/2}$ . Then by combining Theorem˜7.1 (applied for $\tilde{M}$ , $\tilde{N}$ instead of $M,N$ ) with the bounds $f\leq\sqrt{cd}$ and then $d\geq c^{1/2}$ , we get

		$\displaystyle\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{(m,n,c)=1}\alpha_{m}\beta_{n}S(am,n;c)$		(7.8)
		$\displaystyle\ll\\|\alpha\\|\\|\beta\\|c^{o(1)}\sqrt{MNc}\left(\frac{d}{\tilde{N}^{2}}+\frac{c^{\frac{3}{2}}d^{\frac{1}{2}}}{\tilde{M}\tilde{N}^{3}}+\frac{c^{\frac{11}{4}}}{M^{3}N^{3}}\right)^{\frac{1}{6}}$
		$\displaystyle\ll\\|\alpha\\|\\|\beta\\|c^{o(1)}\sqrt{MNc}\left(\frac{d^{\frac{1}{6}}}{\tilde{N}^{\frac{1}{3}}}+\frac{c^{\frac{1}{4}}d^{\frac{1}{12}}}{\tilde{M}^{\frac{1}{6}}\tilde{N}^{\frac{1}{2}}}+\frac{c^{\frac{11}{24}}}{(MN)^{\frac{1}{2}}}\right)=:\mathcal{B}(d).$

Note that this bound $\mathcal{B}(d)$ is increasing with $d\in[c^{1/2},c]$ , and that the right-hand side of ˜7.6 supersedes $\mathcal{B}(c^{1-\delta})$ . In particular,

\displaystyle\mathcal{B}(c^{3/4})

\displaystyle=\|\alpha\|\|\beta\|c^{o(1)}\sqrt{MNc}\left(\frac{c^{\frac{1}{8}}}{\tilde{N}^{\frac{1}{3}}}+\frac{c^{\frac{5}{16}}}{\tilde{M}^{\frac{1}{6}}\tilde{N}^{\frac{1}{2}}}+\frac{c^{\frac{11}{24}}}{(MN)^{\frac{1}{2}}}\right).

A quick computation shows that since $\delta\leq\tfrac{1}{24}$ , one has

\frac{c^{\frac{5}{16}}}{\tilde{M}^{\frac{1}{6}}\tilde{N}^{\frac{1}{2}}}\leq\max\left(\frac{c^{\frac{11}{24}}}{(\tilde{M}\tilde{N})^{\frac{1}{2}}},\frac{c^{\frac{1-\delta}{4}}}{\tilde{N}^{\frac{1}{2}}}\right).

Therefore, the right-hand side of ˜7.7 supersedes $\mathcal{B}(c^{3/4})$ . We now split into cases depending on the factorization of the modulus $c$ .

Case 1: $c$ is divisible by a maximal prime power $p^{k}\geq c^{1-\delta}$ . Then let us write $c=p^{k}q$ , where $q$ is not necessarily a prime, but $q\leq c^{\delta}$ and $(p,q)=1$ .

Subcase 1.1: One has $k=1$ . Then we can apply Corollary˜7.3 (with $M,N$ replaced by $\tilde{M},\tilde{N}$ ), which gives the bound

\displaystyle\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{(m,n,c)=1}\alpha_{m}\beta_{n}S(am,n;c)\ll\|\alpha\|\|\beta\|c^{o(1)}\sqrt{MNc}\left(\tilde{N}^{-\frac{1}{2}}c^{\delta}+(MN)^{-\frac{3}{16}}c^{\frac{11+53\delta}{64}}\right).

The first term here is superseded by the third term in ˜7.6 and the second term in ˜7.7, since $\tilde{M}\leq c$ and $\delta\leq\tfrac{1}{24}$ . The second term here appears directly in both ˜7.6 and 7.7.

Subcase 1.2: One has $k\geq 2$ . Then we let $d:=p^{\left\lceil k/2\right\rceil}q$ , $d^{\prime}:=p^{\left\lfloor k/2\right\rfloor}$ , and $e:=1$ , which gives a valid decomposition $c=dd^{\prime}e$ to use in our Theorem˜7.1. Moreover, since $k\geq 2$ , we have $\left\lceil k/2\right\rceil\leq 2k/3$ , so $d\leq p^{2k/3}q=c^{2/3}q^{1/3}$ , and thus

d\in[c^{1/2},c^{(2+\delta)/3}].

From ˜7.8 we thus obtain an upper bound of $\mathcal{B}(c^{(2+\delta)/3})$ , which is acceptable in both ˜7.6 and 7.7 since $\tfrac{2+\delta}{3}\leq\tfrac{3}{4}$ .

Case 2: All prime powers $p^{k}\mid c$ have $p^{k}<c^{1-\delta}$ .

Subcase 2.1: All prime powers $p^{k}\mid c$ have $p^{k}<c^{1/2}$ . Then we set $d^{\prime}:=1$ , and construct $d,e$ by a greedy algorithm. Initially, we take $d=e:=1$ . For each prime power $p^{k}\|c$ , we append $p^{k}$ to the smaller of $d$ and $e$ . Note that throughout this process, $d$ and $e$ cannot differ by a factor larger than $c^{1/2}$ . In the end, if $d<e$ , we swap $d$ and $e$ . This produces a factorization $c=de$ with $(d,e)=1$ and

d\in[c^{1/2},c^{3/4}].

Then ˜7.8 gives a bound of $\mathcal{B}(c^{3/4})$ , which is acceptable in both ˜7.6 and 7.7.

Subcase 2.2: The largest prime power dividing $c$ is some $p^{k}\in[c^{1/2},c^{3/4})$ . Then we let $d:=p^{k}$ , $d^{\prime}:=1$ , and $e:=cp^{-k}$ , and ˜7.8 gives an acceptable bound of $\mathcal{B}(c^{3/4})$ once again.

Subcase 2.3: The largest prime power dividing $c$ is some $p^{k}\in[c^{3/4},c^{1-\delta})$ . On the one hand, ˜7.8 gives a bound of $\mathcal{B}(c^{1-\delta})$ , which is acceptable in ˜7.6; this completes the proof of ˜7.6.

Now assume (still within Subcase 2.3) that $|\alpha_{m}|\leq 1$ for all $m$ , and we aim to establish ˜7.7.

•

If $p=2$ , then writing $c=2^{k}q$ , we can factorize $c=dd^{\prime}e$ with $d:=2^{\left\lceil k/2\right\rceil}q$ , $d^{\prime}:=2^{\left\lfloor k/2\right\rfloor}$ , and $e:=1$ . Here $d\ll\sqrt{2^{k}q^{2}}=\sqrt{cq}\leq c^{3/4}$ , since $2^{k}\geq c^{1/2}$ implies $q\leq c^{1/2}$ . But then ˜7.8 gives an acceptable bound of $\mathcal{B}(c^{3/4})$ .

•

If $p>2$ , then we can use $d:=p^{k}$ in Theorem˜3.5. Since $d\in[c^{3/4},c^{1-\delta})$ , this gives the bound

\mathop{\sum_{m=1}^{M}\sum_{n=1}^{N}}_{(n,c)=1}\alpha_{m}\beta_{n}S(am,n;c)\ll\sqrt{M}\|\beta\|c^{o(1)}\sqrt{MNc}\left(\frac{c^{\frac{1}{8}}}{M^{\frac{1}{2}}}+\frac{1}{c^{\frac{3}{16}}}+\frac{c^{\frac{1-\delta}{4}}}{N^{\frac{1}{2}}}\right).

Since $M,N\geq\tilde{N}$ and $\tfrac{1}{8}\leq\tfrac{1-\delta}{4}$ , the first and the last terms in the parenthesis above are superseded by the term $c^{(1-\delta)/4}\tilde{N}^{-1/2}$ from ˜7.7. The second term appears directly in ˜7.7.

This covers all cases. ∎

Proof of Theorem˜1.1.

As before, we can assume without loss of generality that $M,N\leq c$ since the result is trivial when $c=O(1)$ . Then the result follows by applying Theorem˜7.4 with $M,N\ll c^{1/2+o(1)}$ , and using the optimal choices $\delta=\frac{3}{175}$ in ˜7.6, respectively $\delta=\frac{1}{69}$ in ˜7.7. ∎

7.4. Averaging over moduli

Finally, let us prove a generalization of Corollary˜1.4.

Corollary 7.5.

Let $q=dd^{\prime}e$ for some $d,d^{\prime},e\in\mathbb{Z}_{+}$ with $d^{\prime}\mid d$ and $(d,e)=1$ , and $f\leq\sqrt{qd}$ be the largest integer with $f^{2}\mid qd$ . Let $C\geq\tfrac{1}{2}$ and $\mathcal{I},\mathcal{J}\subset\mathbb{Z}$ be intervals of lengths $|\mathcal{I}|=M$ , $|\mathcal{J}|=N$ , with $1\leq N\leq M\leq C$ . Let $(\alpha_{m})_{m\in\mathcal{I}},(\beta_{n})_{n\in\mathcal{J}}$ be complex sequences, and for each $c\sim C$ , let $(\alpha_{m}(c))_{m\in\mathcal{I}}$ , $(\beta_{n}(c))_{n\in\mathcal{J}}$ be such that $|\alpha_{m}(c)|\leq|\alpha_{m}|$ , $|\beta_{n}(c)|\leq|\beta_{n}|$ for all $m\in\mathcal{I},n\in\mathcal{J}$ . Then one has

\displaystyle\sum_{\begin{subarray}{c}c\sim C\\ q\mid c\end{subarray}}\left|\mathop{\sum\sum}_{\begin{subarray}{c}m\in\mathcal{I},n\in\mathcal{J}\\ (m,n,q)=1\end{subarray}}\alpha_{m}(c)\beta_{n}(c)S(m,n;c)\right|

\displaystyle\ll\|\alpha\|\|\beta\|\frac{C^{2+o(1)}}{q}\min\hskip-22.76228pt

Proof of Corollary˜7.5.

Throughout this proof, we will use the notation

f_{a}:=\max_{\tilde{f}^{2}\mid a}\tilde{f},\qquad\qquad\|\alpha_{a*}\|:=\sqrt{\sum_{\begin{subarray}{c}m\in\mathcal{I}\\ a\mid m\end{subarray}}|\alpha_{m}|^{2}},\qquad\qquad\|\beta_{a*}\|:=\sqrt{\sum_{\begin{subarray}{c}n\in\mathcal{J}\\ a\mid n\end{subarray}}|\beta_{n}|^{2}},

for any $a\in\mathbb{Z}_{+}$ . In particular, the assumption of the present Corollary˜7.5 takes $f=f_{qd}$ . Note that $f_{a}\mid f_{ab}$ and $f_{a^{2}b}=af_{b}$ for any $a,b\in\mathbb{Z}_{+}$ , and that $f_{ab}=f_{a}f_{b}$ when $(a,b)=1$ .

We can of course assume without loss of generality that $C\gg q$ , since otherwise the sum over $c$ is empty. For each $c\sim C$ with $q\mid c$ , we consider the sum

	$\displaystyle\mathcal{S}(c)$	$\displaystyle=\mathop{\sum\sum}_{\begin{subarray}{c}m\in\mathcal{I},n\in\mathcal{J}\\ (m,n,q)=1\end{subarray}}\alpha_{m}(c)\beta_{n}(c)S(m,n;c)$
		$\displaystyle=\sum_{\begin{subarray}{c}g\mid c\\ (g,q)=1\end{subarray}}\mathop{\sum\sum}_{\begin{subarray}{c}m\in\mathcal{I},n\in\mathcal{J}\\ (m,n,c)=g\end{subarray}}\alpha_{m}(c)\beta_{n}(c)S(m,n;c)=\sum_{\begin{subarray}{c}g\mid c\\ (g,q)=1\end{subarray}}\frac{\phi(c)}{\phi(c/g)}\mathop{\sum\sum}_{\begin{subarray}{c}m\in\mathcal{I},n\in\mathcal{J}\\ (m,n,c)=g\end{subarray}}\alpha_{m}(c)\beta_{n}(c)S(\tfrac{m}{g},\tfrac{n}{g};\tfrac{c}{g}),$

where the last equality follows from the identity $S(m,n;c)=\tfrac{\phi(c)}{\phi(c/g)}S(\tfrac{m}{g},\tfrac{n}{g};\tfrac{c}{g})$ . From the triangle inequality and the bound $\tfrac{\phi(c)}{\phi(c/g)}\leq g$ , we find that

\sum_{\begin{subarray}{c}c\sim C\\ q\mid c\end{subarray}}|\mathcal{S}(c)|\leq\sum_{\begin{subarray}{c}g\leq 2C/q\\ (g,q)=1\end{subarray}}g\sum_{\begin{subarray}{c}c\sim C\\ gq\mid c\end{subarray}}|\mathcal{S}(c;g)|,

(7.9)

where

\mathcal{S}(c;g):=\mathop{\sum\sum}_{\begin{subarray}{c}m\in\mathcal{I},n\in\mathcal{J}\\ g\mid(m,n)\\ (\frac{m}{g},\frac{n}{g},\frac{c}{g})=1\end{subarray}}\alpha_{m}(c)\beta_{n}(c)S(\tfrac{m}{g},\tfrac{n}{g};\tfrac{c}{g}).

We aim to apply Theorem˜7.1 (with $M,N,c\leftarrow\tfrac{M}{g},\tfrac{N}{g},\tfrac{c}{g}$ ) to bound each sum $\mathcal{S}(c;g)$ , and this requires a suitable factorization of the modulus $\tfrac{c}{g}$ . There are two ways to construct this from the assumed factorization $q=dd^{\prime}e$ , which correspond to placing ‘most’ of the factor $\tfrac{c}{gq}$ into $e$ or into $d$ .

Method 1. For each $c\sim C$ with $gq\mid c$ , consider the factorization

\frac{c}{g}=:c^{\prime}q=\tilde{d}d^{\prime}\tilde{e},\qquad\tilde{d}:=(c^{\prime},d^{\infty})d,\qquad\tilde{e}:=\frac{ec^{\prime}}{(c^{\prime},d^{\infty})},

which has $d^{\prime}\mid\tilde{d}$ and $(\tilde{d},\tilde{e})=1$ . We find that

f_{(c/g)\tilde{d}}^{2}\mid c^{\prime}q(c^{\prime},q^{\infty})d=(c^{\prime},q^{\infty})^{2}\frac{c^{\prime}}{(c^{\prime},q^{\infty})}qd\qquad\Rightarrow\qquad f_{(c/g)\tilde{d}}\leq(c^{\prime},q^{\infty})f_{c^{\prime}}f_{qd}=(c^{\prime},q^{\infty})f_{c^{\prime}}f,

so Theorem˜7.1 gives

	$\displaystyle\mathcal{S}(c;g)=\mathcal{S}(c^{\prime}gq;g)$	$\displaystyle\ll\\|\alpha_{g}\\|\\|\beta_{g}\\|\frac{c^{1+o(1)}}{g}\left(\frac{\tilde{d}M^{3}N}{c^{3}}+\frac{f_{(c/g)\tilde{d}}M^{2}}{c^{2}}+\frac{f_{(c/g)\tilde{d}}}{\tilde{d}^{2}}\right)^{\frac{1}{6}}g^{\frac{1}{6}}$
		$\displaystyle\ll\\|\alpha_{g}\\|\\|\beta_{g}\\|C^{1+o(1)}\left(\frac{dM^{3}N}{C^{3}}+\frac{fM^{2}}{C^{2}}+\frac{f}{d^{2}}\right)^{\frac{1}{6}}\left((c^{\prime},q^{\infty})f_{c^{\prime}}\right)^{\frac{1}{6}}.$

Therefore,

	$\displaystyle\sum_{\begin{subarray}{c}c\sim C\\ gq\mid c\end{subarray}}\|\mathcal{S}(c;g)\|$	$\displaystyle=\sum_{c^{\prime}\sim\frac{C}{gq}}\|\mathcal{S}(c;g)\|$
		$\displaystyle\ll\\|\alpha_{g}\\|\\|\beta_{g}\\|C^{1+o(1)}\left(\frac{dM^{3}N}{C^{3}}+\frac{fM^{2}}{C^{2}}+\frac{f}{d^{2}}\right)^{\frac{1}{6}}\sum_{c^{\prime}\sim\frac{C}{gq}}\left((c^{\prime},q^{\infty})f_{c^{\prime}}\right)^{\frac{1}{6}}.$

After applying Cauchy–Schwarz to the last sum, it remains to bound the sums $\sum_{c^{\prime}\sim C/(gq)}(c^{\prime},q^{\infty})$ and $\sum_{c^{\prime}\sim C/(gq)}f_{c^{\prime}}$ , both of which are $O(\tfrac{C^{1+o(1)}}{gq})$ . In particular, for the second sum, we can write

\sum_{c^{\prime}\sim\frac{C}{gq}}f_{c^{\prime}}\leq\sum_{f\ll\sqrt{\frac{C}{gq}}}f\sum_{c^{\prime}\sim\frac{C}{gq}}\mathbbm{1}_{f^{2}\mid c^{\prime}}\ll\sum_{f\leq\sqrt{\frac{C}{gq}}}\frac{C}{gqf}\ll\frac{C^{1+o(1)}}{gq}.

From this and ˜7.9, we conclude that

\sum_{\begin{subarray}{c}c\sim C\\ q\mid c\end{subarray}}|\mathcal{S}(c)|\ll\frac{C^{2+o(1)}}{q}\left(\frac{dM^{3}N}{C^{3}}+\frac{fM^{2}}{C^{2}}+\frac{f}{d^{2}}\right)^{\frac{1}{6}}\sum_{\begin{subarray}{c}g\leq 2C/q\\ (g,q)=1\end{subarray}}\|\alpha_{g*}\|\|\beta_{g*}\|.

Finally, the last sum is easily bounded by $C^{o(1)}\|\alpha\|\|\beta\|$ using Cauchy–Schwarz and the divisor bound. This establishes the bound from Corollary˜7.5 with the first term from the minimum.

Method 2. For each $c\sim C$ with $gq\mid c$ , consider the factorization

\frac{c}{g}=:c^{\prime}q=\tilde{d}d^{\prime}\tilde{e},\qquad\tilde{d}:=\frac{c^{\prime}d}{(c^{\prime},e^{\infty})},\qquad\tilde{e}:=e(c^{\prime},e^{\infty}),

which satisfies $d^{\prime}\mid\tilde{d}$ and $(\tilde{d},\tilde{e})=1$ . We find that

f_{(c/g)\tilde{d}}^{2}\mid(c^{\prime})^{2}qd\qquad\Rightarrow\qquad f_{(c/g)\tilde{d}}\leq c^{\prime}f_{qd}\ll\frac{fC}{q},

so Theorem˜7.1 gives

\displaystyle\mathcal{S}(c;g)=\mathcal{S}(c^{\prime}gq;g)

\displaystyle\ll\|\alpha_{g*}\|\|\beta_{g*}\|C^{1+o(1)}\left(\frac{dM^{3}N}{qC^{2}}+\frac{fM^{2}}{qC}+\frac{fq}{d^{2}C}\right)^{\frac{1}{6}}(c^{\prime},e^{\infty})^{\frac{1}{3}}.

The second bound from Corollary˜7.5 now follows similarly as before from ˜7.9, since the sum over $c^{\prime}\sim\tfrac{C}{gq}$ ‘washes out’ the factor $(c^{\prime},e^{\infty})$ . ∎

Proof of Corollary˜1.4.

This follows from Corollary˜7.5 analogously to how Theorem˜1.2 follows from Theorem˜7.1. ∎

8. Moments of twisted modular $L$ -functions

Here we prove Theorem˜1.5, by inserting our bounds for bilinear forms with Kloosterman sums into the proofs from [3]. We begin by restating ˜7.7 in a shape more similar to [3, Theorem 5].

Corollary 8.1.

Let $r,q\in\mathbb{Z}_{+}$ with $r\mid q$ . Let $K,M\geq 1$ , $\tilde{K}:=\max(K,M)$ , $\tilde{M}:=\min(K,M)$ , and $(\lambda_{k})_{K\leq k\leq 2K}$ be a sequence with $|\lambda_{k}|\leq 1$ for all $k$ . Then one has

	$\displaystyle\sum_{\begin{subarray}{c}M\leq m\leq 2M\\ (m,q)=1\end{subarray}}\left\|\sum_{K\leq k\leq 2K}\lambda_{k}S(k,m;r)\right\|^{2}$	$\displaystyle\ll(qKM)^{o(1)}K^{2}Mr$
		$\displaystyle\times\left(\frac{r^{\frac{11+53\delta}{32}}}{(KM)^{\frac{3}{8}}}+\frac{r^{\frac{1-\delta}{2}}}{\tilde{M}}+\frac{1}{r^{\frac{3}{8}}}+\frac{r^{\frac{1}{4}}}{\tilde{M}^{\frac{2}{3}}}+\frac{r^{\frac{11}{12}}}{KM}\right).$

Proof.

One can of course assume without loss of generality that $M,K\in\mathbb{Z}_{+}$ , and extend the sum over $m$ to include all $m\in[M,2M]$ with $(m,r)=1$ . By duality, it suffices to establish the bound

	$\displaystyle\sum_{\begin{subarray}{c}M\leq m\leq 2M\\ (m,r)=1\end{subarray}}\beta_{m}\sum_{K\leq k\leq 2K}\lambda_{k}S(k,m;r)$	$\displaystyle\ll(qKM)^{o(1)}\\|\beta\\|K\sqrt{Mr}$
		$\displaystyle\times\left(\frac{r^{\frac{11+53\delta}{64}}}{(KM)^{\frac{3}{16}}}+\frac{r^{\frac{1-\delta}{4}}}{\tilde{M}^{\frac{1}{2}}}+\frac{1}{r^{\frac{3}{16}}}+\frac{r^{\frac{1}{8}}}{\tilde{M}^{\frac{1}{3}}}+\frac{r^{\frac{11}{24}}}{(KM)^{\frac{1}{2}}}\right),$

for any sequence $(\beta_{m})_{M\leq m\leq 2M}$ . But this is precisely the content of ˜7.7 with $(M,N,c)$ replaced by $(K,M,r)$ ; the remark after Theorem˜7.4 allows us to ignore the constraint $K,M\leq c$ . ∎

We can now prove an analogue of [3, Proposition 7]. We use the same normalization as in [3, (2.3)] for the Hecke eigenvalues $\lambda_{f}(n)$ of a holomorphic cuspidal newform $f$ for $\textnormal{SL}_{2}(\mathbb{Z})$ , so that

\lambda_{f}(n)\rho_{f}(1)=\sqrt{n}\rho_{f}(n),\qquad\text{where}\qquad f(z)=\sum_{n=1}^{\infty}\rho_{f}(n)(4\pi n)^{k/2}e(nz).

(8.1)

In particular, the Deligne bound [8] reads

\lambda_{f}(n)\ll n^{o(1)}.

(8.2)

Proposition 8.2.

Let $\varepsilon>0$ , $q,d\in\mathbb{Z}_{+}$ with $d\mid q$ , $\frac{1}{20}N\geq M\geq 1$ with $MN\leq q^{2+\varepsilon}$ , and let $\lambda_{1}(m)$ , $\lambda_{2}(n)$ be the Hecke eigenvalues of two (fixed) holomorphic cuspidal newforms for $\textnormal{SL}_{2}(\mathbb{Z})$ . Let $V_{1},V_{2}:\mathbb{R}\to\mathbb{C}$ be functions supported in $[1,2]$ with derivatives $V_{i}^{(j)}\ll_{j,\varepsilon}q^{\varepsilon}$ , and denote

S_{N,M,d,q}:=\frac{d}{(NM)^{1/2}}\sum_{r=1}^{2N/d}\sum_{\begin{subarray}{c}n\equiv m\ (\textnormal{mod }d)\\ (nm,q)=1\\ n\neq m\end{subarray}}\lambda_{1}(m)\lambda_{2}(n)V_{1}\left(\frac{m}{M}\right)V_{2}\left(\frac{n}{N}\right).

(8.3)

Then for any $\delta\in[0,\tfrac{1}{24}]$ , one has

S_{N,M,d,q}\ll_{\varepsilon}q^{O(\varepsilon)}\left(\frac{M^{\frac{5}{16}}q^{\frac{83+53\delta}{64}}}{N^{\frac{5}{16}}}+\frac{q^{\frac{7-\delta}{4}}}{\sqrt{N}}+\frac{M^{\frac{1}{2}}q^{\frac{21}{16}}}{N^{\frac{1}{2}}}+\frac{q^{\frac{13}{8}}M^{\frac{1}{6}}}{N^{\frac{1}{2}}}+q^{\frac{23}{24}}\right).

(8.4)

Proof.

We closely follow the proof in [3, §4]. In particular, we decompose $q=q_{d}q^{\prime}$ where $q^{\prime}$ is maximal with $(q^{\prime},d)=1$ . The bound [3, (4.2)] reads

S_{N,M,d,q}\ll\sqrt{N}\sum_{\begin{subarray}{c}g\mid f\mid q^{\prime}\\ r\mid d\end{subarray}}\frac{\mu^{2}(f)|\lambda_{2}(f/g)|}{fgr}\left(\sum_{\begin{subarray}{c}m\asymp M\\ (m,q)=1\end{subarray}}\Big|\sum_{n}S(\overline{fg}m,n;r)\lambda_{2}(n)V_{2}^{\circ}\left(\frac{nN}{fgr^{2}}\right)\Big|^{2}\right)^{1/2},

where $V_{2}^{\circ}$ is a transform of $V_{2}$ as in [3, (2.10)] (coming from an application of the Voronoi summation formula). Using the rapid decay of $V_{2}^{\circ}$ , we may truncate the sum over $n$ at

n\leq K_{f,g,r}:=q^{\varepsilon}\frac{fgr^{2}}{N},

up to an acceptable loss. Note that the resulting sum over $n$ vanishes unless $K_{f,g,r}\geq 1$ . From Corollary˜8.1, ˜8.2, and the divisor bound, we conclude that

	$\displaystyle S_{N,M,d,q}\ll_{\varepsilon}q^{O(\varepsilon)}\sqrt{N}\max_{\begin{subarray}{c}g\mid f\mid q^{\prime}\\ r\mid d\end{subarray}}\frac{1}{fgr}K_{f,g,r}\sqrt{Mr}\Bigg(\frac{r^{\frac{11+53\delta}{64}}}{(K_{f,g,r}M)^{\frac{3}{16}}}+\frac{r^{\frac{1-\delta}{4}}}{\min(K_{f,g,r},M)^{\frac{1}{2}}}+\frac{1}{r^{\frac{3}{16}}}$
	$\displaystyle+\frac{r^{\frac{1}{8}}}{\min(K_{f,g,r},M)^{\frac{1}{3}}}+\frac{r^{\frac{11}{24}}}{(K_{f,g,r}M)^{\frac{1}{2}}}\Bigg).$

Plugging in the definition of $K_{f,g,r}$ , we see that the expression inside the maximum is non-decreasing in $r$ and non-increasing in $f,g$ . Writing

K:=K_{1,1,q}=\frac{q^{2+\varepsilon}}{N}\geq M,

we find that

	$\displaystyle S_{N,M,d,q}$	$\displaystyle\ll_{\varepsilon}q^{O(\varepsilon)}\sqrt{N}\frac{1}{q}K\sqrt{Mq}\left(\frac{q^{\frac{11+53\delta}{64}}}{(KM)^{\frac{3}{16}}}+\frac{q^{\frac{1-\delta}{4}}}{M^{\frac{1}{2}}}+\frac{1}{q^{\frac{3}{16}}}+\frac{q^{\frac{1}{8}}}{M^{\frac{1}{3}}}+\frac{q^{\frac{11}{24}}}{(KM)^{\frac{1}{2}}}\right)$
		$\displaystyle\ll_{\varepsilon}q^{O(\varepsilon)}\frac{q^{\frac{3}{2}}M^{\frac{1}{2}}}{N^{\frac{1}{2}}}\left(\frac{q^{\frac{11+53\delta}{64}}}{(q^{2}M/N)^{\frac{3}{16}}}+\frac{q^{\frac{1-\delta}{4}}}{M^{\frac{1}{2}}}+\frac{1}{q^{\frac{3}{16}}}+\frac{q^{\frac{1}{8}}}{M^{\frac{1}{3}}}+\frac{q^{\frac{11}{24}}}{(q^{2}M/N)^{\frac{1}{2}}}\right),$

which reduces to the desired bound. ∎

We can now prove the desired asymptotic for twisted moments of modular $L$ -functions.

Proof of Theorem˜1.5.

Let $\varepsilon>0$ and $\gamma:=\tfrac{1}{674}$ . We closely follow the proof in [3, §3], making no changes to the main term analysis from [3, §3.1]. Treating the off-diagonal term as in [3, §3.2], it remains to establish the bound

S_{N,M,d,q}\stackrel{{\scriptstyle?}}{{\ll_{\varepsilon}}}q^{1-\gamma+O(\varepsilon)},

(8.5)

for all $d\mid q$ and $N\geq M\geq 1$ with $MN\leq q^{2+\varepsilon}$ , using the notation from ˜8.3. As in [3, §3.3], we can easily discount the contribution of the range $M\leq N<20M$ using [3, (3.12)], so let us assume that $N\geq 20M$ . We will rely on the bounds

	$\displaystyle S_{N,M,d,q}$	$\displaystyle\ll_{\varepsilon}q^{O(\varepsilon)}(MN)^{\frac{1}{2}},$		(8.6)
	$\displaystyle S_{N,M,d,q}$	$\displaystyle\ll_{\varepsilon}q^{O(\varepsilon)}\left(\frac{(Nq)^{\frac{1}{2}}}{M^{\frac{1}{2}}}+\frac{N^{\frac{3}{4}}}{M^{\frac{1}{4}}}+\frac{N^{\frac{1}{4}}q^{\frac{3}{4}}}{M^{\frac{1}{4}}}+N^{\frac{1}{2}}q^{\frac{1}{4}}\right),$		(8.7)

from [3, (3.6) and (3.11)], as well as on our Proposition˜8.2 (instead of [3, Proposition 7]). First, the trivial bound ˜8.6 establishes ˜8.5 unless

M>\frac{q^{2-2\gamma}}{N},

(8.8)

so let us assume that we are in this range. We now split into cases depending on the size of $N$ .

Case 1. One has $N\leq q^{3/2-3\gamma}$ . Then by plugging ˜8.8 into ˜8.7, we obtain ˜8.5.

Case 2. One has $N\in(q^{3/2-3\gamma},q^{3/2-2\gamma}]$ . Then by plugging ˜8.8 into ˜8.7, we find that

S_{N,M,d,q}\ll_{\varepsilon}q^{O(\varepsilon)}\left(q^{1-\gamma}+\frac{N^{\frac{1}{4}}q^{\frac{3}{4}}}{M^{\frac{1}{4}}}\right),

which is acceptable in ˜8.5 unless

\frac{N}{M}>q^{1-4\gamma}.

Plugging this and $N>q^{3/2-3\gamma}$ into ˜8.4, we find that

S_{N,M,d,q}\ll_{\varepsilon}q^{O(\varepsilon)}\left(q^{\frac{63+53\delta}{64}+\frac{5\gamma}{4}}+q^{1-\frac{\delta}{4}+\frac{3\gamma}{2}}+q^{\frac{23}{24}+2\gamma}\right),

(8.9)

which is acceptable in ˜8.5 provided that

10\gamma\leq\delta\leq\frac{1-144\gamma}{53}.

This is precisely attained for our choice of $\gamma=\frac{1}{674}$ by taking $\delta:=\frac{10}{674}$ in Proposition˜8.2.

Case 3. One has $N\in(q^{3/2-2\gamma},q^{3/2+\gamma})$ . Then ˜8.7 is useless because of the last term. We plug in $M\leq q^{2+\varepsilon}/N$ and then $N\geq q^{3/2-2\gamma}$ into ˜8.4 to find that

S_{N,M,d,q}\ll_{\varepsilon}q^{O(\varepsilon)}\left(q^{\frac{63+53\delta}{64}+\frac{5\gamma}{4}}+q^{1-\frac{\delta}{4}+\gamma}+q^{\frac{23}{24}+2\gamma}\right),

which is a stronger bound than ˜8.9. This completes our proof. ∎

9. Large sieve for exceptional cusp forms

Here we prove a generalization of Corollary˜1.6, which requires some background from the spectral theory of automorphic forms. We recall [11] that for $q\in\mathbb{Z}_{+}$ , the congruence subgroup $\Gamma_{0}(q)$ contains those matrices in $\textnormal{SL}_{2}(\mathbb{Z})$ with bottom-left entries divisible by $q$ . Each cusp $\mathfrak{a}$ of the the fundamental domain $\Gamma_{0}(q)\backslash\mathbb{H}$ is equivalent to a fraction of the form $\tfrac{u}{w}$ , where $u,w\in\mathbb{Z}_{+}$ , $w\mid q$ , $(u,w)=1$ , and $u\leq(w,\tfrac{q}{w})$ ; in particular, the cusp at $\infty$ is equivalent to $\tfrac{1}{q}$ . To such a cusp, one can associate a scaling matrix $\sigma_{\mathfrak{a}}\in\textnormal{PSL}_{2}(\mathbb{R})$ with $\sigma_{\mathfrak{a}}\infty=\mathfrak{a}$ , and via these scaling matrices, functions on $\Gamma_{0}(q)\backslash\mathbb{H}$ can be Fourier expanded around $\mathfrak{a}$ .

The discrete spectrum of the hyperbolic Laplacian $\Delta=-y^{2}(\partial_{x}^{2}+\partial_{y}^{2})$ is parametrized by Maass cusp forms: these are smooth functions $f:\Gamma_{0}(q)\backslash\mathbb{H}\to\mathbb{C}$ which are eigenfunctions of $\Delta$ , vanish at all cusps of $\Gamma_{0}(q)\backslash\mathbb{H}$ , and are square-integrable with respect to the Petersson inner product. Following the normalization of Deshouillers–Iwaniec [11], we write the Fourier expansion of $f$ at $z=x+iy\in\mathbb{H}$ around a cusp $\mathfrak{a}$ (with scaling matrix $\sigma_{\mathfrak{a}}$ ) as

f(\sigma_{\mathfrak{a}}z)=y^{1/2}\sum_{n\neq 0}\rho_{\mathfrak{a}}(n)K_{i\kappa}(2\pi|n|y)\,e(mx),

where $K$ is a Whittaker function as in [11, p. 264]. Altering the choice of scaling matrix $\sigma_{\mathfrak{a}}$ results in multiplying the Fourier coefficients $\rho_{\mathfrak{a}}(n)$ by an exponential phase $e(n\omega)$ , for some uniform $\omega\in\mathbb{R}/\mathbb{Z}$ .

The Kuznetsov trace formula [11, 27], as well as the large sieve inequalities that derive from it, involve an orthonormal basis of Maass cusp forms. The following notation will therefore be useful.

Notation 9.1.

Let $q\in\mathbb{Z}_{+}$ , $\mathfrak{a}$ be a cusp of $\Gamma_{0}(q)$ equivalent⁷⁷7The assumption that $\mathfrak{a}$ is equivalent to $\tfrac{1}{s}$ is true in most applications (note that this includes the cusp at $\infty$ ), and only made for convenience; one can prove similar results at arbitrary cusps with small adjustments. to $\tfrac{1}{s}$ for some $s\mid q$ with $(s,\tfrac{q}{s})=1$ , and $\sigma_{\mathfrak{a}}\in\textnormal{PSL}_{2}(\mathbb{R})$ be any scaling matrix for $\mathfrak{a}$ . Consider an orthonormal basis $(f_{j})_{j\geq 1}$ of Maass cusp forms for $\Gamma_{0}(q)$ , with:

$(i)$ .

Laplacian eigenvalues $\lambda_{j}$ and spectral parameters $\theta_{j}:=\max(0,\tfrac{1}{4}-\lambda_{j})^{1/2}$ ;
$(ii)$ .

Fourier coefficients $(\rho_{j\mathfrak{a}}(n))_{n\in\mathbb{Z}}$ around the cusp $\mathfrak{a}$ , using the scaling matrix $\sigma_{\mathfrak{a}}$ .

Proposition 9.2.

Assume Notation˜9.1, let $X,N\geq 1/2$ , and let $(\alpha_{n})_{n\sim N}$ be a complex sequence. Let $\Phi:\mathbb{R}\to[0,\infty)$ be a smooth function supported in $[\Omega(1),O(1)]$ , with $\int\Phi(t)\,dt\gg 1$ and $\Phi^{(j)}(t)\ll_{j}1$ . Then there exists $\omega\in\mathbb{R}/\mathbb{Z}$ (depending only on $\mathfrak{a}$ , $\sigma_{\mathfrak{a}}$ ) such that

	$\displaystyle\sum_{\lambda_{j}<1/4}X^{2\theta_{j}}\left\|\sum_{n\sim N}\alpha_{n}\,\rho_{j\mathfrak{a}}(n)\right\|^{2}$	$\displaystyle\ll(qN)^{o(1)}\left(1+\frac{N}{q}\right)\\|\alpha_{n}\\|^{2}$		(9.1)
		$\displaystyle+\left\|\sum_{c\in q\mathbb{Z}_{+}}\frac{1}{c}\sum_{m,n\sim N}\overline{\alpha_{m}e(m\omega)}\,\alpha_{n}e(n\omega)\,S(m,n;c)\,\Phi\left(\frac{\sqrt{mn}}{c}X\right)\right\|.$		(9.1)

Proof.

This is [32, Corollary I], which follows from the Kuznetsov trace formula and the regular-spectrum large sieve inequalities of Deshouillers–Iwaniec [11, Theorem 2]. We have implicitly used [32, Lemma B] to write down the Kloosterman sums and $c$ -supports for cusps $\mathfrak{a}$ equivalent to $\tfrac{1}{s}$ for some $s\mid q$ (the latter condition is written as $\mu(\mathfrak{a})=q^{-1}$ in loc. cit.). Note that we incur factors of $e(m\omega)$ and $e(n\omega)$ since we do not assume a special scaling matrix $\sigma_{\mathfrak{a}}$ (as we may), but this will be irrelevant in our computations since the sequence $(\alpha_{n})$ is arbitrary. ∎

In the right-hand side of ˜9.1, the sum over $c$ is really supported on $c\asymp NX$ due to the $\Phi$ -weight, and it vanishes if $q\gg NX$ with a large enough implied constant. Deshouillers–Iwaniec used this simple observation to deduce the following result, which combines [11, Theorems 2 and 5].

Theorem 9.3 (Deshouillers–Iwaniec [11]).

Assume Notation˜9.1, let $N\geq\tfrac{1}{2}$ , and let $(\alpha_{n})_{n\sim N}$ be a complex sequence. Then one has

\sum_{\lambda_{j}<1/4}X^{2\theta_{j}}\left|\sum_{n\sim N}\alpha_{n}\,\rho_{j\mathfrak{a}}(n)\right|^{2}\ll(qN)^{o(1)}\left(1+\frac{N}{q}\right)\|\alpha\|^{2},

(9.2)

for any positive $X\ll 1+\frac{q}{N}$ .

Until now, if $\sqrt{q}\ll N\ll q$ , Theorem˜9.3 has been the state-of-the-art exceptional-spectrum large sieve bound for general sequences $(\alpha_{n})$ and a single group $\Gamma_{0}(q)$ ; the same is true if one averages over levels $q\sim Q$ and allows the sequence $(\alpha_{n})$ to depend on $q$ .

We can now achieve an improvement of Theorem˜9.3 when $q$ has a factorization as in Theorem˜7.1, and similar results can be deduced for arbitrary levels $q$ using Theorem˜7.4. We require a coprimality constraint $(n,q)=1$ for technical reasons, but this is usually harmless in applications. The resulting power savings are relatively small, but serve as a proof of concept that Theorem˜9.3 is not a fundamental barrier.

Theorem 9.4 (Large sieve for composite levels).

Assume Notation˜9.1, let $N\geq\tfrac{1}{2}$ , and let $(\alpha_{n})_{n\sim N}$ be a complex sequence supported on $(n,q)=1$ . Suppose that $q=dd^{\prime}e$ with $d^{\prime}\mid d$ and $(d,e)=1$ , and let $f\leq\sqrt{qd}$ be the largest integer with $f^{2}\mid qd$ . Then ˜9.2 holds for any positive

X\ll 1+\frac{q}{N}+\min\left(\frac{q^{2}}{d^{1/3}N^{7/3}},\frac{q^{3/2}}{f^{1/4}N^{3/2}},\frac{qd^{1/3}}{f^{1/6}N}\right)+\min\left(\frac{q^{7/4}}{d^{1/4}N^{2}},\frac{q^{7/5}}{f^{1/5}N^{7/5}},\frac{qd^{2/5}}{f^{1/5}N}\right).

(9.3)

Proof of Theorem˜9.4.

We may assume without loss of generality that

1+\frac{q}{N}<X\ll\min\left(\frac{q^{2}}{d^{1/3}N^{7/3}},\frac{q^{3/2}}{f^{1/4}N^{3/2}},\frac{qd^{1/3}}{f^{1/6}N}\right)+\min\left(\frac{q^{7/4}}{d^{1/4}N^{2}},\frac{q^{7/5}}{f^{1/5}N^{7/5}},\frac{qd^{2/5}}{f^{1/5}N}\right),

(9.4)

since otherwise the result follows from Theorem˜9.3. We apply Proposition˜9.2 with a choice of $\Phi$ supported on $[2,4]$ , then separate variables in the smooth weight $\Phi(\cdot)$ via two-dimensional Fourier inversion, as in [32, Proof of Theorem 13], to arrive at

	$\displaystyle\sum_{\lambda_{j}<1/4}X^{2\theta_{j}}\left\|\sum_{n\sim N}\alpha_{n}\,\rho_{j\mathfrak{a}}(n)\right\|^{2}$	$\displaystyle\ll(qN)^{o(1)}\left(1+\frac{N}{q}\right)\\|\alpha\\|^{2}$
		$\displaystyle+\sum_{\begin{subarray}{c}\frac{NX}{4}<x\leq NX\\ q\mid c\end{subarray}}\frac{1}{c}\sup_{\begin{subarray}{c}(\beta_{n})_{n\sim N}\\ \|\beta_{n}\|=\|\alpha_{n}\|\end{subarray}}\sup_{\begin{subarray}{c}(\gamma_{n})_{n\sim N}\\ \|\gamma_{n}\|=\|\alpha_{n}\|\end{subarray}}\left\|\sum_{m,n\sim N}\beta_{m}\gamma_{n}S(m,n;c)\right\|.$

The sequences $(\beta_{n})$ , $(\gamma_{n})$ in the supremum arise by incorporating exponential phases $e(n\omega)$ into $(\alpha_{n})$ , partly from the choice of the scaling matrix $\sigma_{\mathfrak{a}}$ , and partly due to the separation of variables. The suprema are of course attained by some sequences $(\beta_{n})$ , $(\gamma_{n})$ supported on $(n,q)=1$ , so we can apply Corollary˜7.5 with $M=N$ and $C\asymp NX$ , to obtain

	$\displaystyle\sum_{\lambda_{j}<1/4}X^{2\theta_{j}}\left\|\sum_{n\sim N}\alpha_{n}\,\rho_{j\mathfrak{a}}(n)\right\|^{2}$	$\displaystyle\ll(qN)^{o(1)}\left(1+\frac{N}{q}\right)\\|\alpha\\|^{2}$
		$\displaystyle+\\|\alpha\\|^{2}\frac{(NX)^{1+o(1)}}{q}\min$

We conclude by noting that

\frac{NX}{q}\left(\frac{dN}{X^{3}}+\frac{f}{X^{2}}+\frac{f}{d^{2}}\right)^{\frac{1}{6}}\ll 1\qquad\text{for}\qquad X\ll\min\left(\frac{q^{2}}{d^{1/3}N^{7/3}},\frac{q^{3/2}}{f^{1/4}N^{3/2}},\frac{qd^{1/3}}{f^{1/6}N}\right),

and

\frac{NX}{q}\left(\frac{dN^{2}}{qX^{2}}+\frac{fN}{qX}+\frac{fq}{d^{2}NX}\right)^{\frac{1}{6}}\ll 1\qquad\text{for}\qquad X\ll\min\left(\frac{q^{7/4}}{d^{1/4}N^{2}},\frac{q^{7/5}}{f^{1/5}N^{7/5}},\frac{qd^{2/5}}{f^{1/5}N}\right).

This covers the range in ˜9.4. ∎

Proof of Corollary˜1.6.

If $q$ has a divisor $d\asymp\sqrt{q}$ such that $\tfrac{q}{d}$ is square-free, then we can take $d^{\prime}=(d,\tfrac{q}{d})$ and $f=d\asymp\sqrt{q}$ in Theorem˜9.4, so ˜9.2 holds for any positive

X\ll 1+\frac{q}{N}+\min\left(\frac{q^{11/6}}{N^{7/3}},\frac{q^{11/8}}{N^{3/2}},\frac{q^{13/12}}{N}\right)+\min\left(\frac{q^{13/8}}{N^{2}},\frac{q^{13/10}}{N^{7/5}},\frac{q^{11/10}}{N}\right).

If additionally $N\ll q^{1/2+o(1)}$ (as Corollary˜1.6 assumes), then we can take $X=q^{3/5}$ , since this is only larger by a factor of $q^{o(1)}$ than the second minimum above. ∎

Appendix A Some necessary computations in $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$

Here we fill in some details involving explicit matrix computations, subgroups, and characters of $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ . In particular, we prove Lemmas˜4.8, 5.4 and 5.2.

Proof of Lemma˜4.8.

The first equality from ˜4.6 follows immediately from ˜4.4. Now let $T_{c}(d):V_{c}\to V_{c}$ denote the map in the (extreme) right-hand side of ˜4.6; we will show that $T_{c}(d)=P_{c}(d)$ .

The fact that $\Gamma_{c}(d)$ is a subgroup quickly implies that $T_{c}(d)$ is self-adjoint and that $T_{c}(d)^{2}=T_{c}(d)$ , so $T_{c}(d)$ is an orthogonal projection. Moreover, one has $\rho_{c}(n)T_{c}(d)=T_{c}(d)$ for any $n\in\Gamma_{c}(d)$ , so any $T_{c}(d)f\in T_{c}(d)V_{c}$ has $\rho_{c}(n)T_{c}(d)f=T_{c}(d)f$ , which shows $T_{c}(d)V_{c}\subset V_{c}(d)$ . Conversely, if $f\in V_{c}(d)$ , so $\rho_{c}(n)f=f$ for all $n\in\Gamma_{c}(d)$ , then clearly $f=T_{c}(d)f$ , which shows $V_{c}(d)\subset T_{c}(d)V_{c}$ . Thus $T_{c}(d)$ is the orthogonal projection onto $T_{c}(d)V_{c}=V_{c}(d)$ , i.e., $T_{c}(d)=P_{c}(d)$ .

The claim about commutativity follows directly from ˜4.6 and the normality of $\Gamma_{c}(d)$ .

Finally, let us prove ˜4.7. It follows from ˜4.6 and ˜3.17 that

P_{c}(d)_{u,v}=\frac{1}{|\Gamma_{c}(d)|}\sum_{n\in\Gamma_{c}(d)}\mathbbm{1}_{u=nv}=\frac{d^{3}}{c^{3}}\mathbbm{1}_{u\in\Gamma_{c}(d)\cdot v}|\Gamma_{c}(d)_{u}|,

(A.1)

where $|\Gamma_{c}(d)_{u}|$ is the stabilizer of $u$ inside $\Gamma_{c}(d)$ (indeed, once $u=n_{0}v$ for some $n_{0}\in\Gamma_{c}(d)$ , all other solutions to $u=nv$ satisfy $n_{0}n^{-1}u=u$ , so $n\in\Gamma_{c}(d)_{u}n_{0}$ ). By the normality of $\Gamma_{c}(d)$ , we see that for any $g\in\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$ ,

	$\displaystyle\|\Gamma_{c}(d)_{gu}\|$	$\displaystyle=\{n\in\Gamma_{c}(d):ngu=gu\}$
		$\displaystyle=\{n\in\Gamma_{c}(d):(g^{-1}ng)u=u\}=\|(g^{-1}\Gamma_{c}(d)g)_{u}\|=\|\Gamma_{c}(d)_{u}\|,$

so by the transitivity of the projective action,

|\Gamma_{c}(d)_{u}|=|\Gamma_{c}(d)_{[1:0]}|=\left\{\begin{pmatrix}q&r\\ s&t\end{pmatrix}\in\Gamma_{c}(d):[q:s]=[1:0]\right\}.

Note that $[q:s]=[1:0]$ simply means $(q,s)=(\alpha,0)$ for some unit $\alpha\in(\mathbb{Z}/c\mathbb{Z})^{\times}$ . This forces $s=0$ , $q\in(\mathbb{Z}/c\mathbb{Z})^{\times}$ , and $t=\overline{q}$ ; moreover, given such a choice of $q,s,t$ , any $r\in d\mathbb{Z}/c\mathbb{Z}$ gives a solution. Since there are $\phi(c)/\phi(d)$ choices of $q$ in the kernel of $\mathbb{Z}/c\mathbb{Z}\to\mathbb{Z}/d\mathbb{Z}$ , and $\tfrac{c}{d}$ choices of $r$ in $d\mathbb{Z}/c\mathbb{Z}\cong\mathbb{Z}/\tfrac{c}{d}\mathbb{Z}$ , we find that

|\Gamma_{c}(d)_{u}|=\frac{\phi(c)}{\phi(d)}\cdot\frac{c}{d},

and plugging this into ˜A.1 proves ˜4.7. ∎

Proof of Lemma˜5.4.

Set $G:=\textnormal{SL}_{2}(\mathbb{Z}/p^{k}\mathbb{Z})$ and $N:=\Gamma_{p^{k}}(p^{j})$ , so $N\triangleleft G$ . Say $\chi=\textnormal{Tr}\rho$ where $\rho\in\widehat{G}$ is primitive. By ˜3.9, we have

\frac{1}{|N|}\sum_{n\in N}|\chi(n)|^{2}=\sum_{\rho_{0}\in\widehat{N}}\textnormal{Mult}(\rho_{0},\rho|_{N})^{2}.

By Lemma˜3.9, $\rho|_{N}$ contains $L$ irreducible representations of $N$ , each with multiplicity $m$ , for some positive integers $L,m$ . Thus

\sum_{\rho_{0}\in\widehat{N}}\textnormal{Mult}(\rho_{0},\rho|_{N})^{2}=Lm^{2},

and in light of ˜3.17, it remains to show that

Lm^{2}\gg(k-j+1)^{-1}p^{2j}.

(A.2)

If $j=0$ , this is a trivial statement. Suppose now that $\tfrac{k}{2}\leq j\leq k$ . Then

	$\displaystyle N=\Gamma_{p^{k}}(p^{j})$	$\displaystyle=\left\{I+p^{j}A:A\in(\mathbb{Z}/p^{k-j}\mathbb{Z})^{2\times 2},\ \det(I+p^{j}A)\equiv 1\ (\textnormal{mod }p^{k})\right\}$
		$\displaystyle=\left\{I+p^{j}A:A\in(\mathbb{Z}/p^{k-j}\mathbb{Z})^{2\times 2},\ \textnormal{Tr}(A)\equiv 0\ (\textnormal{mod }p^{k-j})\right\}$

is abelian, since $(I+p^{j}A)(I+p^{j}B)=I+p^{j}(A+B)$ for $j\geq\tfrac{k}{2}$ . In fact, this shows that

(N,\cdot)\cong\left(\left\{A\in(\mathbb{Z}/p^{k-j}\mathbb{Z})^{2\times 2}:\textnormal{Tr}(A)=0\right\},+\right)\cong(\widehat{N},\cdot).

In particular, all irreducible representations of $N$ are $1$ -dimensional, and can be expressed as

\sigma_{B}(I+p^{k}A):=e\left(\frac{\textnormal{Tr}(AB)}{p^{k-j}}\right),\qquad B\in(\mathbb{Z}/p^{k-j}\mathbb{Z})^{2\times 2},\ \textnormal{Tr}(B)=0.

(A.3)

Now equating dimensions and using Lemma˜3.16, we find that

Lm=\dim\rho\gg p^{k}\qquad\iff\qquad Lm^{2}\gg\frac{p^{2k}}{L}.

(A.4)

We will finish by finding an upper bound on $L$ . By the conclusion of Lemma˜3.9, all $L$ non-isomorphic representations in the decomposition of $\rho|_{N}$ lie in the same orbit of $G$ ’s action by conjugation. For $g\in G$ and $B,\sigma_{B}$ as in ˜A.3, we have

	$\displaystyle\sigma_{B}(g(I+p^{k}A)g^{-1})=\sigma_{B}(I+p^{k}gAg^{-1})$	$\displaystyle=e\left(\frac{\textnormal{Tr}(gAg^{-1}B)}{p^{k-j}}\right)$
		$\displaystyle=e\left(\frac{\textnormal{Tr}(Ag^{-1}Bg)}{p^{k-j}}\right)=\sigma_{g^{-1}Bg}(I+p^{k}A).$

In other words, the action of $G$ by conjugation on irreducible representations of $N$ corresponds to conjugation of the underlying matrices $B$ . It follows that $L$ is at most the maximal size of an orbit in the set

\left\{B\in(\mathbb{Z}/p^{k-j}\mathbb{Z})^{2\times 2}:\textnormal{Tr}(B)=0\right\}

under conjugation by $\textnormal{SL}_{2}(\mathbb{Z}/p^{k}\mathbb{Z})$ , or equivalently by $\textnormal{SL}_{2}(\mathbb{Z}/p^{k-j}\mathbb{Z})$ . Since conjugation preserves the determinant $\Delta=\det(B)$ , we find that

L\leq\max_{\Delta\in\mathbb{Z}/p^{k-j}\mathbb{Z}}\#\left\{B\in(\mathbb{Z}/p^{k-j}\mathbb{Z})^{2\times 2}:\textnormal{Tr}(B)=0,\ \det(B)=\Delta\right\}.

Writing $B=\left(\begin{smallmatrix}x&y\\ z&-x\end{smallmatrix}\right)$ , we further get

\displaystyle L\leq\max_{\Delta\in\mathbb{Z}/p^{k-j}\mathbb{Z}}\sum_{x,y,z\in\mathbb{Z}/p^{k-j}\mathbb{Z}}\mathbbm{1}_{-x^{2}-yz=\Delta}\leq p^{k-j}\max_{a\in\mathbb{Z}/p^{k-j}\mathbb{Z}}\sum_{y,z\in\mathbb{Z}/p^{k-j}\mathbb{Z}}\mathbbm{1}_{yz=a},

where we substituted $a:=-x^{2}-\Delta$ . Now given $a\in\mathbb{Z}/p^{k-j}\mathbb{Z}$ , write $a=p^{\ell}a^{\prime}$ for some $0\leq\ell\leq k-j$ and $a^{\prime}\in(\mathbb{Z}/p^{k-j-\ell}\mathbb{Z})^{\times}$ . The equation $yz=a$ then implies

y=p^{\ell_{y}}y^{\prime},\qquad z=p^{\ell_{z}}z^{\prime},\qquad y^{\prime}z^{\prime}=a^{\prime}.

for some $\ell_{y},\ell_{z}\geq 0$ with $\ell_{y}+\ell_{z}=\ell$ , and some $y^{\prime}\in(\mathbb{Z}/p^{k-j-\ell_{y}}\mathbb{Z})^{\times}$ , $z^{\prime}\in(\mathbb{Z}/p^{k-j-\ell_{z}}\mathbb{Z})^{\times}$ . There are $\ell+1\leq k-j+1$ choices of $(\ell_{y},\ell_{z})$ , and for every choice of $(\ell_{y},\ell_{z},y^{\prime})$ , there are at most $p^{k-j-\ell_{z}}/p^{k-j-\ell}=p^{\ell_{y}}$ choices of $z^{\prime}$ (since $z^{\prime}\ (\textnormal{mod }p^{k-j-\ell})$ is fixed). Putting these counts together, we obtain

L\leq p^{k-j}(k-j+1)\max_{\ell_{y}+\ell_{z}=\ell\leq k}p^{k-j-\ell_{y}}p^{\ell_{y}}\ll(k-j+1)p^{2k-2j}.

Combining this with ˜A.4 establishes the desired bound from ˜A.2. ∎

Proof of Lemma˜5.2.

From ˜4.3, it follows that $\chi_{c}(g)=\prod_{p^{k}\|c}\chi_{p^{k}}(\pi_{c,p^{k}}(g))$ . Working locally at a prime $p|c$ , with say $p^{k}\|c$ and $p^{j}\|d$ , we will establish the bound

\chi_{p^{k}}(g)\ll p^{\left\lfloor\frac{k+j}{2}\right\rfloor},

(A.5)

for all $g\in\textnormal{SL}_{2}(\mathbb{Z}/p^{k}\mathbb{Z})$ such that $p^{j}$ is the largest $p$ -power for which $g\in\{\gamma\in\mathbb{Z}/p^{k}\mathbb{Z}:\gamma^{2}=1\}\cdot\Gamma_{p^{k}}(p^{j})$ . Given ˜A.5, the desired bound in ˜5.1 follows from the divisor bound.

Since $\rho_{p^{k}}(g)$ is a permutation map, $\chi_{p^{k}}(g)=\textnormal{Tr}\rho_{p^{k}}(g)$ equals the number of fixed points of $g$ in $\mathbb{P}^{1}(\mathbb{Z}/p^{k}\mathbb{Z})$ , i.e., the number of solutions in $u\in\mathbb{P}^{1}(\mathbb{Z}/p^{k}\mathbb{Z})$ to $gu=u$ . Let us write $g=\left(\begin{smallmatrix}q&r\\ s&t\end{smallmatrix}\right)$ and $u=[x:y]$ for some integers $q,r,s,t,x,y$ with $qt-rs\equiv 1\ (\textnormal{mod }p^{k})$ and $(x,y,p)=1$ . Scaling both entries of $u$ by a unit in $(\mathbb{Z}/p^{k}\mathbb{Z})^{\times}$ , we can assume without loss of generality that $x=1$ or $y=1$ ; in fact, replacing $\left(\begin{smallmatrix}q&r\\ s&t\end{smallmatrix}\right)\leftrightarrow\left(\begin{smallmatrix}t&s\\ r&q\end{smallmatrix}\right)$ if necessary, we may assume that $y=1$ . Then the equality $gu=u$ means that for some $\alpha\in\mathbb{Z}$ , one has

\begin{pmatrix}q&r\\ s&t\end{pmatrix}\begin{pmatrix}x\\ 1\end{pmatrix}\equiv\alpha\begin{pmatrix}x\\ 1\end{pmatrix}\ (\textnormal{mod }p^{k})\qquad\Rightarrow\qquad qx+r\equiv(sx+t)x\ (\textnormal{mod }p^{k}).

This gives the quadratic congruence

sx^{2}+(t-q)x-r\equiv 0\ (\textnormal{mod }p^{k}).

Now from our assumption that $g\in\gamma\Gamma_{p^{k}}(p^{j})$ for some $\gamma\in\mathbb{Z}/p^{k}\mathbb{Z}$ with $\gamma^{2}=1$ , we know that $p^{j}\mid s$ , $p^{j}\mid r$ , and $p^{j}\mid t-q$ (since $t\equiv\gamma\equiv q\ (\textnormal{mod }p^{j})$ ). In fact, $p^{j}$ is the largest $p$ -power with this property (otherwise, we could pick some $\gamma\equiv q\ (\textnormal{mod }p^{j+1})$ such that $g\in\gamma\Gamma_{p^{k}}(p^{j+1})$ ). Therefore, letting $a_{2}:=sp^{-j}$ , $a_{1}:=(t-q)p^{-j}$ and $a_{0}:=-rp^{-j}$ , we find that

a_{2}x^{2}+a_{1}x+a_{0}\equiv 0\ (\textnormal{mod }p^{k-j}),

(A.6)

where $a_{0},a_{1},a_{2}$ are not all divisible by $p$ . It now remains to show that this equation has

O\left(p^{\left\lfloor\frac{k-j}{2}\right\rfloor}\right)

solutions in $x\ (\textnormal{mod }p^{k-j})$ ; every such solution will have $p^{j}$ lifts to $\mathbb{Z}/p^{k}\mathbb{Z}$ , inducing a total of $O(p^{\left\lfloor(k-j)/2\right\rfloor+j})=O(p^{\left\lfloor(k+j)/2\right\rfloor})$ fixed points $u=[x:1]$ of $g$ .

Case 1. $p\nmid a_{2}$ . Then given any two solutions $x_{0},x$ of ˜A.6, we can subtract the two equalities to obtain

p^{k-j}\mid a_{2}(x^{2}-x_{0}^{2})+a_{1}(x-x_{0})=(x-x_{0})(a_{2}(x+x_{0})+a_{1}).

(A.7)

Let $\ell:=\left\lceil(k-j)/2\right\rceil$ . By the pigeonhole principle, we must have $p^{\ell}\mid x-x_{0}$ or $p^{\ell}\mid a_{2}(x+x_{0})+a_{1}$ . Since $p\nmid a_{2}$ , either option uniquely determines $x\ (\textnormal{mod }p^{\ell})$ in terms of $x_{0}$ . So there can be at most

\frac{p^{k-j}}{p^{\ell}}=p^{(k-j)-\left\lceil\frac{k-j}{2}\right\rceil}=p^{\left\lfloor\frac{k-j}{2}\right\rfloor}

solutions in $x\ (\textnormal{mod }p^{k-j})$ .

Case 2. $p\nmid a_{1}$ . Given the previous case, we can assume $p\mid a_{2}$ . Then $p\nmid a_{2}(x+x_{0})+a_{1}$ , so from ˜A.7 we find that $p^{k-j}\mid x-x_{0}$ , forcing only one solution in $x\ (\textnormal{mod }p^{k-j})$ .

Case 3. $p\nmid a_{0}$ . Then ˜A.6 implies $p\nmid x$ , and by substituting $x\leftrightarrow\overline{x}\ (\textnormal{mod }p^{k-j})$ , we reduce to the case $p\nmid a_{2}$ . ∎

References

[1] Valentin Blomer, Étienne Fouvry, Emmanuel Kowalski, Philippe Michel, and Djordje Milićević. On moments of twisted $L$ -functions. Amer. J. Math., 139(3):707–768, 2017.
[2] Valentin Blomer, Étienne Fouvry, Emmanuel Kowalski, Philippe Michel, Djordje Milićević, and Will Sawin. The second moment theory of families of $L$ -functions—the case of twisted Hecke $L$ -functions. Mem. Amer. Math. Soc., 282(1394):v+148, 2023.
[3] Valentin Blomer and Djordje Milićević. The second moment of twisted modular $L$ -functions. Geom. Funct. Anal., 25(2):453–516, 2015.
[4] Enrico Bombieri, John B. Friedlander, and Henryk Iwaniec. Primes in arithmetic progressions to large moduli. Acta Math., 156(3-4):203–251, 1986.
[5] Jean Bourgain and Alex Gamburd. Expansion and random walks in ${\rm SL}_{d}(\mathbb{Z}/p^{n}\mathbb{Z})$ . I. J. Eur. Math. Soc. (JEMS), 10(4):987–1011, 2008.
[6] A. H. Clifford. Representations induced in an invariant subgroup. Ann. of Math. (2), 38(3):533–550, 1937.
[7] Régis de La Bretèche and Sary Drappeau. Niveau de répartition des polynômes quadratiques et crible majorant pour les entiers friables. J. Eur. Math. Soc., 22(5):1577–1624, 2020.
[8] Pierre Deligne. La conjecture de Weil. I. Inst. Hautes Études Sci. Publ. Math., (43):273–307, 1974.
[9] J.-M. Deshouillers and H. Iwaniec. Power mean values of the Riemann zeta function. Mathematika, 29(2):202–212, 1982.
[10] J.-M. Deshouillers and H. Iwaniec. Power mean-values for Dirichlet’s polynomials and the Riemann zeta-function. II. Acta Arith., 43(3):305–312, 1984.
[11] Jean-Marc Deshouillers and Henryk Iwaniec. Kloosterman sums and Fourier coefficients of cusp forms. Invent. Math., 70(2):219–288, 1982.
[12] Sary Drappeau, Kyle Pratt, and Maksym Radziwiłł. One-level density estimates for Dirichlet $L$ -functions with extended support. Algebra Number Theory, 17(4):805–830, 2023.
[13] William Duke, John Friedlander, and Henryk Iwaniec. Bilinear forms with Kloosterman fractions. Invent. Math., 128(1):23–43, 1997.
[14] Étienne Fouvry, Emmanuel Kowalski, and Philippe Michel. Algebraic trace functions over the primes. Duke Math. J., 163(9):1683–1736, 2014.
[15] Étienne Fouvry, Emmanuel Kowalski, Philippe Michel, and Will Sawin. Lectures on applied $\ell$ -adic cohomology. In Analytic methods in arithmetic geometry, volume 740 of Contemp. Math., pages 113–195. Amer. Math. Soc., [Providence], RI, [2019] ©2019.
[16] William Fulton and Joe Harris. Representation theory, volume 129 of Graduate Texts in Mathematics. Springer-Verlag, New York, 1991. A first course, Readings in Mathematics.
[17] Lasse Grimmelt and Jori Merikoski. On the greatest prime factor and uniform equidistribution of quadratic polynomials. Preprint, arXiv:2505.00493, 2025.
[18] Larry Guth and James Maynard. New large value estimates for Dirichlet polynomials. Ann. of Math., to appear. Preprint, arXiv:2405.20552, 2024.
[19] H. A. Helfgott. Growth and generation in ${\rm SL}_{2}(\mathbb{Z}/p\mathbb{Z})$ . Ann. of Math. (2), 167(2):601–623, 2008.
[20] I. Martin Isaacs. Character theory of finite groups. AMS Chelsea Publishing, Providence, RI, 2006. Corrected reprint of the 1976 original [Academic Press, New York; MR0460423].
[21] Henryk Iwaniec and Emmanuel Kowalski. Analytic number theory, volume 53. American Mathematical Society, Providence, RI, 2021.
[22] Kerr, Bryce and Shparlinski, Igor E. and Wu, Xiaosheng and Xi, Ping. Bounds on bilinear forms with Kloosterman sums. J. Lond. Math. Soc. (2), 108(2):578–621, 2023.
[23] Henry H. Kim. Functoriality for the exterior square of ${\rm GL}_{4}$ and the symmetric fourth of ${\rm GL}_{2}$ . J. Amer. Math. Soc., 16(1):139–183, 2003. With Appendix 1 by Dinakar Ramakrishnan and Appendix 2 by Kim and Peter Sarnak.
[24] Emmanuel Kowalski, Philippe Michel, and Will Sawin. Bilinear forms with Kloosterman sums and applications. Ann. of Math. (2), 186(2):413–500, 2017.
[25] Emmanuel Kowalski, Philippe Michel, and Will Sawin. Stratification and averaging for exponential sums: bilinear forms with generalized Kloosterman sums. Ann. Sc. Norm. Super. Pisa Cl. Sci. (5), 21:1453–1530, 2020.
[26] Philip C. Kutzko. The characters of the binary modular congruence group. Bull. Amer. Math. Soc., 79:702–704, 1973.
[27] Nikolai V. Kuznetsov. The Petersson conjecture for cusp forms of weight zero and the Linnik conjecture. Sums of Kloosterman sums. Mat. Sb. (N.S.), 111(153)(3):334–383, 479, 1980.
[28] James Maynard. Primes in Arithmetic Progressions to Large Moduli I: Fixed Residue Classes. Mem. Amer. Math. Soc., 306(1542), 2025.
[29] Djordje Milićević, Xinhua Qin, and Xiaosheng Wu. Bilinear forms with Kloosterman sums and moments of twisted $L$ -functions. arXiv preprint, November 2025.
[30] Nikolay G. Moshchevitin and Ilya D. Shkredov. On a modular form of Zaremba’s conjecture. Pacific J. Math., 309(1):195–211, 2020.
[31] Alexandre Nobs and Jürgen Wolfart. Die irreduziblen Darstellungen der Gruppen $SL_{2}(Z_{p})$ , insbesondere $SL_{2}(Z_{p})$ . II. Comment. Math. Helv., 51(4):491–526, 1976.
[32] Alexandru Pascadi. Large sieve inequalities for exceptional Maass forms and the greatest prime factor of $n^{2}+1$ . Forum Math. Pi, to appear. Preprint, arXiv:2404.04239, 2025.
[33] Alexandru Pascadi. On the exponents of distribution of primes and smooth numbers. Preprint, arXiv:2505.00653, 2025.
[34] Atle Selberg. On the estimation of Fourier coefficients of modular forms. In Proc. Sympos. Pure Math., Vol. VIII, pages 1–15. Amer. Math. Soc., Providence, RI, 1965.
[35] Jean-Pierre Serre. Linear representations of finite groups, volume Vol. 42 of Graduate Texts in Mathematics. Springer-Verlag, New York-Heidelberg, french edition, 1977.
[36] Joseph A. Shalika. Representation of the two by two unimodular group over local fields. In Contributions to automorphic forms, geometry, and number theory, pages 1–38. Johns Hopkins Univ. Press, Baltimore, MD, 2004.
[37] I. D. Shkredov. On asymptotic formulae in some sum-product questions. Trans. Moscow Math. Soc., 79:231–281, 2018.
[38] I. D. Shkredov. Modular hyperbolas and bilinear forms of Kloosterman sums. J. Number Theory, 220:182–211, 2021.
[39] Igor E. Shparlinski. On sums of Kloosterman and Gauss sums. Trans. Amer. Math. Soc., 371(12):8679–8697, 2019.
[40] Igor E. Shparlinski and Tianping Zhang. Cancellations amongst Kloosterman sums. Acta Arith., 176(3):201–210, 2016.
[41] Shunichi Tanaka. Irreducible representations of the binary modular congruence groups ${\rm mod}\ p^{\lambda}$ . J. Math. Kyoto Univ., 7:123–132, 1967.
[42] Audrey Terras. Fourier analysis on finite groups and applications, volume 43 of London Mathematical Society Student Texts. Cambridge University Press, Cambridge, 1999.
[43] Berke Topacogullari. The shifted convolution of generalized divisor functions. Int. Math. Res. Not. IMRN, (24):7681–7724, 2018.
[44] Jie Wu and Ping Xi. Arithmetic exponent pairs for algebraic trace functions and applications. Algebra Number Theory, 15(9):2123–2172, 2021. With an appendix by Will Sawin.
[45] Xiaosheng Wu. The fourth moment of Dirichlet $L$ -functions at the central value. Math. Ann., 387(3-4):1199–1248, 2023.
[46] Ping Xi. Ternary divisor functions in arithmetic progressions to smooth moduli. Mathematika, 64(3):701–729, 2018.
[47] Matthew P. Young. The fourth moment of Dirichlet $L$ -functions. Ann. of Math. (2), 173(1):1–50, 2011.

	$\displaystyle\\|\widehat{F}_{c,a}^{H_{1},H_{2}}(\rho_{c}^{\circ})\\|$	$\displaystyle=\Bigg\\|\frac{1}{H_{1}H_{2}}\sum_{\begin{subarray}{c}\|h_{1}\|\leq H_{1}\\ \|h_{2}\|\leq H_{2}\end{subarray}}\alpha_{h_{1}}\beta_{h_{2}}\,\rho_{c}^{\circ}(T^{ah_{1}}ST^{h_{2}})\Bigg\\|$		(4.16)
		$\displaystyle\leq\frac{1}{H_{1}H_{2}}\sum_{\begin{subarray}{c}\|h_{1}\|\leq H_{1}\\ \|h_{2}\|\leq H_{2}\end{subarray}}\|\alpha_{h_{1}}\beta_{h_{2}}\|\\|\rho_{c}^{\circ}(T^{ah_{1}}ST^{h_{2}})\\|\ll 1,$		(4.16)

	$\displaystyle\sum_{\begin{subarray}{c}h_{1},\ldots,h_{k}\in\mathbb{Z}\\ \|h_{i}\|\leq\frac{1}{2}H_{j}\\ \forall i\equiv j\ (\textnormal{mod }2)\end{subarray}}1$	$\displaystyle=\sum_{g\in\textnormal{PSL}_{2}(\mathbb{Z}/c\mathbb{Z})}\sum_{\begin{subarray}{c}h_{1},\ldots,h_{k}\in\mathbb{Z}\\ \|h_{i}\|\leq\frac{1}{2}H_{j}\\ \forall i\equiv j\ (\textnormal{mod }2)\end{subarray}}\mathbbm{1}_{T^{a_{1}h_{1}}S\cdots T^{a_{k}h_{k}}S=g}$
		$\displaystyle\ll c^{3/2}\Bigg(\sum_{g\in\textnormal{PSL}_{2}(\mathbb{Z}/c\mathbb{Z})}\Bigg(\sum_{\begin{subarray}{c}h_{1},\ldots,h_{k}\in\mathbb{Z}\\ \|h_{i}\|\leq\frac{1}{2}H_{j}\\ \forall i\equiv j\ (\textnormal{mod }2)\end{subarray}}\mathbbm{1}_{T^{a_{1}h_{1}}S\cdots T^{a_{k}h_{k}}S=g}\Bigg)^{2}\Bigg)^{1/2}$
		$\displaystyle=c^{3/2}\Bigg(\sum_{\begin{subarray}{c}h_{1},\ldots,h_{k}\in\mathbb{Z}\\ h_{1}^{\prime},\ldots,h_{k}^{\prime}\in\mathbb{Z}\\ \|h_{i}\|,\|h_{i}^{\prime}\|\leq\frac{1}{2}H_{j}\\ \forall i\equiv j\ (\textnormal{mod }2)\end{subarray}}\mathbbm{1}_{T^{a_{1}h_{1}}S\cdots T^{a_{k}h_{k}}S=T^{a_{1}h_{1}^{\prime}}S\cdots T^{a_{k}h_{k}^{\prime}}S\text{ in }\textnormal{PSL}_{2}(\mathbb{Z}/c\mathbb{Z})}\Bigg)^{1/2}.$

	$\displaystyle\sum_{\begin{subarray}{c}\|h_{2}\|\leq H_{2}\\ h_{2}\equiv r\ (\textnormal{mod }d)\end{subarray}}(h_{2},c)$	$\displaystyle\leq\sum_{\begin{subarray}{c}g\mid c\\ (g,d)=1\end{subarray}}g\sum_{\begin{subarray}{c}\|h_{2}\|\leq H_{2}\\ g\mid h_{2}\equiv r\ (\textnormal{mod }d)\end{subarray}}1$
		$\displaystyle=\sum_{\begin{subarray}{c}g\mid\frac{c}{d}\\ (g,d)=1\end{subarray}}g\sum_{\begin{subarray}{c}\|h_{2}^{\prime}\|\leq\frac{H_{2}}{g}\\ h_{2}^{\prime}\equiv\overline{g}r\ (\textnormal{mod }d)\end{subarray}}1$
		$\displaystyle\ll\sum_{\begin{subarray}{c}\|g\|\leq\frac{c}{d}\\ (g,d)=1\end{subarray}}g\left(1+\frac{H_{2}}{gd}\right)\ll c^{o(1)}\left(\frac{c}{d}+\frac{H_{2}}{d}\right),$

$\displaystyle\left\|\mathop{\sum\sum}_{\begin{subarray}{c}m\in\mathcal{I},n\in\mathcal{J}\\ (m,n,c)=1\end{subarray}}\alpha_{m}\beta_{n}S(am,n;c)\right\|$	$\displaystyle\leq\\|\alpha\\|\\|\beta\\|\\|K_{c,a}^{\mathcal{I},\mathcal{J}}\\|$	(7.1)
	$\displaystyle\leq\\|\alpha\\|\\|\beta\\|\left(c^{1+2\varepsilon}\\|\widehat{F}_{c,a}^{H_{1},H_{2}}(\rho_{c}^{\circ})\\|+O_{\varepsilon}(c^{-100})\right)$
	$\displaystyle\ll_{\varepsilon}\\|\alpha\\|\\|\beta\\|\left(c^{1+3\varepsilon}\frac{d^{1/q}}{(H_{1}H_{2})^{1/2}}\mathscr{S}^{1/q}+O_{\varepsilon}(c^{-100})\right),$

	$\displaystyle\sum_{\lambda_{j}<1/4}X^{2\theta_{j}}\left\|\sum_{n\sim N}\alpha_{n}\,\rho_{j\mathfrak{a}}(n)\right\|^{2}$	$\displaystyle\ll(qN)^{o(1)}\left(1+\frac{N}{q}\right)\\|\alpha_{n}\\|^{2}$		(9.1)
		$\displaystyle+\left\|\sum_{c\in q\mathbb{Z}_{+}}\frac{1}{c}\sum_{m,n\sim N}\overline{\alpha_{m}e(m\omega)}\,\alpha_{n}e(n\omega)\,S(m,n;c)\,\Phi\left(\frac{\sqrt{mn}}{c}X\right)\right\|.$		(9.1)

Non-abelian amplification and bilinear forms with Kloosterman sums

Abstract.

1. Introduction

1.1. Brief background

1.2. Main results.

Theorem 1.1.

Theorem 1.2.

Example 1.3.

Corollary 1.4.

Remark.

1.3. Applications

Theorem 1.5.

Remark.

Corollary 1.6.

Remark.

1.4. Acknowledgements

2. Outline

2.1. Structure of the paper

2.2. First steps: Fourier analysis

2.3. The key step: Amplification

2.4. Final steps: Combinatorics

2.5. Comments on prime moduli

3. Preliminaries

3.1. Analytic and arithmetic notation

Lemma 3.1.

Proof.

3.2. Bounds for Kloosterman sums

Lemma 3.2 (Ramanujan bound).

Proof.

Lemma 3.3 (Weil bound).

Proof.

Proof of ˜1.2.

Theorem 3.4 (Kowalski–Michel–Sawin [24]).

Proof.

Remark.

Theorem 3.5 (Blomer–Milićević [3]).

Proof.

3.3. Fourier analysis on finite groups

Example 3.6.

Notation 3.7.

Lemma 3.8 (Character orthogonality).

Proof.

Lemma 3.9 (Clifford).

Proof.

Lemma 3.10.

Proof.

3.4. Facts about SL2​(ℤ/c​ℤ)\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})

Notation 3.11 (Projective line).

Notation 3.12 (Reduction mod dd).

Lemma 3.13.

Proof.

Definition 3.14 (Primitive representations).

Lemma 3.15.

Proof.

Lemma 3.16.

Proof.

4. Representations and Kloosterman matrices

4.1. The relevant representations

Definition 4.1 (Permutation representations of the projective action).

Definition 4.2 (Invariant subspaces).

Lemma 4.3.

Proof.

Notation 4.4 (Ordered tensor products).

Definition 4.5 (Sifted representations).

Proposition 4.6 (Decomposition of sifted representations).

Proof.

Definition 4.7 (Special projections).

Lemma 4.8.

Lemma 4.9.

Proof.

4.2. The Kloosterman matrix

Proposition 4.10 (From Kloosterman matrices to Fourier coefficients).

Remark.

Proof of Proposition˜4.10.

Corollary 4.11.

Remark.

Proof of Corollary˜4.11.

5. The amplification argument

5.1. Introducing the amplifier

Proposition 5.1 (Non-abelian amplification).

3.4. Facts about $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$

Notation 3.12 (Reduction mod $d$ ).

6. Counting solutions in $\textnormal{PSL}_{2}(\mathbb{Z}/c\mathbb{Z})$

Proposition 6.4 (Combinatorial count for $q=6$ ).

8. Moments of twisted modular $L$ -functions

Appendix A Some necessary computations in $\textnormal{SL}_{2}(\mathbb{Z}/c\mathbb{Z})$