In mathematics, a modular form is a (complex) analytic function on the upper half-plane, , that satisfies:
The theory of modular forms therefore belongs to complex analysis. The main importance of the theory is its connections with number theory. Modular forms appear in other areas, such as algebraic topology, sphere packing, and string theory.
Modular form theory is a special case of the more general theory of automorphic forms, which are functions defined on Lie groups that transform nicely with respect to the action of certain discrete subgroups, generalizing the example of the modular group .
The term "modular form", as a systematic description, is usually attributed to Hecke.
Each modular form is attached to a Galois representation.^{[1]}
In general,^{[2]} given a subgroup of finite index, called an arithmetic group, a modular form of level and weight is a holomorphic function from the upper half-plane such that two conditions are satisfied:
where and the function is identified with the matrix The identification of such functions with such matrices causes composition of such functions to correspond to matrix multiplication. In addition, it is called a cusp form if it satisfies the following growth condition:
Modular forms can also be interpreted as sections of a specific line bundle on modular varieties. For a modular form of level and weight can be defined as an element of
where is a canonical line bundle on the modular curve
The dimensions of these spaces of modular forms can be computed using the Riemann–Roch theorem.^{[3]} The classical modular forms for are sections of a line bundle on the moduli stack of elliptic curves.
A modular function is a function that is invariant with respect to the modular group, but without the condition that f (z) be holomorphic in the upper half-plane (among other requirements). Instead, modular functions are meromorphic: they are holomorphic on the complement of a set of isolated points, which are poles of the function.
A modular form of weight k for the modular group
is a complex-valued function f on the upper half-plane H = {z ∈ C, Im(z) > 0}, satisfying the following three conditions:
Remarks:
A modular form can equivalently be defined as a function F from the set of lattices in C to the set of complex numbers which satisfies certain conditions:
The key idea in proving the equivalence of the two definitions is that such a function F is determined, because of the second condition, by its values on lattices of the form Z + Zτ, where τ ∈ H.
I. Eisenstein series
The simplest examples from this point of view are the Eisenstein series. For each even integer k > 2, we define G_{k}(Λ) to be the sum of λ^{−k} over all non-zero vectors λ of Λ:
Then G_{k} is a modular form of weight k. For Λ = Z + Zτ we have
and
The condition k > 2 is needed for convergence; for odd k there is cancellation between λ^{−k} and (−λ)^{−k}, so that such series are identically zero.
II. Theta functions of even unimodular lattices
An even unimodular lattice L in R^{n} is a lattice generated by n vectors forming the columns of a matrix of determinant 1 and satisfying the condition that the square of the length of each vector in L is an even integer. The so-called theta function
converges when Im(z) > 0, and as a consequence of the Poisson summation formula can be shown to be a modular form of weight n/2. It is not so easy to construct even unimodular lattices, but here is one way: Let n be an integer divisible by 8 and consider all vectors v in R^{n} such that 2v has integer coordinates, either all even or all odd, and such that the sum of the coordinates of v is an even integer. We call this lattice L_{n}. When n = 8, this is the lattice generated by the roots in the root system called E_{8}. Because there is only one modular form of weight 8 up to scalar multiplication,
even though the lattices L_{8} × L_{8} and L_{16} are not similar. John Milnor observed that the 16-dimensional tori obtained by dividing R^{16} by these two lattices are consequently examples of compact Riemannian manifolds which are isospectral but not isometric (see Hearing the shape of a drum.)
III. The modular discriminant
The Dedekind eta function is defined as
where q is the square of the nome. Then the modular discriminant Δ(z) = (2π)^{12} η(z)^{24} is a modular form of weight 12. The presence of 24 is related to the fact that the Leech lattice has 24 dimensions. A celebrated conjecture of Ramanujan asserted that when Δ(z) is expanded as a power series in q, the coefficient of q^{p} for any prime p has absolute value ≤ 2p^{11/2}. This was confirmed by the work of Eichler, Shimura, Kuga, Ihara, and Pierre Deligne as a result of Deligne's proof of the Weil conjectures, which were shown to imply Ramanujan's conjecture.
The second and third examples give some hint of the connection between modular forms and classical questions in number theory, such as representation of integers by quadratic forms and the partition function. The crucial conceptual link between modular forms and number theory is furnished by the theory of Hecke operators, which also gives the link between the theory of modular forms and representation theory.
When the weight k is zero, it can be shown using Liouville's theorem that the only modular forms are constant functions. However, relaxing the requirement that f be holomorphic leads to the notion of modular functions. A function f : H → C is called modular if it satisfies the following properties:
It is often written in terms of (the square of the nome), as:
This is also referred to as the q-expansion of f (q-expansion principle). The coefficients are known as the Fourier coefficients of f, and the number m is called the order of the pole of f at i∞. This condition is called "meromorphic at the cusp", meaning that only finitely many negative-n coefficients are non-zero, so the q-expansion is bounded below, guaranteeing that it is meromorphic at q = 0. ^{[note 2]}
Sometimes a weaker definition of modular functions is used – under the alternative definition, it is sufficient that f be meromorphic in the open upper half-plane and that f be invariant with respect to a sub-group of the modular group of finite index.^{[4]} This is not adhered to in this article.
Another way to phrase the definition of modular functions is to use elliptic curves: every lattice Λ determines an elliptic curve C/Λ over C; two lattices determine isomorphic elliptic curves if and only if one is obtained from the other by multiplying by some non-zero complex number α. Thus, a modular function can also be regarded as a meromorphic function on the set of isomorphism classes of elliptic curves. For example, the j-invariant j(z) of an elliptic curve, regarded as a function on the set of all elliptic curves, is a modular function. More conceptually, modular functions can be thought of as functions on the moduli space of isomorphism classes of complex elliptic curves.
A modular form f that vanishes at q = 0 (equivalently, a_{0} = 0, also paraphrased as z = i∞) is called a cusp form (Spitzenform in German). The smallest n such that a_{n} ≠ 0 is the order of the zero of f at i∞.
A modular unit is a modular function whose poles and zeroes are confined to the cusps.^{[5]}
The functional equation, i.e., the behavior of f with respect to can be relaxed by requiring it only for matrices in smaller groups.
Let G be a subgroup of SL(2, Z) that is of finite index. Such a group G acts on H in the same way as SL(2, Z). The quotient topological space G\H can be shown to be a Hausdorff space. Typically it is not compact, but can be compactified by adding a finite number of points called cusps. These are points at the boundary of H, i.e. in Q∪{∞},^{[note 3]} such that there is a parabolic element of G (a matrix with trace ±2) fixing the point. This yields a compact topological space G\H^{∗}. What is more, it can be endowed with the structure of a Riemann surface, which allows one to speak of holo- and meromorphic functions.
Important examples are, for any positive integer N, either one of the congruence subgroups
For G = Γ_{0}(N) or Γ(N), the spaces G\H and G\H^{∗} are denoted Y_{0}(N) and X_{0}(N) and Y(N), X(N), respectively.
The geometry of G\H^{∗} can be understood by studying fundamental domains for G, i.e. subsets D ⊂ H such that D intersects each orbit of the G-action on H exactly once and such that the closure of D meets all orbits. For example, the genus of G\H^{∗} can be computed.^{[6]}
A modular form for G of weight k is a function on H satisfying the above functional equation for all matrices in G, that is holomorphic on H and at all cusps of G. Again, modular forms that vanish at all cusps are called cusp forms for G. The C-vector spaces of modular and cusp forms of weight k are denoted M_{k}(G) and S_{k}(G), respectively. Similarly, a meromorphic function on G\H^{∗} is called a modular function for G. In case G = Γ_{0}(N), they are also referred to as modular/cusp forms and functions of level N. For G = Γ(1) = SL(2, Z), this gives back the afore-mentioned definitions.
The theory of Riemann surfaces can be applied to G\H^{∗} to obtain further information about modular forms and functions. For example, the spaces M_{k}(G) and S_{k}(G) are finite-dimensional, and their dimensions can be computed thanks to the Riemann–Roch theorem in terms of the geometry of the G-action on H.^{[7]} For example,
where denotes the floor function and is even.
The modular functions constitute the field of functions of the Riemann surface, and hence form a field of transcendence degree one (over C). If a modular function f is not identically 0, then it can be shown that the number of zeroes of f is equal to the number of poles of f in the closure of the fundamental region R_{Γ}.It can be shown that the field of modular function of level N (N ≥ 1) is generated by the functions j(z) and j(Nz).^{[8]}
The situation can be profitably compared to that which arises in the search for functions on the projective space P(V): in that setting, one would ideally like functions F on the vector space V which are polynomial in the coordinates of v ≠ 0 in V and satisfy the equation F(cv) = F(v) for all non-zero c. Unfortunately, the only such functions are constants. If we allow denominators (rational functions instead of polynomials), we can let F be the ratio of two homogeneous polynomials of the same degree. Alternatively, we can stick with polynomials and loosen the dependence on c, letting F(cv) = c^{k}F(v). The solutions are then the homogeneous polynomials of degree k. On the one hand, these form a finite dimensional vector space for each k, and on the other, if we let k vary, we can find the numerators and denominators for constructing all the rational functions which are really functions on the underlying projective space P(V).
One might ask, since the homogeneous polynomials are not really functions on P(V), what are they, geometrically speaking? The algebro-geometric answer is that they are sections of a sheaf (one could also say a line bundle in this case). The situation with modular forms is precisely analogous.
Modular forms can also be profitably approached from this geometric direction, as sections of line bundles on the moduli space of elliptic curves.
For a subgroup Γ of the SL(2, Z), the ring of modular forms is the graded ring generated by the modular forms of Γ. In other words, if M_{k}(Γ) be the ring of modular forms of weight k, then the ring of modular forms of Γ is the graded ring .
Rings of modular forms of congruence subgroups of SL(2, Z) are finitely generated due to a result of Pierre Deligne and Michael Rapoport. Such rings of modular forms are generated in weight at most 6 and the relations are generated in weight at most 12 when the congruence subgroup has nonzero odd weight modular forms, and the corresponding bounds are 5 and 10 when there are no nonzero odd weight modular forms.
More generally, there are formulas for bounds on the weights of generators of the ring of modular forms and its relations for arbitrary Fuchsian groups.
If f is holomorphic at the cusp (has no pole at q = 0), it is called an entire modular form.
If f is meromorphic but not holomorphic at the cusp, it is called a non-entire modular form. For example, the j-invariant is a non-entire modular form of weight 0, and has a simple pole at i∞.
New forms are a subspace of modular forms^{[9]} of a fixed weight which cannot be constructed from modular forms of lower weights dividing . The other forms are called old forms. These old forms can be constructed using the following observations: if then giving a reverse inclusion of modular forms .
A cusp form is a modular form with a zero constant coefficient in its Fourier series. It is called a cusp form because the form vanishes at all cusps.
There are a number of other usages of the term "modular function", apart from this classical one; for example, in the theory of Haar measures, it is a function Δ(g) determined by the conjugation action.
Maass forms are real-analytic eigenfunctions of the Laplacian but need not be holomorphic. The holomorphic parts of certain weak Maass wave forms turn out to be essentially Ramanujan's mock theta functions. Groups which are not subgroups of SL(2, Z) can be considered.
Hilbert modular forms are functions in n variables, each a complex number in the upper half-plane, satisfying a modular relation for 2×2 matrices with entries in a totally real number field.
Siegel modular forms are associated to larger symplectic groups in the same way in which classical modular forms are associated to SL(2, R); in other words, they are related to abelian varieties in the same sense that classical modular forms (which are sometimes called elliptic modular forms to emphasize the point) are related to elliptic curves.
Jacobi forms are a mixture of modular forms and elliptic functions. Examples of such functions are very classical - the Jacobi theta functions and the Fourier coefficients of Siegel modular forms of genus two - but it is a relatively recent observation that the Jacobi forms have an arithmetic theory very analogous to the usual theory of modular forms.
Automorphic forms extend the notion of modular forms to general Lie groups.
Modular integrals of weight k are meromorphic functions on the upper half plane of moderate growth at infinity which fail to be modular of weight k by a rational function.
Automorphic factors are functions of the form which are used to generalise the modularity relation defining modular forms, so that
The function is called the nebentypus of the modular form. Functions such as the Dedekind eta function, a modular form of weight 1/2, may be encompassed by the theory by allowing automorphic factors.
The theory of modular forms was developed in four periods:
Taniyama and Shimura identified a 1-to-1 matching between certain modular forms and elliptic curves. Robert Langlands built on this idea in the construction of his expansive Langlands program, which has become one of the most far-reaching and consequential research programs in math.
In 1994 Andrew Wiles used modular forms to prove Fermat’s Last Theorem. In 2001 all elliptic curves were proven to be modular over the rational numbers. In 2013 elliptic curves were proven to be modular over real quadratic fields. In 2023 elliptic curves were proven to be modular over about half of imaginary quadratic fields, including fields formed by combining the rational numbers with the square root of integers down to −5. ^{[1]}