Euclid's theorem is a fundamental statement in number theory that asserts that there are infinitely many prime numbers. It was first proven by Euclid in his work Elements. There are several proofs of the theorem.
Euclid offered a proof published in his work Elements (Book IX, Proposition 20),[1] which is paraphrased here.[2]
Consider any finite list of prime numbers p1, p2, ..., pn. It will be shown that there exists at least one additional prime number not included in this list. Let P be the product of all the prime numbers in the list: P = p1p2...pn. Let q = P + 1. Then q is either prime or not:
This proves that for every finite list of prime numbers there is a prime number not in the list.[4] In the original work, Euclid denoted the arbitrary finite set of prime numbers as A, B, Γ. If taken literally, that would mean just three prime numbers.[5]
Euclid is often erroneously reported to have proved this result by contradiction beginning with the assumption that the finite set initially considered contains all prime numbers,[6] though it is actually a proof by cases, a direct proof method. The philosopher Torkel Franzén, in a book on logic, states, "Euclid's proof that there are infinitely many primes is not an indirect proof [...] The argument is sometimes formulated as an indirect proof by replacing it with the assumption 'Suppose q1, ..., qn are all the primes'. However, since this assumption isn't even used in the proof, the reformulation is pointless."[7]
Several variations on Euclid's proof exist, including the following:
The factorial n! of a positive integer n is divisible by every integer from 2 to n, as it is the product of all of them. Hence, n! + 1 is not divisible by any of the integers from 2 to n, inclusive (it gives a remainder of 1 when divided by each). Hence n! + 1 is either prime or divisible by a prime larger than n. In either case, for every positive integer n, there is at least one prime bigger than n. The conclusion is that the number of primes is infinite.[8]
Another proof, by the Swiss mathematician Leonhard Euler, relies on the fundamental theorem of arithmetic: that every integer has a unique prime factorization. What Euler wrote (not with this modern notation and, unlike modern standards, not restricting the arguments in sums and products to any finite sets of integers) is equivalent to the statement that we have[9]
where denotes the set of the k first prime numbers, and is the set of the positive integers whose prime factors are all in
To show this, one expands each factor in the product as a geometric series, and distributes the product over the sum (this is a special case of the Euler product formula for the Riemann zeta function).
In the penultimate sum, every product of primes appears exactly once, so the last equality is true by the fundamental theorem of arithmetic. In his first corollary to this result Euler denotes by a symbol similar to the "absolute infinity" and writes that the infinite sum in the statement equals the "value" , to which the infinite product is thus also equal (in modern terminology this is equivalent to saying that the partial sum up to of the harmonic series diverges asymptotically like ). Then in his second corollary, Euler notes that the product
converges to the finite value 2, and there are consequently more primes than squares. This proves Euclid's Theorem.[10]
In the same paper (Theorem 19) Euler in fact used the above equality to prove a much stronger theorem that was unknown before him, namely that the series
is divergent, where P denotes the set of all prime numbers (Euler writes that the infinite sum equals , which in modern terminology is equivalent to saying that the partial sum up to of this series behaves asymptotically like ).
Paul Erdős gave a proof[11] that also relies on the fundamental theorem of arithmetic. Every positive integer has a unique factorization into a square-free number r and a square number s2. For example, 75,600 = 24 33 52 71 = 21 ⋅ 602.
Let N be a positive integer, and let k be the number of primes less than or equal to N. Call those primes p1, ... , pk. Any positive integer a which is less than or equal to N can then be written in the form
where each ei is either 0 or 1. There are 2k ways of forming the square-free part of a. And s2 can be at most N, so s ≤ √N. Thus, at most 2k √N numbers can be written in this form. In other words,
Or, rearranging, k, the number of primes less than or equal to N, is greater than or equal to 1/2log2 N. Since N was arbitrary, k can be as large as desired by choosing N appropriately.
In the 1950s, Hillel Furstenberg introduced a proof by contradiction using point-set topology.[12]
Define a topology on the integers , called the evenly spaced integer topology, by declaring a subset to be an open set if and only if it is either the empty set, , or it is a union of arithmetic sequences (for ), where
Then a contradiction follows from the property that a finite set of integers cannot be open and the property that the basis sets are both open and closed, since
cannot be closed because its complement is finite, but is closed since it is a finite union of closed sets.
Juan Pablo Pinasco has written the following proof.[13]
Let p1, ..., pN be the smallest N primes. Then by the inclusion–exclusion principle, the number of positive integers less than or equal to x that are divisible by one of those primes is
Dividing by x and letting x → ∞ gives
This can be written as
If no other primes than p1, ..., pN exist, then the expression in (1) is equal to and the expression in (2) is equal to 1, but clearly the expression in (3) is not equal to 1. Therefore, there must be more primes than p1, ..., pN.
In 2010, Junho Peter Whang published the following proof by contradiction.[14] Let k be any positive integer. Then according to Legendre's formula (sometimes attributed to de Polignac)
where
But if only finitely many primes exist, then
(the numerator of the fraction would grow singly exponentially while by Stirling's approximation the denominator grows more quickly than singly exponentially), contradicting the fact that for each k the numerator is greater than or equal to the denominator.
Filip Saidak gave the following proof by construction, which does not use reductio ad absurdum[15] or Euclid's lemma (that if a prime p divides ab then it must divide a or b).
Since each natural number greater than 1 has at least one prime factor, and two successive numbers n and (n + 1) have no factor in common, the product n(n + 1) has more different prime factors than the number n itself. So the chain of pronic numbers:
1×2 = 2 {2}, 2×3 = 6 {2, 3}, 6×7 = 42 {2, 3, 7}, 42×43 = 1806 {2, 3, 7, 43}, 1806×1807 = 3263442 {2, 3, 7, 43, 13, 139}, · · ·
provides a sequence of unlimited growing sets of primes.
Suppose there were only k primes (p1, ..., pk). By the fundamental theorem of arithmetic, any positive integer n could then be represented as
where the non-negative integer exponents ei together with the finite-sized list of primes are enough to reconstruct the number. Since for all i, it follows that for all i (where denotes the base-2 logarithm). This yields an encoding for n of the following size (using big O notation): bits. This is a much more efficient encoding than representing n directly in binary, which takes bits. An established result in lossless data compression states that one cannot generally compress N bits of information into fewer than N bits. The representation above violates this by far when n is large enough since . Therefore, the number of primes must not be finite.[16]
Romeo Meštrović used an even-odd argument to show that if the number of primes is not infinite then 3 is the largest prime, a contradiction.[17]
Suppose that are all the prime numbers. Consider and note that by assumption all positive integers relatively prime to it are in the set . In particular, is relatively prime to and so is . However, this means that is an odd number in the set , so , or . This means that must be the largest prime number which is a contradiction.
The above proof continues to work if is replaced by any prime with , the product becomes and even vs. odd argument is replaced with a divisible vs. not divisible by argument. The resulting contradiction is that must, simultaneously, equal and be greater than ,[a] which is impossible.
The theorems in this section simultaneously imply Euclid's theorem and other results.
Dirichlet's theorem states that for any two positive coprime integers a and d, there are infinitely many primes of the form a + nd, where n is also a positive integer. In other words, there are infinitely many primes that are congruent to a modulo d.
Let π(x) be the prime-counting function that gives the number of primes less than or equal to x, for any real number x. The prime number theorem then states that x / log x is a good approximation to π(x), in the sense that the limit of the quotient of the two functions π(x) and x / log x as x increases without bound is 1:
Using asymptotic notation this result can be restated as
This yields Euclid's theorem, since
In number theory, Bertrand's postulate is a theorem stating that for any integer , there always exists at least one prime number such that Equivalently, writing for the prime-counting function (the number of primes less than or equal to ), the theorem asserts that for all .
This statement was first conjectured in 1845 by Joseph Bertrand[18] (1822–1900). Bertrand himself verified his statement for all numbers in the interval [2, 3 × 106]. His conjecture was completely proved by Chebyshev (1821–1894) in 1852[19] and so the postulate is also called the Bertrand–Chebyshev theorem or Chebyshev's theorem.