Introduction to twin primes and Brun's constant computation

 

(Click here for a Postscript version of this page and here for a pdf version)

1  Introduction

It's a very old fact (Euclid 325-265 B.C., in Book IX of the Elements) that the set of primes is infinite and a much more recent and famous result (by Jacques Hadamard (1865-1963) and Charles-Jean de la Vallee Poussin (1866-1962)) that the density of primes is ruled by the law
p(n) ~  n

log(n)
where the prime counting function p(n) is the number of prime numbers less than a given integer n. This result proved in 1896 is the celebrated prime numbers theorem and was conjectured earlier, in 1792, by young Carl Friedrich Gauss (1777-1855) and by Adrien-Marie Legendre (1752-1833) who studied the repartition of those numbers in published tables of primes.

This approximation may be usefully replaced by the more accurate logarithmic integral Li(n):
p(n) ~ Li(n)= ó
õ
n

2 
 dt

log(t)
.

However among the deeply studied set of primes there is a famous and fascinating subset for which very little is known and has generated some famous conjectures: the twin primes (the term prime pairs was used before [5]).

Definition 1 A couple of primes (p,q) are said to be twins if q=p+2. Except for the couple (2,3), this is clearly the smallest possible distance between two primes.

Example 2 (3,5),(5,7),(11,13),(17,19),(29,31),...,(419,421),... are twin primes.

2  Counting twin primes

As for the set of primes the most natural question is wether the set of twin primes is finite or not. But unlike prime numbers for which numerous and elementary proofs exist [10], the answer to this natural question is still unknown for twin primes ! Today, this problem remains one of the greatest challenge in mathematics and has occupied numbers of mathematicians. Of course, as we will see, there are some empirical and numerical results suggesting an answer and most mathematicians believe that there are infinitely many twin primes.

In 1849, Alphonse de Polignac (1817-1890) made the general conjecture that there are infinitely many primes distant from 2k. The case for which k=1 is the twin primes case.

It's now natural to introduce the twin prime counting function p2(n) which is the number of twin primes smaller than a given n.

2.1  Numerical results

Using huge table of primes (Glaisher 1878 [5], before computer age, enumerated p2(105)) and with intensive computations during modern period (Shanks and Wrench 1974 [12], Brent 1976 [1], Nicely 1996-2002 [9], Sebah 2001-2002 [11], see also [10]) it's possible to compute the exact values of p2(n) for large n and its conjectured approximation 2C2Li2(n) (see next section for the definition).

The following array includes the relative error e (in %) between the approximation and the real value.

n p2(n) 2C2Li2(n) e
10 2 5 150.00
102 8 14 75.00
103 35 46 31.43
104 205 214 4.39
105 1224 1249 2.04
106 8169 8248 0.97
107 58980 58754 -0.38
108 440312 440368 0.013
109 3424506 3425308 0.023
1010 27412679 27411417 -0.0046
1011 224376048 224368865 -0.0032
1012 1870585220 1870559867 -0.0013
1013 15834664872 15834598305 -0.00042
1014 135780321665 135780264894 -0.000042
1015 1177209242304 1177208491861 -0.000064
1016 10304195697298 10304192554496 -0.000031

At present time (2002), Pascal Sebah has reached p2(1016) and his values are confirmed by Thomas Nicely up to p2(4.1015) who used an independent approach and implementation.

2.1.1  Sieving twin primes

Because the most convenient (in fact the only available) way to compute p2(n) is to find all twin primes and just count them, it's of great importance to improve as much as possible such an algorithm. All known methods use variations on the historical Eratosthenes sieve.

In order to accelerate the sieve, a possible idea is to represent integers modulo a base m, so that any integer has the form mk+r with 0 £ r < m. Primes numbers are such as m and r are relatively primes and for any value of m, there are f(m) numbers r which are prime with m (this is the definition of Euler's f totient function).

Modulo 6  

For example modulo 6 all integers have one of the form
6k,6k+1,6k+2,6k+3,6k+4,6k+5
but 6k may not be prime, 6k+2 and 6k+4 are divisible by 2, 6k+3 is divisible by 3, therefore primes (except 2 and 3) must be of the form
6k+1,6k+5
or which is more convenient to sieve twin primes
( 6k+5,6k+7) .

This allows to sieve a proportion of only f(m)/m=2/6 or 33.3% of all the numbers.

Modulo 30  

The same kind of approach modulo 30 gives for candidates
30k+1,30k+7,30k+11,30k+13,30k+17,30k+19,30k+23,30k+29
and here we need to sieve only f(m)/m=8/30 or 26.66% of the integers to find all primes (except 2,3 and 5).

But remember that we are only trying to sieve twin primes hence are left only the candidate couples
( 30k+11,30k+13) ,( 30k+17,30k+19) ,(30k+29,30k+31)
and the proportion drops to 6/30 or 20% of all integers.

This suggest to introduce the function f2(m) which is the number of pairs of integer 0 £ r < m such as r and r+2 are relatively prime with m. We observe from the last two examples that
f2(6)=1,f2(30)=3.


   Example  

Let's illustrate this on a numerical example. The enumeration of twin primes modulo 30 up to 1010 gives, respectively, for each of the 3 previous couples:
9137078,9138015,9137584
twin primes. So that (don't forget to count the two couples (3,5) and (5,7)!)
p2(1010)=9137078+9138015+9137584+2=27412679.

And, for the same couples, up to 1012:
623517830,623557143,623510245
which produces
p2(1012)=623517830+623557143+623510245+2=1870585220.

It's interesting to observe that the contribution to the enumeration of the twin primes of each couple is almost equivalent. This was also observed during all numerical estimations for other modulo like 210, 2310, 30030, ... [11].

This result is well known when enumerating just prime numbers but may be conjectured for twin primes.

Other modulo  

In this table we show the proportion 2f2(m)/m of integer to sieve in order to count twin primes as a function of the modulo m:

m f2(m) %
2 . 50.0
6 1 33.3
30 3 20.0
210 15 14.3
2310 135 11.7
30030 1485 9.9
510510 22275 8.7

The smallest ratios are obtained for values of m which are the product of the first primes (2#=2,3#=2×3,5#=2×3×5,7#=2×3×5×7,...), that is the first values of the primorial # function. For example in [11], the sieves were made modulo 30030 and 510510, therefore less than 10% of the set of integers were considered by the algorithm. In some others implementations sieves modulo 6 or 30 are used.

2.2  Twin prime conjecture

Based on heuristic considerations, a law (the twin prime conjecture) was developed, in 1922, by Godfrey Harold Hardy (1877-1947) and John Edensor Littlewood (1885-1977) to estimate the density of twin primes.

According to the prime number theorem the probability that a number n is prime is about 1/log(n), therefore, if the probability that n+2 is also prime was independent of the probability for n, we should have the approximation
p2(n) ~  n

log2(n)
,
but a more careful analysis shows that this model is too simplified (an argument is given in [6]). In fact we have the following and more accurate conjecture (called conjecture B in[7]).

Conjecture 3 [Twin prime conjecture]For large values of n, the two following equivalent approximations are conjectured
p2(n) ~ 2C2Li2(n)=2C2 ó
õ
n

2 
 dt

log2(t)
(1)
or
p2(n) ~ 2C2  n

log2(n)
(2)

Note that C2 is the twin prime constant and is defined by
C2=
Õ
p ³ 3 
æ
è
 p(p-2)

(p-1)2
ö
ø
=0.6601618158468695739278121100145...

This last constant occurs in some asymptotic estimations involving primes and it's interesting to observe that it may be estimated using properties of the Riemann Zeta function to thousand of digits (Sebah computed it to more than 5000 digits).

Figure 1: p2(n) and 2C2Li2(n)

Remark 4 The function Li2(n) occuring in (1) may be related to the logarithmic integral Li(n) by the trivial relation
Li2(n)=Li(n)+  2

log(2)
-  n

log(n)
.

2.2.1  Generalizations

In fact, Hardy and Littlewood made a more general conjecture on the primes separated by a gap of d. A natural generalization of the twin primes is to search for primes distant of d=2k (which should be infinite for any d according to Polignac's conjecture). The case d=2 is the twin primes set, d=4 forms the cousin primes set, d=6 is the sexy primes set, ...

If we denote pd(n) the number of primes p £ n such as p+d is also prime (observe that here p and p+d may not be consecutive), Hardy-Littlewood's conjecture states (in [7]) that for d ³ 2:
pd(n) ~ 2C2Rd ó
õ
n

2 
 dt

log2(t)
,
with Rd ³ 1 being the rational number
Rd=
Õ
p|d,p > 2 
 p-1

p-2
,       p is prime.

The first values of the function Rd are

d 2 4 6 8 10 12 14 16 18 20
Rd 1 1 2 1 4/3 2 6/5 1 2 4/3

According to this conjecture the density of twin primes is equivalent to the density of cousin primes. For example, the exact computed values up to 1012 are: 

p2(1012) =1870585220
p4(1012) =1870585458, 

which can be compared to the predicted value 1870559867 by the conjecture. Marek Wolf has studied the function
W(n)=p2(n)-p4(n)
and its fractal properties and approximate dimension of 1.48 ([13]).

3  Brun's constant

3.1  From Euler's constant to Brun's constant

Euler's constant  

It's very natural to understand the nature of the harmonic numbers
H(n)=1+  1

2
+  1

3
+  1

4
+...+  1

n
when n becomes large and the sum takes all integers in account. We know since Euler, for instance, that
ê
ê
ê
H(n) ~ log(n)
H(n)-log(n) ~ g
so that the harmonic numbers tends to infinite like log(n). Note that g is Euler's constant and may be evaluated to million of digits.

Mertens' constant  

The next step is to take in account only the primes numbers in the sum that is
P(p)=1+  1

2
+  1

3
+  1

5
+...+  1

p
and we have the beautiful results that
ê
ê
ê
P(p) ~ log( log(p))
P(p)-log( log(p)) ~ M

Therefore the sum diverges (this was also observed by Euler) but at the very low rate log(log(p)) and M is the interesting Mertens' constant which may be evaluated to much less digits than g, say a few thousands.

Brun's Constant  

In the last step we only take in account the twin primes less than p in the sum
B2(p)= æ
è
 1

3
+  1

5
ö
ø
+ æ
è
 1

5
+  1

7
ö
ø
+ æ
è
 1

11
+  1

13
ö
ø
+...
and here comes the remarkable result due to Norwegian Mathematician Viggo Brun (1885-1978) in 1919 [2].

Theorem 5 The sum of the inverse of the twin primes converges to a finite constant B2.

We write this result as

lim
p®¥ 
B2(p)=B2.

Note that this theorem doesn't answer to the question of the infinitude of twin primes, it just says that the limit exists (and may or may not contains a finite number of terms !). The proof is rather complex and based on a majoration of the density of twin primes ; a more modern one may also be found in [8].

Unlike Euler's constant or Mertens' constant, Brun's constant is one of the hardest to evaluate and we are not even sure to know 9 digits of it. By mean of very intensive computations, we only have guaranteed minorations !

3.2  Estimation of Brun's constant

3.2.1  Direct estimation

In the following table we have try to estimate this constant by computing the partial sums B2(p) up to different values of p.

p B2(p) 
102 1.330990365719...
104 1.616893557432...
106 1.710776930804...
108 1.758815621067...
1010 1.787478502719...
1012 1.806592419175...
1014 1.820244968130...
1015 1.825706013240...
1016 1.830484424658...

From this, we observe that the convergence is extremely slow and irregular. If we expect to find even just a few digits, we have to make some assumptions.

3.2.2  Extrapolation

An easy consequence of the twin prime conjecture is that we may write the numbers B2(p) as (see [4] and [9])
B2(p)=B2-  4C2

log(p)
+O æ
è
 1

Öplog(p)
ö
ø
and thanks to this relation, the extrapolated value
B2*(p)=B2(p)+  4C2

log(p)
converges much faster to Brun's constant B2.

Let's take a look to numerical values:

p B2*(p) 
102 1.904399633290...
104 1.903598191217...
106 1.901913353327...
108 1.902167937960...
1010 1.902160356233...
1012 1.902160630437...
1014 1.902160577783...
1015 1.902160582249...
1016 1.902160583104...

which suggest that the value of B2 should be around 1.902160583... (a similar value was first proposed by Nicely after intensive computations and checked later by Sebah, see [9] and [11]).

The relation
B2(p) ~ B2-  4C2

log(p)
invites us to draw the function B2(p)=f( 1/log(p)) (see Figure 2) which should be a line with a negative slop with a value near -4C2 » -2.64064726.

Figure 2: B2(p)=f( [ 1/log(p)])

The intersection of the line with the vertical axis (that is p=¥) is Brun's constant if the twin prime conjecture is valid. And according to this line the direct estimation B2(p) should reach 1.9 not before the value p ~ 10530 which is far beyond any computational project !

4  Twin prime characterization

There is a result from Clement (1949, [3]) which permits to see if a couple (p,p+2) is a twin primes pair. This theorem extends Wilson's famous theorem on prime numbers.

Theorem 6 Let p ³ 3, the integers (p,p+2) form a twin primes pair if and only if
4( (p-1)!+1) º -p mod p(p+2)

Example 7 For p=17, 4( (p-1)!+1) = 83691159552004 º 306 mod 323 and -p º 306 mod 323, therefore (17,19) is a twin prime pair.

The huge value of the factorial makes this theorem of no practical use to find large twin primes.

4.1  Large twin primes

Today, thanks to modern computers, a lot of huge twin primes are known. Many of those primes are of the form k×2n±1 because there are efficient primality testing algorithms for such numbers when k is not too large.

The following theorem due to the French farmer François Proth (1852-1879) may be used.

Theorem 8 [Proth's theorem - 1878]Let N=k.2n+1 with k < 2n, if there is an integer a such as
a(N-1)/2 º -1 mod N
then N is prime.

To help finding large pairs, an idea is to take a value for n and then to start a sieve in order to reduce the set of possible values for the k. It should take a few hours to find twin primes with a few thousands digits.

For example the following numbers are twin prime pairs (some are given from [10]): 
459.28529±1       Dubner, 1993
594501.29999±1
6797727.215328±1       Forbes, 1995
697053813.216352±1       Indlekofer & Janai, 1994
318032361.2107001±1       Underbakke & Carmody & ..., 2001

The last one is a twin primes pair of more than 32000 digits !

References

[1]
R.P. Brent, Tables Concerning Irregularities in the Distribution of Primes and Twin Primes Up to 1011, Math. Comput., (1976), vol. 30, p. 379

[2]
V. Brun, La série 1/5+1/7+1/11+1/13+1/17+1/19+1/29+1/31+1/41+1/43+1/59+1/61+..., où les dénominateurs sont nombres premiers jumeaux est convergente ou finie, Bulletin des sciences mathématiques, (1919), vol. 43, p. 100-104 and p. 124-128

[3]
P.A. Clement, Congruences for sets of primes, American Mathematical Monthly, (1949), vol. 56, p. 23-25

[4]
C.E. Fröberg, On the sum of inverses of primes and twin primes, Nordisk Tidskr. Informationsbehandling (BIT), (1961), vol. 1, p. 15-20.

[5]
J.W.L. Glaisher, An enumeration of prime-pairs, Messenger of Mathematics, (1878), vol. 8, p. 28-33

[6]
G.H. Hardy and E. M. Wright, An Introduction to the Theory of Numbers, Oxford Science Publications, (1979)

[7]
G.H. Hardy and J.E. Littlewood, Some problems of 'Partitio Numerorum' III : On the expression of a number as a sum of primes, Acta Mathematica, (1922), vol. 44, p. 1-70

[8]
W.J. LeVeque, Fundamentals of Number Theory, New York, Dover, (1996)

[9]
T. Nicely, Enumeration to 1014 of the Twin Primes and Brun's Constant, Virginia J. Sci., (1996), vol. 46, p. 195-204

[10]
P. Ribenboim, The new Book of Prime Number Records, Springer, (1996)

[11]
P. Sebah, Counting Twin Primes and estimation of Brun's Constant up to 1016, Computational project at http://numbers.computation.free.fr/Constants/constants.html, (2002)

[12]
D. Shanks and J.W. Wrench, Brun's Constant, (1974), Math. Comput., vol. 28, p. 293-299

[13]
M. Wolf, On the Twin and Cousin primes, (1996), See http://www.ift.uni.wroc.pl/~mwolf/



File translated from TEX by TTH, version 3.01.
On 29 Jul 2002, 14:41.