Pythagoras' Constant :Ö 2

Ö2 = 1.41421356237309504880168872420969807856967187537694...

(Click here for a Postscript version of this page.)

There are certainly people who regard Ö2 as something perfectly obvious but jib at Ö-1. This is because they think they can visualise the former as something in physical space but not the latter. Actually Ö-1 is a much simpler concept.
Edward Charles Titchmarsh (1899-1963).

1 Introduction

The constant Ö2 is famous because it's probably one of the first irrational numbers discovered. According to the Greek philosopher Aristotle (384-322 BC), it was the Pythagoreans around 430 BC who first demonstrated the irrationality of the diagonal of the unit square and this discover was terrible for them because all their system was based on integers and fractions of integers. Later, about 2300 years ago, in Book X of the impressive Elements, Euclid (325-265 BC) showed the irrationality of every nonsquare integer (consult [7] for an introduction to early Greek Mathematics). This number was also studied by the ancient Babylonians. The history of the famous sign Ö goes back up to 1525 in a treatise named Coss where the German mathematician Christoff Rudolff (1499-1545) used a similar sign to represent square roots.

Definition 1 Ö 2 is the unique positive root of the polynomial equation :

x²-2=0.

Theorem 2 Ö 2 is an irrational and algebraic number.

Proof. Suppose that Ö 2=p/q where p and q are relatively primes, then p²=2q² therefore p is even and p=2p^¢ which leads to q²=2p^¢² and to the fact that q=2q^¢. This is in contradiction with p and q being relatively primes.

We will now introduce some of the techniques available to compute this number.

2 Sequences with integers

2.1 Elementary approach

An elementary and ancient algorithm consists in using the double sequence

ì
í
î

p_n+1=p_n+2q_n

q_n+1=p_n+q_n

(1)

which verifies

p_n+1
q_n+1
= p_n/q_n+2
p_n/q_n+1

so that :

lim
n®¥
x_n=
lim
n®¥
p_n
q_n
=Ö2.

It is of interest to note that this quite simple recursion only uses additions with integers and eventually a final division to convert the fraction into the usual decimal representation. Denoting the error by e_n=|Ö2-p_n/q_n|, one easily see that e_n+1 < e_n/5, which involves a geometrical convergence (that is, one digit will be added at every N iterations).

Starting with p₀=q₀=1 we obtain :

ê
ê
ê
ê
ê
ê
ê

x₁=1.(500¼),

x₃=1.41(666...),

x₆=1.4142(011...),

x₉=1.414213(624...),

x₃=17/12 was used by Mesopotamians to replace Ö2. It may be convenient to use a matrix representation of the sequence, we write the sequence like this :

æ
ç
è

p_n+1

q_n+1
ö
÷
ø = æ
ç
è

1
2

1
1
ö
÷
ø æ
ç
è

p_n

q_n
ö
÷
ø

so that

æ
ç
è

p_n+1

q_n+1
ö
÷
ø =Aⁿ æ
ç
è

p₁

q₁
ö
÷
ø = æ
ç
è

1
2

1
1
ö
÷
ø n

æ
ç
è

p₁

q₁
ö
÷
ø .

2.1.1 Quadratic convergence

This matrix representation may be used with n=2^p and thanks to the relation (exercise : show this by recurrence) :

Aⁿ= æ
ç
è

a_n
2b_n

b_n
a_n
ö
÷
ø

comes the following algorithm

ì
í
î

a_p+1=a_p²+2b_p²

b_p+1=2a_pb_p

starting with a₀=b₀=1. It's easy to see that this new process has a quadratic convergence rate and its first iterates are given here :

(3,2),(17,12),(577,408),(665857,470832),(886731088897,627013566048),...

and

ê
ê 886731088897
627013566048
-Ö2 ê
ê < 10^-24.

Note that, in each step of this algorithm, the product of large integers is required and this may be done very efficiently using fast multiplication methods. This method is related to Newton's iteration by observing that :

x_p+1= a_p+1
b_p+1
= a_p²+2b_p²
2a_pb_p
= 1
2
a_p
b_p
+ b_p
a_p
= x_p
2
+ 1
x_p
,

which will be derived by different means later in this essay (see section 6).

2.1.2 Cubic convergence and quintic convergence

The same approach leads to the Cubic sequence

ì
í
î

a_p+1=a_p( a_p²+6b_p²)

b_p+1=b_p( 3a_p²+2b_p²)

starting with a₀=b₀=1 the first iterates are given by :

(7,5),(1393,985),(10812186007,7645370045),...

and also to the Quintic sequence

ì
í
î

a_p+1=a_p( a_p⁴+20a_p²b_p²+20b_p⁴)

b_p+1=b_p( 5a_p⁴+20a_p²b_p²+4b_p⁴)

giving the iterates

(41,29),(1855077841,1311738121),...

We have generated algorithms of this nature at any order of convergence using in the sequence polynomials of higher degrees.

2.2 Modification of the sequence

If we set u_n=p_n+q_n and v_n=q_n then the double sequence (1) is equivalent to this new one :

ì
í
î

u_n+1=2u_n+u_n-1

v_n+1=u_n

and u_n/v_n=u_n/u_n-1=(p_n+q_n)/q_n converge to Ö2+1. So starting with u₀=1,u₁=2 gives the set of u_n:

(1,2,5,12,29,70,169,408,985,2378,5741,13860,33461,80782,195025,470832,...)

those numbers are sometime called Pell numbers and the sequence u_nis a Pell sequence. The first 41 numbers may be found in [10] as well as other similar sequences like Lucas sequences used in Primality tests. For example

u₁₅
u₁₄
-1= 470832
195025
-1=1.4142135623(637...)

and it was found with very little efforts.

This method is probably due to ancient Babylonians.

2.3 Improvement of the sequence

It's possible to improve this method by using a more general sequence :

ì
í
î

p_n+1=(A+B)p_n+2Aq_n

q_n+1=2Bp_n+(A+B)q_n

Easy manipulations show that

lim
n®¥
p_n
q_n
=   æ
Ö

A
B

and that for the error

e_n= ê
ê
ê   æ
Ö

A
B

-x_n ê
ê
ê ,

we have the bound

e_n+1 < e_n æ
ç
è   æ
Ö

A
B

-1 ö
÷
ø 2

.

The more A/B is close to 1 the more the speed of convergence is increased.

Example 3 To illustrate the method let's use the relation

Ö 2= 7
5
æ
Ö

50
49

which corresponds to A=50 and B=49 and the iterative system becomes :

p₁=q₁=1, ì
í
î

p_n+1=99p_n+100q_n

q_n+1=98p_n+99q_n

as for the error, we have

e_n+1 < e_n æ
ç
è æ
Ö

1+ 1
49

-1 ö
÷
ø 2

< e_n/9604.

Here are the first iterates :

ê
ê
ê
ê
ê
ê
ê

x₁=1.4,

x₂=1.414213(197...),

x₃=1.4142135623(637...),

x₄=1.41421356237309(481...)

and almost 4 new digits are added with each step.

3 Continued fraction

3.1 Elementary continued fraction

The idea is to write Ö2=1+1/a₁ with a₁=2.4142135...=2+1/a₂ and so on... This gives the well-known development :

Theorem 4 (Bombelli 1572, [4]).

Ö 2=1+ 1
2+1/2+...

It is more convenient to use the notation

Ö2=[1;2,2,2,2,...].

The development is periodic of period 1 with number {2}. It's a general result : square roots can always be represented with a periodic continued fraction development [5]. It's possible to show that the previous sequence also gives the continued development of Ö2. We give the first rational approximations :

æ
è p_k
q_k
ö
ø

k=1...
= æ
è 3
2
, 7
5
, 17
12
, 41
29
, 99
70
, 239
169
, 577
408
, 1393
985
, 3363
2378
, 8119
5741
, 19601
13860
, 47321
33461
,¼ ö
ø .

3.2 Other continued fractions

Faster converging continued fractions may be obtained, for example :

5Ö2-7=[0;14,14,14,14,...].

The development is periodic of period 1 with number {14}. This time, the first rational approximations are :

æ
è p_k
q_k
ö
ø

k=1...
= æ
è 1
14
, 14
197
, 197
2772
, 2772
39005
, 39005
548842
, 548842
7722793
, 7722793
108667944
,¼ ö
ø

and of course Ö2 » 7/5+p_k/(5q_k), the error is about 1/q_k². Observe that here p_k=q_k-1 which simplifies the evaluation of the approximations.

Other fast converging continued fractions are given by

29Ö2-41=[0;82,82,82,82,...]

and

169Ö2-239=[0;478,478,478,478,...].

It's possible to find interesting continued fractions for numbers of the form mÖ2-n and it's not hard to show that if the integers m and n are such as

n²-2m²=-1

(this is a nonlinear Diophantine equation and it's called a Pell's equation) then

mÖ2-n=[0;2n,2n,2n,2n,...].

From a fundamental result on Pell's equations the solutions are contained in the convergents of the simple continued fraction of Ö2 (there are exactly the convergents of even rank [5] in the case of Ö2). It's also possible to show that there are given by the sequence

ì
í
î

m_k+1=3m_k+2n_k

n_k+1=4m_k+3n_k

starting with (1,1).

Hence the first couples (m_k,n_k) are

(1,1),(5,7),(29,41),(169,239),(985,1393),(5741,8119),(33461,47321),...

Example 5 With the couple (33461,47321) we have the approximation

33461Ö 2-47321 » 1
2.47321

that is

Ö 2 » 47321
33461
+ 1
3166815962
=1.4142135623730950488(369...)

(19 correct digits).

4 Infinite products

The two well-known Infinite products for cosx and sinx are :

Theorem 6 (Euler 1748, [2])

cosx= æ
è 1- 4x²
p²
ö
ø æ
è 1- 4x²
9p²
ö
ø æ
è 1- 4x²
25p²
ö
ø ¼

sinx=x æ
è 1- x²
p²
ö
ø æ
è 1- x²
4p²
ö
ø æ
è 1- x²
9p²
ö
ø ¼

Setting x=p/4 in the cosine product leads to

1
Ö2
= æ
è 1- 1
4
ö
ø æ
è 1- 1
36
ö
ø æ
è 1- 1
100
ö
ø ...

therefore

Ö2= æ
è 2.2
1.3
ö
ø æ
è 6.6
5.7
ö
ø æ
è 10.10
9.11
ö
ø æ
è 14.14
13.15
ö
ø ...=
Õ
k ³ 0
(4k+2)²
(4k+1)(4k+3)
,
(2)

or, which is equivalent, to the aesthetic formula :

Ö2=
Õ
k ³ 0
æ
è 1+ 1
4k+1
ö
ø æ
è 1- 1
4k+3
ö
ø = æ
è 1+ 1
1
ö
ø æ
è 1- 1
3
ö
ø æ
è 1+ 1
5
ö
ø æ
è 1- 1
7
ö
ø ...
(3)

The convergence is extremely slow as we may observe with some iterates :

ê
ê
ê
ê
ê
ê
ê

x₁=1.(333...),

x₁₀=1.4(054...),

x₁₀₀=1.41(332...),

x₁₀₀₀=1.414(125...).

Euler gave the interesting product (2) in 1748 [2], and it's very similar to John Wallis formula for p:

p
4
= æ
è 2.4
3.3
ö
ø æ
è 4.6
5.5
ö
ø æ
è 6.8
7.7
ö
ø æ
è 8.10
9.9
ö
ø ...,

while the product (3) can be compared to the celebrated series

p
4
=1- 1
3
+ 1
5
- 1
7
+ 1
9
-...

With x=p/8 in the infinite product we extract the also slow converging infinite product

2+Ö2=4. æ
è 3.5
4.4
ö
ø 2

æ
è 11.13
12.12
ö
ø 2

æ
è 19.21
20.20
ö
ø 2

æ
è 27.29
28.28
ö
ø 2

...

5 Taylor's expansions

From the Taylor expansion of (1+x)^1/2 or (1-x)^-1/2, it's possible to find another class of algorithms based on series computation.

Theorem 7 (Newton 1665). Let - 1 < x < 1, then

Ö

1+x

=1+ 1
2
x- 1
2.4
x²+ 1.3
2.4.6
x³-¼

1/
Ö

1-x

=1+ 1
2
x+ 1.3
2.4
x²+ 1.3.5
2.4.6
x³+¼

(4)

This theorem was first stated by Isaac Newton (1643-1727) and a correct proof appears later and is due to Leonhard Euler (1707-1783).

Applying the first expansion in (4) which is still valid for x=1 produces the very slowly convergent and alternating series

Ö2=1+ 1
2
- 1
2.4
+ 1.3
2.4.6
- 1.3.5
2.4.6.8
+...,

which first terms are :

ê
ê
ê
ê
ê
ê
ê

x₁=1.(500...),

x₂=1.(375...),

x₃=1.4(375...),

x₄=1.(398...).

A nice improvement is to use the previous results on continuous fractions. We can write :

Ö2= p_k
q_k
æ
Ö

2q_k²
p_k²
,

and for each value of k, this will provide a formula where the number inside the square root of the right hand side of the formula is near 1. When k increases this number tends to 1. Using this remark leads to a sequence of formulae for Ö2, which are more and more efficient :

Ö2= æ
ç
è 3
2
  æ
Ö

8
9

, 7
5
  æ
Ö

50
49

, 17
12
  æ
Ö

288
289

, 41
29
  æ
Ö

1682
1681

, 99
70
  æ
Ö

9800
9801

, 239
169
  æ
Ö

57122
57121

,¼ ö
÷
ø .

Example 8 Using this last formula jointly with Newton's series expansion gives the very fast converging and easy to implement sequence

Ö 2= 239
169
æ
è 1+ 1
2
1
57121
- 1
8
1
57121²
+ 3
48
1
57121³
-¼ ö
ø

for which the first terms are :

ê
ê
ê
ê
ê
ê
ê

x₁=1.4142(011...),

x₂=1.414213562(427...),

x₃=1.41421356237309(457...),

x₄=1.41421356237309504880(687...).

With a very little calculation we have computed 20 digits of Ö 2.

Example 9 (Euler 1755, [4], [8]). The relation Ö 2=(7/5)/Ö (49/50) produces the nice and easy to compute by hand series expansion :

Ö 2= 7
5
æ
è 1+ 1
100
+ 1.3
100.200
+ 1.3.5
100.200.300
+... ö
ø .

The main advantage of using Taylor's formulae is to require only basic multi-precision operations between numbers.

You can download a small clear C program which uses this type of formula with a classical algorithm to compute Ö2 : see the Easy programs for constants computation page at [3].

6 Newton's iteration

A very efficient way to compute square roots is to use the Newton iteration [9] on the function f(x)=x²-2. It gives the following sequence :

x₀=1,

x_n+1=x_n- f(x_n)
f^¢(x_n)
= 1
2
æ
è x_n+ 2
x_n
ö
ø = x_n
2
+ 1
x_n
,

x_n+1 can also be interpreted as the simple mean between the approximation x_n and 2/x_n which is also another approximation.

The first terms of this sequence are :

ê
ê
ê
ê
ê
ê
ê

x₁=1.(500...),

x₂=1.41(666...),

x₃=1.41421(568...),

x₄=1.41421356237(468...).

Number D of correct digits after n iterations with the elementary Newton iteration :

n
1
2
3
4
5
6
7
8
9
10

D
0
2
5
11
24
48
97
195
391
783

It's a well-known result that the rate of convergence of Newton's method and therefore of this sequence is quadratic (the number of digits is doubling at each iteration) for a starting point x₀ sufficiently close to the root of the equation.

Computing Ö2 with this algorithm is convenient if one can compute easily the inverse of x_n. In fact, it is simpler to use only multiplications. This can be reached by using a modified sequence which converges to 1/Ö2 : it consists in using the Newton iteration but this time with the function f(x)=1/x²-2, yielding the sequence

x₀= 1
2
, x_n+1=x_n æ
è 3
2
-x_n² ö
ø =x_n+x_n æ
è 1
2
-x_n² ö
ø .

Observe that in the right hand side relation the increment tends to zero. We give the first iterates :

ê
ê
ê
ê
ê
ê
ê
ê
ê

x₁=0.(625...),

x₂=0.(693...),

x₃=0.70(670...),

x₄=0.707106(444...),

x₅=0.707106781186(307...),

again the convergence is quadratic. A final multiplication by 2 will give the value of Ö2. The advantage of this method is to avoid a multi-precision division at each step and replace it by two multi-precision multiplications. This algorithm is extremely efficient and may be used to compute Ö2 up to billion's of digits.

7 Cubic iteration

7.1 Halley's iteration

Using the second derivative of f(x)=x²-2, it's possible to find a sequence with cubic convergence (the number of digits is multiplied by 3 at each step). The general formula is given by Halley's iteration (the original form was introduced by the astronomer Edmund Halley (1656-1742) in 1694) :

x_n+1=x_n- f(x_n)
f^¢(x_n)
æ
è 1- f(x_n)f^¢¢(x_n)
2f^¢(x_n)²
ö
ø -1

.

Applying this formula with function f(x) gives :

x₀=1, x_n+1=x_n x_n²+6
3x_n²+2
=x_n æ
è 1+2 (2-x_n²)
3x_n²+2
ö
ø .

The first terms of this sequence are :

ê
ê
ê
ê
ê

x₁=1.4(000...),

x₂=1.414213(197...),

x₃=1.414213562373095048(795...).

Number D of correct digits after n iterations with Halley's iteration :

n
1
2
3
4
5
6
7
8
9
10

D
1
6
20
61
185
557
1673
5022
15067
45204

7.2 Another cubic iteration

In [1], an interesting cubic iteration is given which converge to 1/ÖA

x_n+1= x_n
8
(15-10Ax_n²+3A²x_n⁴).

It may be deduced from the general Householder's iteration [6]

x_n+1=x_n- f(x_n)
f^¢(x_n)
æ
è 1+ f(x_n)f^¢¢(x_n)
2f^¢(x_n)²
ö
ø ,

which has cubical convergence for a close enough starting point x₀ (observe the similarity with Halley's iteration).

By applying this general pattern for f(x)=1/x²-A we obtain :

x_n+1=x_n+ x_n
8
( 7-10Ax_n²+3A²x_n⁴) = x_n+ 3A²x_n
8
æ
è x_n²- 1
A
ö
ø æ
è x_n²- 7
3A
ö
ø .

If we explicit this formula with A=2, and starting with x₀=1/2,

x_n+1=x_n+ x_n
8
(7-20x_n²+12x_n⁴)=x_n+ x_n
8
(2x_n²-1)(6x_n²-7),

and, in this iteration, no multiprecision division is required. The first iterates are :

ê
ê
ê
ê
ê

x₁=0.(671...),

x₂=0.70(689...),

x₃=0.7071067811(398...).

Number D of correct digits after n iterations with Householder's cubic algorithm :

n
1
2
3
4
5
6
7
8
9
10

D
0
2
10
30
90
269
808
2425
7276
21829

This method may also be very efficient for high precision computation.

8 Quartic and high order iterations

The direct application of the modified iterations (see the Newton's iteration pages at [3]) on the function f(x)=1/x²-1/2 produces a set of high order algorithms. It's interesting to note that relatively easy algorithms of any order may be derived.

In the following lines we will use the notation

h_n=1- x_n²
2
.

8.1 Quartic iteration

The quartic modified iteration is

x_n+1=x_n+ x_n
16
( 8h_n+6h_n²+5h_n³) .

For example, if we take the initial value x₀=3/2, the first two iterations are

ê
ê
ê

x₁=1.414(123...),

x₂=1.41421356237309(494...).

Number D of correct digits after n iterations with the quartic algorithm :

n
1
2
3
4
5
6
7

D
3
14
63
254
1019
4078
16312

8.2 Quintic iteration

The same method also produces the quintic (fifth-order) modified iteration

x_n+1=x_n+ x_n
128
( 64h_n+h_n²( 48+40h_n+35h_n²) ) ,

and again with the initial value x₀=3/2, the first two iterations are :

ê
ê
ê

x₁=1.4142(236...),

x₂=1.414213562373095048801688(932...).

Number D of correct digits after n iterations with the quintic algorithm :

n
1
2
3
4
5
6

D
4
24
123
615
3076
15380

Algorithms with any order of convergence may also be generated easily (for example : sextic, septic, octic, ... iterations are available !) and we end this section with the octic (eighth-order) iteration :

ì
ï
í
ï
î

d_n=1024h_n+h_n²( 768+640h_n+560h_n²+h_n²(504h_n+h_n²( 462+429h_n) )) ,

x_n+1=x_n+ 1
2048
x_nd_n.

9 Double Iteration

9.1 Quadratic double iterative procedure

During the first years of computer time (EDSAC group at Cambridge University 1951), the following double sequence (easily deduced from Newton's iteration on the inverse of the square root) was proposed to compute square roots :

x₀=A,h₀=A-1,

and

ì
ï
ï
í
ï
ï
î

x_n+1=x_n æ
è 1- 1
2
h_n ö
ø

h_n+1= 1
4
h_n²( h_n-3)

then the sequence h_n tends to 0 and the sequence x_n tends to ÖA (A < 3).

Example 10 Let A=2, therefore

ê
ê
ê
ê
ê
ê
ê
ê
ê

x₁=1.(000...),h₁=-0.5

x₂=1.(250...),h₂=-0.21875...

x₃=1.(386...),h₃=-0.03850...

x₄=1.41(341...),h₄=-0.001126...

x₅=1.41421(288...),h₅=-0.000000951...

and then the convergence becomes quadratic. The number of non quadratic cycles is reduced when A tends to 1. For example, using relation Ö 2=7/5Ö{50/49} so that A=50/49, will lead to a quadratic converge at once.

9.2 Other rates of convergence

Small modifications in the previous procedure may increase the order of convergence, for example the following algorithm has cubic convergence :

x₀=A,h₀=A-1,

and

ì
ï
ï
í
ï
ï
î

x_n+1=x_n æ
è 1- 1
2
h_n+ 3
8
h_n² ö
ø

h_n+1= 1
64
h_n³( 40-15h_n+9h_n²) .

Algorithms at any order of convergence may also be generated.

References

[1]: J.M. Borwein and P.B. Borwein, Pi and the AGM - A study in Analytic Number Theory and Computational Complexity, A Wiley-Interscience Publication, New York, (1987)
[2]: L. Euler, Introduction à l'analyse infinitésimale (french traduction by Labey), Barrois, ainé, Librairie, (original 1748, traduction 1796), vol. 1, p. 89-90
[3]: X. Gourdon and P. Sebah, Numbers, Constants and Computation, Internet site at http://numbers.computation.free.fr/Constants/constants.html.
[4]: E. Hairer and G. Wanner, L'analyse au fil de l'histoire, Bibliothèque Scopos, Springer, (2000)
[5]: G.H. Hardy and E.M. Wright, An Introduction to the Theory of Numbers, Oxford Science Publications, (1979)
[6]: A.S. Householder, The Numerical Treatment of a Single Nonlinear Equation, McGraw-Hill, New York, (1970)
[7]: V.J. Katz, A History of Mathematics-An Introduction, Addison-Wesley, (1998)
[8]: K. Knopp, Theory and application of infinite series, Blackie & Son, London, (1951)
[9]: I. Newton, Methodus fluxionum et serierum infinitarum, (1664-1671)
[10]: P. Ribenboim, The new Book of Prime Number Records, Springer, (1996)

Back to Numbers, Constants and Computation

File translated from T_EX by T_TH, version 3.01.
On 15 Nov 2001, 10:43.