Pure Mathematics. Volume 1 - Parsonson S.L.

8. Finite series and the binomial theorem
$18. Polynomials and partial fractions$
21. Quadratic equations and quadratic functions
22. The parabola and rectangular hyperbola
24. Vector products and their applications
25. Continuous probability distributions
Author: Parsonson S.L.
Tags: mathematics
ISBN: 0-521-07683-8
Year: 1970
Similar
Engineering Mathematics
Handbook of Mathematics
Mathematical Techniques: An Introduction for the Engineering, Physical, and Mathematical Sciences
Text
                    LUME 1
•
CAMBRIDGE UNIVERSITY PRESS

PURE
MATHEMATICS
1
S. L. PARSONSON
CAMBRIDGE UNIVERSITY PRESS
CAMBRIDGE
LONDON · NEW YORK · MELBOURNE

Published by the Syndics of the Cambridge University Press
The Pitt Building, Trumpington Street, Cambridge CB2 1RP
Bentley House, 200 Euston Road, London NW1 2DB
32 East 57th Street, New York, NY 10022, USA
296 Beaconsfield Parade, Middle Park, Melbourne 3206, Australia
© Cambridge University Press 1970
Library of Congress catalogue card number: 70-100026
ISBN: 0 521 07683 8
First published 1970
Reprinted 1972 1974 1975 1977
PRINTED IN GREAT BRITAIN

Contents
Preface page ν
1 Numbers and inequalities
2 Vectors and vector geometry
3 Coordinates
4 Polynomials
5 Functions and inequalities
6 The trigonometric functions
7 Probability in finite outcome spaces
8 Finite series and the binomial theorem
Revision exercise A
9 Mathematical induction
10 Expectation
11 Further vectors
12 Further trigonometry
13 Matrices 1
14 Matrices 2
15 Linear equations
16 Discrete probability distributions
Revision exercise В
Bibliography
Answers
Index
1
24
45
68
78
87
103
130
151
155
169
189
215
238
258
281
301
328
339
343
375

г?

Preface
The present book is the first of a two volume course covering those parts
of modern Ά' level pure mathematics syllabuses not normally included
in standard calculus texts. The reason for omitting calculus is simply to
keep the book within manageable proportions and it is felt that there are a
number of excellent modern calculus books available. Although no formal
calculus is done here, it is expected that a student will be studying calculus
concurrently with this text; thus, for example, it is assumed in Chapter 5
that the reader can differentiate simple algebraic expressions and sketch
the graphs of rational functions. In Chapter 16, the exponential function
is used.
It is hoped that the order of presentation of topics in this book will offer
an effective teaching programme, but variations can be made at the
discretion of the teacher. Thus, for example, the chapters on probability can
be deferred while the two chapters on trigonometry could be taken in
conjunction. Again certain chapters contain work which might be deemed
suitable for a second reading; for example, Sections 4 and 5 of Chapter 10.
In some ways Chapter 1 offers the most difficult problem of presentation
in the whole book: it is necessary later to be able to refer to rational and
irrational numbers, and yet to devote too much time in the early stages
to such topics may not be desirable. The author hopes that he has found
an acceptable compromise but some might yet find parts of Chapter 1 too
formal, in which case they are strongly advised to leave the detailed study
of Sections 2, 3 and 4 for a second reading.
To learn mathematics, constant practice is necessary, some of it
repetitive. The book is therefore liberally supplied with exercises for the student.
Most of the questions represent fairly straightforward applications of the
bookwork, although in the Miscellaneous Exercises, included at the end
of all but two of the chapters, will be found some rather more searching
questions. Furthermore, questions marked Ex. occur in the text; it is
hoped that most, if not all, of these will be attempted by the pupil as he
covers the associated bookwork—certainly those marked with an asterisk
should be regarded as obligatory.
The syllabuses for M.E.I. Ά' level, the Joint Matriculation Board
syllabus in Mathematics (Advanced) and Further Mathematics (Advanced)
and the University of London Revised Syllabus in Mathematics (Advanced)

PREFACE
have been particularly kept in mind in the writing of this book and the
planning of the next. Other boards are in the process of devising new
syllabuses and draft copies published by them indicate that this book will
probably cover the necessary work. The School Mathematics Project have
produced their own text-books but the present book may be used to
supplement these if desired.
It is a pleasure to record my thanks to Mr M. J. Rawlinson who read
part of the text and made a number of valuable suggestions; to Mr A. J.
Moakes who read the entire book and whose detailed criticisms have done
much to remove obscurities and improve the presentation; and to my wife,
who also read the entire book and who lent invaluable assistance in
checking answers. I am grateful for permission to reprint examination questions
from the following Boards: Oxford and Cambridge (O&C, M.E.I.,S.M.P.),
the Joint Matriculation Board (J.M.B.), the Cambridge Local Examination
Syndicate (Cambridge), University of London (London), and the Oxford
Delegacy of Local Examinations (Oxford); also to the Clarendon Press
for permission to use Oxford Scholarship questions (O.S.) and to the
Cambridge University Press and University Registry for the use of
Cambridge Scholarship (C.S.) and Mathematical Tripos (M.T.) questions.
S.L.P.

ι. Numbers and inequalities
1. THE INTEGERS AND THEIR
REPRESENTATION
The process of counting is deeply rooted in the history of mankind and its
development is obscure. Many systems have been evolved but the Arabic
(originally Hindu) which we now employ is almost universal.
The entities used for counting are the whole numbers, от positive integers,
and zero. Ten ciphers are used to represent the whole numbers: 0, 1,2, 3,4,
5, 6, 7, 8, 9. The introduction of the cipher zero was an advance of
considerable significance and it gave to the Arabic system its great flexibility
and versatility by making possible a system of place value recording;
place value calculation (by the abacus) had already been in use for a long
time. A base of ten is most frequently used for continued counting; thus,
the one hundred and forty second integer is written 142, which stands for
Ixl02+4xl0 + 2xl.
The base, 10, of this method of enumeration has no special significance
and any other number would do as well. Indeed, were ten not so deeply
rooted in us for physical reasons, other bases would no doubt be
preferable; for example, eight (23) or twelve (22 χ 3). Had primitive man ignored
his thumbs we could well have inherited a more efficient system.
Other bases have been used. A base which has acquired special
significance recently is the number two. Counting from 1, with 2 as base, the
first eight integers are written: 1,10,11,100, 101, 110, 111, 1000. The value
of this, the binary system, is clear: it requires only two ciphers, 0 and 1.
It is thus eminently suitable for recording numbers in two-state systems as,
for example, in electronic digital computers. Conversion from one base
to another is easily effected. For example
75 (written in the scale of ten, or the denary scale)
= ^гчохгнохгн^гчохгч^г^!
= 1001011 (written in the binary scale).
The same result may be arrived at more quickly by continued

NUMBERS AND INEQUALITIES [1
division by 2, the remainders recorded being the required digits (in
reverse order)
75
37
18
9
4
2
1
0
1
1
0
1
0
0
1
Ex. 1. Explain why the method of
tinued division works.
le scale to another by con-
Ex. 2. Evaluate 11011 χ 1011 in the binary scale. Perform several other
multiplications of this type and check your answers by conversion to the denary scale.
Ex. 3. Evaluate 1110011ч-11011 in the binary scale. Perform several other
divisions of this type and check your answers by conversion to the denary scale.
Exercise 1 (a)
Solve the following equation for x, working in the binary scale throughout:
(i) 101x+ll = 1101;
(ii) llx-11111 = 1011;
10x + lt
(iii)
(iv) ll(10x-101)
111;
= 10x+101.
2. Express the denary number 275 in the scales of 2, 3 and 12. (For the last
part, you will need to supplement the digits 0,1,..., 9 by t = ten and e = eleven.
Why?)
3. With the notation of Question 2, evaluate 4te χ19ί in the scale of 12.
4. With the notation of Question 2, evaluate e89t + 2ee in the scale of 12.
η from the binary form 1100110111
5. Explain the following method of
to denary form 823
1
1
100
4
110
6
111
7
512 + 256 + 48 + 7 =
Octal
Denary
6. Show that 1331 is a perfect cube whatever the base *, provided that* is greater
than3.

1] THE INTEGERS
7. The number χ lies between 100 and 999 (denary scale) and the number у is
formed by writing the digits of χ in the reverse order. Prove that χ ~ у is divisible
by 99, where χ ~ у means the difference between χ and y.
8. Show that the difference between any number with four digits (in the denary
scale) and the number with these digits reversed is divisible by 9, and that, if the
two middle digits are the same, the difference is also divisible by 37.
2. THE INTEGERS (CONTINUED)
We have already tacitly assumed that the positive integers obey certain
laws of combination, which are summarized below. These laws are seen
to hold irrespective of the base employed for representing the numbers:
the integer seven remains the integer seven, whether it be written 7 or 111
(binary) or 21 (ternary); consistency is all that is demanded.
(i) If a = b, then a + c = b + c and ac = be.
(ii) The commutative laws: a + b = b+a; ab = ba.
(iii) The associative laws: a + (b + c) = (a+b) + c;a(bc) = (ab) с
(iv) The distributive law: a(b + c) = ab + ac.
(v) The additive and multiplicative identity laws:
a + 0 = a; a\ = a.
Considerably later historically, zero and the positive integers were
augmented by the negative integers; the word 'integer' will in future be
taken to mean a positive or negative integer, together with zero. To enable
negative integers to be combined, a further rule is required:
(vi) The additive inverse law: a + (-a) = 0.
Example 1. Prove, using only the laws (i)—(vi) above, that ( — a)(—b) = ab.
First observe that
aa + aO = a(a + 0) by (iv)
= aa by (v)
= aa + 0 by (v),
a0 = 0
on adding - aa to both sides and using (ii) and (i). Thus, any integer
multiplied by zero gives the answer zero.
Next, consider
ab + [a(-b) + (-a)(-b)] = a[b + (-b)] + (-a) (-b) by (iii), (iv)
= a0 + (-a)(-b) by(vi);
= 0 + (-a)(-b) by result
proved above;
= (-a)(-b) by(v).
3

NUMBERS AND INEQUALITIES 1]
But
ab + [a(-b) + (-a)(-b)] = ab + [a + (-a)](-b)
= ab + 0(-b)
= ab + O
= ab
Thus ab = (-a)(-b).
Laws (i)—(vi) above supply almost all the apparatus required for the
manipulation of integers. However, as Example 2 below shows, they need
to be supplemented by one more law:
(vii) The cancellation law: ab = ac^b = c, provided α φ 0. (The
sign => is read as' implies'; <= means' is implied by' and о means' implies
and is implied by'.)
In this section we have given only a brief survey of the logical structure
of the integers. The reader interested in acquiring a deeper understanding
of this topic and, indeed, of the other topics mentioned in this chapter
should consult one of the books mentioned in the Bibliography at the end
of the book.
Example 2. Arithmetic modulo 12 is defined as follows: any two of the integers
0, 1, 2, ..., 11 are added or multiplied together and the answer is taken to be
the remainder on dividing the sum or product by 12. Thus
8 + 6 ξ 2 (mod 12); 4x11=8 (mod 12);
and we also write 4-6 = 10 (mod 12).
Show that arithmetic modulo 12 satisfies laws (i)—(vi) above but that the
equation 4χ ^ 4 ^ u)
does not have a unique solution (and so law (vii) does not hold).
If a + b = I2c + r, then b + a = 12c+ r and the commutative rule holds.
All the other laws may similarly be verified.
But χ = 1, 4, 7, 10 all satisfy the equation Ax = 4 (mod 12).
Ex. 4. Prove that -(a-b) =-a + b. (a-bmeans a + (-b).)
Ex. 5. Prove that, if ax = a, then χ = 1, provided я Ф 0. (Notice carefully
what you have to prove; it is not sufficient merely to verify that χ = 1 satisfies
ax = a.)
Ex. 6. Show that arithmetic modulo 11 does not suffer from the same defect as
that exhibited by arithmetic modulo 12 in Example 2. Suggest why this is so.
4
by (ii) and
(iv);
by (vi);
by result
proved above;
by (v).

3] RATIONAL NUMBERS
3. RATIONAL NUMBERS
As soon as numbers began to be applied to problems more complex than
the mere counting of objects, the value of subdividing the interval between
two integers must have become apparent. For example, when a unit of
length was defined, lengths must have been met that were not an exact
integral number of units. Thus the concept of a fraction or, as we shall
prefer to call it, a rational number, was evolved.
In order to emphasize the fact that we are now dealing with a new type
of number, we shall avoid the familiar fractional form for rational numbers
at first, and instead we shall define a rational number as a pair of integers
written in a definite order thus: (p, q), where ?φ0. (We may call this an
ordered pair of integers: (2, 3) and (3, 2) represent different rational
numbers. See Chapter 4.)
(In what follows it may help the reader to see what is happening if he
bears in mind that our aim is to demonstrate that the rational number
(p, q) is what he would call p/q.)
Two rational numbers (ръ qj and (p2, q2) are said to be equal if
Р1Я2-Р2Я1 = 0; if Pil2-P2<li + 0> they are said to be unequal.
Ex. 7. Is it true that (я, *) Φ (*, a)?
Ex. 8. Prove that (t, 2), (2,4), (13, 26) are all equal. Prove more generally that
O, q) = (kp, kg), provided HO.
Since rational numbers are newly defined objects, rules for adding,
subtracting, multiplying and dividing them must be given, for these
operations have so far only been applied to integers. The rules are
Ol,<?l) + <>2,<?z) = Ol<?2+/>2<?l,<?l<?2); Ol,<?l)-0>2,<?2) = Ol<?2 ~ Pi4l, 4l4^\
Oi, <?i) x O2, <?2) = (.PiPz, <?i<?J; (Pi, 1i) + O2, <?2) = (PilbPrfi),
provided p2 φ 0, otherwise the left-hand side is undefined.
Example 3. Verify that the distributive law holds for rational numbers.
We have to show that
Oi> ?i) x [O2, Яд + Оз> 4s)] = Οι, 1i) x O2, Ч2) + (Pi, ?i) x (РаЯа),
L.H.S. =(A,?i)x(ft?8+/W2,fc?a)
5

NUMBERS AND INEQUALITIES [1
R.H.S. = (ft, fi) x (p2, q2) + (p1, qt) x (рв, ft)
= (AA,iift) + (Aft,iiia)
= {PiP24i4i+PiPi4i42,4\4i4i)
= (Р1Р2Ч3 +Р1Р3Ч2, ЧгЧгЧз), by Ex. 8, since
ϊι*0.
In a similar way, all the rules for combining integers may be verified to
hold also for rational numbers.
Ex. 9. Verify the laws (i)-(vii) for rational numbers.
Ex. 10. Verify that, iftherulefor division is ignored, rational numbers of theform
(p, 1) have properties identical to those possessed by the integers.
Ex. 11. Verify that, if (a, 1) χ (χ, 1) = (b, 1), then (*, 1) = (b, a), provided α φ 0.
Ex. 10 shows us that we may identify the integers with rational
numbers of the form (p, 1); Ex. 11 then shows us that the rational number
(b, a), when multiplied by the integer (a, 1) gives the integer (b, 1). In
more familiar language (b, a) has just the properties we associate with
the 'number' bja. We may thus regard a rational number as 'the quotient
of two integers'.
It thus follows that the statement 'л: is a rational number' is equivalent
to the statement 'integersρ and q may be found such that χ = pjq'.
The reader will be familiar with the process of expressing a rational
number as a decimal. For example,
^ = 0.024 = 0x1 + 2 + ^ + 4x^3.
Conversely, a terminating decimal can always be expressed as a rational
number in the form pjq; for example
ο·ΐ75 = -jWo- = -:-δ.
However, the terms 'rational number' and 'terminating decimal'
are not synonymous. Thus, f is certainly a rational number, but
f = 1-142857142857142857...
and the process of division cannot be brought to an exact conclusion.
However, this expression does recur; the reader will probably be familiar
with the notation. & = ,.{49057
In fact, any rational number may be expressed either as a terminating or
a recurring decimal, and conversely, any terminating or recurring decimal
represents a rational number. The proof of the italicized part of this state-
6

3] RATIONAL NUMBERS
ment depends upon the notion of a limit (see Chapter 8; geometric
sequences) but good grounds for believing it are given in the following
particular example.
Example 4. Express 2-378 as a rational number in standard form.
Let* = 2-378; then 100ο* = 2378-378
χ = 2-378
999л; = 2376 by subtraction
Ex. 12. Express the following recurring decimals as fractions:
(i) 0-7, (ii) 0-402, (iii) 6-28.
Ex. 13. Using only the digits 0 and 1 we may express any rational number as a
bicimal. For example
1101101 = lx2+l+0x^+lxi + lxi + 0xi + lxi.
Express as bicimals the decimals 3-75, 0-703125, 4-6, 0-82.
[Hint: see Ex. 1 and substitute repeated multiplication for repeated division.]
Ex. 14. Compare 0-9 (decimal) with 0-1 (bicimal).
Ex. 15. Express 0-01 and l-lOl (both bicimals) as decimals.
4. IRRATIONAL NUMBERS
It is useful to depict the integers and rational numbers as points on a
straight line. In Figure 1.1, an origin О is taken, representing the number
zero, and equal intervals are measured to the right, the end-points
representing 1, 2, 3, ... and to the left representing - 1, -2, -3, .... The
position of a point representing a rational number may be defined quite simply;
Fig. 1.1
for example, the point representing 3-28 is obtained by sub-dividing the
interval between 3 and 4 equally into 100 divisions and marking the end-
point of the twenty-eighth division. We may thus associate with the rational
numbers a definite ordering: л:is greater than у (written χ > у) if the point
representing χ lies to the right of that representing y.

NUMBERS AND INEQUALITIES [1
All rational numbers may thus be represented by points on the line and,
furthermore, however close together two points representing rational
numbers may be there will always be another point representing a rational
number lying between them. For example, between the points representing
3-286 and 3-287 lies the point representing 3-2865.
Ex. 16. Show that Цх+у) always lies between χ and y.
Our last remark shows that, however close together we choose two
rational numbers, we can always fit another rational number in between
them; surprisingly, however, we can never succeed in 'filling the line up'
with points representing rational numbers. In Figure 1.2 an isosceles
right-angled triangle OPQ has been drawn in which OQ = QP = 1.
The theorem of Pythagoras tells us that, if OP represents the number x,
then x2 = 2; our next example shows that Ρ does not represent a rational
number.
Q
Fig. 1.2
Example 5. Prove that J2 is not a rational number.
We have to prove that ^2 cannot be expressed in the form pjq where
p, q are integers. Our method of proof will consist in assuming that
72 can be expressed in the form pjq and showing that this leads to a
contradiction.
Suppose that <J2 = pjq, where p, q have no common factors.
Then p2 = 2q2; but, if the square of an integer is even, the integer itself
must be even, and so ρ = 2r where r is an integer
.*. 4r2 = 2q2 and so 2r2 = q2.
By the same token q must be even.
Thus, p, q must both be even, contradicting our initial assumption.
Numbers such as J2 which are not rational are called irrational. Since
rational numbers are represented by terminating or recurring decimals,
the decimal representation {if it exists) of an irrational number is non-
terminating and non-recurring.
Irrational numbers obey the same laws of combination as rational
numbers. The proof of this lies beyond the scope of this book, as it requires
a systematic definition of irrational numbers in terms of the rationale;

41
IRRATIONAL NUMBERS
we shall be content to assume this result, which is intuitively fairly obvious
if we consider rational (terminating decimal) approximations to irrational
numbers.
Exercise 1 (b)
1. Where does the proof of Example 5 break down if you try to prove λ/4 is
irrational?
2. By assuming that $2 = p/q, where p, q are integers cancelled down into their
lowest form, show that ξ/2 is irrational.
3. Prove that V3 is irrational.
4. Express as (i) a decimal, (ii) a bicimal, the fraction i (denary notation). If a
decimal recurs, does it bicimal representation necessarily recur ?
5. Prove that, if a rational number of the form p/q (cancelled down into its
lowest form) is expanded as a recurring decimal, the length of the block of digits
that recurs is less than q.
6. Which of the following statements are always true and which are not? If a
statement is true, prove it; if it may be false, give a counter-example (that is, an
example that illustrates its falsity):
(i) rational + rational = rational;
(ii) rational + irrational = irrational;
(iii) irrational+irrational = irrational;
(iv) rational χ rational = rational;
(v) rational χ irrational = irrational;
(vi) irrational χ irrational = irrational.
7. If a and * are integers and V* is irrational, prove that (a + V*)3 is irrational.
(The results of Question 6 may be used in this question.)
8. Prove that it is always possible to find an irrational number that lies between
two given rational numbers a and b.
9. lipi,p2,p3 are three unequal prime numbers, prove that
is a rational number, but not an integer.
Can you generalize this result?
10. You are given a ruler with only integral units of length marked on it. Show
how to construct, geometrically, the following lengths:
(i) V3; (ii) V7; Ου) VC + V2).
11. Prove that, if к is an integer and *jk is rational, then Jk is an integer. Show
further that, for к > 1, *j(k2-\) is irrational and deduce that, for к > 1,
V(fc- 1) + V№ +1) is irrational.
9

NUMBERS AND INEQUALITIES [1
5. SURDS
Although the existence of irrational numbers is mathematically significant,
from the point of view of practical arithmetic all numbers may be regarded
as rational, since any irrational number may be approximated to by a
terminating decimal. Indeed, wherever measurements are concerned, the
answers obtained must necessarily be in the form of rational numbers,
since every measuring device must eventually reach the limits of its possible
accuracy. However, just as i is simpler to handle than its decimal
approximation 0-3333, so V2 is often easier to deal with than its approximation
1-414.
An expression involving only rational numbers and their roots (not
necessarily square roots) is called a surd; thus surds form a class of
irrational numbers, though there are irrational numbers, such as π, which
cannot be expressed as surds.
Examples of surds are:
^ ft- {^2+lt\)·' ^3+V2·
Note: if χ is positive, *tyx = xllm means the positive wth root of x; by
convention, the 2 is omitted for square roots, e.g. J9 = 3. Again, 2гЦ(—х)
has no meaning, while
2m+\l(-x) = (-x)1/<2m+« = -{xfi^+v.
A great number of complicated surds may be simplified using the three
results: , ,, ,4
&№) = №·. <®β=ψ·>
(iii) Ца + ф) (Va~V&) = a~b-
Example 6. Simplify the expression V128-V32-V8 + V2·
V128-V32-V8 + V2 = V(26.2)-V(24.2)-V(22.2) + V2
= 2V2-2V2-2V2 + V2
= 8V2-4V2-2V2 + V2
= 3V2.
The process of removing surds from the denominator of surd
expressions is known as rationalizing the denominator.
Conventionally, a surd is usually expressed with its denominator
rationalized.
10

41
Example 7. Rationalize the denominators of the following surds:
... /3 .... 1 ..... J2 + 1
'л IV (11) V5W2: (111) узТ72"
УЗ_УЗУ5 = У15
= У5~У5У5~ 5 ;
__1_ 1 /V5 + V2\ V5 + V2.
u' V5-V2 V5-V2W5 + V2/ 3 '
V2+1 / V2 + l\ Л/3-У2У = (V2+1)(V3-V2)
MV V3+V2 W3 + V2/W3-V2/ 3-2
= V6 + V3-V2-2.
Ex. 17. V45 = 3V5. Simplify in a similar fashion the surds: (i) У8; (ii) У18;
(iii) V54; (iv) V250; (v) V5292.
Ex. 18. Simplify the following surd expressions,
(i) (V3+2V2)2; (ii) V216-V150 + V24; (iii) V75 + V147-V300;
(iv) (V32+V50-V98)2.
Ex. 19. Rationalize the denominators of the following expressions:
... 1 .... V2+1 ..... 1 ,· ч V3 , ч 2-V3
0)7з; Οΰ^-; (ш)2=72: Ov)^; W^.
We conclude this section with some further examples of manipulation
of surd quantities and the use of the у sign.
ExampleS. Simplify:
1 = 1-(V3-V2)
w 1+V3-V2 1-(V3-V2)2
1-V3 + V2
l-(5-2V6)
_1+V2-V3
2(V6-2)
_(l+V2-V3)(V6+2)
2(6-4)
_ У6+2+У12+2У2-У18-2УЗ
4
= 2 + уб-У2
4
(ii) Suppose 14-4У6 = {^a-l^jbf.
Then 14-4У6 = (a + 4b)-4J(ab).

NUMBERS AND INEQUALITIES [1
These two expressions will be equal if
a + Ab = 14,
ab = 6.
By inspection (or by solution of the simultaneous quadratic equations)
a = 2, b = 3.
ThuS V(!4-4V6) = 2V3-V2-
(Notice that we must choose the positive square root.)
Example 9. Solve the equation
V(3x + 4)-V(* + 2) = 2.
(i) Squaring both sides,
(3x+4) + (x + 2)-2V(3x2 + 10x + 8) = 4.
(ii) Collecting terms and dividing both sides by 2,
2x +1 = V(3x2 + l(h: + 8).
(iii) Squaring both sides and collecting terms,
x2- 6x- 7 = 0.
This gives χ = 7 or χ = — 1. However, we must check both of these
solutions in the original equation since, after step (iii), we could equally
well have been solving the equation
V(3x+4) + V(* + 2) = 2. (Why?)
Inspection shows us that only χ = 7 is a valid root of the original
equation.
Exercise 1 (c)
1. Express in the form a^b, where b has no perfect squares as factors
(i) V50; (ii) V363; (iii) V2400; (iv) V192; (v) V1452.
2. Simplify the following surds:
(i)(V5-l)2; (ii) (2V3-V2)2; (iii) (V3 + 1)3; (iv) (V3-V2)3;
(v) (V3 -1)4·
3. Simplify the following expressions:
(i) V18-V2; (ii) V80-V5; (Ш) V108-V75 + V48;
(iv)Vm (y) 714^/3.
12

4. Rationalize the denominators of the following expressions:
*Ъ *>M- «^ ^Ш *>£&
5. Simplify:
(i) V(6-2V5); (ii) V(10-V96); (iii) VO0+12V6); (iv) V(47-6V60);
(v) 3/V(7-V40).
6. Solve for л: the following equations:
(i) 3V*-V(*+5) = 3,
(ii) V(2* + 5)-V(* + 2) = 1,
(iii) V(* + 6) + V(4-*) = Vd-3x).
7. Rationalize the denominators of the expressions:
8. Theequation V(x + 4-7) + V(* + 4-9) = V33
was solved to give an answer χ = 31 (to one decimal point). Without using
tables, state why this is clearly wrong.
9. Given that V5 » 2-23607, J2 χ 1-41421, evaluate:
(i)75; (ii)V^72'
giving your answers to the greatest accuracy that you can guarantee.
10. Given that, to six significant figures, V6 = 2-44949, V3 = 1-73205, evaluate
as accurately as possible 1/(2^3 —V6), justifying the accuracy you give.
11. Rationalize the denominator of the expression #3/(^3 + 1), by using the
factorization a3+bs = ia + b){a2-ab+b2).
12. Rationalize the denominators, and simplify as far as possible the following
expressions:
6. SETS OF NUMBERS
Suppose we have a collection of numbers, the whole collection being
denoted by the letter It. Suppose, too, we ask a question which may be
answered unambiguously 'Yes' or 'No' for each number (or element)
of U. Then the collection of those elements of 11 for which the answer is
'Yes' is said to form a set S, which is a subset of the universal set It,
13

NUMBERS AND INEQUALITIES [1
written Sill. The collection of those elements of 11 for which the answer
is 'No' is called the complement of S, written S'.
It may happen that the answer to the question is 'No' for each
element of 11; in this case we say S is empty and write this as S = 0.
(0 is usually referred to as the null set, or empty set.)
As an example, suppose that 11 is the set of integers from 1 to 10. We
may write this as Ц = {1, 2, 3, .... 10}.
The question may be posed: 'Is the element χ of 11 divisible by 3?'
Those elements for which the answer is 'Yes' form a set A where
A = {3, 6, 9}.
The complement of A is given by
A' = {1, 2, 4, 5, 7, 8, 10}.
Again, the alternative question 'Is the element χ of 11 irrational?' may be
asked and unambiguously answered for each element of 11. The answer is
'No' in every case; the set defined is thus 0 and its complement is U.
In this example, the universal set 11 contains only a finite number of
elements. It may well happen, however, that 11 contains an infinite number
of elements; for example, 11 might be the set of all positive integers. The
proper subset Ε defined by the question 'Is the element χ of 11 even?'
also contains an infinite number of elements. {P is a proper subset of Q
if Ρ is contained in Q but is not the whole of Q, written Ρ c Q. Ρ я Q
means that Ρ is a subset of Q, but may be the whole of Q.)
It is convenient to develop a shorthand notation for the somewhat
cumbersome method we have used so far in defining our sets. The method
adopted is to use braces {:} with a colon (:) or vertical line (|) (read as
'such that') separating the two necessary pieces of information:
(i) within the bracket and to the left of the colon is stated the universal
set from which the elements are drawn;
(ii) within the bracket and to the right of the colon is the statement
defining which particular elements of the universal set are to be chosen.
For instance, for the sets 11 and A mentioned above we may write
A = {x e U:x is divisible by 3}.
The sign e means 'is a member of' or 'is an element of '.
Ex. 20. If U is the set of positive integers {1,2,3,..., 9} write down the elements of
the following sets:
(i) {x e VL: χ+4 e U}; (ii) {x e VL: J χ is rational}.

6]
SETS OF NUMBERS
Ex. 21. If U is the set of all positive integers {1, 2, 3, ...} describe in words the
following sets:
::i(x+l)eU}; (ii) {xeU: x-1 eU};
: &XeЩ; (iv) {xeU: x*-5x-6 = 0};
(0 {«U
(iii) {xeU
(v) {*eU:
x*-bx-l =
Ex. 22. Two sets A and Л which are both subsets of the same universal set U are
said to be equal (A = B) if they contain precisely the same elements.
Show that А = ВоАяВ and В Я А.
We may combine two subsets А я VL and В я VL according to the two
rules of union (u) and intersection (n) defined as follows:
A U В is the set of all elements of 11 that are members of either A or В
or both.
Α Π Β is the set of all elements of 11 that are members of both A and B.
From these definitions it follows at once that the operations of union
and intersection are commutative; that is,
A\J В = B\J A and A (] В = В f] A.
The operations of union, intersection and complementation may be
exhibited pictorially using a Venn diagram. In a Venn diagram, the uni-,
versal set 11 is represented by a rectangle; any subset A £ 11 is depicted
by a closed region lying within the rectangle; see Figure 1.3.
In Figures 1.4, 1.5, 1.6, A U В, А П В and A' are shown shaded:
Ex. 23. Verify, using Venn diagrams, that the operations of union and
intersection are associative; that is
A U (В U Q = (A U В) U С; Α η (Β η С) = (А П В) η С.

NUMBERS AND INEQUALITIES [1
Ex. 24. Verify, using Venn diagrams, that the operation of union is distributive
over intersection, and that intersection is distributive over union; that is
A U (В П Q = (A U В) П (A U С); Α η (В U Q = (Α η Л) U (Α η С).
Ex. 25. Verify rfe Morgan's Laws, using a Venn diagram:
(i) 04 U Л)' = / Л i'; (ii) (А П 5)' = Λ' U Л'.
(The reader may be able to identify these laws with the logic of and (n),
or (U) and not (').)
Ex. 26. If U is the set of all positive integers and
A= {xeU-.ixeU}, В = {xeU: ixeU}
describe An B, using the {:} notation.
Ex. 27. If U = {a, b, c, d, e} and A = {a, c, e} write down the subsets:
(i) A U A'; (ii) Α η A'; (iii) AuU; (iv) A n U; (v) A U 0; (vi) A n 0;
(vii) U'; (viii) 0'.
Ex. 28. If, with the notation of Ex. 27, Л = {я, 6, e), write down the subsets
(i) AnB; (ii) Λ' U B; (iii) Λ η Л'.
Verify de Morgan's Laws (see Ex. 25) in this particular case.
Three particular sets, the set of all integers, the set of all rational numbers
and the set of all real numbers occur throughout mathematics with such
unfailing regularity that it is convenient to introduce a notation by which
to refer to them.
The set of all integers is denoted by Z.
The set of all rational numbers is denoted by Q.
The set of all real numbers (that is, the union of the set of all rational
numbers and the set of all irrational numbers) is denoted by R.
With a suitable definition of each of the terms integer, rational and real
we have the following relation between them
Ζ С Q С R.
The sets of all positive integers is denoted by Z+, of all positive rational
numbers by Q+ and of all positive real numbers by R+.
Ex. 29. Enumerate the elements of the following sets:
(i) {xeZ:(x-2)(2x+l)(x*-2) = 0};
(ii) {xe Q+:(x-2)(2x+l)(*-2) = 0};
(iii) {xe β:(*-2)(2χ + 1)(χ·-2) = о};
(iv) {xeR+:(x-2)(2x+l)(x*-2) = 0};
(v) {xe R:(x-2) (2*+l) (x2-2) = 0}.
16

η INEQUALITIES
7. INEQUALITIES
We have already asserted that the integers and the rational numbers may
be ordered; that is, given two integers (or rational numbers) χ and y,
we may answer the question 'Is χ greater thany?' The same is true for the
set of all irrational numbers, or, indeed, for the union of the sets of rational
and irrational numbers (the set of real numbers).
If x, у are two real numbers, χ > у (read 'χ is greater than y') means
that the point representing the number χ lies to the right of the point
representing the number y. Similarly for χ < у ('х is less than y'). When
comparing two positive or negative numbers χ and у in this way, we see
that . ,. . _ .
χ > yox—y > 0 (i.e. x—y is positive),
x<yox—y<0 (i.e. x—y is negative),
χ = yox—y = 0 (i.e. x—y is zero).
The number zero separates the positive and negative numbers; it is itself
neither positive nor negative.
By continued use of these results we see that we may add or subtract
any number from both sides of an inequality without altering the validity
of the inequality sign.
Inequalities are of value in defining sets. Thus, for example, to take a
rather trivial case, {xeZ: 0 < x < 3) - {1,2).
A more substantial example would be
{xeR:x2 < 4}.
Solving an inequality means redefining the set of elements which satisfy
the given inequality in as simple a manner as possible.
Ex. 30. What does 'solving an equation' mean?
Example 10. If χ is a real number, solve the inequality 3-х > 2.
The question asks us to redefine the set
{xeR:3-x> 2}
as simply as possible.
SinCe 3-*>2
=> 3 > 2+x (by adding χ to both sides)
=> 1 > χ (by subtracting 2 from both sides)
the solution set may be rewritten
{x&R:x < 1}.

NUMBERS AND INEQUALITIES [1
The rule for redefining a set by multiplying or dividing both sides of an
inequality by a number is slightly more complicated. If x—y and ζ are
both positive, then {x-y) ζ is positive, thus
χ > у and ζ > 0 => xz > yz.
If, on the other hand, x-y is positive but ζ is negative, then (x-y) ζ is
Thus, if a set is defined by an inequality χ > у, the same set is defined by
xz > у ζ if ζ is positive,
or by xz < yz if ζ is negative.
A similar result holds for division.
We now append an alternative solution to Example 10:
3-х > 2
=> -3+x < -2, multiplying both sides by -1,
=> χ < 1, adding 3 to both sides,
and we have, as before, the solution set
{x&R:x < 1}.
Ex.31. Solve the inequalities
(i) 3 + x < l;(ii) 2-3* > -l;(iii)3-4x> 1.
The reader should have observed that in, for example, the result
χ > у and ζ > 0 => xz > yz
the two-way implication was not used: it is not valid to deduce from
xz > yz that χ > у and ζ > 0. However, inequalities involving products
may be solved by observing that, if the product is positive, then the factors
are either both positive or both negative. Another useful observation is
that a squared number is always positive or zero.
Example 11. Solve the inequality (x + 2) (x-1) > 0 (x real).
If(x + 2)(;t-l) > Othen
either (i) (x + 2) > 0 and (x-1) > 0,
or (ii) (x + 2) < 0 and (x-1) < 0.
The first pair of inequalities has as solution set
{xeR:x > -2} η {xeR:x > 1} = {xeR:x > 1}.

7] INEQUALITIES
The second pair of inequalities has as solution set
{xeR:x<-2}[){xeR:x< 1} = {xe R: χ < -2}.
The complete solution is thus the union of these two sets
{x e R: either χ < -2 or χ > 1}.
Alternatively, the argument may be presented very clearly in the
following tabular form. The critical values of χ for which (x + 2) {x— 1) changes
sign are χ = —2 and χ = 1. We divide the possible values of χ into the
three ranges χ < — 2, — 2 < л: < 1 and χ > 1 and consider the signs of
(x + 2), (x-l) and hence of (x + 2) (x-l) in each interval.
(x+2Kx-\)
x<-2
~
+
-2<x<\
+
-
x> 1
+
+
+
Example 12. Prove that the expression у = x2 + x +1 is positive for all real
values of x, and find its minimum value.
It will be useful in the solution of this example to employ the notation
> : χ it у means 'x is greater than, or equal to, y\
N°W 2, ,1
у = x2 + x+l
= (* + i)2+l,
.·. y-ϊ = (*+i)2,
.·. y-l > 0 or у > %.
Thus у is always positive and has a minimum value off (when χ = -|).
Given two positive numbers χ and у we define
(i) the arithmetic mean {a.m.) as A = \{x+y);
(ii) the geometric mean (G.M.) as G = -J{xy).
We conclude this chapter by proving a famous theorem (which may be
extended, see Miscellaneous Exercise 1, Question 13).
Theorem 1.1. The a.m. of two positive numbers is at least as great as their
G.M.
Proof. Since the two numbers are positive, we may write them as
u2 and v2.
Let A = (u2 + v2)/2, G = uv; we have to prove that
A- G > 0,
19

NUMBERS AND INEQUALITIES [1
thatis u2 + v2 > 0
L.H.S. = \{u2 + v2-2uv)
= ¥u-vf
> 0, unless и = v.
We have thus proved that A > G unless u2 = v2 in which case A = G.
Example 13. 200 m of wattle fencing are to be bent to form three sides of a
rectangular enclosure, the fourth side being a straight hedge. Find the length
of the rectangle if the area to be enclosed is to be a maximum.
Let xmbe the length, у m the breadth, A m2 the area of the enclosure.
Then we have ... „ ... .... .
(ι) x+2y = 200, (n) xy = A.
Both χ and 2y are positive; thus, by the theorem just proved, their a.m.
is at least as great as their G.M. That is
\{x + 2y) > J(2xy).
Using (i) and (ii), this gives m ^ ^(^
i.e. A ^ 5000.
Equality occurs only if χ = 2y, i.e. χ = 100, у = 50. Thus the maximum
area occurs when the length is 100 m.
Exercise 1 (d)
1. Verify, using a Venn diagram, the results:
(ί) ΑΌ(Α'Ό BY = An (B' U A),
(ii) (AU B)n(A'nC)' = AU(Bn O,
(iii) [A' U {Β' η СОГ = {А П В) и {А П С).
2. If R is the set of real numbers and
A = {xeR:x> 3}, B={xeR;x<4}
describe, using the {:} notation
(i) AD B; (ii) Α η Β; (iii) A' U B.
3. Taking the universal set, U as the set of all integers, describe using the {:}
notation
(i) the set of all negative integers;
(ii) the set of all positive integers divisible by 6;
(iii) the set of all integers, excluding 0, ±1, ±2.
4. A = {xeR: -1 < χ < 3}, В = {xe R: 2 < χ < 4}, C= {xeR:x>3}.
Find expressions for the following sets, using the {:} notation:
(i) A n B;(ii) A' n С; (iii) (A U Β) η C';(iv) iU(fin C);(v) (A' U Β') η С;
(vi) Α' υ (Β' η Q; (vii) /ninC.
20

7] INEQUALITIES
5. A = {xeR:x> 0}, В = {xe R: χ < 1}, С = {xe R: -1 < χ < 2},
D = {xeR: -2 < χ < 1}.
Find expressions for the following sets, using the {:} notation:
(i) Α η Β Π С η D; (ii) Α' η Л П С Π D';
(iii) 04 U Β') η (С η DO; (iv) (Λ' П В') и (С П Ζ>).
6. Solve the following inequalities:
(i) x + 2 < -3; (ii) 3*-l > 5; (iii) 4x-3 < 3x + 4; (iv) 2x > x;
(v) x2 > x.
7. Solve the following inequalities:
(i) (* + 3)(*-l)> 0; (ii) (2x-l)(3x + l)> 0; (iii) (x-3) (2x + 3) < 0;
(iv) x2-6x + 9 < 0;(v) x2-4x-5 < 0.
8. Solve the inequality
(2x + l)(x-2)(x + 3) > 0.
9. Prove that the following expressions are positive, for all real values of x,
and find their minimum values:
(i) x2 + 2x + 2; (ii) x2-6x + 12; (iii) 2x2-2x + l.
10. Find the signs, for all values of x, of:
(i) 2x-x2-2; (ii) 2x2-x+l.
11. For what values of χ is the expression
(*-l)(* + 2)(*-3)(* + 4)
positive?
12. If xy = 25, find the least possible value for χ+у, if both χ and у are positive.
13. If xy = 18, find the least possible value for 2x+y, if both χ and у are positive.
14. If x+y = 2, find the maximum value of xy.
15. If 2x + Ъу = 120, find the maximum value of xy.
16. Some netting is required to make three sides of ^ ρ Β
a rectangular chicken-run, the fourth side being an ι 1 1
existing wall. Find the least length of netting needed
for an area of 50 m2.
17. Figure 1.7 shows part of the framework of a kite. | | |
ABCD is a rectangle; Ρ and Q are the mid-points of О Q с
AB, CD respectively. Find the maximum area of
ABCD that can be made if 4 m of wood are available. FiS· J ·7
What are the dimensions of the kite necessary to attain this area?
Miscellaneous Exercise 1
1. Express 120 as a binary number.
Find the least number of stamps required so that any value from ip to 60p
(in steps of £p) may be selected, and give their values. (You may assume stamps
of any value are available if required.)
21

NUMBERS AND INEQUALITIES [1
2. Show that it is impossible to choose values of a and b so that the number
written as ab in the scale of ten is equal to the number written as ba in the scale
of twelve.
Show, however, that ab (scale of ten) = ba (scale of seven) is possible, and
give an example.
3. Prove the well known result that the remainder on dividing a number by
9 is the same as the remainder on dividing the sum of its digits by 9.
Show that this result may be generalized as follows: if a number is divided by s
then the remainder is the same as the remainder on dividing by s the sum of its
digits, when it is expressed to the base s+1.
4. Prove that, if a number is divided by 11, the remainder is either the same as
the remainder on dividing the difference between the sums of the digits in the even
and odd places, or else the sum of the two remainders is 11.
Generalize this result along the lines of Question 3.
5. Prove that, if the digits of an integer (expressed to the base ten) are
rearranged in any way to form another integer, the difference between the two
integers is divisible by 9.
6. If abc+abc = cba in the scale of five (when abc here means 52a + 5b + c) find
values for a, b, с
Show that this question is always soluble provided a base of Зи-l is used,
where и is a positive integer.
7. Prove that V3-V2 is irrational.
8. Solve the equation
V(4x-2)+V(*+l)-V(7-5x) = 0. (O&C)
9. Verify that the expression
a2 + b2 + c2-2bc-2ca-2ab
is equal to (a + b - c)2 - 4ab. Hence, or otherwise, prove that the expression is
6qUal l° (α+Α + у) (μ-β-Ύ) ("-Α + 7) (*+β-Λ
where a. = *ja, β - 4Ь, У = Vе·
Hence, or otherwise, find one solution of each of the equations
(i) V(*-6)+V(*-D = V(3*-5);
(ii) V(6-x)-Vd -*) = V(5-3x). (O & C)
10. If a > b and с > d, prove that ac + bd > bc + da.
What happens to the third inequality if a < b, с < dl
11. If the universal set is taken as the set R of all real numbers, the three sets
А, В and С are defined as follows:
A = {xeR:x> 2}; В = {xe R: 1 < χ < 4}; С = {xe R: χ < 3}.
Express, in terms of any or all of А, В, С (and using the notation of union,
intersection and complement) the following sets:
(i) {xeR:2 < χ < 3}; (ii) {xe R: χ < 1};
(iii) {xe R: 3 < χ < 4}; (iv) {xe R: 1 < χ < 2 or 3 < χ < 4}.
22

7] MISCELLANEOUS EXERCISE 1
12. Show that the expression x2 + 8xy-5y2-k(x2+y2) can be put in the form
a(x + by)2 when к has either one or other of two values. Find these values and the
values of a and b corresponding to each value of k.
Prove that when the variables χ and у are restricted by the relation
x2 + y2 = 1,
but are otherwise free, then
-7 < x2 + 8xy-5y2 < 3. (O&C)
13. a, b, c, d are four unequal positive numbers. By using Theorem 1.1 and
considering first the pair of numbers i(a + b) and Цс + d), and then, separately,
the pairs a, b and c, d, prove that
Ua + b+c+d)> (abed)*.
Deduce, by considering the four unequal numbers a, b, с and j(a + b + c),
that &a + b + c)> (abc)K
What happens to the last inequality (i) if a = b = c; (ii) a = b Φ с?
Suggest a generalization of the results proved above.
14. If x + 2y+3z = 1, where x, y, ζ are positive numbers, find the maximum
value of xyz.
If u+v = 1, where u, ν are positive numbers, find the maximum value of u2v.
15. A cylindrical vessel, with one end open, is made from a given piece of
material. Show that its volume is greatest if the height and radius are equal.
16. a, b, с are non-zero rational numbers. Show that, if
and if none of a*, 6*, c* is rational, then a*, 6*, c* are each rational multiples
of the same irrational number.
17. Show that, if p/q is a good approximation to V2, then (p2 + 2q2)/(2pq) is a
better one. Starting with ρ = q = 1, show that V2~ 577/408, and estimate the
accuracy of this approximation.
23

2. Vectors and vector geometry
1. SCALAR AND VECTOR QUANTITIES
Many physical quantities are completely specified by their magnitude
alone. For example, if the mass of the box is m kg and we are told that
m = 8, we know the mass of the box precisely. Not all physical quantities
are so easily described as this; for example, if the position of the point Ρ
relative to the fixed point О is denoted by r, then the statement 'r = 8 m'
is not sufficient for us to locate P: it may be anywhere on a sphere, centre
О and radius 8 m. In order to determine the position of Ρ we need to be
given its direction, specified in any suitable way.
The position of Ρ relative to О is an example of a displacement; that is, a
line segment whose length, direction and sense are given. Experimental
evidence reveals that many physical quantities obey the same mathematical
laws as do displacements; such quantities are called vector quantities. We
shall give a precise definition of a vector quantity in the next section,
observing here merely that vector quantities require a direction and sense,
as well as a magnitude, for their specification. Physical quantities which
require only a magnitude (that is, a pure number) to describe them are
called scalar quantities. Here are a few examples of each type:
Vector quantities: displacement, velocity, acceleration, force,
momentum, electric intensity.
Scalar quantities: mass, time, temperature, energy, electric charge,
electrostatic potential.
2. VECTORS AND THE TRIANGLE RULE
Since all vector quantities obey the same mathematical laws as
displacements we shall concern ourselves exclusively with displacements in building
up the relevant mathematics. It should be borne in mind, however, that
the theory being developed is applicable to a wide range of physical
quantities.
Before embarking upon a study of operations involving displacements
we must define what we mean by the equality of two such quantities. A
displacement is typified by its magnitude, sense and direction and so it
seems reasonable to regard two displacements of the same magnitude which
are parallel in the same sense to be equal. Thus, in Figure 2.1, we have
24

2] VECTORS AND THE TRIANGLE RULE
AB = CD = EF = GH = .... We shall use a single symbol to represent
any member of the class of such equal displacements
a = AB = CD = EF = ....
Such a representative of a whole class of displacements is called a
vector. In other words, a vector is a mathematical entity, to which a whole
class of geometrical objects correspond. Vectors are usually printed in bold
face type: a, b, c, .... In manuscript they should be indicated by a wavy
line beneath the letters, a, b, c.
The equality of two vectors a = b, means that a and bare both
representative of the same class of parallel and equal displacements.
Fig. 2.1 Fig. 2.2
The magnitude of a vector, a, written |a|, is the magnitude of any one of
the displacements of which it is representative. It is most important to
remember that |a| = |b| does not imply that a = b; put another way,
two unequal vectors may have the same magnitude.
Two displacements AB and ВС may be combined together, or added,
according to the triangle rule, AB + BC = AC (see Figure 2.2).
Two vectors may similarly be added: to find a + b we draw any
displacement AB represented by a; we then select the unique displacement ВС,
represented by b which has as its initial point the point В and the vector
a + b will be representative of that class of displacements of which AC is a
typical member.
We have said that vector quantities are physical quantities that may be
completely represented by a displacement. Since displacements obey the
triangle rule of combination, so must vector quantities generally. Indeed,
we are now in the position to make a formal definition:
A physical quantity is a vector quantity if
(i) it has magnitude, direction and sense;
(ii) it obeys the triangle law of addition.^
t The reader should observe that certain vector quantities must have their position
specified too. For example, to describe a force we need to know not only its magnitude
and direction, but also its line of application.
25

VECTORS AND VECTOR GEOMETRY [2
The observation that any particular physical quantity is indeed a vector
quantity is the result of experimental evidence, which must include a
verification of the triangle rule.
Ex. 1. Can you suggest any addition to the list of vector and scalar quantities
given in Section 1 ?
Ex. 2. Can three displacements be added together? Does it matter in what
order it is done?
Ex. 3. A rotation may be given a magnitude (size of angle turned through),
direction (axis of rotation) and sense (positive being direction for which rotation
is anticlockwise). Show that rotations are not vector quantities. (This example
shows that physical quantities exist that are neither scalar nor vector quantities.)
3. OPERATIONS WITH VECTORS
Since all parallel and equal line segments represent the same vector a
we may, when verifying the various rules of vector algebra geometrically,
choose as representative those line segments which are most convenient.
Rule 1. Vector addition is commutative, a+b = b + a.
If AB = DC = a and ВС = AD = b, ABCD is a parallelogram and
a+b = AC = b+a.
Rule 2. Vector addition is associative. (a+b)+c = a + (b + c).
If AB = а, ВС = b, CD = c,
then (a +b) + с = AC + CD = AD,
and a + (b + c) = AB+BD = AD.
We may thus drop the brackets when adding three vectors together, and
write a+b+c without ambiguity.
If AB represents the vector a then BA will be said to represent the vector
26

3]
OPERATIONS WITH VECTORS
-a. We shall write a + (-a) = 0 for all a and call 0 the zero vector.
(Note that the zero vector, unlike all others, has no direction.)
If AB = a and AC = b then, since CA+AB = CB, the line segment
CB represents a-b (see Figure 2.5). (Notice that we have used the com-
mutativity of vector addition here.)
To develop the algebra of vectors further, we need a new definition, the
multiplication of a vector by a number {scalar).
If A: is any positive number, кл is the vector of magnitude к |а | in the same
direction as a and with the same sense.
Fig. 2.5
Fig. 2.6
If к is any negative number, кл is the vector of magnitude -к\л\ in
the same direction as a and with the opposite sense.
Thus, multiplying a vector by a number simply stretches or contracts
the vector along its length. Notice that -a = - la.
Rule 3. Multiplication by numbers is associative.
k(h) = (kl) a.
This is immediately apparent, since a, /a and (к!)я all have the same
direction, and the magnitude and senses of Щл) and (кГ)л are both (kl) | a |.
Rule 4. Multiplication by numbers is distributive over vector addition.
к(я + Ъ) = кя+кЪ.
Representing a by AB and b by ВС, кл by AB' and kb by B'C, ACC is
a straight line by similar triangles and the result follows, since the
corresponding sides are in proportion (Figure 2.6).
Rule 5. Multiplication by numbers is also distributive in the following
sense. ,. ,
(к + Т)л = кл + 1л.
The vectors on either side of the above identity have the same magnitude,
sense and direction, as can be seen immediately from a diagram.
*Ex. 4. Prove that O.a =
zero vector?
0. What does this tell us about the magnitude of the

VECTORS AND VECTOR GEOMETRY [2
Ex. 5. If a is a vector of magnitude 4 units due north and b is a vector of
magnitude 1 unit due east, describe the vectors:
(i) 2a; (ii) -3b; (Hi) a-4b; (iv) 2(a+b); (v) 2a-6b.
Ex. 6. If a = 3x+ y, b = x— 2y, find x, у in terms of a, b, justifying your
argument in terms of Rules 1-5.
Ex. 7. If a is a vector of magnitude 1 unit due north and b is a vector of
magnitude 1 unit in the direction N 60° E, describe the following vectors (using a scale
drawing or trigonometry, if necessary):
(i) 2a+b; (ii) -(2a-b); (iii) b-a; (iv) -a-2b.
Ex. 8. If a is a vector of 2 units due north in a horizontal plane, b is a vector of
3 units due east in a horizontal plane and с is a vector of 1 unit vertically up out
of the plane, describe the vectors:
(i) a-2c; (ii) -b-3c; (iii) 3a+2b+6c; (iv) 3a-2b-6c.
Ex. 9. ABCD is a parallelogram with А В 2 units and ВС 1 unit. If a is a vector of
1 unit in the direction AB and b is a vector of 1 unit in the direction of ВС, and
if Ε is the mid-point of CD, find, in terms of a and b, the vectors AC, DB, AE,
EB.
4. COMPONENTS OF A VECTOR
It is often useful to express a given vector г as the sum of a number of
vectors in specified directions
г = я1 + а2+... + я„.
If this is done, аь л^, ..., ая are called components of r.
Given a vector г and specified directions two questions immediately
arise:
(i) Can г be split into components in these directions?
(ii) If the answer to (i) is 'Yes', in how many alternative ways can this
be done ?
We shall answer these questions for a vector г and three non-parallel,
non-coplanar directions in three dimensions by proving two theorems of
types that constantly recur in mathematics. The first theorem is an example
of an Existence Theorem: it will answer question (i) above, telling us that a
solution does exist; the second theorem is a Uniqueness Theorem: it will
answer question (ii) above, telling us that the solution which we know
exists is the only possible one, that is, that it is unique. The significance
of these two theorems will become apparent as we proceed.
Theorem 2.1 (The Existence Theorem). Given a non-zero vector г and three
non-coplanarf, non-parallel vectors a, b, c, there exist numbers Α, μ, ν, such
that ■> , u ,
г = ΛΛ + μο + νΐ.
t That is, displacements represented by a, b, c, cannot all be chosen to lie in one plane.
28

4] COMPONENTS OF A VECTOR
Proof. Represent the vectors a, b, c, r, by line segments OA, OB, ОС,
OP respectively. Through Ρ draw a plane parallel to the plane OBC to
cut the line OA at A' (this may always be done, by the data). Let P' be the
point in the plane OBC such that OP' = A'P.ThroughP'draw a line parallel
to CO to cut OB at B' (this again may be done using the data). The
constructions are illustrated in Figure 2.7.
r = OA'+A'P
= Aa + OP'
= Aa + OB' + B'P'
= λя + μЪ + ι>c.
Fig. 2.7
We have thus proved our Existence Theorem by showing that, with the
given data, a set of components may be constructed. To demonstrate the
truth of the Uniqueness Theorem we employ a common device in
mathematical arguments: we assume that it is false and show that this leads to a
contradiction (see Chapter 9).
Theorem 2.2 (The Uniqueness Theorem). The solution shown to exist in
Theorem 2.1 is unique.
Proof. By Theorem 2.1 a solution exists, say
r = Л^ + ^Ь + ^с.
Assume that a different solution also exists
г = A2a + /t2b + i>2c
where, say, Aj Φ Α2. Then
Aja-b/tib + ^c = A2a+/i2b + i>2C·
and so (A2 - Aj) a = (μ1 -μ2)Ъ + (vx -v2)c.
But (μ1 — μ2)Ъ + (v1-v2) с is a vector in the plane determined by b and с
2-2 29

VECTORS AND VECTOR GEOMETRY [2
and any non-zero multiple of a cannot lie in this plane, by the data. Thus
Aj = A2, which contradicts our initial assumption.
Theorems 2.1 and 2.2 have been proved for three dimensions; they hold
equally well in two dimensions, for a vector г lying in the plane
determined by a and b, where а ф kb.
An expression such as λл + μЬ + vc is called a linear combination of the
vectors a, b, с The theorems we have just proved may be restated in the
form 'Given three non-parallel, non-coplanar vectors a, b, с any non-zero
vector г may be expressed uniquely as a linear combination of a, b, c'.
If numbers Α, μ, ν (not all zero) exist such that
λя+μЪ + vc = 0
the vectors a, b, с are said to be linearly dependent; if no such numbers
exist, a, b, с are said to be linearly independent.
*Ex. 10. Prove that, if О, А, В, С are coplanar, then the position vectors a, b,
с are linearly dependent and, conversely, if a, b, с are linearly dependent, then
0. А, В, С are coplanar.
*Ex. 11. Prove the Existence and Uniqueness Theorems for components in two
dimensions.
Ex. 12. ABCD is a parallelogram, and Ε is the mid-point of CD. AB = a,
AD = b, AE = x, BE = y. Express χ and у in terms of components in the
directions a and b and also express a and b in terms of components in the directions χ
and y.
Ex. 13. Do either the Existence or the Uniqueness Theorems hold in three
dimensions if four directions a, b, c, d are given ?
Exercise 2 a
1. If ν = a + 2b, w = 2a—b, express the following vectors in the form
Aa+/ib.
(i) v + w; (ii) 2v + 3w; (iii) v-3w; (iv) 2(v-w).
2. If u = a + 3b, ν = 2a — b, express a and b in terms of u and v.
3. If u = a + b + c, ν = a + 2b + c, w = a—b—2c, find, in terms of a, b, c:
(i) u + v + w; (ii) u-v-w; (iii) 2u + v-w; (iv) u + 2v + 3w.
4. If u = a + b-c, ν = 2a-b + c, w = 3a+2b + c, find a, b, с in terms of
5. If i and j are perpendicular vectors of unit magnitude, i pointing due east and
j due north, find the magnitudes and directions of the following vectors:
(i) 2i; (ii) -3j; (iii) 3i + 4j; (iv) i-j; (v) -3i+4j.
30

4] COMPONENTS OF A VECTOR
6. If ρ is a vector of magnitude 2 pointing due east and q is a vector of magnitude
5 pointing due south, find the magnitudes and directions of the following vectors:
(i) 6p + q; (ii) 6p-q; (iii) -p-q.
7. If ρ is a vector of magnitude 1 pointing due east and q is a vector of
magnitude 2 pointing north-west find (by accurate drawing, if you wish) the vectors
(i) 2p + q; (ii) p-q; (iii) -p-q; (iv) -4p + 2q.
8. χ is a vector of magnitude 1 pointing due east; у is a vector of magnitude 2
pointing south-west; ζ is a vector of magnitude 20 pointing due north. Express
ζ in the form Ax+/iy, giving the explicit numerical values of λ and μ.
9. If a is a vector of magnitude 1 pointing due north, b is a vector of magnitude
1 pointing N 20° Ε and с is a vector of magnitude 2 pointing N 70° W, express с
in the form с = Ла+/Л.
If d is a vector of magnitude 2 pointing N 10° E, find approximate values for
λ', μ' where d = A'a+/i'b.
10. If i, j, к are vectors of unit magnitude pointing respectively due east, due
north and vertically upwards out of the plane containing i, j, find the magnitude
of the vector u = 4i+ j + 8k.
If ν = 4i—j + 8k and w = 4i—j —8k, find the angles between the vectors
(i) u, v; (ii) v, w.
11. Figure 2.8 represents a lattice of congruent parallelograms; OA = a,
OB = b, AB = с Express OP in terms of
(i) a, b; (ii) a, c; (iii) b, c;
Fig. 2.8
and do the same for the vectors OQ and OR. Express PQ in terms of a, b, and
RQ in terms of b, с Find two different expressions of the form oa+^b+yc for
the vector OQ, with none of α-, β, γ zero. Show that, if one of these expressions
is subtracted from the other, the known relation b = c + a is obtained.
12. In Figure 2.8 ОС = u, OD = v, QP = w. Obtain expressions for OP, OQ,
OR in terms of u, ν and hence obtain w in terms of u and v.
Why is it not possible to express CD in terms of the two vectors PQ, AR?
13. ABCDEF is a regular hexagon, and AB = а, ВС = b. Find CD, DE, EF,
FA in terms of a and b.
14. ABCDA'B'C'D' is a cuboid whose base ABCD is a square of side 2 units.
The sides AA', etc., are vertical and of magnitude 4 units. Ε is the mid-point
31

VECTORS AND VECTOR GEOMETRY [2
of AB, G is the mid-point of B'C and F is the mid-point of CC a, b, с are
three vectors, each of magnitude 1 unit in the directions AB, AD, AA' respectively.
Find, in terms of a, b, с the vectors DE, AF, EF, GF, GE.
15. With the data of Question 14, express ED', EG and EF in terms of
components in the directions a, b and с and also express a, b, с in terms of components
in the directions ED', EG and EF.
16. OABCO'A'B'C is a rectangular box with square ends О ABC and O'A'B'C.
OO', AA, BB\ CC are parallel edges. If АО = ОС = 1 unit, OO' = 2 units,
and if i, j, к are unit vectors (that is, vectors of unit magnitude) along О А, ОС,
OO' respectively, find, in terms of i, j, k:
(i) ОС; (ii) OB'; (iii) C'A; (iv) OD,
where D is the point on CB' produced such that C'B' = B'D;
(v) OM, where Μ is the mid-point of AA'; (vi) MD.
17. О ABC is a regular tetrahedron of side a; p, q, г are unit vectors (see Question
16) along О A, OB, ОС respectively. Find in terms of p, q, r:
(i) AB; (ii) OD, where ABCD is a rhombus.
18. ABCD is a square andP, Q are the mid-points of ВС, CD respectively. Find,
in terms of u = AP and ν = AQ,
(i) AB; (ii) AD; (iii) BD.
19. With the notation of Question 16, OB' = u, AC' = ν, ΟΉ = w; find, in
terms of u, v, w:
(i) OA'; (ii) ВС; (iii) ΟΉ'; (iv) ВС.
20. With the notation of Question 19, if CA' = x, find two distinct expressions
for CB' in terms of u, v, w, x.
21. ABCDA'B'CD' is a cube, with faces ABCD, A'B'CD' and edges AA', etc
Find, in terms of AD', AC, AC, the vectors D'B and A'C.
22. ABCDEFGH is a regular octagon. If AB = а, ВС = b, find, in terms of a, b,
the displacements CD, DE, EF, FG, GH, HA.
5. APPLICATIONS TO GEOMETRY
So far we have developed the algebra of vectors using geometrical
arguments ; we now reverse the process and show that the algebra of vectors
may usefully be employed to deduce geometrical results. The vector
treatment of geometrical problems has the great advantage that it is equally
applicable to two or three dimensions.
To describe a geometrical configuration consisting of a number of points
А, В, С, ... we must be able to locate each point. This may be done by
taking a fixed point О (called the origin) and referring to a point A by
the line segment OA. If a is the vector representative of all the line
segments equal and parallel to OA, then a is called the position vector of the
point A with respect to the origin O. Thus, given an origin O, we may
32

5]
APPLICATIONS TO GEOMETRY
refer to all points in the plane by their position vectors relative to this
origin O.
First, we establish a result known as the Section Formula. This theorem
enables us to write down the position vector of any point on a given line
and so is of importance in setting up a vectorial description of a geometrical
configuration.
Theorem 2.3 {The Section Formula). IfAPB is a straight line, with
APjPB = λ/μ,
and if the position vectors of Α, Ρ, B, relative to any origin O, are a, p, b
respectively, then ,,
Proof. We have (Figure 2.9)
AB = b-a
and AP = p-a.
But AP=Y^-AB
λ+μ
and so (Α +μ) (ρ - a) = A(b - a),
or (λ+μ) ρ = Ab + /<a
and the result follows.
Notice that the proof holds equally well for positive or negative values
of the ratio Α/μ.
Notice also the important special case in which A = μ: the mid-point
of AB has position vector |(a + b).
Fig. 2.9
Fig. 2.10
Example 1. Show that the medians of a triangle ABC are concurrent and
find the position vector of the centroid (meet of the medians) in terms of the
position vectors a, b, c, referred to some origin O.
Let the mid-points of ВС, С A, AB be L, Μ, Ν respectively and call their
position vectors 1, m, η (Figure 2.10).
33

VECTORS AND VECTOR GEOMETRY [2
Then, by the Section Formula
l = Kb + c), m = i(c + a); η = K*+b).
Any point on AL has a position vector
λΐ+μα
λ+μ '
jA(b + c) + /<a
be· λ + μ *
For varying values of A and μ we obtain different points of the line AL;
in particular, if we choose A = 2, μ = 1, we obtain a point whose position
vector is symmetrical in a, b, с and so lies equally well on BM or CN.
Thus, the medians of a triangle are concurrent at the centroid, G, whose
position vector is given by
Notice, incidentally, that we have shown that AGjGL = 2, etc.
Example 2. Show that the mid-points of the sides of a skew quadrilateral
form the vertices of a parallelogram.
Let ABCD be the skew quadrilateral and the mid-points of AB, ВС, CD,
DA be P, Q, R, S respectively.
Take any point О as origin, let a denote the position vector of A with
respect to О and similarly for the other points.
Then we have, on using the Section Formula,
p=i(a+b), q=i(b + c), г = i(c + d), s = i(d + a)
and so p-q = Ця-с)
and s-r = Ka_c)
Thus QP = RS and so PQRS is a parallelogram (one pair of sides equal
and parallel).
Ex. 14. Show that the joins of mid-points of opposite edges of the tetrahedron
ABCD are concurrent at a point G.
Ex. 15. Show that, in a tetrahedron ABCD, the joins of vertices to the centroids
of opposite faces are concurrent at the same point G as that obtained in Ex. 14.
*Ёх. 16. What condition must the numbers α, β possess for the points with
position vectors a, b and aa+/?b to be collinear?
34

5] APPLICATIONS TO GEOMETRY
The Section Formula may be restated in a form that gives a test for the
collinearity of three points whose position vectors are known.
Theorem 2.4 {condition for three distinct points to lie in a line).
Three points А, В, С have position vectors a, b, c. Then (i) if А, В, С lie
on a straight line, there exist numbers cc, β, γ, not all zero, such that
aa + ^b + yc = 0
and cc + β + γ = 0.
Conversely, (ii) if there exist numbers α, β, у not all zero such that
aa+^b + yc = 0
cc + β + γ = 0
С are collinear, suppose ABjBC = λ/μ. Then
. /<a+Ac
i.e. μа-(μ + λ)Ъ + λc = 0.
Take α = μ, β =-{μ + λ),γ = A and the result follows,
(ii) Given that numbers α, β, у exist such that
aa+^b + yc = 0
and α+β + γ = 0
we have aa+/?b = (cc + β) c, by substitution.
Now we know that α, β, γ are not all zero and we may assume, without any
loss of generality, that у φ 0. It follows that α + β Φ 0 and so
_ aa + lb
° ~ α+β '
i.e. с is the position vector of the point dividing AB in the ratio β:χ,
from which it follows, that А, В, С are collinear.
Example 3 (Desargues's Theorem). Two triangles ABC, A'B'C' (not
necessarily in the same plane) are so positioned that AA', BB', С С all pass
through a point V. ВС, В1 С meet at L; CA, С A' at M\ AB, A'B' at N.
Prove that L, Μ, Ν are collinear.
35
and
then А, В, С are collinear.
Proof, (i) Given that A, B,
by the Section Formula

VECTORS AND VECTOR GEOMETRY [2
Denoting the position vectors of the various points in the usual way,
we have, using Theorem 2.4:
ίν + aa + a'a' = 0,
1 1+α + α' = 0;
jv+^b+^'b' =0,
1 l+/?+/?' = 0;
Fig. 2.11
From the second and third pairs of equations
/gb-yc _ /ff'b'-y'c'
β-Ύ ~ β'-Ύ'
and, by the Section Formula, these must both represent 1. Thus
(/?-7)l = fib-yc.
Similarly, (γ —a)m = yc —aa and (a —/3)n = aa —/3b.
i(/S-r)l + (r-a)m + (a-/S)n = 0,
Thus ]
I (/3-r)+(r-«)+(«-/3) = о
and so L, Μ, Ν are collinear.
Ex. 17. If ABC, A'B'C are two triangles not in the same plane and such that
AA\ BB', CC all pass through a point V, prove Desargues's Theorem by showing
that the meets of corresponding sides of the two triangles must all lie on the line
of intersection of the planes ABC, A'B'C
Ex. 18. Prove the Converse of Desargues's Theorem.
36

6] EQUATIONS OF LINES AND PLANES
6. THE VECTOR EQUATIONS OF LINES AND
PLANES
Throughout the present section we shall adhere to the generally accepted
custom that Ρ denotes a variable point whose position vector is r.
A locus is a set of points (in a plane or in space) subject to some
condition; examples are a straight line, a plane, a sphere, the interior of a
sphere, etc. In this section we shall confine our attention to lines and planes.
The equation of a line is the algebraic condition that is satisfied by the
positions vector г of a general point Ρ lying on the line. Similarly for the
equation of a plane. Later we shall meet loci that are defined by inequalities;
the interior of a sphere would be such a locus.
A straight line is completely specified if two points A, В of the line are
given. Suppose that an origin О is taken and that the position vectors of
A and В relative to О are a and b respectively. The equation of the line
AB is the condition satisfied by the position vector г of a general point
on AB. (Of course, if a different origin O' were chosen, a different equation
would be obtained: the equation of any locus depends upon the choice of
origin.) Now, since Ρ lies on AB, or AB produced (Fig. 2.12), AP = AAB,
where A is a number. For different points on the line, different values of A
are taken. This equation may be rewritten in terms of position vectors as
r-a = A(b-a)
which represents the equation of the line AB.
Fig. 2.12 Fig. 2.13
A plane is completely specified by three (non-collinear) points А, В, С
(see Chapter 3, Section 5). For a general point Ρ of the plane, AP is a vector
lying in the plane of AB and AC and so we may split AP into components
in these two directions (remember that А, В, С were non-collinear and so
two directions are defined). Thus АР = ААВ+/гАС (Fig. 2.13) and so,
in terms of position vectors relative to some origin O,
r-a = A(b-a)+/<(c-a)
37

VECTORS AND VECTOR GEOMETRY [2
which may be rewritten as
г = -(λ + μ-1) я + λЪ+μc.
Our final theorem of this chapter is essentially a re-phrasing of the
result above. It is structurally very similar to Theorem 2.4 and the reader
is advised to refer back to that theorem before continuing.
Theorem 2.5 {condition for four points to lie on a plane). Four points A, B,
C, D have position vectors a, b, c, d.
Then
(i) If А, В, C, D lie on a plane, there exist numbers oc, β, γ, δ, not all
zero, such that . „. . , »■,
aa+^b + yc + tfd = 0,
cc + β + γ+δ = 0.
Conversely,
(ii) If there exist numbers oc, β, у, δ, not all zero, such that
aa+^b + yc + tfd = 0,
cc + β + γ + δ = 0
then A, B, C, D are coplanar.
Proof, (i) Given that А, В, С, D are coplanar, d satisfies an equation of
the form ,, , ,. , ,, ,
г =-(λ + μ-l)а + λЪ+μc,
i.e. -(A+/i-l)a + Ab+/<c-d = 0
and the first result follows if we set
α=-(Λ + /ί-1), β = λ, γ = μ, 8 =-Ι.
(ii) Given that numbers ас, β, γ, S exist such that
aa+^b + yc + tfd = 0,
cc + β + γ+δ = 0.
Wehave ~(β + Ύ+δ) a+^b + yc + id = 0,
i.e. β(}> - a)+y(c - a) = £(a - d),
or MB + у AC = δΌΑ.
But AB, AC both lie in the plane ABC and so therefore does δΌΑ. It
follows that D lies in the plane ABC and the proof is complete.
*Ex. 19. Show that the equation of the line through A parallel to the direction
defined by the vector u is г = a + Au.

6] EQUATIONS OF LINES AND PLANES
*Ex. 20. Show that the equation of the plane through A parallel to the directions
denned by the two vectors m, η is г = а + Ат+/ш.
Ex. 21. Show that the mid-points of two pairs of opposite edges of a tetrahedron
ABCD are coplanar.
Example 4. ABCD is the base of a cube whose vertical edges are AA', BB',
CC, DD'. X is the point of trisection of BB' nearer Β, Υ is the point of
trisect ion ofCC nearer С and Μ is the mid-point of ВС.
If D'M cuts the plane AXY at Z, find the ratio in which Ζ divides D'M.
Take side of cube as 6 units and call vectors of unit magnitude in the
directions AB, AD, AA' respectively i, j, k.
Fig. 2.14
Working with A as origin, and denoting the position vector of X by x,
etc., we have
χ = 6i + 2k,
у = 6i + 6j+4k
and so the position vector of any point on the plane AXY is given by
г = λχ+μγ
= 6(Λ +μ) i +6μ\ + 2(Λ +2μ) к.
Again, m = 6i + 3j,
d' = 6j + 6k
and so D'M = 6i-3j-6k.
39

VECTORS AND VECTOR GEOMETRY [2
The position vector of any point on D'M is given by
г = AD'+D'P
= <1' + Л>'М
= 6j + 6k + v(6i-3j-6k)
= 6w + (6-3i>)j + (6-6i0k.
Thus, the point of intersection is given by
6ri + (6-3i0 j + (6-6i0k = 6(A+/<)i + 6/<j+2(A+2/<)k.
But i, j, к are non-coplanar vectors so, by the Uniqueness Theorem,
(ν = λ+μ,
3-3ί> = λ+2μ
and thus v = f.
Thus D'Z:ZM = 4:3.
Exercise 2 (ft)
1. ABC is a triangle. U lies on AB produced so that AB = \AU, V lies on AC
so that AV = 2VC. Find, in terms of the position vectors a, b, с of А, В, С,
the position vectors of U, V and the mid-point of UV.
2. Draw a diagram showing the relative positions of the points whose position
vectors are a, b and 3a-2b.
Prove that these three points are collinear.
3. If G is the centroid of the triangle ABC, prove that
GA+GB + GC = 0.
Suggest a generalization of this result.
4. ABC is an equilateral triangle of side 3 cm. P, Q lie on ВС, С A respectively
and are such that AQ = CP = 2 cm. R lies on AB produced so that BR = 1 cm.
Prove that PQR is a straight line.
5. О ABC is a parallelogram and the position vectors of А, В, С relative to О
are respectively a, b, с Mis the mid-point of ВС. Write down, in terms of b and
c, and hence in terms of a and c, the position vector of a general point X on the
line OM. Deduce that, if X lies on AC, then X is the point of trisection of AC
nearer С and also the point of trisection of OM nearer M.
6. OAB is a triangle; Μ is the mid-point of AB and Τ is the point of trisection
of OB nearer B. TM produced meets О A at X. If OA = a and OB = b, write
down the position vectors of Τ and M, and hence of a general point Ρ of TM.
Deduce the position vector of X and find a relation between the points О, А
andX
40

6] EQUATIONS OF LINES AND PLANES
7. The position vectors of the vertices А, В, С of a triangle are respectively a,
b, с Μ is the point of trisection of AC nearer A, N is the point of trisection of AB
nearer B. Write down the position vectors of Μ and N and hence of general
points of BM, CN. Deduce the position vector of X, the intersection of BM and
CN.
8. ABCDA'B'CD' is a parallelepiped, with parallel edges AA', BB\ CC, DD1'.
U is the point of trisection of AA' nearer A, V is the point of trisection of С1У
nearer C" and W is the mid-point of B'C. With A as origin, the position vectors
of B, D, A' are respectively b, d, a'. Write down the position vectors of U, V, W
and a general point of DD'. Locate the point at which the plane UVW cuts DD'.
9. О UVW is a tetrahedron, the position vectors of U, V, W being u, v, w. A is
a vertex of the parallelogram О UA V, В is the mid-point of VW, С is the reflection
in the origin of the point of trisection of UV nearer U. Find, in terms of u, v, w,
the position vectors of А, В, С and hence find where the plane ABC cuts О W.
10. ABCD is a tetrahedron, Ρ is the mid-point of AB, Q is the mid-point of AD
and R is the point of trisection of AC nearer C. Taking A as the origin, write
down the position vector of the general point X of the plane PQR in terms of
the position vectors of В, С and D. Write down also, in terms of the same frame
of reference, the position vector of the general point Υ of the line DM, where
Μ is the mid-point of ВС
The plane PQR cuts the line DM at Z; find the ratio ZM/MD.
11. Four points P, Q, R, S in a plane through the origin О have position vectors
OP, OQ, OR, OS given by
2i + 3j, 3i + 2j, 4i + 6j, 9i + 6j
respectively, where i and j are given non-parallel vectors. Express the vectors
PR and QS in terms of i and j.
Show that the position vectors OA and OB of the points A and В on PQ and
RS respectively, and such that PA/PQ = a and RBjRS = b, are
(2 + a)i+(3-a)j and (4 + 56) i + 6j,
respectively. Hence determine the position vector with respect to О of the point
of intersection of the lines PQ and RS. (J.M.B.)
12. Two vectors are represented by OP, OQ and R divides PQ in the ratio
*:W. Show that mOP + nOQ = (m + n)OR.
The points D, E, F divide the sides ВС, СА, AB of a triangle in the ratios 1:4,
3:2, 3:7 respectively. Show that the sum of the vectors AD, BE, CF is parallel to
CX, where Xdivides AB in the ratio 1:3. (J.M.B.)
13. ABCD is a skew quadrilateral and a plane cuts AB, ВС, CD, DA at W, Χ, Υ, Ζ
respectively. Prove that AW BX CY DZ =
WB'XC'YDZA
Suggest a generalization of this result.

VECTORS AND VECTOR GEOMETRY [2
14. ABCD is a parallelogram and О is a point in the same plane. OD cuts AB
at Ρ and ВС at Q; OB cuts CD at R and Ζ>Λ at 5. Prove that PS and βΛ are
parallel.
15. О ABC is a square of side 2a. i, j are unit vectors along О А, ОС. The mid-point
of AB is L; the mid-point of ВС is M; OL, AM meet at P; ЯР meets О A at JV.
Show that the segment OP can be measured by the vector A(2ai + aj) and also
by the vector 2αΐ+μ(2α} — αϊ). Hence determine Λ and μ. Prove that ON = $OA.
(O&C)
Miscellaneous Exercise 2
1. ABCDE is a regular pentagon. AB = aandAE = b. Show that
CD = (b-a)/(l+2cos72°)
and express ВС and ED in terms of a and b.
2. ABC is a triangle. L divides ВС in the ratio 1:2, Μ divides С A in the ratio
1:2 and N divides AB in the ratio 1:2. Prove that the triangles ABC and LMN
have the same centroid.
Suggest a generalization of this result and prove your assertion.
3. Referred to some origin O, the position vectors of A, B, C, D are a, b, c, d
respectively. Express in terms of a, b, c, d the displacements BD, AC. What can
be said about the quadrilateral ABCD:
(i) Ifa + c = b + d;
(ii) if |a-c| = |b-d|;
(iii) if both (i) and (ii) hold?
4. The triangles ABC, A'B'C have centroids respectively at G and G'. Prove
that AA' + BB' + CC = 3GG'.
5. P, Q are the mid-points of the sides ВС and CD respectively of the
parallelogram ABCD. Prove that:
AB + AC + AD =f (AP + AQ).
6. ABCD is a tetrahedron and B', C, D' lie on AB, AC, AD produced. The
centroids of the triangles BCD', B'CD', B'C'D are Gu G2, G3 respectively and the
centroids of the triangles B'CD, BCD, BCD' are Hu H2, H3 respectively. If
the centroids of the triangles BCD, G&Gs, НгЩН3 are F, G, Η respectively,
prove that FGH is a straight line.
7. ABCD is a parallelogram and points X, У are taken on the diagonal BD such
that BX = YD. Prove vectorially that AXCYis a parallelogram.
8. ABC is a triangle and Υ, Ζ are points on AC, AB respectively such that ZY
is parallel to ВС. В Υ and CZ meet at P. Prove vectorially that AP produced
bisects ВС.
9. ABC, A'B'C are two skew lines (that is, no plane contains both lines). If
AB:BC= A'B'.B'C prove that the mid-points of the lines AA', BB', CC
are collinear. Is the converse true?

6] MISCELLANEOUS EXERCISE 2
10. Prove that, if ABC is a plane through the origin О from which position
vectors are measured, and if А, В, С have position vectors a, b, c, then there
necessarily exits a relation of the form
pst+qb+rc = 0 (/ι,ί,ΓφΟ)
where 0 is the zero vector.
The lines АО, BO, CO meet ВС, CA, AB in L, Μ, Ν respectively. Prove that
the position vector of L is
-f b + ^-c.
q + r q+r
where the magnitude and sense of each line segment is taken into account. (M.E.I.)
(The result proved in Question 10 is known as Ceva's Theorem: it and its
converse are very useful for proving concurrency theorems.)
11. If a transversal cuts the sides ВС, С A, AB of a triangle ABC at L, Μ, Ν
respectively, prove that
BL.CM.AN
magnitudes and senses of each line segment being taken into account.
(This result is known as Menelaus's Theorem: with its converse it is useful for
proving collinearity properties.)
12. ABCD is a plane quadrilateral. AB and DC meet at P; ВС and AD meet at Q.
Prove that the mid-points of AC, BD and PQ are collinear.
13. P, Q are variable points on two skew lines. Find the locus of the mid-point
ofPQ. Can you generalize this result?
14. ABCD A'B'CD' is a parallelepiped, with ABCD, A'B'C'U congruent
parallelograms and AA', etc., edges. The tetrahedron ACB'D' is inscribed in the
parallelepiped. Prove that AC passes through the centroid X of B'CD' and deduce
that the joins of the vertices of the tetrahedron to the centroids of opposite faces
are concurrent at a point G. Determine the ratios AG: GX: XC.
15. Two vectors a and b, such that b is not a multiple of a, are given in a plane.
Two other vectors с and d are defined by the equations
с = 7ia + y2b,
d = ^a + ^b.
Prove that any vector aa+/Sb can be expressed in terms of с and d provided
УгК-УгК Ф 0. Find the coefficients of с and d.
Explain the geometrical significance of the condition yx S2 - γ2 8г Ф 0. (О & С)

VECTORS AND VECTOR GEOMETRY [2
16. Let О, А, В, С be four distinct points in three-dimensional space, no three
of which are collinear. The position vectors of А, В, С with respect to О are
a, b, с respectively and Xis the point given by χ = Aa+/A> + vc. Prove that:
(i) If X is a point of the line AB, then Α +μ = 1, ν = 0.
(ii) If X is a point of the plane ABC, then λ + μ + ν = 1.
(in) X is in the interior of triangle ABC if and only if λ+μ+ν = 1 and
λ>0, μ > 0, ν > 0 and indicate on a sketch the regions in the plane ABC in
which X lies for other combinations of the signs λ, μ, v. Obtain an expression for
the position vector χ of a general point X in the interior of the tetrahedron
О ABC and find the values of λ, μ, ν corresponding to the centroid. (M.E.I.)

3· Coordinates
1. UNIT VECTORS
A unit vector is a vector of unit magnitude. Thus, a unit vector is a sort
of 'signpost' giving a direction and sense; any vector may be split into
the product of a number (its magnitude) and a unit vector with the required
sense and direction. Thus we may write
г = rt,
where r = |r| and ί is a unit vector in the direction of r. (The notation
employed here is useful and should be adopted by the reader: given any
vector x, its magnitude may be written simply as χ while a unit vector
with the same sense and direction as χ is written x.) Unit vectors are useful
in that they enable us to deal separately with the magnitude and direction
of a given vector.
Now suppose we choose two directions in a plane and specify them by
unit vectors i and j. By the Existence and Uniqueness Theorem for a
plane (see Chapter 2) any displacement AB in the plane, represented by
d may be split into components in the i and j directions
d = pi + qi
where ρ and q are uniquely determined. The vector d, which is
representative of a whole class of displacements, of which AB is a typical member,
may be written in the alternative form
■Ю-
Q the vector i and l_\
Thus, (I represents the vector 2i-j, the vector i and I _ I the
vector - j. Of course, the representation of a vector in this new notation
depends upon our original choice of base vectors i and j. Again, there is
no necessity for the base vectors to be unit vectors, though they will almost
invariably be so. To illustrate these last two remarks, consider two unit
vectors i and j and the vector a where
a = 3i-j.
Now a = (i+j)+2(i-j)
and so we have .
a = I _ I with i, j as base vectors,
45

COORDINATES
with (i + j), (i - j) as base vectors.
■®
Notice that (i + j) and (i-j) are not unit vectors. (For example, if i and j
are perpendicular, neither (i+j) nor (i-j) is a unit vector.)
Similarly, in three dimensions, if non-coplanar base vectors i, j, к
are chosen, any vector d may be split into three components in the i, j, к
directions . ...
d = pi+q} + rk
and d may be represented in the alternative notation as
■Θ
relative to the base vectors i, j, k.
We shall now adopt the convention that i, j, к represent unit vectors
each one of which is perpendicular to the other two. Furthermore, positive
senses are determined by the following rule: if the thumb, first and second
fingers of the right hand are splayed out at right angles to one another, the
thumb points in the positive j direction, the first finger in the positive j
direction and the second finger in the positive к direction, i, j, к are then
said to form a right-handed orthogonal triple of unit vectors. (Orthogonal
means 'at right-angles to one another'.) Similarly, but more simply, one
may define a right-handed orthogonal pair of unit vectors i, j for a plane.
Example 1. If л = i + 2j + 3k, express л as a column vector
(i) with i, j, к as base vectors;
(ii) with i + j, i — к, к — 2j as base vectors.
(i) With i, j, к as base vectors,
|2|.
■(!)■
(ii) Write u = i + j, ν = i-k, w = k-2j.
Solving for i, j, к we have
i = i(2u+v+w); j = |(u-v-w); к = §(2u-2v+w).
Thus, a = i+2j + 3k
= (|u+iv+iw) + (fu-fv-|w) + (2u-2v + w)
= ^u-Jv + fw
46

1] UNIT VECTORS
and we have, with u, v, w as base vectors,
I
Ex. 1. Rewrite 3i + 4j-k and (i + j) - 2(j - 2k) in the column vector notation,
with i, j, к as base vectors.
Ex. 2. Express 3i-j-k as a column vector with (j + k), (k + i), (i + j) as base
vectors. Where do you use the Uniqueness Theorem in your solution?
Ex. 3. Evaluate a + b, 2a-3b and 3(a-2b) where
•(ΐ)
(·)■ ·■{:
3-Q
what are the values of χ and yl Explain how you make your deductions.
2. COORDINATES
Suppose we now have some geometrical configuration in a plane which we
want to describe algebraically. Let us choose any point О of the plane as
origin; then all the points of the plane may be specified by their position
vectors with respect to O. To give more detailed information, these position
vectors may be split into their components in the direction defined by the
right-handed pair of orthogonal unit vectors i, j.
Thus, if r = op
we may write г = xi+yj.
χ and у are called, respectively, the χ and у
coordinates of the point Ρ and are usually
written as (x, y). The lines through О in the i
and j directions are called the χ and у axes j ψ |
respectively, and may be denoted by Ox and Oy.
In Figure 3.1 the following points have been '
plotted:
A(l, 3); 5(3,-1); C(-l, -2); Z>(-3,0).
Note that the χ coordinate is always written first.

COORDINATES [3
The reader should note the distinction between (1,3) and I I: the
first gives the χ and у coordinates of the unique point A, while the second
denotes the vector (referred to i, j as base) representative of the class of
displacements of which OA is a typical member.
Coordinates may similarly be defined in three dimensions, though now
we require a right-handed triplet of orthogonal unit vectors i, j, к to
define the directions of the coordinate axes OX, ΟΥ, ΟΖ through the
origin O. Thus, if Ρ is the point (2,-1, - 3), the position vector of Ρ is
2i —j —3k. Note again that the coordinates are given in the order x, y, z.
Ex. 5. Draw a sketch to denote the approximate positions of the points:
/4(1,0), B(-2, -1), C(l, -3), Z>(-2,0), E(-l, -2).
Ex. 6. Write down the position vectors of the points
A(l, -2), 5(3,4).
Deduce the coordinates of the mid-point of AB. Can you state a general rule for
finding the coordinates of the mid-point of a line ?
Ex. 7. Write down the position vectors of the points
Ail, -1), «5,-5).
Use the Section Formula to deduce the coordinates of the two points of trisection
of AB.
Ex. 8. What are the coordinates of the mid-point of the line joining
ДО,-1,2) and 5(2,3,2)?
Ex. 9. If ABCD is a parallelogram, what relation must hold between the position
vectors a, b, c, d? Find the coordinates of the vertex D of the parallelogram
ABCD whose other vertices are given by
/4(1,2), 5(4,3), C(3,5).
Ex. 10. The coordinates of the points А, В, С are as follows:
/4(-l, 1, 2), 5(1, 0, -3), C(0, 2, 4).
Find the coordinates of the fourth vertex D of the parallelogram ABCD and
of the fourth vertex Ε of the parallelogram ACBE.
Ex. 11. Find the coordinates of the centroid of the triangle ABC of Ex. 9.
Ex. 12. A rectangular box ABCD A'B'C'D' has base ABCD and the edges AA',
BB', CC, DD' are all vertical. Μ, Ν, Ρ are the mid-points of ВС, СС and
A'B' respectively. AB has magnitude 3 units, ВС 2 units and CC 1 unit. If
A is taken as origin and unit vectors i, j, к are taken along AB, AD, AA'
respectively, find the position vector of the centroid of triangle MNP and deduce its
coordinates.
48

3] DISTANCES IN TERMS OF COORDINATES
3. DISTANCES IN TERMS OF COORDINATES
Before proceeding with our study of coordinates we shall develop an
economic notation whose value should be readily apparent to the reader.
Tt would be convenient to refer to all points by using, say, the letter P, to
all χ coordinates by x, to all у coordinates by у and to all ζ coordinates
by ζ but to do so without further clarification would clearly lead to
appalling confusion. We may, however, differentiate between the various points
and their coordinates by the use of suffixes : thus we may call two points Рг
and P2 and take as their coordinates (хг, у1г гг) and (x2, y2, z2) respectively.
One advantage of such a notation is obvious: without further explanation
we know that Pn stands for a point, that x„ is its χ coordinate and so on.
(The reader must guard against confusion between suffixes and indices:
x2 simply means the χ coordinate of the point P2, whereas x2 represents
the result of multiplying the number χ by itself.) The choice of a good
notation is often more than half the battle in the solution of a mathematical
problem; as he gains experience in its use, the reader will come to
appreciate the deeper significance of the suffix notation. One evident further
advantage may, however, be noted here: the use of suffixes effects a
considerable economy in notation if we have to deal with a large (or
indeterminate) number of points. (It should be noted that not all sets may be
enumerated, that is, counted in the form 1, 2, 3, .... In such cases, the
suffix notation as we have presented it breaks down. Consider, for example,
the problem of naming all the points on the line segment joining /4(0, 0)
and 5(1,0).)
Now consider a plane and two points Рг and P2 lying in it whose
coordinates relative to perpendicular axes through some origin О are (хъ уг)
and(x2,y2).
\'P^ dfi
o\
Fig. 3.2
From Figure 3.2 we have
P^ = P2Q + QP1
= (*!-**) i+(ft - Л) J·
Thus, applying Pythagoras's Theorem to the right-angled triangle Р2Рг Q,
ΛΑ = ν[(*ι-*2)2+ϋΊ-Λ)2]·
49

COORDINATES [3
For three-dimensional coordinates the same argument applies but
Pythagoras's Theorem must be applied twice. In Figure 3.3, the feet of the
perpendicular from Р1г P2 to the plane Oxy are Q1} Q2, and RQ1; Q2R
are respectively parallel to Oy and Ox. Thus we have
Ρ2ΡΧ = P2Q + QP1
= QaR + RQi + QPi
= (^ι-^ί + ϋΊ-^) J + Czi-^k
and so PXP2 = J[(Xl-x2y + (уг-у^Нъ~*№
on applying Pythagoras's Theorem to triangles PXP2Q and Q1Q2R.
Fig. 3.3
Example 2. (i) The distance between /4(3, — 1) and B(— 1, 2) is
V[(3- -l)2 + (-l -2)2] = 5.
(ii) The distance between /4(1, 0, -2) and B(-l, 2, -3) is
VK1- -l)3 + (0-2)2 + (-2--3)*] = 3.
Ex. 13. A, B, C, D, E, Fhave coordinates as follows:
Д-1, -2), Я(-3, 2); C(-l, -2, 4), D(0, -4, 3);
E(a, -a, 0), Д-А:, -A:, A:).
Find the lengths of AB, CD and EF.
Ex. 14. Show that ABCD is a rectangle, А, В, С, D having coordinates:
/4(1,2), Ж5,-1), C(8,3), Z>(4,6).
Ex. 15. What is the fourth vertex of the parallelogram ABCD where
A is (1,1, 2), Л(2, 0, -1), C(3, 3, 0) ?
Ex. 16. Find the circumcentre of the triangle
/4(0, -1, 1), B(\, 0, -1), C(-1,1, 0).
What is the circumradius ? (The circumcentre of a triangle is the centre of the
circle which passes through the three vertices.)
50

DISTANCES IN TERMS OF COORDINATES
Exercise 3(a)
. Evaluate as single column vectors:
(i)u)ii); (">2(i+3(J)^
Ч:)+4В; _ <w(i)+3(:i)+2(;);
MiHH-i)- _
ίπ -(Э'ЧЗ-е-В·
find, in column vector form, the vectors:
(i) 2a-b; (ii) a+b-3c; (iii) (3a + b)-2(b+3c).
3. If a, b, с are defined as in Question 2, solve for χ the equations:
(i) 2x = a-b; (ii) 3(x + a) = b-2c
giving your answers in column vector form.
4. Again using the notation of Question 2, solve for x, y, ζ the following systems
of linear equations, giving your answers in column vector form:
= 2a + 2b,
® Γ~, η к + 2Ь' ® {2x+y+z = a + 3b + 2c,
x + 2y = b-c;
l»-«-i·» - 4a-4c.
f x + y-z =
) Ьх+y+z =
lx-y-3z =
5. Two vectors p, q (where рФ4д) are given in a plane. Referred to p, q as
base, the vectors a, b, c, d are given by:
Express a, b, c, d as column vectors, taking p—2q, p + 2q as base.
6. Three non-coplanar vectors u, v, w are given. Referred to these vectors
as base vectors, the vectors, e, f, g, h are given by:
-0-г-0--0-к-4П-
Express e, f, g, h as column vectors taking ν+w, w + u, u + v as base vectors.
7. Write down the coordinates of the mid-points of the lines joining the following
pairs of points:
(i) (2, 4), (4, 6); (ii) ( - 2, 4), (4, 2); (iii) (-1, - 3), (2, - 4);
(iv) (3, 1, -5), (2, -4, 0); (v) (a + b, a, a-b), (a-b, -a, -a-b).
51

COORDINATES [3
8. Write down the coordinates of the two points of trisection of the lines joining
each of the following pairs of points:
(i) (1, 4), (4, 10); (ii) (-2, 1), (1, 8); (iii) (a+2b, 6-2a), (a-b, a + b);
(iv) (1, 2, -1), (2, 1, -5); (v) (a, 2a, b), (a-b, b-2a, a+b).
9. Find the lengths of each of the line segments of Question 7. (Leave your
answers in the form Jm.)
10. А(Ъ, 0), B(4, -1), C(6, 2) are vertices of a parallelogram ABCD. Find the
coordinates of D. Find also the coordinates of Ε if ACBE is a parallelogram.
11. Prove that the triangle whose vertices are A(-\, 2), B(3, 5), C(-4, 6) is
isosceles. What is its area?
12. А(Ъ, 1, -1), B(\, 2, -2), C(0, 0, 2) are vertices of a parallelogram ЛЯС£>.
Find the coordinates of D. What are the coordinates of the meet of the diagonals ?
13. Find the area of the triangle whose vertices are A(\, -3), B(2, 5), C(-4, -3).
14. Show that the line joining A(-1, 4, -3) and B(5, -8, 6) meets the χ axis.
If the point of intersection is C, find the ratio AC/CB.
4. COORDINATE GEOMETRY IN A PLANE;
STRAIGHT LINES AND THEIR GRADIENTS
Throughout this section we shall confine our attention to a plane, so that
only two coordinates are required to specify a point.
The reader will recall that the vector equation of a straight line is the
algebraic condition that must be satisfied by the position vector г of a
point Ρ if Ρ is to lie on the line. In the same way, the Cartesian equation
of a straight line is the algebraic condition that must be satisfied by the
coordinates x, y, of a point P(x, y) of the line (and, conversely, no point
Ρ not on the line has coordinates satisfying the equation).
The equations of lines parallel to the coordinate axes may readily be
obtained. For example, the reader should have no difficulty in seeing that
the line through (-3, 1) parallel to the χ axis is у = 1; similarly, the line
through this point parallel to the у axis is x + 3 = 0. In particular, the
equations of the χ and у axes are respectively у = 0 and χ = 0.
A straight line is determined completely by two points on it. Suppose
now that we wish to find the Cartesian equation of the straight line
through P-Sx-l, л) and P2(x2, y^, where xx =t= x2 and уг Ф y2 and so the
line is not parallel to the axes. Let the position vectors of Рг and P2 be тг
and r2 respectively. Then (see page 37) the vector equation of the line
where rx = Xii+Jij and r2 = х21+У21·
52

И COORDINATE GEOMETRY IN A PLANE
This may be rewritten in column vector notation (with i, j as base)
■©♦
\У2-У1/
By the Uniqueness Theorem in two dimensions this gives the equations
jx-x1 = λ(χ2-χ1),
Ь-У1 = КУл-уд-
Eliminating A between these two equations we obtain the following result.
The Cartesian equation of the line joining the points Рг{хъ уг) P2(x2, y2) is
^xx = Lzy1_
*2~*i У2-У1
(The reader must note carefully the distinction in this equation between
x, у on the one hand and xlt ylt x2, y2 on the other: x, у are the coordinates
of a general point Ρ on the line; xlt ylt x2, y2 are the coordinates of two
specified fixed points of the line. If we take U as the set of all points
(x, y) in the plane, the equation (1) is the defining condition for the set of
points comprising the line; that is, the set
I "' У-Уг yt-yii
The same remarks hold for the vector equation г = тх + А(г2 —rj), in which
г15 r2 are the position vectors of two given points.)
Equation (1) may be rewritten in the form
where η
The number m is called the gradient of the line. The sign of the gradient
tells us which way the line is sloping (see Figures 3.4 and 3.5). In case (i),
хг — х2 and уг—y2 have the same sign and m > 0; in case (ii) χΎ — χ2 and
У1—У2nave opposite signs and m < 0. In both cases, the positive value of
m is equal to tan a, where a is the acute angle made by the line with the
χ axis. Equation (2) gives the form for the straight line when a point on the
line and its gradient are known.f
t The reader who has already met trigonometric ratios of obtuse angles will realize
that if the straight line AB meets the xaxis at P, then the gradient of AB is tan ΔχΡΑ in
all cases, where ΔχΡΑ is measured in the positive (anticlockwise) sense from the χ axis
(see Chapter 6).

COORDINATES
о\ ^
О] _\
Equation (2) may be rewritten as
= mx + c.
(3)
Here the constant с represents the intercept cut off on the у axis, as may
easily be seen by setting χ = 0.
Having derived these various forms for the equation of a straight line,
we may easily deduce the result that any equation of degree 1 in the two
variables x, у represent a straight line (hence the word linear usually
applied to such equations).
Consider the general linear equation in the two variables x, y:
ax + by + c = 0.
If a = 0 or b = 0 this represents a line parallel to one of the axes. If
b Φ 0, the equation may be rewritten in the form
and comparison with (3) shows that this is the equation of a line with
gradient — ajb and making an intercept —cjb on the у axis.
Example 3. Find the equation of the line L^ through (2, — 1) with gradient
— \ and the equation of the line L2 joining the points (0, 7) and (-1, 4).
What are the coordinates of the intersection ofL^ andL^ ?
Draw a sketch showing the relative positions ofl^ andL2 and the
coordinate axes.
Lj has equation y+1 = —Цх-2),
i.e. x + 2y = 0,
54

4] COORDINATE GEOMETRY IN A PLANE
L2 has equation . _fl = TZPj'
i.e. 3x-j> + 7 = 0.
Since all points lying on A satisfy the x + 2y = 0 and all points lying on
L2 satisfy Ъх-у + Ί = 0, the point of intersection of the two lines must
satisfy both equations simultaneously.
Solving
we have
fx + 2y =
Ux-y =
χ = -2,
= 1
and so the point of intersection of Lj and
£,is(-2,l).
(In this case the sketching of the lines Lj
and Z^ is straightforward: we know two points
on L2, and Lj clearly passes through the origin.
In general, to sketch the position of a straight (—2,1)"
line whose equation is given, first find the ~~'
coordinates of the points where it cuts through ц
the coordinate axes.)
Ex. 17. Draw a sketch to show the positions of the straight lines:
(i) 3x-2y-6 = 0; (ii) 2x+y + 3 = 0; (in) 2x-5y = 0.
Ex. 18. Find the equations of the lines through (2, —3) with gradients:
(i) 2; (ii) -i.
Ex. 19. Find the equations of the lines joining the pairs of points:
(i) (2,-3), (3,2); (ii) (0,1), (2,-1);
(iii) (-3, -1), (-1, 2); (iv) (1, 2), (-3, 2).
Ex. 20. What are the gradients of the following lines:
(i) 2x-y-3 = 0; (ii) x+y+l = 0; (iii) 3x + 4y+2 = 0?
Suppose now we have two lines, L^ and L2, the gradients of which are
respectively m1 and m2.
(i) If Li and L2 are parallel, the angles that they make with the χ axis
are equal and
m1 = m2.
(ii) If Lj and L2 are perpendicular then either they are parallel to the
coordinate axes or they have gradients of opposite sign: in the second case,
55

COORDINATES [3
if m1 = + tan <xj and m2 = + tan oc^ then mxm2 = —tan ax tan a2 = — 1,
i.e.
WjWa = —1.
*Ex 21. With the notation above, prove that, if Wi = гщ, Lx and L2 are parallel.
*£x. 22. Again with the notation above, prove that, if т^пц. ——\,Ь^ and L^ are
perpendicular.
Example 4. iW ί/ге equations of the lines Ly, through (1, -1) parallel to
Ъх-у + Ί = 0 and L2, through (2, -3) perpendicular to 2x + 5y+\ = 0.
Find also the equation of the line L^ joining the origin to the intersection
ofLx andL2.
The gradient of 3x-y + 7 = 0 is +3 and so Lj has equations
y+l =3(x-l),
i.e. 3x-y-4 = 0.
The gradient of 2x + 5y+1 = 0 is —| and so the gradient of a
perpendicular line is +f. Thus, the equation of L2 is
y + 3 = i(x-2),
i.e. 5x-2j-16 = 0.
(With practice, the reader will soon be able to derive such equations
more rapidly than is done here. For example, in the second case, any line
perpendicular to 2x + 5y+l = 0 is clearly of the form 5x-2y = к and,
since the required line passes through (2, -3), к = 5 (2)-2 (-3) = 16.)
Now consider the equation
(3x-y-4) + A(5x-2y-l6) = 0,
where A is any number. This is a linear equation and so represents a
straight line; furthermore, the point common to Lj and L2 clearly satisfies
this equation. Thus
(3x-y-4) + A(5x-2y-l6) = 0
56

4] COORDINATE GEOMETRY IN A PLANE
represents a straight line through the intersection of Lj and L2. If it passes
through the origin, (0, 0),
-4 + A(-16) = 0,
A=-i
and the equation simplifies down to
lx-2y = 0
which is the required equation of L3.
Ex. 23. Find the equations of the lines Lu through (4, —10) parallel to Ъх—у = 0
and L2, through (1, 2) perpendicular to2x—y—\ = 0. What are the coordinates
of the intersection of Lx and L^ ?
Ex. 24. Find the line joining the point (—1, 2) to the meet of the two lines
x+3y-l = 0,x-4y + 2 = 0.
Ex. 25. Find the equation of the line through the meet of the two lines
3x-y+l = 0, 4x-3.y+2 = 0
perpendicular to the line Ъх-у-1 =0.
Ex. 26. Find the intercept cut off on the transversal 5x—2y+ 3 = 0 by the two
parallel lines 3x-y + 1 = 0 and 3x-y+4 = 0.
Exercise 3(b)
1. Find the equations of the lines through the stated points with the given
gradients:
(i) (0, -2), 2; (ii) (-1, -1), -2; (iii) (3, -1), -}; (iv) (a, b), -alb.
2. Find the equations of the lines joining the following pairs of points:
(i) (1, -2), (2, 1); (ii) (-2, -3), (-1, 4); (iii) (2, -1), (-1, -1);
(iv) (a, b), (2a, -Щ.
3. Write down the gradients of the following lines:
(i) 2x-3.y+l = 0; (ii) Зх+б.у+2 = 0; (iii) 5x-y + l = 0;
(iv) (a + b) x+(a-b) y+ab = 0.
4. Find the equations of the lines through (3, — 1) (a) parallel and (b)
perpendicular to:
(i) 2x-y+3 = 0; (ii) 5x+4.y + 3 = 0; (iii) x+7 = 0.
In case (i), what is the distance between the two parallel lines ?
5. Find the points of intersection of the following pairs of lines:
!4x+3y-6 = 0, lax+by + a2 = 0,
(i) I (ii) I
U-.y-5 = 0; (bx-ay+b2 = 0.
6. The two lines 3x- 5y-7 = 0, 4x+ 4y+5 = 0 meet at the point A. Find the
equations of the line through A
(i) which passes through the origin;
(ii) is parallel to 7χ - у + 2 = 0;
(iii) is perpendicular to 3x+5.y-l = 0.
57

COORDINATES [3
7. Find the orthocentre of the triangle whose vertices are (0, 1), (1, 2), (4, 3). (The
orthocentre is the meet of the altitudes of a triangle.)
8. The mid-points L, Μ, Ν of the sides, ВС, С А, АВ of a triangle ABC have
coordinates (2, 1), (3, -3), (4, -5). Find the coordinates of А, В, С
9. The coordinates of the vertices of a triangle are (6, 0), (-1,1) and (5, -7).
Find the coordinates of the centre of the circumcircle.
10. The coordinates of the vertices of a triangle are (1, -4), (3, -2) and
( — 11, 12). Find the coordinates of the centre of the circumcircle and
determine the coordinates of the points where this circle cuts the axes.
11. Show that, for all values of λ, the line whose equation is λχ+y = 1—2A
always passes through the point (-2, 1). What is the equation (in terms of A)
of the perpendicular line through (1, 2)? Show that, whatever the value of λ,
the intersection of these two lines always satisfies the equation
x2 + f + x-3y = 0.
What locus does this last equation represent?
12. О ABC is a rectangle in which О А = ЪОС. Μ is the mid-point of ОС and L
is the point of trisection of CB nearer C. OL, AM meet at X. By setting up axes
along О A and ОС and assigning suitable coordinates to the various points,
determine the ratio in which X divides OL.
13. Prove that, for all values of A, the line
(1-A)x + Ay = 3-7A
passes through a fixed point. What are the coordinates of the point?
14. Find the coordinates of the centre of the circumcircle of the triangle ABC,
where А, В, С have coordinates (-1,-3), (-2,-2), (5,5) respectively.
Prove that the point P(6, 4) lies on the circumcircle and prove further that the
feet of the perpendiculars from Ρ on to ВС, CA, AB are collinear.
15. Find the equation of the line joining A(a, 0) and B(0, b). A', B' are the feet
of the perpendiculars from A, B to a variable line through the origin. If A'P, B'P
are respectively parallel to the у and χ axes, what can be said about the position
ofP?
16. What are the equations of the reflections of the line 2x - Ъу+6 = 0 in
(i) the χ axis; (ii) the у axis;
(iii) the line lx-Ъу = 0; (iv) the line 2x-3y = 3?
5. COORDINATE GEOMETRY IN SPACE:
THE PLANE AND STRAIGHT LINE
The effective use of Cartesian coordinates in three dimensions requires
rather more vector technique than we have at present at our disposal and
so a more detailed study will be deferred until a later chapter (Chapter 11).
58

5] COORDINATE GEOMETRY IN SPACE
We shall content ourselves for the moment with demonstrating the general
form of the Cartesian equations of lines and planes in space.
We may define a plane as a set of points in space which
(i) contains at least three non-collinear points;
(ii) has the property that, given any two points Rx and R2 of the set,
all points of the line РгР2 belong to the set.
From this definition, three non-collinear points
clearly define a plane. If тг, г2, r3 are respectively the position vectors of
Pj, P2, P3, we have seen that the plane РгР2Р3 is (see Chapter 2.6)
r=r1+A(r2-r1)+H'3-'i)·
In column vector form this may be written
lX\ (Xl\ (Х*~ХА (χ3-χΑ
which, by the Uniqueness Theorem, yields three equations:
χ = Xl +λ(χ2 - Xl) +μ(χ3 - χ-,),
У = Ух + ЧУг ~ Уд +/*0>з - Уд,
ζ = ζί + λ{ζ2-ζί) + μ{ζ3-ζ1).
Solving the first two equations simultaneously we obtain A and μ as linear
expressions in χ and y. Hence, substituting these values of A and μ in the
third equation, we see that the Cartesian equation of a plane is linear in
x, y, z.
Now let us prove the converse result; a linear equation in x, y, ζ
represents the equation of a plane.
Consider the equation ax+by + cz+d = 0 where we shall suppose,
for simplicity, that α φ 0, b φ 0, с φ 0, d Φ 0. (The fact that one or several
of them may be zero requires a modification of the proof given below;
see Ex. 30.) The three points {-dja, 0, 0), (0, -djb.O), (0,0, -djc)
certainly all satisfy the equation and are non-collinear (since the first two
have zero ζ coordinate, for example). Thus condition (i) for a plane is
satisfied.
To demonstrate condition (ii), suppose Рг and P2 satisfy the given
equation. Then
(■t+by-i+cz-L + d = 0,
z2 + d = 0
(ax^by^cz-L + d
lax2 + by2 + cz2 + d

COORDINATES
[3
and thus, for any values of к and /
But, by the Section Formula,
_ lr1 + kr2 _ (Ιχ,+кхЛ (ly1+ky2\ (lz1 + kz2\
r- k+i -[ k+i )1+[ k+i )i+[ k+i ;k
is the position vector of a point on the line joining P1 and P2. Thus, the
coordinates of anypointon the line joiningPjPa satisfies the given equation.
In three dimensions two intersecting (that is, non-parallel) planes define
a straight line and we should therefore expect a line to be represented by
two linear equations in x, y, z.
The vector equation of the line through P, parallel to the vector
u = /i + wj + лк
is (see Chapter 2) г = тг + Au.
In column vector form this reads
(Η3+ΛΘ
and so, by the Uniqueness Theorem, we have three equations:
χ = Xj+Λ/, у = y-L + Xm, ζ = Zj + Ли
giving the equation of the line in the form
*Zii=Zza = ?Zli = A. (4)
I m η ν '
We use this form even if one or more of /, m, η are zero. For example,
the line through the point (1, -1, 2) in the direction of the vector 2i -3k
may be written as
x-l y+l z-2
x-l z-2 , , „
Τ" = Гз-' '+1=0·
Notice that the Cartesian coordinates of a second general point of the
line (4) may be expressed in terms of one variable as
(*! + A/, y1 + Am,z1 + An).
A is called a parameter for points of the line.
60

5]
COORDINATE GEOMETRY IN SPACE
The vector u defines the direction of the line and may be termed a
direction vector. Since u = /i + mj+ик, /, m, η are proportional to the
cosines of the angles that u makes with the directions i, j, k, that is, with the
directions of the coordinate axes. /, m, η are usually called direction
ratios for the line; if u is a unit vector, the constant of proportionality is
one and /, m, η are called direction cosines (see Figure 3.8, where / = cos дъ
m = cos θ2, η = cos θ3).
Fig. 3.
Example 5. Find the equation of the plane through /4(1,1,1), 5(0, 1, -2),
С (0,0, -1) and the point of intersection of this plane with the line
x-3 =y-2 = z-2
1 -2 3"
Let the equation of the required plane be
ax+by + cz + d = 0.
Since the coordinates of А, В, С all satisfy this equation,
(a + b + c =-d,
b-2c =-d,
which gives a = —3d, b = d, с = d.
The equation of the plane ABC is thus
Ъх-y-z-l = 0.
Any point of the given line has coordinates
(3+A, 2-2A, 2 + 3A).
This lies on the plane if
3(3+A)-(2-2A)-(2 + 3A)-l = 0
giving A = - 2
and so the point of intersection has coordinates
(1, 6, -4).

COORDINATES [3
Example 6. Two straight lines are given parametrically by
L1:x=l-4A, у = l+λ, ζ = l+λ
and L2: x = 2μ, у = Ι —μ, ζ = 2 + μ.
Prove that L± andL2 intersect, and find the equation of the plane containing
the two lines.
If we can choose Α, μ such that the three equations
1-4Λ = 2μ,
1+Λ = l-μ,
l+λ=2+μ
are simultaneously satisfied, then L·^ and L2 intersect. By inspection,
^ = i> /* = — 2 fulfils the required condition (giving the point of intersec-
tion(-l, if)).
Eliminating λ (a) between the χ and у coordinates of a general point of
Li, and (b) between the у and ζ coordinates, we see that the two planes
x + 4y = 5, y-z = 0
both contain Ly. It follows that, for all values of k, the equation
(x + 4y-5) + k(y-z) = 0 (1)
represents a plane containing L·^.
By setting μ = 0, we see that the point (0, 1, 2) lies on L2 (but not on
Ц). This lies on the plane (1) if
(0 + 4-5)+A:(l-2) = 0,
i.e. k=-l.
The equation of the plane containing L·^ and L2 is thus seen to be
x + 4y-5-l(y-z) = 0,
i.e. x + 3y + z-5 = 0.
Ex. 27. What is the equation of the plane
(i) containing Ox and Oy\
(ii) through (1, -1, 2) perpendicular to Ozl
Ex. 28. Describe the position of the plane ax+by = 0.
Ex. 29. Why was condition (i) included in the definition of a plane?
*Ex. 30. Show that three non-collinear points may be found on
ax + by + cz + d = 0
62

5] COORDINATE GEOMETRY IN SPACE
in the following cases:
(i) d = 0, α φ 0, b φ 0, с Φ 0; (ii) d φ 0, a = b = 0, с Ф 0;
(iii) rf = α = 0, b φ 0, с φ 0; (iv) a = 0, b Φ 0, с φ 0, d ΦΟ.
Ex.31. Find the equation of the plane through the line of intersection of the
planes x-y + 2z+l =0, 2x+y-z + 2 = 0 which contains the origin.
Ex. 32. What are the coordinates of a general point of the line
x-3 _y + 2 _ζ + 5Ί
2 -1 ~ -3'
Where does this line cut the plane x+y + z + 2 = 0?
Ex. 33. What are the equations of the line joining (-3, 1, 1) and (2, 2, -1)?
Ex. 34. Find the equation of the plane containing the origin and the line
x-2 = ^-3 = z+4
2 3 -Γ
Exercise 3(c)
1. Find (a) direction ratios, (6) the direction cosines, for the lines joining the
following pairs of points:
(i) (1,2,3), (2, 3,4); (ii) (2,-1, 3), (1, 1, 2);
(iii) (3,1, 2), (5, -1, 1); (iv) (1, 3, 5), (5, 3, 1);
(v) (2, -3,-4), (-3,2,1);
(vi) (a + λ, 2a + 2λ, За + 3λ), (я-μ, 2α-2μ, 3α-3/ί);
(νϋ) (α2, αί>; δ2), (α2, ас, с2); (viii) (1, я, α2), (α2, α, 1).
2. Find the equations of the planes through the following sets of points:
(i) (1,0,0), (0,1,0), (0,0, 1);
(ii) (3,2,0),(0,3, -1), (1,0, -2);
(iii) (2,0,0),(1,1, -1), (6,-5,3);
(iv) (2,2,0),(-l, 1,-4), (1, 1,-1);
(v) (-1,3,1),(1,-3,-3), (3,-1,5);
(vi) (1,2,3), (2, -2, 8), (-1,2, -7).
3. Find the equations of the lines joining the following pairs of points:
(i) (0, 0, 0), (2, 1, - 3); (ii) (1,2,-1), (2, 3, - 3);
(iii) (1, 1, -1), (-1, 4, 2); (iv) (-2, 1, 3), (2, 1, -3);
(v) (1, 4, 2), (1, -1, 2); (vi) (a, 2a, 3a), (-a, a, 2a).
4. Find the equation of the plane through the line of intersection of the planes
x+y-3z = 2,2x-y-z = 1
(i) containing the origin;
(ii) containing the point (1, 1, —1);
(iii) parallel to the x axis.
5. Find the equation of the plane through the line of intersection of the planes
2x + 2y + 3z-l = 0, 3x-y-z+2 = 0
(i) containing the origin;
(ii) containing the point (1, 1, 1);
(iii) parallel to the ζ axis.
63

COORDINATES [3
о the χ axis which contains
x-l = y-3 = z+l
-1 2 ~ -3"
Find the equation of the plane containing this line and the origin.
7. Find the equation of the plane through the point (1, 1, - 5) containing the
ИШ x+l_y + 5_ ζ
8. Write down, in terms of a parameter λ, the coordinates of a general point of
the line
Where does this line cut the plane x-2y-3z-2 = 0?
9. Find the coordinates of the point where the line joining (2, 3, 1) and (4, 7, 3)
cuts the plane 2x+y-z-3 = 0.
10. Find the equations of the line joining (2,1, 1) and (1, -1, 2). Where does this
line meet the plane x—y + az = 0?
Is your result true for all a? Explain.
11. Determine the direction cosines of the line of intersection of the planes
2x-y + z-9 = 0, 4x+y + 2z-6 = 0.
Where does this line meet the plane x+y + z = 0?
12. Prove that the two lines
x-l _y+l _z + 2
1 _ 2 _ 3 '
x + l ^y z-l
1 1 1 '
are skew.
13. Prove that the two lines
x-2 = y + 4 = z-l
3 ~ 1 ~ -1
. x + l y + 4 z + 2
and ^=-Γ=^2
meet. What are the coordinates of their point of intersection? Find the equation
of the plane containing them.
14. Prove that the fr
meet. What are the coordinates of their point of intersection? Find the equation
of the plane containing them.

5] COORDINATE GEOMETRY IN SPACE
15. Find the coordinates of the point common to the three planes:
x-3y + z-6 = 0,
2x-y + 2z-2 = 0,
3x + 2y + z+2 = 0.
16. Find the coordinates of the point common to the three planes:
x-2y + z-7 = 0,
2x + 3y-4z + 26 = 0,
3x+y + 3z+7 = 0.
17. Find the coordinates of the point where the line determined by the two planes
x-3y-z+8 = 0,
x-y+z-2=0
cuts the plane containing the χ and у axes.
Miscellaneous Exercise 3
1. Find the area of the trapezium whose vertices are:
(х2,Уд, (*з,.Уз), (*2,0), (x3,0).
Deduce that the area of the triangle Pi(*b уд, А(*2, Уг), А(*з> Уз) is
ЦХ2У2-Х2У2 + Х3У1- Х1У2 + Х1У2- Х2У1)·
What is the condition that the three points РгР2Рз should be collinear ?
2. ABCD is a rectangle with AB = 2AD. Ε is the mid-point of AB and F is
the mid-point of ВС. СЕ, AF meet at X. By setting up a suitable coordinate
system, prove that DXB is a straight line and determine the ratio DX/XB.
What is the area of the quadrilateral AXCD1
3. ABCD is a rectangle and points P, Q are taken on AB, AD respectively.
The rectangle APRQ is completed. If the lines BQ, DP meet at X, prove that
X,.R, С lie On a straight line.
4. Find, in terms of a parameter w, representing the gradient, the equation of
a variable straight line through the fixed point (a, b).
If this line cuts the χ and у axes at A and В respectively, and the parallelogram
OABP is completed, prove that, whatever the value of m, the point Ρ lies on the
curve
xy = xb — ay.
Sketch this curve for the case a = b = 1 for values of χ > — 1.
5. Show that, by a suitable choice of coordinate system, the equations of two
coplanar non-parallel lines may be taken in the form

COORDINATES [3
Two fixed lines О A, OB are drawn and a variable line, passing through the
fixed point C, cuts О A, OB at Ρ and Q respectively. If the feet of the
perpendiculars from P, Q to OB, О A are U, V, prove that t/Kpasses through a fixed
point.
6. Two fixed straight lines, Lx and L2, meet at O. Through a fixed point A two
lines АРгР2 and AQXQ2 axe. drawn to cut Lx at Рг and Qu and L2 at P2 and Q,_.
Prove that whatever the position of the two lines drawn through A, the point of
intersection of P1Q2 and Ρ2βι lies on a fixed straight line through O.
7. Points Pu P2, P3 are taken on the χ axis, and Qu Q2, β3 on the у axis. Lj
is the point of intersection of P2 β3 and P3 β2,1,г of P3 Qj and Рг β3, L3 of Pj β2
and P2 Qj. Prove that L1; L2, L3 lie on a straight line.
(This is a particular case of Pappus's Theorem, which holds more generally
for two sets of three collinear points on any two straight lines.)
8. Prove that the lines χ+1 j z_3
A x-\ у-ъ z-г
are skew.
By considering the family of planes through one of the lines, prove that
there is just one common transversal to the two lines which passes through the
origin and find its equation.
9. Prove that no three of the points (1, 1, 2), (-2, - 6, 3), (- 1, 1, 5), (2,4, 2) are
collinear but that all four points lie on a plane.
10. Explain why the three planes
x-z+l = 0,
x-y-z+5 = 0,
x + 2y-z + 2 = 0
have no common point.
What can you say about the intersections of the planes:
11. ABCDA'B'C'D' is a cube, with square faces ABCD, A'B'C'D' and vertical
edges AA' etc. Μ is the mid-point of CD'. The plane AB'M cuts BD' at X
and CC produced at Y. Find the ratios BX: XD' and CC'.C'Y.
12. Prove that the three lines χ ν ζ
Li:I = 2 = 3'
Li' 2 -3-I'
r x+l у ζ

5] MISCELLANEOUS EXERCISE 3
If general points of Lt and L2 have parameters λ and μ respectively, find the
condition that the line joining the point On Lt with parameter λ to the point
on L2 with parameter μ should intersect L3.
Deduce that a unique common transversal to the three lines may be drawn
through a given point on Lv
13. In the tetrahedron ABCD, each of the faces ABC, ABD, ACD has a right
angle at A, and AB = AC = AD. X is the point on AD such that AX = 2XD
and Υ is the point on ВС such that С Υ = 2YB. The mid-point of A Y is Z. Prove
that the mid-point of DZ lies on the plane BCX.
14. A is the fixed point (a,0,0) and variable points β (Ο, λ, 0), R(0, Ο,μ)
(λ, μ positive) are taken on the у and ζ axes such that the plane AQR passes
through the fixed point Β (α, β, γ). Prove that
(α—α) λμ + αβμ + αγλ = 0.
Find the value of λ that makes the triangle OQR isosceles.
15. Explain why the equation
x2+y2 + z2 = 1
represents the surface of a sphere, centre the origin.
Find the equations of the planes through the point (2, 0, 0) which are tangential
to the sphere and parallel to the у axis.
67

4· Polynomials
1. POLYNOMIALS
Expression of the form xs_3x2 + 4x+2
and x?-x+l
are called polynomials. More generally, a polynomial of degree η is an
expression of the form
a0xn + a1xn-1 + a2xn-i + ...+an_1x+an (a0 φ 0)
involving only multiples of positive integral powers of χ and a constant
term, an. A polynomial of degree η is completely determined if its (n +1)
coefficients (including the constant term or coefficient of x°) are given and
two polynomials of degree η are said to be identically equal if the coefficient
of xr in each polynomial is the same for all values of r, and conversely.
For example, writing = to mean 'identically equal to'
ax3-x-2 = 2x3-bx2-x + c
о a = 2, 6 = 0, с =-2.
Ex. 1. If ax2 + bx + c ξ Ъх2 + lax + b, find a, b and с
A convenient shorthand is to write a polynomial as P(x); for example,
if we are considering the polynomial x3 — Ъх— 2 we could write 'Let
P(x) = х^-Ъх-Т
and subsequently refer to this polynomial simply as P(x). By P(l) we
mean the value of the polynomial when χ = 1; that is, the numerical value
attained by the polynomial when 1 is substituted for x. In the case just
quoted, P(l) = 1-3-2 = -4andP(-2) = -8 + 6-2= -4.
Ex. 2. If P(x) = xi-3x3 + 4x2-x-1, show that Д1) = 0 and find P(-1), P(0)
and P(2).
Ex. 3. Show that, if P(x) ξ Q(x), then P(k) = Q(k) for all values of k. The
converse of this proposition is also true: if P{k) = Q{k) for all values of k,
then the coefficients of xr in P(x) and Q(x) are the same for all r (see Ex. 7).
Either statement may be taken as the definition of 'identically equal to'.

1] POLYNOMIALS
A non-zero number may be regarded as a polynomial of degree zero,
or a constant polynomial. The number zero may be regarded as the zero
polynomial.^
No ambiguity will arise if we write the zero polynomial as 0. A
polynomial is identically equal to the zero polynomial if all its coefficients are
zero, and conversely.
Ex. 4. If (я-1) л^ + Сд+б) x+(a + b + c) = 0, find the values of a, b, с
Example 1. IfP{x) = ax2 + bx + c, distinguish between the identity P(x) = 0
and the equation P(x) = 0.
P(x) = 0 means that P(x) is identically equal to the zero polynomial and
so we deduce that a = b = с = 0. Thus, whatever value χ may have, say
χ = k, then P(k) = 0.
P(x) = 0 is a statement which holds only for certain values of x. In fact,
provided b" > 4ac and я Ф 0,
pr-b + J(b*-4ac)l = pr-b-J(b>-4ac)l = Q
and P(x) φ 0 for any other value of x.
Exercise 4(a)
1. If P(x) = x3 + 5x2 + 3x +1; find P(l), P(-1), Ρφ, Ρ(3).
2. If (x-2) (x-3) (x-4) = axt + bxz + cx + d, find a, b, c, d.
3. Prove that (x-af = х3-3ах2 + 3а>х-а3. For what real values of χ does
(х-аУ=-а31
4. If а(х-2У + Ь(х-2) + с = 3x2-8x-l, find a, b, с
5. Express 7x3-;c2 + 3x-4 in the form a(x- l)s + b(x- l)2 + c(x- l) + d.
6. Express 4x3 + 12л:2 + 6x in the form a(x+1)3 + b(x+ l)2 + c(x+ 1) + d.
7. Express 3λ-3 + 2λ^- Их- 10 in the form a(x+ 1)3 + Z>(x+ l)2 + c(x+ l) + rfand
also in the form а(л--2)3+/0(л--2)2 + 7(л--2) + 5.
Find the three roots of the cubic equation 3x3 + 2x2 - 1 Ix-10 = 0.
8. Express x3 + 4kx2 + 3k2x - ks in the form a(x + k)s + b(x+k)2 + c(x+k) + d.
9. If (ax2+bx + c) (x+l) = 0, prove that ax2+bx+c = 0.
t We shall not associate a degree with the zero polynomial. The reader will appreciate
that, by elementary algebra, the product of a polynomial of degree m by a polynomial
of degree и is a polynomial of degree m+n. This result holds good even if one or both
of the polynomials is a constant polynomial, but fails if one of the polynomials is the
zero polynomial if a finite degree is associated with this polynomial.
69

POLYNOMIALS [4
10. If (ax+b) (ex+ d) = (ax + c) (bx + d) prove that:
(i) if α φ 0, then Ъ = с; (ii) if b φ с, then α = d = 0.
11. If (х-с)3 = x3 + ax2 + bx-27, find α, b, с.
12. If (x-a) (x-b) (х-с) ξ (x + oi) (χ+β) (χ+γ), prove that
{a,b,c}= {-α,-β,-y}.
2. THE FACTOR AND REMAINDER THEOREMS
If P(x) = {x-a) Q(x), where P(x) is a polynomial of degree η and Q(x) is a
polynomial of degree л— 1, P(x) is said to have a linear factor. Conversely,
if P(x) has a linear factor, then P(x) may be expressed in the form
P(x) = (x-a)Q(x).
For example, 2χ3 + 5χ2_3χ_ ш s (jc + 2) (2ж» + * - 5)
and so the cubic polynomial 2x3 + 5x2-3x-10 has a linear factor (x + 2).
(It also has a quadratic factor (2x2 + x-5).)
It is important to realise at the outset that possession of a factor is not
an absolute property of a polynomial but depends on the restrictions we
place upon the coefficients. For example, if we restrict our polynomials
to have only rational numbers as coefficients, the polynomial x2-2
has no linear factors; on the other hand, if we allow our coefficients to be
real numbers, x2 — 2 = (x-J2)(x + J2). (This is often expressed by the
statement: 'x2-2 is irreducible over the rational field but reducible over
the real field'. The word 'field' has a technical meaning and will be defined
in Volume 2; all that is necessary at the moment is to realize that ' over
the rational field' means that all polynomials under discussion must have
rational numbers as coefficients.)
The theorems we are about to prove do not depend upon whether we
choose the rational field or the real field for our coefficients provided, of
course, that we are consistent. We shall not, therefore, allude to the field
from which the coefficients are drawn, but the reader may, if he wishes
interpret all the coefficients as, say, rational numbers.
It is easy to see that, if P(x) has a factor (x-a), then P(a) = 0. For
P(x) has a factor (x-a)
ο Ρ(χ) = (χ-*) Q(x)
* P(a) = 0.Q(a)
* P(<x) = 0.
A converse of this result is known as the Factor Theorem.
70

2] FACTOR AND REMAINDER THEOREMS
Theorem 4.1 (The Factor Theorem). IfP(x) is a polynomial and ifP(a) = 0,
thenP(x) has a linear factor x—a.
Proof. First observe that
xr - ar = (χ - α) (χ*-1 + αχτ~2 + ofixT~* +...+ α'-1).
Suppose P(x) = b0xn + Ьг х71-1 + b2 xn~2 +...+ bn_^ χ + bn,
Then Ρ(α) = Ь^ + Ь^-^ + Ь^а.71-2^- ...+bn_1a. + bn
and so
P(x) - P(cc) = b0(x" - a») + b^x"-1 - a""1) +... + b^x - <x)
= (x-<x)[b0(xn-1 + ...+acn-1) + b1(xn-i + ...+ccn-i)+...+bn„1].
But P(a) = 0 thus we have
P(x) s (x_a) Q{x)
and therefore P(x) has a linear factor (x-cc).
Consider now the following division process:
х*-2х-Ъ
x-2)~x3-4x2 + x+2
x3-2x2
-2x2 + x
-2x* + 4x
-Ъх+2
-Ъх+6
-4
In elementary algebra the result of this process would be described by
saying that, if x?-4x2 + x+2 is divided by x—2, the quotient is х2 — 2х — Ъ
and the remainder is — 4. There are advantages, however, in restating the
process as follows: χ3 — 4χ2+ χ + 2 is identically equal to the product of
(x-2) by the quotient (x2-2x-3), plus the remainder -4. In symbols
xs-4x2+x + 2 = (x-2)(x2-2x-3)-4. (1)
This identity holds for all values of x, whereas the division process given
above is true for all values of χ other than χ = 2.
Ex. 5. Check that χ = 2 does indeed satisfy (1).
We now prove our second result, the Remainder Theorem, which
generalizes the ideas outlined above.
71

POLYNOMIALS
Theorem 4.2 (The Remainder Theorem). IfP{x) is any polynomial oft
η ^ 1, and x — a is any linear polynomial, then P(x) may be expressed in the
form
P{x) = {x-a)Q{x)+R
where Q(x) is a polynomial of degree n— 1 and R = P{o).
Proof. As in Theorem 1, we have
P(x)-P(<x) = (x-ai) Q(x)
and the result follows immediately.
Note (i). The Remainder Theorem is often stated in the following
form: if a polynomial P(x) is divided by x-a, then the remainder is P{a).
(ii) Strictly speaking, we should prove that the expression given above
for P{x) is unique in the sense that, if P(x) = {x-a) Qi{x) + Ri, then
Qi{x) = Q{x) an^ Ri = R. The proof of this result should be supplied by
the reader.
Example 2. Find the remainder when x3 - 5x2 - χ + 2 is divided by (i) лг— 1;
{и) x + 2; (iii)2x+l.
Write P{x) = x*-5x*-x + 2.
{i)P{x)^{x-l)Q1{x) + R1;
putx=l: J?! = P(1) = 1-5-1+2 = -3;
(ii) P{x) = {x + 2) Q2{x)+R2;
put л; =-2: R2 = P{-2) = -8-20 + 2 + 2 ξ -24;
(iii) P{x) = {2x+l)Q3{x) + R3;
putx=-l: R3 = P{-\) =-i-i + i + 2 = |.
Exercise 4{b)
1. Find the remainder when:
(i) x3-x + 2 is divided by χ-4;
(ii) 3x3-5x2 + x + 2is divided by x-2;
(iii) 4x4-2x2 + x-3 is divided by x+1;
(iv) x5 — x— 1 is divided by χ + 3;
(ν) 4x3-5x2 + 2isdividedby2x-l;
(vi) 4x3 - 8x + lis divided by 2x+l.
2. Show that χ — 2 is a factor of the polynomial x3—4x2 + χ+6 and hence factor-
ize the expression completely.

2] FACTOR AND REMAINDER THEOREMS
3. Factorize the following polynomials over the rational field as far as possible:
(i) 2x3 + 7x2-5x-4; (ii) 12x3 + 5x2-19x-12;
(iii) 2x3 + 7x2-17x-10; (iv) x3 + 3x2 - 2x - 6;
(v) x3-x2-x-2; (vi) x3-7x2 + 7x + 15;
(vii) x3 + 2x2-7x-2; (viii) 8x3 + 12x2-2x-3.
4. Factorize the polynomials given in Question 3 over the real field as far as
possible.
5. If Xs - 5x2 + 7x - a has a factor χ - 2, find a.
6. If 2x3+ax2-5x-1 is divisible by 2x+1, find a.
7. If xs + Ъх2 + ax-l leaves a remainder of 3 on division by (2x +1), find a.
8. If 3λγ> + ολ:2+Ζ>λ:-2 is divisible by both x + 2 and 3x+1, find α and 6.
9. If αχ3 + 3x2 + 6x - 3 is divisible by both x-1 and 2x + 3, find a and 6.
10. By expressing x4 +1 in the form x4 + 2x2 +1 - 2x2, show that x4 +1 is
reducible over the real field.
11. Factorize x4 + 3x3 -15x2 + 9x + 2: (i) over the rational field; (ii) over the real
field.
12. ax4 + 2x3-4x2-2x + Z> has factors (x-1) and (x-2), find a and b and
factorize the expression completely.
3. THE FACTOR THEOREM (CONTINUED)
We now show how the Factor Theorem may be extended for cubic
polynomials and, in particular, how this extended result may be used to derive
certain algebraic identities. The results we prove are valid for the general
polynomial (substituting η for 3 in their enunciation) but proofs for this
require the use of mathematical induction (see Chapter 9).
Theorem 4.3. If P(x) is a cubic polynomial with leading term a0x3, and
ifP{a) = Ρ(β) = Ρ(γ) = 0 for 3 unequal numbers α, β, γ, then
Ρ{χ) = αϋ{χ-α){χ-β){χ-γ).
Proof. Since P(a) = 0, P(x) = (x-a.) Q^x), where Q^x) is a polynomial
of degree 2.
Thus, since Ρ(β) = 0, 0 = (/?-«) &(/?). But/?-α Φ 0 and so &(/?) = 0.
6i(x) therefore has a factor (χ-β).
gj(x) = (χ-β) Q2(x), where Q2(x) is a polynomial of degree 1.
By the same argument,
Q2(x) = (x — y) бз(х), where Q3(x) is a polynomial of degree 0, that is,
Q3(x) = k, where к is some number. Substituting,
Ρ(*)ΞΑ:(χ-α)(χ-/?)(χ-7).
73

POLYNOMIALS [4
Since these two polynomials are identically equal, the coefficient of x3
must be the same in each, and so к = a0 and the proof is complete.
Corollary (The Identity Theorem). If P(x) is a cubic polynomial and if
4 unequal numbers α, β, γ, δ can be found such that
P(<x) = Ρ(β) = Ρ(γ) = P(S) = 0, then P(x) = 0.
Proof By Theorem 4.3,
Ρ(χ) = αϋ(χ-χ)(χ-β)(χ-Ύ).
But P(S) = 0 and so
0 = α0(δ-*)(δ-β)(δ-Ύ).
Now δ Φ α, δ Φ β, δ Φ γ and it follows that α0 = 0. P(x) thus reduces to
a quadratic polynomial axx2 +a2x + a3 which vanishes for 4 distinct values
of x. Successive repetitions of the argument show that аг = α2 = ^з = 0
and the corollary is proved.
Ex. 6. Prove that, if P(x) is a quadratic polynomial and if 3 unequal numbers a,
A, 7 can be found such that P(a) = Ρ(β) = P{y) = 0, then Дх) = О.
Ex. 7. If P(x) and β(χ) are cubic polynomials, and if unequal numbers α, β, γ, 5
сап be found such that P(a) = Q(a), ...,Ρ(β) = Q(S), prove that P(x) = Q(x).
Example 3. Establish the identity
x2-a2 , x2-b2 , x2-c2
(a-b) (a-cy (b-c) (b-a) (c-a) (c-b)
Write „2 „2 „2 L2
P(*)
■1=0 (α Φ ft Φ c).
x2-c2
(a-b) (a-c) (b-c) (Ъ-аУ(с-а) (с-Ъу
Ъ.а±
b+b-
P(a) = 0+^h+^ + l
-b + c
b-c +
Similarly P(b) = P(c) = 0.
But P(x) is a quadratic polynomial and so must vanish identically.
Exercise 4(c)
1. Prove that
(*-6) (x-c) (x-c) (x_-a) (x-a) (x-b)
\a-b)(a-c) (b-c)\b-a) (c-a) (c-b)

3] THE FACTOR THEOREM
2. Prove that
(x-b)(x-c)(x-d) (x-c)(x-d)(x-a) (x-d)(x-a)(x-b)
(a-b)(a-c)(a-d)+(b-c) (b-d)(b-a) + (c-d)(c-a) (c-b)
(x-a)(x-b)(x-c) =
(d-a)(d-b)(d-c) l-V-
3. Prove that
a(x-b) (x-c) b(x-c) (x-a) c(x-a) (x-b) _
7fl=Wla-c) (b-c)(b-a) + (c-e)(c-6) ~* =
4. Prove that
xjx-a) xjx-b) x(x-c) x_ =
bcic-a) (a-b) ca(a-b)(b-c) + ab(b- с) (с - я) abc ~
5. By considering the quadratic polynomial ax* + bx + c, show that, if я ф 0,
" + land a + b + c=0,
an2 + bn + c = 0,
a + bn + cn2 = 0,
then a = b = с = 0.
6. Prove that the cubic polynomial P(x) defined by
= ajx-β) (*-y) (x-δ) b(x-a) (χ-γ) jx-S) c(x-a) (χ-β) (χ-δ)
K ' - (α-β) (α-γ) (α-δ) (β-α) (β-γ) (β-δ) + (γ-α) (y-β) (γ-δ)
rf(x-g)(*-fl(*-r)
"*" (δ-α)(δ-β)(δ-γ)
is the unique cubic polynomial such that Ρ(μ) = α, Ρ(β) = b, Ρ(γ) = c, Ρ(δ) = d.
Find the cubic polynomial P(x) such that
P(-l) =-з, p(0) = 2, P(l) = 5, P(2) = 12.
7. Find the cubic polynomial P(x) such that
ДО) = 1, Д1) = - 1, P(2) = -1, P(3) = 13.
8. Find the cubic polynomial P(x) which takes the value - 5 when χ has the
values - 1, 0, 1 and is such that P(2) = 1.
9. State and prove the identity theorem for quartic polynomials.
Miscellaneous Exercise 4
1. We shall say that a polynomial P(x) 'has the property R' if the remainder on
division of P(x) by the fixed factor x-a is it. Prove that
(i) if Рг(х) and P2(x) both have the property R, so also has the polynomial
P(x) where P(x) = P^+P^x);
(ii) if P3O) has the property R and P4(x) has the property S, show that the
polynomial Q(x) = P3(x) P4(x) has the property RS.
What can be said about the polynomial P(x) if it has the property 0?
75

POLYNOMIALS
2. Prove that, if P(x) is any polynomial of degree η > 2, then P(x) may be
expressed in the form „, . , . , ,. л, . „
P(x) = (x-a) (x-Z>) Q(x) + Rx + S,
where it, 5 are numbers and Q(x) is a polynomial of degree n — 2 (all numbers
and coefficients taken from the same field—say the rationals).
Express 5x3 + 9x2-8x-ll in the form
(x-l) (x + 2) Q(x) + Rx + S
and show you can check your values for R and S by putting χ = 1 and χ = - 2.
3. Find the remainder on dividing the following polynomials by x2 — x — 6:
(i) 7x3-x2-30x-30; (ii) x*-l.
4. The remainder when the polynomial P(x) is divided by (x — d) (x-b), where
α φ 6, is expressed in the form A(x — a)+B(x—b). Find explicit values for A
and Я.
Prove that, if identically equal remainders are obtained on division by
(x-a) (x-b) and (x-a) (x-c),
then (b - с) P(a) + (c-a) P(b) + (a-b) P(c) = 0.
What can you deduce about the remainder when P(x) is divided by
(*-6)(*-c)?
5. By using the Remainder Theorem or otherwise, find the factors of
α2φ + с) + Ъ\с + α) + с\а + b) + labc.
Hence or otherwise show that, if a + b + c = 0, then
я3 + 63 + с3 = Здбс.
Find (i) the factors of
(/> + 2<? - 3/·)3 + (-3;?+<? + 2/·)3 + (2/7-3<? +/-)3,
(ii) the prime factors of „„„ „„, „„ ,_ , ., ч
733-473-263. (Cambridge)
6. If ox2 + 2bx + с can be written in the form
Α(.χ-<χ)* + Β(χ-β)\
prove that ааР + Ь(ас+Р) +с = 0.
Is it true that, if numbers <x, β can be found such that
α<χβ + Ηα+β) + ο = 0
then ax2 + 2bx + ccan be written in the form Α(χ-ά)2+Β{χ-β)21
7. Prove that, if lx2 + mx + n is equal to zero for three distinct values of x,
then/ = m = η = 0.
Hence, or otherwise, prove the identity
a2(x-b) (x-c) b*(x-c) (x-a) c\x-a) (x-b) _ 2
(a-b)(a-c) + (b-c)(b-a) + (c-a)(c-b) ~ '

3] MISCELLANEOUS EXERCISE 4
Prove also that
(x + af (x + ЬУ (x + c)3 _ „
(а-Ъ)(а-сУ(Ъ-с)ф-а)+(с-а)(с-Ъ) = 3* + й+6 + с·
(O&Q
8. Factorize:
(i) bc(b-c) + ca(c-a) + ab(a-b);
(ii) a{b-c)2 + b(c-a)2 + c{a-bY + %abc;
(iii) а(6-с)3 + 6(с-а)3 + ф-6)3-
9. Given that p, q, r are distinct values of χ which satisfy the equation
— + Л + — =L
χ—α χ—/3 χ—γ
prove that, for all values of χ other than χ = α, χ = ^, χ = γ
_£_ + _*_+_£_ = ι _ (*~Ж*-д) (*->-)
*_α+*-/9 *-r~ (χ-α)(χ-/?)(χ-7)·
By considering the value ofP(x) ξ (χ—ρ) (x—q) (x—/·) for different values of x,
prove that, if a, b, с are all positive and a. < β < y, then/?, q, r lie one in each of
thesets {;c еД: α < * </?}, {χ e R-,β < χ < γ}, {χ e R: χ > γ}.
Prove alsi
id that
э that . w w .
a (α-#(α-γ)
β δ с (p-q)(p-r)
{ρ-αγ (ρ-β)* (p-r)1 (ρ-α)(ρ-β)(ρ-γ)'
(Ο & С adapted)
10. When divided by (x2 + x+l), a cubic polynomial F(x) leaves a remainder
(2x+3). When F(x) is divided by x(x + 3), the remainder is 5(x + 1). Find F(x).
(Cambridge)
11. /(x), g(x) and G(x) are cubic polynomials in x. The polynomials h(x) and
#(x) are respectively the remainders left when Дх) is divided by g(x) and by
G(x). Prove that, if αχ3 and Ax3 are respectively the terms of highest degree in
g(x) and G(x), then
[aG(x)-Ag(x)]f(x) ξ aG(x) A(x) - Ag(x) tf(x). (Cambridge)

5· Functions and inequalities
1. FUNCTIONS
The concept of an association between the elements of two sets is of prime
importance in mathematics. For example, the table of sines in a book of
mathematical tables exhibits an association between the set A of angles
between 0° and 90° and the set В of real numhers between 0 and 1. We
could exhibit the fact that sin 30° = 0-5 by writing the ordered pair
(30°, 0-5). The word 'ordered' used here reminds us that the order in
which we write the elements down within each pair must be taken into
account; that is, the ordered pair {a, b)is not the same as the ordered pair
(b, a).
We can put this concept of an association between the elements of two
sets in a more precise way if we introduce the idea of the Cartesian product
of two sets A and B. The Cartesian product of A and B, written Α χ Β is
the set of all ordered pairs {a, b) where a e A and be B. For example, if
A = {1, 2, 3}, В = {3, 4, 5},
then
A xB = {(1, 3), (1, 4), (1, 5), (2, 3), (2, 4), (2, 5), (3, 3), (3, 4), (3, 5)}.
Ex. 1. If the set A has m elements and the set В has η elements, how many
elements has the set Α χ Β ?
A relation between two sets A and В is a proper subset of Α χ Β. Thus,
a relation is a set of ordered pairs {a, b) containing at least one pair, but
not all the pairs, of Α χ Β. Of course, relation is a very wide term as defined
here. For example, both {(1, 3)} and {(1, 3), (1, 4), (1, 5), (2, 3), (2, 4), (2, 5),
(3, 3), (3, 4)} define perfectly respectable relations between A and B. Of
particular significance are the (1-1) relations (read as 'one-to-one'
relations) which associate with each element of A just one element of В in
such a way that no element of В arises from more than one element of A;
in our example above, such a relation could be {(1, 3), (2, 4), (3, 5)}.
In this chapter we shall be concerned with perhaps the most important
of all types of relation, the function. A function f from A to B, written
f:A^-B, is a subset of Α χ Β with the property that each element of A
appears just once. Another word for function is mapping: A is mapped byf
into B. A function may be looked upon as something which associates
78

1] FUNCTIONS
with each element of A a unique element of B. In Figure 5.1 a function is
pictured diagrammatically; notice that the arrows, indicating the paths
by which elements of A are to find their assigned destination in B, start
from each point of A. Notice too that there is no restriction on the number
of elements of A for which be Bis the image. (The image of the element
a e A is the element which a is mapped into by/.)
Fig. 5.1
If we have a function/: A^-B the set A is called the domain of/and the
set of image points (a subset of В in general, but possibly the whole of B)
is called the range of / The set В is called the codomain of /. The image
of the element a e A is written Да), thus, Да) is an element of В or, in
other words, (a,f(a)) is an element of Α χ Β.
Example 1. The function f: R+ -> R+ is defined by
Ax) = ih (xeR+)·
Find:
(i) the range off;
(ii) Д2) andfU2);
(iii) the value of χ which maps into \.
(i) For 0 < x1 < x2,
1 > Г^>Г7— >0·
1+*! l+x2
Thus, since 1/(1 +x) can take any positive value less than 1 by a suitable
choice of positive x, the range of/is the set {y e R+: 0 < у < 1}.
(ii) Д2) = 1/(1+2) = ^;/(V2) = 1/(1+V2) = V2~ 1 · (Notice that both
2 and V2 belong to Л+ and sofl2),fQ2) are defined.)
(iii) If l/(l+x) = i, x = 3.
If there is no doubt as to the domain, a particular function is often
defined by phrases such as 'the functionf{x) = хг — Ъх-Т or 'у is a
function of χ such that у = хг-Ъх-2ь or even 'the function χ*-3χ-2'.
From the context we are presumed to appreciate that χ is, say, a real
79

FUNCTIONS AND INEQUALITIES [5
number. Even so, the phraseology is loose: the notation f(x) (or y) stands
for the image of χ under /, not for the function itself.
*Ex. 2. Comment upon the statement
represent the same function'.
Ex. 3.f:R^-Ris defined byf(x) = x3- 3x2- 5x + 2. Evaluate ДО), Д- 1), Д2),
ДЗ). Which elements of R map into the number 2?
.Ex. 4. Answer the same question as in Ex. 3 for the function g: Q^-R defined
byg(x) = x3-3x2-5x + 2.
A useful pictorial representation of certain functions /: R-> R (or
subsets of these sets) may be obtained by taking the χ axis to represent the
domain and the у axis to contain the range. If у is the image of the number
χ under/, that is, if у =/(x),we represent the element (x,y) of/as the point
with Cartesian coordinates (x, y). The set of all such points is called the
graph of the function. For example, suppose f: R^ R is defined by the
equation Дх) = x2-5x + 4. The graph of this function for the part of the
domain -1 < χ < 6 is shown in Figure 5.2. This graph may alternatively
be referred to as 'the curve у = x2-5x + 4\
Certain relations which are not functions may also be represented
graphically; for example, consider the subset of RxR consisting of all
ordered pairs of the form (x, y) where xe R, ye R such that y2 = x.
That part of the graph for χ < 6 is shown in Figure 5.3.
Fig. 5.2 Fig. 5.3
Ex. 5. Explain why the relation depicted in Figure 5.3 is not a function.
Suppose now we have a function /: A-+ B; under what circumstances
will the set of all ordered pairs (f(a), a), aeR, constitute a function
F: В -> A ?

1]
FUNCTIONS
Viewed pictorially, the question may be rephrased: if all the arrows in a
diagram such as Figure 5.4 were reversed, under what circumstances would
we still have a diagram depicting a function?
Fig. 5.4
The question is easily answered if we recall what is demanded of such
a function F: it must associate each point of В with a unique point of A.
Thus, the range of/ must be the whole of В (since each point of В has an
image under F) and/must be (1-1) (since each point of В has a unique
image under F). If/satisfies both of these conditions F is defined as a
function: it is called the inverse function off: A^ В and is written
f-^.B^A.
Ex. 6. Show that no inverse function exists for the function /: Л -> R where
f(x) = x2, but that an inverse function does exist for the function g: R+ -> R+,
where g(x) = x2.
Exercise 5(d)
1. The function f:R^ R is defined by f(x) = v*—*- 1. Find ДО), Д1), /(2),
Л3),л-1)·
2. The function #:Д+ -> Я is defined by
Findtf(l),*(2),tf(3).
3. The function /: R -> Я is defined by
1-х if * < 0,
0 if χ = 0,
* if * > 0.
(This is normally written/(x) = |*|; read '/(*) equals the modulus of *'.) What is
the range of/? Sketch the graph of/.
A second function g: R -> it is defined by #(x) = x— |*|. What is the range
of gl Sketch the graph of g.
81

FUNCTIONS AND INEQUALITIES [5
4. The function/: R -> R is defined by
[θ, χ = 0.
Write down the values of/(-1) and/(l). Sketch the graph of/
5. A £ R and В Я R. The set / of all ordered pairs (x, y) is formed, where
χ e А, у е В and у = $x. Show that /is «oi a function if
(i) A = R+, Β = β; (ii) Λ = β, Л = R.
Suggest sets A and В for which /is a function.
6. A = {xeR:l ^ χ ^2} and/: Λ -> R is defined by/(x) = 2/x. The function
#: /4 -> Λ is denned by #0) = 1 +.y2. A new function h: A^> R is defined by
h(x) = #(/(*)] (this is usually written h = g of). Find
(i) A(f); (ii) the range of A.
7. Sketch the graph of the function/: R -> R where/(x) = |*+ 11.
8. Sketch the graph of the function g: R^- R where
*(*)= |*+l| + |* + 2| + |x + 3|.
9. A = {xeR.O < χ <3} and/: A -> R is defined by/(x) = x2-3x+2. Find
(i) /(1),Λ2);
(ii) the range of/;
(iii) the subset of A whose elements have the image 1 under/;
(iv) the subset of A whose elements have the image 2 under/
10. A = {x e R: - V6 =£ χ =£ V6) and /: A -> R is defined by /(*) = x*-6x.
Find the range and sketch the graph of/.
11. If /: R -> Λ is defined by /(x) = (*- l)3, show that f-^.R^-R exists and
determine its form explicitly.
12. Determine the numbers which are invariantt under the mapping f. R^- R
where /is denned by
»л _/(*' + *+Dfr+3)-1 (* + -3),
/W_\ 0 (*=-3).
Show how to illustrate invariance graphically for a general function g: R^- R.
13. /: R -> Λ is a given function, A <= R and the set of images of all the elements
in /4 is denoted by B. If С is the set of numbers whose images belong to B,
prove that A <= С and that the inclusion may not be strict.
14. /: R -> R is a given function, 21 <= Λ and the set of images of all the elements
in JSTis denoted byf(X)· If A <= R, В <= Risit necessarily true that
(i) AA П Β) = ΛΑ) η f(B); (ii) ΛΑ υ Β) = ΛΑ) Uf(B) ?
15. Sketch the graph of the function /: R+ -> Λ defined by Дх) = χ2. Show that
the function,?: R+ -> Λ, where ^(x) = **, is the inverse of/. Sketch the graph of g.
What connection exists between the graphs of/and g?
f An element α is invariant under the mapping/if/(a) = a.
82

1] FUNCTIONS
16. If χ is a real number, [x] denotes the greatest integer less than or equal to x.
(For example, [-2*] = -3, [Ц] = 4, [-1] = -1.) The function /: R -> Ζ is
denned by/(χ) = [χ]; sketch its graph.
A function/is called^ez-zW/c if there exists a number к such that f(x+k) = f(x),
for all χ in the domain; к is then called the period off. Show that the function
denned by f(x) = x— [x] is periodic and find its period.
What feature does the graph of any periodic function possess?
2. INEQUALITIES
The function, /, defined by
has for its domain the set R with the single point χ = \ deleted; if we call
this set D, we may write D = R-{f}. The inequality
defines a set А с д where
IxgD:
Ъ-2х
<i;
that is, A is the subset of elements of D which map into elements less
than 1. By 'the solution' of the inequality (1) is meant the determination
of the simplest expression for the set A.
It is always good practice, in solving an inequality, to express one side
of the inequality as zero. Thus
IxeD:
1
-1 ■
IxeD-
.2(*-l) ;
3-2*
" 3-2x
To determine the values of χ satisfying the inequality
: 0
2(*-l) i:
3-2ΛΓ
it is best to express the working in tabular form (see Chapter 1.7). The
critical values of χ e R are χ = 1 and χ = f.
3-2*
x-l
7,-lx
χ < 1
-
+
-
x = 1
0
+
0
.«<i
+
+
+
,>\
+
-
-

FUNCTIONS AND INEQUALITIES [5
Thus A = {xeD:x < l}u{xeD:x > %}.
Figure 5.5 shows a graph of the function/defined above; the domain of
/is denoted by the whole of the χ axis with the point χ = f deleted, the
range off is represented by the у axis, with у = 0 deleted. The subset A
of D is denoted by that part of the χ axis drawn in a thick line. (Note
carefully that l$A,i$A; indeed, \ φ D.)
Fig. 5.5
Example 2. Solve the inequality
* < 2
(*-3)(*+lp 5
and illustrate your answer by means of a sketch of the curve
y=(x~^'3)(x+iy
The function/defined by
/(X) = (x-3)(x+l)
has as its domain the set D where
D = j?-{-i,3};
that is, the set of real numbers, less the two numbers -1 and 3. (D may
be written alternatively as the complement of the set
{xeR:x=-l or χ = 3}.)
84

INEQUALITIES
_ 5х + 2х2-4л:-6
(x-3)(x + l)'5~ 5(x-3)(x + l)
2x2+x-6
_5(х-3)(х+1)
_(2х-3)(х + 2)
5(дс-3)(дс + 1)·
We thus seek a simpler description of the set A, where
| (2Х-ЗИХ + 2) |
The critical values of χ e R are χ = - 2, χ = -1, χ
As above, we set out the work in tabular form.
= 1, χ = 3.
χ <-2
-
-
-
-
+
*=-2
0
-
-
-
0
-2 < χ <-l
+
-
-
-
-
_,<*<!
+
-
-
+
+
3
x~ 2
+
0
-
+
0
\<x <3
+
+
-
+
-
x> 3
+
+
+
+
+
Thus A = {xe D: -2 ζ χ < -1} U {xe D: \ ^ χ < 3}.
Figure 5.6 shows the graph of/, the set A is represented by that part of
the χ axis drawn in thick lines.

FUNCTIONS AND INEQUALITIES [5
Ex. 7. Verify the solution of Example 2 by evaluating /(-3), Д- Ц), ДО), Д2),
Д4). Explain why these are appropriate values to use for checking.
*Ex. 8. Show that the two expressions
^-3)(χ+1) and (*-3)(*-*)(*+l)(*+2)
have the same sign for all values of χ φ — 1 or 3. Hence solve the inequality of
Example 2.
Exercise 5 (ft)
Solve the following inequalities, illustrating your solutions by means of a sketch.
x-4 6(3*+1)
x2-4 ; * (x-l)(x-2)(x+3) ;
86

6. The trigonometric functions
1. RADIANS
We shall assume in this chapter that the reader is familiar with the usual
measurement of angles in degrees and also with the definitions of the sines,
tangents, etc., of acute angles in terms of the sides of a right-angled
triangle. We shall also assume that arc-lengths in circles may be satisfactorily
defined and measured and, in particular, that the circumference and area of
a circle of radius r are respectively 2m and яг2.
An alternative measure of angle may be defined as
follows. Figure 6.1 shows a circle of radius r, centre O.
The magnitude of the angle Δ.ΑΟΡ may be defined as
the ratio of the arc-length, s, of AP to the radius, r.
In the particular case in which s = r, the angle Δ.ΑΟΡ
is called a radian. Thus, an arc-length of 2r subtends an
angle of 2 radians at the centre of the circle. (We may
write this ' Δ.ΑΟΡ = 2rad', or, since a ratio of two lengths is a pure
number, simply' Δ.ΑΟΡ = 2' provided no confusion is likely to arise.)
*Ex. 1. Show that a right-angle is \тт rad. Convert 180° to radians and use your
result to express in radians the following angles: 30°, 60°, 45°, 75°, 135°, 225°,
330°.
*Ex. 2. Express in degrees the following angles: \π rad, |тг rad, \n rad, \π rad,
■Α-π rad, %n rad, -&т rad.
Fig. 6.1
Fig. 6.2
We shall adopt the usual convention that angles are measured as positive
in the anticlockwise sense, the sense being denoted, when necessary, by an
arrow. Figure 6.2(i) shows an angle of -§я, Figure 6.2(ii) an angle of
V-я, both measured from О A.
One advantage of measuring angles in radians is that the expressions for
87

THE TRIGONOMETRIC FUNCTIONS [6
arc-lengths and the area of a sector take on a particularly simple form.
Indeed, if in Figure 6.1 L A OP = θ rad, we have
s = rd;
also, assuming that the area of a sector is proportional to the angle
subtended at the centre,
area of sector AOP = m\d\2n)
= irW.
(A further advantage of using radians will become apparent later when the
reader learns to differentiate the trigonometric functions: if the angles are
measured in radians, the derivatives of these functions take on a
particularly simple form.)
Exercise 5(a)
1. Express in radians the following angles: 20°, 45°, 105°, 150°, 210°, 270°, 315°,
330°, 450°, 570°.
2. Express in degrees the following angles: i?rrad, $π rad, |7rrad, -fen rad,
in rad, f π rad, -far rad, 5π rad, γ^ττ rad, ^π rad.
3. Express in radians, to 3 d.p.: 25° 30', 71° 15', 146° 12'.
4. Express in degrees, to 1 d.p. : 0-48 rad, 0-65 rad, 2-18 rad.
5. Taking π= 3141593, show that 1 rad « 57-296°. (Use a calculating machine
if you have one available.)
6. If the earth is taken to be a sphere of radius 3960 miles, find the length in
feet of arc which subtends an angle of 1 minute at the centre of the earth.
7. A circle has radius 10 cm. Calculate the length of the smaller arc cut off by
a chord of length 16 cm.
8. The chord PQ subtends an angle jtt at the centre, O, of a circle of radius
10cm. Find:
(i) the length of arc PQ;
(ii) the area of the triangle OPQ;
(Hi) the area of the minor segment cut off by PQ.
9. A chord of a circle of radius r cm subtends an angle of 2Θ degrees at its centre.
Obtain expressions for the length of the chord and the length of the minor arc
which it cuts off.
If the arc has length 3 cm and the chord 2-7 cm, prove that
7T0 = 200sin 0. (O&C'O')
10. The section of a tunnel consists of the major segment of a circle standing
on a chord of length 2 m. If the greatest height of the tunnel is 3 m, calculate the
radius of the circle.
Prove that the angle subtended by the chord at the centre of the circle is
approximately 1-287 rad, and hence find the area of the cross-section of the
tunnel, correct to the nearest square metre. (O&C'O' adapted)
88

1] RADIANS
11. Two circles of radii 6 cm and 8 cm respectively have their centres 10 cm
apart. Calculate the length of their common chord, and the area common to the
two circles. (O & С Ό')
12. Thevertexofahollowconeis^andfiCisadiameterofthebase;ВС = 6cm
and AB = AC = 5 cm. The cone is unrolled into a sector of a circle. Find :
(i) the angle of the sector;
(ii) the area of the curved surface of the cone;
(iii) the shortest distance on the surface of the cone from С to the mid-point
of AB. (O&C'O')
2. THE TRIGONOMETRIC FUNCTIONS
Figure 6.3 shows a circle of unit radius; OA = i, OB = j
(where as usual, i and j are perpendicular). The angle Θ,
measured in radians from OA as initial line, defines a
unique unit vector OP. Now OP may be expressed
uniquely in terms of the unit vectors i, j (see Chapter 2)
OP = xi+y\. Fig. 6.3
Thus, given any θ e R, we determine uniquely two numbers χ e R, у е R,
which may be taken as the images of two functions R^> R, called
respectively the cosine and sine functions. We write χ = cos Θ, and у = sin Θ;
the reader will have no difficulty in appreciating that our definitions lead
to the familiar sine and cosine of acute angles if 0 < θ < %n. We have to
be a little careful if we measure θ in degrees: strictly speaking, the sine and
cosine functions would then be different functions R^y R. Of course,
given an angle, the image of the appropriate sine function in either case
would be the same number: for example with angles measured in radians,
sin^Tr = 0-5; with angles measured in degrees, sin 30° = 0-5. We adopt
the same name for the two functions, as not to do so would lead to
confusion; if extra clarity is required, we may always write 'sin^rad' or
'sin0°'. (In this chapter, 'sin Θ' will mean 'sin Θ rad' and similarly for the
other trigonometric functions; if we wish to measure θ in degrees, we
write' sin θ°\)
Since -1 < χ < 1 and -1 < у < 1, the range of both the sine and
cosine functions is the set {x e R: -1 < χ < 1}.
The reader will no doubt suspect that a tangent function will now be
defined by tan θ = yjx. However, we must be careful here, for yjx is not
defined if χ = 0; this occurs whenever θ is an odd number of right-angles,
or, to be more precise, when θ = (2fc + l) \ττ, where к is any integer.
We must therefore restrict our domain to R η A', where
A = {0eR,keZ: Θ = (2Α: + 1)^}.
89

THE TRIGONOMETRIC FUNCTIONS
With this restriction, the tangent function R Π A
tan θ = yjx.
R is defined by
η θ = yjx.
Similarly, if В = {θ e R, к e Ζ: θ = for}, the cotangent function
Rf] B' -+R is defined by cot θ = xjy.
If the reader finds the notation of the previous two paragraphs difficult
to digest, he may simply remember that
(i) tan θ = yjx provided χ φ 0; tan θ is undefined if χ = 0.
(ii) cot θ = xjy, provided у φ 0; cot θ is undefined if у = 0.
In similar fashion, the secant and cosecant functions are defined by
(iii) sec0 = \jx, provided χ φ 0; sec θ is undefined if χ = 0.
(iv) cosec θ = Xjy, provided у φ 0; cosec θ is undefined if у = 0.
In terms of the sets A and В defined above, the domain of the secant
function is R η A'; the domain of the cosecant function is R η Β'.
*Ex. 3. Prove that the range of both the tangent and cotangent function is the
set R.
*Ex. 4. Prove that the range of the secant and cosecant function is the complement
of the set {x e R: -1 < χ < 1}.
The following table of the images of the sine, cosine and tangent
functions for some commonly occurring angles is worth memorizing:
sin
COS
tan
0
0
1
0
W6
1/2
V3/2
1/V3
W4
I A/2
1/V2
1
ir/3
V3/2
1/2
,/3
τ,β
1
0
Undefined
The corresponding images of the secant, cosecant and cotangent
functions may be obtained from the following important identities, which are
valid for all angles θ at which the functions are defined:
(Proofs follow immediately from the definitions. The sign = emphasizes
that these are identities, true for all values of θ for which the functions are
defined.)
Example 1. Find the value of:
(i) sin§7r; (ii) cosf^; (iii) tan^; (iv) с
(vi) cot (—ίπ).
(v) sec (-1-я-);

>] THE TRIGONOMETRIC FUNCTIONS
In each case we find:
(a) the correct sign to attach (by considering signs of χ and y),
(b) the acute angle which defines the same numerical values of χ and y.
(i) For θ = f π, у > 0 and so sin f π > 0.
Also the acute angle which defines the same numerical values of χ and у
s \ττ (see Figure 6.4(i)).
Thus, sin |тг=+ sin \n =+i.
Fig. 6.4
(ii) θ = \n, χ < 0 and the corresponding acute angle is \n (Figure
6.4 (ii)).
Thus, cos f π = - cos \tt =-\.
(iii) For θ = Ιπ, χ > 0, у < 0 and the corresponding acute angle is
£ττ (Figure 6.4 (iii)).
Thus, tan Ιπ = - tan \π = -1.
(iv) For θ = %π, у > 0 and the corresponding acute angle is |тг
(Figure 6.4 (iv)).
Thus, cosecfrr = +cosec^7r = +^/2.
(v) For θ = - J π, χ < 0 and the corresponding acute angle is \π
(Figure 6.4 (v)).
Thus, sec (-|л-) = -sec \n =—yj2.
(vi) For θ = -\π, χ < 0, j > 0 and the
corresponding acute angle is %π (Figure 6.4 (vi)).
Thus, cot (-in) = -cotin = -V3.
Figure 6.5 gives a useful method for seeing which
sign to attach according to the quadrant in which the
angle falls. A stands for all, S for sine, Γ for tangent
and С for cosine. In the A quadrant, all
trigonometric ratios are positive; in the S quadrant, only Fig. 6.5
4 PPM 91

THE TRIGONOMETRIC FUNCTIONS [6
sines (and cosecants) are positive; in the Τ quadrant, only tangents (and
cotangents) are positive; in the С quadrant, only cosines (and secants) are
positive. Notice, as an aid to remembering the diagram, that the letters
(read anticlockwise) spell the word 'cast'.
Example 2. Solve for θ the equation
2sin20 + sin0-l = 0
giving all solutions in the interval 0 < θ < 2π.
The left-hand-side factorizes: (2 sin 0-1) (sin 0+1) = 0 and so is
satisfied by values of θ such that either
(i) sin θ = +i, or (ii) sin θ =-1.
The relevant solutions of (i) are θ = \ττ, θ = \π; and of (ii) θ = \π.
Thus, the required set of solutions of the given equation is
Since OP is a unit vector, x2+y2 = 1 for all Θ. Thus,
sin20 + cos20 = 1, for all0.
Division by cos2 θ gives
1 + tan2 θ = sec2 Θ, for all θ Φ (2k +1) π/2.
Division by sin2 θ gives
1 +cot2 θ = cosec2 Θ, for all θ φ for.
These three identities are true for all values of Θ, provided the functions
mentioned are defined.
Π
•vj
Pi
Ό
Fig. 6.6
Suppose OP is the unit vector defined by the angle θ and OQ the unit
vector defined by the angle \π — θ (see Figure 6.6). If OP = x\+y\, then
OQ = y\ + x\. It follows that
sin (i?r-0) ξ cos Θ, cos (in-θ) = sin Θ, tan ϋπ-θ) ξ cot θ,
cosec Цл-в) = sec θ, sec ($π- θ) ξ cosec θ, cot ϋπ — θ) = tan θ
92

2] THE TRIGONOMETRIC FUNCTIONS
(provided, of course, that θ lies in domains for which both sides are
defined).
*Ex. 5. Find similar simplifications for the values of the trigonometric functions
for the angles
(i) in+θ; (ii) π-θ; (iii) π+θ.
N.B. Be careful about the signs!
Example 3. Solve the equation sin Θ + cos 2Θ = 0, giving all values of θ
between 0 and 2π.
Since cos (%π + Θ) = - sin Θ, the equation may be rewritten
cos 2Θ = cos(i?r + 0).
Now
cos θ = cos φ if θ = ±φ or θ = 2π±φ or θ = 4π±φ ....
Thus the original equation is satisfied for values of θ given by
20 = ±βπ + 0); 2θ = 2π±{\π + &);
2Θ = 4π±(^τ + θ); 20 = 6π±(%π + θ); ....
Trial of these solutions shows that the values of θ lying in the given range
аГе f1 7 1 1 1
Example 4. Prove that the identity
1 tanfl + cotfl
sin θ + cos θ ~ sec θ + cosec θ
holds, provided θ Φ \кл, θ Φ ηπ — \π.
The restrictions on 0 ensure
(i) {θ Φ ^Атг) that tan Θ, cot β, sec Θ, cosec β are defined;
(ii) {θ Φ wr- \π) that sin (9 + cos θ φ 0 and sec θ + cosec 6» Φ 0.
With these restrictions
_ /sing cosfl\ // 1 1 \
R,HS- Icos θ+ sin 0//lcos0 sing/
ξ (sin2 0 + cos2 0)/(sin 0 + cos Θ)
= l/(sm0 + cos0)
= L.H.S.
Example 5. Eliminate θ between the equation
χ = a cos Θ, (1)
j = й sec 6» +с tan 6», (2)
4-2 93

THE TRIGONOMETRIC FUNCTIONS
χ = a cos θ and у = b secd + c tan#
=> xy = a cos θ (b sec 0 + с tan Θ),
=> xy = a£ + acsin#
<s> acsin0 = ху-ой.
But ac cos θ = ex.
Squaring and adding , , , , , . ,.,
м ^ ь a2c2 - c2x2 + (xy-ab)2.
1. Write down the values of cos §π, tan |π, cosec £π, cot jtt, sin Jw, sec jtt,
cos fw, tan %π, cosec ( - ^π), sec ( - |π).
2. Write down the values of cos 315°, tan 135°. cosec 330°, sin(-135°),
cot(-120°), sec 240°, sin 480°, sec (-210°), cot (-60°), sin 1020°.
3. Use your tables to find the values of sin 215°, cos 128°, tan (-40°), cosec 161°,
sin (-200°).
4. Solve the following equations for 0, giving all values lying in the interval
0 < θ < 2π:
(i) sin0=-i; (ii) tan 0+1=0; (iii) sec 0 + 2 = 0;
(iv) cot 0 = V3; (v) 4 sin2 0 = 3; (vi) 2 cos (0-i?r) + V3 = 0;
(vii) sin 0 + cos 0 = 0; (viii) 2 sin 30 = 1; (ix) sec (0+K> = 1;
(x) 3 sec2 ((?+» = 4; (xi) cosec2 (0 + Ю = 1; (xii) tan2 20 = 3.
5. Solve the following equations for 0, giving all values lying in the interval
-я- s: 0 < я-:
(i) 2 sin2 0 + sin Θ = 0; (ii) 2 cos2 0 + 3 sin 0 = 0;
(iii) cos20 = sin0; (iv)2tan0 + si "
(v) sin(i7r-0) + cos0
(vii) tan0 + cotG-7r-0) =
(ix) cot20 + cosec0+l =
(vi) cos 0 = 2 cot 0;
(viii) tan 0+cot 0 +sec 0 =
(x) sec 30 + cosec 0 = 0.
6. Find the maximum and minimum values of:
(i) 3/(2 +sin 0); (ii) (l+cos20)2; (iii) sin2 0 + 2 sin 0+2.
7. No tables to be used in this question.
(i) If \ή < χ < π and sin x = i, find cos χ and tan x.
(ii) Ιί\π<χ<π and tanx=-i, find sin χ and cos*,
(iii) If я- < χ < \n and sec χ = - 3, find cos χ and tan x.
(iv) If in < χ < π and sin χ = i, find tan χ and secx;
(v) If \π < χ < 2π and sec χ = 4, find cosec χ and cot x.
8. Prove that, provided sin 0 φ 0.
l-cos0 = 1
sin 0 — cosec 0+cot 0'
94

2] THE TRIGONOMETRIC FUNCTIONS
9. Prove that, provided sin θ Φ 0, cos θ Φ 0,
tan θ + cot θ = sec θ cosec θ.
10. Prove that, provided all the values of the functions are defined,
cosec2 θ = 1 + cos θ cot θ cosec Θ.
11. If sin θ = f find, without using tables, the possible values of sec (9 +cosec Θ.
12. If tan = i find, without using tables, the possible values of 2 cos Θ + cot Θ.
13. Use your tables to find all values of Θ, lying in the interval 0° < θ° < 360°,
which satisfy the following equations:
(i) 2 sec2 Θ0 = 5-tan 0°; (ii) 3 cos2 Θ" = 7 cos 0°-2;
(iii) 2 sin2 (9°-sin (9° cos (9° -cos2 (9° = 0;
(iv) cot2 θ° = cosec 0°; (v) 16tan20° = 9;
(vi) 3 sec2 Θ0 = 2 cosec 0°; (vii) tan 0° = 2(sec θ° + cos 0°);
(viii) sec20° = l + 2tan0°.
14. Eliminate θ between the following pairs of equations:
(i) χ = 2cos0, y= 3(l+sin0);
(ii) χ = cosΘ — sinΘ, у = cos0+sin0;
(iii) χ = 2 cos 0 — sin в,у = cos# + sin0;
(iv) χ = 3 tan 0, у = 4 sin 0;
(ν) χ = cosec 0-1,.y = cos 0+1.
15. A particle oscillates along the χ axis in such a manner that its coordinates at
time t seconds after the start of the motion are (sin ifot+e), 0).
(i) Where is the particle at the start of the motion?
(ii) Between what two points does the particle oscillate?
(iii) How long does the particle take to move from one extreme point to the
other?
16. If a particle is projected with a velocity К cos ai+ К sin aj under gravity
then its position vector at time t is given by
г = (Kicosa) i + (Kisin a,-\gt2)].
If г is written in the form г = xi+я,
prove that у = χ tan a.
2\K2/
Deduce that, if x, y, g, V are known, there are in general two values of a in
the interval 0 «S a < %π which satisfy this equation. Under what circumstances is
there only one such angle?
17. Prove that the equation „ , ,
x2+.y2 + z2 = a2
represents the surface of a sphere.
Show that the point Ρ whose position vector is
г = a cos θ cos φϊ + α sin θ cos φ} + a sin фк
lies on the sphere, for all values of θ and φ.

THE TRIGONOMETRIC FUNCTIONS [6
Is the following converse result true? Values of θ and φ can be found such that
any point on the surface has a position vector of the form
г = acos0cos0i + asin0cos0j + asin0k.
18. Discuss the possibility of solving the following equations for Θ:
(i) sin θ = a+ I/a; (ii) 2ab cos θ = я2 + Z>2;
(iii) cos2 0+4 = sin 0 + 4 cos 0.
19. What are the maximum and minimum values of the expression 4/(2 + sin x) ?
Without attempting to solve the equations exactly, state how many values of χ
in the range 0 < χ < 2π satisfy
(i) the equation cos x(2 + sin x) = 4;
(ii) the equation tan x(2+ sin x) = 4.
3. THE GRAPHS OF THE TRIGONOMETRIC
FUNCTIONS
Since revolutions through angles 2π, Απ,... about О brings the unit vector
OP into coincidence with its original position, it follows that, for any
integer к and any of the six trigonometric functions/: R-+ R
A2kn + x) = /(*).
Thus, a trigonometric function is a periodic function, period 2π (see
Exercise 5(a), Question 16).
Figure 6.7 shows the graphs of the sine function (continuous line) and
cosine function (dotted line), for values of the domain —2π < χ < 2π.
Fig. 6.7
Ex. 6. How do the graphs of sin χ and cos χ illustrate the identities
sin (%π—χ) = cos x, sin (%π + χ) = cos x,
sin &т-х) = -cos x, sin (jtt + x) ξ -cos x?
Ex. 7. Sketch roughly the graphs of 2 sin x, sin 2x, [sin x\.
Figure 6.8 shows the graph of the tangent function (continuous line)
and cotangent function (dotted line) for values of the domain
— 2π<χ<2π. Recall that the tangent function is undefined at values
of χ which are odd multiples of %π but, by taking χ sufficiently close to such
96

3] GRAPHS OF TRIGONOMETRIC FUNCTIONS
values, we make |tanx| arbitrarily large. A similar remark holds for the
cotangent* function at values of χ which are even multiples of \n. (A line
such as χ = \n is called an asymptote for the curve у = tan x.)
Fig. 6.8
Ex. 8. How does Figure 6.8 illustrate the identities ϊ&η.(\π-χ) = cot x,
tan(i7r+x) ξ —cot x, tan(f-77- — x) = cot x, tan(j7r+x) = —cot xl
Ex. 9. Sketch roughly the graphs of tan 3x and |tan x\.
Figure 6.9 shows the secant function (continuous line) and cosecant
function (dotted line).
Ex. 10, How does Figure 6,9 illustrate the identities
sec (in- x) = cosec x, sec (\n + χ) = -cosec x,
sec (Ιπ— χ) = —cosec χ, sec Цп+х) = cosec χΊ
Ex. 11, Sketch roughly the graphs of sec 2x and |cosec x\.
97

THE TRIGONOMETRIC FUNCTIONS
[6
4. INVERSE TRIGONOMETRIC FUNCTIONS
The sine function has no inverse, because if we take any number ζ in the
range — 1 < ζ < 1, we can find any number of values χ such that
sin χ = z. However, if we restrict the domain of the sine function to
-\ττ < χ < \tt (the range remaining -1 < у < 1)
then the inverse function does exist, for the mapping becomes one to one,
The inverse function so defined is called the arcsine function, Thus, arc-
sine is a function from
{xeR: -I ^ χ ^ 1} to {yeR: -%п < у < \ή)
with the property that arcsin χ = у if sin у = χ, and conversely, (Notice
that we retain χ as an element of the domain.) Another notation for arcsin χ
is sin-1 x; the slight drawback to this notation is that it suggests that the
unrestricted sine function has an inverse, which is, of course, false,
The graph of the arcsine function may be obtained directly from the
graph of the sine function for the restricted domain — \ή < χ < \π by
interchanging the χ and у axes. If we retain χ as an element of the domain,
so that у = arcsin x, the graph is as shown in Figure 6.10.
•\-
2
-1
Fig. 6.10 Fig. 6.11
The arccosine function may be similarly defined, with domain
{xeR: -1 < χ < 1} and range {0 < у ^ π}.
If у = arccos x, then cos у = χ. The graph is shown in Figure 6,11,
Again, the arctangent may be defined, with domain R and range
{-\π < у < %π} (notice the strict inequality), If у = arctan x, then
tan у = χ. The graph is shown in Figure 6,12,
Ex. 12. Suggest appropriate domains and ranges for arccotangent, arcsecant and
arccosecant.
Ex. 13, Write down the values of arcsin \, arccos 1/^2 and arctan (-1).

INVERSE TRIGONOMETRIC FUNCTIONS
Example 6. Simplify cos (arcsin x).
Let arcsin χ = у; then —\π < у < \ή and sin у = χ
cos2 у = 1 — sin2 у = 1-х2.
Thus cos у = ± V(l -x2); but — |-7г < j < \ή and so cos у > 0; it follows
that cos (arcsin x) = V(l -*2)·
Exercise 6{c)
1. Show that the function/denned by
f(x) = cos л: + sin χ
is periodic and sketch its graph.
2. Draw the graph of the function/defined by
f(x) = sin χ + sin 2x
for values of χ in the interval 0 < χ < 2π. Solve the equation
sinx + sin2x = 0,
giving values of χ for which 0 < χ < 2π.
3. Draw the graph of у = tan x°, from χ = - 20 to χ = 70, plotting points at
intervals of 10°.
Using the same axes and intervals, draw also the graph of
у = cos (x+10)° from χ = 20 to χ = 50.
Read from your graph the value of χ when tan x° = cos (x+10)°.
(O&C'O')
4. It is required to find an angle χ such that
sin л:
= τ*
where χ is measured in radians.
Draw on the same diagram the graphs of у = sin л: and у = f χ for values of
χ from 0 to \ή, taking 1 in, (or 2 cm) to represent τιπ on the scale for x, and
0-2 on the scale for y.
99

THE TRIGONOMETRIC FUNCTIONS [6
Estimate from your graphs the required angle, giving your answer in degrees
to the nearest degree. (O & С Ό')
5. Solve graphically the equation tan 2x° = 2 cot x°, giving all solutions χ such
that 0° < x° < 180°.
6. Draw the graph of у = 1 — 2 sin 60x° for values of χ from 0 to 6, using 2 cm
as unit on both axes. By drawing a suitable straight line on the same diagram,
read the solutions, within the given range of values of x, of the equation
2(1 - 2 sin 60x°) = x. (O & С Ό')
7. Solve graphically the equation
sin χ = cos2;t+l
for values of χ for which 0 < χ < п.
8. Using the same axes sketch the graphs of: (i) sin x, (ii) 2 sin x, (iii) 1 + 2 sin x.
9. Using the same axes, sketch the graphs of (i) sin x, (ii) -sin x, (iii) 2-sin x.
10. Sketch the graph of 2(1 -cos x).
11. Sketch the graph of 1 -tan x.
12. Sketch the graph of 1 + |sin x\.
13. Sketch the graph of sin (х + &т).
14. Sketch the graph of 1 +2 cos (χ-^π).
15. Find the values of:
(i) sin (arccos i); (ii) cos (arcsin -£);
(iii) tan(arccot V3); (iv) sin(arctan—1);
(v) cos(arccot — 1/V3); (vi) sec (arccos y);
(vii) cosec (arcsin i); (viii) cot (arcsin -iV3)·
16. Simplify the expressions (i) tan (arcsin x); (ii) sin (arctan x).
17. Sove for χ the equation tan (arccot x) = 2 sin (arccos x).
18. Simplify the expression sec {arccos [tan (arccot *)]}.
What restriction must be placed upon the value of χΊ
19. If arcsin χ = я·-arccos у, find a relation between χ and y.
20. Prove that:
(i) arcsin χ + arccos χ = %n;
(ii) arcsin [cos (arcsin x)] + arccos [sin (arccos x)] = in.
(O & С modified)
Miscellaneous Exercise 6
1. A chord PQ of a circle of radius r subtends an angle 2Θ rad at the centre of
the circle and 2Θ < n. Find an expression for the area of the smaller of the two
portions into which PQ divides the circle.

4] MISCELLANEOUS EXERCISE 6
If this area is a fraction 1 /2n of the area of the circle, prove that sin 2Θ = 20 — 1,
Obtain graphically an approximate solution in radians of this equation.
(O&C'O')
2. If a circle of radius a is drawn with its centre on the circumference of another
circle of radius a, find the area common to both circles,
3. Find the maximum and minimum values of the expression
sin2 0 + 6 sin 0 + 4.
4. Solve the equation
12 cos2 θ = 6 + sin0
giving all values of Θ, in radians, lying between 0 and 2π,
5. A globe representing the earth consists of a sphere of radius 10 cm, Find the
length of the circumference of the small circle representing the 30° parallel of
latitude,
Two points A, B on the 30° parallel of latitude differ in longitude by 90°,
Calculate:
(i) the length of the straight line joining A and B;
(ii) the angle subtended by AB at the centre of the globe;
(iii) the great circle distance between A and B, in cm, correct to 3 s.F.
(O&C'O')
6. Show, by sketching the appropriate graphs, that the equation
x+cosx = 1
has only one root, What is this root ?
7. Prove that the identity
sin2 χ cos2 y—cos2 χ sin2 у = sin2 x—sin2 у
is true for all values of x, y. Is the identity
sin2 χ cos2 y—cos2 χ sin2 у _
sin2 x— sin2 у ~
also true for all values of χ and yl Give reasons for your answer.
8. Solve the simultaneous equations
sin(x + .y) = УЗ,
cos(2x-x) = -У2
giving all pairs of values of x, y, lying between 0 and n.
9. Eliminate θ between the two equations
χ = tan Θ—sin Θ,
у = tan<9 + sin<9.
10. Sketch the graphs of у = x and у = cos χ and hence sketch the graph of
у = x + cosx.
Sketch the graphs of:
(i) у = |x+cos x\; (ii) у = \x\ +cos x; (iii) у = x+ |cos x\.
101

THE TRIGONOMETRIC FUNCTIONS [6
11. Draw an accurate graph of the function у = \n sin χ between χ = 0 and
χ = π. Draw in the same diagram the lines у = mx for m = 1, τ, τ, f, 1 ·
Determine the values of χ where these lines cut the graph of у = \ττ sin χ
giving your answers in the form kn, where к is correct to two decimal places.
Use the values obtained to draw a separate graph of
, sin* ,
у = \π between χ = \π and χ = π.
12. Show by means of a rough graph that the equation cos χ = х/2тт has
2n +1 positive roots when л is a positive integer. Show also that, if r is a positive
integer less than n, the root nearest to 2rn is 2m+a, where a, is the smallest
positive root of the equation
cos* = - + -—.
η 2nn
Draw a careful graph of у = cos χ for - \ή < χ < \ή and from it find the five
positive roots of the equation
4π cos χ = x.
Give your answers in the form kn, where к is correct to two decimal places.
(O&C)
13. Show that θ = 2mr + a, and θ = (2n +1) π-a,, where η is an integer, both
satisfy the equation sin θ = sin a. Prove, furthermore, that any solution of the
equation must have one or other of these forms.
θ = 2m + a or θ = (2n + 1) π- a is called the general solution of the equation
sin θ = sin a,. Find forms for the general solution of
cos θ = cos a and tan θ = tan a.
14. Find the general solutions of the equations
(i) tan λ: = cot(x + in); (ii) 2 cos2 χ -cos χ - 1 = 0.
15. AB is a chord of a given circle, centre О and radius a, subtending an angle
θ at 0 (θ < π). Prove (i) that the area of the triangle OAB is \аг sin Θ, and (ii)
that the area of the minor segment of the circle of which AB is the chord is
\α\θ-ύηθ).
Prove also that the area of the part of the circumcircle of the triangle АО В
lying outside the given circle is
16. In a circle centre O, two radii OP, OQ contain an angle θ radians (θ < π).
If the area of the sector OPQ is A and the length of the chord PQ is 2c show that
1 — cos θ = сЮ\А. Draw on the same diagram on squared paper the graphs of
the functions у — 1 — cos θ and у — сЮ/А in the particular case where
A = 7rcm2, с — 1-3 cm
for values of θ between 0 and ή. (Take 1 cm to represent -&п on the χ axis and
4 cm to represent one unit on the у axis.) Read off from your graphs the value
of θ between 0 and π which satisfies the equation 1 - cos θ = c29/A in this
particular case. Use your value of θ to find the radius of the circle in this case.
(Cambridge)
17. The function/: R+ -> R is defined ЬуДх) = χ sin \/x. Sketch the graph of the
function.
102

7· Probability infinite outcome spaces
1. ARRANGEMENTS AND SELECTIONS:
COUNTING LARGE NUMBERS
Before discussing the question of probability, we shall introduce a few
techniques whereby the procedure for counting up large numbers may be
made more efficient. Our reason for doing so is that, in probability, we are
frequently faced with the problem of deciding in how many ways an event
can occur; since the numbers involved might be large, it is desirable to
arrive at them by the simplest method available.
Consider first the following problem: in how many ways may the four
aces from a pack of cards be arranged in a row on a table ? By direct counting,
it is not difficult to arrive at the correct answer, which is twenty-four.
However, it is simpler to argue thus: consider four spaces on the table; the first
space may be filled in any one of four ways: for each of these four ways,
the next space may be filled in three ways and so on, The total number of
ways is thus 4x3x2x1 = 24.
The result of multiplying together all the integers from 1 to л is written
η! (read 'factorial η'). Thus, the number of ways of arranging the four aces
may be written 4!.
Ex. 1. Verify that 2! = 2,3! = 6,4! = 24. Write down the values of 5!, 6!, 7!.
The result above may be generalized in an obvious way: the number of
ways of arranging η unlike objects in a row is η!,
Now suppose we alter our question: how many different rows of four cards
may be made, given a standard pack of fifty-two cards! Direct counting
is now out of the question—it would take far too long, But an identical
argument to that employed above shows us that the number of ways is
52x51 χ 50x49 = 6497400. We may write this in the alternative form
52!
48!'
Again, our argument may be generalized in an obvious way: the number
of ways of arranging r objects chosen from η unlike objects is
nx(n-l)x(n-2)x ... x(n-r+l) =(-~ry,·
103

PROBABILITY IN FINITE OUTCOME SPACES [7
This is often referred to as the number οι permutations of η unlike objects
r at a time and is written nPr. Thus
(Note that 0! is defined to have the value 1 and so this formula remains
true when r = n.)
Ex. 2. Evaluate 4P2, SPS> 6P6.
Ex. 3. How many different' words' can be made using the five letters of the word
'after' ? How many of these begin with A and end with R?
We may modify our question again and ask: 'how many different hands
of four cards may be dealt from a standard pack Τ The difference here is
that we are not concerned with the order in which the cards are dealt:
we only want to know the number of selections (sometimes called
combinations) of four cards that can be made from fifty-two cards. Suppose for the
moment that this number is N. Then each of these N different hands may
be rearranged in 4! = 24 ways. There are then 24N different arrangements
of four cards. But we already know that the number of different
arrangements is 52 χ 51 χ 50 χ 49. Thus
= 270725.
24
Once more, our argument may be generalized: since there are nPr
possible arrangements of η objects taken r at a time, the number of
selections, or combinations, of η unlike objects, taken r at a time, is
nPr = П\
r\ r\{n-r)V
The number of combinations of η unlike objects, taken r at a time, is
written either as nCr or I I. The latter notation is used almost universally
now, although the former has its merits: we can read „C, as 'n choose r',
reminding us that we are choosing, or selecting, rather than arranging.
We shall retain „Cr in this chapter, but revert to ( I in later chapters,
when the need arises. 7_s
Щ^г)У
(Notice that again, since 0! has been defined as 1, „C„ = 1 by the formula,
which accords with the commonsense result that there is just one way of
selecting η objects from n.)
104
0-*-

1] ARRANGEMENTS AND SELECTIONS
Ex. 4. Evaluate SC2,6C3.
*Ex. 5. Evaluate nC0 by the formula, and interpret your result.
*Ex. 6. Evaluate nCr and nCn_r by the formula and interpret your result.
Ex. 7. In how many ways may a cricket team of eleven boys be chosen from
fourteen boys available? If three particular boys are certain to be chosen, how many
ways are there of completing the team?
Example 1. A committee of three is to be chosen from four men and three
women. If at least one man is to be included, how many possible selections are
therel
First solution. There are three possibilities:
(i) Two women included: number of ways of selecting the women
= 3C2 = 3; for each of these choices there are 4Cj = 4 ways of selecting
the remaining committee member, who must be a man.
Total (i): 3x4 = 12.
(ii) One woman included: number of ways of selecting the woman
= 3Q = 3; for each of these choices there are
г 4х3 ή
ways of selecting the remaining committee members.
Total (ii): 3x6 = 18.
(iii) No woman included: the committee is selected entirely from men,
4C3 = 4 ways.
Total (iii): 4.
The total of possible committtees is thus
12 + 18+4 = 34.
Second solution. There are
_7x6x5
,Сз"1х2хЗ ^
possible committees in all. Just one of the committees consists entirely of
women, and so 35- 1 =34 contain at least one man.
It is often necessary in probability questions to know in how many
ways it is possible to arrange η objects in a row, given that r are alike of
one kind, s are alike of a second kind and so on. We start by distinguishing
between the like objects by attaching suffixes to them: for example, if we
have r letters a, s letters b, ... then we write our η letters as
<h, Ун аз> ···> ar, K, b2, b3, ..., bs, ....
There are η J ways of arranging these η letters. But, in any one of these
105

PROBABILITY IN FINITE OUTCOME SPACES [7
arrangements, the letter a^, α2, a3, ..., ar may be rearranged amongst
themselves in r! ways; similarly the letter b1; b2, b3 bs may be arranged in si
ways and so on. Thus, if we drop the suffixes, the number of
distinguishable arrangements becomes
rlsl...'
Hence we have the following important rule.
Given η objects, r alike of one kind, s alike of another kind and so on, the
number of arrangements of the η objects in a row is
For example, the number of different arrangements of the letters of the
word SELECTIONS IS
2^j = 907200.
Ex. 8. Tn how many ways may four letters Ρ and six letters Q be arranged in a
row?
We complete this section by mentioning one further technique for
counting which is frequently useful in probability: to find the number of ways
of arranging η objects, r at a time, if repetitions are allowed. We shall
illustrate the method adopted in Example 2.
Example 2. In a simplified football coupon there are ten matches whose
results (home win, 1; away win, 2; draw, χ) are to be forecast. In how many
ways may the coupon be completed!
The first result may be forecast in three different ways (1, 2, x); for
each of these choices there are three different choices for the second
match (1, 2, χ) and so on. The total number of ways of completing the
coupon is thus 310 = 59049.
Ex. 9. A multiple choice paper consists of ten questions, to each of which is
attached five possible answers, labelled A, B, C, D, E. A candidate selects one
of these answersfor each question; in how many ways may he complete the paper?
Exercise 7(a)
1. Five boats are entered for a race. Assuming that they all finish and that there
are no dead heats, in how many possible orders can they pass the finishing line?
2. In a form of thirty boys, a first and a second prize are to be awarded; in how
many ways can this be done?
106

1] ARRANGEMENTS AND SELECTIONS
3. You have a form on which you have to give your first six choices for
university, in order of preference. If you have a list of twelve universities offering the
course you want, in how many ways can you complete the form?
4. A fruit machine has three windows, in each of which appears independently
one of six pictures. How many different arrangements (taking the order into
account) are possible?
5. How many five-digit numbers can be formed from the digits 1, 2, 3, 4, 5,
using each digit once? How many of them are even? How many are even and
greater than 30000?
6. A committee of four is to be selected from six Labour and six Conservative
M.P.s. How many possible committees are there? In how many will the
members of the Labour Party have a majority?
7. In how many ways may a tennis team of six members be selected from fifteen
available players? In how many ways may a first and second team be chosen?
8. Twelve people are to divide up into three sets of four players for a whist drive.
In how many ways can this be done?
9. Of ten electric light bulbs, three are faulty but it is not known which. In how
many ways may three bulbs be selected? How many of these selections will
include at least one faulty bulb?
10. How many different bridge hands (thirteen cards) are there which contain
(i) all four aces, (ii) three aces and one king? (Leave your answer in factorial
form.)
11. Criticize the following attempted solution of Example 1:
Ά man must sit on the committee, and he can be chosen in four ways. The
remaining two members may now be chosen arbitrarily from among the six
remaining people: this may be done in 6C2 = 15 ways. Thus, the total number of
possible committees is 4 χ 15 = 60.'
12. Prove that the numbers of distinguishable arrangements of η objects in a
row, if two are alike and the rest different, is \n\.
What is the number of distinguishable arrangements if three are alike and the
rest different ?
How many different telephone numbers can be made using all the digits of
the number 4225267?
13. How many different arrangements are there of the letters of the word
queue? In how many of these arrangements do the letters QU appear together,
in that order?
14. Find in how many ways a batting order (eleven men) may be made if Smith
is to bat before Brown.
15. A diagonal of an и-sided polygon is a line joining two non-adjacent vertices.
How many diagonals does an и-sided polygon possess?
16. In how many ways can two Is, two 2s and two 3s be thrown with six dice?
107

PROBABILITY IN FINITE OUTCOME SPACES [7
17. Given η unlike objects, find the number of ways of dividing them into three
unequal groups of sizes p, q, r where p+q + r = n.
In how many ways can they be divided into two groups of size ρ and one of
size n—2p,if no attention is paid to the order of the groups?
18. A pair of integers is selected from the set of positive integers 1, 2, 3, ..., n.
In how many ways may this be done? [In each pair the order of the integers is
immaterial, e.g. (2, 3) and (3, 2) count as one pair only.]
If the integers in each pair are multiplied together show that, in the case when
η is odd, the number of products which will be odd integers is \{n2 — 1). If
η is large show that this number is approximately one-quarter of the total number
of products. (Cambridge)
2. RANDOM EXPERIMENTS AND
OUTCOME SPACES
The subject of probability deals with 'experiments' which may have a
number of possible outcomes; more specifically, it seeks to assign a
numerical measure to the likelihood of obtaining various possible results
if such an experiment is conducted, and thereby enable us to analyse the
situation mathematically. Experiments of this nature, in which the results
obtained depend upon chance, may be called random experiments.
We may take, as three typical examples of random experiments:
(i) A coin is tossed and the result (head or tail) noted.
(ii) From the very large output of a machine producing electrical
components, a sample of twenty components is drawn and each component in
the sample is tested in turn to decide whether or not it is faulty.
(iii) A person is tested for blood group {A, B, AB or O).
Each of the above experiments has the property that its outcome may
be one of a (finite) number of possibilities. A set whose elements represent
all the various distinct possible outcomes of a random experiment is called an
outcome space for that experiment and its elements are called elementary
events. Notice particularly the words 'all' and 'distinct' in this definition:
all the possible outcomes must be represented in the set and no elementary
event can correspond to more than one possible outcome of the experiment.
(The word 'space' is used because the outcomes of an experiment are
frequently represented as points in geometrical space;' outcome set' would
perhaps be a preferable term but we shall follow the customary usage.
Alternative terms in use are 'sample space' and 'possibility space'.)
For example, denoting by r the numbers of defectives found in a sample
in experiment (ii) above, a possible outcome space would be
Si ={reZ:0 < r < 20}.
It should be noticed that we have talked of ал outcome space rather than
108

2] RANDOM EXPERIMENTS AND OUTCOME SPACES
the outcome space: any other set whose elements represent distinct
possible outcomes and which exhausts all the possibilities will do, e.g.
S2 = {G, F, P},
where G denotes a good result (no defectives);
F denotes a fair result (one or two defectives);
Ρ denotes a poor result (more than two defectives).
Generally speaking, it is best to choose as an outcome space one that
gives as much detail as possible about the result obtained, but there are
exceptions to this and other criteria may be adopted.
Ex. 10. Two coins are tossed and the result (in terms of heads and tails) noted.
Suggest three possible outcome spaces.
Ex. 11. A card is drawn from a pack and its value and suit noted. Criticize the
following outcome space:
S = {card is an ace, card is a heart, card is neither a heart nor an ace}.
Ex. 12. Suggest an outcome space for Ex. 11 if the experiment is concerned only
with the drawing of a heart or an ace from a pack of cards.
Ex. 13. A count is made of the numbers of girls and boys in a family. Under
what circumstances would the set
S = {there are more girls than boys, there are more boys than girls}
constitute an outcome space?
3. PROBABILITY DISTRIBUTIONS
We now assign to each elementary event st of our outcome space S a
positive fraction^ (i.e. 0 < pt < 1) called the probability that the outcome
of our experiment will be s{. Furthermore, for consistency, we shall make
the sum of all the probabilities/?» over the entire outcome space 1. Thus,
the probabilities associated with an outcome space S = {sly s2 s„} are
real numbers such that
0 < Pi < 1, (1)
Pi+P2+Ps+-+Pn = 1. (2)
Such a set of probabilities is said to constitute a probability distribution
for the given outcome space.
It will be observed that so far we have attached no meaning to the
numbers p{—we have only placed certain restrictions upon their possible
values. Provided we observe these restrictions we have a mathematically
meaningful system. However, it is desirable that a mathematical system
should have some relevance to the physical world. If a mathematical
system describes some physical situation, we are said to have created a
109

PROBABILITY IN FINITE OUTCOME SPACES [7
mathematical model of that situation. In using a probability distribution
as a mathematical model of a situation we shall demand that the probability
Pi shall be, in some sense, a numerical measure of our degree of belief
that the experiment will result in the outcome j».
It should be carefully noted that the assigning of a probability
distribution to an outcome space S constitutes an assumption about the
experiment under consideration. Certain' natural' ways of assigning probabilities
are discussed below. In many cases, a 'natural' way of assigning
probabilities will appear so obvious that the tacit assumptions made may be
overlooked. It is a sound point of self-discipline to pause to consider the
assumptions made when embarking upon any question in probability.
We now give two examples of ways in which probabilities may be
assigned to the elementary events of typical outcome spaces. The final
justification for such probability distributions is that calculations based
upon them are supported by empirical evidence.
(i) Symmetry among the possible outcomes may make it reasonable
to assume that all the elementary events have equal probability. For
example, in the case of a die, it may reasonably be assumed that no one
face is more or less likely to appear uppermost if the die is thrown than
any other. Then sinceрг +p2 +... +pe = 1,
Pi = Pi = Рз = Pi = Ps = Pe = 6-
If the same probability is assigned to each elementary event of an outcome
space S, the resulting probability distribution is said to be uniform and the
elementary events are said to be equiprobable.
(ii) Previous repetitions of an experiment (conducted under constant
conditions) show that outcome A occurs a per cent of the time, В occurs
b per cent of the time, etc., where A, B, ... are quite distinct. Then our
accumulated experience suggests that the probability of securing outcome
A is aj 100, that of securing В is й/100, etc. As an example, suppose that a
large number of samples of size 20 are taken from the output of a machine
manufacturing electrical components and that, on average, 0-94 per
sample turn out to be defective. Then we may reasonably take the
probability of any components being defective as 0-94/20 = 0-047. Notice that
such an assignment of probabilities satisfies the requirements (1) and (2)
on p. 109.
Ex. 14. Of the outcome spaces you suggested for Ex. 10 which do you think may
reasonably be supposed to be uniform?
Ex. 15. Two dice are thrown and the total score is noted. Criticize the assigning
of a uniform distribution to the outcome space
S = {2, 3,4, ...,12}.
110

3] PROBABILITY DISTRIBUTIONS
Ex. 16. Suggest an outcome space for the experiment of throwing two dice
that may be given a uniform distribution.
Ex. 17. From the weather records of a certain town taken over the past thirty
years, on the average five days in November have been recorded as foggy. Is
it justifiable to assume that the probability of a November day in that town
being foggy is i?
Ex. 18. The number of boys and girls in a family of five children are noted.
Justify the assumption that the outcome space
S = {there are more girls than boys, there are more boys than girls}
has a uniform distribution (i.e. ρ = \ for each of the two elementary events).
4. PROBABILITIES OF EVENTS
Any subset Ε of the outcome space S of an experiment is called an event.
(Notice that the use of the word in 'elementary events' is consistent with
this definition, since {s} с S.) If we have assigned a probability
distribution to our outcome space it becomes meaningful to consider the
probability of the event Ε (by which we would mean, in a practical example, a
measure of our degree of belief that the experiment will result in one of the
elementary events belonging to E). A moment's consideration should
show that the following definition is plausible, at least in simple cases.
Given a sample space S with associated probability distribution, and an
event Ε с S, then the probability of the event E, written Pr (E\S) (read: 'the
probability of event Ε given S'—more explicitly, 'the probability of Ε
given the probability distribution of the outcome space 5") is defined as
the sum of the probabilities of all the elementary events belonging to E.
More formally, if S = {sly s2, ..-, s„} andp{ is the probability associated
with s{, and if Ε = {sri, sr2, ..., srm} then
Pr (E\S) = pn +pn +p„ + — +prm-
Example 3. If a card is drawn from a well-shuffled pack, what is the
probability that it will be an acel
A suitable outcome space, S, is the set of 52 elementary outcomes
corresponding to the fifty-two different cards in the pack.
To continue, we must make an assumption about the probability
distribution. Since the pack is well shuffled, we shall consider this as a case of
fifty-two equiprobable events and so associate with each element of S the
probability -fe. The event Ε is the subset of S consisting of the drawing of
the ace of spades, hearts, diamonds or clubs.
Then Pr (E\S) = -h +W + A +-s\- = -h-
Ill

PROBABILITY IN FINITE OUTCOME SPACES [7
Example 4. What is the probability of securing a hand of thirteen cards all
of one suit in a game of bridge!
We take as our outcome space the set consisting of all possible distinct
bridge hands, i.e. a set with 62C13 elements.
If the cards have been well shuffled we may assume that these hands
are equiprobable and so the probability of getting a specified hand is
13! 391/52!.
Of these hands, four consist of one suit only. The required probability is
thus 13! 39! 4/52!, which works out to be roughly 1-6 x KH1.
*Ex. 19. How would you interpret the probabilities obtained in Examples 3 and
4?
Ex. 20. What is the probability of obtaining one head and two tails if three coins
are tossed?
Ex. 21. What is the probability of obtaining a total of more than 10 from the
throw of two dice?
Ex. 22. There are one thousand tickets issued in a lottery and prizes are awarded
for twenty of them. What is the probability of any specified ticket securing a
prize?
Ex. 23. What is the probability that a bridge hand contains (i) just one heart
(a singleton heart); (ii) a singleton in just one suit? (Leave your answers in
factorial form.)
Probability is occasionally formulated in terms of odds rather than as
a fraction. If the probability that a horse will win a race is estimated as $,
then the probability that it will not win is £ and the odds against its winning
are 3 to 1 (sometimes written as '3 to 1 against')· Again, if the probability
that another horse will win a race is §, then the odds infavour of its winning
are 2 to 1 (or '2 to 1 on').
Ex. 24. The odds on horse A to win a race are 3 to 1 on and on horse Л 4 to 1
against. Write down the probabilities
(i) that A wins;
(ii) that Л wins;
(iii) that A does not win.
Exercise 7(b)
1. Consider families consisting of six children, all of different ages. The 'type'
of family is defined by an ordered set of the form {B, B, G, B, G, B} the elements
of which represent the sex of each child, starting with the eldest. How many
different types are there? In how many different types are there three boys
and three girls? What is the probability that a family of six children will consist
of three boys and three girls?
112

4]
PROBABILITIES OF EVENTS
2. In question 1 an outcome space
S = {0, 1,2, ...,6}
is given, where the elements represent each of the possible number of boys in a
family. Assign a probability distribution to this space.
3. A number is chosen at random from among the integers 1, 2, 3,..., 20. What is
the probability that
(i) it is a multiple of 3 or 7?
(ii) it is a multiple of 3 or 5 ?
4. If a committee of four is chosen at random from ten women and ten men,
what is the probability that there will be two women and two men serving?
5. If, in Question 4, the male chairman is certain to be re-elected, what is now
the probability of equal numbers of men and women ?
6. Two numbers are selected at random from the integers 1 to 10. (You may
assume that all numbers are equally likely to be selected and that the same
number may be selected twice.)
The elements 0, 1,2, ..., 9 of the outcome space represent the magnitude of the
difference between the two numbers. Assign a probability distribution to this
space which you feel represents a suitable mathematical model for the
experiment.
7. If two numbers are selected at random from the numbers 1 to 10, what is
the probability that the larger number will be greater than 8 ?
8. If two cards are drawn from a pack of fifty-two cards, what is the probability
that
(i) they will both be spades,
(ii) they will be of the same suit ?
9. Two people are asked independently to write down an integer between 1 and
10 (inclusive). The sum, s, of the two numbers is then calculated. Write out an
outcome space for the possible values of s and assign a probability distribution,
on the assumption that each person is equally likely to select any one of the ten
numbers.
What is the probability that s will be prime?
10. In a mixed bag of screws there are twice as many large as small. 10 % of the
large screws are defective and so are 5 % of the small screws. Assuming that every
screw has an equal probability of being selected, what is the chance of picking
a defective screw ?
Comment upon the assumption made.
5. THE ADDITION LAWS
We are now in a position to prove some simple theorems concerning the
probabilities of compound events. Throughout we shall suppose that the
outcome space S and the probability distribution are given.
113

PROBABILITY IN FINITE OUTCOME SPACES [7
Theorem 7.1. If' ΕΎ, E2 are two events, then
Pr (Ег U E2\S) = Pr (^|5) + Рг СВД-Рг (Ег η E2\S).
Proof. The sum of the probabilities of the elementary events in E1 и E2
is equal to the sum of the probabilities in Ег and E2, less the sum of the
probabilities in Ex Π Ε2, since this has been included twice. The result now
follows from the definition of the probability of an event.
Theoreml .2. Ifthe events Ег,Е2 are mutually exclusive {i.e. ifЕгГ\ Е2 = 0),
then
Pr (Ег U £2|S) = Pr СВД + Рг (E2\S).
Proof: Pr(E1 П E2\S) = Pr(0 \S) = 0, by the definition of the
probability of an event. The result now follows immediately from Theorem 7.1.
Theorem 7.3. If Ε is any event
Pr(E'\S) = l-Pr(£|S).
Proof. Since Ε U E' = S and Ε Π Ε' = 0, this is a special case of
Theorem 2.
Theorem 7.4. If Elt E2, ..., En are mutually exclusive events
{Et[\E,= 0, idpj)
then Pr(U^,|s) =Pr(JE,1|S) + Pr(JE,2|S) + ...+PrCE,n|S).f
A formal proof of this theorem may be had by employing mathematical
induction (see Chapter 9). However, its truth is intuitively fairly obvious
if we consider a Venn diagram and note that, since no two of the Et
intersect, the sum of the probabilities in UE{ is obtained by adding up the
probabilities for Ex then for E2 and so on.
Example 5. Two dice are thrown. What is the probability of scoring either a
double, or a sum greater than 9?
We take as our outcome space the set of 36 pairs (i,j), where i,j run
independently from 1 to 6. We make the assumption that the events are
equiprobable and so attach a probability to each elementary event of -Зд6-.
Wrlte E, = {(1, 1), (2, 2) (3, 3) (4, 4) (5, 5) (6, 6)},
E2 = {(4, 6) (5, 5) (6, 4) (5, 6) (6, 5) (6, 6)},
t The notation U means the union of the η sets Eu E2, ...,E„; that is.
Εχ U Ег U ... U Е„.
114

51 THE ADDITION LAWS
and so Ε1{\ΕΛ = {(5, 5), (6, 6)}.
We thus have
Pr (£, |S) = Pr (E, \S) = -h; Pr (Ε, η Ε,\S) = &
and so Pr (£j и E2 \S) = i + έ - "A. by Theorem 7.1,
= A-
Example 6. Two cftce are thrown. What is the probability of not getting a
double ?
Making the same assumptions as in Example 5 and using the same nota-
ti0n' Pi(Ei\S)=l-Pr(E1\S), by Theorem 7.3,
= 1-*
= i
Ex. 25. In a class of boys, one-third have black hair and one-quarter have brown
eyes. What deductions can you make?
Ex. 26. If a card is drawn from a well-shuffled pack what is the probability thai
it is either an ace or a king?
Ex. 27. If a card is drawn from a well-shuffled pack, what is the probability that
it is an ace or a heart?
Ex. 28. If a number is selected at random from the integers 1,2,..., 30 what is the
probability that it is
(i) divisible by 2;
(ii) divisible by 3;
(iii) divisible by 6;
(iv) not a multiple of 2 or 3 ?
Ex. 29. If one thousand tickets are issued in a lottery in which there are two
first prizes, eight second prizes and ten third prizes, what is the probability of not
securing a prize with one ticket ?
6. THE MULTIPLICATION LAWS
We shall now consider how we must modify our estimate of the
probability of an events if we are given information in addition to our outcome
space and its allied probability distribution. Suppose that Ε and F are
two events (with Pr (F\S) φ 0) and we require to estimate the probability of
Ε knowing that F occurs. Essentially, we have a new outcome space F
and we have to determine Pr (E\F). (We define Pr (^l-F) to mean
Pr (Ε η F\F).)
115

PROBABILITY IN FINITE OUTCOME SPACES [7
Now since F is the new outcome space, the new probability distribution
for F must have the property that the probabilities of the elementary
events contained in F sum to 1. Making the plausible assumption that the
relative proportion of the weights attached to the elementary events of F
remain unaltered, this is equivalent to scaling up the probabilities in .Fby
a factor 1/Pr (F\S). The probability that Ε will occur is thus Pr (Ε η F\S),
scaled up by this factor ljp(F\S). We are thus led to make the following
definition:
Fv(E\F)
Fr(Ef)F\S)
?r(F\S) ·
Example 7. A bag contains twenty balls, ten of which are red, eight white
and two blue. The balls are indistinguishable apart from the colour. Two balls
are drawn in succession, without replacement. What is the probability that
they will both be red?
We take as our outcome space S the set of 20 χ 19 = 380 possible
selections of two balls. Since the balls are indistinguishable apart from
colour, we may impose a uniform probability distribution upon S.
Now define Кг as the event 'the first ball picked is red', R2 as the event
'the second ball picked is red' and similarly Wlt W2, Въ B2.
We require to find Pr (R-, η R2\S).
Now Pr (R^S) = ^ and Pr (R2\Rj)—the probability that a red ball is
selected from a bag now containing nine red, eight white and two blue
balls is -j%.
Thus, pr № n ^ |S) = pr №|jy pr ^ |S)
= &·**
= &·
Example 8. With the data of Example 7, what is the probability that we obtain
a blue and a white ball (in either order) ?
Adopting the notation of Example 7, we are required to calculate the
probability of obtaining one or other of the two mutually exclusive events
Βτ η W2 and W1 П B2; that is Pr {(B1 η W2) U (й^ П B2)\S).
Now
Pr {(B1 η W2) U (Wx η BJIS} = Pr (Bx η *ВД + Рг (й^ П B2\S),
since the events are mutually exclusive using Theorem 2;
= Pr (H^J.Pr (Я^ + Рг (52|й^).Рг (W^S)
= -h.
116

6] THE MULTIPLICATION LAWS
The arguments employed for the solution of the preceding examples may
be represented diagrammatically by a probability tree (see Figure 7.1).
Starting at О on the extreme left, the end-points of the first three branches
represent all the possible (mutually exclusive) results of the first draw.
The end-points of the next three sets of branches represent all the possible
Fig. 7.1
outcomes of the second draw, the various sets corresponding to the
different draws on the first round. The various probabilities are then attached
to the branches as shown and the probability of any desired path from
О to some end-point on the right may be found by multiplying the
successive branch probabilities together.
For example
Pr^n ад = ■&*# = ft,
Pr (B2\S) = Pr (R, η B2\S) + Pr (^ П B2\S)+Vr (Btf B2\S)
(events mutually exclusive)
= Pr СВД.Pr №|jy + Pr (^|S).Pr (B2\ Wy
+рг(ад.Рг(ад)
= «.ft+A.ft+A-iV
= ΤΓο·
(Can you see a simpler method of arriving at this last probability?)
117

PROBABILITY IN FINITE OUTCOME SPACES [7
Probability trees are particularly helpful if a number of probabilities
have to be read off. If only one probability is required, a simplified tree may
often help. For example, in Example 7 we are essentially concerned with
two alternatives: red and not red. Figure 2 shows a suitable tree for this
example.
<20
,0
<z
Pr (R, η R2\S) = Pr №|5).Рг (R^R,)
A particularly important case of the multiplication law arises in the case
of independent events. Intuitively, two events are independent if neither
has an effect upon the other. To make this notion mathematically precise
we define two events A and В to be independent if
Pr {A n B\S) = Pr (A\S).Pv (B\S).
Comparison of this definition with that given above for conditional
probabilities shows that the assumption of independence is equivalent to
the assumption that
Fv (A\S) = Fr (A\B).
In words, the occurrence of В does not lead us to review our estimate of
the probability of A.
Ex. 30. Prove that, if Pr (A n B\ S) = Pr (A \ S) Pr (B\ S), then Pr (A \ S) = Pr (A \ B).
Example 9. A coin is tossed and a die thrown. What is the probability of
securing a head and a six!
On the assumption that the two events are independent, the required
probability is clearly \ χ \ = τι.
118

61 THE MULTIPLICATION LAWS
Ex. 31. A bag contains two white balls and six red balls; a second bag contains
four white balls and four red balls. The balls are indistinguishable apart from
colour. What is the probability of obtaining two white balls if
(i) one ball is drawn from each bag;
(ii) two balls are drawn from the first bag;
(iii) two balls are drawn from the second bag?
Ex. 32. Criticize the following argument:
' A ring of anti-ballistic missiles is estimated to give a probability of 0-2 of
destroying any incoming missile. A set of five such rings will therefore render
a city immune to missile attack.'
Ex. 33. 'In a form of twenty-four boys, six failed their English examination and
four failed their Mathematics. The probability that a boy, selected at random
from the form, failed in both English and Mathematics is thus & χ -& = ■&.'
Do you think this argument is essentially correct?
Ex. 34. A coin is spun twice and a die is thrown twice. Find the probability of
obtaining at least one head or at least one six (or possibly both).
Exercise 7(c)
1. What is the probability of not throwing a six with four throws of an
unbiased die? What is the probability of throwing at least one six?
2. On average, 2 % of the electric light bulbs of a certain type prove to be
faulty: what is the probability that a batch of twelve such bulbs will be free
of defectives?
3. If four people are chosen at random, find the probability that no two of them
were born on the same day of the week. (M.E.I. Ό')
4. A bag contains a dozen apples, of which three are bad. If two apples are
withdrawn at random, find the probability that
(i) both are good ;
(ii) both are bad;
(iii) one is good and one bad. (M.E.I. Ό')
5. Show that there is a better chance of throwing at least one 6 with four throws
of a single die than there is of throwing at least one double 6 with twenty-five
throws of a pair of dice. (This is a modification of a famous problem in the
history of probability, first proposed by a gambler, the Chevalier de Mere and
transmitted by him to the French mathematician Fermat, who managed to solve
it.)
6. A bag contains two red, three white and four blue balls. If two balls are drawn
in succession without replacement, what is the probability of drawing a red and a
blue ball, in either order? What is the corresponding probability if the first ball
is replaced before the second is drawn ?
7. A bridge hand consisting of thirteen playing cards contains two aces. If
five cards are drawn at random from the hand, find the probability that the five
will contain neither of the aces. (M.E.I. 'C)
119

PROBABILITY IN FINITE OUTCOME SPACES [7
8. A teacher has twelve pupils in his form, eight boys and four girls. On a school
outing he can provide transport in his car for three pupils and decides to draw
lots for the seats. Construct a tree to show on its branches the probabilities as to
the sex of the winning pupils in the different stages of the draw. Hence, or
otherwise, find the probability that those winning will contain
(i) two boys and one girl,
(ii) at least two girls. (M.E.I. Ό')
9. Given five different letters and their respective envelopes, in how many ways
can one letter be placed in each envelope if this is done at random?
Find in how many cases only two of the letters will be in their correct
envelopes.
What is the probability that at least three of the letters are in their correct
envelopes? (M.E.I. Ό')
10. What is the probability that, after a pack of cards has been well shuffled,
two aces will lie at the top ?
11. A bag contains three red, four white and five blue balls, indistinguishable
apart from their colour. If two balls are drawn successively, without
replacement, estimate the probability of obtaining two balls of different colours.
12. A machine has N components, each of which has a probability of 2/3 of
breaking down. The machine will function provided at least one of its
components is functioning. What value should be chosen for N if the machine is to be
99% efficient?
13. A pack of fifty-two ordinary playing cards is shuffled and a card is
withdrawn. D denotes the event that the card is a diamond, К that it is a king, R
that it is red. Prove by calculating the appropriate probabilities that D and К
are independent, that К and R are independent, but that D and R are not
independent.
What is the value of Pr [(Z)> nK)U(Dn #')]? (M.E.I.)
14. Three cards are drawn from a pack of fifty-two cards and, when they have
been replaced and the pack shuffled, a second set of three cards is drawn. Find
the chance that the six cards drawn should include at least one ace. (M.E.I.)
15. A bag contains five white and three red balls. Balls are drawn in succession
and are not replaced. Show that the chance that the first red ball will appear
at the fifth draw is 3/56. (M.E.I.)
16. Three bags, А, В, С, contain respectively three white and two red balls, four
white and four red balls, five white and two red balls. A ball is drawn unseen
from A and placed in B; then a ball is drawn from В and placed in C. Find the
chance that if a ball is now drawn from С it will be red. (M.E.I.)
17. How many times should an unbiased die be thrown if the probability that
a six should appear at least once is to be greater than 9/10? (M.E.I.)
18. A batch of fifty articles contains three which are defective. The particles are
drawn in succession (without replacement) from the batch and tested. Show that
the chance that the first defective met will be the /-th article drawn is
(50-/·) (49-/0/39200. (M.E.I.)
120

6] THE MULTIPLICATION LAWS
19. Two men A and В play a game in which A should win eight games to every
seven won by B. If they play three games, show that the probability that A will
win at least two games is approximately 0-55. (Cambridge)
20. The chance of any one engine of a four-engined aeroplane failing on a long
journey is 5 %. If only one engine fails the chance of the aeroplane completing
the journey is 80 %; if two engines on opposite wings fail, its chance of
completing the journey is 50 %. It cannot fly with two engines out of action on the
same wing. Find the chance that the aeroplane will complete the journey.
(O&C)
21. (i) From a bag containing five red, four white and three green balls, three
are drawn together at random. Find the chance of their being
(a) all of different colours,
(b) all of the same colour.
(ii) Two six-faced dice whose faces bear the numbers 1 to 6 respectively are
thrown together. Find the chance of
(a) the total score being exactly 8,
φ) the total score being greater than 8. (O & C)
7. EXTENDED USE OF THE MULTIPLICATION
LAWS
Note. To gain precision we have so far always specified the conditioning
set, even if it is the entire sample space (whose associated probability
distribution is known). In this latter case, however, little harm is done by
dropping the reference to S; thus, if the outcome space is S and an event
Ε is considered as a subset of S, it is customary to refer to Pr (E\S) simply
as Pr (E). Provided no ambiguity arises we shall adopt this simplified
notation.
For example, if A, В are independent, we shall write
Pr(4 η 5) = Рг(Л).Рг(5).
More generally, if А, В are any two events in an outcome space S
Pr (Α η Β) = Pr (A\B).l?r (В).
In pursuing applications of the multiplication law, we first notice that,
since the operation of intersection is commutative, that is, since
Α η Β = Bf) A
we have, for any two events A, B
Pr (Α\Β).Ρτ (B) = Vr(A(\B) = Pr (Β η A) = Pr (B\A).Pr (A).
This form of the multiplication law is often useful, for example, when it
is desired to calculate Pr (A\B) and the three probabilities Pr (A), Pr (B),
Pr (B | A) are known or easily found.
121

PROBABILITY IN FINITE OUTCOME SPACES ['
Example 10. Two bags contain coloured balls as shown in the table below.
BagI
Bag II
Red
3
0
White
2
4
Blue
1
2
A die is thrown; if a I or 6 appears, then bag I is chosen, otherwise bag II.
A ball is then drawn from the selected bag. If the result of the throw of the die
is unknown, but merely that a white ball is drawn, what is the probability that
it came from bag II
We begin by making the assumption that the balls are indistinguishable
apart from colour and that we have an equiprobable outcome space.
Let I denote the event 'bag I is selected',
II denote the event 'bag II is selected',
W denote the event' a white ball is selected'.
ThenPr(I) = i;Pr(II) = f.
W may occur in either of two mutually exclusive ways: bag I is selected
and a white ball is drawn; bag II is selected and a white ball is drawn.
Thus we have
7r{W) = Pr(^nl) + Pr(^n II)
= Pr (W|I).Pr (I) + Pr (W|lT).Pr (II)
Pr(I|PT)
_ Pr (I n W)
~Pr(0O
Pr(^nl)
Pr(0O
Pr(^|l).Pr(I)
Pr (W)
which is the required probability.
The technique mentioned above may be generalized to yield a result
commonly known as Bayes's Theorem. (The Reverend Thomas Bayes
(1702-61) was one of the earlier writers on the subject of probability. The
theorem which bears his name was published posthumously in 1764.)
122

7] EXTENDED USE OF THE MULTIPLICATION LAWS
Before considering the general theorem we shall solve an example for
a special case (n = 3).
Example 11. Three machines produce the same type of electrical component,
20 % of the total output coming from machine A, 50 %from machine В and
30 %from machine C. Tests conducted in the past show that 5 % of the
components from A and 1 %from each of В and С prove faulty. A component
selected at random from the total output is proved to be faulty. What is the
probability that it came from machine ΑΊ
Call A the event 'component comes from machine A',
В the event 'component comes from machine B',
С the event' component comes from machine C,
F the event 'component found faulty'.
We have to determine Pr (A\F).
Now, from the data, we may assign the following probabilities:
Pr (F\A) = xj^; Pr (F\B) = y^; Pr (F\C) = χ^;
Pr (A) = -fts; Pr (B) = yVo; Pr (C) = -f^-.
Now Pr (A\F) = Pr (A n F)/Pr (F) and so the problem reduces to
determining Pr (A n F) and Pr (F), given the numerical values above. But
Fr(Af\F)= Vr(F(\A)
= Pr(F\A).J>r(A)
= τοπ·
Again F = (F() A) U (Ff] B) U (F(] C) and these three events are
mutually exclusive (no component can come from more than one machine).
ThUS Pr (F) = Pr (F(] A) + ?r (F(] 5) + Pr (F(] C)
= Pr (F\A). Pr (A) + Pr (F\B). Pr (B) + Pr (F\ C). Pr (C)
-Th + iU + rAo
Thus 50°" PrC^^f
= f
Ex. 35. Of two pennies, one is double-headed, the other normal. If one of the
pennies is selected at random and tossed twice, what is the probability of
obtaining two heads? If one of the coins is selected at random, tossed twice and gives
two heads, what is the probability that it is double-headed?
Ex. 36. Find the probabilities that the faulty article of Example 11 came from
(i) machine B, (ii) machine C.
123

PROBABILITY IN FINITE OUTCOME SPACES [7
Ex. 37. With the data of Example 11, find the probability that a component
selected at random and shown to be not faulty comes from machine A.
Ex. 38. Two articles come from one of the machines of Example 11, but it is
not known which one. On testing, both are found to be faulty. Find the
respective probabilities that they come from machines А, В and C.
Notice that in Example 11 we had an outcome space partitioned into
three mutually exclusive and exhaustive events А, В, С, and were required to
find the probability of A conditional upon the occurrence of some event F.
More generally, we define a partition of a set S as a set of subsets
{НЪН2,Щ, ...,Hn)
such that the intersection of any two of the H's is empty and the union of
the H's is S, that is
Given a set S, the set {Hlt Я2, ..., #„} of subsets forms a partition of
Sif
(i) Hif) H} = 0,1 «ξ i <j «ξ n;
(ii) ΰ# = S.
Theorem 7.5 (Bayes's Theorem). Given apartition Hu Я2, ..., Hn of an
outcome space S and an event Ея. S,
Vx{H}\E)
Рг(ЭД)Рг(Яг·)
Рг(^|Я1).Рг(Я1) + Рг(^|Я2).Рг(Я2)+...+Рг(^|Яп).Рг(Яп)·
Proof. Since the H{ are exhaustive,
Ε = {Ε η Щ U {Ε П Я^ U ... U {Ε η Яп).
But the Hi are mutually exclusive and it follows that the Ε η Щ are also
mutually exclusive. Thus, by Theorem 7.4
Pr(£)= Pr(£n Я^ + Рг^п Я2) + ...+Рг(£п Нп)
= Pr (ВДЭ.Рг (Я^ + Рг (£|Я2).Рг (Я2)+ ... +Рг (£|Яп).Рг (Яп),
but
Рг(Я,п£)
Рг(Я3-|£)- Fr(E)
_Рг(Е[)Щ
Рг(£)
_Рг(£|Я^.Рг(Я,)
Рг(£)
Рг(ОД).Рг(Я,)
= Рг(£|Я1).Рг(Я1) + Рг(£|Я2).Рг(Я2) + ... +Рг(£|Яп).Рг(Яп)·
124

7] EXTENDED USE OF THE MULTIPLICATION LAWS
The value of this theorem lies largely in its application to inferential
problems in statistics. A number of hypotheses H{ are made (hence the
conventional use of the letter Η in the theorem) and the probability
of one hypothesis Д, given the occurrence of some event Ε is determined in
terms of the probabilities of the Щ without the prior knowledge of the
event E. (This is sometimes referred to as the determination of the a
posteriori probability of the event H} given the apriori probabilities of the
events H{, 1 < i < n.) The major practical difficulty in its application lies
in the estimation of the apriori probabilities Pr (Щ.
In solving problems involving Bayes's Theorem it is advisable always
to work from first principles as in Example 11, and not to quote the
theorem in its general form.
Exercise 7(d)
1. One of four dice is known to be biased, showing a 6, on average, three times
as often as each of the other scores. A die is chosen at random and thrown three
times and three 6s appear. What is the probability that it is one of the unbiased
dice?
2. A factory manufactures three different qualities of light bulb, А, В and С in
the ratio 1:2:3. The bulbs are indistinguishable in external appearance but
extensive tests indicate that, on the average, 1 % of type A, 4 % of type В and
5 % of type С are below the advertised standard. A batch of six, all of the same
type is sent in error without a distinguishing label. If none of these turns out to be
defective, estimate the probability that they were of type A.
3. Of a large sample of items, 40 % were produced by machine A and 30 % by
each of machines В and С The machinery was very unreliable, machine A
producing 10% defective items, machine В 20% defective items and machine С
30 %. If an item, selected at random from the sample, proves to be defective,
what is the probability that it came from machine A ?
4. A card is missing from a pack of 52 cards. If this is the only information you
have, what is the probability that the missing card is a spade? The pack is well
shuffled and the first card is removed and proves to be a spade. What would your
assessment of the probability that the missing card is a spade be now? The card
removed is now replaced and the pack shuffled. The top card again proves to be
a spade. What is your assessment of the probability now?
5. Four bags, I, II, III, IV, contain white and black balls as shown in the
following table.
Bag I II III IV
Number of white balls 12 3 4
Number of black balls 9 8 7 6
A die is thrown; if a one appears, bag I is chosen, if a two or three, bag II,
if a four or five, bag III and if a six, bag IV. A ball is then drawn at random from
the bag selected.
5-2
125

PROBABILITY IN FINITE OUTCOME SPACES [7
If you are told that a black ball has been drawn, what should your
assessment be of the probability that it came from bag I ?
6. A certain rare disease from which one in ten thousand of the population suffers
is diagnosed by a test which reveals the presence of the disease in 95 % of the
cases of those tested who actually have the disease. However, it also incorrectly
yields a positive reaction in 1 % of the cases of those who are not suffering from
the disease. If a person selected at random from the population shows a positive
reaction, what is the probability that he is actually suffering from the disease?
7. One of four pennies is known to be double-headed, the other three being
normal. One of the pennies is selected at random and tossed three times. If the
result is three heads, what is the probability that the coin tossed is the double-
headed penny?
8. An engineer is responsible for servicing three small computer installations
А, В, С The probability that А, В, С will break down on any day are 2го, го, too
respectively. The probabilities that the three computers will not be in operation
after 7 p.m. are ^, §·, $ respectively. On one day the engineer receives a message
that one of the computer installations has broken down at 8 p.m. but the
message does not specify which computer. Calculate the probabilities that the
breakdown is at А, В, С and so decide the first installation which should be
checked by the engineer. (M.E.I, adapted)
9. The probability that a radio message sent from a ship is not picked up by a
land-station is p. The land-station replies to all messages received and its
transmission back has probability q of not being picked up by the ship. Find the
probability that the land-station failed to receive the message if the ship gets
no reply.
Miscellaneous Exercise 7
1. A number is drawn at random from the set of thirty-seven numbers
0, 1,2, ...,36,
and is then replaced.
(i) What is the chance that the number drawn will be odd?
(ii) What is the chance that in two consecutive draws the numbers drawn will
be the same ?
(iii) What is the chance that in one hundred consecutive draws the number 0
will not be drawn ?
(iv) What is the chance that in five consecutive draws none of the numbers
will be odd?
Give the answers to (i) and (ii) as vulgar fractions, and evaluate the answers
to (iii) and (iv) approximately as decimals. (Cambridge)
2. An encyclopaedia consisting of n(>4) similar volumes is kept on a shelf
with the volumes in correct numerical order; that is, with volume one on the
left, volume two next and so on. The volumes are taken down for cleaning and
replaced on the shelf in random order. What are the probabilities of finding
exactly я, (я—1), (я—2), (я—3) volumes in their correct order on the shelf?
(Cambridge)
126

7] MISCELLANEOUS EXERCISE 7
3. You have available two pairs of dice. The dice of one pair are both true but,
in the other pair, one is true and the other is known to be biased in some way.
Which pair would you choose if you were playing a game in which to win you
had to throw (a) a score of 7; (6) a double?
4. Two persons X and У play a game in which X deals from a standard pack of
fifty-two cards. Each player receives three cards and the game is won by X if he
holds more red cards than there are black cards held by Y.
By considering the number of black cards in play or otherwise, calculate the
probability that the dealer wins a game. What odds should be fixed by the dealer
in order that he may expect just to show a profit? (M. E.I. Ά')
5. Four cards, the aces of hearts, diamonds, spades and clubs are well shuffled
and then dealt two to player A, the other two to player B. A is then asked whether
at least one of his two cards is red. He replies in the affirmative. In the light of this
information we wish to calculate the probability that he holds both the red aces.
Consider the argument:
'We know that he has one red ace; without loss of generality we may suppose
that it is the heart ace. Among the other three cards there is no reason why one
more than another should be the diamond ace: one only out of three equally
likely possibilities gives A both the red cards: the required chance is thus $.'
Criticize this argument and produce a correct argument and answer. (C.S.)
6. A bag contains twelve coloured balls: 4 yellow, 3 white, 2 black, 2 red and
1 green. A prize is offered for obtaining 2 red balls in two draws, there being an
option of either replacing or not replacing the first ball. Which would you choose ?
Generalize your result to the case of a bag containing N balls of s different
colours where there are m, balls of colour /, and prove your conjecture.
7. Raffles of two types, A and B, are held. In raffles of type A there are 2N
tickets, of which In carry prizes; in those of type В there are N tickets of which
η carry prizes. Which would you prefer: two tickets in a raffle of type A, or two
tickets in separate raffles of type ΒΊ
8. A six-digit telephone number (i.e. any number between 100000 and 999999)
is dialled at random. Show that the probability that one or more digits are dialled
in succession (as in 322916 or 322216 or 322911 but not as in 329216) is about
41 %. (Cambridge)
9. An experiment consists in drawing balls one at a time from a bag containing
eight white and two black balls initially. Balls drawn from the bag are not
replaced. A note is kept of the number r (1 =ζ r =ζ 9) of the draw which first yields
a black ball. If the experiment is repeated many times what is (i) the most
probable value of /-; (ii) the probability that r exceeds 5 ? (Cambridge)
10. Articles are distributed among three boxes in such a way that each article
has the same chance of being placed in any one of the boxes. Find the probability
that at least one of the boxes remains empty when r articles have been
distributed.
Deduce that, if r articles are similarly distributed among five boxes, the
probability that exactly two boxes remain empty is бР'"1 - 2r + V/S*-1. (Oxford)
127

PROBABILITY IN FINITE OUTCOME SPACES [7
11. A bag contains m oranges and η lemons: sampling is at random without
replacement. Obtain the probabilities of the following events:
(i) the first draw is an orange;
(ii) the first two draws are oranges;
(iii) the second draw is a lemon, given that the first was an orange;
(iv) the third fruit is an orange. (Cambridge)
12. In a game of bridge each of four players is dealt thirteen cards after the pack
has been shuffled. One of the players has four diamond cards and he can see four
more diamonds in the exposed dummy hand, which has been put down on the
table by one of the players. What is the chance that the five remaining diamond
cards are split three in one hand and two in the other?
If, in the first three rounds of play, each player has used exactly one diamond
and two other cards, what is the chance that one of the players has no diamond
cards remaining in his hand? (M.E.I.)
13. Half the population of the city of Ekron are Philistines and the other half
are Canaanites. A Philistine never tells the truth but a Canaanite speaks
truthfully with probability f and falsely with a probability f. What is the probability
that a citizen encountered at random will give a correct answer to a question?
Tabulate (as fractions, decimals or percentages) the probabilities that 0,1,2, 3
men out of a sample of three citizens taken at random from Ekron will affirm a
proposition (i) when it is true, (ii) when it is false.
The proposition has a prior probability of $ of being true. What is the
(posterior) probability of its being true conditional on its being affirmed by only one
out of three ? (Cambridge)
14. A farmer keeps two breeds (A, B) of chicken; 70% of the egg production is
from birds of breed A. Of the eggs laid by the A hens, 30 % are large, 50 %
standard and the remainder small; for the В hens the corresponding proportions
are 40 %, 30 % and 30 %. Egg colour (brown or white) is manifested
independently of size in each breed: 30 % of A eggs and 40 % of Л eggs are brown. Find
(i) the probability that an egg laid by an A hen is large and brown;
(ii) the probability that an egg is large and brown;
(iii) the probability that a brown egg is large;
(iv) which size grade contains the largest proportion of brown eggs;
(v) whether colour and size are manifested independently in the total egg
production. (Cambridge)
15. A and В are two events and B' is the complementary event to B. Show that
Pr (A) is between Pr (A\B) and Pr (A\B'). (M.E.I.)
16. Prove that, if A and В are independent events, so also are A and B'. If
Alt A2, As are three mutually independent events such that
Pr (At nA'jnA'1)=pi (i φ у ф к),
then Pr04i) = ptKpi + λ),
where λ is a root of a certain cubic equation, which should be found.
17. A motorist driving along the main street of a town encounters two sets of
traffic lights 660 feet apart. Both sets of lights change every 30 seconds and the
second set are timed to change to 'Go' 10 seconds after the first set. The motorist
128

η MISCELLANEOUS EXERCISE 7
drives at 30 m.p.h. but if forced to stop at the lights, he accelerates uniformly from
rest at 4 ft/s2 until he attains that speed. Estimate the probabilities of his finding
(i) both sets of lights in his favour;
(ii) both sets of lights against him;
(iii) only one set of lights against him.
Suggest two ways in which the relative timing of the two sets of lights could
be adjusted to ensure that the probabilities would be the same if travelling in
either direction and calculate the probabilities.
(Assume throughout that his progress is not interfered with by other vehicles,
and also, that he drives at 30 m.p.h. right up to the lights and then stops
instantaneously if they are red at that instant.) (SMP)
18. Three players, Χ, Υ, Ζ, play a game and at each round of the game they have
an equal probability of winning that round. The player who wins each round
scores one point and the game is won by the first person to score a total of
3 points. X wins the first round; calculate the probability that Twins the game.
(M.E.I.)
19. The basic premium under a certain motor insurance policy is £A p.a. The
scale of No Claims Bonus provides 25 % discount after the first year without
claim, 33J% after two consecutive years free of claim and 50% after three or
more consecutive years free of claim. In addition a preferred policy holder's
discount of 20 % is given on the basic premium (which may already be discounted
by the No Claims Bonus) for four consecutive years without claim. If a preferred
policy holder's discount has been granted it is not lost by reason of a claim.
Suppose that for a given driver there is a constant probability q of no claim in
a given year. Evaluate the probabilities for the possible premiums in the sixth
year of the policy, in terms of q. (M.E.I.)
20. n2 balls, of which η are black and the rest white, are distributed at random
into η bags, so that each bag contains η balls. Determine the probability that at
least one bag contains no black ball.
129

8. Finite series and the Binomial Theorem
1. SEQUENCES, SERIES AND THE
Σ NOTATION
In Chapter 4, we saw that a function maps each element of the domain
into an element of the range. A function whose domain is the set of positive
integers, Z+ = {1, 2, 3, 4, ...}, is called a sequence and the elements of the
range are called terms of the sequence, f
For example, we could have as the terms of a sequence the numbers
1, 3, 5, 7, 9, ...,
where 1 (the first term) is the image of the integer 1; 3 (the second term) is
the image of the integer 2; 5 (the third term) is the image of the integer 3
and so on (see Figure 8.1).
12 3 4 r
13 5 7 2r-l
Fig. 8.1
It is common usage to refer to the range of a sequence simply as ' a
sequence'; thus we should say that the numbers 1, 3, 5, 7, 9, ... 'form a
sequence'. Other examples of sequences are
1, 2, 4, 8, 16, ...,
1, 4, 9, 16, 25, ...,
1, Ц, 2, 21, 3, ...,
1,V2, 2, 2V2, 4, ....
If we call the first term of a sequence щ, the second term щ and so on,
the rth term (general term) of the sequence will be denoted by ur. A sequence
itself is often written {ur} (provided this notation causes no confusion with
the set consisting of the single element ur). For example, the first few terms
t It is possible to extend the definition of a sequence to include Ζ as the domain, but
we shall restrict ourselves to Z+ in this book, or, occasionaly to Z+ U {0}, that is, the set
of natural numbers with zero.
130

1] SEQUENCES AND SERIES
of the sequence {r3-l} are 0, 7, 26, 80, 124, .... Again, the sequence
1, 3, 5, 7, 9, ... could be written {2r- 1}—but see Ex. 5, which illustrates
the danger of assuming that the form of the /th term is determined by the
first few terms of a sequence.
Sequences are sometimes defined by giving, say, the first term and a
relation between the (r- l)th and rth terms. For example, the sequence
defined by the relation _ , ι (i\
and the first term щ = 1
has, for its second term 2.1 + 1 = 3, for its third term 3.3 + 1 = 10,
for its fourth term 4.10+ 1 = 41 and so on.
*Ex. 1. What are the first four terms of the sequence given by
и, = /■«,_!+1,
(i) if«! = 0; (ii) ifUl = 2?
A relation such as (1), connecting general terms of a sequence, is called a
recurrence relation. As another example of a recurrence relation we might
have , ™
щ = иг_х + иг_г. (2)
To define this sequence {ur} we need two initial terms (why?). Thus, if
щ = 1, щ = 1
then (2) defines the sequence
1, 1,2,3,5,8, 13,....
Ex. 2. Write down the first four terms of the sequences defined by the following
recurrence relations and initial terms:
(i) ur = 2иг_ъ u1 = \; (ii) ur = «r_!+1, «i=l;
(iii) ur = rur_x + r\ Ul = 0; (iv) щ - Ur-г = i-O'r-i - и,-.),
иг = 1, и2 = 2;
Ex. 3. Write down the first six terms of the following sequences:
(i) {Ъг}; (ii) {3r-5}; (iii) {2-'}; (iv) {r(r*-1)};
(v) {l+(-l)'}; (vi) {l/Wr+l)]}; (vii) {f-»};
(viii) {r+(- 1)4/·+1)}.
Ex. 4. Suggest possible fifth and sixth terms for the following sequences, whose
first four terms are given, indicating how you arrived at them:
(i) {1,4,7,10,...}; (ii) {1, -hi, -*,·..};
(iii) {1,2,4,7,...}; (iv) {0,3,8, 15, ...};
(v) {1,3,6,11,...}.
131

FINITE SERIES AND THE BINOMIAL THEOREM [8
*Ex. 5. Why was the word 'suggest' used in Exercise 4?
Work out the first four terms of the sequence {/-4 - 10/·3 + 35/·2 - 50/·+25}.
Work out the fifth term of this sequence.
Can you see an easy way of constructing the general term of a sequence which
has the property that its first five terms are zero, but its sixth term is non-zero?
Given a sequence it is often necessary to obtain the sum of its first η
terms. If the first η terms of a sequence are written down and connected by
+ signs, we are said to have a series. For example
1+3 + 5+7 + ...+(2и-1),
l+i+i+i + ... + 1/л
are the series obtained respectively from the sequences {2r— 1} and
{iW·
If η is known, such sums may be found simply by obtaining the
necessary terms and adding them one by one, but the process is naturally
laborious. In this chapter we shall explore more sophisticated methods for
summing certain simple series, but, before doing so, it is desirable to
introduce a shorthand notation for sums, the Σ notation (read 'sigma
notation').
We write щ + щ + щ +... + un as Σ щ. Thus, for example,
r=l
Σ/·= 1+2 + 3 + 4= 10;
Σ (3r-l) = 2 + 5 + 8 + 11+14 = 40.
r=l
It is possible to assume that uT is defined by values of r φ Ζ+:
Σ0·2 + 1)= 1+2 + 5 + 10= 18;
Σ (r2+r + l)= 1+1+3+7 = 12.
r=-l
The general rule for finding Σ uT is to determine the value of uT for r = m
and all subsequent r, in steps of 1, until we arrive at un and then to sum all
the terms so obtained.
(0 If!;
(iv) Σ(-1)'ι·;
r=l
(vii)Jo/^.
132
(ii) ΣΟ-»-,);
(ν) Σ (ή + 2);
(iii) Σ/·(2'-1);
(νί) Σ (/·2-1);
г=-2

1] SEQUENCES AND SERIES
Ex. 7. Evaluate:
(ί) Σ&-+2); (ii) L·*; (iii) Σ ή;
(iv) Σ(ι--1)»; (ν) Σ ^; (vi) Σ(-1)·>»;
/1 1 \
(vii) Σ 2'; (viii) Σ
ЙГ
Ex. 8. Write in Σ notation:
(i) 1 + 2+3 + 4; (ii) 1-2 + 3-4;
(iii) 2+4 + 6+8; (iv) 1+2 + 4 + 8 + 16;
(v) l+i + i+i+i + i; (vi) 1.2 + 2.3 + 3.4 + 4.5 + 5.6 + 6.7;
(vii) 1-1 + 1-1 + 1-1 + 1-1; (viii) 1 +2 + 3+4+... + n.
*Ex. 9. Prove that, if {«,} and {vr} are two sequences, then
Σ («, + »,) = Σ uT+ Σ vT.
*Ex. 10. Prove that, if {«,} is any sequence and a a fixed number, then
Σ auT = α Σ иг.
♦Ex. 11. Interpret and evaluate Σ с, where с is fixed. (That is, find the sum of the
first η terms of the constant sequence defined by ur = c, all r.)
2. ARITHMETIC SEQUENCES
Sequences such as {1, 3, 5,7, ...},{\\,2\, 3|, 5, ...},{4, 1, -2, -5, ...}are
called arithmetic sequences, their distinguishing feature being that the
difference between any pair of successive terms is constant. If we call this
common difference d, and the first term a, then the arithmetic sequence
takes the form {a + (r- 1) d).
Ex. 12. Write down the /-th terms of each of the following arithmetic sequences:
{1,4,7,...}, {1, 2£, 4, ...}, {2,0,-2,...}.
Ex. 13. Find to which integer r the following terms correspond in the given
arithmetic sequences:
(i) 91 in {1, 3, 5, ...}; (ii) 139 in {3, 7, 11, ...);
(iii) -253 in {2, -1, -4, ...}; (iv) 61-7 in {3-2, 4-7, 6-2, ...};
(v) -67in{7i,6i, 5|,...}.
133

FINITE SERIES AND THE BINOMIAL THEOREM [8
Ex. 14. Taking what you consider to be the simplest continuation in each case,
write down, in Σ form, the sum of the first η terms of each of the following
series:
(i) 1.4.7 + 4.7.10+7.10.13+10.13.16+...;
(ii) 2.3.5 + 4.6.9 + 6.9.13 + 8.12.17+...;
(iii) 21 + 44 + 6' + 810+...;
,., 1.3 3.7 5.11 7.15^ .
0ν)ΊΓ + —+-r + —+ ...;
1 2 3 4
W 1.2.3 3.5.10 5.8.17 7.11.24'
The sum to η terms of the arithmetic series (sometimes called the
arithmetic progression and abbreviated A.P.) is easily obtained by the
following argument.
Consider first the sum of the first η positive integers, that is Σ r- Writing
this sum as S we have
S = 1+2 + 3 + .. .+n
and S=n + (n-l) + (n-2) + ... + l.
Adding: 2S = Oz + l) + Oz + l) + 07 + i) + ...+()7 + i)
= n(n+l).
Thus 5 = 1я(и + 1).
Now, for the general arithmetic series,
| [a + (r- 1) d] = Σα+άΣ r- | d (see Ex. 9 and Ex. 10)
= na + idn(n + l)-nd
= M2a + (n-l)d].
This method is easily applicable to any arithmetic series and depends only
upon memorizing the single important result
Σ^ = Μη + 1).
Example 1. The tenth term of a certain arithmetic series is -29 and the
twentieth term is -69. Find the sum of the first twenty terms.
Let the first term be a and the common difference d. Then
a + 9d=-2% a + l9d = -69,
giving a = 1, d = - 4. The rth term of the arithmetic series is thus
134

2] ARITHMETIC SEQUENCES
7 + (/·— 1) ( — 4) = 11—4/- and the sum of twenty terms is given by
20
Σ (11-4/-)= 220-2.20.21
= -620.
Exercise 8(a)
1. Evaluate:
(i) Z(r* + r); (ii) Σ -; (iii) Σ (l+r + r*).
r=l j=l Г r=-2
2. Write in Σ notation:
(i) 1.3 + 4.6 + 7.9+... (to я terms);
(ii) 1.2.3-2.3.4+3.4.5-4.5.6+... (to>zterms);
(iii) l2-32 + 52-72+ ... (to η terms).
3. Find the first four terms of each of the sequences defined by the following
recurrence relations and initial terms:
(i) ur = 3η^-ι+1, «i= 1;
(ii) «, + 2«Γ_!-3«Γ_2 = 0, «! = 1, «2 = 1;
(iii) щ = «r2_1; Ul = 2;
(iv) 2игиг_1-и2_1 = 3, «ι = 1.
4. If {иг} is an arithmetic sequence with common difference d and first term a,
show that «r satisfies the recurrence relation
uT = iir^ + d.
If sn = Σ и,, determine a recurrence relation involving only sn, sn_u j„_,
and </. What isij?
5. Find the rth term of the following arithmetic sequences:
(i) {3, 12, 21, ...}; (ii) {i, i, i, ...}; (iii) {12, 4, -4, ...}.
6. Find the number of terms in the following arithmetic series:
(i) 1+3 + 5 + ... + 101; (ii) 2+9 + 16+... +485;
(iii) 5 + 2 + (-l) + ...+(-82).
7. Sum each of the series in Question 6.
8. The fourth term of an arithmetic series is -14 and the tenth term is —44.
Find an expression for the rth term and hence obtain the sum of the first twenty
terms.
9. The first term of an arithmetic series is 9 and the sum of the first ten terms
is 450. Find an expression for the rth term and deduce an expression for the sum
to η terms. Check your final result by setting (i) η = 1; (ii) η = 10.
10. The ninth term of an arithmetic series is 24 and the sum of the first nine terms
is 126. Find an expression for the rth term and deduce an expression for the sum
to η terms.
135

FINITE SERIES AND THE BINOMIAL THEOREM [8
11. How many terms of the series 1 + 8+15+ ... are required to give a sum of
5500?
12. How many terms of the series 12 + 6 + 0+... are required to give a sum of
-3348?
13. Show that n n η
Σ(/·+1)2 = Σ/·2 + 2 Ъг + п
and deduce that Σ/· = \η(η+1).
14. If Σ sT = n2 + n, prove that {sT} is an arithmetic sequence, and find the first
three terms.
15. Prove that the sum of η terms of any arithmetic series may be written in
the form an2 + bn+c. Prove the converse of this result: that if
Σί, = an2 + bn+c,
then {sT} is an arithmetic sequence.
16. If a2, b2, c2 are the first three terms of an arithmetic sequence, prove that
(6 + c)"1, (c + a)"1, (a + b)-1 are the first three terms of another arithmetic
sequence (provided b + c + 0, c + αφθ, a + b * 0).
17. If 3 + x+y+29 is an arithmetic series, find χ and y.
If a+x1 + x2+... + xn + b are the (я+ 2) terms of an arithmetic series, find xT
in terms of a, b, r and n.
18. An arithmetic sequence whose first term is 8 has a common difference of i.
Find N, given that the sum of the first 2JV terms is equal to the sum of the next
N terms.
3. GEOMETRIC SEQUENCES
Sequences such as {1,2,4, 8, ...}, {3, - 6,12, - 24, ...}, {1-5, 0-75, 0-375...},
in which each term is a fixed multiple of the preceding term are called
geometric sequences. If we call the first term of the sequence a, and the fixed
multiple, or common ratio, ρ then the geometric sequence takes the form
Ex. 15. Which of the following sets of numbers could be the first three terms of
geometric sequences?
(i) {1,-5,25,...}; (ii) {2,2i,2tf,...} (iii) {1, 4, 9,...};
(iv) {1, -1-1,1-21, ...}; (v) {7, -1, -\,...}; (vi) {1, -1,1, ...}.
Ex. 16. Find to which value of r the following terms correspond in the given
geometric sequences:
(i) 256 in {2, 4, 8, ...}; (ii) 1458 in {2, 6, 18,...};
(iii) 10-30 in {10, 1, 01, ...}.
136

3] GEOMETRIC SEQUENCES
A geometric series {geometric progression, G.P.) is a series obtained from
a geometric sequence. The sum of the first η terms of such a series may be
obtained as follows:
Let S = Σ ар7-1.
If ρ = l.thenS = па.
If Ρ φ l.then c , , . , , n ,
S= a + ap + ap^ + .-.+ap71-1
pS = αρ+αρ2 + ...+αρη-1+αρη
.'. (l-p) S = a —apn.
Since 1 —ρ Φ 0 we may divide by (1 —p) to obtain the following important
reSullt: » /i „ч
l-p
Σ^-^^Ь^ O0*1)·
Since, for \p\ < 1, limp™ = 0 (consult a calculus text-book for the
definition of limits and the proof of this result), we may take
Дор^-а/а-р)
less than any positive number, however small, by taking a sufficiently
large n, provided \p\ < 1. Thus, if we write
lim Σ ар*'1 as Σ αΡτ~χ
we have Σ αΡτ~
(The sum to infinity of a geometric series.)
Example 2. Find the sum of money that must be invested to yield an
income of £50 at the end of that year, and at the end of each of the
subsequent nine years, reckoning on a return of 5% compound interest on money
remaining invested. {Assume that interest is payable once a year.)
For every £1 invested, its value at the end of one year is £1-05, its value
at the end of two years is £1·052 and so on. Thus, to obtain £50 at the end
of the first year, we must invest £РЪ where
>! x 105 = 50.
Again, to obtain £50 at the end of the second year, we must invest £P2,
Whefe P2xl05' = 50
and so on.
137

FINITE SERIES AND THE BINOMIAL THEOREM f8
The total sum to be invested is given by
P1+Pa + ... +P10 = 50[1 05-1 +1-05-2 +... +1-05-10]
= 1000[1-1/10510]
= 1000[1-1/1-6289]
= 386-2.
Ex. 17. Comment upon the number of decimal points used in the calculations in
Example 2 and on the accuracy achieved.
Exercise 8(b)
1. Find the sum of the following geometric series, simplifying your answers as
far as possible, but without evaluating terms of the form />":
(i) 1+4+16+... (12 terms); (ii) 1-2 + 4-8 + ... (20terms);
(iii) V2 + 2 + 2V2 + ... (16 terms); (iv) l+i + i + ... (50 terms)
2. The third term of a geometric sequence is 5 and the eighth is 160. Find the
first term and the common ratio.
3. Find two possible geometric sequences whose fourth term is 9 and whose
eighth term is 144.
4. Which is the first term of the geometric sequence 1, j, i, ... which is less than
io-8?
5. How many terms of the geometric series 1 +i+i+... must be taken to give
a sum greater than 2-10-6?
6. How many terms of the geometric series 1+% + tg + .-- must be taken to
give a sum greater than 4—10-3?
7. If x— 4, x, x+6 are the first three terms of a geometric sequence, find χ and
the fourth term.
8. How many terms of the series 1 +5 + 25 + ... must be taken to give a sum
greater than 1010?
9. A piece of paper is 01 mm thick (so that when it is folded over, the resulting
thickness is 0-2 mm). If a very large sheet of paper could be folded fifty times,
what would be the resulting thickness? (This is a variant of a famous problem;
another form it can take is to find how much corn is required if one grain is to be
placed on the first square of a chess board, two on the second, four on the third
and so on. The answers, when first met, are surprising.)
10. The sum of the first η terms of the sequence {sT} is i(3" -1). Prove that the
sequence is geometric, and write down the first three terms.
138

GEOMETRIC SEQUENCES
11. If £250 is invested at 4 % compound interest, reckoned annually, what will
be its value after six years ?
12. What sum of money must be invested to give an annuity, starting ten years
after investment, of £100 p.a. for five years, reckoning on 4 % compound interest
(calculated annually) on money remaining invested ?
13. Sum to infinity the following geometric series:
(i) 1+-ιττ + τππ+·..; (η) 3 + H + J + ...;
(iii) 1-i+J—....
14. A recurring decimal may be regarded as the sum of an infinite geometric
series; for example _i -i -i
021 = ϊδο+ιοοοο+ιοοοοοο+""
Use this approach to express 0315 as a fractionp/q.
15. If the population of a country increases by 3 % each year, how many years
will it take for the population to double?
16. If in = Σ ар'-1, evaluate Σ sr.
17. If {«,} is a geometric sequence with common ratio p, find a recurrence
relation involving ur and ιιτ_λ. If sn = Σ ur, show that sT satisfies the recurrence
relation
Sn-psn-i = a.
What is i,?
18. If au д2, д3, я4,... are terms of a geometric sequence with common ratio/-,find
the sum of η terms of the series
(aj - Да)2 + (да - α3)2 + (α3 - β^)2 + ....
4. FURTHER SUMMATION; THE USE OF
DIFFERENCES
By factorizing the algebraic expression r{r+1) (r + 2)-(r- 1) r{r+\) we
seethat r(r+l) s £[,(,+1} (r + 2)_(r_i)r(r+i)].
Now suppose we let r take successively the values 1, 2, 3, ..., η in this
identity. Writing the resulting equalities one below the other we have
1.2 = HI.2.3-0.1.2],
2.3 =i[2.3.4-1.2.3],
3.4 = i[3.4.5.-2.3.4],
{n-\).n = \[{n-\)n{n+\)-{n-2){n-\)ri\,
n.(n+l) = i[n(n + 1) (n + 2) - (n-I) n(n +1)].

FINITE SERIES AND THE BINOMIAL THEOREM [8
If all the terms on the left-hand side are added together we have precisely
Σ /■(/" +1); if all the terms on the right-hand side are added together, we are
left with -i 0.1.2 at the beginning and %n(n+l) (n + 2) at the end, all
the others having cancelled out in pairs. Thus
jtr(r+l) = in(n+l)(n + 2).
Ex. 18. Show that
1 1_
~~r r+1
and deduce the result Σ ——— = 1 -.
r=ii-Cr+l) л+1
More generally, if the rth term of the sequence {sr} can be expressed as
the difference between the (r+ l)th term and the rth term of a new
sequence {ur}, that is, if we can find a new sequence {ur} with the property
that, for all r,
then Σ sr = Σ («r+1 - «г) = "и+1 - «ι·
In the example given above, sr = r(r+1) and uT = i(r — l) r{r+1);
in Ex. 18,
sr = λ and ur = -Ijr.
This technique for summing series is called the method of differences; its
use depends, of course, upon our ability to discover the new sequence {ur}.
*Ex. 19. Show that
r(r+Y)(r + 2) = MKM-l)(M-2)(r + 3)-(r-l)r<r+l)(r + 2)]
and deduce that
Σ/·(/·+1)0-+ 2) = Ыя+1)(я + 2)(я+3).
*Ex. 20. Show that
rjr
-1 = !Г_! L__1
+ l)(r +2) 2L/C- + 1) (r+l)(r + 2)J
and deduce that
Ά 1 1 1
r=1K»-+l)(i4-2)"4 2(л+1)(л + 2)·
♦Ex. 21. Prove that
Σ/-0-+1) (/- + 2) (/- + 3) = Ы«+1) (л+2) (л+3) (л+ 4)
and evaluate L(r+l)(r + 2)(r+ 3) (r + 4).
140

4] FURTHER SUMMATION
*Ex. 22. Prove that
Ά 1 = J_ 1
r_i r(r+ l)(r + 2) (/- + 3) 18 3(л+1)(л+2)(л + 3)
It is sometimes worthwhile to manipulate sr into a slightly different form
before applying the method of differences.
Example 3. Evaluate n 1
r{r + 2) r(r + l)(r + 2) (/■+l)(/4-2)"rr(/- + l)(/4-2)·
Then
v^__ = f; 1 +v J
r-ir(r + 2) r=i{r+\){r + 2yr^ir{r+\){r + 2)
= Si lr + Ί ~7+2} + iSx \J{r + V)~(r + l)(r + 2)j
= Ь"^+2]+2Ь"(й+1)(й + 2)]
4 2(л + 1)(л + 2)·
(Note the check by putting л = 1.)
The method of difference is thus seen to constitute a powerful technique
for summing series. The reader should commit the method to memory,
especially the particular approach used for the series given at the
beginning of this section and in Ex. 19 and Ex. 20. We shall complete this
section by using the results already proved to deduce two more important
sums:
(i) j>2 = K"+ О (2л +1); (π) if = \n\n + lf.
Proof of (i): Since
r2 = r(r + l)-r,
Д'2=|К' + 1)-Д>-
= Mn + l)(n + 2)-in(n + l)
= £л(л+1)(2л + 1).

FINITE SERIES AND THE BINOMIAL THEOREM
Proof of (ii):
r(r+l)(r + 2) = r* + 3r* + 2r
and so r3 = r(r + l)(r + 2)-(3r2 + 2r)
= г{г+\){г + 2)-Ъг(г+\) + г.
Thus Σ rs = | r(r+l) (r + 2)-3 Σ /■(/■ + !)+ | '
= *л(л + 1)(л + 2)(л + 3) -л(л + 1)(л + 2) + *л(л + 1)
= £л2(л + 1)2, on simplification.
(Observe that Σ /-3 = ί Σ Α Λ
Example 4. Evaluate n
γΣ (/■+!)(«+/■).
Д(/■ +1) (л + г) = Σ/'2 + г(л +1) + л)
= Σή + (η+1)Σ? + η\ byExs. 9, 10, 11
= £л(л + 1)(2л + 1) + -£л(л + 1)2 + л2
= |л(5л2 + 15л + 4).
1. Sum the following series:
(i) l2 + 22 + 32 + ... + 122; (ii) 13 +23 + 33+... +123;
(Ш) 32 + 42+52 + ... + 202; (iv) l2+32 + 52 + 72+... + (2/z-l)2.
2. By expressing r(r+1) (/-+ 3) in the form /■(/■+1) (/·+ 2) +r(r+1), sum the series
Σ/·(/·+1)0-+ 3).
Find the value of
3. Find
4. Find
5. Find
142
Σι-(τ+1)(τ + 4).
Σ(ι* + ι- + Ζ).
ДКл-г)1.

4] FURTHER SUMMATION
6. Find Σ(/·2 + 2τ)
(i) by expressing it as Σ r2+ Σ lr\
(Η) by expressing it as Σ (/· +1)2- Σ 1.
7. Show that, for;· > 0,
___r = 1 1_
(r+l)(r+2)(r+3) (r + 2)(r + 3) fr+l)(r + 2)(r+3)·
Hence evaluate £__£__.
8. Evaluate ^5 __^__.
9. Evaluate Д___^_.
10. Find the sums to я terms of the following series:
(i) 1.4 + 4.7 + 7.10+10.13 + ...;
(ii) l2.2 + 22.3 + 32.4 + 42.5 + ...;
(iii) 1.2.3 + 4.5.6 + 7.8.9 + 10.11.12+...;
(iv) l2.2.3 + 22.3.4 + 32.4.5+42.5.6+....
11. Show that
x(x+ 1) (2x+1) ξ Ax{x+ 1) (x + 2) (χ + 3) + β(χ- 2) (χ-1) x(x+1)
for certain constant values of A and B, and find these values. Hence find the sum
of the first η terms of the series
1.2.3 + 2.3.5 + 3.4.7 + 4.5.9 + .... (O&Q
12. Prove that
(2я+1)(2и + 3) _ (2я-1)(2я+1)_ 2(2я+1)
(n+D&i+Z) nin+\) ~n(n + l)(n + 2Y
Find the sum of the first η terms of the series
ГЬ + 2Х4 + 3-Ь+- (0&C>
13. Deduce Σ τ·2 from the identity (2r+l)3-(2r-l)3 = 24r2 + 2.
14. Evaluate Σ - „ ■■
15. Prove that the sum of the squares of the first η even numbers exceeds the
sum of the squares of the first η odd numbers by n(2n + 1). Hence, or otherwise,
find the sum of the squares of the first η odd numbers. (O & C)

FINITE SERIES AND THE BINOMIAL THEOREM [8
5. THE BINOMIAL THEOREM
By ordinary multiplication of algebraic expressions
(x + a)1 = x + a
(x + af = x2 + 2xa + a2
(x + af = xs + 3x2a + 3xa2 + as
(x + af = х* + 4х3а + 6х2а2 + 4ха3 + а*
{x + af = x5 + 5xia+10x3a2 + l0x2a3 + 5xai + a5
Notice the following features possessed by these expressions:
(i) the number of terms in the expansion of (x + a)n is η +1;
(ii) the degree of each term in the expansion of (x + a)n is n;
(iii) The coefficients of the terms in the expansion of(x+a)n are given by
the nth line of Pascal's triangle (Blaise Pascal, 1623-62); see Figure 8.2.
1 1
1 2 1
13 3 1
14 6 4 1
1 5 10 10 5 1
Fig. 8.2
Each line of Pascal's triangle starts and finishes with a 1; terms in
between are obtained by adding together the two terms on either side of it in
the row above.
*Ex. 23. Supply the sixth, seventh and eighth lines of Pascal's triangle.
Ex. 24. Check that the expansion of (x + a)6 satisfies conditions (i), (ii) and (iii)
above.
Ex. 25. Show that, if we add the 'zeroth' line, 1, to Pascal's triangle, the
expansion of (x + a)° satisfies conditions (i), (ii) and (iii).
By direct verification as above, we may show that conditions (i), (ii) and
(iii) hold for all suitably small values of η; the Binomial Theorem can be used
to show that they are true for all positive integers n. However, before
proceeding to the general theorem, we shall solve an example by means
of Pascal's triangle.
144

5] THE BINOMIAL THEOREM
Example 5. Obtain the expansion of(2x—3y)s. Hence obtain an approximate
value for 199-75, and comment upon the accuracy of the result.
From the fifth line of Pascal's triangle
(2x-3yf = (2хУ + 5(2хУ(-Зу) + Ю(2хУ(-Зу)* + 10(2х)*(-ЗуУ
+ 5(2x)(-3yy + (-3yf
= 32x5-240xiy + 120xY-1080x2ys + 8Wxyi-243f.
If we put χ = 102, у = 10-\ then (2x-3yf = 199-75. Hence,
199-75 = 3 ·2 χ 1011 - 2-4 χ 109 + 7-2 χ 106 ...
and the last three terms will not affect the first six significant figures.
Direct calculation of the line above shows us that
199-75 = 3-17607 χ 1011, correct to six significant figures.
Ex. 26. Write down the expansions of
(i) (Зх-уУ; (ii) (.2х + уУ; (ίίί) (2χ-1)'; (ίν) (3x + 2yf.
The Binomial Theorem states that, for any positive integer n,
(x + a)n = xn + ("\ x^a + Q x^a2 + ··· + (") *η~ΤαΤ + ■ · · + β"
= Σ (") xn~rar. (Recall that (?\ = l\
Proof. (x + a)n = (x + a) (x + a) (x + a)... (x + a). [n brackets in all]
Since, in forming the product on the right-hand side, one term is chosen
from each bracket, each term in the expansion must be of the form
xn~TaT (choose (n — r) terms x, and r terms a). Now the term xn~TaT may be
obtained by selecting any r brackets and choosing the a term and then
choosing an χ from each of the remaining (n — r) brackets. But there are
I I different ways of doing this and so the coefficient of xn-rar is I 1. The
proof of the Binomial Theorem is now complete.f
We make the following observations:
(i) Since r runs through all integral values from 0 to n, there are (n +1)
terms in the expansion.
(ii) The degree of each term is (n-r) + r = n.
t An alternative proof of the Binomial Theorem is given in Chapter 9.3.
145

FINITE SERIES AND THE BINOMIAL THEOREM
"(/■■
("-1)! [J-+r\
-l)l(n-r-l)l\_n-r+r\
= (")■
the coefficients in the expansion of (x + a)n are connected to those of the
expansion of (* + a)™-1 in precisely the same way as was previously
observed for Pascal's triangle.
M (":;h;'K)
is of some importance and will be used later; it is generally known as
Vandermonde's Theorem.
Example 6. Obtain the first three terms in the expansion of (1 - 2*)20 in
ascending powers of x.
(1-2*)» = (l)» + (^) (1)»(-2χ)4-(*) (l)18(-2*)2
= l-40* + 760*2....
Exercise 8(d)
1. Write down the expansions of
(i) (x-yf; (ϋ) (2χ+№;
(iii) (1+2*)'; (iv) (2x + 6yf.
2. Find the coefficient of *2 in the expansions of each of the following expres-
(i)'(3*+l)4; (ii)(2-*)'; (iii)(l+2*)12; (iv) (3-4*)°.
3. Write down the first three terms in the expansions of each of the following
expressions in ascending powers of *:
(i)(l+2xf; (ii)(l+!*)"; (iii) (2-*)'»; (iv) (3*-*)'.
4. Write down the general term in the expansion of

5] THE BINOMIAL THEOREM
Hence determine the constant term in this expansion (that is, the term not
involving x).
5. Determine the constant term in the expansion of
/ , M6
6. Find correct to five significant figures, the values of
(i) (1-01)6; (ii) (0-998)8; (iii) (9-99)s; (iv) (2-98)s.
In each case, justify the accuracy of your result.
7. Find, without using tables, the value of
(2 + V5)4 + (2-V5)4.
8. Write down the coefficient of a3x7 in the expansion of (a + x)10. Find the
coefficient of а3Ь*с3 in the expansion of (a + b + c)10.
9. Expand in ascending powers of x, as far as terms in x2, each of the following
expressions:
(i) (1+x + x2)4; (ii) (1-x+x2)6; (iii) (1 +2x-x2f; (iv) (2-x-2x2)6.
10. If a device measures linear dimensions with an error of less than \%, find
the possible percentage margin of error if it is used to measure the volume of a
cube.
11. Find the sum of the coefficients of the powers of χ (including x°) in the
expansion of (l+x)n.
If(l+x + x2)4 = ao + аг χ + α2χ2+... + αβ Xе, evaluate
(0 Σα,; (Η) Σ(-Ι)'*.
12. Prove Vandermonde's Theorem by considering the identity
(1+x)"*1 = (l+x)(l+*)".
Prove that
, +2
-1/
and suggest a generalization of this result.
Miscellaneous Exercise 8
1. Find (i) the sum, (ii) the product, of the first η terms of the geometric sequence
{1,3,9,...}.
2. The first term of a geometric series is 18 and the sum to infinity is 20. Find
the common ratio and the sum of the first six terms. Find also in its simplest form
the ratio of the /Jth term to the sum of all the subsequent terms of the infinite
series. (J.M.B.)
147

FINITE SERIES AND THE BINOMIAL THEOREM [8
3. In the expansion in powers of χ of the function
(1+х)(я-6х)12
the coefficient of x8 is zero. Find in its simplest form the value of the ratio a/b.
(J.M.B.)
4. Write down the sum of the geometric series
l+x + x2 + x3+...+x".
Deduce the sum of the series
l+lx + Sx' + .-.+nx"-1
by differentiating, with respect to x, the expression for the sum of the original
series.
Find the sum of the series
\2 + 22х + Ъ2хг + ...+п2х"-\
5. Use Vandermonde's Theorem to evaluate
6. Numbers C1; C2, C3,... are defined as follows:
C, = 2, C„+1 = 2(C, + C2+... + C„)
for η = 1, 2,.... Find C2, C3 and C4 and prove that the numbers C2, C3, C4,...
form a geometric progression.
Find the sum of η terms of the series
C1+C2+C3 + .... (O&C)
7. The numbers я0, аъ ..., a„; c0, съ ..., οη_λ are the coefficients in the two
expansions ,, , . , , , , ,
(l+x)" = a0 + a1x + a2x2 + ...+anxn,
(1 + x)"-1 = с0 + с1л: + с2х2 + ... + с,!_1л:'!-1.
Prove that:
(a) a0 = c0 and a„ = c„_j;
(b) Сг-г + Сг = Or for 1 m r *ζ n-1;
(c) rcir = «cr_j for I *Z r *ζ n.
Use these results to show that
л. α- '"+1
^-^_!+...±йо = Or+1
ior 0 < r < n-\. (O&Q
8. Integers a, b, d are connected by the relation a = b + d. By using the binomial
expansion of (6+d)n, where я is a positive integer, show that a" — Ь^ф + nd) is
exactly divisible by d2.
Replace b by a-d in this result, and hence show that if a is the first term,
d the common difference and / the nth term of an arithmetic progression, then
an — (a—d)n~11 is exactly divisible by d2.
Show that 510—214 is exactly divisible by 9. (Cambridge)
148

5] MISCELLANEOUS EXERCISE 8
9.Provethat χ +2 + 3 + ...+я = Ып + 1).
Prove also that
12-22 + 32-42 + ...+(2л + 1)2 = (я+1)(2я+1).
Show that, when m is any positive integer, odd or even, the sum of the first m
terms of the series
ist-.ir-HwOw + l). '" (O&Q
11. Show that
04-,)·
' considering the identity
(1+χ)2™(1-χ)2™ξ (1-х2)2™,
ethat limy I2m\2 /2w\2 /2w\2 ,
(θ)-(ΐ)+(2)-· + (2,)=(-
l} (W!)2
12. Prove that, when я is a positive integer and α φ 1,
1+(1+я) + (1+я + я2)+...+(1+я + я2
Show that the sum may also be written
л + (л-1)я+(л-2)я2 + ... + яя-1
and deduce from this the sum to η terms of the series
l+2Z> + 3Z>2 + ...+(>z-l)Z>"-2 + >zZ>"-1. (O&Q
13. Find, in binary form, the square of the number whose expression in binary
notation consists of the r digits 1.
By putting m = n, deduce that the sum of the product in pairs of the first η
integers is , , . ,. ,„
A«(«2-l)(3« + 2).
15. Verify that r\r +1)2 - (/· -1)2 r2 = 4r3
and deduce that 13 + 23 +... + rfi = \n\n +1)2.
Prove that l3 + 33 + 53+...+(2n-l)3 = я2(2л2-1)
and find the sum of the cubes of all odd numbers less than 50 that are not
multiples of 5.

FINITE SERIES AND THE BINOMIAL THEOREM [8
16. Prove that 12 + 22 + 32+...+л2 = \η(η + \)(2η + ΐ).
A non-degenerate triangle is to be made from three rods chosen from 2n
straight rods whose lengths are 1, 2, 3, ..., 2n units respectively. If the length of
the longest side is 2r units, show that (/- — l)2 such triangles can be made.
Deduce that the total number of possible triangles whose longest side is an
even number of units is ^_ ц ^ _ J} (Jmr)
17. The salary scale for certain employees begins at £630 a year and rises to a
maximum of £855 by fifteen annual increments of £15 each. After the maximum
has been reached the annual salary remains constant. Find expressions for the
total amount received by an employee in η years (i) when η < 16; (ii) when
n> 16.
Find the value of the second expression when η = 24.
It is proposed to revise the salary scale so that it begins at £720 a year and
rises to a maximum by 10 equal annual increments. If each annual increment is
£d find an expression for the total amount to be received in salary by an employee
in η years when η > 11.
Find the integral value of d which would make the total amount to be received
in twenty years on the new scale as nearly as possible equal to the total amount
received in twenty-four years on the old scale. (Cambridge)
18. The series of natural numbers is grouped as follows:
(1), (2,3,4), (5,6,7,8,9), ...
(i.e. each bracket contains two integers more than the preceding bracket).
(i) Find the total number of integers in the first (n— 1) brackets.
(ii) Show that the first number in the nth bracket is n2-In + 2.
(iii) Show that the sum of the numbers in the nth bracket is n3 + (n -1)3.
(iv) If the first number in the nth bracket is denoted by a and the first number
in the (n+ l)th bracket is denoted by b show that the sum of the numbers in the
nth bracket is exactly divisible by (b - a) and that the quotient is an odd number.
(Cambridge)

Revision exercise A
1. If the expressions
x3-(a + 2)x+2b and 2x3 + ax* - 4x - b
have a common factor x + 3, find the values of a and b, and find a second
common factor. (O&C'O')
2. Find the mid-point of the line joining the points (-4, 2) and (2, -6), and
show that the equation of the perpendicular bisector of this line is 3x-4y = 5.
Prove that the point (3, 1) is one vertex of a square of which the points (—4, 2)
and (2, — 6) are opposite vertices, and find the coordinates of the fourth vertex.
(O&C'O')
3. You are given that the equation
Зх*-9* + 5 = 0
has a root approximately equal to 1. By substituting χ = 1+A, and neglecting
A2 and higher powers show that χ = lg is a closer approximation to this root.
4. (i) The fourth term of an arithmetic series is 55, and the tenth term is 45.
Calculate the sum of all the positive terms.
(ii) Find, correct to one decimal point, the sum of the first fifteen terms of the
geometric series χ +(1.02) + (1.02)2 + (1.02)3 + .... (0 & c <0>)
5. In how many distinct ways may the letters of the word syzygy be arranged?
In how many of these do the three Ys appear together ?
(What does the word syzygy mean ?)
6. Without using tables, prove that tan 60° = V3.
A rectangle ABCD lies in a horizontal plane and DE is a line of length h
drawn vertically upwards from D; EA, EB, EC are joined. If the lines AE, CE
make angles of 60°, 45° respectively with the horizontal, express AD and CD in
terms of h, and hence calculate the angle which BE makes with the horizontal.
(O&C'O')
7. Find the equation of the line of gradient — 1 which passes through the point
of intersection of the lines x + 3y- 5 = 0 and 2x-y-7 = 0.
8. If x2 + 2x+2 is a factor of x4 + ax2 + b, find a and b. Express x*+ 16 as the
product of two quadratic factors whose coefficients are real numbers.
9. ABC is a triangle: points Q, R are taken on AC, AB respectively so that
AQ = 2QC, 2AR = RB.
Express AB, AC in terms of RQ, ВС.
10. If \π < χ < π and sin χ = §-, find (i) cos x, (ii) tan x, (Hi) cosec x.
151

REVISION EXERCISE A
11. Write down the expansion of (1 + xf in ascending powers of x. If
(l+x + x2)s = l+ax+bx*+...,
find a and b.
12. Given that (2x+1) is a factor of the expression
6лг> + ял:2 + 6л:+1
find a, and hence find the other two linear factors.
13. Three integers (not necessarily distinct) are chosen at random from the set of
integers {1, 2, 3, ..., 10}. What is the probability that their sum is 10? What is
the probability that their sum is 10 or less?
14. By considering a suitable rhombus of side 1 unit, show that, if sin θ = a,
then sin 20 = 2oV(l-a2)·
Deduce that _
Sinl5° = ^V2"·
15. Show that 4r3 = [r(r+l)Y-[r(r-l)f.
Hence evaluate Σ r3.
16. How many terms are there in the expansion of (α + 6 + c)6?
Can you generalize your result for (a+ b + c)nl
17. Show that it is not possible to find a number which is written abc in the
scale of 4 and bca in the scale of 5.
18. Two points, Ρ and Q, on the circumference of a circle of radius 10 cm are
such that PQ subtends an angle of 1 radian (57° 180 at its centre. State the length
of the minor arc PQ of this circle, and calculate the length of the chord PQ.
If the above circle is drawn on the surface of a sphere of radius 15 cm, find
the angle which the chord PQ subtends at the centre of the sphere, and hence
find the length of the minor arc Ρ β of the great circle of the sphere which passes
through Ρ and β. (O&C'O')
19. I take my wife out to buy her a new hat. There is a probability of те that
I shall approve of the first hat that we are shown, but if I approve, there is only a
probability of jo that my wife will agree with my choice. If we both like the hat,
I purchase it. If I dislike the first hat that we are shown, there is a probability of
τ that my wife will like it. If she does like it she naturally overrules my choice
but, when I hear the price, there is a f probability that I shall veto the purchase.
What is the probability that I buy her the first hat we are shown ? If I do buy her
the hat, what is the probability that we both like it ?
20. Prove that, if я is an integer, none of the numbers 7я + 3, 7я + 5, 7я + 6 can
be a perfect square.
21. Find the equation of the plane through the point (1, 1,1) containing the line
x-3 _y^X_ J_
5 ~ 2 ~ -4'
Find also the direction cosines of the line of intersection of this plane with the
plane Oxy.
152

REVISION EXERCISE A
22. The set S consists of all points in a plane whose coordinates relative to a
given pair of perpendicular axes are both integers (in the units used). If А, В, С
are members of S, prove that the area of the triangle ABC is a rational number
of square units.
23. Eliminate θ between the pair of equations
a sec θ + b cosec θ = c,
a' sec Θ — Ъ' cosec θ = с'.
24. Solve the inequality
and illustrate your answer by sketching the graph of
- *-l
У ~ x2-3x-4'
25. Evaluate Σγ(γ+1) (2/- + 3).
26. Л = {xeR:-3 < χ ^ 2}, В = {xe R: -2 ^ χ < 3},
С={д;ей:1<4
Express, in the form {:}, the sets
A(]B, Aft C, A'(\B, A ft В ft C, A\)(Bft C").
27. Show, by drawing a rough sketch, that the equation
χ = 2 cos л:
has just one positive solution.
By drawing an accurate graph, estimate this value of χ as accurately as you
can.
28. A five digit number (that is, a number lying between 10000 and 99999
inclusive) is selected at random. What is the probability that it contains either the
digit 8 or the digit 9 (or both)?
29. Show that the plane containing the points with position vectors
i-2j, i + j+k, 2i-j + 2k
has vector equation
г = i-2j + A(3j + k)+/<i + j + 2k).
Find the position vector of the point of intersection of this plane with the line
г = 3i + 5j-5k + K2i + 3j-k).
30. a, b are rational approximations to the irrational numbers -JA, ^B. An
approximation to j(AB) is calculated by using the formulae:
(i) JiAB) » ab; (ii) J(AB) » %(a + bf -(A + B)];
(iii) J(AB) и ЩА + В)-(а-ЬП
The errors in the three cases are respectively Elt Ег, Е3. Find a relation connecting
Ei, E2, E3.
153

REVISION EXERCISE A
31. A bag contains three white and four red balls, another bag contains four
white and three red balls. One of the bags is selected at random and two balls
are drawn from it. If both balls are red, what is the probability that, of the five
balls remaining in the selected bag, two are red?
32. A and В are two points with position vectors a, b relative to a given origin O.
Prove that the equation of the internal bisector of the angle АО В is
r = A(a + 6).
(Recall that a is a unit vector in the direction of a.)
What is the equation of the external angle bisector ?
Find the position vector of the incentre of the triangle ABO in terms of
a = |a|, 6 = |b|, с = |AB| and the unit vectors a, b. (The incentre of a triangle
is the point of intersection of the bisectors of the angles of the triangle.)
33. Evaluate the expressions
(0 Vfl+VU+VO + ···)}]; (n) Vf2 + V(2+V(2+ ·.·)}]·
If η is an integer, when is an expression of the form
V[k+V{«+V(«+-)}]
a rational number ?
34. Two cards are missing from a pack. If three cards are drawn at random from
the remaining fifty (without replacement) and all are found to be aces, what is the
probability that, if a fourth card is drawn, it will also be an ace ?
154

9· Mathematical induction
1. A NOTE ON MATHEMATICAL PROOFS
In a mathematical proof we proceed from one step to the next by processes
of deductive logic; that is, each step is logically implied by the previous
step or steps of the proof. In other words, a mathematical argument has
a direction. The first step must depend upon certain premises or known
theorems which may or may not be stated explicitly. For example, in the
traditional form of proof for a geometrical problem, the premises (data)
are usually stated before the proof commences and the argument leads
from this statement to the final deduction, using previously proved results
of elementary geometry. If, however, we are asked to prove that the
probability of a double with a throw of two unbiased dice is |, we must commence
by asserting that the set consisting of the thirty-six pairs of numbers
(1,1), (1,2), ...,(1,6), (2,1),(2,2), ...,(6,6) constitutes a possible outcome
space and that we assume that each of these elementary events has a
probability of -£$. ('The dice are unbiased.') We are then able to deduce
the required probability.
Since a mathematical argument has a direction, we shall frequently
meet the situation in which a sequence of statements p, q, r are strung
together in the form ' statement ρ implies the statement q which, in turn,
implies the statement r and so on'. The notation
ρ => q (read 'p implies q')
has already been used; its use in mathematical proofs can lead to
conciseness of expression.
Example 1. Prove that x2+y2 + z2-yz-zx-xy ^ Ofor real x, y, z.
Ε = x2+y2+z2-yz-zx-xy
^2E = (y-zf + iz-xy + ix-y)2
=> IE > 0, for real x, y, z,
=> Ε ^ 0, for real x, y, z.
The reader must always be on his guard, when writing ρ => q, that this
is indeed what he means: a genuine deduction from the immediately
preceding statement is implied. One of the principle sources of error among
б PPM 155

MATHEMATICAL INDUCTION [9
beginners attempting to prove mathematical results is to confuse the
implication ρ => q with the implication q => p. ρ => q means that, whenever
ρ is true, q is true; it does not tell us anything about the truth of the
statement ρ if q is true. (We shall not discuss the meaning of the
statement/7 => q in the case when ρ is false; the bibliography refers to books in
which problems concerning the use of implication are discussed at length.)
Ex. 1. 'If -1 > 0, then (-1)2 > 0, which is true, and so -1 > 0.' Comment.
Ex. 2. A class is asked to prove that, if a/b = c/d, then
(a + b)j(a - b) = (c + d)/(c - d) (a, b, c, d unequal and non-zero).
Criticize the following proof:
=> ac + bc-ad-bd = ac+ad-bc-bd
=>2bc = 2ad
=> a/b = c/d
which is true and so the original proposition is true.'
Ex. 3. Criticize the proof of the identity cosecM tanM -1 = tanM given below:
'If cosec2 A tan2 A-1 ξ tan2 A, then
1 sinM
-r-rz —τ—. = 1 + tan2 A = sec2 A
sin2 A cos2 A
which is manifestly true, since
= sec A.'
cos A
If the implication ρ => q is reversible, that is, if ρ => q and q => ρ both
hold, then we may write
poq (read 'p implies and is implied by q').
Example 2. Solve the equation J(2x +1) + Jx = 5.
V(2x+1) + V*=5
=>2x+l+x + 2VM2x+l)] = 25
о 2VM2x+l)]=-(3x-24)
=> 4(2x2 + x) = 9x2 - 144л; + 576
о х2-148л:+576 = 0
о (х-4)(х-Ы4) = 0.
156

1] A NOTE ON MATHEMATICAL PROOFS
Thus V(2x+1) + V* = 5 => (x-144) (x-4) = 0, but two of the steps of
the argument are not reversible, and so it is not possible to infer that
(х-144) (x-4) = 0 => V(2*+l)+V* = 5· To express it another way, if
χ = a is a root of V(2x + 1) + V* = 5, then it is certainly a root of
(x-144) (x-4) = 0,
but, if χ = α is a root of (x-144) (x-4) = 0 it may not be a root of
V(2x +1) + y/x = 5. To complete the solution of the given equation we must
substitute back the two possible solutions χ = 4 and χ = 144. It is then
seen that χ = 4 is a root of the original equation, but that χ = 144 is not.
Ex. 4. In Example 2, how do you know that χ = 4 is the only root of the equation
V(2x+1) + V* = 5?
Ex. 5. Solve Question 6, Exercise 1 c, using the implication signs, and explaining
carefully which steps are not reversible.
If a result is stated in the form ρ => q, the converse result (if it holds)
is q => p. For example, if ρ and q are defined as follows:
ρ is the statement 'the triangle ABC has AB = AC,
q is the statement 'the triangle ABC has LB = LC\
then a well-known theorem of elementary geometry asserts that ρ => q.
The converse theorem, q => p, is also true and the two theorems may be
combined together in the single two-way implication ρ ο q.
It is by no means always the case that, if a theorem is true, then its
converse is also true. Indeed, if a theorem takes the form
ρ and # => r
a converse is not clearly defined. In the case in which a theorem ρ => q
and its converse q => ρ both hold, we may use the two-way implication
-o- in its formulation; alternatively, we may use the phrase 'if and only if
—sometimes abbreviated to 'iff'.
In proving the truth of a two-way implication it is vital to remember
that two separate proofs are needed. For example, referring again to the
proposition about the triangle ABC mentioned above, we
(i) assume AB = AC and deduce that
LB = LC (AB = AC => LB = LC
or alternatively, AB = AC only if LB = ZC);f and
t From the purely linguistic points of view, the words 'only if are somewhat
ambiguous; mathematically, however, no ambiguity can arise if we define 'p only if q' to
mean lp => q\
6-2 157

MATHEMATICAL INDUCTION [9
(ii) assume ΔΒ = AC and deduce that
AB=AC {Z-B = LC => AB = AC
or, alternatively, AB = AC if LB = Z-Q.f
Ex. 6. The triangle ABC is right-angled at A if and only if ВС2 = СА2+АВ*.
What would you assume if asked to prove the Only if part of this proposition?
Ex. 7. Correct the following statement: "The integer N (expressed in the denary
scale) is divisible by 5 if and only if the units digit of N is 5.'
Ex. 8. Correct the following statement: 'Two vectors a and b are equal if and
only if |a| = |b|.'
Ex. 9. Is it true to say that, if χ > 0 then ax > x2 only if a > χΊ
It is often necessary to disprove an implication; that is, to show that
the truth of statement ρ does not imply the truth of statement q. In this
situation we write ρ φ> q (read 'p does not imply q'). Since ρ => # means that,
in all cases in which ρ holds, q holds too, to show that ρ ή> q we have
merely to exhibit one case in which ρ holds and q does not hold (a counter
example). For example, if we have
ρ is the statement 'л: is of the form (6n + 1) π/3, η integral',
q is the statement 'sin л: = sin 2x'
it is a fairly straightforward matter to prove that, if ρ is true, then q is
true; that is, p=> q. To disprove the converse, q => p, we have simply
to find a counter example, e.g. χ = 0, which certainly satisfies q, but is not
of the form (6n ± 1) тг/3.
Ex. 10. Prove that χ = (6« ± 1) π/З => sin χ = sin 2x.
Ex. 11. If ρ is the statement '« is an odd number' and q is the statement 'an
integer к can be found so that η = 4k +1' prove that q => ρ and disprove the
converse result ρ => q.
To complete this section we mention one final method of proving that
the implication ρ => q holds. If a statement ρ is modified by the addition
of the word 'not', a new statement 'not p', written p' (sometimes ~ p)
is obtained. For example, if ρ is the statement 'the integer η is divisible
by 3' p' is the statement' the integer η is not divisible by 3'. p' is called the
negation of p.
t Another way of expressing implications is by using the phrases 'necessary condition'
and 'sufficient condition'. A necessary condition for ρ is q means that.? => q; a sufficient
condition for ρ is q means that q => p. A necessary and sufficient condition for ρ is q means
that ρ oq.
158

11 A NOTE ON MATHEMATICAL PROOFS
The implication ρ => q is equivalent to saying that we cannot have ρ
true without q being true or, using negatives, if q' is true, then p' is true;
in terms of implication, q' =*■ p'. The argument also works in reverse and
thus the two implications ρ => q and q' => p' are equivalent. Expressed
more succinctly . . . , ,.
(p=>g)o (я => p)-
The mathematical value of this equivalence is that it is often easier to
prove the implication q' => p' rather than the implication ρ => q.
Ex. 12. In Ex. 11, prove that p'=>q'.
Example 3. To prove that there is an unlimited number of primes.
Let q be the statement Ί has no prime factor' and r be the statement
'there is an unlimited number of primes'. The statement # is true, since the
smallest prime number is 2; we shall show that q => r, and so r is true.
Now the negation of r is the statement /■': 'there is a finite number of
primes'.
r' => there is a greatest prime number, ρ say;
=> (k+l) is not prime, where к is the product of all the primes,
к = 2.3.5 ρ;
=> (k + l) has some member of the set {2, 3, 5, ...,p} as a factor;
=> both к and (k+l) have some member of the set {2, 3, 5, ..., p) as a
common factor;
=> 1 has some number of the set {2, 3, 5, ...} as a factor;
Thus r' => q' and so q => r.
A number of important points have been glossed over in the discussion
of implication in this section. The reader should consult one of the books
mentioned in the bibliography for a more thorough and rigorous treatment
of the subject.
Exercise 9(a)
1. Disprove the converse of the result ' if a quadrilateral is a rectangle then its
diagonals are equal'.
2. All angles θ of the form θ = кп (к an integer) have the property that
sin θ = tan Θ. Is the converse result true?
3. If a, b e Z+ and if the operation о is defined by
t ab-a-b
йо6 = ^ТГ'
prove that aob = boaoa = b.

MATHEMATICAL INDUCTION [9
4. Solve the equation J(x-4) + 2 = V(*+12)> stating which steps in your
solution are reversible and which are not.
5. Solve the equation J(2x+1)-Jx = 1, stating which steps in your solution
are reversible and which are not.
6. Solve the equation J(2x-l) + J(x + 3) = 9, stating which steps in your
solution are reversible and which are not.
7. Explain the fallacy in the following argument:
'n2 +1 has no real factors. Put η = 3: thus 10 is a prime number.'
8. Prove that the difference of the squares of two odd numbers is divisible by 8.
State and prove a converse theorem.
9. Prove that (x + h) is a factor of ax2 + bx + с if and only if ah2-bh + c = 0.
10. Prove that, if α φ — 1, then the simultaneous equations in x, y:
(ax+y^b,
(x + ay = b
can be solved for χ and y.
What happens if a = 1 ?
Is it true that the equations cannot be solved if α = — 1 ?
11. Two well-known theorems of number theory are:
(i) any odd prime of the form 4n +1 is expressible as the sum of the squares
of two integers;
(ii) any odd prime of the form An + 3 is not expressible as the sum of the
squares of two integers.
Combine these two theorems into a single theorem by using the phrase 'if and
only if.
12. a, b, c, d are four unequal, positive numbers. Prove that
a-c Ца2 + сг\ ., , , .. а с
b^d=J[v^d2) rf«"d only if ъ = ,
13. If P(x) is a polynomial with a repeated linear factor, that is, if
P(x) = (x-hYQ(x),
where Q(x) is a polynomial, prove that (x— h) is also a factor of P'(x). (jP'(x) is
the derivative of P(x).)
14. Prove that, if 0 < χ < 1,
then (X-xf <^f^.
4-х
15. ABC is any triangle and Μ is the mid-point of ВС. Prove that
AB2 + AC2 = 2AM2 + BM2 + CM2 (Apollonius's Theorem).
Is it true that, if Μ is a point on ВС such that
AB2 + AC2 = 2AM2 + BM2+CM2
then Μ is the mid-point of BC1
160

U A NOTE ON MATHEMATICAL PROOFS
16. If ρ is a prime number greater than 3, prove that either /7+1 or p—\ is
divisible by 6.
Show that the statement 'if χ is a number such that x +1 or x— 1 is divisible
by 6, then л: is a prime' is false.
Show further that the statement 'if л: is divisible by 6, then at least one of
x+1 or x— 1 is a prime number' is also false.
17. Prove that a sufficient condition for the integer χ to be divisible by 17 is that
there exists an integer η such that
x= 35я-18".
Prove further that this is not a necessary condition for χ to be divisible by 17.
2. MATHEMATICAL INDUCTION;
AN INTRODUCTORY EXAMPLE
Consider the series
Suppose we define the sequence {sn} in such a way that sn represents the
sum of the first η terms of this series. Then by direct computation,
■Sl =2. S2 = f, S3 = %, S4 = $.
The form taken by the first few values of sn suggests that
"и + 1
(*eZ+).
Let us define a new sequence {/„} in such a way that
f = —
Then we wish to prove that the two sequences {sn} and {/„} are the same,
in the sense that sn = fn for all η e Z+.
As a first step, we observe that
fn+1 = ^-±i , by the definition of/„
= ^+ϊ+(7Γ+Τ)"(^+Τ)
О)
, by the definition of sn. (2)
161
> + l)(« + 2)

MATHEMATICAL INDUCTION [9
(1) and (2) define recurrence relations (see chapter 8.1); with the initial
terms they enable us to determine uniquely any term of either of the
sequences {sn}, {/„}. But we further observe, in this case, that
(i) the two recurrence relations are the same;
(ii) Sl = i =U
Since the recurrence relations (1) and (2) develop successive terms of the
sequences {sn} and {/„} in the same way, and since the two sequences
have the same starting point, we deduce that sn = /„ for all n.
The observation that (1) and (2) represent the same recurrence relation
enables us to deduce that if sn = fn, then j„+1 = fn+1, or, expressed
alternatively, - .
Sn=fn^ Sn+1 = fn+1.
The fact that jj = /i then enables us to infer the equality of sn and/„ for
allneZ+;for г г г
Si = fi=>S2=f2=>S3 =f3... ·
Ex. 13. Define sequences sn and/„ as follows
s„ = 1.2+2.3 + 3.4+...+л(л+1),
Λ = Κ»+1)(Ί+ί-
Show that (i) st = /i, (ii) s„ and /„ satisfy the same recurrence relation. Deduce
thatjn =/„.
3. THE METHOD OF MATHEMATICAL
INDUCTION
Suppose that pn is some statement about the positive integer n. For
example (see example 5 below), we might have
pn: 'the expression 13й — 6™~2 is divisible by 7'.
Then, if we can show
(i) that pm is true for some specific integer m (in our example, m = 2
would do); and
(ii) that pk =>Pic+i for all к ^ т (in our example, рк^ is the statement
*13*+1-6*-1') we have
Pm => Pm+l => Pm+2 => ■■·
and the truth of the statement pn may be asserted for all η > m. Such a
process is called proof by mathematical induction.
The method of mathematical induction is used to prove a conjecture
about the positive integer n. The conjecture itself is usually arrived at by
inductive argument from a few simple cases—hence the use of the word
'induction', though the method of proof is deductive.
162

31 METHOD OF MATHEMATICAL INDUCTION
To summarize: given the statement p„ about the positive integer л,
to prove the truth of pn we
(i) verify that the induction starts; that is, we find an integer m for
whichpm is true;
(ii) prove the implication/^ => pk+1 for a general positive integer к *s m.
*Ex. 14. Show that n2 + n+41 is a prime number if η = 1, 2, 3. Prove that the
statement 'n2 + n + 41 is a prime number for all positive integers я' is false.
(n2 + n + 41 is, in fact, prime for η = 1, 2, 3, ..., 39 but it is false when η = 40
and whenever я is a multiple of 41: pk φ> /%+ι·)
*Ex. 15. If /?„ is the statement '5л2— 5я + 2 is divisible by 10', prove that
Pk =*■ Л+1- Prove also that /?„ is false for all positive integers n. (The induction has
no starting point.) Hint. To prove the falsity of pn, observe that Pk^Pk+i and
ihusp'k=> p'k+1.
Example 4. Prove that
1.11 + 2.21 + 3.3! + ...+л.л! = (л + 1)!-1.
Write sn = 1.1!+2.2! + 3.3! + ...+л.л!.
We have to prove that sn = (n +1)! -1; let pn be the statement
Ч, = (л + 1)1-Г.
(i) px is true, for sx = 1.1! = 1 and (1 + 1)!-1 = 1: the induction
starts,
(ii) To prove pk => pk+1:
Sk+i = sk + (k+l) (k+1)!, by the definition of sk;
**l+i = (*+l)!-l+(*+l)(*+l)!, Ъурк;
=>ί*+ι = (*+1)![1+*+1]-1
=>Ъ^ = (* + 2)!-1
and thus pk => pk+1 and the result is true for all л > 1, by mathematical
induction.
Example 5. Prove that 13й— 6й-2 is divisible by 1 for all positive integers
greater than m, where m is a certain integer, to be specified.
Write un = 13й- 6й-2; we have to prove that un is divisible by 7 for all
л ^ m, where m is some integer, to be found.
(i) щ is a fraction, and the concept of divisibility does not apply.
u2= 132-6° = 168 = 7.24 and щ is therefore divisible by 7: the induction
starts.
Let/7„ be the statement 'и„ is divisible by 7'.
163

MATHEMATICAL INDUCTION
[9
(ii) Toprove/»»=>A+i:
Ujc+1 = ΐ3*+ι_6*-ι and uk = 13*-6*-2
^uk+1-6uk = (13.13*-6.6*-2)-(6.13*-6.6*-2)
=> uk+1-6uk = 7.13*
=> "fc+i = 7.13* + 6ks
=> Mfc+i is divisible by 7, by pk, since the sum of two integers
divisible by 7 is itself divisible by 7.
Thus, 13n-6"-2 is divisible by 7 for all η > 2.
Example 6 {the Binomial Theorem). Prove that
(х + аГ = (J) *«+ф x^a+Q *-V + ... + ^) ж»-*+... + £) ««
/or positive integral n.
(i) (x + α)1 = x+a = II x¥+ xV and the induction starts.
(ii) Consider the product
(x+a) [*"+(") xn~ia+W\ xn-2a2+... + r) xn~r ar + ...+an~\.
Each term in the product is of degree (n+1), the first is xn+1, the last is
an+1 and the term containing xn+1-rar is
and the proof is complete, by mathematical induction.
164

31 METHOD OF MATHEMATICAL INDUCTION
It should be noted in passing that the proof of the Binomial Theorem
given in Chapter 8.5 strictly speaking requires induction, to justify the
process of selection from each bracket. The student is recommended to
supply the necessary details.
In our next example, we make an assumption (sometimes called
inductive hypothesis) of a slightly different form to those of the preceding
examples. The reader should observe that, to start the induction, we have
to verify that the result holds for two successive values of n.
Example l.Ifa sequence un is defined by the recurrence relation
un+1 = 3un-2un_1 (η Ζ 2)
and the initial values иг = 0, щ = 2, prove that
un = 2"-2.
(i) Since 2J-2 = 0 and 22-2 = 2, the result holds for η = 1 and
л = 2.
(ii) Suppose
uk = 2*-2 and Mfc_! = 2*-!-2 (к 3* 2).
Then,
Ub+i = 3ks —2ks_i, by the given recurrence relation;
= 3(2*-2)-2(2*_1-2), by the assumption made above;
= 6.2*-1-6-2.2*-1 + 4
= 4.2*-1-2
= 2*+1-2.
Thus uk = 2*-2 and u^ = 2к~г-2 ^ uk+1 = 2k+1-2.
But the result is true for η = 1 and η = 2, hence it is true for η = 3 and so
on.
*Ex. 16. If {«„} is a sequence defined by the relation un = Σ u„ η ^ 2 and the
initial value ux = 4, prove that un = 2" (η ^ 2). (Make the assumption that, for
к > 2, us = 2s for all s in the interval 2 < s < k.)
Example 8. Prove Theorem 7.4, using mathematical induction.
Theorem 7.4 states that, if Elt E2, ...,En are mutually exclusive events,
(EtnE1= φ, i φ./), then
PriG^ls) =_|;Pr№|S).
165

MATHEMATICAL INDUCTION [9
(i) The result is certainly true if η = 2, by Theorem 7.2, and the
induction starts.
(ii) Write F = "\3\ and suppose that Pr(F|S) ^^PrCE^S).
Now F П Ek = φ, since the events are mutually exclusive. Thus
Pr(F U Ek\S) = Pr(F|S) + Pr№t|S), ЬУ Theorem 7.2
= Σ ?r(Ei\S) + ?T(Ek\S), by the assumption made above
= ДРг(ВД
and the theorem is proved, by mathematical induction.
Exercise 9(b)
Throughout this exercise, η represents a positive integer. Prove the results 1-14.
using the principle of mathematical induction.
1. 1+2 + 3+4 + ...+л = $л(л + 1).
2. 1+3 + 5 + .. .+(2л-1) =я2.
3. 12 + 22 + 32 + 42 + ...+я2 = *и(л+1)(2и + 1).
4. 13 + 23 + 33 + ...+я3 = 1η\η + ΐ)\
5. 1.1+3.2+5.4+...+(2и-1). 2я"1 = 3+2я(2л-3).
6. 3.1! + 7.2! + 13.3! + ...+(я2 + я + 1)я! = (я + 1)2 я!-1.
9. я6 - η is divisible by 30.
10. 12" + 2.5""1 is divisible by 7.
11. 2to + 32"-2 is divisible by 5.
12. 52n + 22n-2.3"-i is divisible by 13.
13. З.Л + З'-'.И"-1 is divisible by 8.
14. (1 +x)n > 1 +nx, provided χ > -1, η > 1.
15. The terms of the sequence {uT} are all positive and s„ = Σ иг. Prove that
(1 +«,) (1 +и2) (1 +«з)... (1 +йп) > 1 +ίη for 77 > 2.
166

3] METHOD OF MATHEMATICAL INDUCTION
16. Prove that 5" < n\ for all sufficiently large n. Add such precision as you can
to the phrase 'sufficiently large n\
17. Prove by mathematical induction that the sum of the angles of a convex
я-sided polygon is (2л—4) right-angles.
18. Prove that an я-sided convex polygon has \η(ιι— 3) diagonals.
19. A straight line separates a plane into 2 regions; two straight lines separate
the plane into 4 regions and so on. If я straight lines (no three concurrent, no two
parallel) separate the plane into un regions, prove that
«n = i(«2 + «+2).
20. If я is a positive integer, prove that
52п+2-24л-25
is divisible by 576. (Cambridge)
21. If Λ„) = ι + ^ + Ιι+... + ^
prove that Σ [(3/·2 + 3/-+ 1)Д/·)] = (n + 1УДп)-Ып +1). (Cambridge)
22. Prove that
jbfr+l) (r+3 ... (r + ft-1) = «"+Ц-<" + *>.
23. If дъ д2, д3,..., д„ are all positive, prove that
(д! + д2+...+д„) (- + -+... + -) ^ л2.
24. Prove that, for any positive integer n,
2n 2n(2n-2) 2n(2n~2)(2n-4) , ч п
2^Ί+(2^ϊ)-(2^3-) + (2.-1)(2.-3)(2.-Τ) + - (t° Я t6rms) = 7л-
(O&Q
25. A motorist estimates that, by travelling along a main road at a certain steady
speed, the probability that the next set of traffic lights will be green if the last
set was green is \ and that the probability that the next set of lights will be
green if the last set was red is i. He sets out one day to test his theory. Prove that,
if the first lights he meets are green and if
pn = Pr (the Hth set of lights is green when he reaches them)
then ^n = !+i(i)"-1 (n> 1).
26. A man repeatedly tosses an unbiased coin, scoring 1 for each head and 2
for each tail. If pn is the probability that his score will ever be n, prove that
27. η bags, numbered 1 to n, each contain one white and one black ball. A
ball is taken at random from bag one and placed in bag two; a ball is then taken

MATHEMATICAL INDUCTION [9
from bag two and placed in bag three and so on, until finally a ball is taken from
bag n. Prove that the probability that this ball is white is
!(·♦£)
given that the first ball drawn was white.
28. The sequence {«„} is defined by the recurrence relation
u„-5un_1 + 6un_2 = О (n > 3)
and the initial values иг = 7, и2 = 17. Prove that
un = 2-+Ч-3".
29. The sequence {«„} is defined by the recurrence relation
un+2 + un+1-2un = 0 (Ol)
and the initial values иг = 4, и2 = — 2. Prove that
и* = 2-(-2)".
30. The sequence {«„} is defined by the recurrence relation
un+3 + 2un+2-un+1~2u„ = 0 (я ^ 1)
and the initial values иг = -1, и2 = 7, и3 = -7. Prove that
ип = 2 + (-1)«(1+2").
31. (The Fibonacci Series.) The sequence {«„} is defined by the equations
«i = «2=l, «„«=«„ + «„_! (n>2);
prove that
«η = ±{*·-Ρ) where a = !±^5 and β = ±=£
are the roots of the quadratic equation x2 = x + 1.
32. Prove that, for positive integral n, 32n —5" is divisible by 7 if and only if η
is even.
168

ίο. Expectation
1. RANDOM VARIABLES
In Chapter 7 we considered random experiments described by outcome
spaces. With each elementary event we associated a number, its
probability, which denoted our degree of belief that the experiment would result
in that particular events occurring. The elementary events themselves
may be described in various ways: in some cases, it is natural to denote
them by a number, e.g. for the fall of a die we could denote our elementary
events by the numbers 1,2,3,...,6; but sometimes no such natural
numerical description exists. For example, for a single toss of a coin, the two
element set {heads, tails} is the natural choice of outcome space, the elements
being labelled by the descriptions 'heads' and 'tails'. In order to make a
mathematical analysis of random experiments it is helpful to describe the
possible outcomes numerically even in those cases where no such 'natural'
description exists. Thus, in the case of coin spinning, the event' tails' could
be denoted by the number 0, the event 'heads' by the number 1. Such
values are called values of a random variable.
Let us pause here to recapitulate. Suppose we have a random
experiment whose possible outcomes are the η elementary events s-y, s2, ..., sn.
Then we may take as our outcome space the set
S = {s-y, s2, ..., sn}.
We now attach two numerical 'labels' to each element sf of S:
(i) the probability, pt, of that event occurring;
(ii) the corresponding value xt of the random variable (see Figure 10.1).
To help fix ideas, consider the following examples:
1. An unbiased die is thrown and the score noted. The outcome space
with the associated probabilities and values of a possible choice of random
variable are shown in Figure 10.2. If the experiment is repeated a number
of times, then the sum of the values of the random variable obtained gives
us our aggregate score.
2. Two unbiased dice are thrown in an attempt to score a double.
Since we are interested in just two possible outcomes, we may take our
outcome space, S, as shown in Figure 10.3. Associated probabilities together
with a possible choice for values of a random variable are as indicated.
In this case, if the experiment is repeated, the sum of the values of the
random variables obtained gives us the number of doubles thrown.
169

EXPECTATION
Л = 6 Л==6
{Тр
Fig. 10.3
3. A simple coin-tossing game is played as follows: A pays 5 pence for
the privilege of tossing two unbiased coins. If he gets two heads, В pays
him 5 pence and his stake money is returned; if he gets a head and a tail
(in either order), his stake money is returned; if he gets two tails, В keeps
the stake. A possible choice of outcome space, with associated probabilities
attached, is shown in Figure 10.4. In this experiment we are primarily con-
Л-4
= 4
1 J »2 Ι «Ι ί *4 !
Ш\ IhtI [та] ϊττ]
i A A A

1] RANDOM VARIABLES
cerned with the financial outcome and our choice of random variable
reflects this concern. If the experiment is repeated a number of times,
the sum of the values of the random variable obtained denotes A's net
gain in pence.
With these examples behind us we now make the following definition:
a variable whose value is a number determined by the outcome of a random
experiment is called a random variable.
The discerning reader will have observed that a random variable is a
function from a given sample space into a set of numbers, the values of
the random variable being the images under this function. The words
'random variable', however, are often loosely used to denote the image—
a vice to which we ourselves shall succumb from time to time for the sake
of brevity.
In the same way that a function is denoted by / and the image of ζ
under/by/(ζ), so a random variable may be denoted by Zand its
associated value for the z'th element of the outcome space by x{.
Ex. 1. A man pays 10 pence to throw two dice. For any double he receives his
stake momey back, together with a prize: one pound for a double six and 20
pence for any other double. Suggest a suitable outcome space, probability
distribution and random variable for this experiment.
Ex. 2. A man insures his life for £1000, paying a premium of £X. Suggest a
suitable choice of random variable to describe the situation.
2. EXPECTATION
A mass of numerical data often has an indigestible appearance and conveys
very little, unless subjected to considerable analysis. A device commonly
used to give an overall impression of the data is to determine their average
value. For example, if the heights of a hundred boys are measured, the
complete set of results may usefully be characterized by giving their average
height.
We shall now investigate what meaning can be attached to the phrase
'the average value of the random variable X associated with a given
probability distribution'. If an experiment is repeated N times, where N
is a large number, our interpretation of the probability p, as a measure
of our confidence in securing the outcome s^ leads us to expect roughly
ρλ Ν occurrences of s1} with similar results for s2, s3, etc. Thus we should
anticipate a score of xx on/^iVoccasions, x2 on/?2iVoccasions and so on,
giving an approximate average value of our random variable X for all N
171

EXPECTATION [10
repetitions as
p1Nx1+p2Nx2+...+pnNxn
-— N ——- = p1x1+p,x,+...+pn Xn
= ΣΡίΧί-
The larger the value of N, the more confidence we should place in the value
of Σ ρ{ X{ as an estimate of the average. We are thus led to formulate the
following definition:
The expectation ${X) of the random variable X is defined by the
equation n
S{X)= ^PiXi-
i=l
Example 1. An unbiased die is thrown; what is the expected score!
Here/?» = \ for each elementary event, and the associated values of the
random variable X are the integers 1, 2, 3, 4, 5, 6. Thus
£{X) = Σ £
= 3-5.
It is perhaps superfluous to remark that no one sufficiently familiar
with an unbiased die would expect a score of 3-5 on any one throw. 3-5
simply represents the best estimate available, prior to the actual experiment,
that we can make of the final average score if the die is thrown a number of
times; the larger the number of throws, the better we expect our estimate
to be. (The reader is strongly recommended to test the accuracy of this
forecast if ever he finds time lying heavily on his hands by throwing a die,
say a hundred times, and computing his average score.)
Example 2. Two players, A and B, play the following game with three coins:
A pays a stake of 10 pence and tosses the three coins in turn. If he obtains
three heads, his stake is returned, together with a prize of '30 pence; for two
consecutive heads, his stake money is returned, together with a prize of
10 pence. In all other cases, В wins the stake money. Is the game fair!
We must first consider what is meant by the question 'is the game fair?'
Intuitively it seems reasonable to label a game as 'fair' if, in the long-run
neither side anticipates any considerable financial gain. Mathematically,
a game between two players is fair if the expectation of gain for either
player is zero.
172

2] EXPECTATION
To continue with the solution of the problem, we choose as our outcome
space the four-element set
S = {(HHH), (HHT), (THH), (anything else)}.
On the assumption that the coins are unbiased and that the results of the
tosses are independent, Pr (HHH) = (£)3 = -|; similarly,
Pr (HHT) = Pr(THH) = % and Pr (anything else) = 1 -| = ·§.
As our random variable we take the net gain, in pence, for A on each
particular elementary event: хг = +30, x2 = +10, x3 = +10, xt = -10.
Summarizing in tabular form, we have:
HHH
i
30
HHT
έ
10
THH
i
10
Anything
1
-10
£{X) = £.30+i.l0+|.10 + !(-10)
= 0.
The game is therefore, according to our definition proposed above, fair.
Ex. 3. An experiment with three possible outcomes su s2, s3, has probability
distribution {j, f, ^} and associated random variables 3, 2, 1. Determine the
expectation.
Ex. 4. A man pays 1 penny to throw three unbiased dice. If at least one six
appears he receives back his stake money together with a prize consisting of the
number of pennies equal to the number of sixes thrown. Does he expect to win
or lose?
Ex. 5. An experiment can result in three possible outcomes, whose probabilities
are k, i, i. A random variable is assigned whose values are respectively x2, -x,\.
Show that, if the experiment is repeated a number of times, the player may
possibly finish with a negative score but can anticipate an aggregate score which
is positive. Can his expectation be zero ?
173

EXPECTATION
Exercise 10(a)
1. The values of a random variable, together with their associated probabilities
for four different experiments, are given in the tables below. Calculate &(X)m.
the four cases.
(i)
m
(iii)
*i
P;
*/
Pi
*/
P;
0
A
-2
A
1
A
1
A
-1
!
2
A
2
i
0
A
3
i
3
i
1
A
4
!
4
A
2
A
5
A
5
A
6
A
*/
£,·
1
0
2
1
3
0
4
5
0
6
7
0
8
i
9
0
10
i
2. In Question 1, a new random variable У is constructed so that Υ = 2X— 1.
If the probability distributions remain the same, calculate S(Y) in the four cases.
Can you generalize your result in any way ?
3. A player pays a certain sum of money to spin two coins. For two heads
he receives back Юр, for two tails he receives 2p, for a head and a tail he receives
nothing. In all four cases he forfeits his stake money. What should the stake
money be for the game to be fair?
4. Two dice are thrown; find the expectation of the higher score showing (or
the score of one of them, if they fall alike).
5. If the probability that a man aged sixty will survive another year is 0-9,
what premium should he be charged for a life insurance policy of £1000? (If he
survives the year, he receives no money back.)
6. ΑΊ and X2 are two random variables, each with values 0, 1, 2, 3, ..., 9, and
each possessing a uniform probability distribution. Evaluate
(OrfUi-.*·); (ii) ЩХг-XS.
7. Two bags each contain ten coloured discs as shown.
Red
4
5
Green
3
3
Blue
3
2

2]
EXPECTATION
A player stakes a certain sum of money for the privilege of drawing two discs,
one from each bag. For two discs of the same colour his stake is returned and, in
addition, he is awarded a prize of Юр for two reds, 20p for two greens and 25p
for two blues. For two discs of different colours he loses his stake.
Show that, if the stake money is 8p, he can anticipate gaining in the long run,
but that with the stake at 9p he should expect to lose.
8. The game of Question 7 is repeated, but the player now tosses a coin to decide
which bag he must choose from: if he tosses a head, he chooses bag I, if a tail,
bag II; he then draws a disc at random from the chosen bag, notes its colour and
replaces the disc. He repeats the process again and is paid prizes as in the
previous question. Determine the minimum stake (to the nearest penny) required to
ensure that the player will show a loss in the long run.
9. The game of Question 8 is repeated but the discs are not replaced between
draws. Determine the minimum stake (to the nearest penny) required to ensure
that the player will show a loss in the long run.
10. The game of Question 7 is repeated, but the player is now required to place
his stake after the result of the first draw from bag I is known. If he draws a red
disc first time he pays 2p, if a green, 9p, if a blue, 12p. Show that, in the long run,
the player expects to win.
Show further that the above stakes are the fairest available that still give an
advantage to the player, in the sense that, if any one of the stakes were increased
by lp, the bank would then expect to win.
11. A man pays a stake to throw two dice. If he scores a total of 3 or 11, he
receives 40p. For a total of 5 or 9 he receives 20p, and for a total of 7, Юр (in
each case the stake money being returned too). He loses his stake money for any
even score. Show that he expects to win if the stake money is 21 p, but that, if the
prizes for scoring 5 or 9 and 7 are reversed, he would then expect to lose.
12. Two identical bags contain respectively (i) four fivepenny pieces and twelve
tenpenny pieces, (ii) nine fivepenny and seven tenpenny pieces. You are allowed to
select a bag and draw a coin at random from it. If the coin you draw is a five-
penny piece, what would be a fair price for you to offer for the bag you did not
select?
13. The path in Figure 10.5 represents a simple maze along which a rat is made
to run. It starts at S and has to finish at F. If it makes a mistake at A by turning
along AA' it will return to A and be forced by the construction of the maze, to
turn towards F, and similarly at each of the other junctions. The probability of
taking either of the two paths available at each junction is \. Find the expected
number of mistakes the rat will make in running from S to F.
А' В' С D'
ι //// g
«♦А В С D _M ,
Start Finish
Fig. 10.5
14. What is the expected number of moves that can be made by (i) a bishop,
(ii) a knight, placed at random on an empty chess board?
175

EXPECTATION [10
15. Two dice are thrown in One turn', each turn costing 5p. If a prize of 40p is
given for a double six and a prize of 20p for any other double (together, in both
cases, with the stake money), determine the loss to a person playing the game
One hundred times.
16. A man puts three coins into a bag, deciding at random for each coin
separately whether it is to be a fivepenny or a tenpenny piece. Calculate the expected
total value of the coins he puts in his pocket.
17. A man puts three £5 notes into one envelope and three £1 notes into a
similar envelope. Each year at Christimas, he chooses one envelope at random and
gives his nephew a note from it. As soon as either envelope is emptied by his
taking the last note from it, the process ends.
(i) State the different totals which the nephew may have received when the
process ends;
(ii) for each of these totals calculate the chance of its occurrence;
(iii) deduce that the nephew's expectation of gain is £12-375.
(Cambridge adapted)
3. STANDARD DEVIATION; EXPECTATION OF
FUNCTIONS OF RANDOM VARIABLES
The expectation £(X) of the random variable X is often called the mean
of X and is frequently denoted by the letter μ (or, occasionally, when it is
necessary to avoid ambiguity by referring to the random variable under
consideration, μχ). μ gives us a forecast of the average value obtained for
the random variable if the experiment is repeated a great number of times;
it does not, however, give us any indication of how the individual results
will be spread out.
By way of illustration, consider two unbiased dice, one of which has its
faces printed in the usual way, but the other has three faces showing one
dot and three faces showing six dots. It is readily verified that the
expected score for the second die is 3-5, just as it is for the first but clearly,
if each die is thrown one hundred times, the separate scores in the two
cases will present a very different appearance: the spreads of the
distributions are quite different. To measure the spreads of these distributions it
might seem plausible to consider a new random variable Υ whose value for
a particular face equals the difference between the score showing and the
mean (commonly called the deviation from the mean) and to calculate the
expectation of Y. However, a calculation in the above example shows that
such a quantity is quite unsuitable.
For the first die, Υ takes the values {-2-5, -1-5, -0-5, 0-5, 1-5, 2-5}
with corresponding probabilities {\, \, \, \, \, £}. For the second die, the
values of Гаге {-2-5, 2-5} with corresponding probabilities {\, £}.
176

31 STANDARD DEVIATION
For die I, £{Y) = i(-2-5-1-5-0-5+0-5 +1-5 + 2-5) = 0.
For die II, £(Y) = i(-2-5 + 2-5) = 0.
A moment's consideration should show that our obtaining the answer
zero in both cases is no coincidence. Indeed, for any distribution, the
expectation of the deviation from the mean is zero. Why?
To avoid the cancelling out of the positive and negative scores a plausible
precaution would be to choose as our new random variable not Y, the
deviation from the mean, but Z, the squared deviation from the mean and
this does, indeed, prove a very satisfactory measure of spread.
Let us see how such a new choice of random variable works in our
example of the two dice. For die I, Z takes the values {6-25, 2-25, 0-25}
each with associated probability J, while for die Π, Ζ takes the value
{6-25}, with associated probability 1.
For die I, S{Z) = K625 + 2-25 + 0-25) = 2-92.
For die II, S{Z) = 6-25.
The precise meaning of these two numbers in probabilistic terms will be
more fully explained in the next section (see Chebyshev's Theorem); for
the moment it suffices to point out that the larger answer obtained for the
second die corresponds to the fact that the scores obtained from this die
are, on the average, farther from the mean than they are for the normal die.
Summarizing the results, we make the following definition: given a
random variable X, with mean μ, the variance, σ|, of X is the expectation
of the squared deviations from the mean. That is, if a new choice of
random variable, Z, is made, where zt = (хг-/<х)2, then
σ% = S{Z)
The positive square root of the variance, σχ, is called the standard
deviation of X; it has the advantage of being measured in the same units
as X. The standard deviation (or, equivalently, the variance) has been
shown, in an intuitive sense, to give a measure of spread; its deeper
significance will be appreciated by the reader only as he gains a fuller grasp of the
subject, when it will be seen that standard deviation plays a central part in
the development of probability theory.
There is no difficulty in extending the concept of expectation to other
functions of a random variable. Indeed, given a function/that maps the
random variable X into another random variable Y, where y{ = f{xv),
177

EXPECTATION [10
we may define the expectation off(X), £[f(X)], by
with this notation σ| = <?[(Χ-μχ)2].
Ex. 6. Find the mean and variance of the random variable X whose probability
distribution is shown in the table below:
X
Ρ
0
1
1
I
2
k
3
i
4
έ
Ex. 7. Find the variance for the number of heads showing if three unbiased coins
are tossed.
*Ex. 8. A random variable has mean μ and variance σ2. If a constant с is added
to each value of the random variable, describe (without embarking upon any
detailed working) what difference this will make to (i) the mean, (ii) the variance.
4. SOME THEOREMS CONCERNING THE MEAN
AND STANDARD DEVIATION OF A RANDOM
VARIABLE
Throughout this section we shall assume that, for some random
experiment, an outcome space S with known probability distribution is given
and a random variable, X, assigned, where £(X) = μ and «?[(X-/<)2] = σ2.
Our first theorem enables us to determine μ and σ in terms of the
expectation of Χ- α and {X- a)2. Its value lies in the fact that these latter
quantities are frequently a great deal less troublesome to calculate than
are μ and σ2 directly from the definition.
Theorem 10.1. If a is any number, then
(i) £[{X~a)] = μ-α; (ii) £\{X~af\ = σ2 + (/*-α)2.
Proof, (i) £[(X-a)] = ΣΡί(χ-α)
= %PiXi~%Pia
= μ~αΣΡί
= μ— a, since Σ A = 1 ■
178

41 SOME THEOREMS
г = 1
= Σ л(*< - μ?+Άμ - α) Σ λ(*< - /*)
+(/ι-β)!.Σί(,
since 2{μ—ά) and {μ—a)2 are common factors and thus may be taken
outside the summation; thus
<?[(X-af] = σ2 + (/ί-α)2, since |й = 1
and ΣιΡί(Χί—μ) = 0 (from Theorem 10.1 (i) putting a = /<).
Example 3. Find the mean and variance of the numbers 1, 2, ..., 20, assuming
that they are uniformly distributed.
We use Theorem 10.1, with a = 10,
^[(Ζ-10)] = -/0-[-(9 + 8 + ... + 1) + (1+2+...+9)+10]
= 0-5,
/ί = 10-5;
^[(Z-10)2] = -2J0-[2(l2 + 22 + ...+92)+102]
= -Ай.9.10.19+100]
= 33-5,
σ2 == 33-5-(0-5)2
= 33-25.
The next theorem determines the mean and standard deviation of a
linear function of the random variable. Intuitively, the results are easy to
understand: multiplication of every value of Ζ by A clearly results in a
proportionate increase in the measures of the average (mean) and spread
(standard deviation), while a shift of origin shifts the average by the same
amount but has no effect upon the spread.
Theorem 10.2. If λ, a are any numbers:
(i) the mean of the random variable Υ = λΧ+α is λμ + α,
(ii) the standard deviation of the random variable Υ — λΧ + α isAa.
Proof, (i) S[XX+ a)] = Σ л(Лх< + a)
= ΑΣίΛ + ι.ΣΑ
= λμ + α.
179

EXPECTATION [10
(ii) The mean of Υ is λμ + α, by (i) and the variance of Υ is given by
^{[γ-ίλμ + α)]*} = <?{[(λΧ+α)-(λμ + α)γ}
= Σ/^Α^-Α/*]2
г = 1
i = l
= AV2.
Our final theorem of this section is a result due to the great Russian
mathematician P. L. Chebyshev (1821-94). Its importance lies in the fact
that it gives us an interpretation for standard deviation in terms of
probabilities ; that is, it gives us, in terms of A, an upper limit for the probability
that a value of the random variable lies further away from the mean than
A standard deviations, whatever the associated probability distribution.
(Of course, if we are given some information about the probability
distribution we can usefully refine the inequality, but this does not detract
from Chebyshev's result viewed as a general theorem.)
Theorem 10.3 {Chebyshev's Theorem). If λ is any positive number,
Ρτ(|*,-/ί| > Ασ) < 1/A2.
Proof. Define the subset A of the outcome space S by
Α = {χί:\χί-μ\ > λσ} = {χ{:(χ^μγ > Α2σ2}.
Now S = A U A' and A n A' = 0. Denoting by Σ summations over
A
those values of i for which *,■ e A, and similarly Σ and Σ, we have,
A' S
'! = Σ^-/·)!
= Σ/ΜΛ-/*)2+ Σρ£χ{-μ?
A A'
>ΣΡί(Χί-μ)2, since Σ/>*(**~/*)2 > 0,
A A'
> Σ Α·λ2σ2, by the definition of A,
A
= AVUft,
Since AV2 > 0 we may divide both sides of the inequality by AV2 :
}2> Σλ = Ργ(*,:|χ*-αΙ >*°-)·
180

4] SOME THEOREMS
Ex. 9. A probability distribution has mean 3 and variance <r2 = 1 -2. Give an
approximate limit for the probability
Pr(|*-3| > 2σ).
The probability distribution for the random variable X(X = 1, 2, ..., 5) is
given in the following table:
* 1 2 3 4 5
Ρ U> i I U \i о
Find the mean and standard deviation. What is Pr (|*-3| > 2cr)? Comment
upon the results obtained and any discrepancy you observe.
Ex. 10. Find an expression for £[(Χ-μ)3] in terms of £(X3), μ and <r.
5. PROBABILITY GENERATING FUNCTIONS
We now introduce a technique of considerable value in dealing with
probability distributions of an integral variable. Given a random experiment the
outcome space for which has associated integral random variable X, the
expectation of the function tx is called the probability generating function
(p.g.f.), G(t), for the distribution. Thus
G(t) = £ψ\ =(|д<".
For example, suppose a card is drawn at random from a well-shuffled
pack and that we score the face value of the card drawn (an ace scoring 1
and a picture card 10). The probabilities of scoring 1, 2, 3, ..., 9 are each
тз, while the probability of scoring 10 is 4/13. Thus
G(t) = ^(t1+t2 + t3 + ... + ts) +-& г10.
The quantity t has no particular significance: it is simply used as a
'carrier' for the values x{ of the random variable. Since the coefficient of
tx<- is р^ the probability of each value of the random variable may be read
off if we know the form of the p.g.f. Thus, if we are able to deduce the form
of the p.g.f. we have a concise summary of the probability distribution.
Theorem 10.4 gives a method of building up the p.g.f. for a complicated
distribution but, before we embark upon its enunciation and proof, we must
make the following definition:
Two random variables X and Υ defined on the same sample space are
said to be independently distributed if
Pr (X = Xi and Υ = уд = ?τ(Χ = χ,) Pr (Υ = у,).
Our next theorem shows that, if X and Υ are independently distributed,
then the new random variable Ζ = X+ Υ has as its generating function
181

EXPECTATION
[10
the product of the generating functions of X and Y. For simplicity, we
shall consider the particular case in which X can take the η +1 values
0, 1, 2, ..., η with Pr (X = г) = Pi and that Υ can take the (n + 1) values
0, 1, 2, ..., и, with Pr (Y =j) = q,-. Then Ζ can take the 2n +1 values 0, 1,
2, ..., In, with Pr (Z = k) = гъ say.
Theorem 10.4. У/" X and Υ are independently distributed random variables
taking the values 0, 1, 2, ..., η and ifthep.g.f.s of X and Υ are Gx(t), Gv(t)
respectively, then the random variable Ζ = X+Y has p.g.f. Gz(t) where
Gz(t)=GJ.t)Gy(t).
Proof. rk = Pr(Z = k)
= Pr(Ar=0andr= A:) + Pr(Z= 1 and Υ = Jk-l) + ...
+ Pr(Ar= Arand Г= 0)
but Ga,(0 G„(0 = Oo +Л '+Λ'2+- +/>» tn) (ς0 + ς^ + ς2^ + ...+ςη tn)
= РоЯо + Oo ?i +Л ?o) ' + (Λ ft +A ft + A ft) t2 + ■ ■ ■
= Gz(t).
Before giving an example of the application of this theorem, we prove
one further result to illustrate the value of developing the theory of p.g.f .s:
we show that they may be used to calculate the mean and standard
deviation of a distribution.
Theorem 10.5. If the random variable X, whose p.g.f. is G(t), has mean μ
and standard deviation σ, then,
(i) G(l)=l;
(ii) σ(1) = μ;
(iii) G"(l) = σ2+/ί2-/ί.
Proof, (i) С?(0=Д^Г
.·. G(l)=ipr
= 1;
(ii) G'(t)
182
= grPrt™
= КГРг
= «?(*)
= μι

5] PROBABILITY GENERATING FUNCTIONS
(iii) G>(t)= ±r(r-\)pTtT-*
■'■ с?"(1)=Д^-1)л
= £(X2)-£{X)
= (σ2+μ2)-μ, by Theorem 10.1, with a = 0.
Example 4. Two unbiased dice are thrown. Find the expectation of the total
score, and its variance.
Let the random variable X denote the score from the first die; the
random variable Υ the score from the second die. Then we have to calculate
the expectations and variance of the random variable Ζ = X+ Y. Now
Gx(0 = «'1 + '2 + -+'6),
Gy(t) = Ut1 + t2 + ...+t6),
л Gz(t)=-h(t1+t2+-+t*)\
assuming that the dice fall independently. Thus
G'z(t) = -hit1+t2 + ...+t6)(l+2t + 3t2 + 4t3 + 5г4+6г6),
G"z(t) = -1h-(t1+t2 + -+t6)(2+6t+l2t2+20г3+30г4)
+is(l+2t + 3t2 + 4t3+5ti + 6t&y,
Л G'z(l) = 7,
G'z{\) = 47f.
By Theorem 10.5, μ = 7
μ2 + ο^—μ = 47f,
μ = 1, σ2 = 5f.
Ex. 11. Three coins are tossed and the number of heads noted. Find the p.g.f.
for the resulting distribution and determine μ and σ2.
6. EXPERIMENTS WITH INFINITE
OUTCOME SPACES
Experiments with an infinite number of possible discrete outcomes are
easy to visualize. For example, suppose a coin is spun until a head appears.
Take as random variable X, the number of tails obtained before the first
183

EXPECTATION [10
head is tossed; then Ζ has possible values 0, 1, 2, 3, ... and
Рг(Х = г) = ^.
Notice that £ Pr (X = r) = ^T = 1»
which is as it should be. Indeed, all the results of Chapter 7 hold for such
outcome spaces; detailed proofs will be omitted, because they depend upon
the manipulation of convergent series. However, assuming the obvious
generalizations of the definitions and theorems, the calculations involved
are not difficult. For instance, in the example above
But fi0*r=(l~*)"1 (W <i}
and so Σ гхг~г = (1 -x)~2,
assuming that the infinite series may be differentiated term by term (a
result always true, in fact, for convergent power series). Thus
Σ rxr+1 = x2(l-x)~2
and, putting χ = \, S{X) = 1.
Example 5. A and В alternately throw a die, the game terminating in a win
for A if he throws a I or a 6, or in a win for В if he throws a 2, 3, 4, 5. Find
the probability that A wins and, if he wins, find the average number of throws
he takes, given that A commences the play.
Pr {A throws 1 or 6 on a particular toss) = },
Pr (B throws 2, 3, 4 or 5 on a particular toss) = -|,
Pr (A wins on (2r+ \)th. play)
= Pr {A and В fail alternately on first 2r plays)
χ Pr {A throws 1 or 6 on (2r+ \)th. play)
= ®r (*)"(*).
Л Pr {A wins) = ΣβΥ Ш = ]4| = *·
We now have to find the expected number of throws taken by A, given
that A wins. We take as our random variable X where
X = r if A wins on his rth turn.
184

61 INFINITE OUTCOME SPACES
Since the game can go on indefinitely, X may take any positive integral
value 1, 2, 3, ... (i ly-ii
Pr (X=r\A wins) = У-3^Ц—^
= my-1
and ^(1) = Σ ir©'"1
(on using the result proved above: Σ rx7'"1 = (1 -x)-2).
Ex. 12. A man tosses a coin until a head appears and is paid a number of pence
equal to the number of tosses he makes. What is his expectation?
Exercise 10(b)
1. X is a random variable with mean μ and standard deviation σ. Write down
the means and standard deviations of the following random variables:
(i) -X; (ii) X+l; (iii) ЪХ-\; (iv) (Χ-μ)/σ.
2. Calculate the mean and variance for each of the following distributions:
(i)
(ϋ)
X
Ρ
X
Ρ
0
i
1
ft
1
i
2
A
2
A
3
ft
3
ft
4
ft
4
*
5
ft
5
1
6
ft
X
^
-3
ft
X
Ρ
50
ft
-2
ft
100
i
-1
i
150
1
0
A
200
i
1
ft
250
ft
2
ft
3
ft
4
ft
3. Calculate the η
s and variances for X2 in each part of Question 2.
4. A cubical die is so weighted that the probability of obtaining any face is
proportional to the score showing on that face. Find the mean and variance of
the score obtained.
5. If the random variable X сгп take the values 1, 2, 3 n, all values being
equally likely, calculate the mean and variance for X.

EXPECTATION [10
6. А, В and С repeatedly throw a die, A starting, then B, then C, then A again
and so on. The winner is the first to throw a six. What are their respective chances
of success ?
7. A throws a pair of unbiased dice, В a pair of dice of which one is unbiased
and the other is such that the probability of a six is p. If they throw in turn and
the winner is the first to throw a double six, find p, given that, when A has the
first throw, the game is fair.
8. An unbiased coin is tossed η times. If a head appears, + 1 is scored, if a tail,
-1 is scored. Prove that the p.g.f. for the random variable X, where x, = total
score after / turns, is given by
G(i) = (2i)-n(i2+l)n.
9. A die is thrown η times. An odd face makes the score showing, an even face
scores zero. Determine the p.g.f. and hence the mean score for η throws.
10. Two dice are thrown together and the scores added. What is the chance that
the total score exceeds 8 ?
Find the mean and standard deviation of the total score. What is the standard
deviation of the score for a single die? (Cambridge)
11. A card is drawn at random from a standard pack and scores the face value
of the card (with ace one and picture cards 10 each). Find the mean and variance
of the score.
If the card is replaced, the pack well shuffled and a second card drawn, find
the probability that the total score for both draws is 12.
12. Two unbiased dice are given, one of which has the faces numbered in the usual
way 1, 2,..., 6, but the other has two faces numbered 1, two numbered 3 and
two numbered 5. Both dice are thrown and the total score, X, is recorded. Find
the mean μ and standard deviation, σ, of X.
Find also
(i) Ρτ(\Χ-μ\ > or); (ii) Pr(|JT-H > 2σ).
Miscellaneous Exercise 10
1. Two bags contain red and white discs as shown in the table below:
Red White
5
10
15
10
One of the bags is selected at random and a disc drawn from it proves to be red.
If the red discs are now valued at £1 each and the white discs are valueless, what
would be a fair price to pay for the remaining discs in the selected bag ?
2. (The St Petersburg Paradox.) A coin is spun. If a head is obtained first time
you are paid lp; if you get a tail followed by a head you receive 2p; for two tails
followed by a head 4p, the next prize being 8p and so on. Show that, however
much you are prepared to pay to play the game, your expected profit will be
positive.
186

6J MISCELLANEOUS EXERCISE 10
Criticise any assumptions you have made and indicate what further
knowledge you would require before offering a more realistic 'fair price' for the game.
If the banker against whom you are playing starts with a capital of lOOp, what
would be a fair price for you to offer him before playing the game ?
3. A and В alternately throw an unbiased die, A having the first throw. The game
terminates when a six is thrown. В agrees to pay A £1 if A throws a six on his
first throw, £3 for a six on his second throw, £5 for a six on his third throw, and
so on. What would be a fair price for A to offer to play the game?
4. Two men play a game with two dice. A has a true die, whereas В has a die
which is biased so that each of the even faces is twice as likely to occur as each
of the odd faces. The two players throw their own die and A wins from В the
sum of the numbers thrown when the sum is even and the numbers are unequal.
Л wins the sum from A when it is odd; and the game is drawn when the two
numbers are equal.
Calculate the expectation of the game to A.
5. X and У are two random variables defined over the same probability
distribution. A new random variable Ζ is constructed, where Ζ = X+ У Prove
that the mean of Ζ is the sum of the means of X and Y.
Is it also true that the variance of Ζ is the sum of the variances of X and У?
6. If X is a random variable with mean μ and standard deviation σ prove that
the standard deviation of the random variable {Χ-μΥ is
{*[(*-/04-«*}*■
7. η points are marked on a line at 1 cm intervals. If two of the points are chosen
at random, find the probability that they will be r cm apart. Deduce that the
expected distance apart of the two points is j(n+1).
8. A and В play a game in which A's chance of winning is p, while B's is q, where
p+q = 1. They have a contest, the winner being the first to score two consecutive
successes. Prove that the expected number of games is
{2+pq)0—pqr\
9. X\s a random variable such that Pr (X < 0) = 0 and the mean and standard
deviation of X are respectively μ and σ. Prove that, for any к 5= 1,
Pr (X < A/t) > ^ .
Deduce Chebyshev's result, that
Ρτ(μ-λσ < X < μ+λσ) 5= 1--^·
10. The random variable У has a distribution such that Pr (У = r) is given by the
coefficient of θτ~' in p\\ — dq)-' for r > t and zero otherwise, where 0 < ρ < 1,
q = 1 —p. Find the expectation of У (M.E.I.)
11. Xis a random variable that can take all positive integral values 0, 1, 2, 3, ....
If
pr = Pr {X = r) and qr = Σ ρ,
s=r+l
7 "μ 187

EXPECTATION [10
and if the p.g.f. for X is G{t) and
H{t)= Zqrt\
prove that H{t) = {1~^°}.
Prove, furthermore, that H{\) = μ where μ is the mean of X, and determine
the value of #'(1).
12. A die is thrown η times. If S denotes the total sum obtained, prove that
£{&) = 7л(21л+5)/12.
13. η identical cards are numbered 1, 2, 3, ..., n, and a random sample (with
replacement) of size к is drawn. If the number on the highest card is taken as the
value of the random variable X, prove that
and deduce that, for large n, the mean value of X is approximately
nk
k + V
14. A and В play a game of golf. At each hole A's chance of winning is a and
B's is b. Find the chance, h, that the hole is halved, in terms of α and b.
Show that the chance of A being r up after η holes is the coefficient of χ to the
power (n+r) in the expansion of (ax2 + hx + b)n.
lfa=b = h, find the chance that A wins 5 and 4, i.e. that he was either 5 up
with 5 to play and then halved the fourteenth hole, or was 4 up with 5 to play
and won it.
Leave your answer in terms of powers of 3 and binomial coefficients.
(M.E.I.)
15. A man consistently stakes a fixed proportion of his available capital on a
fair bet, that is, a bet with zero expectation of gain. If he wins, his existing
capital is increased by a %, but, if he loses, he decreases it by b %. Show that he
must expect to lose in the long run, whatever values a and b might take.
16. Two numbers -Sfand У between 1 and 100 (inclusive) are selected at random,
all possible pairs {X, Y) having equal probabilities. Let Ζ denote the maximum
of X and Y. What is the probability that Ζ < 50? By use of the formulae
Ег = 1л(и+1)
and Σ/-2 = %n(n+l)(2n+l)
or otherwise, show that the mean of Ζ is just over 67. Find a median of Z.
(A median of Ζ is any number ζ such that Pr (Z < 0 5= i and Pr (Z 5= 0 > i-)
(C.S.)
17. Is it possible to devise a coin tossing experiment with a single unbiased coin
to give a probability of τ for some event ?
If you are given a coin which is biased in some unknown way, show how to
devise an experiment in which the probability of success, to be defined, is $.
188

ii. Further vectors
1. THE DOT (SCALAR, INNER) PRODUCT
OF TWO VECTORS
We have defined the combination of two vectors by the triangle rule, an
operation which we called addition and for which we used the sign +. The
choice of the word 'addition' was not entirely arbitrary, for it transpired
that the addition of vectors bore resemblances to the addition of numbers,
in the sense that certain algebraic properties of addition in the two cases
were strikingly similar.
We shall now define a new way of combining two vectors, which we
shall call dot multiplication (otherwise variously called scalar
multiplication, or the formation of the inner product) and denoted by .. The use of
a familiar name and notation is again dictated by the fact that dot
multiplication and the ordinary multiplication of two numbers display a strong
resemblance to one another. However, the reader should again be on his
guard and realize that the two operations are quite different as they
operate on different types of quantities. Having issued this caveat, we now make
our definition of the dot product of two vectors a and b:
a.b = |a||b| cos 6»,
where θ is the angle between the vectors a and b. Note that the
multiplication on the right-hand side of this equation is the ordinary multiplication
of three real numbers, |a|,|b|, and cos Θ, of which the first two are positive
(or zero).
Before proceeding we must clarify one apparent ambiguity in the
definition given above: the angle θ between two vectors is chosen as that angle
Fig. 11.1
189

FURTHER VECTORS [11
(between 0° and 180°) through which one of the vectors must be rotated to
bring it into coincidence in the same sense as the other vector (see Figure
11.1). It follows that the sign of the dot product is positive if the angle thus
defined between the two vectors is acute and negative if the angle is obtuse.
We have said that the word 'multiplication' is used in this context
because of the similarity shown between dot products and ordinary
products. This similarity we now endeavour to display, but to give added force
to our warning against pressing the analogy too far, we start by observing
two laws of multiplication of ordinary numbers that do not hold for
dot products.
Dot multiplication does not give closure; that is, a.b is not a vector.
Dot multiplication is not associative; indeed, (a. b). с is not even defined,
for (a.b) is a number, not a vector. (On the other hand, we can talk about
(a.b) c, which represents a vector in the direction of с and whose
magnitude is |a| |b| cos θ times that of c.)
However, there is a sort of associative rule that holds: if a, b are two
vectors and к is a number,
Ar(a.b) = (A:a).b = a.(jfcb).
The proof of this follows immediately from the definitions and may be
left as an exercise for the diligent reader.
Another reassuring note may be struck by observing that the
commutative law holds for dot products
a.b = b.a.
Again, the proof of this is immediate from the definition and may, with
even more confidence, be left as an exercise for the reader.
The final rule for dot products, that of distributivity over vector addi-
is more subtle and the proof is not immediately obvious. However, the
effort expended in demonstrating this result is amply rewarded, for with
it we shall have set up all the apparatus necessary for an effective algebra
of vectors.
We begin by defining the projection of a vector a in a direction specified
by a unit vector u. Suppose the displacement OA represents a; draw the
plane π through О perpendicular to u. Then, if D is the foot of the
perpendicular from A to π,
a = OD+DA
= OD+/7U
(see Figure 11.2). Further, the expression of a as the sum of two vec-
190

tors, one in the plane л
that
THE DOT PRODUCT
e in the direction u is unique. For suppose
= OO+pu
= OD'+^'u,
Fig. 11.2
where D, D' both lie in π. Then, by subtraction
0 = OD-OD'+/>u-p'u
= O'O+(p-p')u
and so DD' = (p-p') u.
But this is impossible unless D and D' coincide and ρ = ρ', for DD' and
(p-p') u lie in different planes and so cannot possibly be equal, unless
each is the zero vector.
The number ρ that arises in this construction is called the projection of a
in the direction u. We now prove the fundamental result for projections:
if the projection of яг in the direction u is^ and the projection of a2 in the
direction of u is p2, then the projection of aj + a2 in the direction u is
Pl+P2-
Suppose, with the obvious notation, that
aj = ODi+^u,
a2 = OO2+p2u;
then aj + a2 = (ODj + OD2) + (px +p2) u.
But ODj + OD2 is certainly a vector in the plane π, and so, by the
uniqueness of the expression for aj + a2 as the sum of two vectors, one in π and
the other in the direction of u, the projection of aj+a2in the direction of u
isp-L+Pz.
Having defined the projection of a vector in a given direction we may
obtain a simple geometrical interpretation for the dot product of two
vectors a and b. Represent a, b by the displacements OA, OB; let С be the
foot of the perpendicular from A to OB, and D the foot of the
perpendicular from В to О A. Then ОС is the projection of a in the direction of b, and
OD is the projection of b in the direction of a (see Figure 11.3).
191

FURTHER VECTORS
Now a.b = OA(OB cos Θ)
= OA.OD
and a.b = OB(OA cos Θ)
= OB. ОС, Fis· n·3
and thus the value of a. b is the product of the magnitude of either of the
vectors with the projection of the other in its direction.
Our proof of distributivity for dot multiplication over vector addition
has been long delayed but our result, which we shall state as a theorem,
now follows very readily and the results we have proved concerning
projections have an interest and importance in their own right and were
doubly worth pursuing.
Theorem 11.1. For any vectors a, b, c,
a.(b+c) = a.b+a.c.
Proof. Let the projection of b and с in the direction a be px and p2
respectively. Then
a.(b + c) = \я\(р1+р2)= |«|a+I«Ia
= a.b + a.c
(by distributive law for ordinary numbers).
Ex. 1. А, В have position vectors a, b referred to some origin O. If О A is
perpendicular to OB, show that a. b = 0. Is the converse result true ?
*Ex. 2. If a is the position vector of the point A, interpret the scalar product a. a.
*Ex. 3. Show that (a-b) .(a-b) = a.a + b.b-2a.b.
If a, b are the position vectors of the points A, B, interpret this equation (i) when
О A and OB are perpendicular; (ii) when О A, OB are not perpendicular.
Example 1. Prove that, if the sum of the squares of two opposite edges of a
tetrahedron is equal to the sum of the squares of
another pair of opposite edges, then the remaining /1\
pair of opposite edges are perpendicular. /\ \^
We are given that BZ_ I _ _ _\ D
AB2 + CD2 = ВС2 + DA2 XL^^^'^
and have to prove that С
ACLDB. Fig" UA
Take any origin О and let the position vectors of A, B, C, D be a, b, c, d.
Then, in terms of these position vectors, we are given that
(b-a).(b-a) + (d-c).(d-c) = (c-b).(c-b) + (d-a).(d-a).
192

1] THE DOT PRODUCT
Now
(b-a).(b-a) + (d-c).(d-c) = (c-b).(c-b) + (d-a).(d-a)
ob.b-2a.b + a.a + d.d-2c.d + c.c = c.c-2b.c + b.b + d.d-2d.a + a.a
ob.c + d.a = a.b + c.d
o(c-a).(b-d) = 0.
But c-a Φ 0, b-d Φ 0,
.·. AC1.BD.
Exercise 11 (a)
1. Showthata.a-b.b = (a-b).(a+b).
Interpret this equation geometrically and discuss the particular case |a | = |b|.
2. If a, b are non-zero vectors such that |a + b| = |a — b|, show that a and b
are perpendicular.
3. A is a fixed point with position vector a and с is a constant vector. If the
variable point Ρ has position vector r where г satisfies the equation
(r-a).(r-a) = c.c
what can be said about the position of P?
4. Prove that b + c = 0 and |a| = |b| o(a-b).(a-c) = 0.
Interpret this result as a geometrical theorem.
5. OXYZ is a rhombus of side α; Ρ is any point of О У (or О У produced). Prove
that OP. YP = XP2-a2.
6. ACBD is a straight line and О is a point not on the line. ΔΑΟΒ, /-COD are
both right-angles. If OA = a, OB = b, AC = x, CB = y, BD = z, find the
position vectors (relative to O) of С and D.
V*™**»1 CB.BD _ OB-
ACAD OA2'
7. ABCD is a tetrahedron in which ABLCD and ACL BD. Prove that ADA. ВС
and that AB' + CD2 = Ασ+BD2 = Αΰ2 + Βσ.
8. If each edge of a tetrahedron is equal to the opposite edge, prove that the
line joining the mid-points of any two opposite edges is at right-angles to each
of these edges.
9. Prove that, if two of the joins of mid-points of opposite edges of a
tetrahedron are at right-angles, the remaining edges are equal.
10. ABCD is a skew quadrilateral (that is, the vertices А, В, С, D do not all lie
in the same plane). The mid-points of AC, BD are P, Q respectively. Prove that
AB2 + BC2 + CD2 + DAi = AC2 + BD2 + 4PQ2.
193

FURTHER VECTORS [11
11. ABCD is a tetrahedron; X, Y,Z are the mid-points of AB, AC, AD and
P, Q, R are the mid-points of CD, BD, ВС. Prove that
AB^+DC + IPX2 = AC2 + BD2 + 2QY2= AD2 + BC2 + 2RZ\
12. ABCD is a tetrahedron and G is the centroid of the base BCD. Prove that
AB* + Aa + AD* = GB2 + Ga + GD2 + 3GA2.
2. THE DOT PRODUCT (CONTINUED);
COMPONENT FORM
If i, j, к form a right-handed set of orthogonal unit vectors, the following
equalities immediately follow from the definition of the dot product:
i.i = j.j=k.k = l, (1)
j.k = k.i = i.j = 0. (2)
Theorem 11.2. If л = ai\ + a2\+a3k and b = b1i+b2j+b3k,
then a.b = сцЪ^Л-аф2 +a3b3.
Proof: a.b = (a1i + a2i + a3k).(b1i + b2j+b3k)
= a1b1i.i+a2b2}.} + a3b3k.'k+a1bii. j+o^i.k
+а2Ь-1']Л + а2Ь3\.Ъ. + а3Ь1Ъ.Л + а3Ь21к..\
= a1b1+a2b2 + a3b3 by virtue of equations (1) and (2).
From Theorem 11.2 it follows that
|a|2 = a.a = al+a\+a\
, A. A . a.b aJ)-,+αφ2 +αφ3
and that ™°-]j[\W = M+dl + d!b№+l*+l*>·
Example 2. А, В, С have Cartesian coordinates (2, 3, 4), (-2, 1, 0), (4, 0, 2)
respectively. Find:
(i) cos ABAC;
(ii) the unit vector perpendicular to the plane ABC.
χ = AB =-4i-2j-4k,
у = AC = 2i-3j-2k
giving x.у =-8 + 6 + 8 = 6.
But x.y = |x||y| cos Z-ЯЛС
= V36V17 cos ABAC.
194

Now suppose u = li + mj + nk is a unit vector perpendicular to the plane
ABC. Then
2] THE DOT PRODUCT
Thus 6 = 6^17 cos ABAC,
.·. cos ABAC = 1/V17.
zk is a unit vei
Ju.x = 0,
lu.y = 0,
f-4l-2m-4n=0,
·· 1 2l-3m-2n = 0,
— 4m — 4n = 0
and so /:т:и = 1:2: —2, on solving for the ratios l:m:n. Thus
u = Ai + 2Aj-2Ak
where A must be so chosen that |u| = 1,
1,e' A2 + 4A2 + 4A2 = 1,
i.e. A = |
and so u = ii + fj-fk.
Ex. 4. If a = i-3j + k,b = 2i + 2j-k, find a .b.
Ex. 5. Show that the two skew lines
(x-l _y+l _ z+1
e perpendicular.
\x+l
I 1
Ex. 6. Find the unit vector perpendicular to the plane A (3, 1, 1), В (2, 5, 0),
C(-l, -1, 15).
Ex. 7. Deduce the two distance formulae of Chapter 3 using dot products.
Ex. 8. Prove vectorially that the angle between two diagonals of a cube is
arccosi.
3. FURTHER COORDINATE GEOMETRY OF
TWO DIMENSIONS
Throughout this section we shall suppose a right-handed set of axes Ox,
Oy is given, their directions being defined by unit vectors i, j. Л(Х> У-д,
Р&ъУъ) ··■ are given points with position vectors rl5 r2...; P(x, y) is a
general point with position vector r.
195

FURTHER VECTORS
[И
A line is specified given a point Рг lying upon it and its direction; let
us suppose its direction is defined by the unit vector u = /i+mj. Here
/ = cos 0J and m = cos θ2, where θχ and θ2 are the angles made by the
vector u with the axes Ox and Oy respectively (senses being taken into
account) (see Figure 11.5).
The line through Ρχ(χΎ, yj in the direction u thus has the equation
(see Figure 11.5, where ΘΎ φ \ή). Since cos θ2 = sin θ1} this equation may
be rearranged in the form
У-У1 =
ri(x~xi)
or у—У\ = tan Θ^χ—Xj)
agreeing with the form given in Chapter 3. (Notice, however, that the letter
m was used differently on that occasion.)
Given two lines, one through Рг in the direction щ = /ji + mj, the other
in the direction u2 = /2i+w2j, the angle φ between the two lines may be
found by computing the dot product Uj.u2
«l-«2 = l«ll |U2|C0S ?5
= cos φ (since щ, v^ are unit vectors)
and thus cos φ = l-J2+mxm2. (3)
For example, the lines 2x-y+l = 0, 3x + 2.y-3 = 0 may be put in the
χ _ y—\ x—l _ y_
1 " 2 ' -2 "3
or, equivalently, by making the denominators components of unit vectors
(P+m2 = 1)
χ _ y—\
l/V5-2/V5;
-2/V13 3/V13'

3]
FURTHER TWO-DIMENSIONAL GEOMETRY
The angle between the lines is thus given by
1 2 , 2 3
cos^=-V5-Vi3+75-Vi3
4
"V65'
Notice in particular that, from equation (3) the two lines are perpendicular
ol1l2 + m1m2 = 0.
Provided m1 φ 0, m2 Φ 0 this second condition may be written
But, as was noted above, /j/wj and l2jm2 are the gradients of the two lines;
we have thus regained, by an alternative method, the result of Chapter 3.4.
Fig. 11.6
It is frequently required to find the perpendicular distance from a point
to a line. Suppose we are given a line through P1(x1, уг) in the direction
u = /i + mj and a point P2(x2, y2). Let Q be the foot of the perpendicular
from P2 to the given line. Then the unit vector in the direction of QP2 is
- mi + /j (consider the dot product with u). Thus
P2Q = P^.C-mi + Zj)
= ([χ*-χι]ΐ+\η-3>Δϊ).(-ή+ΐί)
= -m(x2-x1) + l(y2-y1).
Although this gives the required perpendicular distance, it may not be a
very convenient form to handle: the equation of a line is generally given in
the form ax+by + c = 0 and we shall now deduce, as a theorem, the
perpendicular distance from the point P2(x2, y2) to this line.
197

FURTHER VECTORS [11
Theorem 11.3. The perpendicular distance from the point Ρ^х2, у2) to the line
ax + by + c = 0 is
ax2 + by2 + c
V(a2 + 62) '
Proof. The given line may be rewritten in the form
x+cja _ у
b -a
or, making the denominators components of a unit vector, in the
equivalent form , ,
x+cja у
blJ(a*+b*) -a/V(a2+62)*
Thus the unit vector defining the direction of the line is
b a .
U V(a2 + 62)! J(a* + b2)h
and a perpendicular unit vector is
,_ a . b .
U V(«2 + *2) V(a2+*2)J'
IfPifa, yx) is some (arbitrary) point of the given line, the perpendicular
distance, d, from P2 is given by
d = PjPa.u'
= a(*i-x2) . Ky-i-Уг)
V02 + Z>2) +V(a2 + *>2)'
But axx + byx =—с, since Рг lies on the line and hence
, = ax2+by2 + c
V(«2 + ^2) *
For example, the perpendicular distance from the point (1, 2) to the line
2*-5J+l=0is 2-10 + 1 _ 7
V29 V29'
(The significance of the negative sign is explained below; the magnitude of
the perpendicular distance is 7/^/29.)
Suppose now that the line joining OP2 cuts the line
ax+by+c = 0
at Pi (see Figure 11.7). Then, if OP1:P1P2 = 1 :A, the coordinates of Рг are
/ *2 _У»_\
\1+λ· 1+A/·

FURTHER TWO-DIMENSIONAL GEOMETRY
Fig. 11.7
But Pj lies on the line ax + by + c = 0 and thus
._^,
:= 0
1+AT1+AT
or ax2 + by2 + c = -Ac.
From this equation it follows at once that, if ax2 + by2+c and с have
the same sign, A is negative and, if they have opposite signs, A is positive.
Since с may be written as a.0 + 6.0 + c, we may summarize the result as
follows.
If, in the expression ax + by + c, the substitution of the quantities
(i) χ = 0, у = 0; (ii) χ = x2, у = y2 gives two values of the same sign, then
P2 and О lie on the same side of the line (for, in this case, Рг divides OP2
externally and so A < 0); if on the other hand, it yields values of opposite
signs, P2 and О lie on opposite sides of the line.
Ex. 9. Find the perpendicular distances of the point (2, — 1) from the lines
3x + 4y + 3 - 0 and 4x-3y-6 = 0. What can you deduce about the position of
(2, -1) relative to the two given lines?
Ex. 10. Sketch the two lines 3x+y-3 = 0,3x+y-6 = 0 and shade in the region
determined by the four inequalities л: > 0, у > 0, Зх+у-3 > 0, Зх+у-6 < 0.
Ex. 11. Find the equation of the bisector of the acute angle between the lines
χ - 0 and 5x+12j>-60 = 0. Find the incentre of the triangle formed by the
lines χ = 0, у = 0, 5x+ 12j>-60 = 0.
Ex. 12. a, b, care constants. Show that, as θ varies, the straight line
xcos#+.ysin0+acos0 + Z>sin0 + c = 0
touches a certain circle. Find the centre and radius of the circle.
Ex. 13. Find the equations of the lines parallel to Sx + 6y + 3 = 0, and distant
1 unit from it.
Exercise 11 {b)
1. Determine the projection upon (a) the χ axis, (6) the line 3x—4y+l = 0 of
the line segments joining the following pairs of points:
(i) (2,3), (4, 4); (ii) (-1,2), (1,5);
(iii) (-1, 2), (2, -3); (iv) {a,b\ {a + b,a-b).
199

FURTHER VECTORS [11
2. Find the cosines of the acute angles between the following pairs of lines:
(i) x-y + l = 0, 3x-y + 2 = 0; (ii) 3x + 2y+l = 0, x-2y-3 = 0;
(iii) 4x-y+l = 0, 3x + 4y = 0; (iv) ax + by = 0, ax-by = 0.
3. Determine the perpendicular distances between the following points and lines.
(Leave your answers in surd form.)
(i) (0, 0), x+4y-1 = 0; (ii) (-1, 2), 2x-3y-4 = 0;
(iii) (-2, -3), x-3y + 5 = 0; (iv) (h, -k), hx-ky + 2hk = 0.
4. Find the distances between the following pairs of parallel lines:
(i) x-2j>+1 = 0,x-2j> + 3 = 0; (ii) 2x+y-3 = 0, 4x+2y+3 = 0;
(iii) 5x-y-l = 0, 5x-y + 3 = 0; (iv) ax + by + a = 0, ax+by-b = 0.
5. Find the equations of the pairs of lines, parallel to the following lines, and
distant three units from them:
(i) 3x-4j>+1 = 0; (ii) 8x+6y + 5 = 0;
(iii) 5x+12j> + 2 = 0; (iv) 3x+2y+l = 0.
6. Draw rough sketches and shade in the areas determined by the following
inequalities:
(i) x> 0,y >0,y <2,2x+3y-8 <0;
(ii) χ > 0,y > 0, x-y-l < 0, Зх+у-7 < 0;
(iii) 2x-y > 0,x-2y < 0,2x-y + 2 > 0, x-2y+4 > 0;
(iv) \x+y\ < u\x-y\ < i;
(v) \x+y-l\ < 1; ^χ-^-ΙΙ < 1.
7. Find the equations of the bisectors of the acute angles between the following
pairs of lines:
(i) χ = 0,3x-4y = 0; (ii) 3x-4y-l = 0,4x-3y-2 = 0;
(iii) 5x+12j> + 3 = 0,6x-8y+3 = 0.
8. Find the equation of the image by reflection of the line x-y = 4 in the line
2x+y = 1. (London)
9. ABC is a triangle; A is the point (0, 3) and В is the point (-5, -2). The
orthocentre (meet of the altitudes) of the triangle is the point (- 1, 1). Find
(i) the coordinates of C; (ii) the area of the triangle;
(iii) the tangent of the angle ACB.
10. Show that the reflection of the point (α, β) in the line у = тх is the point
/(l-ffl2)a + 2W/g 2wa-(l-QA
\ 1 + w2 ' 1+w2 /'
A is the point (j>, q), В is the reflection of A in the line у = χ and С is the
reflection of Л in the line у = - χ. Find the coordinates of C.
Show that С is the reflection of A in the Xmzpx + qy = 0. (London)
200

3]
FURTHER TWO-DIMENSIONAL GEOMETRY
11. A triangle ABC lies wholly within the first quadrant and has an area of
Ц sq. units. The equation of one side is 2x- 5y + 23 = 0 and the vertices A and
В are (1, 5) and (3, 4) respectively. Find the equations of the other two sides, the
angle ABC, and the coordinates of the orthocentre of the triangle. (London)
12. Hf(x,y) = ax + by + c, discuss the changes in value and sign of f(X, Y),
when the point Ρ with coordinates (X, Y) moves in the x, у plane, in relation to
the line whose equation is/(χ, у) = 0.
Determine the smallest value attained by the expression
when χ and у vary subject
22a: + 11?-21,
э the simultaneous restrictions
3x + 4y> 12,
2x-y > 2.
4. FURTHER COORDINATE GEOMETRY
OF THREE DIMENSIONS
As a straight line in two dimensions may be completely defined by a point
and a gradient, so a plane may be defined, given a point lying on it and
the direction of its normal. Suppose that a plane passes through the point
A (position vector a) and is perpendicular to the directions defined by the
unit normal n. Then, if Ρ (position vector r) is any point of the plane,
AP = r-a
(Figure 11.8). But
AP.n = 0,
Λ (r-a).n = 0,
i.e. r.n = a.n. (4)
Equation (4) represents the vector equation of
the plane through A with unit normal vector n.
If Cartesian axes are taken in the directions i, j, к
and η = /ji + y + ^k, a = а^ + а^ + азк, г = xi+yj + zk, then (4) takes
the Cartesian form , , , , , , , , , , ,,4
1гх + 12у + 132 = 1^ + 12(12 +l3a3. (5)
Thus, equation (5) represents the Cartesian equation of the plane
through А{аъ а2, a3) with normal whose direction cosines are 1Ъ /2, /3.
Conversely,
r.n =p (^constant) (6)
represents a plane with unit normal vector n. For r.n is the length of the
projection of OP in the direction n, and, if this is constant for all Ρ, Ρ
clearly lies on a plane perpendicular to n. Thus, (using components),
ax + by + cz = d
201
Fig. 11.8

FURTHER VECTORS [11
is the equation of a plane. The unit normal vector is
(л/(а2+й2 + с2)) *+ Ца2 + й2 + с2))J+ W(a2 + ft2 + c2)) k
and it follows from equation (6), that the perpendicular distance from the
origin to this plane is the numerical value of
d
V(a2 + Z>2 + cT
Ex. 14. Write down the unit vectors perpendicular to
(i) x-2y+2z = 0; (ii) Ъх-y+z = 6; (iii) 3x + 2y+z = 2.
Ex. 15. Write down the equation of the planes through the given points and
perpendicular to the given vectors:
(i) (1,2,0), i-3j + k; (ii) (2, -1, -l),i-j + k;
(iii) (3,-4,2),2i+3j-5k.
Theorem 11.4. The perpendicular distance from the point Рг{хъ ylt Zj) to the
plane ax+by + cz + d — 0 is
ax^ + by-L + cz-L + d
V(a2 + 62 + c2) *
Fig. 11.9
Proof. Let P2{x2, Уъ zz) be any point of the plane ax + by + cz + d = 0 and
let N be the foot of the perpendicular from Рг to the plane (Figure 11.9).
If η is the unit vector perpendicular to the plane, then
Ρ1Ν=Ρ2Ρ1.τι
= [(χι-χύ'ι+ Οί-λ) i+ {ζι-ζύ Ч
' LV^ + ^ + c2){+ V(a2 + *2 + c2)J + 7(а2 + й2 + с2)кJ
_ а{хг - x2) + Ну,. -y2) + c(zx - z2)
V(a2 + ft2 + c2)
202

4] FURTHER THREE-DIMENSIONAL GEOMETRY
But ax2 + by2 + cz2 = —d, since P2 lies on the plane. Thus
PXN
ax1 + by1 + cz1 + d
Again, ax1 + by1 + cz1+d may be positive or negative, depending upon
which side of the plane Рг lies. The result is completely analogous to that
for the straight line.
We conclude this chapter by giving some worked examples of coordinate
geometry in two and three dimensions.
Example 3. Find the complete locus of a point which moves so that its
perpendicular distance from 8x —y + 18 = 0 is twice its perpendicular distance
from 7x + 4y + 6 = 0. (O & C)
LetP(x, y) be any point of the plane. If dlt d2 represent the perpendicular
distances from Ρ to the two lines
8£-jH-18 _ 7x + 4y + 6
dl_± V65 ' 2 V65 ■
The required locus is the totality of points Ρ which satisfy dx = 2c4; that is
if л 8х-.у + 18 Ux + Sy+12\
This is equivalent to
\(х'у)' V65 ~ V65 /
= {(х,у):2х + Зу-2 = 0} U {(дс, у) :22х + 7у + 30 = 0}.
The locus is thus the pair of straight lines
2x + 3y-2 = 0, 22x + 7y + 30 = 0.
4. Prove that the lines
x-3=y-2
1 1
ϊ - ο - ι
are skew, and find the perpendicular distance between them.

FURTHER VECTORS [11
The given lines may be written
x-3 y-2 z-4 ,
~T~ - ~Γ - ~T~ = л'
χ v+1 z-1
and so a general point Ρ of the first line has coordinates (A + 3, A + 2, ЗА + 4)
and a general point Q of the second line has coordinates (/*, - 1, μ +1).
The two lines are clearly not parallel, since their direction cosines are
different. Thus they are either skew or else they meet in a point. If they
meet in a point, we must have
A + 3 = μ,
λ+2 =-1,
ЗА + 4 = μ + l,
for some Α, μ. But solution of the first two equations gives A = — 3, μ = 0
and these values clearly do not satisfy the third equation. The lines are
therefore skew. Now
OP = (A + 3)i + (A + 2) j + (3A + 4)k,
OQ = /ti—j+O + 1) k,
.·. QP = (A-/*+3)i + (A+3) j + (3A-/* + 3)k.
But the directions of the two given lines are given by the vectors
i+j + 3k
and i + k.f
Thus QP is perpendicular to both lines
o(A-/i + 3)+(A + 3)+3(3A-/< + 3) = 0
and (A-/t + 3) + (3A-/t + 3) = 0.
Solving these two equations gives A = -1, μ = 1 and so the coordinates of
the end-points of the common perpendicular to the two lines are (2, 1, 1)
and (1, -1, 2) and the length of the common perpendicular
= Vt(2-l)2 + (l + l)2 + (l-2)2]
= V6.
t Since we are going to use the condition for perpendicularity, in which the right-
hand side of the dot product is zero, there is no need to make these vectors unit vectors
since this would only introduce a factor that would divide out.
204

4] FURTHER THREE-DIMENSIONAL GEOMETRY
Note: A more elegant method for finding the direction of the common
perpendicular to two skew lines will be described when the vector product
is introduced.
Example 5. Find the image by reflection of the point (11, -13, 8) in the plane
2x-3y+z+l = 0.
The vector 2i-3j+k is perpendicular to the given plane and so the line
through (11, -13, 8) perpendicular to the plane has equations
x-U =y+13 = z-8_
The coordinates of a general point Ρ of this line are
(2Λ+11, -3Λ-13, A + 8)
and Ρ lies in the plane
o2(2A+ll)-3(-3A-13) + (A + 8)+l = 0
о 14А + 70 = 0
о А = -5.
Thus, the reflection of (11, - 13, 8) in the plane is given by A = -10, i.e.
(-9, 17, -2).
Exercise 11 (c)
1. Prove that the vectors:
a = i+2j + 4k, b = 2i-3j + k
are perpendicular.
Find the angle between the vectors
c = 3i + j + 2k, d = i-2j-3k.
2. Find the equations of the planes through the given points which are normal
to the given vectors:
(i) (0,0,0), i-j + k; (ii) (1, 2, -3), 2i+j-3k;
(iii) (-1, 2, -4), 3i-k; (iv) (2,4, -1), 6i-3j-2k;
(v) (2, 3, 4), 4i + j - 3k; (vi) (a, b, c), ai + b\ + ck.
3. Find the cosines of the angles between the following pairs of lines. (Leave
your answers in surd form.)
W 2 -3 -1' 1 0 -2'
205

FURTHER VECTORS
W 1 _1 4 ·
(Ш>вг=
"»ί-ί-^τ· --'--δ·
4. Find the cosines of the angles between the following pairs of planes. (Leave
your answers in surd form.)
(i) 3x-y + z = 0, χ = 0; (ii) 2x-y = 0, x+y + z = 0;
(iii) 2x-j»-z+3 = 0, 3x-y + z-l = 0;
(iv) x+j>-4z+l = 0, 2x-3y + 4z+5 = 0;
(v) ax+by = 0, aj>-fe = 0;
(vi) ax + by-cz = 0, bx-cy + ax = 0.
5. Find the cosines of the acute angles between the following lines and planes.
(Leave your answers in surd form.)
© v-v-v-
w -1 0 2 '
x-2 j>-1 ζ
W 2 2 ~-Г
.. . jc-3 y-4 r+3
w 1 0 -1'
, -4 x~a _ У-Ь _ z-c
x+y + z = 0;
lx-y-z = 0;
3x-2y + 2z+4 = 0;
4x-3y-4z = 0;
y-z=0;
px+qy+rz = 0.
6. Find the unit vectors normal to the planes through the following sets of points,
and deduce the equations of the planes:
(i) (1,1, -1), (2, 3,1), (5, -1,-13);
(ii) (2, 6,1), (0, 3,1), (4, 0, -2);
(iii) (1,4,1), (2, 7, 2), (-3,0,-1);
(iv) (1,0,-1), (0, -4, -1), (3, 2, 2);
(v) (1,1, 2), (2, 2,3), (-2,1,11);
(vi) (0, ф, b/c), (с/а, 0, -а/с), ф/a, a/b, 0).
206

4] FURTHER THREE-DIMENSIONAL GEOMETRY
7. Find the coordinates of points where the line
x-2 _ y-3 _ z+1
1 2 ~ -2
meets the planes
(i) x-y-z = 0; (ii) 2x-y-z-6 = 0;
(iii) 3x+4y+2z-9 = 0; (iv) 2x+3y + z = 0.
8. Find the reflections of the following points in the given planes:
(i) (-l,7,S),Sx-y-z-10 = 0;
(ii) (5, -9, -6),2x-y + 3z-8 = 0;
(iii) (6,13, -3),4x-y-3z+l9 - 0;
(iv) (3,5,8),3x-j;-z+26 = 0;
(v) (-5, 11, 6), 5x-6y-2z-27 = 0;
(vi) (0,0,0), ax + by + cz + d= 0.
9. Find the feet of the perpendiculars from the origin to the lines
U 2 1 -1' K} 1 -2 1 '
(ϋο=-τ = ^ = νί αν)
x-3 _ j>-8_ z-3
10. Find the foot of the perpendicular from (7, -
11. Where does the line
x-2 = y-J z-2
1 ~ 3 " -1
cut the plane x—^+z = 0?
Find the image by reflection of the line in the given plane.
12. Find the image by reflection of the line
x-10 = y-4 = z-2
9 -1 ~ 0
intheplane3x-j»+2z-2 = 0.
13. Find the image by reflection of the line

FURTHER VECTORS [11
14. Prove that the following pairs of lines are skew:
χ _ у-Ъ _ _z_
ϊ ~ ~T~ ~ ~i':
x-5 _ j>-8 _z-1
3 ~ 7 ~ -1"
Find the direction ratios of the common perpendicular and determine the
shortest distance between the two lines.
15. ABCD is a rectangle and О is a point on the normal at С to the plane of
this rectangle. AB = a, AD = b, and CO = h. Ρ is a point on АО and the line
through Ρ in the plane AOB which is perpendicular to АО meets AB at M.
If AP = x, show that
PM = xj(b*+h*)/a; AM = xj(a2 + b2 + h*)la.
Prove that the cosine of the acute angle between the planes OAB and О AD is
a6/V[(e2 + A2) (b2 + A2)]. (London)
16. Define the scalar product of two three-dimensional Euclidean vectors u, v.
Deduce an expression for the angle between the vectors
u = «1i+«2j+M3k,
ν = Cii+cjj+сзк,
in terms of ult u2, u3, vlt v2, v3.
A regular tetrahedron has vertices O, A, B, C, where О is the origin, and A,
В, С have position vectors with respect to О given by
OA=-i+j,
OB = ai+ b],
ОС = pi+q] + rk.
Find the numerical values of a, b, p, q, r given that a > 0 and /· > 0. (SMP)
17. Show that the line L given by
x+l = y-1 = z+l
5 ~ 3 ~ 2
is the intersection of the planes
3x-5j>+8 = 0 and 2y-3z-5 = 0.
Show that every plane containing the line L can be expressed in the form
λ(3χ-5γ+8) + μ(2γ-3ζ-5) = 0.
How should Α, μ be chosen in order to ensure that the plane is perpendicular
totheplane 5x-y+2z=-21

4] FURTHER THREE-DIMENSIONAL GEOMETRY
Hence or otherwise obtain the equation of the orthogonal projection of the
line L on the plane
y 5x-y+2z =-2,
expressing your answer in vector form, χ = ia+ b. (M.E.I.)
18. Prove that the equation of the plane which cuts off intercepts a, b, с on the
axes of x, y, ζ respectively is
The foot of the perpendicular from the origin to a plane isP(2, -1, 2). Find
the equation of the plane. If the plane meets the axes of x, y, ζ at А, В, С
respectively, prove that AP is perpendicular to ВС, and find the angle between AP and
CP. (J.M.B.)
19. Two planes, itx and щ, have equations
2a:+j>+z=1 and Ъх+y-z = 2,
respectively. Prove that the plane π3, which is perpendicular to nlt and contains
the line of intersection of щ and щ has the equation
x-lz = 1.
Points Ρ and Q lie on the planes щ and π3 respectively and the line PQ is
perpendicular to Щ. If the coordinates of Ρ are (-2, 4, 1), find the coordinates
of Q. Determine the angle between the line PQ and the perpendicular from Ρ to
the line of intersection of the three planes. (J.M.B.)
20. The straight line whose equations are
-2 1 2
meets the plane x+ 2y-2z = 8 at B, and A is the point (2, 0, - 1) on the line.
The foot of the perpendicular from A to the plane is С Find
(i) the coordinates of В and C;
(ii) the length of AC.
Show that the sine of the acute angle between В A and ВС is %.
The line А С is produced to D so that А С = 2CD. Find the coordinates of D.
(J.M.B.)
Miscellaneous Exercise 11
1. The perpendicular distance from the origin to a straight line / is of length
ρ and makes an angle a, with the positive χ axis. Prove that the equation of the
line may be taken in the form
χ cos a.+y sin a = p.
What is the equation of the parallel line through the point iVX, y^ ? Deduce
the perpendicular distance from Px to the given line.
2. Find the equations of the straight lines which bisect the angles between the
straightlines 3x_4j;_11 = o, 12*+5,-2 = 0. (O&C)

FURTHER VECTORS [11
3. Find the equation of the bisector of the acute angle between the lines
4x + 3y-12 = 0, 12* + 5y- 60 = 0. (O & C)
4. Sketch the triangle formed by the lines
3x-4y-4 = 0, 12x-5j> + 6 = 0, 7x + 24j;-56 = 0,
and verify by calculation and reference to your sketch that the point (1,1) is the
centre of the inscribed circle. (O & C)
5. A straight line is drawn through AQi, k) and P(x, y) so that AP makes an
angle θ with the positive direction of the χ axis, and AP = r. Prove that
Three vertices of a square are E(2, 2), F(-2, 2) and G(-2, -2); a straight
line of gradient J is drawn from A{- 3, - 1) to meet FG at Ρ and EF at Q. Use
the formulae in the first part of the equation to find the length of Ρ β.
Find also the radius of the circle with centre at the origin to which APQ is a
tangent. (O & C)
6. Points A and В lie on the same side of a line /, С is the optical image of A in
/, and ВС meets / at P. Prove that / is a bisector of ΔΑΡΒ. Given that / has
the equation x + 3y = 5, and that the coordinates of A and Л are (1, -2) and
(—11, 2) respectively, verify that these points lie on the same side of /. Prove
that the coordinates of С are (3, 4), and find the coordinates of P. (O & C)
7. Prove that the equation of the circle, centre (12, 13), radius 7, is
x2+y2-24x-26y + 264 = 0.
Two sides of a triangle are
5x-12j;+5 = 0, 12x-5j>+12 = 0
and the incentre is the point (12, 13). Prove that the third side touches the circle
x2+y2-24x-26y+264 = 0. (O&C)
8. Prove that the perpendicular bisectors of the sides of a triangle are
concurrent (at the circumcentre) and that the altitudes of a triangle are concurrent (at
the orthocentre).
9. Prove that the circumcentre S, the centroid G and the orthocentre Я of a
triangle ABC have position vectors (referred to any origin O) which satisfy the
relation
h + 2s-3g = 0.
Deduce that SGH is a straight line and determine the ratio SG: GH.
Prove further that the mid-point, N, of the line SH is the centre of the circle
through the mid-points of the triangle ABC. What is the radius of this circle?
10. О ABC is a tetrahredron in which О A is perpendicular to ВС and OB is
perpendicular to AC PQR is a triangle such that A is the mid-point of QR, В
is the mid-point of RP and С is the mid-point of PQ.

4] MISCELLANEOUS EXERCISE 11
Taking О as the origin, express the position vectors of P, Q, R in terms of those
of А, В, С Hence prove that OP = OQ = OR.
If D is the foot of the perpendicular from О to the plane ABC, prove that D
is the circumcentre of the triangle PQR.
11. ABCDA'B'C'D' is a cube with edges AA', BB\ CC, DD' and diagonal AC.
Show that B'C is perpendicular to the plane ABC. Find a line in the figure
perpendicular to the plane ACC. What is the angle between the planes ABC
and ACC"]
12. In a tetrahedron PQRS, the edges PQ, RS are perpendicular to the faces
PRS, PQS respectively; L is the mid-point of PS and Μ is the mid-point of QR.
Prove that PQ} + RS* = QR2-PS\
PM= SM= iQR,
4LM2 = PQ2 + RS2. (O & Q
13. Prove that there is one and only one line which joins two given skew lines
and is perpendicular to each of them.
Two fixed skew lines AL, BM have a common perpendicular AB, and the angle
between them is Θ. Prove that
LM2 = AL2 + BM*-2AL.BMcose.
Prove that, if θ = in and the points L and Μ vary on the fixed skew lines so
that LM is constant, the locus of the mid-point of LM is a circle. (O & C)
14. The common perpendicular of two skew lines / and /' meets them at A and A'
respectively. Points Ρ and P' are taken on / and /' respectively so that AP+ A'P'
is constant, where the sense of AP and A'P" is taken into account.
Show that the locus of the mid-point Μ of PP' is a straight line m, and
describe its relation to / and /'.
If AP— A'P' is constant, show that the locus of Μ is another straight line m',
and describe the relation of m and m'. (O & C)
15. Find the reflection of the line
3 1 ~ -1
in the plane x-y-z + 2 = 0.
16. Prove that, given two skew lines and a point О not lying on either of them,
that just one transversal may be drawn through О to cut each of the lines.
Prove that the lines _
x-4 y + 3 z + 2
are skew and find the equation of the common transversal that passes through
the origin.
17. Obtain the equation of a plane in the vector form
n.r = ρ (η a unit vector),
explaining precisely what you mean by each of the three symbols n, r, p.
211

FURTHER VECTORS [11
Prove that the length of the perpendicular to the plane from the point S with
position vector s is |n.s—p|; and find the position vector t of the mirror image
Τ of S in the plane—that is, of the point Τ such that TS is perpendicular to the
plane and bisected by it. (M.E.I.)
18. Referred to a given system of rectangular coordinates in space of three-
dimensions, the points А, В have coordinates (1, 0,0), (-1, 0,0) respectively.
A variable point Ρ has coordinates (x, y, z). Write down the direction-cosines
of PA, PB and prove that, if θ is the angle between them, then
2 θ = (x2+y2 + z2-l)2
[(x2 + y* + z* +1) - 2x]. [(x2 + y* + z* +1) + 2x]'
Deduce that, if Ρ is restricted to lie in the plane ζ = 0, then
C^+y-l)2 = [(x2+j>2-1)2 + 4j>2]cos20
and find the equations of the two circles on which Ρ must lie if the angle θ is kept
constant. (M.E.I.)
19. Two planes x-3y + 2z = 2,
2x-y-z= 9,
meet in the line /. Find the equations of
(i) the plane through the origin which contains /,
(ii) the plane through the origin which is perpendicular to /.
Find also the coordinates of the reflection of the origin in /. (C.S.)
20. Find the value oik such that the line joining the points (- 2, k, - 9), (2,1, 7)
intersects the line joining (- 2, -4, 4), (7, 2,1).
What are the coordinates of the point of intersection ?
21. A regular tetrahedron ABCD has the face ABC in the xy plane, the origin
is the centre of that face, the vertex A is at the point (1, 0, 0) and the vertex D
is on the positive half of the ζ axis. Find the coordinates of В, C, D and of the
centre of the tetrahedron, and the direction ratios of the normals to the four
faces.
Hence or otherwise show that, if any line makes angles α, β, γ, δ with the faces
of a regular tetrahedron, then
sin2 a + sin2 /3+ sin2 γ+ sin2 δ = f.
22. Prove that the equation of the straight line through the given point A,
position vector a, and in the direction of the unit vector b is
г = a + ib,
where t is the distance from A of the variable point Ρ of the line whose position
vector is r.
Prove also that the equation of the plane through the point С (position
vector c) whose normal is in the direction given by the unit vector d can be
expressed in the form
r.d = c.d.
212

4] MISCELLANEOUS EXERCISE 11
The plane through the point C(l, 2, 4) has normal in the direction
d = Ai + ^j + Hk.
Find the length of the shortest distance from the point A (3, 4, 5) to the plane.
(M.E.I.)
23. To find the position of an underground rock layer a number of vertical
borings are made at points on horizontal ground which form a coordinate grid.
Results are as follows, the unit of distance both horizontally and vertically
being 300 m:
Grid-point (0,0) (2,0) (0,2) (2,2)
Depth 0-270 0-225 0162 0117
Show that these results are consistent with this part of the rock layer being a
plane. Find the (x, y) equation of the line in which this plane when produced
would meet the ground, and show that the plane would be inclined at about 3i°
to the ground.
What would you conclude if a boring at (1,1) gave a depth of about 0-18?
(M.E.I.)
24. Calculate the shortest distance between the line of intersection of the planes
x-8y + 2z+9 = 0, x-2y-z+6 = 0
and the line of intersection of the planes
2x+y + 8z-l2 = 0, x-y + z-6 = 0;
and show that the line which cuts both these lines at right-angles passes through
the point (-4,12,8).
25. Find the coordinates of the mirror image of the point (p, q, r) in the plane
ax + by + cz + d = 0.
A ray from the origin is reflected successively in the planes
x+y-z+l = 0
and x-y + 2z-l = 0,
and then passes again through the origin. Find the points at which it meets the
two planes.
26. Prove that the lines
^±1 _ l±l - £±3 *-l_J, + 3_ ζ
2 ~ 3 ~ 1 ' 3 ~ 1 ~ -2
do not intersect.
Find the equation of the plane through the origin which contains the first
line, and find also the direction cosines of the line through the origin which meets
both lines. (Oxford Mod.)
27. The line x\l = y/m = zjn is reflected in the plane ax + by + cz + d — 0. Show
that the equation of the resulting line is
(fl2 + 62 + c2)x + 2arf {a2 + b2 + (f)y + 2bd
(cP + b + c2)l-2a(al+bm + cri) ~ {ai + bi + <F)m-2b(al+bm + cri)
= (fl2 + 62 + c2)z + 2crf
~ (а2 + Ъ2 + с2)п-2с(а1+Ът + спУ
213

FURTHER VECTORS [11
Hence, or otherwise, find the equation of the plane such that the angle between
it and the plane lx + my+nz = 0 is bisected by the plane ax + by+cz = 0.
(Oxford Mod.)
28. Write down the equations of the axes Ox, Oy, Oz. What is the equation of the
plane containing Ox and the point (a, b, c) ?
The roof of a rectangular house consists of four inclined planes, each sloping
upwards at an angle of 45° to the horizontal. What is the angle between two
adjacent faces of the roof?
29. ABCDA'B'C'D' is a cubical box, with faces ABCD, A'B'C'D' and edges AA',
BB', CC, DD'. Ε is the point on BB' such that BE = \EB'\ F is the mid-point
of CC. Find the angles between
(i) the line А'В and the plane AEF;
(ii) the plane А'ВС and the plane AEF.
30. Two lines through the origin have unit direction vectors /ji + wj + ^k and
/2i + w2j+H2k. Prove that the locus of points equidistant from the two lines is a
pair of planes and deduce the equations of the straight lines which bisect the
angles between the given lines.
31. Two straight paths on a plane hillside are at right-angles, and make angles θ
and φ respectively with the horizontal. If the hillside itself makes an angle a, with
the horizontal, prove that .,„,.,.
sin2 θ + sin2 φ = sin2 a.
Prove also that the acute angle between the projections of the paths on a
horizontal plane is . ^ ,4
arccos (tan θ tan φ).
32. A line of slope, in an easterly direction, of a plane hillside is inclined at an
angle a to the horizontal. A line of slope of the hillside in a southerly direction
is inclined at an angle β to the horizontal. Prove that the actual inclination, Θ,
of the hillside to the horizontal is
θ = arctan (tan2 a, + tan2 #)*.
A vertical pole of height h is placed on top of the hill. Show that the angle φ
subtended by it at a point distant a down the line of greatest slope through the
foot of the pole is given by
Y ~ asec0+Atan0"
Find φ, if h = 16, a = 36, a = 30° and β = 45°. (London)
33. Let G be the centroid of the acute-angled triangle ABC of circumradius R.
Showthat AG°+B(?+CG>>2R>.
34. If О is the point on AB such that АО = 20B, and if Ρ is any point, prove that
APt+lBPt-SOF2
is independent of the position of P.
What is the locus of a point which moves so that the sum of the squares of its
distances from the vertices of an equilateral triangle is constant ?

12. Further trigonometry
1. FORMULAE FOR COMPOUND AND
MULTIPLE ANGLES
In many applications of trigonometry it is essential to be able to deal with
expressions such as sin(^+5) or sin2^ in terms of the trigonometric
functions of the simpler angles A and B. Angles such as A + B are called
compound angles; in particular, those like 2A are called multiple angles.
Ex. 1. By taking A = В = 30°, show that sin (A + Β) φ sin A + sin В in general.
Is it possible to find values of A and В such that sin A + sin В cannot be expressed
as the sine of a single angle?
We shall now establish the fundamental compound angle formulae for
sin (θ + φ), cos (θ + φ). The proof depends upon the fact
that we may express a unit vector OP in the form —
cos θ\ + sin Θ],
where i, j are two perpendicular unit vectors and θ is the
angle between i and OP measured anticlockwise from i;
see Figure 12.1 which represents a circle of unit radius, „
with Fl& Ш
OA = i, OB = j, OP = p, where ρ = cos 0i + sin 0j.
Similarly, if OQ makes an angle 90°+ 0 with i (that is, if LPOQ = 90°,
measured anticlockwise), then
OQ = q = cos(9O° + 0)i + sin(9O° + 0)j
= —sin <9i + cos θ\.
These expressions for p, q hold for angles of any size and either sense.
Theorem 12.1. For any two angles:
(i) cos (<? + 0) = cos <9cos0-sin<9sin0;
(ii) sin (θ + φ) = sin θ cos φ + cos θ sin φ;
(iii). cos ((9-0) ξ cos θ cos φ + sin θ sin φ;
(iv) sin {θ — φ) = sin θ cos φ-cos θ sin φ.
Proof. Take points APRBQ as a unit circle, centre O. OA = i, OB = j;
OP = p, OQ = q; OR = r. ΔΑΟΡ = Θ, LPOR = φ (see Figure 12.2).
215

FURTHER TRIGONOMETRY [12
The proof of (i) and (ii) is effected by expressing the unit vector г (a) in
terms of i and j directly and φ) in terms of ρ and q and hence in terms of i
and j. пи
Thus (а) г = cos {θ + φ) i + sin {θ + φ) j
and ф) г = cos φγ> + sin 0q.
But ρ = cos <9i + sin Θ]
and q = — sin #i + cos Θ]
(see remarks preceding this theorem).
Substituting for ρ and q in φ) this gives
(с) г = cos 0(cos 0i + sin <9j) + sin 0(-sin (9i + cos Θ])
= (cos θ cos φ — sin θ sin 0) i + (sin θ cos φ + cos 0 sin φ) j.
But, in a plane, the expressions for the components of a vector г in the
two directions i and j are unique (see Chapter 2) and so, comparing (a)
and (c),
(i) cos {θ + φ) ξ cos θ cos φ — sin θ sin 0;
(ii) sin (θ + φ) = sin θ cos 0 + cos θ sin 0.
Formulae (iii) and (iv) now follow immediately, on writing — φ for φ and
recalling that sin (-φ) = -sin φ, cos {—φ) = cos 0,
(iii) cos {θ — φ) = cos 0 cos φ + ύηθ sin 0;
(iv) sin {θ — φ) = sin 0 cos φ — cos б sin 0.
Ex. 2. Express sin 15° and cos 15° in surd form by writing 15° = 45° -30°.
Ex. 3. Express cos 165° and sin 105° in surd form.
Ex. 4. If sin χ = tt and sin у = f find sin (x+y) and cos (x+y)
(i) when χ and j> are both acute angles;
(ii) when* is acute and j> is obtuse;
(iii) when χ and у are both obtuse angles.
Ex. 5. Express cos(^ + 5)-cos(^-5)in terms of sin A and sin Л.
The results of Theorem 12.1 are important and should be committed
to memory. They give rise immediately to the following results which we
shall also state in the form of a theorem:
Theorem 12.2. For any angles θ, φ for which all of the expressions appearing
(vi) tan (0-0) =
216

I] COMPOUND AND MULTIPLE ANGLES
(vii) sin 20 ξ 2 sin 0 cos 0;
(viii) cos 2Θ = cos2 0-sin2 θ = 2 cos2 »-1ξ1-2 sin2 Θ;
2tan0
(ix) tan 2
l-tan2(
Proof, (v) The right-hand side is not defined if θ or φ = (2k + \)%n\
neither side is defined if θ + φ = (2k +1) \π. For any other values of θ and φ
/a , л\ sin θ cos φ + cos θ sin φ , . . ,.λ , ....
tan (θ + φ) = !-, ■ a ■ · by formulae (ι) and (n);
v T' cos 0cos0-sin0sm0 J ww»
divide top and bottom of the fraction on the right-hand side by cos θ cos φ
(a non-zero expression, by the restrictions on θ and φ).
(vi) As for (v) by using (iii) and (iv);
(vii) set θ = φ in(ii);
(viii) set θ = φ in (i), and recall that sin2 0 + cos2 Θ = 1;
(ix) set θ = φ in (ν).
*Ex. 6. What values of θ and φ must be excluded in (a) formula (vi); and (b)
formula (ix) ?
Example 1. Find an expression for cos 75° in surd form.
Method (i). „„ . ,... ,,„.
w cos 75° = cos (30°+ 45°)
= cos 30° cos 45°-sin 30° sin 45°
Method (ii).
Thus
(We reject the negative square root since cos 75° > 0.)
Example 2. Prove the identity
V3 ι
" 2 V2"
V3-i
2V2 '
;os 150° :
-iV3
4-2V3
8
cos 75°
1 1
"2V2
= 2 cos2 75°-1,
= 2 cos2 75°-1,
= cos2 75°.
V3-i
2V2 ■

FURTHER TRIGONOMETRY
The r.h.s.
L.H.S. =
=
is defined,
1 si
by the restriction
η 2Θ
cos 20 cos 20
1-sin 20
cos 2 0 '
(-4^й> (sin'* + ,
cos2 0 -sin2 (9 v
cos2 0 + sii
1 -sin 2Θ
ι2 0-2 sin 0 cos 0
cos 20
(sin2 0 φ cos2 0, since 0 φ (2л +1) £тг)
The formulae we have proved are also useful in dealing with
combinations of inverse trigonometric functions, as is shown in the next example.
Example 3. Find the value of arctan 2 +arctan 3.
Write χ = arctan 2; у = arctan 3, then
\π < χ < %π, \π < у < \π and so \π < χ+y < π,
f tan j
Since
Thus
fr<
x+y < π,
" 1-tanx
2 + 3
1-6
= _i.
this gives
x+y
arctan
= π + arctan (
= π-1π.
2 + arctan 3 =
tan у
-i)
fr.
Example 4. Find angles 0 between 0° and 360° satisfying the equation
5 cos 0-6 sin 0 = 3.
Observe that 5 and 6 are two sides of a right-angled triangle whose
hypotenuse is V61. Thus, dividing both sides of the equation by ,/61,
cos α cos 0—sin α sin 0 = -^y,
where tan α = f, i.e. α = 50° 12'. Also, using tables,
vfi = cos67°25'

COMPOUND AND MULTIPLE ANGLES
No.
3
61
v'61
cos67° 25'
Log.
0-4771
1-7853
0-8926
T-5845
0-sin 50° 12' sin Θ = cos 67° 25',
cos (50° 12' + 6) = cos 67° 25',
50° 12'+ 0= 67° 25' or 292° 35'
0 = 17° 13' or 242° 23'.
Examples. Find maximum and minimum values for the expression
у = sin 20 + 4 cos 20.
This may be solved by a process similar to that
employed in Example 4. Thus
, = V17(^sin2tf + Acos2,)
= V17 sb (20 +a), Fig. 12.4
where tan a = 4. But — 1 ^ sin φ ^ 1,
.·. -V17 <J < V17·
Observe that, since tan 76° x 4, у attains its maximum when
20 + 76° χ 90°,
i.e. θ χ 7°, and its minimum when 20 + 76° χ 270°, i.e. θ χ 97°.
Ex. 7. Find the maximum and minimum values of the following expressions:
(i) sinx-3cosx; (ii) 2 cos χ + sin χ; (iii) 5 + 3 sin χ + 4 cos χ;
(iv) 1 — sinx-cosx; (v) 1/(2— sin * + cos x).
Ex. 8. Solve the following equations, giving all solutions lying between 0° and
360° inclusive:
(i) cosx — sin л: = 1; (ii) 3 cos x+4 sin χ = 5;
(iii) sin χ + 2 cos χ = 1; (iv) 12 sinx — 5 cos χ = — 4.
Ex. 9. If tan Л = i, and tan В = |, find tan (A + B) and cot (Л-Я).
£*. 10. If x+y = in and tan χ = i find tan y.

FURTHER TRIGONOMETRY [12
Ex. 11. If cos χ = i, find cos 2x. If, furthermore, 0 < χ < π, find sin 2x.
Ex. 12. Find the maximum and minimum values of 1 + sin χ cos x.
Ex. 13. Find the values of χ in the interval 0 < χ < 2π which satisfy the
equation cos 2x = 3 sin x— 1.
Ex. 14. Find the value of tan &r without using your tables.
Ex. 15. Find sin (2 arcsin x) and cos (2 arccosy).
Ex. 16. By writing χ = arctan \ andj> = arctan f and considering tan (x+y),
show that arctan i+arctan ^ = arctan f.
Ex. 17. Express arcsin f + arcsin A in the form arcsin x.
The multiple angle formulae can be used to derive three very useful
expressions, for sin 0, cos 0 and tan 0, in terms of tan \θ. Their value
derives from the fact that, with their help, a trigonometric expression may
be written as a rational function^ of a single variable t = tan \θ. The reader
will find that they have applications to problems in calculus as well as
ordinary trigonometry.
Theorem 12.3. If tan \θ = t, then
(i) άηθ**;
00
(iii)
Proi
cos θ
tan0
>/· 0)
fiO
(iii)
t A
~l+t2'
-Д-, (M*i).
sin θ = 2 sin £0 cos £0
_ 2 tan \θ
~ sec2i#
2i
1 +12''
cos 0 = cos2 -^0 —sin2 \θ
_ l-tan2i0
sec2 ^0
1-i2
l+i2;
tan 0 = T372. by dividing (i) by (ii).
rational function is defined as the rai
tio of two polynomials

COMPOUND AND MULTIPLE ANGLES
Example 6. Prove the identity
l+sin0
cos θ
1 +
l+t2
_(!±0!
tan@ir+№ (?*(2n+l)№,
1 _ - , ^n multiplying numerator and denominator by (1 +t2),
= rzv 1 + '*° (^*(2«+i)i^),
tanjg + tanjg
l-tan|7rtani^
" 1-f*
etani?r Φ 1.
-.= 3>
Example 7. Sofoe ί/ze equation of Example 4, tuzng ί/ге substitution
tan £0 = i.
5 cos 0-6 sin 0 = 3,
. 5(1 -f2) 12i_
·' l+f
5 —5i2—12i = З + Зг2,
4t2+6t~l = 0,
, _ -6+v(36+i6)
8
= K-3±Vi3)
= 0-151 or -1-651,
.·. \θ = 8° 35' or 12Г 12',
0 = 17° 10' or 242° 24'.
Note. Some discrepancy arises between the results obtained here and in
Example 4. Rounding-off explains the error in the larger answer; the
smaller answer is seriously affected in Example 7, since the subtraction
— 3 + V13 loses a significant figure.
Ex. 18. Express sin x + cos χ in terms of t, where t = tan \x.

FURTHER TRIGONOMETRY [12
2. SUM AND PRODUCT FORMULAE
The expansions derived for cos (A ± B) and sin {A ± B) lead to a further
series of identities between trigonometric functions which prove of great
value in the manipulation of trigonometric expressions. The identities
which we are about to deduce may be divided into two groups:
1. The expression of products of trigonometric functions as sums.
2. Conversely, the expression of sums of trigonometric functions as
products.
We shall state and prove these two sets of identities as two theorems.
Theorem 12 A.
(i) sin A sin В з i[cos (A - B) - cos {A + B)];
(ii) cos A cos В з £[cos (A + B) + cos (A - B)];
(iii) sin A cos В з i[sin (A +B) + sin (A-B)];
(iv) cos A sin В з £[sin (A +5)-sin (A-B)].
Proof, (i) R.H.S. = i[(cos A cos 5+sin A sin B)
— (cos A cos B— sin A sin B)]
= £.2 sin A sin -S
= L.H.S.;
(ii), (iii), (iv) are all proved in a similar fashion.
Theorem 12.5.
rs ■ . χ, ^ . A+B A-B
(ι) sin Λ+sin 5 3 2 sin—=—cos—=—;
,..ч · , · D „ Л+Я . A-B
(n) sin Л — sin В 3 2 cos—=—sin—=—;
Proof.
... . ΓΑ+Β Α-ΒΊ . ΓΑ+Β Α-ΒΊ
(ι) R.H.S. = sin j^—2 '—2—J "*"Sln \_~~2 2~~J*
= sin A+sin В
= L.H.S.
(ii), (iii), (iv) are all proved in similar fashion.
222
by Theorem 12.4(iii)

SUM AND PRODUCT FORMULAE
Example 8.IfA + B+C= 180°,prove that
cos A +cos 5+cos С — 1 +4 sin у sin j si
L.H.S. = cos A + [cos 5+cos C]
=
=
=
=
1-2
1-2
1-4
R.H.S.
sin у sin у—cos
. АГ B+C
sin-y cos—2
sin у sin
5 . С
2S1H2
5-СП
2 "J
B-
-cos—2
Example 9. Sofoe i/ze equation
sin Зх + sin л: = cos л:
gz'wng a// roots in the range 0° < χ < 360°.
sin 3x + sin л: = cos л:
о- 2 sin 2x cos χ = cos χ
ocosx(2sin2x-l) = 0
=> either cos χ = 0 or sin 2x = £.
(1) cos χ = 0 gives χ = 90° or 270°.
(2) sin 2x = i gives 2л; = 30° or 150° or 390° or 510°
and χ = 15° or 75° or 195° or 255°.
Thus χ = 15° or 75° or 90° or 195° or 225° or 270°.
Note. A very common error made in equations of the type in Example 8
is to forget the solution cos χ = 0.
Ex. 19. Express in factor form:
(i) sinx + sin3x; (ii) cos 4x+cos 2x;
(iii) cos 3x — cos x; (iv) sin x+cos x;
(v) COS {χΤΤ— Χ) — COS (ϊ7Γ + Χ).

FURTHER TRIGONOMETRY [12
Ex. 20. Prove the identity sin χ + sin 2x + sin 3x = sin 2x(2cosx+l). Hence
find all values of χ in the interval 0 =g χ < 2π satisfying the equation
sinx + sin 2x + sin Ъх = О.
*Ex. 21. Solve the equation sin χ + sin 2x = 0 by expressing the left-hand side in
factor form, giving all solutions in the interval 0 =g χ =g 2π. Solve the equation
again in the following alternative ways:
(i) by rewriting it as sin 2x = sin (-*), etc.;
(ii) by rewriting it as sin χ + 2 sin χ cos χ = 0.
Exercise 12(a)
1. Evaluate in surd form:
(i) sin 75°; (ii) cos 105°; (iii) tan 105°; (iv) cot 75°.
2. Prove that sin ЗА = 3 sin A-4 sin3 A and that cos ЗА = 4 cos3 A- 3 cos A.
By considering the equation sin 2A = cos ЗА, express in surd form:
(i) sin 18°; (ii)cosl8°; (iii) sin 54°; (iv) tan 108°.
3. Prove that the following identities hold for all angles A, B for which the
expressions appearing are defined:
(i) sin (A + В) sin (A - Β) ξ cos2 В - cos2 A;
(ii) cos (A - B) - sin (A + Β) ξ (cos A - sin A) (cos В - sin B);
..... cos (Л + B) + cos (Л - B)
{П1) sin(A + B)-sb(A-B)=COtB;
(iv) sin04+.B)sin Л ξ cos A-cos (A + В) cos В;
, л „. sec Λ sec Л cosec Л cosec В
(ν) sec (Λ + В) = ;
cosec A cosec В—sec A sec Л
(vi) tan Z4(cot A -tan A) = 2;
(vii) cos 4Λ ξ 4(cos4 Л + sin4 A) — 3;
(viii) cot A -cot 2Λ = cosec 2Λ;
sin A + sin В
cos Λ + cos В
sin Λ + sin 2.
cos Λ +cos 2Λ + cos ЗА ~
4. If А, В, С are the angles of a triangle, prove the following identities:
(i) sin Λ + sinjB + sin С = 4cos — cos — cos —;
(ii) cos2v4 + cos2fi+cos2C+4coSv4cos.Bcos C+l ξ 0;
224

2] SUM AND PRODUCT FORMULAE
(iii) cos2 A +cos2 В + cos2 С ξ 1 - 2 cos A cos Л cos C;
(iv) tan Λ + tan В + tan С = tan Л tan Л tan С (provided that the triangle is
not right-angled).
, giving values between 0° and 360° in-
(i) 2 sin χ = sin (χ+ 45°); (ii) 2 cos χ = cos (x + 60°);
(iii) sin(30° + x) = cos (15° -x);
(iv) cos (x- 30°) + sin (x- 20°) = cos x.
6. Solve the following equations for x, giving values of χ between 0° and 360°
inclusive:
(i) cosx + sinx + V2 = 0; (ii) sinx + V3 cosx-1 = 0;
(iii) 3 sin x-4 cos x + 2 = 0; (iv) 3 sin x + 2 cos x-3 = 0.
7. Solve the following equations, giving all solutions between 0° and 360°:
(i) sin 2x + sin2 χ = 0;
(ii) cos2x—3cosx = 4;
(iii) cosx + cos2x + cos3x = 0;
(iv) cos x— sin χ = cos 2x.
8. (i) Prove that, when cos φ = — \, the value of the expression
sin θ + sin {Θ + ф) + sin {θ + 2ф)
is zero, whatever the value of Θ.
(ii) Prove that if
ιη(χ + α),
= \ sin a,
stating what restrictions you impose on the value of sin α for a solution to be
possible. Hence find the values of χ between 0° and 360° which satisfy the equation
5 tan χ = tan (x + 30°). (O & C)
9. Prove the identities:
... sec2fl + 2tanfl_
(0 (cosTTiin^r2 = SCC °;
(ii) 4 cos θ cos (Θ +120°) cos (0 -120°) ξ cos 30. (O & C)
10. Prove that cos [(n + 2)6} = 2 cos θ cos [(л +1) 0] - cos И0.
Hence express cos 30 and cos 40 in terms of cos 0, and prove that
cos 50 ξ 16 cos6 0-20 cos3 0 + 5 cos 0. (O & Q
,, „ , sin 50+ 2 sin 30 +sin 0
11. Prove that ъ ^ = cot 0
cos 0—cos 50

FURTHER TRIGONOMETRY [12
and hence, using the formula sin 30 = 3 sin 0 — 4 sin3 0, show that cos 36° is
a root of the equation 8χ*_8χ2 + χ+ί = 0. (0&C)
and, without using tables, deduce that
tan22i° = V2-1
(ii) Prove that cos 30 = 4cos30 — 3 cos0 and deduce that
cos3 θ + cos3 (Θ + 60°) + cos3 (0 +120°) = f cos (0 + 60°) + i cos 30. (O & C)
13. (i) If a = sin 0+ cos ф and b = cos 0+ sin 0, prove that
cos (θ-φ) = 2αΖ>/(α2 + Ζ>2)
and hence, or otherwise, find tan (θ—φ) in terms of a and b.
(ii) Express 11 sin2 χ + 12 sin χ cos χ +6 cos2 χ in the form a + b sin (2x-0)
where a, 6 and 0 are constants to be determined.
Hence prove that
2 =g 11 sin2 x+ 12 sin χ cos x + 6 cos2 χ =S 15. (Cambridge)
14. Prove that sec χ + tan χ = tan(i?r+ix)
and express in a similar way (i.e. as the tangent of an angle)
(i) sec χ -tan χ; (ii) cosecx-cot x. (Cambridge)
15. Prove
2 sin 0(cos 2Θ + cos 40+cos 60) = sin 70 - sin 0.
Deduce that cos ^π + cos fπ + cos fw = — \.
Show also that cos ^π+cos ^π + cos %π = + \. (Cambridge)
16. Express the function cos χ+ 2 sin χ in the form R sin (x + a) where it is
positive and 0° < α < 360°. State the values of R and a. Hence, or otherwise,
find the values of χ in the range 0° to 360° inclusive which satisfy the equations
(i) 2cosx + 4sinx= 1; (ii) cos x(cos χ + 2 sin x) = 1. (Cambridge)
17. If и - cos 0+ sin 0, ν = cos 0- sin 0, prove that
cos 20 = uv, sin 20 = u2-1, и2 +1>2 = 2.
Prove that, if 0 is a root lying between —180° and 180° of the equation
cos20+asin20 = cos0-sin0,
then it is either 0° or 90° or it is a root of the equation
(a + l)cos0+(a-l)sin0 + a = 0.
Solve the equation
cos 20 + 7 sin 20 = cos 0 - sin 0
for values of 0 between -180° and 180°. (O & C)
226

2] SUM AND PRODUCT FORMULAE
18. Find all the pairs of values (θ, ψ) lying between 0 and 2π that satisfy the
equations cos θ + cos φ = cos ^
sin θ + sin φ = sin in. (O & C)
19. Prove, by induction or otherwise, that
cosa+cos (a+/?) + cos (μ + 2β) + ... +cos [oc+(n- 1) β]
= cos [α+Кя-1) β] sin £л/в cosec \β,
provided that β is not a multiple of 2π.
A regular polygon has η sides of length a; the vertices of the polygon are
Vu V2,..., Vn. Show that
(V1V2)" + (V1V3y+... + (V1 VnY = ina*cosec2 (π/ri). (Cambridge)
3. THE SOLUTION OF TRIANGLES
One of the most important applications of elementary trigonometry is to
the solution of triangles; that is, the determination of the remaining sides
and angles of a triangle some of whose sides and/or angles are given. For
consistency, we employ the following notation: ABC is a triangle with sides
ВС = а, С A = b, AB = с and the radius of the circumcircle of Δ ABC is R.
A triangle may be solved uniquely in the following three cases:
(i) given the three sides;
(ii) given two sides and the included angle;
(iii) given two angles and a side.
A triangle may also be solved (but not necessarily uniquely) in the
following case:
(iv) given two sides and a non-included angle.
The Cosine and Sine Rules of elementary trigonometry are employed in
the solution, the Cosine Rule in cases (i) and (ii), the Sine Rule in cases (iii)
and (iv). Before illustrating their use, we give the proofs for the benefit of
the reader unfamiliar with them.
The Cosine Rule.
In the Δ ABC, a2
b2
c2
227
Fig. 12.6
b2 + c2~2bc cos A,
c2 + a2~2cacosB,
a2 + b2~2abcos С

FURTHER TRIGONOMETRY
[12
Proof. Take A as origin and let the position vectors of В, С be r1; r2
respectively. (Thus \тг\ = с, |г2| = b and 1^ —r2| = a.) We have
ki-r2|2 = (^-г^.^-Гг)
= Га.Гг + г^^-г^.Гг
= #4 с2-26c cos A.
The other two forms are proved similarly, using a new origin.
Ex. 22. Prove the vector identity
2Γι.(Γ2-Γι) = Γϋ.Γϋ-Γχ.Γι-Ο-ϋ-Γύ.Ο·,!-!·!)
and show that, with the notation used above, this reduces to the second form of
the Cosine Rule.
The Sine Rule.
In the AABC a = b = с =
sin A sin В sin С
Proof. Draw the diameter BD.
\iLA is acute (Figure 12.7(i)), LBDC = LA.
If LA is obtuse (Figure 12.7 (ii)), LBDC = 180°- Ζ A.
In both cases
^- = 2*
sin A
and the result follows by symmetry.
(i) (ϋ)
Fig. 12.7
Ex. 23. Show that the area, Δ, of the Δ ABC is given by the formula
Δ = \bc sin A.
Write down the two similar expressions for Δ and deduce the Sine Rule in the
form
a _ b _ с
sin A sin В sin С
Example 10. In Δ ABC, ВС = 6 cm, CA = 4 cm, AB = 5 cm. Find the
angles of the triangle.
228

3] SOLUTION OF TRIANGLES
Clearly we must use the Cosine Rule to begin with. We select the angle B,
since this is the smallest angle, and the tables of cosines are marginally
more accurate for smaller than larger angles.
16 = 25 + 36-60 cos LB,
cos LB = 0-75,
LB = 41° 24'.
By the Sine Rule
sin A
~6~~
sin LA =
LA =
sin 46° 24'
2
= 0-9919(5),
= 82° 42'.
By the angle sum property of a triangle
LC-
= 55° 54'.
Example 11. Discuss the solution of a triangle in which А, а, с are given.
(This is often called the ambiguous case, since it is sometimes possible
to draw two triangles.)
в В
A X A Cx X C2
(0 Oi)
Fig. 12.8
Construct the point X such that LAXB = 90°, LB AX = LA.
if c sin A
с sin A > a, sin С = > 1,
a
and no triangle can be drawn (Figure 12.8(i)).
If с sin A = a, LC = 90° a unique right-angled triangle can be drawn.
If с sin A < a < c, sin С = с sin A/a < 1 and two angles С may be
found, one acute and one obtuse (see Figure 12.8 (ii)); two triangles can be
drawn.
Finally, if a > c, then A > С and the obtuse value for С must be
rejected (see Figure 12.8(ii) again).
*Ex. 24. Interpret the four congruency conditions for triangles in terms of the
solution of triangles by trigonometry.

FURTHER TRIGONOMETRY [12
4. SOLUTION OF PROBLEMS, PARTICULARLY
IN THREE DIMENSIONS
We recall that the angle between two planes is equal to the angle between
the normals to the two planes and that the angle between a line and a plane
is equal to the angle between the line and its projection in the plane. It is
important to remember, too, that bearings are always given in a horizontal
plane.
Many problems are best done by setting up coordinate systems and
applying the methods of Chapters 2 and 11. The reader must learn to
develop a flexible attitude towards problem solving and be prepared to try
several techniques in a search for the simplest approach.
Example 12. A hillside faces due north and is inclined at 20° to the horizontal.
A path up the hill has a bearing of 120°. Find the angle the path makes with
the horizontal.
Fig. 12.9
ABCD is a vertical rectangle; ADEFa horizontal rectangle. The hillside
is represented by BCEF; AF points due north, FE due east. FC represents
the path.
We have LAFB = 20°, LAFD = 60°, and we have to calculate LDFC.
Let LDFC = θ, ΑΒ = h units.
In AAFB,
In AAFD,
230
F D
Fig. 12.10
AF = h cot 20°.
FD = AF sec 60° = h cot 20° sec 60°.

41 SOLUTION OF PROBLEMS
In ADFC, tan θ = hjDF = tan 20° cos 60°
= 0-1820,
θ = 10° 19'.
Example 13. Prove the theorem of Apollonius that, in any triangle ABC with
median AM, Авг + AC2 = 2AAP + IBM2.
An aircraft flying on a constant course and at constant height with speed
V is observed from a station on the ground at times 0,t,2t to have elevations
α, β, γ respectively. Prove that
V = AV(cot2 a-2 cot2 /? + cot2 γ)Ιφ.
If the bearings of the aircraft from the station at times 0 and 2t are θχ
and θ2 respectively, determine the course of the aircraft.
В Μ Α
Fig. 12.11 Fig. 12.12
B' VI Μ' VI A1
Fig. 12.13
Let LAMB = θ (Figure 12.11).
In AAMB, AB2 = ^M2+5M2-2^M.5Mcos Θ.
In AAMC, AC2 = AM2 + MC2-2AM.MC cos (18O°-0).
But BM = MC and cos θ = -cos (180°-0) whence the theorem of
Apollonius, by addition.
AB represents the course of the aircraft, Α, Μ, Β being its position at
times 0, t, It. The projections of Α, Μ, Β on the ground are Α', Μ', Β',
231

FURTHER TRIGONOMETRY
and О is the observation point. We are given that AM — MB =
that ΔΑΟΑ' = <x, ΔΜΟΜ' = β, ΔΒΟΒ' = у.
In ΔΟΑΆ, OA' = h cot a.
Similarly OB' = h cot у; ОМ' = /г cot /?.
In Δ A O'B', h2 cot2 α + /г2 cot2 у = 2/г2 cot2 β + 2¥*ί2
by Apollonius's Theorem, whence
F = /zV(cot2 a-2 cot2 /? + cot2 у)/?л/2.
Furthermore, if 0 is the bearing of the course of the aircraft
LA'OB' = вх-вг; LB'A'O = φ~θ1
and we have, by applying the Sine Rule in ΔΟΑ'Β'
2Vt _ h cot у
sin(^-^2) ~ sin (0-0)'
/г cot у sin (#! — 02)
sin (0-00 =
A,/2V(cot2 a-2 cot2 /? + cot2 y)'
cotysin^!-^) Ί
,V(2 cot2 a - 4 cot2 /? + 2 cot2 y) J
■D
Exercise 12(b)
1. Two points Л and С on the bank of a straight river are 120 metres apart.
It is observed that a point A on the opposite bank is such that the angle ABC
is 72° 15' and the angle ACB is 38° 30'. Find the width of the river, correct to the
nearest metre. (O & С Ό')
2. In AABC, s = Ца + Ь+с); deduce from the Cosine Rule that
If the area of the triangle is Δ, use this result to prove
(i) sin I = Ур'У"0] ; GO Δ = V№"a) (s- b) (s- c)].
3. In the AABC, a = 14, b = 15, с = 13. Calculate
(i) the area of the triangle;
(ii) sin A;
(iii) the radius of the circumcircle. (O & С Ό')
4. Prove that,
0)
(ii)
232
til
. A~
tan —
in any triangle ABC,
sin A — sin В
sin A + sin B'
В a-b С
- = a^bCOt2-

41 SOLUTION OF PROBLEMS
In a triangle ABC, a = 13-41 cm, b = 9-63 cm and LC = 34°. Find the size
of the angles LA and LB.
If in a triangle a = 5, b = 4 and cos (A — B) = ii, prove that cos С = i
andthatc=6. (O & C)
5. A straight river is 80 m wide. A man on one bank observed that the angle of
elevation of the top of a tree directly opposite him is 18° 20'. He walks 60 m
along the bank. Find the angle of elevation of the top of the tree from his new
position. (O&C'O')
6. In a triangle ABC, a = 20 m, b = 28 m, с = 32 m. Prove that LB = 60°.
From the points А, В, С which are on level ground, the top of a flagstaff has
the same angle of elevation, 30°. Calculate the height of the flagstaff.
(O&C'O')
7. A snow-slope is a plane inclined to the horizontal at an angle of a. A man on
skis traverses this slope in a straight line which makes an angle β with the
horizontal. Show that the angle θ which his path makes with the line of greatest
slope of the plane is given by the equation
cos θ = sin β cosec a.
If a = 25° and β = 20°, calculate the size of the an gle Θ. (O&C'O')
8. Two adjacent sides of a roof, whose horizontal bases meet at right-angles,
slope at 30° and 45° to the horizontal. At what angle do the roofs intersect?
9. A right pyramid, vertex O, stands on a square horizontal base ABCD;
AB = 2a and the height of the pyramid is h. Express the sines of the following
angles in terms of A and a:
(i) the inclination of OB to the horizontal;
(ii) the inclination of a slant face to the horizontal;
(iii) the angle between the faces OAB, ODC;
(iv) one-half of the angle between the faces OAB, OBC. (O & C)
10. ABCD is a regular tetrahedron and Mis the mid-point of the edge CD. Find
the angle between the plane ABC and
(i) the plane ABM; (ii) the line AM.
11. From the top of a cliff the angle of depression of a ship, steaming on a
constant course at 12 km per hour, is 15° and its true bearing is 75° north of west.
Two minutes later, the angle of depression is 12° and the true bearing is due west.
Find
(i) the height of the cliff (to the nearest metre);
(ii) the ship's course (to the nearest half-degree).
12. A and В are two points on one bank of a straight stretch of a river, Ρ is a
chimney exactly opposite A and 20 m from the other bank. The angles of
elevation of the top of the chimney from A and В are α and β respectively. Calculate
the width of the river to the nearest metre, given that
a = 45°, β = 30° and AB = 200 m.
233

FURTHER TRIGONOMETRY [12
Prove that, if the angle of elevation of the top of the chimney from a point C,
midway between A and B, is γ, then
4 cot2 7 = 3 cot2 a+cot2 β
whatever the height of the chimney, the width of the river, and the distance
AB may be. (O & Q
13. In the quadrilateral ABCD, AB = 13 cm, ВС = 20 cm, CD = 48 cm,
LBCD is 90° and ABAC = ADBC. Without using tables
(i) prove that cos ABAC = ts;
(ii) prove that cos LACB = f;
(iii) find the area of the quadrilateral by adding the areas of the triangles
ABC and ACD. (Cambridge)
14. A right pyramid has vertex V and rectangular base ABCD. AB = 4 cm,
ВС = 6 cm and the height of the pyramid is 8 cm. Find
(i) the angle a sloping edge makes with the base;
(ii) the angle the face VAB makes with the base;
(iii) the angle between two adjacent sloping faces.
15. From a point Ρ in a horizontal plane a man observes the summit S of a
mountain to be due north at an elevation Θ. When the man has walked a
distance la on a bearing α east of north to a point Q in the horizontal plane, he
observes that the elevation of S from Q is again Θ. If h is the height of S above
the horizontal plane containing Ρ and Q, show that h = a tan θ sec <x. When
the man has walked a further distance a in the same plane and in the same
direction to a point R he observes that the elevations of S from R is φ. Show that
cot2 φ = (3cos2a+l)cot20
and that the distance RS is
a(sec2 α sec2 θ + 3)}. (Cambridge)
16. An observer situated at a point О in a horizontal plane observes two other
points Ρ and Q. The point Ρ is in the horizontal plane containing О on a bearing
α west of north from O. The point Q is situated due north of О at an angle of
elevation β as observed from O. If OP = OQ = r, show that the length / of
the straight line PQ is given by
/2 = 2r2(l-cosacos/?).
Hence, or otherwise, show that cos LPOQ = cos α cos β.
If α = 60°, β = 30° find the length of the arc PQ of the circle which passes
through Ρ and Q and has its centre at O. Give the answer in terms of r correct
to two decimal places. (Cambridge)
17. ABC is an equilateral triangle of side 1 m marked out on level ground.
Three vertical posts are driven in at the vertices; AP is of height a m; ВQ is of
height b m; CR is of height с m (a > b; с > b). The line PQ meets the ground at
U; the line RQ meets the ground at V. Prove that
BU = JL·, BV= *

41 SOLUTION OF PROBLEMS
If / = 2, a = 6, b = 3, с = 4, calculate the length of the perpendicular from
Л to the line UVand hence find the inclination of the planePQR to the horizontal.
(O&C)
18. A tower stands on a level plane. The inclinations of its top from three points
А, В, С in the plane are oc, β, γ. The points А, В, С are in a line which does not
go through the foot of the tower and AB = p, ВС = q. Prove that the height h
of the tower is given by
/z2[p(cot2 a-cot2/0)+9(cot2 α-cot2β)] = pq(p+q). (O & C)
19. In the tetrahedron О ABC, ВС = а, СА = b and AB = c;
LBOC = LCOA = LAOB = 90°.
Find the angle between the planes OAB and ABC.
Miscellaneous Exercise 12
1. Prove that arctan | + arctan { = }π and that
arctan i + arctan £ + arctan | = \π.
2. Solve the equations:
(i) arctan i+ arctan χ = arctan f;
(ii) 2 arcsinOc-x) = 3 arccos (x+y) = π.
3. Two circles with centres О and С meet in Ρ and Q. The radii of the circles are
a and b and the angle CPO is a. Prove that the angle between the common
tangents of the circles is Θ, where
(a-b)2 cot2 iff = 4ab sin2 ice. (O & C)
4. Find the set of values of χ which satisfy the inequality
2 sin x+sin 2x > 0. (SMP)
5. Under what circumstances is it true that
arctan χ + arctan ν = arctan; ?
1-xy
Find values of x, у for which the relationship above is (i) true, (ii) untrue.
6. Prove that, in any triangle ABC,
_a 6__ с
sin A sin В sin С '
In the triangle ABC the angle Л is a right-angle and О is a point inside the
triangle at which all the sides subtend the angle 120°. If θ is the angle CBO, prove
that .„
e_ c+*J3
If the angle С = 30°, show that CO = 2AO. (O & C)
7. Show that 4 arctan |— arctan гт^ = ίπ (a result commonly known as
Machiris Formula).

FURTHER TRIGONOMETRY [12
8. Find all pairs of angles x, у such that 0 < χ < π, Ο < у < π which satisfy
the simultaneous equations
sin χ sin у = i(V3— 1); cos χ cos у = i(V3 + 1).
9. A man stands facing the rectangular front of a building and is in the same
plane as one of the ends of the building. The elevation of the nearer top corner
A is α and of the further top corner В is β. The man walks towards the vertical
edge through В until the elevation of В is also a, and he finds that he has walked
a distance a. Show that the height of the face of the building is
a sin a sin /3/sin (α—β)·
Show also that its length is afsin (pc+fi)/sin (а-Д)]*. (O & Q
10. Find the maximum and minimum values of the expression α sin (9+6 cos Θ.
By making the substitution t — tan \θ, deduce the condition for the existence
of real roots of the quadratic equation
(b+c)t*-2at + (c-b) = 0.
11. An aircraft is observed flying on a constant course γ east of north at a
constant height. When its true bearing is θ west of north, the angle of elevation
is a, and when its true bearing is φ east of north, the angle of elevation is β. Prove
that γ is given by . , ,.„*/,
_ sin0 tan «+sing tan/g
~ cos φ tan a-cos θ tan β'
Prove that, if θ = φ, the angle of elevation δ, when the true bearing is north,
is given by Ш1 s = i(tan а + шд sec A (O & Q
3 tan x — tan3 χ
12. Prove that tan 3x
1-3 tan2л:
provided both sides of the identity are defined.
Deduce that the three roots of the cubic equation
ί3-3ί2-3ί+1 = 0
are tan -fen, tan fen, tan |π.
13. Sketch the graph of the function defined by
f(x) = |sinx + cos*|.
For what values of χ in the interval 0 =g χ =g π does |sin x+ cos
14. Find the maximum and minimum values of the expression
sinx(sinx+cosx).
15. Eliminate χ and у between the equations
sinx+sin^ = a,
cos x+cos у = b,
x+y = a,.
236

41 MISCELLANEOUS EXERCISE 12
16. The wave-train well away from a ship is modelled by the equation
z = asin[(x+.y-tf)/p]
where ζ is the height of the sea's surface above the mean horizontal plane in
which (x, y) are Cartesian coordinates, a is the maximum height of the waves, с
is the fixed speed, and ρ a fixed length. Sketch a diagram showing an airman's
view of the waves, indicating the lines of the crests and troughs and the direction
in which the waves are travelling; and prove that the wave velocity and distance
between successive waves are сД/2 and ττρ^Ι respectively.
The wave train of another ship, given by
ζ = α sin [(χ—у— ct)/p]
is superimposed on the other one. Prove that on the lines у = (N+ i) πρ, where
N is an integer, the sea is undisturbed. (SMP)
17. Find the range or ranges of values of с such that the simultaneous equa-
sec<9 + cosec$S = с
are satisfied by real values of θ and φ.
Obtain the general solutions of these equations when с = б£.
237

13. Matrices 1
1. INTRODUCTION
Any pair of simultaneous equations in two unknowns, χ and y,
{ax + by = c,
dx + ey=f
is completely specified if we know
(i) the coefficients on the left-hand side, which we may write as the
rectangular array
СЭ
and (ii) the two numbers on the right-hand side, which we may write as
the rectangular array
Ex. 1. Solve the simultaneous equations whose coefficients are given by the
rectangular array .
and whose right-hand sides are given by the rectangular array
The answer is the pair of values 4 (for x) and - 3 (for y), which can be expressed
as an array
l-з)'
as we saw in Chapter 2.
Such rectangular arrays of numbers are called matrices; in the example
above they exhibit the known quantities in a pair of simultaneous
equations but they may be used in other contexts too, to display information.
Suppose 23 boys take examinations in Mathematics (M), English (E)
and French (F). In each examination there are five grades A, B, C, D, E.
Then the fate of the boys may be summarized in the following 3x5 matrix
238

(read 'three by five matrix'; that is,
columns): . „
INTRODUCTION
matrix with three rows and five
С D Ε
7 6 3\
8 6 2.
8 7 0/
Ex. 2. In the example just quoted, describe what information is represented by
each of the following :
(i) the 3 χ 1 matrix (or column-vector)
(iii) the 1 x 5 matrix (or row-vector)
(3587 0).
More generally, any rectangular array of m rows and η columns of ele-
ments is called an r
nxn matrix:
Mi <ha
/ <h\ <ha
^13
<hb
... aln
... din
\aml am2 атЪ
■ amn
We may, if we so wish, denote the whole matrix by a single capital letter A
(printed in bold-face type; in script, write a capital A and underline). The
element а^ in the z'th row and yth column, is called the i, jth element or
entry of the matrix. A real matrix is a matrix all of whose elements are
real numbers.
2. LINEAR TRANSFORMATIONS
If we are given a pair of axes in a plane, any point Ρ may be located by
its coordinates (x, y), so that OP = xi+y}. Suppose we now associate
with each point P(x, y) a unique point P'(x', y') such that
For example, the point P(l, 2) gives rise to P'(5, -1); the point Q{-2, 3)
gives rise to g'(4, - 5). Such an association of points is a mapping, or func-

MATRICES 1 [13
tion, of the set of points in the plane into itself and is often referred to as a
transformation of the plane into itself. We may specify our mapping by the
2x2 matrix
(! -Э
which we call the matrix of the given transformation. For the present,
such a matrix is to be regarded simply as an inert array of coefficients
defining a transformation. In Section 3 we shall consider combinations of
transformations and the resulting matrices; in Section 4, rules for combining
matrices will be developed and the inert arrays which we have at present
will come to life. Finally, in Section 5 matrices, with their new-found
vitality, will be used to illuminate the concept of a geometrical
transformation.
We may also regard the transformation as mapping the vector OP into
the vector OF and the vector OQ into the vector OQ'. If we call the
transformation Γ, then we write
Γ(ΟΡ) = OP'
and T(OQ) = OQ'.
Notice carefully that Γ(ΟΡ) is a vector: the position vector ofP'. Г(ОР) is
called the image of OP; P' is the image of Ρ and may be written T(P).
The transformation defined above has an important property: it maps
the vector OP+OQ into the vector sum of Γ(ΟΡ) and T(OQ). Let us
demonstrate this property first with the points Ρ and Q above :
OP = i + 2j, OQ=-2i + 3j.
Let OR = OP+OQ
so that R is the point (-1, 5) and OR = -i + 5j. By the definition of T, R'
has coordinates (-1 +2.5, -1-5), i.e. (9, -6) and so
OR' = 9i-6j.
But OP' + OQ' = (5i-j) + (4i-5j)
= (9i-6j)
= OR'.
Now let Ρ be any point (Als kj, Q any point (/z2, k2) and let R^, k3) be
the point such that 0R = OP + OQ,
or h3i + k3\ = (A1 + A2)i + (fc1 + A:2)j.

2] LINEAR TRANSFORMATIONS
then T(OR) = T(h3i + k3j)
= (h3 + 2k3) i + {h3 - k3) j, by the definition of T,
= (h1 + h2+2k1 + 2k2)i + (h1+h2-k1-k2)i
= Pi + 2kd i + (Ai - kd j] + P2 + 2*2) i + (A2 - *2) j]
= Γ(ΟΡ) + T(OQ), again using the definition of T.
The result we have just proved for the transformation Τ can be
generalized. If S(/z4, fc4) is the point whose position vector is given by
OS = AOP+/<OQ,
where Α, μ are any numbers, so that /z4 = λ^+μη^ kt = λ^+μΐ^, then
r(os> = T{Ki+Ki)
= (/z4 + 2jfc4) i + (A4 - Jt4) j, by the definition of T,
= (ЛЙ!+/</г2 + 2A^ + 2/4*2) i + (Λ/zj +/i/z2 - A/q -/<fc2) j
= λ[(η1 + 2^) i + ihi-кг) j]+/*[(A2 + 2*2) i+ (*,-**) j]
= λΤ(ΟΡ)+μΤ(00).
The property which we have just demonstrated for the transformation
holds for a wide class of transformations, called linear transformations.
A linear transformation Γ is a transformation such that the image of the
vector Ax +/*y is the vector sum of A times the image of χ and μ times the
image of y. In symbols, Γ is a linear transformation if, for any vectors x, у
and any numbers Λ,„ τ{λχ+μ^ = λτ{χ)+μΤ^
*Ex. 3. By writing χ in the form x+ 0, show that, for any linear transformation
T, ДО) = 0.
An example of a mapping, T, that is
particularly easy to visualize geometrically is that in
which the image P' of Ρ is obtained by a half-
turn about the origin (see Figure 13.1). The
coordinates of P'(x', /) in terms of P(x,y) are given
by
ПАУ)
*> Fig. 13.1
and thus the matrix of this transformation is

MATRICES 1
[13
To see that Γ is a linear transformation, consider the points P(//1; kj
and QQh, k2) and suppose that
OR = AOP+/<OQ,
where OR = h3 i + k3 j = (A^ + /i/z2) i + (Akx+/tfc2) j,
then T(OR) = lX.hsi+k3j)
= -h3i-k3\, by the definition of T,
= - (Λ/zj+A) i - (A&!+M2) j
= A(-A1i-fc1j)+H-A2i-/c2j)
= A7T[OP)+/<r(OQ).
Thus, Γ is a linear transformation.
*Ex. 4. Write down the matrix for the transformation
x' = kx,
у = ky.
This transformation is called the enlargement transformation, or dilatation
transformation. Draw a diagram and consider the effect on the points (2, 3),
(-1,2) when к is (i) 2, (ii) \. Can you suggest a reason for the name?
*Ex. 5. Prove that the enlargement transformation is linear.
*Ex. 6. The function/:R -* R defined by f{x) = ax+b is a linear function (that
is, its graph is a straight line). Show that, regarded as a mapping of points of the
χ axis into points of the χ axis, it is not a linear transformation.
Ex. 7. The function /: R -+ R defined by f(x) = x2 maps the χ axis into itself.
Prove that/is neither a linear function nor a linear transformation.
Example 1. The transformation Τ maps the vector OP into the vector OP'
where
|OP| = |OP'| and LPOP' = <x,
a fixed angle measured in the anticlockwise sense. Find the matrix for Τ
and verify that Τ is a linear transformation.
Suppose OP makes an angle β with the χ axis
(see Figure 13.2). Writing |OP| = |OP'| = r we
have
x' = r cos (α +β) = (r cos β) cos α - (r sin β) sin a,
y' = r sin (ρί + β) = {r cos/?) sin a + (r sin/?) cos α С ^
and the transformation is given by Fig. 13.2
x' = л: cos α-y since,
y' = χ sin α + j cos a.
242
■ϊ(χ,ν)

2] LINEAR TRANSFORMATIONS
The matrix of the transformation is therefore
(cos α — sin a\
sin α cos a/
To prove that Τ is linear, consider P(/z1; kj, Q(h2, k2), R(h3, k3) where
OR = AOP+/<OQ,
Г((Ж) = T{h3\ + k3\)
= Qi3 cos <x-k3 sin a) i + (h3 sin a + ka cos a) j, by the equations
for Τ defined above,
= [(λ/jj +/</z2) cos α _ (Mi +M2) sm °0 >
+ [(Mi +Мг) sin α + (Afcj +/<fc2) cos a] j
= A [(/zj cos a - fcj sin a) i + (/zj sin a + кг cos a) j]
+/*[(/z2 cos a - fc2 sin a) i + (h2 sin a + k2 cos a) j]
= λΤ(ρΡ)+μΤ(00)
and the result is complete.
3. LINEAR TRANSFORMATIONS
AND THEIR MATRICES
Suppose we have a linear transformation Τ whose matrix A is given by
Since i is the position vector of the point 7(1, 0) and j is the position vector
of the point /(0, 1), we have
r(i) = fli + 4 T(j) = bi+dj.
Recalling that vectors may be written in column form, this result shows us
that the first column of A, regarded as a vector, is the position vector of the
image of I and the second column of A, regarded as a vector, is the position
vector of /.
Ex. 8. Prove the converse of the result above that, if
T(i) = ai+cj and T(j) = bi + dj,
then the matrix of Γ is A where
243

MATRICES 1 [13
(The results above show us that we are entitled to talk of A as the
matrix of a given linear transformation Τ only if we take i, j as our base
vectors: to put it another way, if we change our coordinate system, we
change the equations which define the linear transformation. We shall
continue in this chapter, unless specifically stated otherwise, to assume that
our base vectors are i and j.)
Since any vector χ may be expressed in the form
χ = Ai+/ij,
knowledge of the effect of Γ upon i and j enables us to predict the effect of
Τ upon x; for _, ч _,v , „
T(x) = Τ(λι+μι)
= λΤφ+μΤφ.
Now suppose we have two linear transformations S, Τ whose matrices
are respectively A and W where
If we define a new transformation U, called the sum of S and Τ by the
equation ^ = 5(χ) + Γ(χ)
it may be shown that
(i) U is a linear transformation;
(ii) U has matrix , , , , .
v ' /a + w b+x\
[c+y d+z)·
For we have
(i) i/(Ax + /ty) = S(Ax+/ty) + Τ(λ\ + /ty), by definition of U,
= λΞ(\)+μΞ(γ) + λΤ(χ)+μΤ(γ), since S, Τ are linear,
= А(5(х) + Г(х))+К5(у) + Г(у))
= λυ(χ)+μυ(γ), by definition of U,
and U is a linear transformation.
Again, (ii) to find the matrix of U, we have simply to find i/(i) and C/(j).
U® = S(\) + T®, by definition of U,
= (ai + c]) + (wi+y'})
= (a+w)'\ + (c+y) j.
Similarly Щ) = φ + χ) i + {d + z) j
and the result (ii) follows.
244

MATRICES OF LINEAR TRANSFORMATIONS
and Ρ is the point (1, 1) sketch in a diagram the effects of S, Τ and U upon the
point P.
*Ex. 10. The transformation L is defined by
L(x) = cT(x).
(L is sometimes written as cT.) Prove that L is a linear transformation and that
its matrix is
*Ex. 11. The transformation Mis defined by
M(x) = S[T(x)],
that is, χ is first transformed under Τ into 7Щ, and Γ(χ) is then transformed
under S into M(x). (M is called the product of S and Τ and may be written as
ST.) Prove that Mis a linear transformation and that its matrix is
(aw+by ax+bz\
cw + dy cx+dzj'
+ dy
All the transformations mentioned so far have transformed points in a
plane into points in the same plane. It is not difficult to generalize the
concept of transformations of points in space to points in space. For example,
the equations , _
define a transformation whose matrix is
Ί 0 0\
0 1 0 .
*0 0-1/
Ex. 12. Give a geometrical interpretation for the transformation Τ whose
matrix is
/1 0 0\
0 1 0.
\0 0 -1/
Exercise 13 {a)
1. The transformation Τ is defined by the equations

MATRICES 1
[13
What is the image of the point P(-1, 3)? What point Q gives rise to the point
G'(3,5)?
2. A transformation is represented by the matrix
(A :S)·
Find the image of the point (-1, 2) and also the point whose image is (14,9).
What points are invariant under this transformation? (A point is invariant
under a transformation if it maps into itself.)
3. What points are invariant under the transformation whose matrix is
(? :?)'
4. A transformation whose matrix is
(if)
is called a shear. Consider its effect upon the square with vertices (0, 0), (1, 0),
(1,1), (0, 1) and suggest a reason for the name.
5. Prove that the shear transformation defined in Question 4 is linear.
6. Describe the following transformation in geometrical terms:
x' = y,
/ = 0.
Draw a sketch to illustrate the transformation; mark in a number of points and
their images.
Prove that the transformation is linear.
7. Answer the same questions as in Question 6 for the transformation
x' = 2y,
y' = 2x.
8. Sketch a diagram to illustrate the effect of the transformation which has matrix
(ίί)
on the four points P(0, 0), β(1, 0), R(l, 1), 5(0, 1).
Determine the area of the image quadrilateral P'Q'R'S'.
What can you say about the particular transformation in which ad-be = 0?
9. Prove that the translation transformation defined by
*' = x,
y' = y+l
is not a linear transformation.
246

3] MATRICES OF LINEAR TRANSFORMATIONS
10. Find the matrix of the linear transformation which maps the points U, J
whose position vectors are given by
-©·-©
into the points U', J' whose position vectors are given by
■■-G-'-C)·
11. Write down the matrix for the shear, S, given by
x' = x + ky,
У' = У,
and the matrix for the reflection R, given by
*' = x,
V = -У-
Two new transformations Γι and T% are defined as follows. To find Тг(Р),
find R(P) = Q\ say, and then find S(Q'); to find Тг(Р), find S(P) = Q", say,
and then find R(Q"). Prove that 7Ί and T2 represent different transformations.
4. THE ALGEBRA OF MATRICES
The results we proved in the last section for the sums, numerical multiples
and products of linear transformations and their associated matrices
suggest that definitions should be given for the sums, numerical multiples
and products of matrices so that, for example, the sum of two matrices
A and W is the matrix of the sum of the two transformations whose matrices
are respectively A and W. In this section we shall give some definitions for
combining and manipulating matrices which are indeed motivated by the
corresponding rules for linear transformations (proved only in the plane,
but holding more generally). In Section 5 we shall show how the definitions
of this section tie up with our previous work on linear transformations.
We begin by defining equality of matrices. Notice that such a definition
is required: it is intuitively fairly clear that
G ?) - Π Э
are different matrices, but are
/0 2\ ./0 2 0\
(5 1) and (5 1 0)
different?
247

MATRICES 1 [13
(i) Equality of matrices.
Since column vectors are 3 χ 1 matrices (in three dimensions) we must
ensure that our rule for the equality of matrices includes as a special case
the rule for equality of vectors.
Two matrices A and В are equal if and only if
(a) they have the same number of rows and the same number of columns;
(b) the i,jth element of A equals the i, jth element of B, for all possible
values of i and).
Example 2. If
\1
then a = -2, b = 4, с = 2, d = 1, e = 0. The two matrices
/2 a 3\/c -2 3\
\l Ob) \d e 4/'
a /0 2 0\
and (5 1 0)
are not equal.
(ii) Addition of matrices.
Two matrices are said to be conformable for addition if they have the
same number of rows and the same number of columns.
By analogy with the corresponding rule for column vectors, we make the
following definition: if two matrices are conformable for addition, A + B
is the matrix whose i, jth element is the sum of the i, jth elements of A andB.
If A and В are not conformable for addition, A+Bis not defined.
Example 3. If /1-3
A = (4 0 3
m
/1+2 -3 + 2 2-3\ /3 -1 -1\
4 + 1 0-3 3 + 4 = 5 -3 7 1.
\l+3 -1-3 2+1/ \4 -4 3/
•B
then A + С is not defined.
(iii) Multiplication of matrices by numbers.
Again working by analogy with column vectors, if A is any matrix and к
is any number, kA is a matrix, conformable for addition with A, whose i,jth
elements is к times the i, jth element of A.
248

4] THE ALGEBRA OF MATRICES
Example 4. If
ЧЫЬЧВ -2Ч:Л)·
We may now define the subtraction of two matrices A and В which are
conformable for addition by the equation A-B = A + (-l)B.
Example 5. With the matrices A and В of Example 3
/-1 -5 5\
A-B =33 -1 .
\-2 2 1/
(iv) Zero matrices.
A zero matrix is a matrix all of whose elements are zero. Provided no
confusion is likely to arise, a zero matrix may be written 0 but note that
there are many different zero matrices; for example,
/0\ /0 0\ /0 0 0\
[θ/ \0 0/ \0 0 0/
are all zero matrices, and are all different,
(v) Multiplication of matrices.
Two matrices A and В are said to be conformable for the product AB if the
number of columns in A is the same as the number of rows in B. We now
define the product AB of two matrices A and В which are conformable for
the product AB as follows: the i,Jth element of AB is obtained from the ith
row of A and the jth column of В by multiplying together the corresponding
elements and adding. If А, В are not conformable for the product AB, then
AB is not defined.
Example 6. If
/1 3 2\ /-1 1 -1\
A = 5 -1 3 and В = 2 0 2 1,
\l 1 2/ \ 0 3 1/
then
/(1) ( -1) + (3) (2) + (2) (0) (1) (1) + (3) (0) + (2) (3)
AB= (5)(-l) + (-l)(2) + (3)(0) (5)(l) + (-l)(0) + (3)(3)
\(1) (-1) + (1) (2)+(2) (0) (1) (1)+(1) (0) + (2)(3)
(l)(-l) + (3)(2) + (2)(l) \
(5)(-l) + (-l)(2) + (3)(l)
(1)(-1)+(1)(2) + (2)(1) /
/ 5 7 7\
= -7 14 -4 .
\ 1 7 3/
249

MATRICES 1
but CA is not defined.
1 0 then AC
-(■! "3
Ex. 13. If
find
(i) A + B;
Ex. 14. If
find
(i) AB;
*Ex. 15. If
verify that
(i) A + B =
(iii) A(BQ =
Hi
(ii) 2A-
A=|
(ii) BA.
*-G-
B + A;
= (AB)C;
Show also that AB φ ΒΑ.
2
0
2
-3B;
I
t
"?
1
В =
^-δ i-
(iii) AB;
■-G 4
-e ?)- --G
(ii) (A + B) + C =
-3
(iv) BA.
i)
?)·
= A + (B+C);
(iv) A(B+C) = AB+AC.
In Ex. 15 above, certain of the basic laws of algebra have been shown to
hold good for the particular matrices А, В, С With the definitions of
equality, addition and multiplication we have given, the following laws
may be shown to hold good for all matrices А, В, С, provided all the sums
and products are defined:
(i) Matrix addition is
(a) Commutative: A + B = B + A;
(b) Associative: A + (B+C) = (A + B) + C.
(ii) Matrix multiplication is
(a) Associative: A(BC) = (AB) C;
φ) Distributive over addition: {*^ Ζ fc++f^
However, matrix multiplication is non-commutative; indeed, two
matrices А, В may be conformable for the product AB and yet BA is
not defined. This is not to say that matrix multiplication is never
commutative.
&..6.,f a-(J J) and ,-(-· J).
verify that AB = BA.
250

4] THE ALGEBRA OF MATRICES
A unit or identity matrix is a square matrix in which the elements of the
leading diagonal (top left to bottom right) are all unity and every other
element is zero. Thus, the 2x2 identity matrix is
U
while the 3 χ 3 identity matrix is
f. ϊ Λ
\0 0 1/
Provided that no confusion is likely to arise and the correct size (2 χ 2,
3x3, ...) is obvious from the context, an identity matrix is denoted by the
letter I.
Ex. 17. Show that, if A is any 2 χ 2 matrix, В is any 3 χ 2 matrix and I is the 2 χ 2
identity matrix, (i) AI = IA = A; (ii) BI = B.
Ex. 18. Show that, if I is the 3 χ 3 identity matrix and A is any matrix with three
columns, AI = A.
5. LINEAR TRANSFORMATIONS
AND THEIR MATRICES (CONTINUED)
The rules for matrix addition and multiplication given above enable us
to obtain a deeper insight into the machinery of linear transformations.
If Ρ has coordinates (x, y) and its image, P', has coordinates (x\ y') then
the position vectors (in column vector form) of Ρ and P' are respectively
If we have a transformation Ρ to P' defined by the matrix
then x, у, х', у' are related by the equations
x' = ax + by,
У = cx + dy
and these may be written, on using the definition of multiplication of
matrices, in the form of the matrix equation
or, more briefly, p' = Ap. (1)
9 ppM 251

MATRICES 1 [13
Furthermore, A always represents a linear transformation, for
A(Ax+/*y) = AAx+/<Ay.
A second linear transformation p' to p" with matrix В has the form
p" = Bp' (2)
and so, combining (1) and (2),
p" = B(Ap)
= (BA)p.
Thus, if the linear transformation whose matrix is A is followed by the
linear transformation whose matrix is B, the combined effect is a linear
transformation whose matrix is BA.
In three-dimensional space, precisely similar results hold. Indeed, if
p, p' are the position vectors (in column vector form) of the point Ρ and its
image P', and if A is a 3 χ 3 matrix, then the linear transformation defined
by A takes the matrix form p' _ др
which is precisely equation (1), although ρ', Α, ρ have different meanings.
Example 7. The linear transformation Ρ to P' of points in three-dimensional
space is defined by the matrix:
-V, i)
Show that, under this transformation,
(i) any point Ρ is mapped into a point lying in a certain plane;
(ii) all points of the line
x-l _y+l _ z-1
1 1 -2
are mapped into the same point.
(iii) Find the set of points that are mapped into the origin.
(i) The transformation is given by the equations
x' = 2y + z,
y' = 3x+y + 2z,
ζ = Ъх-y + z.
But y'-z = x' and so all points map into the plane x-y + z = 0.
(ii) A general point Ρ of the given line has coordinates (Λ + 1, Λ-1,
-2A+2). Thus, the position vector of the image of Pis given by
ii!) (Ш
and so all points of the given line map onto the point (0, 6, 6).
252

5] LINEAR TRANSFORMATIONS (CONTINUED)
(iii) If the image of the point P(x, y, z) is the origin, then x, y, ζ satisfy
the equations , . „ , „
M (a) 2y+ ζ = 0,
(b) 3x+ y + 2z = 0,
(c) 3x- j> + ζ = 0.
Planes (a) and (ft) meet in the line χ = j> = -\z which clearly lies also in
plane (c). Thus, all points with coordinates of the form (A, A, -2A) map
into the origin.
1. If
evaluate
0) A + B;
2. If
evaluate
(i) A+B;
3. If
evaluate
Exercise 13(b)
*-(-Μί)--(-1-5 2).
(ii) A-B; (iii) 2A-B; (iv) 2A+3B.
1-Х 2 1\ /12 1\
A=3 0 3 and В = 1 1 3 ,
\ 1 2-1/ \2 -1 2/
(ii) 2A-3B; (iii) 3A-B; (iv) 4A-2B.
-М^-И'-П-
(i) A-3B + 2C; (ii) 2A(B-Q; (iii) A2-AB + AC.
Verify that (A + B)2 = A2 + AB + BA+B2; can this be put in the simpler form
A2 + 2AB + B2?
4. А, В, С are all 3 x 3 matrices. Remove brackets in the following expressions:
0) A(B-
5. If
-2C); (ii) (A + B)(A-Q; (iii) (A-B-C) (A + B + C).
-H-e-i »)·
find AB and BA.
6. If
find
(i) AB;
(iv) B2;
7. If
/1 0 2\ /3 1 -2\
A= 0 -1 -1 , Β = 1 -1 -2 ,
\2 11/ \l 1 1/
(ii) BA; (iii) A2;
(v) (2A-B)2; (vi) (A + 2B) (A-B).
/2 1 -1\ /0 2 1\
/ 2 1 -1\ / 0 2 1\
= ( 1 1 2 , B= ( 1 3 0 ,
\-l 2 1/ \-2 -1 2/

MATRICES 1
find
(i) AB; (ii) BA; (iii) A2;
(iv) B2; (v) A2-4B2; (vi) (A-2B) (A+2B).
evaluate
(i) AC; (ii) ВС;
Verify that (A+2B) С = АС+2ВС.
3 4\ /-1 0 1\ /
1 -2 , B= 2 1 0 , C =
2 3/ \ 0 2 3/ \-
-Hi
find
(i) AB; (ii) BA; (iii) ABA; (iv) BAB.
10. A matrix Μ is said to be transposed into the matrix M' if the first row of Μ
becomes the first column of M', the second row of Μ becomes the second column
of M', and so on. Write down the transposes of the matrices:
M:
■&т-Ш-
Calculate the matrix products M'M and TM; show also that (TM)' = M'T".
If the elements of Μ are the Cartesian coordinates of a point P, what
information is provided by the element of M'M?
If the matrix Τ describes a transformation of the point Ρ of three-dimensional
space, interpret geometrically the equation:
(TM)' (TM) = M'M,
and find all appropriate values of a, b and с (SMP)
11. Show that under the linear transformation defined by the matrix
= (2 2 l)
\8 0 -1/
0 -1/
any point Ρ in space is mapped into a certain plane, and find the equation of this
plane.
Show further that all points of the line
x-l y-\ _£=3
1 -5 _ 8
are mapped into a certain point, and find the coordinates of this point.
12. Show that, under the linear transformation defined by the matrix
4πί)'
any point Ρ in space is mapped into a certain plane, and find the equation of this
plane.

5]
LINEAR TRANSFORMATIONS (CONTINUED)
Show also that all points of the line
x-1 _y-2=z-l
1 -1 -1
map into a certain point and find the coordinates of this point.
13. The linear transformation Τ of three-dimensional space into itself maps
the point Ρ into the point F and the point Q into the point Q'. The points Ρ and
Q are distinct. Show that
(i) if P' and Q' are distinct, then all points of the line PQ map into points
of the line P'Q'.
(ii) if P' and Q coincide, then all points of the line PQ map into P'.
14. The linear transformation Τ of the plane into itself has matrix
и: -D-
Show that Τ maps the interior of the triangle 0(0, 0), P(2, 0), g(l, 1) into the
interior of another triangle O'FQ' and find the coordinates of the vertices
O', F, Q'.
Prove also that the image of the centroid of the triangle OPQ is the centroid
of the triangle O'F Q'.
Miscellaneous Exercise 13
map a set of parallel lines into another set of parallel lines when ad φ be?
What happens to the transformation defined by A when ad = bcl Is it still
one-one? (A transformation is said to be one-one if each image F arises from a
unique P.)
2. What are the position vectors of the images of the points whose position
vectors are i, j, к under the linear transformation whose matrix, A, is given by
(a-i. аг д3
δι h из
Ci C2 C3
Describe geometrically the linear transformation whose matrix is
(ο ι o).
\0 0 l/
3. Describe the effect of the linear transformations whose matrices are:
<» (! J), <«> U "'); CD (? ?)■
For each transformation determine the set of vectors which are transformed into
the zero vector.
255

MATRICES 1 [13
4. A is the matrix of the transformation that rotates all points in the plane
through an angle α; Β is the matrix of the transformation that rotates all points
in the plane through an angle β. Calculate BA and simplify the resulting matrix.
What result does this illustrate?
5. Can you find a point Ρ with position vector χ which is mapped into itself by
the tranformation whose matrix is A, where
Can you determine any real values of λ which would enable you to find non-zero
vectors χ such that Ax = λχ? If you can, find both λ and the corresponding vectors.
6. Answer the same questions as in Question 5 for the linear transformations
with matrices
ЧП): «(-3 1
7. The transpose of the matrix A, denoted by A', is defined as the matrix whose
z'th row is the same as the /th column of A. For example, if
чм ?)■-(< J i)·
then / 1 1\ /2 4 3\
*'-(-, 4) -»-(, 2 -5).
If С and D are both 2x2 matrices, prove that (CD)' = D'C.
8. A symmetric matrix A is a matrix such that A' = A. If A and В are two
symmetric matrices, find a further condition that A and В must satisfy to ensure
that AB is a symmetric matrix.
9. A square matrix is said to be diagonal if its only non-zero elements lie on the
leading diagonal (that is, top left to bottom right). For example, the following
two matrices are diagonal:
-?)· 6 i ?)■
Prove that, if Db D2 are two 3x3 diagonal matrices, then
(i) DjDa is a diagonal matrix; (ii) D^ = D^;
(iii) Di = Di.
If A is a 3 χ 3 matrix and D any diagonal 3x3 matrix, what с
about A if AD = DA?
10. A linear transformation is given by the matrix
(111)·

5] MISCELLANEOUS EXERCISES 1
Show that
(i) all points are mapped into points on a certain plane;
(ii) all points on the line
x-1 y-\ z-1
1 ~ 3 ~ -5
are mapped into the same point.
11. Let А, В, С be real 2x2 matrices and write
[A,B] = AB-BA, etc.
Prove that
(i) [A,A] = 0;
(ii) [[А, В], С]+ [[В, С], А]+ [[С, А], В] = 0;
(iii) [A, B] = I => [A, B™] = wB™-1 for all positive integers m.
At each step you should state clearly any properties of matrices which you use.
The trace 7>(A), of a matrix
A = К °A
is defined by 7>(A) =
Яц+approve that:
(iv) 7>(A+B) = 7>(A)+7>(B);
(v) ЖАВ) = 7>(BA);
(vi) 7>(I) = 2.
Deduce that there are no matrices satisfying [A, B] = I. Does this in any way
invalidate the statement in (iii)? (M.E.I.)
12. If Μ denotes the matrix /1 0\
and I denotes the matrix /j q\
\0 1/
prove that Μ2 = 3Μ-2Ι.
Prove further that, if η is any positive integer,
M" = (2я-1) М-2(2"^- 1) I. (M.E.I, adapted)
13. Prove that a linear transformation Τ of three-dimensional space into itself
maps the interior of a tetrahedron into the interior of the image tetrahedron.
Prove further that the image of the centroid of the given tetrahedron is the cen-
troid of the image tetrahedron.
257

ι φ Matrices 2
1. THE INVERSE OF A MATRIX
The reader has seen that a definition of multiplication of matrices may be
formulated which has some of the properties of multiplication of real
numbers. Notably, if the matrices are square and of the same order (say
three, for the sake of argument), then the matrix I, where
/1 0 ov
1=01 0 ,
\o 0 1/
has some of the properties of the number 1.
Ex. 1. The linear transformation Γ maps points of the x-axis according to the
rule ,
Ί\χ) = kx.
Interpret Tin the case where к = 1.
Ex. 2. The linear transformation Tmaps points in a plane into points in the same
plane according to the rule
Г(Х) = Ax where A is a 2 χ 2 matrix.
Interpret Τ in the case where
*-n·
Ex. 3. Interpret the matrix /1 0 0\
1=0 1 Ol
\0 0 1/
as the matrix of a linear transformation.
The question thus naturally arises: 'Is there a matrix analogue to the
reciprocal of a number ?'; that is,' given a square matrix A, does there exist
a matrix В such that BA = I?'
We begin our investigation by actually constructing such a matrix В
for a given 2x2 matrix A. Suppose
Hb3)·
We effect the construction in two stages: we first form a matrix Cj such that
258

1] THE INVERSE OF A MATRIX
the product QA has as its first column ι ι,
that CaCQA) = I. By associativity, it follows that CaQ = B.
Since the element au of A is already 1, we have only to assume that QA
makes the element of the second row and first column of the product zero.
Choose
(J?)·
Q =
Next choose
Q =
Thus if we take
= (_* J); then QA =
(J _J); then C2(QA)
В = C,Q = (_2 1) we have BA =
We write В = A-1 and call A-1 the inverse of A.
(Notice that we have not yet justified the use of the definite article' the':
in fact, В is unique, although the intermediate matrices Cj and C2 are not.
Notice also that we have shown that BA = I but not that AB = I although
this latter equation does indeed hold, as we shall soon show.)
In this example, the choice of Q and C2 was determined by trial and
error. Whilst this is easy for 2x2 matrices, in more complicated cases
we shall need a more general method. This is given by considering the so-
called elementary row operations on a matrix.
We shall consider the 3 χ 3 matrix
-- a2l
w
«22 «23 )
«32
although our results will easily be seen to generalize to the nxn case (and,
in particular, to cover the 2 χ 2 case).
(i) The first elementary row operation: the interchange of two rows of Α..
Consider the matrix
0 0\
0 1 ,
1 0/
«12 «1
EjA = \a31 a32 a3
«22 «23/
and the effect of pre-multiplication by Ej has been to interchange row 2 and
row 3 of A. A can be any 3x3 matrix; taking A = I, since Ejl = Ej, Ej is
formed by interchanging row 2 and row 3 of I. If r„ r't represent the z'th
259

MATRICES 2 [14
rows of A and EjA respectively, the effect may be symbolically expressed
by К = rlt r2 = r3, r3 = r2.
Similarly /0 0 1\
E2 = 10 1 0
\l 0 0/
has the effect of interchanging the first and third rows of
A {r[ = r3> r2 = r2> r3 = rv)
Ex. 4. Show that Ei is the matrix of the linear transformation which reflects
points in the plane у = ζ.
Ex. 5. Interpret E2 as the matrix of a linear transformation.
(ii) The second elementary row operation: the multiplication of a row of A
by a non-zero constant с
Consider I с 0 0\
Q = |0 1 0),
\0 0 1/
(ca11 ca^ ca13\
a21 a22 Ой J
a3i a32 a3J
and the effect of pre-multiplication by Q has been to multiply the first row
of A by с (ri = crlt r'2 = r2, r3 = r3). Notice again that Q has been
obtained by performing the required elementary row operation on I. (The
reader may wonder why the restriction с Ф 0 has been imposed; so far as
the result is concerned, this is of course unnecessary, but multiplication of a
row by zero must be specifically excluded when we use elementary row
operations to determine inverses.)
Ex. 6. Interpret Q as the matrix of a linear transformation.
(iii) The third elementary row operation: addition to any one row a
constant multiple с of another row.
Consider /1 0 0\
M! = (0 1 0),
\0 с 1/
/ On an a13 \
MjA = Ι α21 α22 α23 I
\a31 + ca21 a32 + ca22 а^ + са^/
and we have a matrix whose first two rows are those of A, but whose third
row is obtained by adding to each element of the third row of А, с times
the corresponding element of the second row {r[ = rlt r2 = r2,r3 = r3 + cr2).
260

1] THE INVERSE OF A MATRIX
Ex. 7. Interpret Mj as the matrix of a linear transformation.
Matrices such as Ε, Μ and C, which effect elementary row operations
are called elementary matrices. Let us now reconsider the reduction of
to a unit matrix in terms of pre-multiplications by elementary matrices.
We wish to effect the transformations
и; и; ι
The first transformation may be obtained by leaving the first row as it is
and subtracting from the second row twice the first row
0i = rltri = r2-2/j)
that is, by multiplying by the elementary matrix
Operating thus on A we obtain
(J?)·
We now leave the second row as it is and subtract from the first row three
times the second row {r[ = гг - 3r2, r2 = r2); that is, we multiply by
and the reduction is complete.
If we attempt a reduction of any square matrix A to unit form by a
succession of row operations represented by the elementary matrices
Xj... X„, then γ γ α τ
ХИХИ_1 ··. X2XjA = I
and X„X„_!... XuXJ = XnXn.1...X2X1 = A"1.
Thus, in a practical reduction we perform the successive row operations on
two series of matrices, one side starting with A and finishing with I, the
other side starting with I and finishing with A-1 (see Example 1).
Ex. 8. If A is the matrix of a linear transformation T, and if A-1 exists, interpret
A-1 as the matrix of a linear transformation.
Example 1. If A= (-1 2)*
find В such that BA = I.

MATRICES 2
-1 2\
3 4}
-5 10\
0 10/
/ о
г2 + 3Г1 \ 1
( ο ίο)
r2
r2
Φ0
(1
(Ϊ
и
n
1)
s
Thus B= [\ з5 .
\lT7 To/
Notice that, in the third transformation, we multiplied the first row by 5
to avoid introducing fractions for a little longer—an artifice worth
remembering. With practice, the reader will learn to contract some of the
working, though it is best to write it out in full at first. It is always advisible
to check one's working by computing BA.
Ex. 9. Find, using elementary row operations, matrices В such that BA = I in
each of the following cases:
Example 2. If
/1 2 5\
A =
find В such that BA = L
262
m

THE INVERSE OF A MATRIX
/1 2 5\ / 1 0 0\
(2 3 4 I 0 1 0
\l 1 2/ \ 0 0 1/
/1 2 5\
0-1-6 r't = r2-2
\l 1 2/
/1 2 5\ / 1 0
(O -1 -6) -2 1
\0 -1 -3/ ri = rt-rx \-l 0
/1 2 5\ / 1 0 0\
(0 -1 -6) (-2 1 0)
\0 0 3/ r3' = r3-r2 \ 1 -1 1/
/1 0 -7\ ri = r! + 2r2 /-3 2 0\
(θ -1 -6) (-2 1 0)
\0 0 3/ \ 1—1 I/
/1 0 -7\ /-3 2 0\
(O -1 0 r,' = r2 + 2r3 0-1 2
\0 0 3/ \ 1—1 1/
/ι ο o\ ri = ri+ira /-f -i f\
0 -10 0 -1 2
\0 0 3/ \ 1 — 1 1/
/1 0 OV Z-f-i }V
0 1 0) ri =-r2 0 1 -2
\0 0 3/ \ 1 -1 1/
о o\ /-!- -i i\
10 0 1-2
0 l/ ri = ir3 \ i -* i/
■Г:.
1 -2
Ex. 10. Find, using elementary row operations, matrices В such that BA = I in
each of the following cases:
/6 8 5\ /4 7 3^
(i) A= 3 5 3 ; (ii) A= 2 5 2;
\2 3 2/ \5 13 5/

MATRICES 2
(iii) A
/4 3 5V /4 1 IV
= 4 2 3 ; (iv) A= 3 2 3 .
\6 3 5/ \4 3 4/
Having determined the inverses of some specific matrices, we next
consider whether all square matrices A have an inverse. The perceptive
reader may have observed that the process outlined above breaks down if
a complete row of zeros is obtained. In fact, if this occurs at any stage, A has
no inverse. Matrices for which no inverse can be found are called singular
matrices; matrices which have an inverse are called non-singular matrices.
The proof of a necessary and sufficient condition for singularity is deferred
to the next section.
Before leaving the consideration of elementary row operations it should
be pointed out that entirely analogous elementary column operations exist
and post-inverses (that is, matrices С such that AC = I) may be found by
post-multiplication by elementary matrices.
Exercise 14(a)
1. Find matrices В such that BA = I in each of the following cases:
■G3'
\5 4/'
(iii) A =
Ov) A= L _ ; (v) A= '
VA-1 h V
2. Find matrices С such that AC = I for each of the matrices A of Question 1.
3. Show that the matrix /3 2\
1б 4/
is singular.
4. Find matrices В such that BA = I in each of the following cases:
\4 1 4/
/3 4 1\ /3 4 5\ /3 6 2\
) 2 3 1 ; (v) 4 3 11 ; (vi) 1 4 2 .
\3 7 2/ \l 0 3/ \2 4 2/
Verify in each case that AB =
5. Show that the matrix
is singular.
264

1] THE INVERSE OF A MATRIX
6. A matrix of the form
\o ο ι/
leading diagc
is non-singu
■0 ··(■)■
with zeros in every entry below the leading diagonal and 1 as each element of that
diagonal is called an echelon matrix.
Prove that every echelon matrix is non-singular and that, if A is echelon and
BA = I, then В is also echelon.
7. If
find В such that BA = I.
show that the system of equations
(6x + 9y + 8z = 1,
3x + 5y+4z = 3,
2x + 4y + 3z = 2
may be written in the matrix form
Av = w.
Pre-multiply both sides of this equation by В and hence solve the equations.
8. Solve the equations (3x + 2y + 3z = 17,
\lx-3y + 4z = -7,
[2x- y+3z= 1
by the method of Question 7.
9. For what value of a is the matrix
-3 -1
singular?
10. If AX =
i

«ι2\
«22/
e
α
о}
wj
and
к,
u12
z + a21H
Z+iZ22W
o,
1,
MATRICES 2 [14
2. DETERMINANTS
Recall that a square matrix A is non-singular if a (pre- or post-) inverse
may be found; otherwise it is singular. We now establish a simple criterion
for singularity for 2 χ 2 matrices.
Theorem 14.1. If A is the matrix
/«n
W
then A is non-singular о a11a22-a^a^ φ 0.
Proof, (i) Suppose A is non-singular.
A is non-singular => a matrix В =
exists such that BA =
^ {^x + a^y =
[a^x + a^y =
!x(a11a22-a12a21) = a^,
у(а11а22-а12а2г) = -a12,
z(a11a22-a12a2i) = -a21,
w(a11a22-a12a21)= au.
But the matrix A is non-singular and thus Α φ 0, from which it follows
that at least one of the elements of A is non-zero and hence that
«11 «22-«12 «21 * 0·
(ii) Suppose ДцОгг-«i2«2i + 0·
Consider
/ α22 -α12\ /оц α12\ = /aua22-a21a12 0 \
\-«2l «ll/ \«21 «22/ I 0 «11 «22- «21 «12/
then
«11«22-«12«21 * 0 ^ l ( ^ "Μ («" aA = I
ii"22 12 21-r a^a^-a^a2X\-a^ au) \a21 a22J
/ «22 -«i2\
«11 «22-«12 «21 I-«21 «ll/
=> A is non-singular.
The quantity a11a22—a12a21 is called the determinant of A and is written
Ι α" αΐ2 I or, more shortly, det A (or |A|).
where В =

2] DETERMINANTS
Thus the result of Theorem 14.1 may be stated as:
Ά is a non-singular 2x2 matrix -o- det Α φ Ο'.
If the elements of A are real numbers then det A is a real number and
we have a mapping A -> det A from the set of all 2 χ 2 matrices with real
elements to the set of real numbers. Similarly, we have mappings A -> det A
from the set of all 2 χ 2 matrices with rational/integral elements to the set
of all rationals/integers.
The rale for determining the value of det A must be carefully memorized:
products of the elements are formed diagonally (the leading diagonal first)
and then subtracted. Thus
Il 21 =_2. I 2 _11 =_7. I cosa sina| = ι
I 3 4 I I - 3 - 2 I ' I - sin a cos a|
Ex. 11. Evaluate:
I 3 1 I I 1 -2 I I 1 -2 Ι
(i)|o i|; co|2 t|; cd|3 ,[.
The construction of the determinant of a 2 χ 2 matrix has thus given us a
test for singularity. We now define determinants of higher orders; although
the definition will no doubt appear complicated, the motivation for their
introduction should be clear—we shall use them to extend the result of
Theorem 14.1.
If A is the 3 χ 3 matrix /au a12 a13\
I «21 «22 «23).
\«31 «32
then the determinant of A, written
I «11 «12
«21 «22 «23 |
I «31 «32
is defined by
. . A I «22 «23 I | «21 «23 L I «21 «22 |
det A = an\ —«12 +«13
I «32 «33 I I «31 «33 I I «31 «32 I
= Дц «22 «33 - «11 «23 «32 - «12 «21 «33 + «12 «23 «31 + «13 «21 «32 - «13 «22 «31 ·
It is simply verified by direct calculation that det A may be expressed in
the two alternative forms
, . A I «12 «13 I . I «11 «13 I „ I «11 «12 I
det A = —α2ι Ι+α22 —«23 >
I «32 «33 I I «31 «33 I ! «31 «32 I
or, more briefly, det A (or |A|)

MATRICES 2
«12 «13
I «22 «23 I
«11 «12
I «21 «22 I
These three forms give the expansion of the determinant by rows. The
determinant may also be expanded by columns; for example, expanding
by the first column
det A = Оц
«22 «23
-«21
«12 «13 L «12 «13
+ «31
«32 «33 I «22 «23 I
I «32 «33
The term I а& а2з Ι
I «32 «33 I
is called the minor of au; similarly, the minor of a12 is
I «21 «23 I
I «31 «33 I
(see expansion by first row) while the minor of a32 is
I «11 «13 I
I «21 «23 I
(from the expansion by the third row). To find the minor of any term,
write down the determinant and cross-out the row and column containing
the term under consideration; the required minor is the 2 χ 2 determinant
that remains. See Figure 14.1, where the minor of a23 is being sought: it is
«11 «12
ОГ («11^32-«12«3l)·
«u a13 a13
• —аа_—ашг-.а^-
To find the sign to be attached to each minor in the expansion of a
determinant, draw the chess-board pattern, as shown in Figure 14.2,
starting with + along the leading diagonal. Thus, if we were expanding by

2] DETERMINANTS
the second row, or by the third column, we should include the term
| «11 «12 I
~«23
I «31 «32 I
A minor, together with its correct sign as given in Figure 14.2, is called a
cofactor, the cofactor of the term ai} being written At}. Thus
«11 «13
= «ii«33-«i3«3i;
«11 «13
= «13 «21 "«11 «23·
I «21 «23 I
In terms of cofactors, the expansion of the determinant may be written
det A = a11A11+a12A12 + a13A13 (first row),
or det A = α12Λ12+«22Λ2+ «32^32 (second column), etc.
Determinants of orders 4, 5, 6, ... may be defined similarly. Thus, for
the 4 χ 4 determinant ,
«11 «12 «13 «14
«21 «22 «23 «24
«31 «32 «33 «34
«41
«43 aM
each cofactor is a 3 χ 3 determinant, obtained by rejecting the row and
column in which the corresponding element stands and attaching the
required sign, obtained from the chess-board pattern, to the remaining
determinant. Thus
I «11 «13 «14 I I «11 «12 «14 I
A22 = α31 α33 a34 ; Лз = - «21 «22 «24 ·
I «41 «43 «44 I I «31 «32 «34 I
The expansion of this determinant by the third column is
«13^13+ «23 Лз + «33^33+ «43^43·
Ex. 12. Evaluate the determinants:
-1 1
1 1

MATRICES 2
3. PROPERTIES OF DETERMINANTS
We shall now develop some simple properties of determinants, restricting
ourselves to the 3 χ 3 case for simplicity, although the results and proofs
all extend to η χ η determinants.
First observe that the interchange of rows and columns does not affect
the value of a determinant: for we may expand in the first case by the first
row and in the second case by the first column and, in either case, the
corresponding cofactors remain the same. Thus, any property possessed
by the rows of a determinant is possessed equally well by the columns.
For brevity, we shall denote the determinant
I «ii «12 «13 I
«21 «22 «23 Ьу Δ.
I «31 «32 «33 I
Property 1.
I Оц + x (h2+y a13 + z
«21
«22
«23
For
«11 «12 «13
«21 «22 «23
+ «21 «22 «23
«31 «32 «33 I I «31 «32
= (an + x) An + (a12 + y) A12 + (a13 + z) A13 = R.H.S.
Property 2. If two rows (or columns) are identical, det A = 0.
For, if two rows are the same, expanding by the third row the
corresponding cofactors are all zero.
Property 3. If an elementary row operation is performed on A, the
determinant of the new matrix is equal to the product of det A and the determinant
of the corresponding elementary matrix.
Case (i): the interchange of two rows (columns).
If, for example, we interchange rows 2 and 3, on expansion by row 1 the
cofactors all change sign but remain the same in magnitude; but
II 0 0 1
0 0 1 =-1.
Ι ο ι о I
The result may similarly be verified for any other interchange of two rows
(columns).
Case (ii): the multiplication of a row (column) by a non-zero constant.
сОц ca12 ca13
«21 «22 «23
= с det A,
by expanding by the first row.
270

PROPERTIES OF DETERMINANTS
But the corresponding elementary matrix is
Ic 0 0\
0 1 0
\o ο ι/
с
0
0
0
1
0
0
0
1
Case (iii): the addition to any one row (column) a constant multiple of
another row (column).
Suppose, for example, that we add to row 1 с times row 2
On
(къ + саы a^ + ca^
<h\ (hs, <hi
= det A + c a21 <къ «23
I a31 a32 a33 I
by properties 1 and 3 (case (ii))
= det A, by property 2.
But the corresponding elementary matrix is
\o 0 1/
0 10
0 0 1
= 1,
on expanding by the first column.
Other possibilities are dealt with in a precisely similar manner.
Property 3 may clearly be extended to a sequence of elementary row
(column) operations on A.
Property 4. det (AB) = det A det B.
Write
then
detAB
I OilKl+ 0·\Φ·ί\ + ^13^3
= аяЬц + α22 b21 + а2з b3.
Ι а31йи + а32*21 + азз*г
= ^ώ,
ап b12 + a12 έ22+α13 b32 an b13 + a^b^ + α13 b33
02! b12 + α22 b22 + α23 b32 α21 b13 + α^2 Ъ23 + %, b33
α31 b12 + α32 έ22 + Озз b32 a^ b13 + ^ b23 + a33 b33
But this may be expressed, by an extension of property 1, as the sum of
twenty-seven determinants, twenty-one of which are zero. (For example,
one of the vanishing determinants would be
I <hibu (hib-a, (hbK
<hsb33
-- bnb12b3i
<h\ an <ha
<hl «21 ^23
= 0 by property 2.)
a21bu a21bn
I ^31*11 a31b12
We are left with six determinants which do not have two identical columns.
However, using properties 1 and 3 we get
detAB = det A(bnb22b33 — b11b23b32 + b12b23b31 — b12b21b33 + b13b21b32
-b13b22b31) = det A det B.
271

MATRICES 2
[14
One simple but useful result that follows immediately from property 2
can now be derived. It will be recalled that a11A11 + a12A12 + a13A13 = det A
and similarly for other rows and columns. Now suppose we multiply each
element of some row (or column) by the cofactors of the corresponding
elements of another row (or column); for example
Now
a11A21 + a12A22 + α^Αχ,
anA21 + a12A22 + a13A23
«ιι <hs an
«31 «32 «33
I «11 «12 «13 I
A similar result holds for any other combination of rows (or columns).
Such an expansion is called an expansion by alien cofactors and we have
demonstrated that expansion by alien cofactors gives the value zero.
Example 3. Evaluate the determinant
18 4 12
9 3 3
I 5 15 10
12 13
= 4.3.5 3 1 1
I 1 3 2
18 4 12
9 3 3
5 15 10
(removing factors)
1
= (-60)
= 1020.
3 4 |
(-17)
(expanding by second
row)
Example 4. Express the determinant
as the product of factors.
272

3] PROPERTIES OF DETERMINANTS
If we set b = c, the given determinant is zero (two columns identical) and
so (b-c) must be a factor. By symmetry, (c-a) and (a-b) must also be
factors. But the expansion of the determinant clearly gives an expression
of degree 4, in a, b, с and so the remaining factor must be linear and
symmetrical, i.e. (a + b + c).
-- X(b-c) (c-a) (a-b) (a + b + c).
By comparing the term in ab3 on both sides, the value of λ is clearly 1.
2 1 -
1
4 2 1
-5 -1 -2
8 13 3
1 14 8
19 55 22
Ex. 13. Evaluate the following determinants:
16 15
With the properties of determinants that we have just proved we are
almost in a position to extend our theorem on the necessary and sufficient
condition for a matrix to be non-singular to matrices of order higher than
two. However, before we do so, we shall introduce a further concept, the
adjugate (or adjoint) of a matrix, since there is an intimate connection
between the adjugate and inverse of a non-singular matrix.
Given any square matrix A, where
(an a12
Chi «22
«31 «32
the adjugate (or adjoint) of A, written adj A, is defined by
adj A =
- u,
Us
that is, the adjugate is obtained by substituting for each element its co-
factor and transposing (see Miscellaneous Exercise 13, Question 7).
Theorem 14.2. For any square matrix A
A (adj A) = (adj A) A = (det A) I.

MATRICES 2
[14
Proof. (3x3 matrices)
Ml Ou Ou\ /Λΐ Λΐ Λΐ\ /Δ 0 0\
A(adjA)= α21 α22 αΔ Iai2 Λ22 AsA = 0 Δ 0 I,
\α31 α32 α33/ \Α13 Λ23 A J \0 0 Δ/
since each term in the product is either an expansion of the determinant,
or an expansion by alien cofactors;
= ΔΙ, where Δ = det A.
A precisely similar result holds for (adj A) A.
Corollary. If det Α φ 0, det (adj A) = (det A)2 (3x3 matrix).
Proof. Take determinants of both sides of A (adj A) = (det A) I and
divide both sides by det A.
It can be shown that, in fact,
det A = 0 => det (adj A) = 0
and so the restriction that det A must be non-zero may be dropped.
If det Α φ 0 we have now obtained an inverse for A, namely
adj A . adj A T
-j-^-г, since A.-j-^-r = I.
det A det A
We note further that multiplication of this inverse with A is commutative.
To obtain a complete picture about inverses we need to clear-up two final
points:
(i) Can a matrix A have an inverse even if det A = 0?
(ii) If det Α φ 0, can we find an inverse of A other than adj A/det A?
Theorem 14.3. If A is any 3x3 matrix, we have:
the matrix A is non-singular о det Α φ 0.
Proof (ϊ). To prove the implication => we show that det A = 0 => A is
singular (see Chapter 9).
det A = 0 ^ for any matrix B, det (AB) = det A det В = 0
*AB + I, since det I = 1
=> A is singular;
(ii) det Α Φ 0 ^ §^4 exists
det A
=> A is non-singular, by Theorem 14.2.
274

3] PROPERTIES OF DETERMINANTS
Theorem 14.4.
A-i _ adJ A
A ~ det A
is the unique inverse of the non-singular matrix A.
Proof. Suppose L is a left inverse other than A-1; that is, suppose that
LA = I, L Φ A"1. Then
L = LI = ЦАА-1) = (LA) A"1 = IA"1 = A"1,
which contradicts the assumption thatL Φ A-1. Similarly for right inverses.
Exercise 14(b)
1. Evaluate the following determinants:
«I!! I; oo I _f -T N
Oii)
Ι 6 5 Ι
(iv)
2. Evaluate the following determinants:
(·)
(iv)
(νϋ)
(x)
-4 7
3 1
5 -9 -
4 3 1
3 4 2
10 8 3
1 3 5
2 7 1
5 16 19
112 129 1
67 78
99 114
4
2
-4
04
62
92
7 17 16
13 33 30
10 21 23
3. Determine which of the following matrices are non-singular:
(9 6 10\
(iii) 17 9 29
/10 7
(ii) 23 14
\4 1
/-1 ί
(iv) 3 1
\ 11 -1

MATRICES 2 [14
4. Find the adjugate of each of the following matrices and, in each case, evaluate
the product of the matrix with its adjugate. Write down the determinant of each
matrix.
f ' 5):
\5 13 9/
/1 3 IV
) 3 10 5 ;
\l 4 5/
'4 11 5\
1 4 21.
1 2 1/
5. Find the inverses of the following matrices by the adjugate-determinant
method:
(п>Ш
6. Find the values of a which make the following matrices singular:
/1 3 7\ / 7 13 1\ / 1 -3
(i) 4 2 5 ; (ii) -1 a 3 ; (iii) -2 6
\5 5 β/ \ 3 7 2/ \ 4-11
7. Factorize the following determinants:
( ' - l}
\-l 4 0/
) (2 6
\l 4
/1 2 7\ /2 0 0\
= (0 1 2j, B= jl 1 01.
\l 2 8/ \0 2-1/
Solve for X the matrix equation
4 4\
8 8/
/0 2
B= 1 1
\3 -2

PROPERTIES OF DETERMINANTS
Solve for X the matrix equation
AX-B =
/2 1 -4\ / -4 2 1\
= (1 1 3), B= 11 -5 -3l.
\0 2 -1/ \-16 7 5/
Miscellaneous Exercise 14
-1 if \ 0 1/ \0 1/ \1 1/
Κι)·
Deduce that the rows of a 2x2 matrix can be interchanged by operations
which add multiples of one row of the matrix to the other, together with
operations changing the sign of a row.
Find a 2 χ 2 matrix X such that
la b\ v (b a\
2. Do 2 χ 2 matrices А, В with integer entries, exist such that
(а) AB = 0, ΒΑ Φ 0?
(б) AB = ΒΑ = 0, Α Φ 0, Β φ 0, Α Φ Β?
(с) AB = ΒΑ Φ 0, Α, Β φ Ι, ΑΦ Β?
In each case, if your answer is 'yes', justify it by giving examples of suitable
Α, Β. (Μ.Ε.Ι.)
so that the transpose a' is given by
·(:)
write down the 3 χ 3 matrix aa' and prove that the determinant of this matrix
has value zero.
Obtain the analogous result when
(a 0\
(SMP)
■B
4. If A is any 3x3 matrix and Pisa non-singular 3x3 matrix, prove that
det(A-Al) = det(B-AI), where Β = Ρ1 ΑΡ.
277

MATRICES 2
[14
5. S is a skew-symmetric matrix if S + S' = 0; A is an orthogonal matrix if
AA' = I. If S is a given skew-symmetric matrix and A, B given orthogonal
matrices, prove the following results:
(i) det A = ± 1;
(ii) AB is orthogonal;
(iii) (I-S) α + S)-1 is orthogonal.
6. If
prove by induction that
Prove also that the nth power of the matrix
»_/i ь\ ., /i
Do these results hold if η = -
7. Prove that
I b2c2 + a2d2 bc+ad 1 I
c2a2 + b2d2 ca+bd 1
I a2b2+c?d2 ab + cd 1 |
Also prove that
= (b-c) (c-β) (fl-b) (.a-d)(b-d)(c-d).
= (aj-x) (я2-*) (Дз-л) (я4-*) (l + * Σ -
■(Hi)
and det МФО, prove that, if λ is a root of the equation
Ι Α + λ Η
where A is the cofactor of a, etc., then det Μ/λ is a root of the equation
(x+a)(x+b) = h2. (O&C)
9. Factorize the determinant
Ι 1 + я2 + я4 1 + αί> + α2ί>2 l + ac+aV I
\l + ab + a2b2 l+62 + 64 1 + Z>c+Z>2c2 . (О & Q
I l + ac+a*c2 l + bc + b2c2 l + crHc4 |
278

10. Prove that
yz
(y+z-x
11. If
MISCELLANEOUS EXERCISE 14
zx xy = - 16xyz(y-z) (z-x) (x-y).
(z + x-yY (x+y-z)1 I (O&C)
0 0 rf \
/0 6 0
1 c / ,
/
where a, b, ...,/arerealanda > b > c, prove that the equation
det(A-*I) = 0
has real roots. (O&C adapted)
12. If A and В are square matrices such that AB = 0, prove that either A = 0
or В = 0 or det A = det В = 0.
13. Two matrices A and В are said to commute if AB = BA. Let
prove that every matrix В that commutes with A can be expressed in the form
В = AA+/tI, where A and μ are scalars and I is the unit matrix. Obtain
expressions of this form for A2 and A"1. (M.T.)
14. Prove that, if a is a 3 χ 3 matrix, with adj a = A and det a = a, that
aA = Aa = αϊ.
Assuming that α Φ 0, prove that:
(i) det A = a2;
(ii) X = A is the only solution of the matrix equation aX = αϊ;
Oii) adj A = aa.
Which, if any, of the above results, are still valid if the restriction on α is
removed ?
15. If A is a 3 χ 3 matrix with αη φ 0, and if the cofactor of every element of
A is zero (that is, if adj A = 0), prove that A has the form
(an а1г а13 \
kan ka12 ka13 J.
lau 1агг laj
16. What is meant by the statement that multiplication of real numbers is
associative and is distributive over addition?
Prove that, if я,,·, b{j cw (/,/' = 1, 2, 3) are real numbers, then
3/3 \ 3 /3
Σ Σ aiabap\ cPj = Σ aia Ι Σ *α/,<
β = \ \α = 1 / α = 1 V = l

MATRICES 2 [14
Deduce that multiplication of 3 χ 3 real matrices is associative and that, if A
is a 3x3 real matrix, then the notation A3, A4, As,... may be interpreted
unambiguously.
If the 3 χ 3 real matrices В and A2 +A+I are non-singular (I is the identity
matrix) why would the notation A/B be ambiguous, but
17. The non-singular matrix В has the property BB' = B'B, where B' is the
transpose of B. Prove that B'B"1 = B-'B'. Prove also that, if С = B-'B', then
CC is the identity matrix.
■H i i)

15. Linear equations
1. LINEAR EQUATIONS IN TWO UNKNOWNS;
INTRODUCTION
Given perpendicular coordinate axes Ox, Oy in a plane, any linear
equation connecting the two variables χ and у is the equation of some straight
line in the plane; that is, the linear equation defines the set of points
(x, y) comprising the line. Two such sets of points will have as their
intersection a single point, in general, whose coordinates may be obtained by
solving the equations simultaneously, or, in other words, by requiring the
defining properties of the sets of points comprising the two lines to hold
simultaneously.
For example, the pair of straight lines
(2x + 5y= 1,
1 x-3y = 6
intersect at the point (3, -1), a result derived by solving the simultaneous
equations in the usual way.
Consider now the pair of straight lines
(2x + 3y = 1,
(2x + 3y = 2.
It is immediately obvious algebraically that these equations have no
solutions. Geometrically, they represent parallel straight lines, with no
finite point of intersection.
Again, consider the 'pair' of straight lines
| x + 2y = 1,
(2x + 4y = 2.
The second equation is a thinly disguised re-write of the first so any point of
the single line represented has coordinates satisfying the equation. In fact,
setting у = A and solving the first equation for x, we see that (1-2A, A)
is a solution, for all values of A. Since we may assign values of the single
parameter A arbitrarily, our equations are said to have one degree of
freedom (or to be a one parameter system). Geometrically, we have a pair of
coincident straight lines.
The reader may feel this last case somewhat frivolous, and its extensive
281

LINEAR EQUATIONS [15
discussion pedantic. However, if the coefficients are unknown, this
possibility must be remembered.
To summarize; given two equations in two unknowns, one of three
possibilities occurs:
(i) The equations have a unique solution. (The two lines are distinct
and intersect.)
(ii) The equations have no solution. (The two lines are distinct and
parallel.)
(iii) The equations have one degree of freedom. (The two lines coincide.)
Example 1. Solve the equations
[х+У = 2,
\ax + 2y = b.
Eliminating v, we have ,„ „ ,
b·^ (2-a)x = 4-b.
Case (i). If α Φ 2, we have the unique solution
4-b b-2a
X = 2^a' У = ^а-·
Case (ii). If a = 2, we have 0.x = 4-b.
Sub-case (iia). If b Φ 4, no solution for χ exists.
Sub-case (ii*). If b = 4, χ may take any value, у then being determined
from the equation x+y = 2.
Thus, the complete solution of the equations takes the form
4-b b-2a
αΦ2:χ = ϊ-α, У=^-а,
a = 2, b Φ 4: no solution;
a = 2, b = 4: one-parameter solutions (λ, 2 —A).
The reader would be wise to consider the meaning of this solution
geometrically.
The reader is probably so accustomed to solving pairs of simultaneous
equations that it may not have occurred to him to ask what happens if
he is given three (or more) equations in two unknowns. From geometrical
considerations it should be fairly clear that, in general, no values of χ
and у satisfy all three equations simultaneously. However, consider the
system ( x + 2y = 5,
3x- y=l,
{ x-5y= -9.
Solving the first two equations we get χ = 1, у = 2 and these values cer-
282

11 TWO UNKNOWNS; INTRODUCTION
tainly do satisfy the third equation. Geometrically, these three equations
represent three straight lines through the point (1, 2) (see Figure 15.1).
Ъх-у = 1 *
Fig. 15.1
It will be recalled that, if
^x+bjy + C! = 0, 02X + b2y + c2 = 0
are two non-parallel straight lines, then any line through their point of
intersection has the form
μ(α1χ + ό1γ + α1) + λ(α2χ+^γ + €2) = 0.
Thus, in the particular case we are considering, any straight line through
the point (1, 2) has an equation of the form
/4х + 2у-5)+ЦЗх-у-1) = 0.
Setting λ/μ = - \ we obtain the third equation. This example illustrates the
general result, that, if three equations in two unknowns have a solution,
then any one equation is a linear combination of the other two (see
Chapter 2.4).
2. LINEAR EQUATIONS IN TWO UNKNOWNS;
CONTINUED
We consider pairs of equations of the form
ioax + аау = Ьъ)
{a^x + a^y = bj
Writing
the equations (1) may be rewritten in the matrix form
Ax = b. (2)
IO PPM 283

LINEAR EQUATIONS [15
If we regard A as the matrix of a linear transformation T, equation (2)
tells us that the image of the point P, with position vector I I, is the
of simultaneous equations may be reinterpreted as the problem of finding
the point Ρ whose image, under a given linear transformation, is a given
point B.
If A is non-singular, that is, if det Α φ 0, we have, for any point B,
Ax = b
ο Α1 Αχ = A-Jb
ο χ = A-Jb
and we see that the unique point P, whose position vector is A_1b, has
image В under the linear transformation T.
Ex. 1. Show that A-1 is the matrix of a linear transformation S.
[You must show that, for every point Q with position vector y, the image S(y)
exists and is unique and that 5(Ay+μζ) = A5(y) +μδ(ζ).] The linear
transformation S so defined is called the inverse of the transformation T; S and Γ satisfy the
relations S[T(x)] = x; T[S(y)l = у for all χ, у.
As an example of the solution of equations whose matrix, A, is non-
singular, consider
Write
then det A =
and we have
\3x
19 Φ 0 and
A-1 =■
G
-Ю-
~0-
-e>-
-2y =
+ 5y-
D·
■0
■H
Η
= 4,
= 1.
b =
5 2
2 3
5
-2
-D-
0
)
I)
The solution may be stated in the form χ = ff, у = -γ-9- or, in terms of the
linear transformation Twhose matrix is A, the unique point (y§, -Д) maps
into (4, 1).

2]
TWO UNKNOWNS; CONTINUED
Ex. 2. Solve the equations Ъх-2у = 4,
2х + Ъу = 1
by using the inverse matrix method.
Suppose now that det A = 0. Before discussing the existence of a
point Ρ whose image is В under the linear transformation T, consider the
effect of Τ upon the unit square OIMJ. For a general transformation
(0 00
Fig. 15.2
(det Αφ 0) ΟΓ Μ'J'is a parallelogram whose area is det A (see Chapter 13)—
depicted in Figure 15.2, but, if det A = 0, the parallelogram collapses and
O, I', J', M' are collinear as shown in Figure 15.3.
*Ex. 3. If det A = 0, show that every point Ρ of the plane maps into a point of
the line ΟΓ.
Since, in this latter case, Τ maps every point Ρ into a point of the line
ΟΓ, a point В not lying on the line cannot be the image of any point under
T: the equations Ax = b are inconsistent.
However, if В does lie on ОГ, we may write the position vectors of/',
/', В as r, тт, ст respectively; that is
Aj = r, Ai = тт, b = ст.
Then, P(x, y) is a point which maps into В
о ст = A(xi+yj)
о ст = x(Ai)+XAj)
о ст = (mx+y)i
о Ρ lies in the straight line y + mx = c,
and infinitely many points map into B, by the reversibility of the argument.
Ю-2 285

LINEAR EQUATIONS [15
To summarize for the case det A = 0:
(i) the equations Ax = b have no solution (are inconsistent; equations
with one or more solutions are said to be consistent).
о there is no point which maps into В under Τ
о В does not lie on the line ОГ;
(ii) the equations Ax = b have infinitely many solutions
о there are infinitely many points Ρ which map into В under Τ
о В does lies on the line ОГ.
Ex. 4. Show that the equation ax = b is uniquely soluble only if α φ 0, in which
case, the unique solution is χ = ar^b.
Discuss the case a = 0, showing that failure to obtain a unique solution can
occur in two different ways.
Ex. 5. What can you say about the equation Ax = b if T(i) = T(J) = 0? (Γ is
the linear transformation with matrix A.)
Exercise 15(a)
1. Solve, using the inverse matrix, the following pairs of simultaneous equations:
\Ъх- у = 1, \Ъх + 2у = 1, j70x+105y = 12,
(0 1 (и) 1 (ш) 1
\2х + Ъу = 19; l2x + 7y = 29; I 5x- 7y = 5.
(i) ax + a2 = Ьг-Ьх; (ii) ax + b = bx + c.
3. Solve for x, у the following pairs of equations, commenting upon any special
cases that arise for particular values of the coefficients a:
. (Ъх+ау =-2, ί x + 2y = 4, .... (3x+ ay = 1,
w \3x + 2y = 3; K } \2x + ay = 2a; ^ } W+12.y = 2.
4. Solve for x, у the following pairs of equations, commenting upon any special
cases that arise for particular values of the coefficients a, b;
( ax + by = 2, ίαχ + Зу = 6; ... iax + Zy = 1,
1 \l2x-4y = a + Z>; W \ x-by = 2; wy \bx + ay =-1.
5. Discuss the solution of the following simultaneous equations for various
values of α: ,
x+ y = h
\2x + ay = 5,
U*+ у =-3.
6. Discuss the solution of the following simultaneous equations for various
values of the coefficient a:
(ax + 3y =
fr
6,
2y = a.

2] TWO UNKNOWNS; CONTINUED
7. Discuss fully the solution of the following simultaneous equations for the
unknownsx,y: , , .
J ( ax+ by = с,
\агх+Ьгу = с2.
8. Under what conditions do the homogeneous equations
(a^x+b^y = 0,
\a2x+b2y = 0
have a non-trivial solution (that is, a solution other than x, у = 0) ? Solve the
equations fully in the case where non-trivial solutions exist.
9. Solve for x, у the equations:
f χ cos θ +y cos φ = cos a,
\ χ sin θ+y sin 0 = sin a.
10. Sketch the set, E, of points in the plane whose coordinates satisfy the
inequality x+y > 0. The linear transformation, T, has matrix A where
Sketch the image set T(E).
11. Γ is a linear transformation whose matrix is
■6 :!)·
If Ε is the set of points whose coordinates satisfy the inequality у > 0, sketch the
image set T{E).
3. SYSTEMS OF LINEAR EQUATIONS IN THREE
UNKNOWNS; INTRODUCTION
Given perpendicular axes Ox, Oy, Oz in space, a linear equation
connecting the three variables x, y, ζ is the equation of some plane. Two linear
equations represent two planes which may be parallel (same unit normal
vector) but otherwise intersect in a straight line. In general,
three planes intersect in a unique point and thus, in general,
three equations in three unknowns possess a unique solution;
see Figure 15.4.
For example, the three planes
x+ y+ ζ = 0,
x + 2y + 3z =-3,
x+ y- z = 4, Fig· 15.4
have in common the single point (1, 1, —2)—a result readily derived by
successively eliminating two of the variables.

LINEAR EQUATIONS [15
However, just as in the case of two lines, complications may occur.
Consider the equations , ,
[ x+ У + Зг = 1,
| x-2y+ z= 1,
I2x- y + 4z = 2.
If we write
Lx = x+y+3z-l, L2 = x-2y + z-l, L2 ξ 2x-y + 4z-2,
then it is readily seen that L3 = Lj+Z^; thus, these three equations
represent three planes through a common line (see page 62) and so
possess infinitely many solutions (one degree of
freedom); see Figure 15.5.
Again, the system
(2x+y + 3z = 6,
x+y + 4z = 1,
x- z= 1,
has no solution, for by subtracting the first two equations we obtain
x-z = 5, which is inconsistent with the third equation. Geometrically,
x — z = 5, which is a plane through the line of intersection of the two
planes 2x+y+3z = 6 and x+y + 4z = 1,
is parallel to the third plane x-z = 1 and thus the line of intersection of
the first two planes is parallel to the third plane. A similar result would
hold whatever line of intersection was chosen and so the three planes form
a triangular prism of uniform cross-section; see Figure 15.6.
Fig. 15.5
Fig. 15.6 Fig. 15.7
Another possibility is illustrated by the system
f x+ y+ ζ = 1,
\2x + 2y + 2z= 3,
I x+ y + 3z= 1,
which clearly has no solution, the first pair of equations being inconsistent.
Geometrically, the first two planes are parallel to one another but not to
the third plane; see Figure 15.7.

3] THREE UNKNOWNS; INTRODUCTION
Again, we may have equations in the form:
(x+y + z = 1,
jx+j + z = 2,
Ух+y + z = 3.
The equations are manifestly inconsistent and thus have no solution.
Geometrically, they represent three parallel planes.
Finally, we may have two or three equations which reduce to the same
equation, in which case we have one or two degrees of
freedom—geometrically, two or three coincident planes. (Or, possibly, two coincident
planes and the other plane parallel—giving no solution.)
To summarize: given three equations in three unknowns one of four
possibilities may occur:
(i) The equations have a unique solution (no two of the planes are
parallel and the planes do not have a common line of intersection).
(ii) The equations have no solution (either at least two of the planes are
parallel or the three lines of intersection are parallel but not coincident or
two of the planes coincide and the third is parallel).
(iii) The equations have a one-parameter solution (the planes have a
common line of intersection).
(iv) The equations have a two-parameter solution (the planes all
coincide).
The situation is more complicated than in two dimensions, as was to be
expected but, if the reader keeps the various geometrical possibilities in
his mind, he should avoid confusion.
Example 2. Solve
(1)-(2) gives
(1) - (3) gives ■
If b Φ 5, z =
and substitution
χ =
1 completely the equations
(Ъх+у+ ζ = a, (1)
\2x + y- ζ = 4, (2)
[5x+y + bz = 1. (3)
x +2z = a-4,
-2x + (l-b)z = a-l,
(5-b)z= 3(a-3).
3(a-3)
5-6
back gives
-a-2-ab + 4b _ l5 + 5a-12b + 2ab
5-
corresponding to the case of unique solution.
If b = 5, α φ 3 there is no finite solution for ζ—we have a triangular
prism of planes (clearly no two of the planes are parallel).

LINEAR EQUATIONS [15
If b = 5, a = 3 the equations are consistent: putting χ = λ in (1) and
(2) we have 2y = 7-5A, 2z = - l-λ and the line of intersection of the
first two planes is , ,
χ _y-j z + Jr _
2~ -5~ -\~ μ·
By substitution, the general point (2μ, \ — 5μ, -\-μ) of this line lies on
plane (3) for all μ. Since no pair of the planes coincide, we have three
planes with a common line of intersection and the general solution may be
taken as „ - , ,
χ = 2μ, у = ϊ-5μ, ζ = -\-μ
(or any comparable form).
If we are given four equations in three unknowns, we have, geometrically
four planes and these will not, in general, have a common point. However,
it is not impossible for four equations in three unknowns to be consistent;
consider, for example, the four planes given by the equations
x+ y+ ζ = 0,
2x- 3y+ ζ =-3,
x+ 3y+ ζ = 2,
3x+Uy+4z = 9,
all of which contain the point (1, 1, - 2). In practice, the best way of
tackling such a system is to solve three of the equations and check whether or
not the fourth equation is satisfied.
Ex. 6. Show that the system of equations
Г x+ y+ ζ = 2,
-5y+ z=-l,
j x+2y + 3z =-1,
[3x-3y + 2z = 2
is consistent.
4. SYSTEMS OF LINEAR EQUATIONS IN
THREE UNKNOWNS; CONTINUED
As in the case of two unknowns, we may write the system
саих + а12у+а1Ъг = Ьгл
la^x + ^y+a^z = ьЛ (З)
\аЪ1х+аъгу+аъъг = b3,l
in the matrix form Ax = b. (4)
290

4] THREE UNKNOWNS; CONTINUED
Regarding A as the matrix of a linear transformation T, equation (4)
tells us that the image of the point P, whose position vector is
(i)·
is the point B, whose position vector is
(i)
Again, the problem of solving a set of simultaneous equations may be
interpreted as the problem of finding the point Ρ whose image, under a
given linear transformation, is a given point B.
If det Α φ 0 we have, as in section 2,
Ax = b
о A-JAx = A-4>
ο χ = Α-4>;
the unique point P, whose position vector is A_1b, has image В under the
linear transformation T.
Ex. 7. Show that A-1 is the matrix of a linear transformation S—the inverse
of the transformation T.
As an example of the solution of equations whose matrix, A, is non-
singular consider
Oc+ y+z = 4,
\l 2
and we have
(by the usual inversion
process)
:|)'*-(H-3
(Э-ШН
4
The solution is χ = 1, у = 1, ζ = 2 or, in terms of the linear
transformation whose matrix is A, the point (1, 1, 2) maps into the point (4, -2, 1).
291

LINEAR EQUATIONS [15
The analysis of the case in which det A = 0 follows the same pattern as
the two-dimensional case and is pursued in the following series of exercises.
Ex. 8. Show that, under the linear transformation Τ whose matrix is given by
ia-ί a2 a3\
A = lbx b2 b3\
Vi c2 cj
the images of the points /, /, K, whose position vectors are i, j, k, are /', /', K',
whose position vectors are respectively the first, second and third columns of A.
Ex. 9. Show that, under T, the unit cube, three of whose edges are OI, OJ, OK,
transforms into a parallelepiped (a figure all of whose faces are parallelograms).
Ex. 10. Show that the volume of the parallelepiped obtained in Ex. 8. is det A.
(This result is necessary for the remaining exercises in this section, but its proof
is rather hard and may be omitted at a first reading. The best proof depends
upon the triple scalar product, which will be met in Book 2.)
Ex. 11. If det A = 0, show that О, Г, J', A"'are coplanar. What does this tell us
about the columns of A?
Ex. 12. If the point D, with position vector d, is the image of a point Ρ under
a linear transformation whose matrix is singular, what can be said about
(i) the point D; (ii) the vector d?
Ex. 13. Show that, if det A = 0, and if D lies in the plane OI'J'K', then all the
points of a certain line map into D.
Ex. 14. In the extreme case in which О, Г, J', K' are collinear, what can be said
about the columns of A? If Ax = d has a solution in this case, what can be said
about (i) the point D; (ii) the vector d?
In an actual example, det A should be calculated and the various cases in
which det A = 0 should be treated by reverting to the original equation.
Matrix methods are admirably suited to studying the structure of systems
of equations but rarely constitute an efficient procedure for their solution
in individual cases.
Example 3. Discuss the solution of the system of linear equations
x + ay+ ζ = b,
ax+3y+ ζ = 1,
5x + Sy+3z = 1
for various values of a and b.
/1 a 1\
A=la 3 II and so det A =-3α2+13α-14.
\5 8 3/
292

4] THREE UNKNOWNS; CONTINUED
Thus det A = 0 if a = 2 or J.
Case (i). α φ 2, α φ }.
The equations have a unique solution for all values of b.
Case (ii). a = 2.
The equations reduce to
x + 2y+ z = b, (1)
2x + 3y+ ζ = 1, (2)
5x + 8y+3z = 1. (3)
Eliminating ζ between (2) and (3), and between (1) and (2),
Г x+ у =2, (4)
\ x+ у = 1-6. (5)
Thus, the equations are consistent if and only if b = -1, in which case we
have a one-parameter solution л: = λ, у = 2-λ, ζ = λ-5 (on using
equations (4) and (1)).
Case (iii). a = y.
The equations reduce to
(3x + 7y+3z = 3b, (6)
\7x + 9y+3z = 3, (7)
[5x + 8y+3z= 1. (8)
(6) + (7)-2(8) gives 0 = 36 + 3-2.
Thus, the equations are consistent if and only if b = - $, in which case we
have a one-parameter solution χ = λ, у = 2(1 - λ), ζ = ^(11λ-15) (from
equations (7) and (8)).
5. HOMOGENEOUS EQUATIONS
We now consider the system of linear equations
(апх + а12у+а13г = 0,
a21x + a22y+a23z = 0,
a31x + 03^+033 ζ = 0,
or Ax = 0.
Such a system is said to be homogeneous. If A is regarded as the matrix of a
linear transformation Τ we seek these points which are mapped into the
origin under T. It is easy to see that the origin maps into the origin under
any linear transformation; that is, that χ = 0 is always a solution for a set
of homogeneous equations—we call this the trivial solution—but it is of
more interest to look for non-trivial solutions χ φ 0.
293

LINEAR EQUATIONS
Ex. 15. Show that the linear transformation Γ whose matrix is
'-G -! -!)
maps all points of the line
nto the origin.
What does this tell us about the system of homogeneous equations
[ x + 3y+ ζ = 0?
We conclude this chapter by proving a necessary and sufficient condition
for the existence of non-trivial solutions of the homogeneous equations
Ax = 0.
Theorem 15.1
Ax = 0 has a non-trivial solution o- det A = 0.
Proof, (i) If det Α φ 0, A-1 exists and χ = 0 is the only solution.
Thus, if Ax = 0 has a non-trivial solution, then det A = 0, and we
have proved that a necessary condition for the existence of a non-trivial
solution is det A = 0 (the implication =>).
(ii) If det A = 0 and all the cofactors are zero then, by the result proved
in Miscellaneous Exercise 14, Question 15, the rows of A are in proportion
and the three equations reduce to a single equation, which has a
two-parameter solution.
If det A = 0 and at least one cofactor, say А1Ъ is non-zero, then
a11A11+a12A12+a13A13 = 0 (detA = 0),
α21 Λΐ + α22 Ли + α23 Лз
α3ΐΛΐ +Я32Л2 +^33^13
We have thus constructed a solution
x = ^11» У = Лг, г = A13
which is non-trivial, since An φ 0.
The condition det A = 0 is sufficient to ensure the existence of a non-
trivial solution (the implication <=).
Exercise 15 (b)
1. Solve the equations (3x+ y+ z= -2,
\2x+2y + 3z = 8,
I x + 3y + 2z = 6
(i) by successive elimination; (ii) by using the inverse matrix.
294
= 0/1
= 0j
expansion by alien cofactors.

5] HOMOGENEOUS EQUATIONS
2. Calculate the inverse of the matrix
C-i-i-i)
and hence solve the equations
i-3x+Uy + 7z = a,
\ - x + 4y+3z = b,
[ 2x- 7y-5z = с
(i) when a = 4,b = l,c = -2; (ii) when β = 1, b = l, с = -1;
(iii) when α = 1, * = -6, с = 0.
3. Solve the equations ( x — 2y+ ζ = 1,
hx+2.y-3z = 7,
l.5x-2j,- ζ = 9
and interpret your result geometrically.
4. Show that the matrix /3 -1
A = (2 3
\l -15
is singular.
Discuss the solution of the equations
(3x- y+ 2z = 4, \Ъх- у+ 2z = 4,
(i) \2x+ Ъу- z=\, (ii) \2x+ Ъу- ζ = 1,
I x-l5y+10z = 8; [ x-l5y+l0z = 9,
and interpret your results geometrically.
5. Show that the equations (3x-y+ ζ = 3,
]αχ+^+ ζ = 4,
[8x-.y+3z = *
are consistent and have a unique solution, provided α φ 2.
Discuss the solution of the equations for the case a = 2.
6. Discuss the solution of the equations
Г x+j, + z=l,
i ox + ay+2z = 1,
U* + 3.y+(a+l)z = *
for various values of a, b.
Interpret your various results geometrically.
7. /3 -1 1\ /2 -1 -1\
A= 2 1 -1 , B= 3 -1 2 .
\1 -4 -l/ \5 -1 -2/
A maps the point Ρ into the point Q and В maps the point Q into the point R,
whose coordinates are (-5, 0, 0). Find the coordinates of P.
295

LINEAR EQUATIONS [15
8. Discuss the solution of the equations
(3x + 2y+az = 2a,
\ax + y + 6z = 15,
[ x + 2y + 3z = S
for various values of a.
Interpret your results geometrically.
9. Find the value of a for which the homogeneous equations
(6x + 2y- z = 0,
\Ъх + ау-2г = 0,
[3x-2y+ z = 0
have a non-trivial solution, and find solutions for this case.
10. Prove that, if det A = 0 and if Axj = b and Ax2 = 0, then Xi + Ax2 is a
solution for the equation Ax = b, for all values of λ.
Find a solution of the equations
x+ y+ z = a,
Ъх-5у- z = b,
2x-8y-3z = с
when a = b = с = 0 and hence write down the general solution of the equations
when a = 1, b = 3, с = 2.
11. A linear transformation Г has matrix
■6 J I)'
Show that Τ maps all points of the line
into the origin, and find the line of points that maps into the point (6, 6, 0).
What is the relation between these two lines?
Iax bx сЛ
= я2 h сг\.
\<h Ih cj
Ax is the cofactor of дь etc., and det A = Δ. If
a^x + b^y + dz = dlt
a2x + b2y + c2z = a\,
a3x + b3y + c3z = d3,
prove that Δχ = d1A1 + d2A2 + d3 A3.
If Δ = 0, what does the value of the expression dx Ax + d2A2 + d3 A3 tell you about
the solution of the given system of equations ?
296

5] HOMOGENEOUS EQUATIONS
13. Prove that, if
= [a2 b\ °cX d = IdX χ = (y)
\a» b3 c3J \dj \zj
and if det Α Φ 0, then the unique solution of the equation
Ax = d
is given by
<?i ii Cj αϊ i/i Ci αϊ Z>i dj.
\ d2 b2 c2\ = a2 d2 c2 \ = \ a2 b2 d2
| d3 b3 c3 I I a3 d3 c3 I I a3 b3 d3 I
14. The linear transformation, T, has matrix A where
■(-1J -5-
Ε is the set of points in space whose coordinates satisfy the inequalities χ > 0,
у > 0, ζ > 0, x+y + z < 1. Determine the image set T(E).
Miscellaneous Exercise 15
1. Find the inverse of the matrix
Given the equations Г 2*!—4x2+ x3
+ 3x3 ■■
(-1 i -!)·
(2лг!-4л:2Н
-X!+2x2-
find the solutions
(i) when a = 1, b = 1, с
2. Let A be a 3 χ 3 real matrix and b a three-rowed column vector. It is proposed
to solve the equation _
and a particular solution χ = Xo is noted. Prove that any other solution may be
written in the form
X = Xo + U,
where u is a solution of the equation Ax = 0. Prove conversely that any vector
Xo + u is a solution of the equation, where Xo is a fixed particular solution of the
equation and u is any vector such that Au = 0.
Interpret geometrically the equation and its solutions if A is singular (i) when
Xo exists, and (ii) when there is no such particular solution Xq. (M.E.I.)
297

LINEAR EQUATIONS
M-(j
form the product MM' and show that M' = 49M"1.
Without multiplying out, state the product M'M, giving reasons for your
answer. Hence, or otherwise, find the solution of the equations:
2*!+3x2+ 6x3 = 1,
6*! +2x2-3x3 = 1,
3*!-6x2+ 2*3 = 2. (M.E.I.)
4. The simultaneous equations
Зх + у = 5
may be written in matrix form as
(1 -Do-® °tAx=a
Carry out numerically the procedure of the following three steps:
(i) A'AX = A'B;
(ii) (A'A)-1 A'AX = (A'A)-1 A'B;
■0
(iii) IX = = (A'A)-1 A'B.
Verify that the values of x, у so found do not satisfy all the original three
equations. Suggest a reason for this.
Under what circumstances will the procedure given above, when applied to
a set of three simultaneous equations in two variables, result in values which
satisfy the equations? (SMP)
5. By systematic elimination, find values of А, В, С in terms of a, b, c, such that
( x+ y+ z = 1,
\ ax+ by+ cz = a + b + c,
[a2x + b2y+c2z = bc + ca + ab,
if and only if ix+y+ ζ = 1,
{ Ay + Bz=b+c
[ Cz = a2 + b2.
Solve the equations, assuming a, b, с are all different.
Describe geometrically the configuration of planes, in three-dimensional
Euclidean space, with equations:
x+ y+ z= 1,
x + 2y+2z = 5,
x + 4y + 4z = 8. (SMP)
298

5] MISCELLANEOUS EXERCISE 15
6. Solve the simultaneous equations
x+ y + ζ = 3,
x+2y + 3z = 6,
x+3y + kz = 4 + k
(i) when/: φ 5;
(ii) when к = 5, giving the general solution. (SMP)
7. If A is the matrix Iй I and X is the non-zero matrix I I, show
Iй I and X is the non-zero matrix I* I,
that, if AX = AX, where λ is a number, real or complex, then
X*-(a + d)X+(ad-bc) = 0.
Show that A itself satisfies this quadratic equation, in the sense that
A2-(a + d)A + (ad-bc)\ = 0,
/1 0\
s the matrix
(Itl-
8. By considering det (A - XT), extend the result of Question 7 to a 3 χ 3 matrix A.
9. Factorize the determinant I 1 1 1 I
I a2 b2 c2 I'
Show that, if no two of a, b, с are equal, the equations
x+ y+ ζ = a,
ax+ by+ cz = ab,
a2x+b2y+c'z = abc,
have unique solutions for x, y, z, and find them.
Discuss the special cases
(i) a = b Φ с; (ii) b = c + a; (iii) a = b = с. (М.Т.)
10. Prove that the only value of λ allowing a real non-trivial solution of the
simultaneous equations _ .
y + z = Xy,
4x+2y-z = Xz,
is λ = 1. (M.T.)
11. If А, В are 3 x 3 matrices and χ a three-rowed column vector such that there
exist numbers λ, μ such that
Ax = λχ, Bx = μχ,
prove that there exists a number ν such that
ABx = »x.
Prove further that, if x1; x2, x3 are three linearly independent column vectors
with this property, then A and В are commutative for multiplication.

LINEAR EQUATIONS [15
12. Write the two sets of three equations
Яа*1+Я*2*2 + Я>3*3 = С, ЬцУ!+bi2y2+biay3 = Xt (ί = 1, 2, 3)
in matrix form, and prove that they can be solved uniquely for х1г x2, х3, у1г у2, y3
if and only if det AB φ 0, where A and В are the 3 x 3 matrices of coefficients
in the two sets of equations.
Show that the equations
Χ-ίΛ-2χί = 2, Хг + Хг + Хз = 1, Зх2-Х3 = к,
Зу1+У2 + 4у3 = Χι, -y-ί + 2у2-Зу3 = хг, У!+5у2 -2у3 = х3
are inconsistent if к φ 7, and find the most general solution for х^.-.уз if
к = 7. (M.T.)
13. Prove that, if λ: φ 0, the system of equations
2x+ у = а,
x+ky- ζ = b,
y + 2z = с
has a unique solution (x, y, z) for every choice of (a, b, c).
Show also that, when к = 0, the system is consistent if and only if (a, b, c)
satisfy a certain linear relation, and find this relation.
Verify that the system is consistent when к = 0 and (a, b, c) = (1, 1, -1)
and find an expression for the general solution of the system in this case.
(M.T.)
14. Find, for all values of the parameter λ, the number of solutions of the
eqUati°nS x + 2y+Xz=0,
2x+3y-2z = λ,
λχ+ y+ ζ = 3. (M.T.)

16. Discrete probability distributions
1. INTRODUCTION:
THE UNIFORM DISTRIBUTION
In Chapter 7 we discussed the concept of an outcome space for a random
experiment. To each elementary event we ascribed a probability and the
entire set of probabilities was described as г. probability distribution for the
outcome space. In this chapter we shall enumerate various possible
probability distributions that find frequent use as mathematical models for
random experiments. We shall generally define our outcome space in terms
of an associated random variable Ζ (see Chapter 10), and we shall thus be
able to talk about the mean and variance of the distribution; however, it
must be remembered that, for example, the mean is defined as ${X) and
so, if a new random variable, Y, is chosen, the mean naturally changes, too.
Unless there is any possibility of confusion, we shall refer to the mean and
variance of X as μ and σ2 respectively.
Suppose we have an outcome space consisting of η elementary events,
with associated random variable, X, where
X= {1,2, 3,..., л}.
Perhaps the simplest assumption we can make about the η possible
outcomes is that each one is as likely to occur as any other. Thus we take
Pr(Z=r)=l/n. (1)
Equation (1) defines the uniform distribution for η possible outcomes.
The uniform distribution is, in a way, fundamental, for we may often
subdivide the elements of an outcome space in such a way that the new
outcome space may reasonably be given a uniform distribution. This is not
always possible, however; a simple counter-example might be the fall of a
biased coin.
The mean, μ, of the random variable X with a uniform distribution is
given by
μ = <?(Χ)
= !("+!)· (2)
301

DISCRETE PROBABILITY DISTRIBUTIONS [16
Again, the variance, σ2, is given by
= £[Χ*]-μ*
= Κ"+ΐ)(2«+ΐ)-Κ"+ΐ)2
= -h(n2-l). (3)
The reader will find a large number of examples on the uniform
distribution at the end of Chapter 7.
Ex. 1. A man spins a coin and throws a die. For a head he scores —1, for a
tail +1, and this he adds to the score showing on the uppermost face of the die.
The values of the random variable, X, where
X= {0, 1,2, ...,7}
represent his total score. Why do you think a uniform distribution would be
an unsuitable mathematical model for this eight-point space? Suggest a way of
subdividing the sample space so that a uniform distribution would be suitable.
2. THE BINOMIAL DISTRIBUTION
The simplest possible type of random experiment is one that has just two
possible outcomes, which we may conveniently designate success and
failure. An experiment of this type is called a Bernoulli trial. (J. Bernoulli,
1654-1705, one of a family of distinguished mathematicians, whose
famous work on probability, the Ars Conjectandi, was published
posthumously in 1713.). Some examples of Bernoulli trials are
(i) a coin is tossed : does it show heads or tails?;
(ii) a person is tested for disease: has he got the disease or not?;
(iii) the height of a person is measured: is it less than six feet or not?;
(iv) a marksman fires at a target: does he bit or miss the bull?
If the probabilities of the two possible outcomes of a Bernoulli trial are
ρ (success) and# (failure) (where p+q = 1) and η independent repetitions
of the trial are made, we may define the random variable X as the number
of successes obtained in the η trials and calculate p(r) = Pr (X = r),
r = 1,2,3, ...,n.
Method I. Suppose first we assign some specific order in which we should
obtain the r successes: for sake of argument, suppose the first r trials
result in success and the final (n — r) trials in failure. Then, by independence,
302

2] THE BINOMIAL DISTRIBUTION
the probability of obtaining this sequence is prqn~r. But there are Iй)
different sequences possible. Thus
p(r) = -Pr(X=r)=typ'q'". (4)
Method II. The p.g.f. for a single Bernoulli trial is given by
S{f) = p.t1+q.t0
= pt+q.
Thus, the probability generating function for η independent Bernoulli
trials is given by G(t) = (pt+qy.
Thus p(r) = coefficient of tT in the expansion of (pt+q)n
The probability distribution defined by equation (4) is called the
binomial distribution. It is easily verified that (4) does indeed define a valid
probability distribution. For Pr (X = r) > 0 and also
№
ргдп-r = (j)+qYj by the Binomial Theorem;
= 1, since p+q = 1.
It finds a wide field of application as a probability model but the reader
must always take care to remember the binomial distribution applies only
if we have η independent repetitions of a Bernoulli trial. A binomial
distribution is known completely if the two parameters η and ρ are given; the
phrase 'the binomial distribution arising from η repetitions of a Bernoulli
trial with probability of success/'' is often abbreviated to B(n,p).
Example 1. Six unbiased dice are thrown. What is the probability of securing
three or more sixes!
The dice may be assumed to fall independently and so the random
variable X (= number of sixes showing) has a binomial distribution with
probability \ of success on each of the six Bernoulli trials.
Thus
Pr(Z> 3) = l-p(X < 3)
* 006.

DISCRETE PROBABILITY DISTRIBUTIONS [16
We next derive expressions for the mean, μ, and variance, σ2, of the
number of successes in a binomial distribution. We shall give two methods,
the first one using the definition (4) directly, the second employing the
probability generating function (and using Theorem 10.5).
Method I. By definition
* /л-1\ r n r ■ /n\ n\ nln-\\
= np%l \rZi)pr'1^n~r
= npip+q)71'1, by the Binomial Theorem,
= np, since p+q = 1.
Again
σ2 = <?[(Χ-μ)2] = <?[Χ2]-μ\ by Theorem 10.1,
= Σ''2( I Prqn~T — n2p2, using the first part,
= Σ nr Г_ J prqn~r-n2p2
= ^n{n-\)p2^rZ^pr-2q^
+ ^np^~_l}jPr-1qn-r-n2p2,
on writing r = (r-l) + l,
= n{n—\)p2 + np — n2p2, since p+q = I,
= npq.
Thus we arrive at the important result: the mean and variance of the
number of successes in a binomial distribution B(n, p) are given by
μ = np, σ2 = npq. (5)
Method II. Since the p.g.f. for the binomial distribution is given by
G(t) = (pt+qY
we have
G'(t) = np{pt+qY-\ G\t) = n{n-l)p2(pt+qy-2 (n>2).
Putting t = 1, μ = np, σ2+μ2—μ = n(n—l)p2 and the results follow.
304

2] THE BINOMIAL DISTRIBUTION
Example 2. A very large number of balls are in a bag, one-eighth being black
and the rest white. Twelve balls are drawn at random. Find
(i) the probability of drawing three black balls and nine white balls;
(ii) the probability of drawing at least 3 black balls;
(iii) the expected number of black balls in the sample;
(iv) the most likely number of black balls in the sample.
Drawing a ball from the bag and noting its colour may be regarded as a
Bernoulli trial (a black ball drawn being a success, ρ = £). Since a ball is
not replaced before its successor is drawn, the twelve Bernoulli trials
constituting our sample are not strictly independent; however, since the
number of balls in the bag is very large, we may assume that Pr (black ball
drawn) remains sensibly constant at ^; that is, each trial is independent of
what has previously occurred. Thus the random variable
X = number of black balls contained in a sample of 12
has a binomial distribution 5(12, |).
к 0-129;
* 0-182;
(iii) S(X) = np
= 1-5.
(iv) Write p(r) = Pr (X = r) = (^) p^~r.
= 12! rl(12-r)! β\
(r+l)!(ll-r)! 12! \i)
Yl-r
7(r+l)·
Thus ^±1) < j if r > ι
P(r)
and so /7(1) = щщ p(0) = y^(O)
is the maximum of the p(r); that is, the most likely number of black balls

DISCRETE PROBABILITY DISTRIBUTIONS [16
[Notice that the ratio deduced in Example 2(iv) enables us to express
p(r) in terms of p0. Thus:
Α = ΨΡο,
a = Ha = Mpo,
Рз = \%Рг = ΤιΐΡο, etc.
This is a particularly useful device for calculating binomial probabilities
if a hand calculating machine is available.]
Ex. 2. Find the values of Pr (X = r) in terms of Pr (X = 0) for the binomial
distribution B(4, £).
Ex. 3. If fifteen dice are thrown what is
(i) the expected number of sixes showing;
(ii) the most likely number of sixes showing ?
Ex. 4. Calculate the mean and variance for the binomial distribution B(9, |).
Ex. 5. In a family of four children, what is the probability of there being two
boys and two girls ? What is the probability that the eldest two are boys and the
youngest two girls ? What are the odds against having all four children of the
same sex?
Exercise 16(a)
1. A coin is tossed four times. Find the probability that heads appear
(i) at the first two tosses, followed by two tails;
(ii) just twice in the four throws;
(iii) at least twice.
2. 10 % of the very large number of articles produced by a machine are faulty.
What is the probability that a random sample of ten articles will
(i) be free of faulty articles;
(ii) contain more than two faulty articles?
3. From a packet containing a large number of seeds, 40 % of which are
advertised to give red flowers and the others white, 10 plants are produced. What is the
probability
(i) that all the plants have red flowers;
(ii) that all the plants have white flowers;
(iii) that half the plants have red flowers and half white?
4. 10 % of the very large number of articles produced by a machine are faulty.
If articles are taken at random and tested, how many articles will be tested, on
average, before the first faulty article is found ? What is the probability that the
testing procedure will have to go on longer than this before the first faulty article
is found?
5. (i) In a trial, eight coins are tossed together. In one hundred such trials how
many times should one expect to obtain three heads and five tails ?
306

2] THE BINOMIAL DISTRIBUTION
(ii) If 8 % of articles in a large consignment are defective, what is the chance
that a sample of thirty articles will contain fewer than three defectives ?
(O&C)
6. A battery of four guns is firing on to an enemy emplacement. It is reckoned
that each gun should score on the average one direct hit in every five shots,
and that three direct hits are needed to destroy the emplacement. If each gun
fires one shell, calculate the probability that the emplacement will be destroyed.
With new gun crews it is reckoned that two of the guns should score one direct
hit in every three shots and that the other two guns should score one direct hit
in every four shots. If each gun now fires one shell, calculate the probability that
the emplacement will be destroyed. (Cambridge)
7. Nine unbiased dice are thrown. Find p(r), the probability that r sixes appear,
and hence determine the value oip(r+\)jp(r). Find
(i) the expected number of sixes;
(ii) the most likely number of sixes;
(iii) the probability of obtaining more than one six.
8. (i) In a binomial distribution where the probabilities of the occurrence of an
event are given by the terms of the expansion of (p + g)m, the mean μ and the
standard deviation σ of the distribution are given by the formulae
μ= mp; σ = J(mpq).
Prove these formulae in the case where m = 3.
(ii) Two men A and В play a game in which A should win eight games to
every seven won by B. If they play three games, show that the probability that
A will win at least two games is approximately 0-55. (Cambridge)
9. Playing a certain One-arm bandit', which is advertised to 'increase your
money tenfold', costs 5p a turn; the player is returned 50p if more than eight balls
out of a total of ten drop in a specified slot. The chance of any one ball dropping
is p. Determine the chance of winning in a given turn, and for ρ = 0-65, calculate
the mean profit made by the machine on five hundred turns. Evaluate the
proportion of losing turns in which the player comes within one or two balls of
winning (j> = 0-65). (Cambridge)
10. Prove that, in the binomial distribution B(2n, %), the probability of scoring
an even number of successes is \.
11. An experiment consists of tossing an unbiased coin twelve times and
counting the number of heads obtained (X). If the mean and variance of X are μ
and σ2 respectively, find the value of
(i) Ρτ(\Χ-μ\ > σ); (ii) Ρτ(\Χ-μ\ > 2σ).
3. THE GEOMETRIC AND NEGATIVE
BINOMIAL DISTRIBUTIONS
We must now consider two further probability distributions associated
with the independent repetitions of Bernoulli trials. However, instead of
repeating the trial a fixed number of times and asking how many successes
307

DISCRETE PROBABILITY DISTRIBUTIONS \\6
have been obtained, we reverse the process by asking how long we must go
on repeating the trial until a stipulated number of successes is obtained.
First, suppose we wish to calculate the probability that the first success
occurs at the rth repetition of a Bernoulli trial, the probability of success
in any one trial being p. (As in Section 2, we assume that each trial is
independent of what has previously occurred.) We take as our random
variable X, where
X = number of trials up to and including the trial which results in the
first success.
There is no upper limit to the value X may take: we can theoretically
continue obtaining failures for ever, although the probability of doing so
steadily decreases.
Now Pr (X = r) = Pr (initially (r — 1) failures, followed by a success),
Le· Pr(X=r) = <r-1P· (6)
Equation (6) defines the geometric distribution for the random variable
X= {1, 2, 3, ...}. It is easily seen that this is a valid probability distribution,
for
= 1.
The mean, μ, and variance, σ2, of the random variable Ζ for a geometric
distribution may be derived without undue difficulty: we give two possible
methods.
Method I. S(X) = Σ rpqr-\
qS(X) = ^rpq^ = ^(r-\)pq^.
r=l r=1
Subtract: pS( X) = ρ + Σ pq^
= 1, since p + q — 1,
■·· μ = W = \\p.
Again £{X*) =jtrW~1.
^(χ2) = Σ'2μγ = Σ('-1)2μ'--1·

3] BINOMIAL DISTRIBUTIONS
Subtract: P#(X*) = Ρ + 2 Σ rpqr~x - Σ РЧГ~Х
r=2 r=2
= ρ + 2(β-ρ)-^
= p + 2{(llp)-p}-(\-p)
= (2/p)-l,
= 1-1-1
/>2 /> />2
= ?/P2·
Thus, for the geometric distribution,
μ = -, σ2
/>
Method II. The p.g.f. for the geometric
G(t) = g(r)
=£pqr-1rr
= ptl(l-qt).
Thus G\t)=pl(\-qt)\ G"(t) = 2pql(l-qty
and results (7) follow on substituting t = 1 and using Theorem 10.5.
The geometric distribution is a suitable probability model in a number
of practical examples; for example, in inverse sampling, articles are tested
until a faulty article is found.
Example 3. 5 % of the output of a certain machine is faulty. Articles are
taken at random from the output and tested, the process stopping when the
first faulty article is obtained. Find
(i) the expected duration of this sampling process;
(ii) the probability that the process will terminate before the expected
value is reached.
We shall assume that the total output of the machine is large and so the
individual Bernoulli trials of testing articles are independent, each with
probability of success (that is, of finding a faulty article) equal to -fa.
Adopting the geometric distribution as a suitable mathematical model, we have
(i) mean length of sampling process = l/(-2ir) = 20;
309
distribution is given by

DISCRETE PROBABILITY DISTRIBUTIONS [16
(ii) Pr (X < 20) = 1 -Pr (X > 20)
= 1-Σ (НГЧА)
r=20
= 1-[(М)19Ш]/[1-Ш
к 0-623.
The concept underlying the geometric distribution may be generalized
by asking how many Bernoulli trials will be required up to and including
the Mi success. Taking as our random variable X = {1, 2, 3, ...} we
observe that, if X = r, the rth outcome must be the Mi success, probability
p, and the previous (fc— 1) successes are binomially distributed B(r— l,p).
ThuS f/r-l\ \
Pr (X = r) = p[\^_\) /,*-y-«-<*-«j (/>, r ^fc)
(8)
Equation (8) defines the negative binomial distribution; to appreciate the
reason for this name and also to facilitate the verification that this is indeed
a valid probability distribution, we first observe that
X= X^Xz+... + Xi+... + Хъ
where Xt is the number of trials after the (г— l)th success up to and
including the rth success. Thus, the p.g.f. for a negative binomial distribution
is the Mi power of the p.g.f. for a geometric distribution, i.e.
G{i) = P4X\-qty\
Writing G(t) = а^ + а^^ + а^ь+Ч...
we have for r ^ k, Pr (X = r) = ak+r,
Furthermore,
and it follows that
310
ДРг (ЛГ = r) =βΣ β*4*
= G(1)
G'(t) = Λρ*ί*-1/(1-ϊ0*+1»
* 2 kq
μ = -, σ = -τ·

3] BINOMIAL DISTRIBUTIONS
Ex. 6. Devise an experiment for which you feel the negative binomial
distribution would constitute a suitable mathematical model.
Ex, 7. Dice are thrown in succession until two sixes have appeared; find
(i) the probability that two dice in all are thrown;
(ii) the probability that more than four dice are needed;
(iii) the expected number of throws required.
4. THE POISSON DISTRIBUTION
Another distribution of common applicability associated with an infinite
outcome space and random variable
X= {0,1,2,3,...}
is the Poisson distribution, defined by the equation
Pr(Z=r) = e-a.^ (e>0).
Since Σ Pr (X = r) = e-a Σ -} (10)
= e~aea
= 1
and Pr (X = r) > 0, for all r, equation (10) does constitute a valid
probability distribution.
We shall show that the Poisson distribution arises as the limit of a
binomial distribution in which n^oo andp^O, in such a way that np = a
remains constant. Thus the Poisson distribution will form a suitable
probability model, not only to situations where the binomial distribution
applies directly and η is very large and ρ very small but also to situations
where the binomial distribution is not directly applicable. For example,
consider requests for trunk call connections made at a telephone exchange.
Suppose that, for the period 7.00 a.m. to 8.00 a.m. they average out at
three and suppose further that the request takes a negligible time to make.
Dividing the hour up into a large number of short time intervals (say 720
of 5 seconds each), the probability of there being a call in any one of the
intervals may be taken as 2b", on the assumption that the calls arrive
independently throughout the hour, and the probability of two calls in
the interval is negligible. Thus we have 720 independent repetitions of a
Bernoulli trial ('Is there a call or not in a given 5 second interval?') with
small probability of success ρ = j-Jj, and the Poisson distribution may
be regarded as a good mathematical model to employ.
We must now verify our original assertion that a binomial distribution
311

DISCRETE PROBABILITY DISTRIBUTIONS
[16
B{n, p) tends to a Poisson distribution as η -> oo and ρ -> 0 in such a way
that np = a, a > 0. First, for B(n,p)
Pr(Z=r)=(^V"-r
= ^i)(^bJ^±lV(1_^(1_,r
= Ι »M */ I—«l^j^V/»-.
since л/7 = a.
Since r is a fixed number, the product of the r factors in the numerator of
the first fraction tends to 1 as η -+ oo, and (1 — p)~r tends to 1 as ρ -+ 0.
Furthermore,
and we have Pr (jf = r) -> ^p
as required.
The Poisson distribution is completely described if we are given the
value of a; thus, a Poisson distribution is a one-parameter distribution and
a Poisson distribution with parameter a may be referred to as P(a).
The mean, μ, and the variance, σ2, of the Poisson distribution P(a) may
be found by the direct computation of $(X) and ${X*). Thus
μ = <?(Ζ)
= ^r eracf
= a.
Similarly, σ2 = «?(Х2)-/*2
= α as above.
For the Poisson distribution P(a),
μ = a, σ2 = a, (11)
i.e. the mean and variance of the Poisson distribution P(a) are both equal
to a.
t For the proof of this and other results concerning the exponential function, see one
of the books on calculus mentioned in the bibliography.
312

4] THE POISSON DISTRIBUTION
Ex. 8. Prove that, if the random variable X = {0, 1, 2, ...} has equal mean
and variance, this does not necessarily mean that X has a Poisson distribution.
{Hint: consider a suitable uniform distribution.)
Example 4. Traffic accidents are reported in a certain town at an average
rate of four in a week. Estimate the probabilities:
(i) of a given week being accident free;
(ii) of there being three or fewer accidents in a given week;
(iii) of there being more than four accidents in a given week.
To produce reasonable estimates of these probabilities we have first to
set up a mathematical model of the situation. The assumption that
accidents occur independently of one another is arguable but, without more
detailed data, it would be difficult to make a more plausible assumption.
With this hypothesis, a Poisson distribution for the number, X, of accidents
per week would appear to be a reasonable model, for the reason outlined
earlier. Thus we have
(i) Рг(Х=0) = е-*
к 0-018;
(ii) Рг(Х<3) = е-*(1+4 + ^+:£)
и 0-433;
(42 43 44\
1+4 + 2! + з, + 4|)
к 0-371.
Exercise 16(b)
1. The random variable X can take Values 0, 1, 2, 3 Given that X has a
Poisson distribution, mean 2, calculate the probabilities
Pr (X = 0), Pr (X = 1), Pr (X = 2), Pr (X = 3),
Pr (X = 4), Pr (X = 5), Pr (X > 5).
2. Samples of forty articles at a time are taken periodically from the continuous
production of a machine and the number of samples containing 0, 1, 2, ...
defective articles are recorded in the following table:
No, defective per sample 0 1 2 3 4 5 6 Total
No. of samples |~~^ ^ ^ ~ ~ \ 0 | 100
Find the mean number of defectives per sample.
313

DISCRETE PROBABILITY DISTRIBUTIONS [16
Assuming that this is the mean of the population and that the Poisson
distribution applies, find the chance of:
(a) a sample containing four or more defectives;
(b) two successive samples containing between them four or more defectives.
(O&C)
3. The following table shows the results of recording the telephone calls handled
at a village telephone exchange between 1.00 p.m. and 2.00 p.m. on each
of a hundred weekdays (e.g. on thirty-six days no such calls were made):
Calls 0 I 2 3 4 or more
Days |36 35 22 7 0
Assuming that calls arrive independently and at random, estimate
(i) the mean m of the corresponding Poisson probability distribution;
(ii) the probability that if the operator is absent for ten minutes no call will
be missed;
(iii) the probability that if the operator is absent for ten minutes two or more
calls will be missed. (Cambridge)
4. In an examination 60 % of the candidates pass but only 4 % obtain distinction.
Use the binomial distribution to calculate the chance that a random group of
ten candidates should contain at most two failures.
Use the Poisson distribution to calculate the chance that a random group of
fifty candidates should contain more than one distinction. (Cambridge)
5. The road accidents in a certain area occur at an average rate of one per two
days. Calculate the probability of 0,1, 2 6 accidents per week in the district.
What is the most likely number of accidents per week ?
How many days in a week are expected to be free of accidents ? (M.E.I.)
{Note. This question is best attempted using a hand calculating machine, if
one is available.)
6. Explain briefly what is meant by a Poisson distribution and show that for
such a distribution the mean is equal to the variance.
In a bakery 3600 cherries are added to a mixture which is later divided up
to make 1200 small cakes.
(i) Find the average number of cherries per cake.
(ii) Assuming that the number of cherries in each cake follows a Poisson
distribution, estimate the number of cakes which will be without a cherry and the
number with five or more cherries. (O & C)
7. 10 % of the output of screws from a machine are faulty. If screws are taken
at random from the output until two faulty screws are found, what will be the
average number of screws tested ? What is the probability that precisely this
number of screws will be needed ?
8. 20 % of the butterflies in a district are of type A. If a random sample of size
10 is taken, find the probability
(i) that there will be just two butterflies of type A in the sample;
(ii) that the tenth butterfly caught will be the second one of type A.
314

41 THE POISSON DISTRIBUTION
9. Mass-produced articles are taken at random from a batch and tested until a
faulty article is found. If the twenty-first article proves to be the first defective,
is this at variance with the assumption that 10 % of all the articles are faulty?
10. If X is distributed according to a Poisson distribution with mean A, write
down the probability that X = r.
How is this probability modified if the values X = 0 are unobservable ? Prove
that the mean is now λ/(1 -e~A), and find the second momentt about X = 0.
(Oxford)
11. Evaluate Pr(^= r+l)/Pr(X = r) for the Poisson distribution P(a). If
the numbers of misprints on pages of an uncorrected proof have a Poisson
distribution with mean 2-7, what is the most likely number of misprints to be
found on a given page ?
12. The average proportion of bad eggs in an egg packing station is one in
2000. The eggs are packed in boxes containing six eggs each.
(i) Evaluate to two significant figures the probability that a box contains
one bad egg.
(ii) A housewife complains if she obtains two or more boxes, with one bad
egg each per hundred boxes. What is the probability that she complains.?
(M.E.I.)
13. During World War II, 537 flying bombs fell on south London. The
distribution of the number of hits in 576 areas, each of 0-25 km2, is given in the
table. Compare the actual frequency with the theoretical frequency obtained by
assuming that the aim was effectively random and followed a Poisson distribution
with the same average number of hits.
No. of hits 0 1 2 3 4 5 6 or more
Frequency 229 211 93 35 7 1 0
(M.E.I, adapted)
5. SAMPLING INSPECTION SCHEMES
Suppose a machine produces a large number of articles and it is required to
keep a check upon the number of articles produced which do not attain
some prescribed standard; for convenience, we shall describe such articles
as 'faulty'. The most obvious method is to check each article individually,
but such a process suffers from several drawbacks; for example
(i) the process of testing might be very costly;
(ii) the process would almost certainly be time consuming;
(iii) by the very nature of the test, the article tested might be destroyed
(consider, for example, the testing of photographic flash bulbs).
A more economic approach is to take a random sample from the total
output, to test each article of the sample and to deduce, from a
probabilistic argument, the quality of the whole output. The process of selecting
a random sample is not as straightforward as it might appear; we shall,
t The second moment about X = 0 Is SiX*).
ττ PPM 315

DISCRETE PROBABILITY DISTRIBUTIONS
[16
however, here assume that such a sample has been drawn, and concentrate
upon the second part of the problem: inferringthequalityofthepopulation.
A simple approach would be to take a sample of size n, test each article
and reject the batch as sub-standard if, say, more than m of the articles
prove faulty. We begin by assuming that we have some percentage, lOOp
say, as an upper limit to the number of faulty articles which may be allowed
before the manufacturing process is stopped and corrected for any fault.
Example 5. Samples of size 20 are taken from a large batch of articles
produced by a machine; if more than two faulty articles are discovered the
batch is withdrawn, otherwise it is accepted. What is the probability that, if
the machine is producing 5 % of faulty articles, the batch will be accepted!
If the proportion of faulty articles rises to 10 % of the total output, what is the
probability the batch will now be rejected!
Since the batch is large, we may assume that the removal of twenty
articles does not sensibly alter the proportion of defectives and so the
binomial distribution 5(20, 0Ό5) may be taken as a suitable mathematical
model for the experiment, where our random variable is the number, X,
of defectives in a sample of size 20.
(i) If Pr {article faulty) = -2\, then
Pr (not more than 2 faulty articles in 20)
χ 0-925-
(ii) If Pr (article faulty) = -&, then
Pr (more than 2 faulty articles in 20)
a 0-649.
Thus, if only 5 % of the articles produced are faulty, the chance of
accepting the batch is about 92^%, whereas, if the number of faulty articles
increases to 10 %, there is a 65 % chance of rejecting the batch.
Ex. 9. Comment upon the efficiency of the test given in Example 5. Suggest
ways in which you feel it could be improved.
The testing procedure exhibited above is an example of a single sampling
inspection scheme. Any sampling inspection scheme is open to two types
of error:
(I) A batch with an acceptable number of faulty articles may be
rejected.
(II) A batch with aq unacceptable number of faulty articles may be
accepted.
316

51 SAMPLING INSPECTION SCHEMES
If α = Pr (error of Type Ι), β = Pr (error of type II), then, in Example 1
above, α и 0075, β χ 0-351, if we regard 5 % faulty as acceptable, but
10 % faulty as unacceptable.
It can be shown that, if the size of the sample is held fixed, but the
standard of rejection is altered, either α increases and β decreases or vice versa.
Thus, there is an unavoidable margin of error in any inspection scheme;
the decision whether to minimize α or β depends upon such external
arguments as whether it is economically more desirable to withdraw good
batches or allow sub-standard batches on to the market. Of course, if the
sample is increased, more accurate information about the population may
be obtained and both α and β may be decreased.
Ex. 10. What objections are there to increasing the sample size in order to reduce
aand/??
If the probability, p, that an individual article is faulty is now regarded
as a variable, and the testing procedure is defined (that is, the sample size
and number of faulty articles required for rejection are given), the
probability Ρ of accepting the batch is a function of p. The curve obtained by
plotting Ρ against ρ is called the operating characteristic for the testing
plan. For example, with the test outlined in Example 5, the operating
characteristic has the following shape:
"°K

DISCRETE PROBABILITY DISTRIBUTIONS [16
From this graph, the probability of accepting (or rejecting) a sample
given the proportion of defectives in the population may be read off.
An increase in the sample size causes the operating characteristics to fall
off more steeply, thereby improving the test.
Other schemes are possible, besides single sampling schemes. For
example one can devise double sampling schemes as outlined in the next
example.
Example 6. A sample of size 8 is taken from a large batch of articles
produced by a machine. If the sample contains no faulty article the batch is
accepted, if more than one, it is rejected; if, however, the sample contains
a single faulty article, a second sample of eight articles is taken. The batch is
now accepted if this second sample is free of faulty articles, otherwise it is
rejected. Find
(i) the probability of accepting a batch containing 1 % of faulty articles;
(ii) the probability of rejecting a batch containing 10 % of faulty articles;
(iii) the average size of sample tested in case (ii).
PI
1-0 L
0-8
0-6 l·
0-4 V
Q-2h \
0-2 04 0-6 0-8 1-0 ρ
Fig. 16.2
Since the batch is large, the binomial distribution constitutes a suitable
mathematical model. Thus, if ρ is the probability of a randomly chosen
article being faulty, the probability, P, of accepting the batch is given by
318

61 HYPOTHESIS TESTING
i>=(i-/08+[8(i-/0»(i-/08
= (i -/О8+8/41-/О15;
(i) ifp = 1/100, Ρ и 0-99;
(ii) if/7= 10/100, 1-Рй 0-55;
(iii) a first sample of size 8 is always taken; the probability that a second
will be required is 8(0·9)7(0-1) χ 0-382 and so the expected sample size is
8 + 8x0-382» 11.
As in the case of single sample inspection schemes, the operating
characteristics curve can be drawn in this case too; see Figure 16.2.
Ex. 11. What advantage does a double sampling scheme possess over a single
sampling scheme? Are there any disadvantages?
6. HYPOTHESIS TESTING
Probability theory finds a further application in the testing of statistical
hypotheses. Consider the following problem.
Example 7. A coin is tossed ten times and shows nine heads. Is this sufficient
evidence to support the claim that the coin is biased!
In order to attack this problem, we adopt the approach made in testing
scientific theories: we make a hypothesis and then consider whether or not
observational evidence supports this hypothesis. In problems of statistical
inference, our hypothesis is generally called the null hypothesis (because it
often takes the form of an assumption about the absence of bias in the
population): in this example, we shall take as our null hypothesis the
statement ' the coin is unbiased; that is, Pr (heads on any toss) = \.
Having set up our null hypothesis we may ask the question ' Is the
probability of obtaining a result as bad as or worse than, the observed result so
small that we are reluctant to ascribe it to chance?' If it is, we agree to
reject our null hypothesis in favour of some alternative hypothesis.
To complete the framework of our solution we must further decide
what numerical value to prescribe to the indefinite phrase 'so small';
following common practice, we shall define ρ = -2й0- as a small probability
(the so-called '5% significance level'); in other cases it might be more
reasonable to take ρ = -j-^ (' 1% significance level').
Returning to the given example, our null hypothesis, H0, is given by
Ho* = h
we may take as our alternative hypothesis
Η,-.ρ + i.
319

DISCRETE PROBABILITY DISTRIBUTIONS [16
Our question now takes the form ' What is the probability of obtaining
a result as bad as, or worse than, nine heads with an unbiased coin?' Since
we are comparing the hypothesis H0:p — \ with the alternative hypothesis
Η-ι'.ρ Φ i, we are simply testing for absence of bias and thus we have no
reason to distinguish between an apparently abnormal number of heads
or tails; it follows that a result 'as bad as, or worse than, nine heads' is to
be interpreted as the result 'nine or ten heads or tails'.
We adopt as our probability model for the situation the binomial
distribution. Thus we have
Pr (9 or 10 heads or taih/H^ = 2[φ10 + 10(i)9(i)]
Since this is less than yg·, we infer that the result is significant at the 5 %
level: we have reason to believe that the coin is biased.
Sometimes a test of a statistical hypothesis is constructed before the
observational evidence is obtained. For instance, in our last example, we
might have compared the null hypothesis:
H0:p = i
with the alternative hypothesis:
Η,-.ρ Φ i
and so decided what results we should require before accepting or
rejecting #0.
In Example 7, our null hypothesis took the form of a statement about the
probability of an event; more generally, a null hypothesis is a statement
about the probability distribution of a population. We conclude this chapter
with a further example, in which our null hypothesis is a statement about
the mean of a random variable.
Example 8. Batteries are sent out by manufacturers in batches of five
hundred. On average, 2-4 batteries per batch are faulty. How many batteries
would we need to find faulty in a particular batch before we could adduce
evidence that more faulty batteries were appearing than could be ascribed
to pure chancel
Since we are here concerned only with the question of finding more than
the expected number of faulty batteries, we take as our null hypothesis
#0: mean number of batteries per batch = 2-4,
and as our alternative hypothesis
Нг: mean number of batteries per batch > 2-4.
We adopt as a mathematical model of the situation the Poisson
distribution (there is a large number of batteries in every batch and the prob-
320

6] HYPOTHESIS TESTING
ability of any particular battery being faulty is small), with random variable
X = r when there are r faulty articles in a batch.
Thus we have Pr (X = r\ H0) = <r2 4 χ ^^
and we wish to find the least integer η such that
Pr(Z> n\H0) < 0-05.
Equivalently we have to find the least integer η such that
Pr(Z< n\H0) > 0-95.
By direct computation
(2-42 ?-43 2-44\
1+2-4 + ^- + ^- + ^-)
и 0-0907(1 + 2-4 + 2-88 + 2-304 + 1-382)
« 0-90,
Pr (X < 5|#0) = Pr (X < 4|Я0) + Рг (X = 5|#0)
и 0-96.
Thus, η = 5: if we were to find 5 or more faulty batteries in a batch, we
should have reason for rejecting our null hypothesis #0 in favour of our
alternative hypothesis Hv
Exercise 16(c)
1. A machine is believed, on average, to produce 0-1 % of faulty articles.
Estimate, to 2 d.p., the probability of finding a batch of five hundred articles free of
defectives
(i) by using a binomial distribution;
(ii) by using a suitable Poisson distribution.
A batch of five hundred is tested and found to contain two faulty articles.
Comment.
2. A large batch of manufactured articles is accepted if either of the following
conditions is satisfied:
(i) a random sample of 10 articles contains no defective article;
(ii) a random sample of 10 articles contains one defective article and a second
random sample of ten is then drawn which contains no defective articles.
Otherwise the batch is rejected.
If, in fact, 5 % of the articles in a batch to be examined are defective, find the
chance of the batch being accepted [(0-95)10 = 0-5987]. (O & C)
3. From a batch of manufactured articles a sample of ten is taken and each
article is examined. If two or more articles are found to be defective the batch is
321

DISCRETE PROBABILITY DISTRIBUTIONS [16
rejected; otherwise it is accepted. Show that, if ρ is the proportion defective in a
batch and Ρ its chance of being accepted,
Ρ = (1-/»)·(1+9/ή.
Find an expression for Ρ if it is now decided to modify the scheme so that when
one defective is found in the sample a second sample of ten is taken and the batch
rejected if this second sample contains any defectives.
In the second case, what will be the average number sampled per batch over a
large number of batches when ρ = 0-05 ? (M.E.I.)
4. In a certain inspection scheme a sample of fifty items is selected at random
from a very large batch and the number of defectives is recorded. If this
number is more than three the batch is rejected; if it is less than three the batch is
accepted. If the number of defectives is exactly three a further sample, this time
of twenty-five items, is taken and the batch is rejected if there is more than one
defective in the second sample but accepted otherwise.
If the proportion of defective items in the batch is 1 % determine the values of
the following probabilities;
(i) that the batch is accepted as a result of inspection of the first sample;
(ii) that a further sample has to be taken and the batch is accepted as a result
of inspection of that sample;
(iii) that the batch is rejected.
Take (0-99)48 as 0-6173. (M.E.I.)
5. State the formula for the probability that a variable following a Poisson
distribution of mean m takes the value r. Prove that the variance of r is m.
Past experience has shown that the number of defective items produced in a
shift by a certain machine is a Poisson variable of mean 4. A new employee in his
first shift produced six defectives. Is this clear evidence that he is operating the
machine inefficiently? (Cambridge)
6. The probability P(r) that there will be r damaged tomatoes in a crate can be
taken as _„ ,
PM = ^\
where m is the expectation of r. Over a large number of crates the value of m
has been found to be 10. In the first crate from a new supplier the value of r
was 4. Test whether this is significant evidence that the value of m for this
supplier is less than 10; explain carefully the logic of your argument.
(Cambridge)
7. A die is thrown six times. If a score of six is made on three of these occasions,
have you any reason for believing that the die is biased?
8. A cubical die with faces marked 1 to 6 is thrown η times. Show that on the
hypothesis that the die is unbiased the chance that the face marked 4 will appear
uppermost not more than once is p, where
322

6] HYPOTHESIS TESTING
If η = 40 and the face marked 4 comes uppermost exactly once, test whether
the hypothesis that the die is unbiased is contradicted
(я) at the 1 % significance level;
(b) at the 0-1 % significance level. (Cambridge)
9. A bag contains ten balls, each of which is either black or white, but otherwise
the balls are indistinguishable one from another. Three balls are drawn without
replacement, and all are found to be white. Test the hypothesis
H0: there is an equal number of black and white balls in the bag.
The three balls are now replaced and three more are drawn; these are found to
be two white and one black. Test the hypothesis Щ again.
10. A sampling inspection scheme is operated by taking a random sample of size
10 from each large batch of a product. The batch is rejected if more than one
defective is found and otherwise the batch is accepted. Plot the operating
characteristic of the given plan.
Explain the applications of operating characteristics when choosing a suitable
plan. (M.E.I.)
[You are advised to use a hand calculating machine for the first part, if you
have one available.]
11. A double sampling inspection scheme is devised as follows: a sample of size
15 is drawn from a batch of articles; if the sample contains none or one faulty
article the batch is accepted, if more than three, it is rejected. If it contains two
or three faulty articles a second sample is taken; the batch is now rejected if this
second sample contains more than one faulty article. If the batch contains
100/7 % faulty articles, find
(i) the probability that the batch will be accepted;
(ii) the value of ρ which gives the largest expected sample size;
(iii) the size of this largest expected sample.
12. An assembled instrument contains two critical components A and B, Sample
tests show that we may expect one in ten of A and one in eighteen of В to be
defective. Estimate and compare the costs of the following inspection plans,
per hundred fully tested instruments:
(i) to test every component before assembly at a cost of 2p for each A tested
and 3p for each В tested; or
(ii) to test every instrument after assembly, if this test adds nothing to the cost
but making good a defective instrument costs on average 24p. (M.E.I.)
Miscellaneous Exercise 16
1. A pack of cards is cut and the suit of the exposed card noted; the pack is then
well shuffled and the process repeated. Find
(i) the probability that a spade will appear for the first time on the fourth cut;
(ii) the average number of cuts required before the first spade appears;
(iii) the average number of cuts required to expose cards of all four suits.
2. Show that the probability of r successes in η independent trials is the coefficient
of tr in the expansion of (q+pt)n, where ρ is the chance of success in a single
trial and q = 1 —p. Prove that the mean number of successes is np.
323

DISCRETE PROBABILITY DISTRIBUTIONS [16
Samples, each of eight articles, are taken at random from a large consignment
in which 20 % of the articles are defective. Find the most likely number of
defective articles in a single sample and the chance of obtaining precisely this
number.
If a hundred samples of eight are to be examined, calculate the number of
samples in which you expect to find three or more defective articles. (O & C)
3. Two coins are identical in appearance, but one is unbiased while the other
gives, on the average, heads three times as often as it does tails. One of the coins
is taken at random and tossed four times. If two heads and two tails appear,
what is the probability that it is the unbiased coin?
4. Explain briefly what is meant by a Poisson distribution of rare events and its
relation to the binomial distribution. Prove that the mean of the distribution is
equal to its variance.
A shopkeeper's sales of washing machines are four per month on the average.
Assuming that the monthly sales fit a Poisson distribution, find to what number
he should make up his stock at the beginning of each month so that his chance
of running out of machines during the month will be less than 4 %. (O & C)
5. A machine depending for its energy upon four complexes of solar cells, each
complex functioning independently of the others, will work provided one of the
four complexes is working. Each complex has probability ρ of failing. The
machine is redesigned to have six complexes and will function if two of the six
complexes are working. Is the new design an improvement on the old?
6. A and В play N games, each of which must result in a definite win to one or
other player, ,4's chance of success in any one game is p. For his nh win, A
receives from В £r, for his ith loss, he gives В £s. Find A's expected gain.
7. A coin is tossed repeatedly an even number, In, times. Show that, whatever
the bias (provided Pr (Η) Φ 0 or 1) the probability of obtaining the same
number of heads as tails decreases as η increases.
8. In lawn tennis a set is won by the first player to win six games, except that
if the score reaches 5-5 the set is won by the first to lead by two games. Two
players have chances respectively ρ and q of winning in any game (p+q = 1);
games may be treated as independent. Find the chance that a set lasts exactly
2n + 2 games (n > 5). (Cambridge)
9. The number of eggs laid by an insect has Poisson distribution with mean μ.
If the probability that an individual egg survives is p, show that the number
of eggs surviving has Poisson distribution, and determine its mean value. (You
may assume that the survival of an egg is independent of the fate of the other
eggs laid.)
10. An experiment has probability^ of success. In л independent trials, pn is the
probability of an even number of successes (p„ = 1). Prove that
Pn-(l-2p)Pn_1=p.
If ДО = Σ^ηίη, prove that
/(0 = [1 - (1 -P) 0/{(l - 0 [1 - (1 - 2p) t]}.
Deduce that pn = Ш + (1 - 2p)n]-
324

61 MISCELLANEOUS EXERCISE 16
11. Two machines, A and B, produce large numbers of articles, with 10 % of
those from A and 20 % of those from В defective. Machine В produces 50 % more
articles than machine A. A batch of ten taken from one of the machines contains
two defective articles. What is the probability it came from A ? (Give to 1 d.p.)
12. In a simplified probability model of the service in a barber's shop, it is
supposed that all haircuts take exactly six minutes and that a fresh batch of
customers arrives at six-minute intervals. The number of customers in a batch is
described by a Poisson probability function, the mean number being three. Any
customer who cannot be served instantly goes away and has his hair cut elsewhere.
The shop is open for forty hours a week. Calculate the theoretical frequencies
with which batches of 0,1, 2, 3, 4, 5 and more than 5 customers will arrive.
The proprietor reckons that it costs him £25 a week to staff and maintain each
chair in his shop, and he charges 25p for each haircut. Calculate his expected
weekly profit if he has
(i) three; (ii) four; (iii) five chairs. (SMP)
13. A sample of η coins is drawn at random from a large collection in which a
fraction ρ are pennies. What is the probability that just r of the η coins are
pennies ?
If the probability that a penny is a Queen Elizabeth one is q, what is the
probability that there are exactly s Queen Elizabeth pennies among the r pennies of this
sample?
Write down the probability that a sample of η coins will contain s+k Юр
pieces, only s of which are Queen Elizabeth ones, and calculate the sum of these
probabilities for all possible values of k. (C.S.)
14. A bag contains a large number of red, white and blue dice in equal numbers.
If η are drawn at random, show that the probability Pin, r) of drawing exactly
r red dice is equal to the term containing ft)r (i)n~T in the expansion of (} + §)".
If r dice are thrown, find the probability Q{r, s) of throwing exactly s sixes.
If η dice are drawn from the bag and the red dice drawn are thrown, show
that the probability of throwing exactly s sixes is
nifp(n,s+t)Q(s+t,s)
and prove that this is equal to a term in a binomial expansion. Explain why a
binomial distribution is obtained. (C.S.)
15. A random sample of size 10 is taken from a batch of a thousand components
and one defective is found.
(i) What is the largest possible percentage of defectives in the batch ?
(ii) ρ is the smallest proportion of defectives in the batch such that the
probability of obtaining not more than one defective is not less than 95 %. Find
the value of ρ to two decimal places by trial of suitable values of p.
(iii) Find the most likely percentage of defectives in the batch (i.e. such that
the probability of obtaining one defective in the sample is a maximum).
(M.E.I.)
325

DISCRETE PROBABILITY DISTRIBUTIONS [16
16. In sampling inspections of batches of manufactured articles a random sample
of twenty is taken; if none or one defective occurs in the sample the batch is
accepted, if three or more defectives occur the batch is rejected. If two defectives
occur a second random sample of twenty is taken and if in the combined sample
of forty less than four defectives occur the batch is accepted; otherwise it is
rejected.
Assuming that the proportion/? of defectives in a batch is sufficiently small for
the Poisson distribution to apply, show that the chance Ρ of the batch being
accepted is given by
Ρ = е-20" (1 + 20p) (1 + 200p2 е-20*).
(i) Find the chance of a batch which is 2 % defective being rejected.
(ii) Find the chances of batches which are respectively 5 and 10 % defective
being accepted.
(iii) Sketch the operating characteristic curve for the inspection scheme.
Determine the average sample size per batch if ρ = 0-05. (О & С)
17. The number of a certain type of organism in a given volume of water has
Poisson distribution with mean 2. A test, applied to indicate absence of the
organism has a 90 % chance of success if the organism is in fact absent, but also
indicates absence in 10 % of those cases in which they are present. If the test is
applied and indicates absence, what is the probability that the water is free of
the organism?
18. The probability that a source emits r α-particles in a given time is
proportional to μτ\ϊ\, where μ is a constant. Obtain the constant of proportionality
and calculate the mean of r.
The probability of the same source emitting s ,5-particles in the same time is
proportional to v'/sl Assuming that the two types of particle are emitted
independently, write down the probability that r α-particles and s /3-particles are
given off in this time and show that the probability that a total of η particles of
the two types are emitted is
β-(μ+") Qi + v)n/nl (Cambridge)
19. The probability that any randomly chosen rat from the colony used in a
certain laboratory will show a certain undesirable characteristic is p; it is known
that the value of ρ for the colony is either 0-4 or 0-6 and it is desired to set up a
sampling scheme to decide which value is correct. The procedure is to take rats
one at a time at random and test for the presence of the characteristic; after η
rats have been tested, let r be the number with the characteristic.
Show how to determine limits Ln and U„ such that for known η
Pr (/· < Ln\p = 0-6) к a and Pr (/· > Un\p = 0-4) « β
where α and β are specified.
Use the table of partial binomial sums below to determine Ln and Un for
η = 5,7, 9, 11, 13, 15, 17 where α and β are less than 0-1 and as close to it as
possible. Mark these values on a plot of π — r against r, and explain how such
a diagram could be used in the decision procedure.
326

MISCELLANEOUS EXERCISE 16
J£
3
4
5
6
7
8
9
10
11
5
317
87
10
7
580
290
96
19
2
Partial biiu
9
768
517
267
99
25
4
0
,,
881
704
467
247
99
29
6
1
0
mial sums
13
942
831
647
426
229
98
32
8
1
15
973
909
783
597
390
213
95
34
9
17
988
954
874
736
552
359
199
92
35
Table of ΙΟ3 χ Σ I I (0-4)s (0-6)"-. (Cambridge)
20. At a certain seed testing station it is found that a proportion 0-4 of a certain
type of seed is fertile. By accident the remaining stock of this seed (whose total
amount is very large) is completely mixed with an equal quantity of a second
type of seed which is believed to be completely infertile. If this latter assumption
is true, what is the probability that a seed taken at random from the mixture
will germinate?
Each of seven pots is planted with two seeds taken at random from the mixture.
Six pots eventually produce one or more plants each. Is this result consistent, at
the 5 % level of significance, with the infertility postulate for the second type of
seed? (Cambridge)
21. Derive the probability p(r) of obtaining exactly r successes in η independent
trials, the probability of a success being ρ at each trial. Determine the mean and
variance of this distribution.
Find an expression for [p(r)]/[p(r— 1)] and hence or otherwise find the
conditions that rm must satisfy if p(rm) is such that no other value of p(r) is greater
than/KrJ.
Show that the mode of a binomial distribution differs from the mean by less
than unity. (M.E.I.)

Revision exercise В
1. Find the equation of the line Lt through the origin with gradient \, and
also the equation of the line L2 perpendicular to Lx and passing through the point
(4, -*).
2. In a large university, one-third of the men and one-quarter of the women
read science. If four men are selected at random, what is the probability that not
more than one reads science? If two men and two women are selected at random
what is the probability that not more than one of the four reads science?
What is the probability that, in a mixed group, one man reads science and the
other man and the two women read something else?
3. A square lamina ABCD of side a is held with the corner A on a horizontal
plane. The feet of the perpendiculars from B, C, D on to the plane are В', С", D'
and the angles B'AB, D'AD are <x, β. The angle B'AD' is Θ. Prove that
(i) cos0 = -tan α tan β;
(ii) the area of the triangle B'AD' is %a-(cos2 α cos2 β- sin2 α sin2 /?)*;
(iii) the inclination of the lamina to the horizontal can be expressed in the
ОГт arccos [cos (α+β) cos (α-β)]*. (О & С)
4. If α Φ i(4n+ 1)я, find x if sin (x+oc) = cos (x-oc).
What can you say about χ if α = \(4n + 1)я?
5. Find the sum to infinity, S, of the geometric series
1+7ϊδ+ϊδ+ιονιο+ "■
If SN denotes the sum of the first N terms of this series, find the least value of
N such that S and SN are the same correct to 3 d.p.
6. A tennis match usually consists of either three or five sets, and ends when one
side has won a majority of the sets. If the probability of a side winning a set is p,
and if the result of each set is independent of any previous results, show that the
probability of a match going its full legth is 2pq in the case of a three-set match
and 6/?V in the case of a five-set match (q = 1 —p).
Show that the first probability is always greater than the second, if;? Φ 0 or 1.
7. The function/: R -> R+ is defined by
Λ*)= |*+1| + |2χ-1|+|*-2|.
Sketch the graph off and determine the least value of A*).
8. Call a matrix of the form lx Л
328

REVISION EXERCISE В
which is symmetrical about both diagonals 'super-symmetrical'. If A, B, ... are
super-symmetrical 2x2 matrices and m, n, ... are non-negative integers, prove
that A™B" ... is super-symmetrical.
Does this result hold for 3 χ 3 matrices?
9. Factorize into linear factors the expression 2x2 + 5xy — 3y2—3x+5y—2 and
describe geometrically the set of points {(*, y): 2x2 + 5xy - Зу2 - 3x + 5y - 2 = 0}.
Describe geometrically the sets of points
(i) {(x,y): x*-2y* = 0}; (ii) {(*, у) : x2 + 2y2 = О}.
10. i,j, к are unequal positive integers and x, y, ζ are real numbers. Prove that
x(j-k)+y(k-i) + z(i-j) = 0ox,y,z are respectively the zth, y'th, kth terms of
an arithmetic sequence.
11. Prove by mathematical induction, or otherwise, that
л.13+(л-1).23 + (л-2).33+... + 1.и3 = -knin+V, (л+2) (Зи2 + 6и+1).
(Oxford)
12. Find the inverse of the matrix
(J \ I
Hence solve the simultaneous equations
Г 5x + 3y + lz = a,
\ 3x+4y + 6z = b;
[-x + 2y+ z = c;
(i) when a = -1, b = 2, с = -3; (ii) when a = b = 1, с = 2.
13. What are the first three terms in the expansion of (l-2x)24 in ascending
powers of x?
Find the value of 0-9824, correct to 4 d.p.
14. Show that, whatever value is chosen for k, the equation
(3 + 5/c) x + (2-7/c) y + (5-4k) = 0
represents a straight line through a fixed point A.
Find the particular line of the system
(i) which passes through the origin;
(ii) which is parallel to the у axis.
15. The polynomial P(x) leaves a remainder of x+1 on division by x2-2 and
the polynomial Q(x) leaves a remainder of 2x-f-3 on division by x2-2. New
polynomials R(x) = P(x)+Q(x) and Six) = P(x) Q(x) are defined. Find the
remainder
(i) when R(x) is divided by x2-2;
(ii) when S(x) is divided by x2-2.
329

REVISION EXERCISE В
16. If A is the point (a cos a, b sin a) and В is the point (a cos /δ, 6 sin β), show
that the equation of the line AB is
What is the connection between α and β if AB passes through the origin?
17. Prove that the number of spheres that can be drawn to pass through three
given points and touch a given plane is 2 or 1 or 0, explaining how the three cases
arise.
18. О A, OB, ОС are three concurrent straight lines lying in one plane. Ρ is a
point outside the plane such that the angles POA, РОВ, РОС are equal. Prove
that PO is perpendicular to the plane.
19. Find the matrix of the linear transformation which reflects all points in the
plane x+y+z = 0.
20. ABCDA'B'C'D' is a parallelepiped, with opposite faces ABCD, A'B'C'D'
and edges AA', etc., AB = BB' = B'C, and the mid-points of BB', A'D', A'B'
are respectively F, G, H. Find the ratio in which the plane AD'Η divides the line
FG.
21. Two boxes each contain one hundred cards, numbered 1 to 100. A card is
taken from each box, the numbers on the two cards noted, and they are then
returned to their respective boxes. If the process is repeated fifty times, find the
probability that at no stage a pair of cards bearing the same number will be
drawn. How many draws are needed for the probability to be > £?
22. Prove that lg 15 is irrational. (lg 15 means log1015.) Is it true that lg χ is
irrational <s> χ is not a power of 10?
23. Evaluate the following determinants:
I b-c c-a a-b I I (x-p)2 iy-p)2 (,ζ-ρ)2 I
(i) c-a a-b b-c \; (ii) ι (x-q)2 (y-q)2 (z-q)2 .
| a-b b-c c-a \ \ (x-r)2 (y-r)2 (z-rf \
24. Define the projection of a vector in a given direction. ABCD is a
quadrilateral in which LA = Ζ С = 90°. The feet of the perpendiculars from B, D
to AC are X, Г respectively. Prove that AX = CY.
25. In a large crate of oranges, 100p% are bad. A random sample of ten oranges
produces two bad ones. On the assumption that this is the most likely number
of bad oranges to find in the sample, what can you say about the possible values
ofp?
26. ABCD is a square lying in the plane 2x—y — 2z = 5. If A has coordinates
(1, 1, -2) and С has coordinates (5, 5, 0), find the coordinates of Л and D.
27. The function, /, defined by
f(x) = sin χ + cos χ
has for its domain the set of real numbers {x e R: 0 < χ < π}. Find the range
of/

REVISION EXERCISE В
28. Factorize the expression ax2 + (ac + b—c) x+c(b — c) into two linear factors.
Hence factorize the expressions
(i) a + b-c + bc + ca-c2;
00 a-b + c + bc-ca-c2;
On) asb2 + a2bc + ab2-abc+bc-c2.
29. Describe geometrically the transformation of three-dimensional space
represented by premultiplying the position vector of the point (x, y, z) by the matrix
i 0 -i\
0 1 0 .
-i о i)
Which points are left unaltered by this transformation ? (O.S.)
30. When A and В play chess the chance of either winning a game is always
i and the chance of the game being drawn is always \. Find the chance of A
winning at least three games out of five.
If A and В play a match to be decided as soon as either has won two games,
not necessarily consecutive games, find the chance of the match being finished
in ten games or less. (J.M.B)
31. Solve the equations
0) 2sin2x+sin2x = 0;
00 sin* = cosQ-я-*);
giving all solutions in the interval 0 =g χ «S 2я in each case.
32. Solve the inequality x2-\ < 1
x2-4 " 5
and illustrate your solutions by means of a sketch of the curve
33. The base AB of a triangle ABC is fixed and К is a fixed point on AB. The
vertex С of the triangle moves so that the perpendicular distances of К from С А
and CB are always equal in length. Prove that, in general, the locus of С is a
circle through K.
What is the exceptional case? (Cambridge)
34. A pitcher is taken daily to a well and back. Its chance of being broken on
an outward trip is рг. If it survives the outward journey it has a further chancer
of being broken on the return. Show that on any day its change of being broken
is P, where
Ρ = Pi+Pi-PiP*·
Show that the chance that the pitcher will survive for at least η days is Qn,
where Q = (1 -рг) (1 -p^), and find the chance it will survive (n-1) days but
be broken on the nth day.
If two such pitchers are each taken independently to the well and back daily,
prove that the chance that they survive for exactly the same number of days is
Pli\ + Q). (CS.)
331

REVISION EXERCISE В
35. Find the image by reflection of the point (4, 3,1) in the plane
3x+2y+3z+l = 0.
36. A and В are two towers, В being four miles due east of A. The true bearings
of a flagpole С from A and В are a° east of north and a° west of north
respectively; the true bearings of a second flagpole D from A and В are (a+/3)° east
of north and (μ-β)° west of north respectively. Draw a sketch-plane to indicate
the positions of А, В, С, D.
Assuming that А, В, С and D are on level ground, prove t hat D is 4 sin2 β cosec 2a
miles south of С and 2 sin 2β cosec 2a miles east of C. (O & C)
37. If axi + 3xs + bx2 + cx + 2 is exactly divisible by (χ+ 2) (x2-l), find the
values of a, b, с
38. Sketch the graph of the function defined by
f(x) = x+\x\ (xeR).
In a separate diagram, shade in the set of points satisfying simultaneously the
three inequalities
y>x+\x\, y>\x\-x, x-y+l>0.
What is the maximum value of the expression
E= x+y+l
if χ and у are subject to the three given inequalities?
39. Prove that, if А, В are two non-singular 3x3 matrices, then AB is a non-
singular matrix and (AB)-1 = B_1A_1.
If / 6 3 5\ /1 0 0\ /10 0\
A= 2 5 2 , E,= 0 0 1 , Et= 0 1 0
\-3 4 -2/ \0 10/ \l 1 1/
find Α-1, ΕΓ',Ε^1.
What are the inverses of the following matrices:
/ 6 3 5\
(ii) -3 4 -2 ?
\ 5 12 5/
40. Using the formula for tan (A + B) in terms of tan A and tan B, show that,
if А, В, С are the angles of a triangle, then
tan^ + tan Л + tan С = tan Л tan .В tan C.
Calculate, in degrees and minutes, the angles of a triangle ABC if
tan A: tan В: tan С = 1:2: - 6. (Cambridge)
41. Find the equation of the plane containing the origin, the point (1, 1, 1)
and the point (3,1, — 1). Find also the direction ratios of the line of intersection
of this plane with the plane x+y-z-4 = 0.
332

REVISION EXERCISE В
42. In a game of tennis one point is scored either by A or by his opponent B.
The winner of the game is the player who first scores four points, unless each
player has won three points, when 'deuce' is called and play proceeds until one
player is two points ahead of the other and so wins. If A's chance of winning any
point is | and B's chance is i calculate the chance of
(i) A winning the game without 'deuce' being called;
(ii) a similar win by B;
(iii) 'deuce' being called.
If' deuce' is called, prove that A's subsequent chance of winning the game is f.
Deduce that A's chance of winning the game is nearly six times that of B.
(Cambridge)
43. Prove that " tn\
ДАЛ-2·
If A is a square matrix such that A2 = A, express (A + I)" in the form
α,Α+βΙ.
44. Use the method of mathematical induction to prove that
(fc+l)l (fc+2)l (fc+w-l)l = (k+n)\
1! 2! (л-l)! (k+l) (л-1)Г
45. Show that the equations
Uk + 2)x + 2y + 3z = 7,
| x + (k+2)y + z = 0,
{5x+2y + (6-k)z = 13
have a unique solution for all but three values of k.
Discuss the solution of the equations in the three exceptional cases.
46. Prove the formula cos ЗА = 4 cos3 А-Ъ cos A. (Formulae for cos 2A
and sin 2A may be assumed.)
Substitute χ = i+cos θ in the equation 8x3- 12x2+ 1 = 0 and, with the aid
of the above formula, solve the resulting equation in Θ, giving values between 0°
and 180°.
Hence find the three roots of the cubic equation in x, correct to two decimal
places. (Cambridge)
47. A1A2... An is a regular polygon of side 1 unit. Two (distinct) vertices are
selected at random. Taking as random variable X, the shorter distance measured
along the perimeter between the two points, find S(X)
(i) when η is odd; (ii) when η is even.
48. When л is a positive integer the coefficient of an~rbr in the binomial
expansion of (a+ b)n is denoted by I I. Write down an expression for I J and prove
that . . ...
"- for r= 2.3, ...,*.
By comparing the binomial expansions, or otherwise, prove that, when
> °'b> °' (а+ЬУ- α" < nb{a+ Ь)"-1. (О & С)

REVISION EXERICSE В
49. Factorize the determinants
I x a b с |
(O&C)
I я2 (я+l)2 (я + 2)2 Ι ι χ a b с
(i) b2 φ + iy ф + 2У , (ii) \a χ с b
| c2 (c+1)2 (c+2)2 | \b с χ a
\ с b a x
50. Find the image by reflection of the plane χ = 0 in the plane x+y+z = 4.
51. Using the same axes draw the graphs of у = sin x° and у = 1 — --- for
values of χ from 0 to 360. Estimate from your diagram the roots of the equation
The equation sin x° = mx+c, where m and с are constants, has roots χ = 60,
χ = 330. With the aid of your diagram estimate
(i) the other root of the equation;
(ii) the values of m and с (Cambridge)
52. Prove that, when any polynomial Дх) is divided by (x-d), the remainder
i*№
The remainder, when/(x) is divided by (x— a) {x—b) is written in the form
A(x-a) + B(x-b); prove that
b—a a—b
When αχϊ+βχ^ + γχ + δ is divided by x2— 1, the remainder is Kx where К is
a constant; when it is divided by x2 — 4, the remainder is K. Prove that
a=-/?=-iy=S= -i*. (O&C)
53. Let A(0) and B(0) denote the matrices
/cos0 sin 0\ /cos0 -sin0\
\sin# -cos0/ \sin# cos Θ J
respectively.
(i) If
(ί)-»(ί)
show that the point (A", 1") is the mirror image of {X, Y) in the line у = xtan£0.
(ii)PrOVethat AWA№) = B(0),
where 0 is some angle (to be determined), and hence, or otherwise, explain the
relation between the points (X\ V) and (X, Y) when
(ί)-»(ί)-
(iii) Prove that A(0,) A(02) A(03) = A(03) A(0a) A(0,)
and interpret this result geometrically.
334

REVISION EXERCISE В
54. One side of a hill has the form of an inclined plane, the line of greatest
slope, AB, being due north. A man starts from A and walks to B, the distance
being dx km. At В he walks for a distance d2 km in a straight line, inclined at
an acute angle θ to AB, reaching a point C. The vertical heights of В and С
above the level of A are hx km and /z2 km respectively. Prove that
cos a = ———.
кгаг
The bearing of С from A is φ east or west of north. Prove that
хапф = Ш^ь- (0&0
55. Show that, for all but two specific values of k, the equations
Ix-ky + 2z = 3,
3x+ y-2z= 1,
kx-9y+2z = 7,
represent three planes with a single common point.
Interpret geometrically the two special cases.
56. In how many different ways may a red balls, b white balls and с black balls
(indistinguishable apart from colour) be arranged in a straight line?
Find the coefficient of xlyizi in the expansion of (x+y+2z)12.
57. Solve the inequality I 5x+l I
\х*-х-6\ <
and illustrate your solution by drawing a sketch of the curve
5x+l
58. Three spheres of radii 1, 2, 3 units, touch each other externally. Prove
that a plane which touches each sphere makes an angle arcsin (J VI3) with the
plane containing the centres of the spheres. (O & C)
59. Find all values of x, у and ζ which satisfy the equations
where v-2w = a,
-u +3w = b,
2u-3v = с (C.S.)
60. If I deal cards one by one from a well-shuffled pack, find the expectation of
the number of cards I shall have to deal
(i) to secure a spade; (ii) to secure all the spades.
335

REVISION EXERCISE В
61. ABCDA'B'C'D' is a rectangular box with faces ABCD, A'B'C'D' and edges
AA', etc. AA'D'D is a square of side 1 unit, AB = 2 units and Μ is the
midpoint of CD'. By setting up a suitable coordinate system, or otherwise, find the
acute angle between the planes BMA', AB'D', correct to the nearest degree.
62. Prove that I {a-xf {a-yf {a-zf {a-wf I
Ub-xf ф-уУ (b-ζγ (b-wf\ 0
{c-xf (c-y)2 {c-zf (c-wY \ '
I (d-xy (d-уУ (d-zy (d-wy |
63. If θ denotes the angle between two intersecting lines with direction cosines
/, m, η and /', m', ή respectively, show that
cos# = U'+mm' + nn'.
Prove that the lines χ = y+3 — z+\
and * = KH-4) = H*+3)
intersect. If Ρ is the point of intersection, find the equation of the line through
Ρ perpendicular to both given lines. (J.M.B.)
64. Two jars, one white and the other black, contain a + b balls each; in the white
jar there are a white and b black balls and in the black jar b white and a black.
Single draws are made as follows: at the /-th draw a ball is drawn from the white
or black jar according as the (/·- l)th ball drawn was white or black, the colour
of the ball noted and then returned. If pn is the probability that the nth draw is
white, show that , ,. , ,
{.a + b)pn=b + (a-b)pn_1.
By means of the substitution pn = i+qn,
or otherwise, determine pn, when the jar from which the first draw is made is
(i) chosen at random, (ii) white. What is the probability in the two cases that
both jars have been drawn after three draws? (O.S.)
65. The number η whose digits in the scale of 10 are a, b, c, d in that order is the
same as the number whose digits in the scale of 9 are d, b, c, a in that order; in
other words, we have
103a + 1026 + 10c + i/= 94+92b + 9c+a
and the digits a, b, c, d all lie between 0 and 8 inclusive. Prove that there is
exactly one number л (Ф 0) with this property, and find n. (C.S.)
66. Prove that, if /A 1 0\
иА"-1 Ми-1)А"-2\
А" лА"-1
where л is a positive integer.
If A0 is defined to be I, is the result still true for any integer л?

REVISION EXERCISE В
67. The points Pt and P2 on the surface of the earth (assumed to be a sphere of
radius r and centre O) are at the same (northern) latitude A, and their respective
longitudes are Lx and L2. Show that the length С of the route from Рг to P2 via a
circle centre О (called a great circle) satisfies the equation
cos I —J = sin2 A + cos2 A cos (I^—LJ.
If the separation in longitude is 90° show that either
_ „ . /cos A\ '™° ^
С = 2r arcsin I —— I c
68. If A is a real number, show that the equations
λχ+ y+ ζ = 2,
x+Xy + ζ =-l,
x+ y + λζ =-1,
have a unique solution (x, y, z) for all but two exceptional values of λ.
Show that, for one of the exceptional values of λ, the equations have no
solution and that, for the other exceptional value of λ they have infinitely
many solutions. (O.S.)
69. Determine θ so that the line
lx+ my+ η = θ (Ι'χ+ т'у+rf)
is perpendicular to the line λx+μy + v = 0.
Prove that the perpendiculars from the vertices Au A2, As of the triangle formed
by the lines , . ,. , „ „.
их+пцу+щ = 0 (г = 1,2,3)
on to the sides B2Bz, BZBU ВгВг respectively, of the triangle formed by the lines
λιΧ+μ{ν+νι= 0 (г = 1,2,3)
are concurrent if
(/A + wj/ia) (/2λ3 + »г2/*з) (/3λ! + wa/ti)
= (/jAa + Wl/i3) (/.Л + пцрд (/3A2 + Wa/ta).
Deduce that, in this case, the perpendiculars from the vertices of Вг Вг Bs
on to the respective sides of Αλ Α2 A3 are also concurrent. (C.S.)
70. A hill 0-3 km high is in the shape of a spherical cap, with a horizontal circular
rim, the radius of the sphere being 0-6 km. A man walks up from a point of the
rim to the peak at a steady speed of 1-6 km per hour but never ascending at a
gradient of more than arcsin (4). Find the minimum time the walk can take him,
and sketch roughly a possible minimum path (as seen from above); does it
necessarily have no sharp corners ? (C.S.)
71. In a population of identical elements, an element produces no or one
progeny with equal probabilities i. If the population consists of 0, 1, 2, ...

REVISION EXERCISE В
elements with probabilitiesp0,puP2, ■ ■ · show that the probability P, of r progeny
is given by the coefficient of tr in G(i), where
When Pn = -I*
find the probability that the number of progeny is greater than or equal to N.
By considering G(t) and its derivatives, or otherwise, find r, the expected value
of /·, the number of progeny, and the expected value of (/—F)2. (O.S.)
72. А, В and С are the three angles of a triangle. Show that
I sin Л sin Л sin С I
cos A cos В cos С = 0. (C.S.)
| sin3 A sin3 В sin3 С |
73. If л is a positive integer and ρ a prime number, а„(и) denotes the greatest
integer к such that pk divides л. If η is written in the form
N
η = Еад' (0 < ar <p-\),
show that η— Σ ar
^(пП=~~-. (CS.)
74. .4.BCD is a parallelogram and Ε a point not necessarily in the plane ABCD.
Show that a2 + c2+g2 + h2 = 62 + i/2 + e2, these being the lengths shown in the
figure, and find a relation involving only a, b, c, e, f. (You may use vector
geometry.) (C.S.)
338

Bibliography
GENERAL
Allen, R. G. D. Basic Mathematics (Macmillan). A broad survey of mathematical
ideas—useful as a reference book.
Backhouse, J. K. and Houldsworth, S. P. T. Pure Mathematics (Longmans,
2 volumes). Written some years ago, with traditional 'A' levels in mind—
consequently rather old-fashioned, but workmanlike in its approach and
contains a great number of examples for the student.
Kemeny, J. G., Murkill, H., Snell, J. L. and Thompson, G. L. Finite
Mathematical Structures (Prentice-Hall). An introductory survey of some modern
topics, containing a much more thorough account of the ideas sketched in
Ch. 9.1 and also including chapters on linear algebra and probability.
S.M.P. Advanced Mathematics. (Cambridge, 4 volumes). Specifically written
for the new S.M.P. 'A' level, these books are full of new ideas, presented in
a lively and stimulating manner.
ALGEBRA
Beckenbach, E. F. and Bellman, R. An Introduction to Inequalities (Random
House.) A readable introduction to the subject.
Birkhoff, G. D. and Maclane, S. A Survey of Modern Algebra (Macmillan,
New York). Really a university text book, but beautifully written and
containing much that can be read with profit by sixth-formers. Gives a much more
thorough account of the ideas outlined somewhat sketchily in Ch. 1.
Cohn, P. M. Linear Equations (Routledge and Kegan Paul). A sound
introduction at low cost.
Durell, С V. and Robson, A. Advanced Algebra (Bell, 3 volumes). Written a
long time ago, but these books wear well and still contain a great deal of
valuable reading; they also contain an excellent selection of problems, many of
which are stimulating and challenging.
Ferrar, W. L. Higher Algebra for Schools (Oxford). A good introductory course
in traditional sixth-form algebra.
Maxwell, E. A. Algebraic Structures and Matrices (Cambridge). Carries the work
beyond 'A' level standard; contains some good exercises.
Neill, H. and Moakes, A. J. Vectors, Matrices and Linear Equations (Oliver and
Boyd). Covers the course of linear algebra required for the M.E.I. Ά' level
examinations. Approaches the subject from a rather different standpoint to
that adopted in this book.
Niven, I. Numbers: Rational and Irrational (Random House). A comprehensive
account of the work of Ch. 1.
339

BIBLIOGRAPHY
CALCULUS
Hardy, G. H. Pure Mathematics (Cambridge). A very famous book. Not easy
reading, but repays careful study. A work for the specialist.
Matthews, G. Calculus (Murray). An excellent modern text.
Maxwell, Ε. Η. Calculus (Cambridge, 4 volumes). A comprehensive text, useful
as a work of reference.
Quadling, D. A. Mathematical Analysis (Oxford). An excellent introduction to
university work: presupposes some acquaintance with the calculus.
Scott, D. B. and Timms, S. R. Mathematical Analysis (Cambridge). A university
text, but contains much good reading for the serious sixth-form student.
Siddons, A. W., Snell, K. S. and Morgan, J. B. A New Calculus (Cambridge,
3 volumes). A standard text-book; volume 3 is a particularly good
introduction to advanced work.
Toeplitz, O. The Calculus, A Genetic Approach (Chicago). A stimulating
approach to calculus by tracing its historical development.
GEOMETRY AND TRIGONOMETRY
Barton, A. Introduction to Coordinate Geometry (London). A good account of
two-dimensional coordinate geometry.
Conn, P. M. Solid Geometry (Routledge and Kegan Paul). See Conn: Linear
Equations.
Forder, H. G. Higher Course Geometry (Cambridge). Full of excellent material—
fascinating for the keen geometer.
Kazarinoff, N. Geometric Inequalities (Random House). Stimulating reading:
contains a good selection of problems, with hints on solutions.
Macbeath, A. M. Elementary Vector Algebra (Oxford). Despite its title, this is
really an introduction to vector geometry. Very clearly written.
Maxwell, E. A. Coordinate Geometry with Vectors and Tensors (Oxford). A
good introduction to three-dimensional coordinate geometry—takes the
subject well beyond 'A' level.
Nobbs, С G. Trigonometry (Oxford). A good introduction to'A' level work.
Tuckey, С A. and Rollett, Α. Ρ Trigonometry (Murray). A sound introductory
text, with many interesting digressions.
PROBABILITY
Feller, W. An Introduction to Probability Theory and Its Applications, Volume 1
(Wiley). An advanced text, but highly recommended for the serious post-
Ά' level student.
Mosteller, F. Fifty Challenging Problems in Probability (Addison-Wesley). Some
good material here for the problem addict.
Mosteller, F., Rourke, R. E. K. and Thomas, G. B. Probability and Statistics
(Addison-Wesley). A very elementary introduction to probability—wordy at
times, but thorough.
Parzen, E. Modern Probability Theory and its Applications (Wiley). A very
thorough introduction, carrying the subject well beyond Ά' level.
340

BIBLIOGRAPHY
Whitworth, W. A. Choice and Chance (Hafner). Published seventy years ago,
the bookwork is distinctly old-fashioned, but it ends with a delightful collection
of problems, some distinctly tricky. Excellent bedside reading for the inveterate
problem-solver; a key is published to prevent insomnia.
BOOKS FOR GENERAL READING,
COLLECTIONS OF PROBLEMS, ETC.
Abramowitz, M. and Stegun, I. A. Handbook of Mathematical Functions (Dover).
Over a thousand pages of mathematical tables (most to sixteen or eighteen
significant figures), formulae and graphs, together with extensive notes.
A marvellous reference book and not at all expensive for what it contains.
Ball, W. W. R. Mathematical Recreations and Essays (Macmillan). First
published in 1892 but it still provides excellent reading material.
Bell, Ε. Τ. Men of Mathematics (Pelican, 2 volumes). A racy and amusing account
of the lives of great mathematicians.
Boyer, С. В. A History of the Calculus (Dover). A readable account of the
subject.
Burkill, J. С and Cundy, Η. Μ. Mathematical Scholarship Problems (Cambridge).
A good collection of problems at university scholarship level.
Cadwell, J. H. Topics in Recreational Mathematics (Cambridge). Digressions on
the underlying mathematics in fifteen different problems: it makes very good
reading.
Courant, R. and Robbins, H. What is Mathematics? (Oxford). A famous book,
which discusses a number of mathematical situations in some depth.
David, F. N. Games, Gods and Gambling (Griffin). An excellent book on the
early history of probability.
Dorrie, H. 100 Great Problems of Elementary Mathematics (Dover). Extended
discussion of 100 problems of considerable historical interest, together with
their solutions—excellent value.
Gardner, M. Mathematical Puzzles and Diversions (Pelican). Entertaining
reading in hexaflexagons, polyominoes, the games of nim and hex, etc.
Lockwood, Ε. Η. A Book of Curves (Cambridge). A delightful collection, with
historical notes and comments on the geometrical properties. A good book for
browsing.
Maxwell, E. A. Fallacies in Mathematics (Cambridge). Will be read with interest
and enjoyment by any mathematician.
Moakes, A. J. Core of Mathematics (Macmillan). An elementary account of
mathematical structure and methods of proof: easy to read but stimulating.
Newman, J. R. World of Mathematics (George Allen and Unwin). An extensive
anthology of mathematical writings—good browsing material.
Polya, G. How to Solve It (Doubleday). An analysis of the art of problem
solving
Polya, G. Mathematics and Plausible Reasoning (Princeton, 2 volumes). The
title is self-explanatory. Compulsory reading for the serious student, while the
more frivolous will be able to skip through with profit.
Salkind, С. Т. The Contest Problem Book (Random House). A collection of
multiple choice problems, not all of which are easy.
Sawyer, W. W. Prelude to Mathematics (Pelican). A book for the layman,
341

BIBLIOGRAPHY
but can be read by the specialist with profit. Strongly recommended to all
sixth-formers.
Smith, D. E. A Source Book in Mathematics (Dover, 2 volumes). Contains
excerpts from the writings of great mathematicians of the past—all within the
understanding of the sixth-former.
Weston, J. D. and Godwin, H. J. Some Exercises in Pure Mathematics
(Cambridge). A fine collection of problems, based on the Welsh Ά' level
(alternative syllabus). Contains detailed solutions.
342

Answers
CHAPTER 1
PAGE
2 Ex. 2. 100101001. Ex. 3. 100, remainder 111.
Exercise 1 (a)
1. (i) 10; (ii) 1110; (iii) 10000; (iv) 101.
2. 100010011, 101012, lie.
3. 8e242. 4. 3e, remainder 19.
5 Ex. 7. Yes, unless a2 = b2 φ 0.
7 Ex. 12. (i) 7/9; (ii) 134/333; (iii) 283/45.
Ex.13, (i) 11-11; (ii) 0-101101; (iii) 100-1001;
(iv) 0-li010001111010111000.
Ex.15, (i) 0-3; (ii) 1-714285.
Exercise 1 (ft)
9 4. (i) 0-3; (ii) 0-Oi.
6. (i) True; (ii) True; (iii) False; (iv) True; (v) True (provided rational
number is not zero); (vi) False.
11 Ex. 17. (i) 2V2; (ii) 3V2; (iii) 3V6; (iv) 5V10; (v) 42V3.
Ex. 18. (i) 11 + 4V6; (ii) 3V6; (iii) 2V3; (iv) 8.
Ex. 19. (i) iV3; GO i(2 + V2); (iii) «2 + V2); (iv) 3 + V6; (ν) 7-4Λ/3.
Exercise 1 (c)
12 1. (i) 5V2; (ii) 11V3; СЮ 20V6; (iv) 8V3; (v) 22V3.
2. (i) 6-2V5; (ii) 14-4V6; (iii) 10+6V3; (iv) 9V3-11V2;
(v) 28-16V3.
3. (i) 2V2; (ii) 3V5; (Hi) 5V3; (iv) Vl3; (v) (3V3)/2.
13 4. (i) V5/5; (ii) V21/3; (iii) 2(V7 + 2)/3; (iv) (13 + 4V3)/11;
(v) V6 + V3-V2-2.
343

ANSWERS
13 5. (i) V5-1; (ϋ) V6-2; (iii) 3V2+2V3; (iv) 3V3-2V5;
(v) V5 + V2-
6. (i) 4; (ii) ±2; (iii) -5.
7. (i) «2+V2-V6); (H) i(VlO+V5-V2-3).
9. (i) 0-44721 (5 d.p.); (ii) 0-27395 (5 d.p.).
10. 0-98560 (5 d.p.). 11. i(3 - ^9 + ^3).
12. (i) (V(*+ D+V(*-1)}/2; (ϋ) (*+2) V(*+ Ό/(*+1);
(iii) (V(2x-a)-V(2x+a)}7(4х2-Д2)/(4х2-я2);
(iv) W(*- D-(*-2) V(*+ D-2(x-l)}/(x2-2x).
14 Ex. 20. (i) {1, 2, 3, 4, 5}; (ii) {1, 4, 9}.
15 Ex. 21. (i) Odd integers; (ii) integers 5= 2; (iii) multiples of 10;
(iv) the integer 6; (v) 0.
16 Ex.26. jxeU: | e uj.
Ex.27. ©И; (ii) 0; (iii) U; (iv) Λ; (ν) Λ; (vi) 0; (vii) 0;
(viii) U.
EX. 28. (i) {a, e); (ii) {a, A, d, e); (iii) {C}.
Ex. 29. (i) {2}; (ii) {2}; (iii){-},2}; (iv) Ш, 2};
(v) {-V2,-i, V2, 2}.
18 Ex. 31. (i) χ < -2; (ii) * < 1; (iii) * < £.
£Ьотсда7(</)
20 2. (i) Λ; (ii) {* e R: 3 < χ < 4}; (iii) {x e R: χ < 4}.
3. (i) {x e Ζ: χ < 0}; (ii) {x e Z+: x/6 e Z+};
(iii) {xeZ:x^ -3 or x? 3}.
4. (Ϊ) {xeR:2si x< 3}; (ii) {xe Д.* >3};
(iii) {xeR: -1< x< 3}; (iv) {xeR: -1 =g x« 3};
(v) {xeR:x > 3}; (vi) {xe Д: л: < -1 or χ =ϊ 3};
(vii) {хеД:3 < л: < 4}.
21 5. (i) {хеД:0< л: < 1}; (ii)0; (iii) {xe R: 1 < x < 2};
(iv) {«i: -2< x< -1}.
6. (i) {* e R: χ < - 5}; (ii) {χ eR:x> 2}; (iii) {x e R: χ < 7};
(iv) {xeR:x > 0}; (ν){*εΛ:χ<0 or x > 1}.
7. (i) {xeR:x< -3 or χ > 1}; (ii) {xe R: x< -i or χ > i};
(iii) {хеД: -| < χ < 3}; (iv) 0; (v) {xeR: -1< x< 5}.

ANSWERS
21 8. {xeR: -3 < χ < -\ or χ > 2}.
9. (i) 1; (ii) 3; (iii) \. 10. (i) Negative; (ii) positive.
11. χ < -4; -2 < χ < 1; χ > 3.
12. 10. 13. 12. 14. 1. 15. 600.
16. 20m. 17. fm2; ^=lm, AD = im.
Miscellaneous Exercise 1
I. 1111000,· Seven: ip, lp, 2p, 4p, 8p, 16p, 32p.
22 2. a = 2, b = 3. 6. a = 1, * = 4, с = 3.
8. 1-1/V7. 9. (i) 10; (ii) f. 10. Unaltered.
II. (Expressions not unique) (i) An C; (ii) A' n Л'; (iii) С п Л;
(iv) ВП(А'\) С).
23 12. к = -7, α = 8, * = \; к = 3, а = -2, Ъ = -2.
13. (i) Expressions equal; (ii) strict inequality.
14. 1/162,4/27.
CHAPTER 2
28 Ex. 5. (i) 8 units N; (ii) 3 units W; (iii) 4V2 units NW;
(iv) 2V17 units N 14° 2' E; (v) 10units N 36° 52' W.
Ex. 6. χ = (2a + b)/7, у = (a-3b)/7.
Ex. 7. (i) V7 units N 19° 6Έ; (ii) V3 units S 30° E;
(iii) 1 unit S 60° E; (iv) 41 units S 40° 54 W.
Ex. 8. (i) 2V2 units at 45° with downward vertical in plane through SN
line;
(ii) 3V2 units at 45° with downward vertical in plane through EW line;
(iii) 6V3 units at 45° to Ν, Ε and upward vertical;
(iv) 6V3 units at 45° to N, W and downward vertical.
Ex. 9. (i) 2a+b; (ii) 2a-b; (iii) a + b; (iv) a-b.
30 Ex. 12. χ = b + ia, у = b-^a, a = x-y, b = i(x + y).
30 Ex. 13. Yes, no.
Exercise 2(a)
1. (i) За + b; (ii) 8a + b; (iii) -5a+5b, (iv) -2a+6b.
2. a= (u + 3v)/7, b = (2u-v)/7.
345

ANSWERS
30 3. (i) 3a + 2b; (ii) -a + 2c; (iii) 2a + 5b + 5c; (iv) 6a + 2b-3c.
4. a = K"+v), b = (-u-4v + 3w)/9, с = (-7u-v+3w)/9.
5. (i) 2 units E; (ii) 3 units S; (iii) 5 units N 36° 52' E;
(iv) V2 units SE; (v) 5 units N 36° 52' W.
31 6. (i) 13 units S 67° 23' E; (ii) 13 units N 67° 23' E;
(iii) V29 units N 21° 48' W.
7. (i) 1-53 units N 22° 30' E; (ii) 2-8 units S 59° 38' E;
(iii) 1 -47 units S 16° 20' E; (iv) 7-4 units N 67° 30' W.
8. -20х-10л/2у.
9. λ = 2 sec 70°, μ = -2 tan 70°; λ' = μ' κ, 1-02.
10. 9units:(i) 12° 45'; (ii) 125° 28'.
11. OP:(i) a + b; (ii) 2a + c; (iii) 2b-c.
OQ:(i)2a + 3b; (ii) 5a + 3c; (iii) 5b-2c
OR:(i)3a + 4b; (ii) 7a + 4c; (iii) 7b-3c.
PQ = a + 2b; RQ=-2b + c.
12. iu + ^v, fu + Jv, u + v, w=-Ju-^v.
13. b-a, -a, -b, a-b.
14. a-2b, 2a + 2b+2c, a + 2b+2c, b-2c, -a-b-4c.
32 15. -a + 2b + 4c, a + b + 4c, a+2b + 2c;
(-3ED' + 2EG+2EF)/7, (ED'-3EG+4EF)/7, (ED' + 4EG-3EF)/14.
16. (i) j + 2k; (ii) i + j + 2k; (iii) i-j-2k; (iv) 2i + j + 2k;
(v) i + k; (vi) i+j + k.
17. (i) aq-ap; (ii) ap-aq + ar.
18. (i) (4u-2v)/3; (ii) (4v-2u)/3; (iii) 2(v-u).
19. (i) u-iv-iw; (ii)i(v-W); (iii) i(ii + w), (iv)i(v-u).
21. AC-2AD', 2AC-AC.
22. bV2-a, b-aV2, -a,-b, a-bV2, aV2-b.
35 Ex. 16. α + β= 1.
Exercise 2{b)
40 1. 2b-a,K2c + a), K-a+3b+c). 5. A(b + c), A(2c + a).
6. lb, i(a + b), 2a, О A = AX.
41 7. |a + ic,ia+|b; χ = }(2a + 4b + c).
346

ANSWERS
PAGE
41 8. Ja', |b+d + a', b+-£d + a'; the point of trisection of DD' nearer D'.
9. u + v, \(y + w), - i(v + 2u); point A' on WO produced such that
ox = $wo.
10. 3:1. 11. 2i + 3j, 6i + 4j, 6j-i.
42 15. i |.
Miscellaneous Exercise 2
1. b+ 2 cos 72° a, a+ 2 cos 72° b.
3. (i) Parallelogram; (ii) equal diagonals; (iii) rectangle.
43 14. 3:1:2.
15. {αδ2 - A)/(7i ^ - У2 <U (/07, - <*72)/(7ι <?2 - 7s <*i)! c, d not parallel.
CHAPTER 3
Ex. 4. - 2, 6.
48 Ex. 6. (2,1). Ex. 7. β, - J), (¥, - i1). Ex. 8. (1, 1, 2).
Ex. 9. (0, 4) Ex. 10. (-2, 3, 9), (0, -1, -5). Ex. 11. (f, ψ).
Ex.12, ίΐ + j + ik, (|,1, i).
50 Ex. 13. 2V5, V6, V(2a2 + 3/t2). Ex. 15. (2, 4, 3).
Ex. 16. (0, 0, 0), V2.
Exercise 5(a)
H:!)HiH (J-
.«(Ii); CO |
4· <»»- (J· '-HiW-(J·'- ©·-(-;)·
» PPM 347

ANSWERS
» 5· (Ι)· (-1)· (-!)· Θ-
411 (:)· Μ)-
7. (i) (3, 5); (ii) (1, 3); (iii) (}, -*); (iv) (i -i -j);
(ν) (α,Ο, -*).
52 8. (i) (2, 6), (3, 8); (ii) (-1, Щ, (Ο, ψ); (iii) (e+ b, b-a), (a, *);
Gv) & 1, -i), (1, f-, -V); (v) (а-i*, fa+i*, ia+*),
(a-ib, -ia+ib,ia+b).
9. (i) V8; (ii) V40; (iii) VlO; (iv) V51; (v) J(8a2+4b*).
10. (5, 3), (1, -3). 11. V.
12. (2, -1, 3), (4, i, i). 13. 20. 14. 1:2.
55 Ex. 18. (i) lx-y-Ί = 0; (ii) 2x+3.y + 5 = 0.
Ex. 19. (i) 5x-y-13 = 0; (ii) x+y-1 = 0; (iii) 3x-2.y+7 = 0;
(iv) y-2 = 0.
Ex.20. (0 2; (ii) -1; (iii) -f.
57 Ex. 23. 3x-.y-22 = 0, x+2^-5 = 0, (7, -1).
Ex. 24. 11*+5^+1 = 0. Ex. 25. x+3y-l = 0.
Ex. 26. 3V29.
Exercise 3(b)
1. (i) 2x-y-2 = 0; (ii) 2x+.y+3 = 0; (iii) x+2y-l = 0;
(iv) ax+by-(a2+b*) = 0.
2. (i) 3x-y-5 = 0; (ii) 7x-y+ 11 = 0; (iii) y+1 = 0;
(iv) 3bx+2ay-5ab = 0.
3. (0 I; GO -i; (Ш) 5; (iv) (b+a)Kb-a).
4. (i) 2x-.y-7= 0, x+2^-1 = 0;
(ii) 5x + 4.y-ll = О, 4Х-5.У-17 = 0; (iii) x-3 = 0,y+l = 0. 2V5.
5. (i) (3, -2); (ii) (-(a3 + *W+n -ab(a-b)/(a*+b*)).
6. (i) 43x+3.y = 0; (ii) lx-y-2 = 0; (iii) 10x-6^-9 = 0.
58 7. (-3, 10). 8. (5, -9), (3, -1), (1, 3). 9. (2, -3).
10. (- 5, 4), (- 5 ± 2V21, 0), (0, 4 ± 5V3).
11. x-Xy = 1 - 2λ; a circle on (- 2, 1), (1, 2) as diameter.

ANSWERS
58 12.3:4. 13.(3,-4). 14.(2,1). 15. Lies on AB.
16. (i) 2x+3y+6 = 0; (ii) 2х+Ъу-6 = 0; (iii) 2х-Ъу-6 = О;
(iv) 2x-3j>-12 = 0.
62 Ex. 27. (i) ζ = 0; (ii) ζ = 2.
63 Ex. 31. 3.y-5z = 0. Ex. 32. (2λ + 3, -λ-2, -3A-5), (1, -1, -2).
Ex.33. Kx+3) = y-1 =-«z-l).
Exercise 3 (c)
1. (i) [1, 1, 1], [1/V3, 1Д/3, 1/V3]. The direction ratios only are given in
the remaining seven parts; (ii) [1, -2, 1]; (iii) [-2, 2, 1]; (iv) [1,0,-1];
(v) [-1, 1, 1]; (vi) [1, 2, 3], (vii) [0,a,b+c]; (viii) [1, 0, -1].
2. (i) x+y+z = 1; (ii) x + y-2z = 5; (iii) 2x+y-z = 4;
(iv) 3x-y-2z = 4; (v) 5x+3y-2z = 2; (vi) 5x-z = 2.
3. (i) ix = y = -iz- (H) x-1=y-2 = -i(z+iy,
(iii) }(*-!) = -40,-1) = -^+!); (iv) %x + 2) = -i(z-3),y=l;
(v)x=l, ζ =2; (vi) &x-a)=y-2a = z-3a.
4. (i) 3x-3y + z = 0; (ii) 5x-4y-l = 0; (iii) 3^-5z-3 = 0.
5. (i) 7x + 3.y+5z = 0; (ii) 4x-4.y-5z+ 5 = 0; (iii) llx-^+5 = 0.
64 6. 3.y+2z-7 = 0; 7x-4^-5z = 0.
7. 2x+.y+2z+7 = 0.
8. (3λ+8,λ+1,1), (5, 01). 9. (1,1,0).
10. x-2 = i(y-l)= -(z-1), (3,3,0).
11. [1/V5.0, -2/V5], (1,-4,3).
13. (-7, -7, 4), x-4.y-.z-17 = 0.
14. (l,l,3),3x-4y-2z+7 = 0.
65 15. (1, -2, -1). 16. (-3, -4, 2). 17. (7, 5, 0).
Miscellaneous Exercise 3
2. 2:1, 5y. 4. mx-y+ib-am) = 0.
66 8. χ = -y = \z.
10. (i) Planes form a prism;
(ii) planes all pass through the line χ — ζ + 1 = 0, у = 4.
11.2:1; 2:1. 12. 18λ/ί+2λ-5/* = 0.
67 14. α(^+ у)/(д - α). 15. χ + ζ V3 - 2 = 0.
ra-a 349

CHAPTER 4
PAGE
68 Ex. 1. 3,6,6. Ex.2. 8, -1,5.
69 Ex.4. 1, -1,0.
Exercise 4(a)
1. 10, 2, 3-J, 82. 2. 1, -9, 26, -24. 3. 0.
4. 3, 4, -5. 5. 7, 20, 22, 5. 6. 4, 0, -6, 2. 7. 3, -7, -6, 0;
3. 20, 33, 0; -If, -1, 1 8. 1, k, -Ik2, -k\
70 11. -9, 27,3.
Exercise 4(b)
72 1. (i) 62; (ii) 8; (iii) -2; (iv) -241; (v) Ц; (vi) 4*.
2. (x-2)(x-3)(x+l).
73 3. (i) (2x+l) (x+4)(x-l); (ii) (x+l)(3x-4) (4x + 3);
(iii) (x + 5) (x-2)(2x+l); (iv) (x+3) (^-2); (v) (x-2) (xHx+l);
(vi) (x+l)(x-3)(x-5); (vii) (x-2) (x2 + 4x + l);
(viii) (2x-l)(2x + l)(2x + 3).
4. As for Qu. 3 except (iv) (x + 3) (x - V2) (x + V2);
(vii) (x-2)(x + 2-V3)(x+2 + V3).
5. 2. 6. -5.
7. -6i 8. 4, -5. 9. 2, -2.
11. (i) (x-l)(x-2)(x* + 6x+l);
(ii) (x-l)(x-2)(x + 3 + 2V2)(x+3-2V2).
12.0,4; 2(x+l)(x-l)(x-2).
Exercise 4(c)
75 6. x3-x2 + 3x + 2. 7. 2x3-5x2 + x+l.
8. x3-x-5.
Miscellaneous Exercise 4
76 2. Я =-2, 5 = -3.
3. (i) l8x + 6; (ii) 1261X + 2777. 4. P(b)/(b-a), P(a)/(a-b).
5. (b + c)(c + a)(a+b). (i) 30 + 2tf-3/-)(-3/?+9+2r)(2;>-39 + /·);
(ii) 2, 3, 13, 47, 73.
350

ANSWERS
PAGE
77 8. (i) -(b-c)(c-a)(a-b); (ii) (b + c) (c + a) (a + b);
(iii) (a + b+c)(b-c)(c-a)(a-b).
10. x^ + Sx'+Sx+S.
CHAPTER 5
78 Ex. 1. wi.
80 Ex. 3. 2, 3, -12, - 13; 0, (3 ± V29)/2.
Ex.4. 2, 3, -12, -13; 0.
Exercise 5(a)
81 1. -1, -1, 5, 23, -1. 2. Ц, 2$, 3$.
3. {y e R: у > 0}, {у e R: у < 0}.
82 4.-1,1. 6. 2i, {zeR: 2=g z=g 5}.
9. (i) 0, 0; (ii) {yeR:-h^y<2}; (iii) {(3 - V5)/2, (3 + V5)/2};
(iv) 0.
10. {yeR: -4V2< *<; 4V2}. 11. f^iy) = 1 +jA
12. i. 14. (i) No; (ii) no. 15. Reflection in χ = у.
Exercise 5(b)
15. χ < -KV13+1) or χ > -KV13-1).
16. -V3 < x < -V2 or j2 < χ < V3.
17. л: < li or * > 3. 18. χ > 2|.
19. χ < -i or * > If. 20. * < -л/2 or χ > V2.
21. x<0 or li<x<l 22. χ < -l-\ or 34 < x < 5.
351

CHAPTER 6
PAGE
87 Ex. 1
■ π π 5π 3π 5π 11π
'3'4'Ϊ2'~4'~4'~6~'
Ex. 2. 45°, 150°, 120°, 40°, 105°, 36°, 48°.
Exercise 6{ά)
π π 7π 5π Ίπ 3π 7π \\π 5π 19π
' 9'4'Ϊ2'~6~'~6~'Τ'Τ'ΊΓ'Τ'ΊΓ'
2. 30°, 60°, 135°, 75°, 210°, 810°, 54°, 900°, 50°, 63°.
3. 0-445, 1-244, 2-552. 4. 27-5°, 37-2°, 124-9°.
6. 6080 feet—the 'nautical mile'. 7. 18-5 cm.
8. (i) 10-5 cm; (ii) 43-3 cm2; (iii) 9-1 cm2.
9. 1r sin θ, 7Γ/-0/9Ο. 10. 1-7 m, 8 m2.
Э 11. 9-6 cm, 26-5 cm2.
12.(i)216°; (ii) 47-1 cm2; (iii) 6-24 cm.
Exercise 6{b)
\ 1. -i, -1, 2, -V3/3, -V2/2, 2, 0, V3/3, -J2, -2V3/3.
2. V2/2, -1,-2, -V2/2, V3/3, -2, V3/2, -2V3/3, -V3/3, -V3/2.
3. -0-5736, -0-6157, -0-8391, 3-0716,0-3420.
. ... 7л- 11Я- 3π 7тг 2тг 4тг тг 7тг
4.(0Т,—; (п)т,т, (ш)т>у; (.ν)-,-
, я- 2π 4π 5π \3π Ππ 3π Ίπ
(ν)3-,τ,τ.τ; (νΟ^,^; (vu)T,T;
, ...ч π 5π 137Γ 177Γ 25π 29π ,. ч 5π , ч п . , „
«""'Τί-ίί-ΊΪ'ΤΓ-Ίί-Τί-· «Τ· W<M».».i».an
<·?■ «Π·?·?·
«ν)--,»,»; ,ν,-ϊΛ" W-J.Ji W0 -£-££''*
(viii) no solution; (ix) --; (χ)
12' 12' 12"
' 3π 7π

ANSWERS
94 6. (i) 3, 1; (ii) 4,0; (iii) 5,1.
7. (i) -2V2/3, -V2/4; (ii) V5/5, -2V5/5;
(iii) -i, 2V2; (iv) -V15/15,-4V15/15; (v) -4V15/15,-V15/15.
95 11. -A, H. 12. 3 + 6V10/10.
13. (i) 45,123-7, 225, 303-7; (ii) 70-5, 289-5; (iii) 45, 225,153-4, 333-4;
(iv) 38-2, 141-8; (v) 36-9, 143-1, 216-9, 323-1; (vi) 30, 150;
(vii) no solution; (viii) 0, 63-4, 180, 243-4, 360.
14. (i) 9*2 + 4(y-3)2 = 36; (ii) x2+y* = 2; (iii) 2*2-2*y + 5.y2 = 9;
(iv) *2(16-j>2) = 9.y2; (ν) yil-y) (1 + *)2 = 1.
15. (i) (sine,0); (ii) (±1,0); (iii) π/ω seconds. 16. a. = \n.
96 19. 4, li (i) 0; (ii) 2.
98 Ex. 12. arccot: R -> {y e R: 0 < у < π};
arcsec: {xeR: \x\ > l}^ {y eR: 0< y^ π,y+ \ττ).
Ex.13. &r,in, -\π.
Exercise 6{c)
99 2. 0, |7Г, тт. |тг, 2тг. 3. 35-2. 4. 65°.
100 5. 35-3, 144-7. 6. 0-4, 3-32, 5-14. 7. 0-90, 2-25.
15. (i) V3/2; (ii) V3/2; (iii) V3/3; (iv) -V2/2; (v) \; (vi) 3;
(vii) 4; (viii) -V3/3.
16. (i)x/V(l-x2)l GO */V0+*2)· 17. V2/2.
18. x, |*| > 1. 19. *2+/ = 1.
Miscellaneous Exercise 6
1. 0-967.
101 2. 2a2(>-V3/4). 3.11,-1. 4. 0-73, 2-41, 3-99, 5-43.
5. 54-4 cm. (i) 12-25 cm; (ii) 75-5°; (iii) 13-2 cm.
6. 0. 7. sin * Φ ± sin у
8. (j>,3» or (Ця.з». 9. (*2-/)2= I6*y.
102 11. 0-887Γ, 0-787Γ, 0-69тг, 0-60я, 0-50я.
12. 0-467Г, 1-бЗтг, 2-317Г, 3·95π, 4тг.
13. 2ηπ + α., ηπ+α..
14. (i) i(3n+l)n; (ii) §(3я+1)я- or 2кл. 16. 0-39тг, 2-3 cm.
353

ANSWERS
CHAPTER 7
PAGE
103 Ex. 1. 120, 720, 5040.
104 Ex. 2. 12, 60, 720. Ex. 3. 120, 6.
105 Ex. 4. 10, 20. Ex. 5. 1. Ex. 7. 364, 165.
106 Ex. 8. 210. Ex. 9. 510.
Exercise 7(d)
1. 120. 2. 870.
107 3. 665280. 4. 216. 5. 120, 48, 30. 6. 495, 135.
7. 5005, 420420. 8. 34650. 9. 120, 85.
10. (i) 48!/(9!39!); (ii) 48! 16/(9! 39!). 12. \n\, 840.
13. 30, 12. 14. 19958400. 15. i«(«-3). 16. 90.
108 17. n\/(plq\r\),n\/{3(n-2p)l(pm. 18. \n{n-1).
109 Ex. 10. {##, Я71, TH, TT}, {OH, 1#, 2#}, {coins fall the same, coins fall
differently}.
112 Ex. 20. i Ex. 21. A. Ex. 22. зтт-
Ex. 23. (i) 132(39!)2/(27! 52!); (ii) 13 (39!)2/(27! 51!).
Ex.24, (i) I; (ii) i; (iii) i.
Exercise 7(b)
1. 64, 20, A.
113 2. {■ά,^,ϋ,ϋ,ϋ,ή,ή}.
3. (i) i; (ii) A- 4. Ш. 5. Ш-
6. Pr (0) = A, Pr (/·) = (10- /·)/50, r = 1 to 9. 7. &-.
8. (i) τ^; (ii) A.
9. {2, 3, 4, ..., 20}, {тк, т^, jh, .·., tSit, т№, ifo, ■·■> ue}l Ϊ&.
ю. Α.
115 Ex. 26. A. Ex. 27. A- Ex. 28. (i) i; (ii) i; (iii) £; (iv) |.
Ex. 29. fg.
119 Ex.31, (i) i; (ii) A; (iii) A. Ex.34. Ш
354

ANSWERS
Exercise 7(c)
PAGE
119 1. &&, £ft. 2. 0-784. 3. Ш-
4. 0) Α; (ϋ) А; (Ш) A. 6. *, if. 7. if.
120 8. (i) H; (ii) if. 9. 5!, 20, nfe. 10. rir.
11. fi 12. At least 12. 13. Ц. 14. 0-22.
16. if. 17. 13 times.
121 20. 0-956. 21. (i) Α; Α; (ϋ) А, те-
123 Ex. 35. |,f. Ex.36. iV, i
124 Ex. 37. -4^r. Ex. 38. й, зтг, зтг.
Exercise 7(d)
125 1. #r. 2. 0-1997. 3. Д. 4. J, A, #T. 5. ,.
126 6. !^W; 7. A. 8. W, ff, A.
9. p/(p+q-pq).
Miscellaneous Exercise 7
1. (0 Η; (ϋ) aV; 0») 0-065; (iv) 0-036.
2·^!' °' 2(^2)!' ЗОРз)Т
127 3. No difference in either case. 4. Approx. y. 5. i.
6. Replacement yj; no replacement ^g.
7. Probability of securing no prize greater for B, but this is compensated
by greater probability of two prizes.
9. (i) 1; (ii) f. 10. (2'-l)/3-i.
128 11. (i) mlim + n)· (ii) m(m- 1)/{(т + л) (м + л- 1)}; (iii) n/(m + n- 1);
(iv) m/(m+ri).
12. т¥г, A.
13. i (i) AV, ift, i¥5, тЬ; GO rb, тй, A, t^V; Й.
14. (i) 0-09; (ii) 0-111; (iii) 0-336; (iv) small; (v) no.
16. (Λ + λ)(Α + Λ)(Λ + λ) = A2.
17. it, 0, TT; exactly and oppositely synchronized; i, 0, J; J, ^, J.
18. j&.
355

PAGE
128 19. λ, (1-9)(1-<74);!λ, (1-9)9*; |λ, (1-9)9; |λ, (1-<?)<72;
}λ,(1-9)<Ζ3;ΊΛ,94.
20. 1 - (л-1) Ι (л»- лг) Ι и—VCi" - 1)!-
CHAPTER 8
131 Ex. 1. (i) 0, 1, 4, 17; (ii) 2, 5, 16, 65.
Ex. 2. (i) 1,2,4,8; (Ю 1, 2, 3, 4; (iii) 0, 4, 21, 100; (iv) 1, 2, 5, 17;
(v) 1-5, 1-4, 1-41, 1-414, (approx.).
Ex. 3. (i) 3, 6, 9, 12, 15, 18; (ii) -2, 1, 4, 7, 10, 13;
(iii) 1, 2, 4, 8, 16, 32; (iv) 0, 6, 24, 60, 120, 210; (v) 0, 2, 0, 2, 0, 2;
(vi) i, i, А, А, то A-; (vii) 1, i, 1, 4, 25, 216;
(viii) -1, 5, -1,9, -1, 13.
132 Ex. 5. 1, 1, 1, 1, 25.
Ex.6, (i) 1 + 4+9; (ii) 0+2 + 6+12; (iii) 1 + 6+21 + 60;
(iv) -1 + 2-3 + 4-5; (v) 2+3 + 10+29; (vi) 3 + 0-1+0+3 + 8;
(vii) 0+i + i+i + l
133 Ex. 7.(i) 18; (ii) 14; (iii) 100; (iv) 6; (v) lA; (vi) -15;
(vii) 31; (viii) A;
Ex. 8. (i) Σ /·; (ii) Σ (-1)^1/·; (iii) Σ lr; (iv) Σ Τ-1;
(ν) Σι-1; (vi) Σζ-e+l); (νϋ) Σ(-1)'+ΐ; (viii) Σ л
Ex. 11. en. Ex. 12. Ъг- 2, Ц3г~ Ό, 4- 2/·.
Ex. 13. (i) 46; (ii) 35; (iii) 86; (iv) 40; (v) 100.
134 Ex. 14. (i) Σ(3/·-2)(3/·+1)(3/·+4); (ii) Σ 6/·2(4/·+1); (iii) Σ (2r)3r-2;
(iv) Σ(2/·-1)(4/·-1)(3/·-1)-1; (ν) Σ(- 1)'+V{(2/-1)(3/·- l)C7r-4)}-1.
Exercise 5(a)
135 1. (i) 112; (ii) 2&; (iii) 15.
2. (i) Σ3Κ3/—2);
(ii) Σ (-1ГМг+1)(г+2); (iioJU-D'+H^-l)2.
356

ANSWERS
135 3.(i)l,4,13,40; (ii) 1, 1, 1, 1; (iii) 2, 22, 24, 28; (iv) 1, 2, 1|, lfi-
4. sn-2sn^ + sn^ = d.
5. (i) 9/·-6; (ii) i(2r-l); (iii) 20-8/·.
6. (i) 51; (ii) 70; (iii) 30. 7. (i) 2601; (ii) 17045; (iii) -1155.
8. 6-5/·, -930. 9. 8/-+1,л(4л + 5). 10. i(5r+3), £л(5л+11).
136 11. 40. 12. 36. 14. 2,4,6. 17. llf, 20j.
18. 63.
Ex. 15. (i), (ii), (iv) and (vi). Ex. 16. (i) 8; (ii) 7; (iii) 32.
Exercise 8(b)
138 1. (i) K412-l); (ii) -K220-l); (iii) (2 + V2)(28-l);
(iv) i[3 -3-»].
2. Ц, 2.
3. li,2i,4i, ... or -li, 2i, -4i 4.18.
5. 21. 6. 29. 7. 27. 8. 16.
9. 1-126 xlO8 km. 10. j, 1, 3.
139 11. £316-1. 12. £313-07. 13. (i) 1^; (ii) 6; (iii) f.
14. AS-. 15. 23i years. 16. {pa(p"-l)-an(p-1)}(/>-1)-2.
17. «r = />«,._!. 18. a2(l - /·) (1 - /·2") (1 + ζ-)"1.
140 Ex. 21. in(n+ 1) (л + 2) (я + 3) (и + 4) (я+5).
141 Ex. 22. 5L- |{(я + 1) (η + 2) (я + 3) (я + 4)}"1.
Exercise 8(c)
142 1. (i) 650; (ii) 6084; (iii) 2865; (iv) iw(4«2-l).
2. Ая(я+1)(« + 2)(Зя + 13), Ал(я+1)(я + 2)(Зя + 17).
3. Ая(я+1)(« + 2)(Зя + 1).
4. i^"3 + 2я2 + Ъп + 10). 5. tVC*2 -1).
143 6. Ып +1) (2и + 7). 7. i - i(2« + 3) {(я + 2) (я + З)}"1.
8. з7-*(3я+1){(я+1)(я + 2)(я + 3)}-1.
9. AV- А(4я +13) {(п+1) (л+ 2) (л+ 3) (я + 4)}-1.
10. (i) я(3я2 + Зя - 2); (ii) hniri + 1) (я + 2) (Зя + 1).
(iii) Ып+ 1) (Зя+ 1) (Зя- 2); (iv) ^я(я +1) (я + 2) (я+ 3) (4я +1).
357

ANSWERS
143 11. l-i;i„(„+lf(„+2).
12. i(2«+D(2« + 3){(«+l)(«+2)}-1-i
14. i-iMfl+l)}-1. 15. i«(4^-l).
144 Ex. 23. 1, 6, 15, 20, 15, 6, 1; 1, 7, 21, 35, 35, 21, 7, 1;
1. 8, 28,56,70, 56, 28,8,1.
145 Ex.26, (i) 81x4-l08x3y + 54x2y!-12xy3 + ;»>4;
(ii) 64x6 +192x5y + 240л:V +160*V + 60*V +1Ixf + у*;
(iii) 128*' - 448x6 + 672x6 - 560x4 + 280x3 - 84x2 +14* -1;
(iv) 243x6+810xV+1080xV + 720xV+240x/ + 32/.
Exercise 8(d)
146 1. (i) x6-6x&y+15x1y2-20xsys+15x2yi-6xy5 + f;
(ii) 32x5 + 40x1y+20x3f+5x2y3 + ixyi+^2y;
(iii) 1 + 14* + 84л^ + 280х3+ 560x4 + 672x6 + 448x6+128*';
(iv) 64(x6 +18x6y + 135xV + 540xV +1215лУ +1458*/ + 72V).
2. (i) 54; (ii) 672; (iii) 264; (iv) 19440.
3. (i) 1 + 16X+112*2; (ii) 1 + Ч-х + Чкх2\ (ш) 256(4- 20x+45x2);
(iv) -y> + 2\y*x-№fx\
4. (8\ 2rx*r-*,nw.
147 5. A. 6.(i)l-0615; (ii) 0-98411; (iii) 0-99501; (iv) 235-01.
7. 322. 8. 120,4200.
9. (i) l+4x+10x2; (ii) l-6x+21x2; (iii) 1 + 10x + 35x2;
(iv) 16(4-12x-9x2).
10. ±li%. 11. 2". (i) 81; (ii) 1.
Miscellaneous Exercise 8
1. (i) i(3"-l); (ii) 3*"<«-i> 2. A, 19-99998,9.
148 3. f. 4. {nx"+1-(n+l)x"+l}(x-l)-\
{„2xn+2_ (2„2 + 2„- 1) X"+i + (n+lf X--X-1} (х- 1)-з.
5. (;+;). 6. 4, 12, 36; 2.3-
149 12. {(l-^-n^a-^Jd-*)-2.
13. (/·-1) digits 1, followed by r digits 0 and finally the digit 1.
358

ANSWERS
PAGE
149 15. 627500.
150 17. (i) 15л(л+83)/2; (ii) 45(19«-40). 18720, l0dn+720n-55d; 30.
18. („-If.
Revision Exercise A
151 1. 5,3;x-l. 2. (-5, -5). 4. (i) 1110; (ii) 17-3.
5. 120, 24. 6. 40-9°. 7. 7x+7.y-29 = 0.
8. 0, 4; (x2 + 2V2x + 4) (x2 - 2V2x + 4).
9. 3RQ-2BC, 3RQ-BC.
10. (i) -V5/3; (ii) -2V5/5; (iii) Ц.
152 11.5,15. 12. 11, (3x+l)(x+1).
13. 0-036, 0-12. 16. 28, Цп+1)(п + 2).
18. 10 cm, 9-6 cm, 37-25°, 9-75 cm. 19. two, H-
21. 2x+3.y + 4z-9 = 0; 3V13/13, -2713/13,0.
153 23. (be'+b'cf (ca' - c'af = (aU + a'bf [(be' + b'cf +(ca'- c'af].
24. i(5-V97) < x< -1 or i(5 + V97) < χ < 4.
25. b0i+l)(f+2)(3/! + 7).
26. {-2 < χ < 2},{-3 < χ < 1},{2 < χ < 3},{1 < * < 2},
{-3<x<2}.
27. 1-03. 28. MM. 29. -i-j-3k.
30. 2Et = E2+Ez.
154 31. J. 32. г =/<a-b), a*(a+b)/(a+* + c).
33. (i) i(V5+l), (ii) 2; 4n + 1 is a perfect square. 34.^.
CHAPTER 9
158 Ex. 6. ЛЯС is right-angled at A. Ex. 7. Not' only if.
Ex. 8. Not 'if. Ex. 9. Yes.
Exercise 9(d)
160 4. 13. 5. 0or4. 6. 13.
10. No solution if a = - 1 unless * = 0.
15. No: e.g. right-angle at В and Μ coincident with B.
359

ANSWERS
CHAPTER 10
PAGE
171 Ex. 2. lOOO-X -X.
173 Ex. 3. l£ Ex. 4. Lose.
Ex. 5. Yes, if*
Exercise 10(a)
174 1. (i) 2|; (ii) -A; (iii) 3^;
(iii) 6A; Gv) 10.
3. 3p. 4. W.
175 8. 9p. 9. 7p.
14. (i) 8-75; (ii) 5-25.
176 15. 27ip. 16. 22ip.
178 Ex. 6. 2, li. Ex. 7. i
(iv) 5i.
5. £100.
12. £1-32.
17. *i3 8
16p | 2 3
Ex. 8. (i) μ+c;
= 1.
2. (i) 4h 00-ii;
6. 0, 3-3.
13. 2.
13 15 16 17
3 2 3 3"
(ii) σ\
Ex. 9. i, 0. Ex. 10. <?(Χ)*-3μσ*-μ\
Ex. 11. |(1 + f)3, Iii
Ex. 12. 2p.
Exercise 10(b)
1. (i) -μ, σ; (ii) μ+1, σ; (iii) 3/t-1, 3σ; (iv) 0, 1.
2. (i) 2-81, 4-41; (ii) 3-7, 2-01; (iii) 0-15, 3-53; (iv) 150, 3000.
3. (i) 12-31, 121-1; (ii) 15-7, 102-8; (iii) 3-55, 16-75;
(iv) 2-55 xlO4, 2-83 xlO8.
4.4J-.2J. 5. Hn+l),Un2-D.
6. M, H, H. 7. A. 9. (i6 + i3 + ί + 3)"/6", |я.
10. A, 7, 2-415, 1-71. 11. ff, ^, ^.
12. 6-5, 2-36. (i) |; (ii) 0.
Miscellaneous Exercise 10
1. £7-33. 2. 4p favours player, Цр favours banker.
3. Just over £3. 4. — lj^. 5. Yes, if Χ, Υ independent.
7. 2(n-r)/{n(n-l)}. 10. 0p-i. 11. «σ2+/*2-
"■НШ'Н'5К)Ш1Ю

ANSWERS
CHAPTER 11
PAGE
192 Ex. 1. Yes, provided a Φ 0, b Φ 0. Ex. 2. OA2.
Ex.3, (i) Pythagoras; (ii) Extension of Pythagoras (Cosine Rule).
Exercise 11 (a)
193 3. Sphere, centre A, radius |c|.
195 Ex. 4. -5. Ex. 6. (3i + j + k)/Vll.
199 Ex. 9. 1, 1; lies on angle bisector.
Ex. 11. ЗХ + 2.У-10 = 0, (2, 2). Ex. 12. (-a, -*), \c\.
Ex. 13. 8x + 6y+13 = 0, 8x+6y-7 = 0.
Exercise 11 (ft)
1. (i) 2, ψ, (ii) 2, ¥; (iii) 3, |; (iv) Z>, (3e-26)/5.
200 2. (i) 2V5/5; (ii) V65/65; (iii) 8V17/85; (iv) |(я2-*2)/(я2+*2)|.
3. (i) V17/17; (ii) 12V13/13; (iii) 6V10/5; (iv) (h + lcf (/z2+/t2)-}.
4. (i) 2V5/5; (ii) 9V5/10; (iii) 2V26/13; (iv) (a+b) (a2 + *2H.
5. (i) Ъх-4у + \6 = 0, Ъх-Ьу-U = 0;
(ii) 8x+6y + 35 = 0, 8x + 6y-25 = 0; (iii) 5x+12^+41 = 0,
5x+12.y-37 = 0; (iv) 3x + 2.y+ 1 ±3Vl3 = 0.
7. (i) 2x-y = 0; (ii) 7x-7y-3 = 0; (iii) 128x+16y+69 = 0.
8. x-ly-18 = 0. 9. (i) (9, -9); (ii) 52i; (iii) +h
10. (-p, -q).
201 11. x+2y-U = 0, x-y+l = 0, arccos (8V145/145), (¥, 3).
12. 37.
202 Ex. 14. (i) i(I-2J + 2k); (ii) (3i- j + к)Д/11; (iii) (3i + 2j + к)Д/14.
Ex. 15. (i) x-3y+z = -5; (ii) x-^ + z = 2; (iii) 2x+3.y-5z = -16.
Exercise 11(c)
205 1. 110-9°. 2. (i) x-.y+z = 0; (ii) 2x+y-3z = 13;
(iii) Ъх-z = 1; (iv) 6x-3.y-2z = 2; (ν) 4x+.y-3z = -1;
(vi) ax+by+cz = a2 + b2 + c2.
3. (i) 2V70/35; (ii) V28/84; (iii) 2/29; (iv) 1/2.
361

ANSWERS
206 4. (i) 3VH/H; (ii)Vl5/15; (iii)V66/ll; (iv) 17V58/174;
(v) аЫаг + Л2)-1; (vi) \ab - be - ca \ (a2 + Ьг + с2)"1.
5. (i)V7/7; (ii)Vl05/15; (iii) 1; (iv) 40/41; (v) ,/3/2.
(vi) Vffi(<?/ -ртЩМР + m* + «3)(p2 + q2 + r2)}.
6. (i)K2i-2j+k),2x-2^ + z= -1;
(ii) i(3i-2j + 6k),3x-2.y + 6z = 0;
(iii)^(i + j-4k),x+^-4z = 1;
(iv)^(4i-j-2k), 4x-y-2z=6;
(v)^(3i-4j+k),3x-4.y + z =1;
(vi) (a2i - b*j + c2k)/V(a4 + b1 + c4), a*x - V-y + c2z = 0.
207 7. (i) (2, 3.-1); (ii) (4, 7,-5); (iii) (1, 1, 1); (iv) (0,-1, 3).
8. (i) (9, 5, 3); (ii) (7, -10, -3); (iii) (-6, 16, 6); (iv) (-9, 9, 12);
(v) (15, -13, -2); (vi) (- 2ad(a* + Ρ + c*)-\ - ΉκΚ,α* + & + <?Y\
-Icdtf+tf + c2)-1).
9. (i) (1, 1, 3); (ii) (2, 3, 4); (iii) (6, 2, 4); (iv) (6, 4, 1).
10. (8, 2, 0). 11. (1, 4, 3), K*-l) = y-4 = z-3.
12. Цх-1) = - Hr- 5) = fa- 2). 13. x = y- 3 = - (z+ 6).
208 14. 3: -1:2, Vl4.
16. «V3 -1), «V3 +1), V3(i - V3)/6, V3(i + V3)/6, iA
17. 5λ = 2μ, a = 5i + 29j + 2k, b = -(i + 5j + k).
209 18. 2x-.y + 2z = 9, arccos -f. 19. (1, 5,0), arccos V(rr)·
20. (i) (4, -ι, -3), (¥, f, -¥); (ii)!;(!,!, -|).
Miscellaneous Exercise 11
1. |/7-Xj cos a—^sinal.
2. 3x + ll.y+19 = 0, llx-3^-17 = 0.
210 3. 14x + 8.y-57 = 0. 5. 3-75, 1. 6. (-4,3).
9. 1:2.
211 11. 60°. 15. x + 2 = Ky+5) = z-5.
16. \x=-\y = -z. 17. s + 2|u.s-p|(±u).
362

ANSWERS
PAGE
212 18. x*+y* = l + 2.ycot0.
19. (i) x-5y + 4z = 0; (ii) x+y+z = 0, (6, -2, -4).
20. -11,(1, -2,3).
21. Ж-*, V3/2, 0), C(-i, - V3/2, 0), £(0, 0, V2), (0, 0, V2/4); -4:0:V2,
2:±2V3:V2,0:0:l.
22. 2.
213 23. 5x+12^-60 = 0. 24. 2V6.
25. χ = [p(b* + c2 - a2) - 2a(*9 + cr+d)] (a2 + b2 + с2)"1, etc.;
(A, -A,l),(-i -if).
26. 15x-9.y-3z = 0, 1:2: —1.
27. E[(a2+Z,2 + c2)/-2a(a/+*w+w)x] = 0.
214 28. у = ζ = 0, ζ = χ = Ο, χ = у = 0; cy-fe = 0. 120°.
29. arcsin 8Д/82, arccos 4Д/82.
30. Intersection of planes with 2,(m2ns—m3n2) χ = 0.
32. 14-2°. 33. Sphere, centre the centroid.
CHAPTER 12
215 Ex. 1. Yes, e.g. A = В = 60°.
216 Ex. 2. KV6-V2), KV6+V2)· Ex. 3. -i(V6+V2), i(V6+V2).
Ex.4. (i)ff,M; («) U, -U; (in) -ft, M-
Ex.5. -2 sin Л sin Я.
217 Ex. 6. (a) 0-0 Φ (2A:+1) in; φ) θ φ \т±\тт.
219 Ex. 7. (i) ± V10; (ii)±V5; (iii) Ю, 0; (iv)l±V2; (v)i(2±V2).
Ex. 8. (i) 0, 270, 360; (ii) 53-1; (iii) 90, 323-1; (iv) 4-9, 220-5.
Ex. 9. 1,7. Ex. 10. \.
220 Ex. 11. -J, fV2· Ex. 12. Ц, \. Ex. 13. &τ, £π.
Ex. 14. V2-1. Ex. 15. 2*VG ~x\ 2/-1. Ex. 17. arcsin Ц.
221 Ex. 18. (1 + 2i- f2) (1 + i2)-1.
223 Ex. 19. (i) 2 sin 2x cos χ; (ii) 2 cos 3x cos x; (iii) -2 sin 2x sin χ;
(iv) V2 sin Й7Г+ χ); (v) s/2 sin χ
224 Ex. 20. 0, \тт, |тг, тг, §тг, |тг, 2тг. Ex. 21. 0, ψ, π, %π, 2π.
363

Exercise 12(a)
224 1. (i) i(V6+V2); (ii) -i(V6-V2); (iii) -(2+V3); (iv) 2-V3.
2. (i) KV5-D; (ii) W(10+2V5); (iii) i(V5+l);
(iv) -KV5+DV(10+2V5).
225 5. (i) 28-6°, 208-6°; (ii) 120°, 300°; (iii) 37-5°, 217-5°; (iv) 18-3°, 198-3'
6.(0 225°; (ii) 90°, 330°; (iii) 29-5°, 256-7°; (iv) 22-6°, 90°.
7. (i) 0°, 116-6°, 180°, 296-6°, 360°; (ii) 180°; (iii) 45°, 120°, 135°,
225°, 240°, 315°; (iv) 0°, 45°, 90°, 225°, 360°.
8. |sina| < |; 9-3°, 50-7°, 189-3,° 230-7°.
10. 4 cos3 Θ- 3 cos Θ, 8 cos4 Θ- 8 cos2 0+1.
226 13. (i) ± (a2-*2) (2a*)-1; (ii) a = 8i, * = 6*; tan θ = Д.
16. R = V5, a = 26-6°. (i) 140-5°, 346-3°;
(ii) 0°, 63-4°, 180°, 243-4°, 360°.
17. -97-6°, 0°, 90°, 171-3°.
227 18. (Цп, Дтг), (Дтг, Цп).
Exercise 12(b)
232 1. 76 m. 3. (i) 84; (ii) Ц; (iii) 8i
233 4. 101-2°, 44-8°. 5. 14-8°. 6. 9-3 m. 7. 36° 8. 127-8°.
9. (i)A(A2 + 2a2)-*; (ii)/г^2 + a2)"*;
(iii) 2я/г(/г2 + α2)-1; (iv) V№2 + 2a2)/V(2A2 + 2a2).
10.(i)35-3°; (ii) 64°. 11. (i) 77 m; (ii) S 46° W. 12.121m.
234 13. 529 cm2. 14. (i) 64-9°; (ii) 69-4°; (iii) 31-6°.
16. 1-12/·. 17. 1-96, 56-8°.
235 19. exccos{(ci-ai-bi + 2aW)H2l?c:i+2(?a2 + 2a2l?--ai-bl-c1)-i}.
Miscellaneous Exercise 12
2. (i) i; (ii) h ~i- A. 2m < x < (2л +1) π, n e Z.
236 8. GK Aw), (А", *я), фя, in), б", Ня). 13. 0, *π, тт.
14. i(l ± V2). 15. tan ia = a/*.
237 17. о 4.
(i) (9 = 2n7T + arccosf, 0 = 2w?r+arcsin |;
(ii) (9 = 2n7r+arccosf, ψ = (2w+l) 7r-arcsini;
(iii) θ = 2n7T+arccos|, ψ = 2w7r+arcsinf;
(iv) θ = 2n7r+arccosi, ψ = (2w+l) w-arcsin f.

CHAPTER 13
PAGE
245 Ex. 9. S(P) = (3, 5), T(P) = (3, 0), U(P) = (6, 5).
Ex. 12. Reflection in Oxy plane.
Exercise 13(a)
1. (-5, -10), (i, -i).
246 2. (-5, -4), (3, -5); (0, 0). 3. The line x-2y = 0.
6. Reflect in χ = у and then project on to χ axis.
7. Reflect in χ = у and stretch by factor 2.
8. \ad — bc\; all points map into line ex — ay = 0.
11.
П-
250 Ex. 13. (i) 2 1
Exercise 13{b)
•(-"3=
4. (i) AB-2AC; (ii) A2-AC + BA-B(
(iii) A^AB-BA+AC-CA-iB+C)2.

(iv) О О - 2 ; (ν) 2 2 -6 ; (vi) -6-2
\5 1-3/ \-l -3 19/ \- 8 -4
/ 3 8 0\ / 1 4 5\ /6 1 -1\
7. (i) -3 3 5 ; (ii) 5 4 5; (iii) 1 6 3 ;
^«(-»
-27 -40 - 1
254 8. (i) 4 1
^{_1-·_ϋ··π-3=
/0 О й\
10. (χ j> ζ), * 0 0; Τ preserves lengths; each + 1.
\0 с 0/
11. 2x+y-z = 0, (-1, 7, 5).
12. x-y + z = 0, (5, 8, 3).
255 14. (0,0), (2, 2), (-1,0).
Miscellaneous Exercise 13
2. Shear; χ, ζ coordinates fixed.
3. (i) (0, 0) only; (ii) line 2x-y = 0; (iii) all points in the plane.
256 5. (0, 0) only; no.
6. (i) (0, 0) only; λ = 10: line 7x-5y=0,X=-2: line x+y = 0;
(ii) all points of line x-y = 0; λ = 6: line 4x + y = 0.

PAGE
256 8. AB = ΒΑ.
9. A diagonal, provided non-zero elements of D distinct.
257 11. No.
CHAPTER 14
l-i о i\ ι i i -¥
(iii) 1 5 -4 ; Ov) 0-4 3
\ 0 -3 2/ \-i ! -I
Exercise 14(d)
>(14);
/ f -3 V\ / i i -i\ I f -i ¥\
(iii) -f 2 -Ϊ ; (iv) i -i i ; (v) \-i i -ψ);
0 -1
i i
■u

ANSWERS
PAGE
267 Ex. 11. (i) 3; (ii) 5; (iii) 7.
Ex. 12. (i) 2; (ii) -3; (iii) 0;
(iv) -3; (v) abc+2fgh-ap-bg*-ch\
273 Ex. 13. (i) 12; (ii) -9; (iii) 0, (iv) 1.
Exercise 14(ft)
275 1. (i) -2; (ii) -1; (iii) -1; (iv) 22.
2. (i) -30; (ii) 0; (iii) 0; (iv) 1; (v) 24; (vi) -9;
(viii) -12; (ix) 8; (x) 6.
. (iii) Singular.
16 -16,0,0;
/30-11 5\ /0-1 2\
1-10 4 -2,21,2; (iv) 1 -1 -3,1
\ 2 ^1 1/ \-2 3 5/
i i -i ·
6. (i) 12; (ii) 1; (iii) -10. 7.(i)(b-c)(c-d)(a-b);
(ii) (*-c)(c-a)(a-*)(a + * + c); (iii) (b-c)(c-a)(a-b)(bc + cc
36\
-95.
136/
/-3 33 4\
.0-8 0 .
\ 2 - 2 -l/
Miscellaneous Exercise 14
2. (a) Yes; (*) yes; (c) yes.
ι ac\ /a* ab
be); [ba b2 +
c7 \ca cb+
1 bc+2 .
c2+4/
278 6. Yes. 9. (* - cf (c - a)2 (a - bf.
368

PAGE
279 13. A2 =
CHAPTER 15
285 Ex. 2. (Й.-А).
286 Ex. 5. True for any χ provided b = 0.
Exercise 15(d)
1. (i) (2,5); (ii) (-3,5); (iii) (f, -?).
2. (i) If α Φ -b, χ = b-a; if α = -*, true all x.
(ii) If αφ b,x= ic-b) (а-ЬУ1; if a = b φ с, no solution;
if a= b = c, true all x.
3. (i) χ = (4 +За) (За-б)-1, у = 5(2-a)-1, provided α φ 2.
If a = 2, equations inconsistent.
(ii) If α φ 4, * = 0, у = 2; if α = 4, * = 2(2-λ), j; = λ, all λ.
(iii) If α φ ±6,χ= 2(6 +аГ1, > = (6 +α)"1.
If α = 6, χ = λ, у = i(l - 3λ); if α = - 6, equations inconsistent.
4. (ί)ΚαΦ-3*,Λ:=(*2 + ύ*+8)(4α+12*)-1,^=(24-α*-α2)(4α+12*)-1.
If α = -ЪЪ, b2 φ 4, equations inconsistent; if b = 2, χ = λ, у = 3λ+ 1;
if* = -2, л: = λ, у = ЗА—1.
(ii) If ab φ -3, χ = 6(1 +*) (οΛ + 3)-1, j, = 2(3-α) (аб + 3)-*.
Ifa= Ъ,Ь = — 1,х = Л,у = 2-λ; if ab = -3,α φ 3 equations inconsistent,
(iii) If α2 φ Λ2, χ = (α—Λ)-1, ^ = (b- a)-1. If a = b, equations inconsistent;
if a + b = 0, χ = λ, у = (αλ-Ι)α-1.
5. Inconsistent unless α = 3 or -1;
if α = 3, χ = - 2, j> = 3; if β = - 1, * = 2, > = -1.
6. Inconsistent unless a = 1, 2 or - 3. If a = 1, χ = V, .У = t',
if a = 2, χ = 2, j> = 0; if a = - 3, χ = 1, .у = |.
287 7. (i) α Φ 0, * Φ 0, α φ *, * = c(b-c) сгЧЬ-аУ\у = c(c-a) b-\b- аУ1;
(ii) a = 0, b φ 0, с Ф 0, equations inconsistent unless
* = c, when χ = λ, у = 1; (iii) α = 0, b φ 0, с = 0, consistent,
л: = λ, у = 0; (iv) α φ 0, * = 0, с φ 0, equations inconsistent unless
369

PAGE
287 я = с, when solution is χ = 1, у = λ; (ν) я φ 0, * = 0, с = 0,
consistent, χ = 0, у = λ; (vi) a = b φ 0, с Φ 0, inconsistent unless
я = 6 = с, when solution is л: = λ, ^ = l-λ; (vii) α = b = 0, с Φ 0,
inconsistent; (viii) я = b = с = 0, consistent, χ = λ, у = μ.
8. яА-яА = 0, χ = АяГ1, .у = -Щ1.
9. If 0 φ hr+φ (к integral), x = sin (φ-a,) cosec($S-0),
j> = sin(0-a) cosec(0—φ); if 0 = ктт + ф, equations inconsistent unless
θ and ψ both differ from α by an integral multiple of π. If θ = α + 2w7r,
0 = α+2/OT, χ = X,y = l-λ; if0 = α. + 2ητπ,φ = а + (2я+1)тг,
л; = А, у = λ-l; if θ = a + (2w + l)7T, 0 = α, + Ίηπ, χ = λ, > = λ+1;
if# = α + (2ηι+1)π,ψ = а + 2(я+ 1) π, χ = λ, у =-1-λ.
292 Ex. 12. (i) Lies in plane OI'J'K'; (ii) can be expressed in terms of any
two of i', j', k'.
Ex. 14. Columns of A proportional; (i) lies on ΟΓ; (ii) Ai'.
Exercise 15(b)
294 1. (-2,0,4).
/ 1 6 5\
295 2. 1 1 2l. (i) (0, 1, -1); (ii) (2,0, 1); (iii) (-35, -5, -7).
3. Any point of line i(x-l) = Ну+1) = i(z+2).
4. (i) Line λ, Цб-Ίλ), i(13— 11λ); (ii) inconsistent: planes form
prism in direction 5i—7j —Ilk.
5. b = 10, line; b φ 10, inconsistent.
6. я = "2, inconsistent; я φ 2, consistent only if * = я + 2, giving line.
7. (2, 1,-3).
296 8. я = 5, inconsistent; я = — f, line; unique point otherwise.
9. 4, (0, λ, 2λ). 10. (λ, λ, - 2λ), (1 + λ, λ, - 2λ).
11. Цх+6) = у = -^-(ζ-12), parallel.
297 14. Tetrahedron 7x-6.y-5z > 0, χ-2y- ζ < 0, 9x-8.y-7z < 0,
3x-4.y-3z > -2.
Miscellaneous Exercise 15
1. Ii i l), (-2, -l, l), (4,4,4)-
\0 -2 -ι/
2. (i) Line or plane; (ii) inconsistent.
370

ANSWERS
PAGE
298 3. (f, -1,4).
5. A = b-a, В = c-a, С = (b-c) (c-a);
χ = (*2 + с2)(й-*)-) (с-я)-1, у = {с2Л-a2) {a-b)-1 (b-c)~\
z= (a* + b*)(b-c)-1(c-a)~1.
299 6. (1, 1, 1), (λ, 3-2λ, λ).
9. (*-с)(с-й)(й-*), л: = я*(*-с)(с-я)-1(я-*)-1;.У = оЧа-Ь)~\
ζ — аЫ^с—а)-1; а = b Φ с, inconsistent unless α = 0; а Φ Λ = с, line;
а = b = с, plane.
300 12. - 4, 3, 2, -11 (1 + λ)/7, 5(1 + λ)/7, λ.
13. a-2b-c = 0, (λ, 1-2λ, λ-1).
14. λ φ f, -1, unique; λ = ^-, inconsistent; λ = -1, line.
CHAPTER 16
306 Ex.2. TPo,TsPo,ihPo,6hsPo. Ex.3, (i) 2-5; (ii) 2.
Ex. 4. 1-8,1-44. Ex. 5. f, ^, 7 to 1.
Exercise 16(a)
1. ii^ 2.0-348,0-071. 3.0-000105, 0-00605, 0-201.
4. 9, 0-387. 5. (i) ^2U; (ii) 0-566.
307 6. <й-, Hi. 7. (9-/-K5/-+5)-1. (i) 1-5; (ii) 1; (iii) 0-46.
9. /79(10-9p), £3-52, 46-8%. 11. (Ί) 0-388; (ii) 0-0386.
311 Ex.7, (i) A; (ii) Ш; (ш) 12.
Exercise 16(b)
313 1. 0135, 0-271, 0-271, 0-180, 0090, 0-036, 0017.
2. 1-45; (a) 0-06; (*) 0-33.
314 3. (i) 1; (ii) 0-846; (iii) 0-013. 4.0-167,0-594.
5. 0-030, 0-106, 0-185, 0-216, 0-188, 0-132, 0-077; 3, 4-25.
6. 3; 60, 222. 7. 20, 0-029. 8. 0-302, 0-060.
315 9. No: Рт(Х^ 21) « 0-121. 10. (λ* + A)/(l -e"A).
11. afr+l)"1, 2. 12. 0-0030, 0-0379.
13. 226-8, 211-4, 98-6, 30-6, 7-1, 1-5.
371

ANSWERS
Exercise 16(c)
PAGE
321 1. (i) 0-63; (ii) 0-61. Pr (2 or more) « 0-09. 2. 0-79.
3. (1 -p)w {1 +10^1 -p)»}, 13-15.
322 4. 0-9860, 0-0119, 0-002. 5. No.
6. Pr (/· < 4|w = 10) = 0-029; yes. 7. Pr (3 or more sixes) = 0-062.
8. Yes; no {ρ π 1/163).
323 9. Not significant in either case.
11. (i) (1-ρ)14(1 + ΐν{1 + 105^2(1-,ρ)13 + 455^3(1-,ρ)12};
(ii) 0-164; (iii) 17-5.
12. £2-25, £1-50.
Miscellaneous Exercise 16
1. 0) Яг; (ii) 4; (iii) Si 2. 1, 0-335; 20.
324 3. M- 4. 8. 5. Yes, if ρ < i; no otherwise.
6. }iV(iV+1) (2p-1). 8. ¥ (2/И7)" <J?+q*)-
9. μρ.
325 11. 0-3.
12. 20, 60, 90, 90, 67, 40, 33. (i) £157-50;
(ii) £167-50; (iii) £160-75.
13. f^jjrXl-p)-',
Qo*)*(i-да)"--
14. β(',*)= Q(*)*(t)"
15. 0)99-1%; (ii) 0-03; (iii) 0-1.
326 16. 0-011, 0-870, 0-516; 23-7. 17. 0-585.
18. e-'4, μ, е-^+оу^гЫУ1.
327 20.0-2; no. 21. p(n-/·+!) (r-rp)-1, rm = [р(л+1)].
372

Revision Exercise В
PAGE
328 1. x-ly = 0,4x+2y-15 = 0. 2. Щ, %, \. 4. i(4/H
5. J(10+V10), 7. 7. 3. 8. No.
329 9. (2x-.y+l) 0+З.У-2), two lines; (i) two lines; (ii) origin.
/-8 11 -10\
12.^-9 12 -9. (i)(20,20,-23); (ii) (-V, -5,
\ 10 -13 11/
13. l-48x+1104x2, 0-6158. 14. (i) 37x-27.y = 0;
(ii) 31x+27 = 0.
15. (i) 3x + 4; (ii) 5x + 7.
/ 1 -2 -2\
330 16. a. ~ β= (2k+l)n. 19. И-2 1 -2.
\-2 -2 1/
20. 3:1. 21. About f, 69.
23. (i) 0; (ii) 2(9-0 (r-p)(p-q)(y-z)(z-x)(x-y).
25. A < ρ < A. 26. (2, 5, - 3), (4, 1, 1).
27. {yeR: -1 < у < J2}.
331 28. (x + c)(ax+*-c). (i) (1+ c) (a+*-c); (ii) (c-1) (*-<
(iii) (ab + c)(a2bc+b-c).
29. Plane л: + ζ = 0.
30. -fo, UU. 31. (i) 0, in, π, in, 2π; (ii) Д-тг, Ня.
32. -2 < χ < -i or i < л: < 2. 34. β-
332 35. (-2, -1, -5). 37. 2, -4, -3. 38. 4.
CO - 2 - 2 3 ; (ii) - 2 3 - 2 .
\ 23 24-33/ \ 43 -62 45/
40. 26° 34', 45°, 108° 26'. 41. x-2y+ ζ = 0, 1:2:3.
42. (i) fff; (ii) ih\ (Ш) Ш- 43. α = 2"-l, /S = 1.
45. A: = 1, line; A: = — 2 or A: = 3, inconsistent.
46. 2Q°, 100°, 140°; 1-44, 0-33, -0-27.
47. (i) i(«+l); (ii) in\n-iy\

ANSWERS
334 49. (i) - Mb - c) (c - a) (a - *);
(ii) (x + a+b+c)(x + a-b-c)(x-a+b-c)(.x-a-b+c).
50. x-2y-2z + S = 0.
51. 53°, 155°, 344°. (i) 158°; (ii) m = -0-005, с = 1-7.
53. φ = ег-вг.
335 55. к = 5, line; /с = -7, prism. 56. 554400.
57. л: < -5, -1 < χ < Ι, χ > 7.
59. u + v+w = 0 and Зя+2*+с = 0 for consistency;
line (λ, λ+Ub-a), X-i(2a+b)).
60. ЗЦ, 49-A-.
336 61. 47°. 63. x-1 = -i(y+2) = z.
64. (i) \\ (ii) i + \(a - b)n (a + b)~"; *(* + 2a) (a + *Г2 in both cases.
65. 5567. 66. Yes, provided λ Φ 0.
337 68. λ = -2, line; A = 1, inconsistent.
69. (λΙ'+μηΐ)<№+μη,)-\ 70. 23min.
338 71. 3-N, \, |. 74. 2a2 + 2c2 +P = 2b* + 2d2 + e2.

Index
a posteriori, 125
a priori, 125
addition of vectors, 25
additive identity, 3
additive inverse, 3
adjoint matrix, 273
adjugate matrix, 273
alien cofactors, 272
ambiguous case, 229
Apollonius's Theorem, 160
arcsine, 98
arithmetic mean, 19
arithmetic sequence, 133
arithmetic series (progression), 134
arrangements, 103
associative, 3
axes of coordinates, 47
base, 1
base vectors, 45
Bayes's Theorem, 124
bearings, 230
Bernoulli trial, 302
bicimal, 7
binary system, 1
binomial distribution, 302
Binomial Theorem, 145
Cartesian equation, 52
Cartesian product, 78
centroid, 33
Ceva's Theorem, 43
Chebyshev's Theorem, 180
circumcentre, 50
codomain, 79
cofactor, 269
cofactor alien, 272
column vector, 239
combinations, 104
common difference, 133
common ratio, 136
commutative, 3
complement, 14
component form of dot product, 194
component of vector, 28
compound angles, 215
consistent, 286
coordinates, 47
cosine, 89
Cosine Rule, 227
cosecant, 90
cotangent, 90
counterexample, 9, 158
degrees of freedom, 281
de Morgan's Laws, 16
denary system, 1
Desargues's Theorem, 35
determinant, 266
deviation from mean, 176
dilatation transformation, 242
direction cosines, 61
direction ratios, 61
direction vectors, 61
displacement, 24
distributions
binomial, 302; geometric. 308; negative
binomial, 310; Poisson, 311; uniform,
110, 301
domain, 79
dot product, 189
double sampling, 318
elementary events, 108
elementary row (column) operations,
259
empty set, 14
enlargement transformation, 242
entry (matrix), 239
equality sets, 15
equality vector quantities, 24
equation of line, 37, 53, 60
equation of plane, 37, 59
equiprobable, 110
event, 111
Existence Theorem, 27
expectation, 172
Factor Theorem, 70
factorial, 103
Fibonacci Series, 168
fraction, 5
function, 78
geometric distribution, 308
geometric mean, 19

TNDEX
geometric sequence, 136
geometric series (progression), 137
gradient, 53
graph, 80
homogeneous equations, 287, 293
hypothesis testing, 318
i, j, k, 46
identical polynomials, 68
identity matrix, 251
if and only if (iff), 157
image, 79
implication =>, 4, 155
incentre, 154
inconsistent, 286
independent events, 118
inductive hypothesis, 165
inequalities, 17, 83
infinite outcome spaces, 183
inner product, 189
integer, 3
intersection, 15
invariant, 82, 246
inverse function, 81
inverse matrix, 259
inverse sampling, 309
inverse trigonometric functions, 98
irrational numbers, 8
linear combination, 30
linear dependence, independence, 30
linear equations, 54, 281
linear transformations, 241
inverse, 284; matrix of, 244; one-one,
255; product of, 245; sum of, 244
locus, 37
magnitude (vector), 25
mapping, 78
mathematical induction, 162
matrix, 239
addition, 248; adjoint, 273; adjugate,
273; conformable, 248, 249; diagonal,
256; echelon, 265; element of, 239;
elementary, 261; equality of, 247;
inverse, 259; multiplication by number,
248; multiplication of, 249;
non-singular, 264; orthogonal, 278; singular, 264;
skew-symmetric, 278; symmetric, 256;
trace of, 257; transpose of, 254; unit,
251; zero, 249
mean, 176
median, 188
Menelaus's Theorem, 43
method of differences, 140
376
minor, 268
modular arithmetic, 4
modulus, 81
moment, second, 315
multiple angles, 215
multiplication of vectors by numbers, 27
multiplicative identity, 3
mutually exclusive, 114
necessary condition, 158
negation, 158
negative binomial distribution, 310
non-trivial solutions, 287, 294
null hypothesis, 319
null set, 14
odds, 112
one-one, 78
operating characteristic, 317
ordered pair, 78
orthocentre, 58
orthogonal, 46
outcome space, 108
infinite, 183
parameter, 60
one-parameter distribution, 312; one-
parameter solution, 281
partition, 124
Pascal's triangle, 144
periodic function, 83
permutations, 104
Poisson distribution, 311
polynomials, 68
identical, 68
position vector, 32
possibility space, 108
probability, 109
addition of, 114; of event, 111;
multiplication of, 116
probability distribution, 109, 301
probability generating function, 181
probability tree, 117
projection, 190
proper subset, 14
q, e+, i6
R, R+, 16
radian, 87
range, 79
random, experiments, 108
random variable, 171
expectation of, 172; mean of, 176;
standard deviation, 177; variance, 177
rational number, 5

INDEX
rationalizing the denominator, 10
real matrix, 239
recurrence relation, 131
recurring decimal, 6
relation, 78
Remainder Theorem, 72'
row vector, 239
suffix, 49
sum to infinity, 137
surd, 10
tangent, 89
terminating decimal, 6
terms of sequence, 160
transformation, 240
triangle rule (vectors), 25
trivial solutions, 287
uniform distribution, 110, 301
union, 15
Uniqueness Theorem, 28
unit vectors, 32
universal set, 13
Vandermonde's Theorem, 146
variance, 177
vector, 25
components of, 28; dot product, 189;
multiplication by number, 27; triangle
rule, 25; unit, 32
vector addition, 26
vector equations, 37
vector quantities, 24
Venn diagram, 15
Ζ, Ζ+, 16
zero matrix, 249
zero vector, 27
St Petersburg Paradox, 186
sample space, 108
scalar product of vectors, 189
scalar quantities, 24
section formula, 33
selections, 104
sequence, 130
series, 132
set, 12
complement of, 14; empty, 13; equality
of, 15; intersection, 14; null, 13; union,
14; universal, 12
shear, 246
sigma notation, 132
sine, 89
Sine-Rule, 228
single sampling, 316
skew lines, 42
spread, 176
solution of triangles, 227
standard deviation, 177
subset, 13
sufficient condition, 158

LU Ε
CAMBRIDGE UNIVERSIT PRESS

PURE
MATHEMATICS
2
S. L. PARSONSON
Senior Mathematics Master
Harrow School
CAMBRIDGE UNIVERSITY PRESS
CAMBRIDGE
LONDON · NEW YORK · MELBOURNE

Published by the Syndics of the Cambridge University Press
The Pitt Building, Trumpington Street, Cambridge CB2 1RP
Bentley House, 200 Euston Road, London NW1 2DB
32 East 57th Street, New York, NY 10022, USA
296 Beaconsfield Parade, Middle Park, Melbourne 3206, Australia
ISBN:0 521 08032 0
First published 1971
Reprinted 1973 1975
Printed in Great Britain
at the
University Printing House, Cambridge
(Euan Phillips, University Printer)

Contents
17
18
19
20
21
22
23
24
25
26
27
28
29
Pre/асе
Complex numbers (1)
Polynomials and partial fractions
Complex numbers (2)
Mappings in the Argand diagram
Quadratic equations and quadratic functions
The parabola and rectangular hyperbola
Polynomial equations
Vector products and their applications
Continuous probability distributions
Numerical solution of equations
The ellipse and hyperbola
Further matrices
Further coordinate geometry
Revision exercise С
Bibliography
Answers
Index
page ν
379
401
423
438
450
461
483
506
520
552
573
609
633
657
680
683
703

Preface
This book completes the course designed to cover the work required for
modern Ά' level pure mathematics syllabuses (including probability),
particularly the M.E.I, syllabus. As explained in the Preface to Volume 1,
the calculus has been excluded, since it is already adequately covered by
existing texts, but a knowledge of the subject is demanded throughout the
book, and particularly in Chapter 25, on continuous probability.
In places the subject is carried beyond the bare limits of 'A' level
requirements: most chapters contain a small proportion of work which
would probably best be left to a second reading, while Chapters 20, 24, 28,
29 contain a substantial proportion of * S' level work.
As in Volume 1, the book is liberally supplied with exercises for the
student. Most questions are straightforward applications of the bookwork,
though a few harder questions may be found in the Miscellaneous
Exercises. The questions marked Ex. occurring in the text illustrate the
associated bookwork: those marked with an asterisk should be regarded as
obligatory.
The same nomenclature as in Volume 1 has been used to indicate the
source of examination questions and I am grateful to the Examination
Board of the University of London and the Oxford and Cambridge Schools
Examination Board for permission to reproduce their questions. I should
also like to record my thanks to Dr N. A. Routledge and Mr A. J. Moakes
who read the book and made many valuable suggestions and to my wife
who once again lent invaluable assistance in checking the answers.
S.L.P.

^

\η. Complex numbers (1)
1. AN EXTENSION OF THE REAL NUMBER
SYSTEM
We have seen (Chapter 1, et seq.) that, by broadening the meaning of the
term 'number', we are able to ascribe solutions to an increasing range of
problems. To take a simple example, consider the solution of certain
equations. If we restrict 'number' to mean 'positive integer', the equation
*-2 = 0
has a root, but the equation
x+2 = 0
has not. The latter equation does have a root, however, if we postulate
the existence of negative integers. Thus, by augmenting the set Z+ of
positive integers into the set Ζ of integers we are able to solve a wider
variety of equations. Z+ is a subset of Z, and the familiar laws of algebra
governing the combination of elements of Z+, that is
(i) closure: if a, b, e Z+, then a + b, ctbeZ+;
(ii) commutativity: a + b = b + a, ab = ba;
(iii) associativity: (a + b) + с = a + (b + c), (ab) с = a(bc);
(iv) distributivity:a(6 + c) = ab + ac, (b+c)a = ba + ca;
(v) the existence of a multiplicative identity: \a = a
hold for the elements of Z. But we also have two new laws which hold for
Ζ but not for the subset Z+:
(vi) the existence of an additive identity: 0+a = a;
(vii) the existence of additive inverses: ( — a) + a = 0.
In a similar way, an equation such as
2x+l =0,
which has no solution in Z, has a solution if we augment Ζ into the set
Q of rational numbers. Again, Ζ is a subset of Q, and laws (i)—(vii) above
hold in Q, together with an additional law
(viii) the existence of multiplicative inverses: α_1α = 1 (α φ 0).
Laws (i)-(viii) define afield; that is, any set of numbers combined by the
operations of addition and multiplication and satisfying laws (i)-(viii)
above constitute a field.
379

COMPLEX NUMBERS (1) [17
Ex. 1. Show that the set
S = {x = a+b^/2: a, b rational}
constitute a field under the usual operations of addition and multiplication. (For
example, to verify (iv) you have to show that
(αι + ό1λ/2)(α2 + ό2λ/2)
is of the form a3 + ό3λ/2,
where аъ ...,b3 are all rational numbers. Check, in a similar way, the other seven
laws.)
The extension of the number system to the set Q vastly increases the
number of equations with a solution, but it is not difficult to formulate
equations in terms of the elements of Q which have no rational solution:
consider, for example, the equations
x2-2 = 0 or 2 sin χ = 1 or 10* = 5.
Many such equations acquire solutions if we augment the set Q into
the set R of all real numbers. Although it is possible to define R in terms
of the elements of Q, we have contented ourselves with the intuitive
geometrical concept of R as ' completing the number line' but, since real
numbers may be approximated arbitrarily closely by rational numbers, it
seems reasonable to assume that laws (i)-(viii) above hold in R as well.
Since we have 'filled in the gaps' of the number line it might seem
reasonable to suppose that our work is done and that any equation
involving elements of R as coefficients should have elements of R as solutions.
Such is not the case however; indeed, if we modify the three equations of
the previous paragraph only slightly to
x2 + 2 = 0 or sin л; = 2 or 10* =-5,
none of these equations has a solution in R. However, these equations have
solutions provided we extend the number systems still further. It will be
the purpose of the remainder of this chapter to introduce such an extension
in a reasonably informal manner. The resulting augmented set we shall call
the set of complex numbers and denote it by C. Since С will contain R as
a subset, it will be necessary to verify that the laws of combination (i)-(viii),
suitably redefined, hold in С also.
Up to this point, each extension of the number system has led inexorably
to a further extension. However, it has been shown (by С F. Gauss in the
Fundamental Theorem Algebra) that no further such extensions are needed,
in the sense that any polynomial equation with its coefficients in С will
have all its roots lying in С as well. But the real significance of the set С
goes deeper than simply enabling solutions to be found for any polynomial
380

2] MANIPULATION OF COMPLEX NUMBERS
equation: it reveals an underlying unity among mathematical concepts
that would otherwise remain unsuspected, for example between
trigonometric and exponential functions, suitably redefined.
2. COMPLEX NUMBERS AND THEIR
MANIPULATION
The extension of the real number system that was foreshadowed in Section
1 is effected as follows: we introduce a new symbol j and define a complex
number to be an expression of the form a+b] where a, b are any real
numbers. (The sign + is used at the moment simply to unite the two
components a and b] of the complex number: the operation + has only been
defined so far for combining real numbers.) For the moment, j is to be
regarded purely as a new symbol: an interpretation of j will appear as soon
as we have defined operations between complex numbers. Before we do
so we must define equality: two complex numbers a+b] and c + d] are equal
if and only if a = с and b = d.
Addition, subtraction and multiplication of complex numbers is defined
to proceed precisely as if each complex number were a polynomial in j,
where j is subject to the condition j2 = — 1.
Thus for example,
(3+j) + (5-2j) = 8-j
(3+j)(5-2j)=15-6j + 5j-2j2
= 17-j (since j»=-l).
Ex. 2. The result of dividing the polynomial P(x) by x2 +1 may be written in the
form P(x) = (x2 +1) Q(x) + ax + b.
Discuss the relation of this polynomial identity to the concept of a complex
number. (Put л: = j.)
Ex. 3. Simplify: (i) (2+j)-(3-j), (ii) (7-7j) + (4+3j), (iii) (2-j)(2+3j),
(iv)(l + 3j)2, (v)(l+3j)3.
In the same way that constant polynomials were identified with real
numbers, so a complex number of the form α + Oj can be identified with
the real number a. Thus we write
2+0j = 2 and 2(3-4j) = (2+0j)(3-4j) = 6-8j,
as might have been expected. 0+0j is written as 0 (the complex number
zero). Similarly, we abbreviate 0 + 6j to b].
The existence of a multiplicative inverse for any non-zero complex
number has already been postulated in Section 1. To find an explicit form for
such an inverse we use the fact that
(a + b])(a-b]) = a2-(6j)2 = a* + b\
381

COMPLEX NUMBERS (1)
1 2+] 2+j
\<-—i) —
more generally,
a + b] a—b]
Thus ^-^_ — — _--.
(я+ВД-1 = whz τΞπ = ά-iJ ^ b>not both zer°)·
Ex. 4. Show that the multiplicative inverse of (2—j) obtained above is unique;
that is, show that ,„ ... . . , ,
(Z-j)-1 = x+y]^x = f, y = i.
Division of complex numbers by complex numbers now follows:
ζ 4- w means zw~\ For example
4-j _ (4-j)(l~3j) _ l^j _ ! _is-
i+3j~(i+3j)(i-3j) ίο Λ loJ·
Ex. 5. If Zj = 2+ 3j, z2 = 3-4j, express in the form a+oj:
(i) 2г1-Зг2, (ii) (2Ζι-ζ2)2, (iii) (z1-2jz2)2.
Ex. 6. Express as complex numbers in the standard form a+by.
(i) 1/(2 + j), (ii) (2 + 3j)/(l + j), (iii) {(1 + j) (1 + 2j)}/(l + 3j).
*Ex. 7. Taking 0 = 0 + 0j as the additive identity and 1 = l+0j as the
multiplicative identity, verify that laws (i)-(viii) of Section 1 hold for complex
numbers.
The interpretation of α + Oj as a real number enables us to speak
meaningfully of the square roots of a negative real number. Thus, if a e R,
then the real number — a2, regarded as an element of C, is the square of the
complex number a] and also of the complex number — <zj.
Given a complex number ζ = a + b) we call a the real part of ζ and b
the imaginary part of z:
a = Re (z), b = Im (z).
(Note that the imaginary part of a complex number is raz/!)
If ζ is non-zero, we have seen that the reciprocal of z, \j{a + b]), is
obtained as follows:
_! = _[_ = _J_ а-Щ = fl-ij
a + b] a + b] a—b] a2 + b2'
The number ζ* = α—6j, obtained by writing — j for j in z, is called the
complex conjugate of z. The real number \z\ = V(fl2 + *2) is called the
modulus of z. Thus we have shown that, for any non-zero complex number z,

2] MANIPULATION OF COMPLEX NUMBERS
Ex. 8. Write down the real and imaginary parts of the following complex
numbers:
(i) 3-2j, (ii)(3-2j)2, (ШИЗ-г.])-1.
Ex. 9. Show that Re, Im may be interpreted as functions С -» R. Find
Re [Im (a + b})] and Im [Re (a + b})],
R being regarded as a subset of C.
Ex. 10. If zx = 2-j, z2 = 3 + 2j find:
(i) z*, (ii) z*, (iii) (Zl + Z2)*, (iv) (ζιΖ2)*, (ν) (1/zO*, (vi) (Zl + 3z2)*, (vii) (z« + z|)*.
Ex. 11. The conjugacy function/: С -» С is defined ЬуДг) = ζ*. For what subset
of С is /the identity function?
fit. 12. Find:
(i) |3+4j|, (ii) |2j|, (iii) |l/(l-j)|, (iv) |cos0+j sin0|, (v) |l-cos θ+] 5ΐηθ\.
Ex. 13. The word 'modulus' has, prior to its use in this chapter, been used in the
context of real numbers only, to mean ja2. Show that the two uses coincide if
R is regarded as a subset of C.
*Ex. 14. If zl5 z2 are two complex numbers, and z2 Φ 0, prove that:
(i) (Zl + z2)* = z* + z*, (ii) (Zlz2)* = z*zt, (iii) zlZl· = \ζλ\\
Ov) R* = ΐ = rii- (v) ζχ + zf = 2Re (Zl),
\z2/ z2 |z2|
(vi) z1-z* = 2jlm(z1).
The results proved in Ex. 14 are of great importance and should be
committed to memory.
The concept of modulus enables us, in a sense, to order complex
numbers, but this is not entirely analogous to the ordering of real numbers,
since different complex numbers can have the same modulus. It is not
possible to order the complex numbers in the same way as we order the
reals.
Exercise 17(a)
1. Ifzj = l + 2j,z2 = 2—j,z3 = 4 + 5j, express as complex numbers in the
standard form a+by.
(i) гг + гг + г3·, (ii) 3z!-z2 + 2z3; (iii) ζ^; (iv) z! + 2z3; (v) z2z3 + z3zx + z^;
(vQz^z,,; (vii)(z1-jz2)/(z2 + 2jz3); (viii) -+-; (ix) (z2 + z3)3;
(x) (z1-z2)(z2+jz3); (xi) j/tf+jzl); (xii) (г?-г2г3)/(г22-гзг1).
2. If Zj = cos <?! + j sin 0l5 z2 = cos 02 + j sin 02, show that
Zl z2 = cos (θ1 + θ'„) + j sin φχ + θ2),
and find zjz2. Evaluate z\ and (z1z2)2.
3. Solve the following equations:
(i) x2-Ax + 5 = 0; (ii) 2л:2-2л: + 1 = 0; (iii) x2-5x + l = 0;
(iv) jx2-2x-2j = 0.

COMPLEX NUMBERS (1) [17
4. If (5- 12j) = (a + b])2 where a > 0, find a, b. Find similarly complex numbers
which when squared give (i) j; (ii) 3 +4j.
5. Solve the equation x2-(4+j) лг + 5-j = 0, giving each root in the form
a + b].
6. Given that A(3 +2/) -5(1 -j)- (5 + 2j) = 0, find A and B: (i) if A, В are both
real; (ii) if A, B are conjugate complex numbers.
7. If ζ = 1 — cos Θ—j sin 0, write down the value of z*. Express z'1 as a complex
number in standard form.
8. If α is a real number and ζ is a complex number, prove that (1+az)* = 1 + az*.
Deduce that, in standard form, (1 + az)-1 = (1 + az*)/(l + 2a Re (z) + a2|z|2).
9. Given zeC, show that ζ is real oz = z*.
10. What is the conjugate of z*? By considering the product zxz* for any pair
of complex numbers zx and z2, prove that zx z* + z*z2 is real. What can you say
about the complex number ζ1ζ* — ζ^ζ2Ί
11. Show, by constructing an example, that non-real numbers a and b may be
found so that the quadratic equation
x2-ax + b = 0
has a real root.
Is it possible for the equation to have two real roots if a, b are non-real ?
12. Form the quadratic equation whose roots are:
(i) 2 + j, 2 - j; (ii) 2 - 3j, 2 + 3j; (iii) 2 - j,/3, 2 +jV3; (iv) 1 + 2j, 2 + j; (v) 4,1 + j.
Can you conjecture any general result about the coefficients of a quadratic
equation and its roots ?
13. By writing хг+у2 in the form хг — Qy)2, split x2+y2 into linear factors with
coefficients in C.
Factorize into linear factors with coefficients in C:
(i) л:2-2л:+2;00x44/·,(iii)x2+ 3y2;(iv)j(x2 + x+1)-1:(y)jx2-4xy-4]y2;
(vi)x3+l; (vii)jc*+l.
(Hint for (vii): the expression may be written (x2+l)2 — 2x2.)
14. By putting χ equal to j in the identity
χ*-3χ3-χ+2 ξ (л:2 + 1) Q(.x) + ax+b
determine the numerical values of a and b.
Find the remainder, on division by x2 +1, of (i) Xs +1; (ii) x9 +1.
15. Evaluate j" in the four cases η = Am, 4m+l, 4m+2, Am + 3 (m integral).
Find the sum of the series n
16. If (1 +j)" = a+b], prove that a2 + b2 = 2":
(i) by taking the complex conjugate of each side of the original expression;
(ii) by mathematical induction.
384

2] MANIPULATION OF COMPLEX NUMBERS
17. Evaluate the determinant
I 2 1+j 2 1
1-j -1 4-j ,
I 2 4+j 3 |
where j2 = — 1.
Explain how without evaluation, it could have been concluded that the value
of the determinant was real. (M.E.I.)
3. THE ARGAND DIAGRAM
As forecast in Section 1, we may set up a 1-1 correspondence between the
set С of complex numbers and the points in a plane by associating the
complex number x+yj with the point Ρ whose coordinates are (x, y)
referred to a given pair of rectangular coordinate axes. The χ axis
corresponds to the set R of real numbers, the у axis to the set of pure imaginary
numbers, while a general complex number a+ b], α ή= 0, 6 + 0 lies off the
axes in one of the four quadrants.
Ex. 15. Plot the points corresponding to the complex numbers l+2j, l-2j,
-j,+j. What is the geometrical relationship that exists between the points
representing ζ and z* ?
Demonstrate geometrically the result
ζ = ζ* ο zeR.
Such a geometrical representation of complex numbers is generally
referred to as the Argand diagram (J. R. Argand, 1768-1822)f or as the
complexplane.
As we saw in Chapter 3, there is a 1-1 correspondence between the set
of points P(x, y) and the set of position vectors OP = x\ + y\. Thus, the
complex number x+yj may be alternatively represented in the Argand
diagram by the point Ρ or by the position vector of P. Both representations
have their value, and we shall use both freely.
*Ex. 16. ΙΐΡ represents the complex number a + b] (P is the affix of the complex
number a + bj) show that |a + oj| = |OP|.
Consider now two complex numbers zx = at + 6ij and z2 = a2 + b2],
with affixes Рг andi>2. The sum гг + г2 = (a1+a2) + (b1 + b2)];b\it
OPi + OP2 = (β! + α2) i + fo + b2) j
t The first exposition of the geometrical treatment of complex numbers was in fact
published in 1797 by a Norwegian surveyor, Casper Wessel (1745-1818). For a
translation of his paper see The Treasury of Mathematics: 2 by Henrietta Medonick (Pelican).
385

COMPLEX NUMBERS (1) [17
and thus the position vector representing the sum of two complex numbers
is the sum of the position vectors which separately represent the two numbers
(see Figure 17.1).
Fig. 17.1 Fig. 17.2
*Ex. 17. Describe the vector representing the complex number z1 — z2.
Ex. 18. Mark in the Argand diagram the affixes of the complex numbers 2—j,
2(2—j) and |(2—j). Interpret multiplication of a complex number ζ by the
real number (i) a > 0, (ii) b < 0, in terms of an operation upon the vector OP
representing ζ in the Argand diagram.
Ex. 18 shows that multiplication of a complex number ζ by a positive
real number a is represented geometrically by an enlargement of OP
(possibly by a factor less than 1). Multiplication by a negative real number
— a both enlarges OP and rotates it through an angle π (see Figure 17.2).
*Ex. 19. Mark in the Argand diagram the affixes of the complex numbers 3 + 2j,
j(3 + 2j), — (3 + 2j), — j(3 + 2j). Interpret multiplication of a complex number ζ
by the pure imaginary number j in terms of an operation upon the vector OP
representing ζ in the Argand diagram. Show that multiplication of ζ by the pure
imaginary number bj is represented in the Argand diagram by an enlargement of
magnitude \b\ followed by (i) an anticlockwise rotation through \π if b > 0, or
(ii) a clockwise rotation through iniib < 0.
Ex. 20. Interpret geometrically the statement j2 = — 1.
To interpret geometrically the multiplication of two general complex
numbers, it is convenient to introduce a new concept, the argument of a
complex number z.
Given a complex number ζ = x+yj, represented by the vector OP, we
have χ = rcos Θ, у = r sin Θ, where r = \z\ = ■Jix'+y2) (see Figure
17.3).
The two real numbers, r and Θ, determine the complex number ζ
uniquely; conversely r is determined uniquely by z, but there is an infinite
386

3] THE ARGAND DIAGRAM
number of values of θ corresponding to z, differing from one another by
integral multiples of 2π. Each such value of θ is called an argument of z.
Just one of these values will lie in the interval —π<θζπ, and this value
is called the principal argument ofz, and is written arg z.
If arg ζ = φ, we may write
ζ = r(cos φ+) sin φ).
Fig. 17.3
A complex number may always be stated in this modulus-argument form.
For example
l+j = V2(^ + ^j) = V2(cosi77+jsini77),
l-j = V2(cos[-^]+jsin[-^]),
-l+j = V2(cos^+jsin^),
-1-j = V2(cos {-%*]+) sin [-%*]).
(Notice that we must always be careful to select the angle to lie in the range
— π < arg ζ ^ π.)
Example 1. Express in modulus-argument form the complex numbers
(i) -V3+j, (ii) 2-j, (iii) l+j cot α (-π < α < ^anda Φ 0).
(i) I — V3+j| = 2 and we have
= 2(cos|^+jsinfn').
(ii) |2-j| = V5 and we have
= 5(cos α+j sin a),
where a is the angle in the range -\n < α < 0 such that tan α = --£.
387

COMPLEX NUMBERS (1)
[17
(iii) First observe that
|l+jcota| = V(l+cot2a) = |coseca|.
If 0 < a < π, |cosec a| = cosec a and we write
1 +j cot α = cosec α (sin α+j cos a)
= cosec a (cos φ+] sin φ),
where φ is chosen so that . ^
r -η < φ «ϊ π
and sin φ = cos α
and cos φ = sin a.
Thus, for 0 < a < π
1 +j cot a = cosec a[cos (i^-a)+j sin (i?r-a)].
If — я < a < 0, |cosec a| = —cosec a and we write
1 +j cot α = |cosec a| (—sin a—j cos a)
= |coseca|(cos 0+j sin Θ),
where (9 is chosen so that
— π < θ ^ π
and sin θ= -cos α
and cos θ = sin a.
The reader should verify that this gives
1+j cot α = |coseca|[cos (-^-a)+j sin (-|я-а)]
for — я < α < 0.
£лг. 21. Find the modulus and principal argument of each of the following
complex numbers, and write each in modulus-argument form
(i) 1+V3J; (ii)2 + 2j; (iii) -1; (iv) j; (v) -j; (vi) -,/3-j; (vii) ,/3-j;
(viii) 1 + j tan a; (ix) tan α+j; (x) 1 -cos a-j sin a (use half angle formulae).
*Ex. 22. Show that arg ζ = -arg z* and hence find arg z"1 in terms of arg z.
Ex. 22. By writing ζ in the modulus-argument form r(cos θ+j sin #), discuss the
geometrical interpretations for the multiplication of the complex number ζ
by: (i) the real number a > 0; (ii) the real number a < 0; (iii) the pure imaginary
number bj, b > 0; (iv) the pure imaginary number bj, b < 0.
Returning now to the question of the geometrical interpretation of the
multiplication of two complex numbers ζ and w, let us write z, w in the
modulus-argument form:
ζ = r (cos θ + j sin Θ), w = s (cos φ + j sin 0).
388

3] THE ARGAND DIAGRAM
Then
wz = sr (cos Φ+) sin φ) (cos 0+j sin Θ)
= sr [(cos φ cos θ -sin φ sin 0)+j (cos 0 sin (9 + sin φ cos (9)]
= sr [cos (0 + 0)+j sin (0 + 0)],
and thus the product, wz, is a complex number with modulus |vc||z| and
an argument argw+argz. In words, the modulus of a product is the
product of the moduli and an argument of the product is the sum of the
arguments. Geometrically, if the affix of ζ is P, multiplication by w
(i) enlarges OP by a factor |vc|, and (ii) rotates OP anticlockwise through
an angle arg w.
Ex. 24. If ζ is any complex number and w = u+vj (и, ν real and positive), plot
in the Argand diagram, the affixes of z, jz, uz, jvz, uz+jvz. Deduce that
\wz\ = \w\\z\ and arg (wz) = arg w+ arg z.
Is it necessary to modify your demonstration in any way if either и or ν is
negative?
R(wz)
y\ / I
/ /^7m
o\ ДО
Fig. 17.4
In Figure 17.4, /, P, Q are the affixes of 1, z, w respectively. If each side
of triangle IOP is enlarged by a factor | w\ and the triangle is then rotated
anticlockwise through an angle arg w about O, I is brought into coincidence
with Q. Let Ρ be brought into coincidence with R (so that the triangles
OIP, OQR are similar). Then
\OR\ = \w\ \OP\ = \w\\z\
and 10R = arg w+arg z; thus R is the affix of the complex number
wz. (Notice that IP, which represents the complex number ζ -1, is rotated
into the displacement QR, which represents the complex number w(z — 1).)
389

COMPLEX NUMBERS (1)
Ex. 25. Form the product (1 +j) (д/3+j) and interpret your result geometrically.
What are the values of cos 75° and sin 75° ?
Now suppose that w, zx and z2 are any three complex numbers. Let
P, Q, be the affixes of z1 and zx + z2 respectively (Figure 17.5). Then OP
represents ζλ and PQ represents z2. Now enlarge triangle OPQ linearly
by a factor |vc| and rotate through an angle arg w to bring it into
coincidence with the triangle ORS. Then OR represents vezl5 RS represents wz2
and OS represents w(>i + z2). But OS = OR + RS and we have thus
verified the distributive law . .
w(zi + z2) = wz-l + wz*
.Slw^+z,)]
У.
0
lz-l= —
®-
Fig. 17.5
Ex. 26. If w φ 0, show that
and that arg
or differs from arg z—arg w by 2π.
*Ex. 27. Draw diagrams similar to that of Figure 17.4 to illustrate the
construction from the affix of the complex number z, of the affixes of the complex
numbers (i) z2; (ii) z3; (iii) z~\
Example 2. zx and z2 are arbitrary complex numbers. Give a geometrical
verification of the triangle inequalities
h-ζ,ΐ ζ \\Zl\-\z2\\.
Describe geometrically the set S = {ze C:\z-1— j| ^ 1} and show that
ifzeS, then
V2-1 ^ |z-2| ^ V2+1-

3]
THE ARGAND DIAGRAM
In Figure 17.6, OP represents zu PQ represents z2 and thus OQ
represents z-l + Zz and RP represents z1 — z2, where OPQR is a parallelogram.
B\it,inAOPQ, |OQ| ^ |OP| + |PQ| (with equality only if OPQ is a straight
line) and thus . , . „ . , , . ,
Again, in ΔίλΡΚ,
|OP| ^ |OR| + |RP| and |OR| ^ |OP| + |RP|;
thus, |Zl| ^ |г2| + |г!-г2| and \z2\ a |zi| +|zx-z2|,
which combine together to give
Ι^-ζ,ι > |N-|z2||.
Fig. 17.6
For the second part of the question, we know that \z — w\ gives the
distance between the affixes of z, w in the Argand diagram, and the set S
is thus represented by the circumference and interior of the circle, centre
(1+j) and radius 1.
We demonstrate two methods of deducing the final part of the question.
Method (i) {Analytical)
From the inequalities
\ъ-ъ\> 1Ы-Ы1, W+zt\^ Ы + Ы,
\z-1\ = |(z-l-j)-(l-j)l > ||z-l-j|-|l-j| ^ V2-1,
\z-2\ = |(z-l-j) + (-l+j)| ^ |z-l-j| + |-l+j| ^ V2+1-
Method (ii) (Geometrical)
Let С be the centre of the circle \z— 1— j| = 1 (which touches the real
axis at A, say) and let В be the affix of the number 2. Let ВС cut the circle
atP, Q (see Figure 17.7).
391
we have

COMPLEX NUMBERS (1)
[17
For given real a > 0, \z — 2| = a represents a circle, centre В and radius
a. Of all such circles, the smallest containing a point of S (that is, the
circumference and interior of the circle \z— 1 —j| = 1) has radius BP and the
largest a radius of BQ. But, by Pythagoras, ВС = •N/2andthus5i> = ^J2-l,
BQ = ^2+1 and the result follows.
Ex. 28. The triangle inequalities stated in the Example above may be deduced
without recourse to geometry. Show that
Zj z2* + zf z2 = 2 Re (zj z2*)
and deduce that |z! + z2|2 = |Zi|2 + 2 Re (z1z^)+ |z2|2.
Show also that Re (zx z2*) < | zx 11 z21
and deduce that |zi + z2| < |z!| + |z2|.
Prove the second triangle inequality in a similar fashion.
Consider now the effect of multiplying the complex numbers 0, 1,
l+j.j by a+by. we get
0, a + b), (a-b)-(a+b)], -b + aj
—the vertices of another square (Figure 17.8).
(fl-b,a+b)
(0,1)
(1
(0,0)
(1
1)
0)

3] THE ARGAND DIAGRAM
The result is strongly reminiscent of the linear transformation with
^^ /« -*\ /1 0\ JO -14
(* a)=a[0 l)+*(l θ)
= αϊ+6J, say.
The result suggests that it might be profitable to associate the number 1
with the matrix I, the number j with the matrix J and the general complex
number a + b] with the matrix al + b5.
Consider now two complex numbers a+b], c + d] and the associated
matrices , -
e d- с "3-
We have
(i) (a+bj) + (c+dj) = (a+c)+(b + d)j
(ii) (e+uj)(c + aS) = (ec-M) + @c + <fc)j
and (b aj[d cj-\bc + da ac-bd)'
The 1-1 correspondence between the set С of complex numbers on the
one hand and the set of all 2 χ 2 matrices of the form
on the other is thus seen to preserve the structures of addition and
multiplication.
Ex. 29. Confirm, by direct computation of (a + oj)-1 and the inverse of the matrix
I I that the structure of division is also preserved.
*Ex. 30. Interpret the correspondence between the complex number cos θ+j sin#
and the matrix I . . J in the light of their geometrical properties.
\ sin a cos 0)
Ex. 31. If C, R have their usual meanings and Μ is the set of all matrices of the
form I J, a, b e R, determine the nature of the function /: Μ -> R which
corresponds to the modulus function С -> R.
To summarize, we have the following representations of the complex
number a + b}:
(i) the point (a, b) in the Argand diagram;
393

COMPLEX NUMBERS (1) [17
(ii) the vector ax + b\;
(iii) the matrix al+bJ where
>-n-'-(?-i)·
Representation (i) is, as it were, a static representation: it is useful in that
it enables us to discuss sets of complex numbers in geometrical terms, but
by itself, it lacks the additive and multiplicative structure of complex
numbers.
Representation (ii) is, as it were, dynamic: addition of vectors and
addition of complex numbers have the same structure. However, multiplication
of complex numbers has to be interpreted by a new operation on vectors:
dot products of vectors on the one hand and products of complex numbers on
the other do NOT have the same structure.
Representation (iii) gives a. full structural representation of the complex
numbers in terms of matrices: addition, subtraction, multiplication and
division of complex numbers have their exact analogue in the
corresponding operations on the associated matrices.
Example 3. The affixes of the complex numbers a = 3+j and b = 1 +2j
are A and В respectively. Find complex numbers ρ, ρ', q, q' with affixes
P, P', Q, 0! such that ABQP andABQ'P' are squares.
Find also complex numbers r, r' with affixes R, R' such that ABR, ABR'
are equilateral triangles.
AB = b-a
= -2+j.
To obtain AP and AP' we must rotate AB through ± \v\ that is, we must
multiply {b-a) by ±j (see Ex. 19). Thus
AP = j(-2+j)
= -l-2j
and AP'=-j(-2+j)
= l + 2j.
Thus p = a+(-l-2])
= 3+j-l-2j
= 2-]
and p' = Ъ+] + \+2]
= 4 + 3j.
394

THE ARGAND DIAGRAM
Similarly q = b + (-l-2})
= l+2j-l-2j
= 0
and q = l+2j+l+2j
= 2 + 4j.
To rotate AB through an angle ± \π we seek complex numbers ζ such
that \z\ = 1 and arg ζ = +%π; thus
ζ = cosyn-lj sin \π
= i(i±JV3).
It follows that AR = Ц1 + j V3) (6 - a)
and thus /■ = a+ i(l +JV3) (*- a)
= 3+j+Kl+W3)(-2+j)
= 2-W3 + (f-V3)j.
Similarly AR' = $0 - JV3) (* - a)
and thus r' = 3+j + i(l-JV3)(-2+j)
= 2 + W3 + (f + V3)j.

COMPLEX NUMBERS (1)
Exercise 17 φ)
1. Determine the modulus and principal argument of each of the following
complex numbers, where necessary leaving the modulus in surd form and giving
the argument in degrees, correct to the nearest degree: (i) 3 —4j; (ii) 2 + 3j;
(iii)-l-2j;(iv)i+j; (v)-2 + 5j; (vi) -2-j; (vii) 12-5J; (viii) -9-40J.
2. Simplify the following complex numbers:
(i) (costf+jsintf)3; (ii) (cos θ+j sin 0)/(cos ф-j sin?5);
(iii) (cos β+j sin Θ) (sin Θ+j cos Θ); (iv) (cos ^π+j sin £π)2;
(ν) (cos tjtt+J sin -^tt) (cos t&t +j sin γ^π);
(vi) (cos xr?r + j sin jjtt) (cos -γψτ — j sin γ^π);
(vii) (cos^+j sin^Vicos^-J δίη^π);
(viii) (1 +cos Θ+j sin Θ) (1 - cos 0-j sin φ).
3. Determine the principal argument of the complex number
(sin β+j cos Θ) (cos φ+j sin 0)
(i) when —\n < 0 — β < |π; (ii) when |τ7 < φ — θ < |π.
4. If Ρ is the affix of the complex number z, show how to construct geometrically
the affixes of: (i)l+z; (ii)l-2z; (iii)2j + 3z; (iv) 1-jz; (v) (1-j) z; (vi)j + z2;
(vii) z + z2; (viii) (l+j)(l-z).
5. If Ρ is the affix of the complex number z, show how to construct the affixes
of: (i) z*; (ii) z-z*; (iii) 1+2/z; (iv) (1+z)*; (v) 1/(1 +z); (vi) j/(j-z);
(vii) z/(l-z);(viii)(l-z)-2.
6. If Λ Q are respectively the affixes of the complex numbers z, w show how to
construct the affixes of: (i) 2z+ve; (ii) ^(ζ + 2κ>); (iii) £(3z-ve); (iv) z+jw;
(v) (1 +j) (z + иО; (vi) (z+1) (и>+1); (vii) w/z; (viii) j(z-w).
7. Describe geometrically the set S = {z e C:|z-2-j| < 1} and prove that, for
allze5, V5-1 < |z| < V5+1.
8. Describe geometrically the set S = {z e C:|z-2| < 2} and prove that, for
allzeS, 3 =£ |z+2 + 3j| < 7.
9. Show that |z-6j| < 3^ 10 < |z-5 + 6j| < 16.
10. Show that |z-3-j| < 1^ V10-1 < \z\ < VlO+1.
11. Prove, using mathematical induction, the following results:
(i) |Zl + z,+ ...+z.| =S lzxl + Ιζ,Ι + ...+ Ιζ,Ι;
(ii) \ζΛζ,...ζη\ = |г1||г,||г1|...|г.|;
(iii) aig(z1z1zt...z^ = Σ argz*;
(iv) (z1+zt + z, + ...+z^* = Σ ζ*;
(ν) (zxz2z3...zn)* = ζ*ζ%ζ?...ζ%.
12. The function/: C-* С is defined ЬуДг) = 2z+j. Describe the effect of this
mapping geometrically and find the value of ζ which is invariant under the
mapping, again interpreting your result geometrically.

3] THE ARGAND DIAGRAM
13. Answer the same question as in Question 12 (substituting where necessary,
values for value) for the functions denned by: (i) f(z) = jz+1; (ii) f{z) = z2 + 1;
(iii) flz) = z2 +1 + j; (iv) f{z) = 2z* +1 + j.
14. Show that the points representing the complex numbers 3 + 4j, 7 + 2j, 5 + 8j
in the Argand diagram are three vertices of a square and find the fourth vertex.
Show that the interior of the square represents the set
{zeC.z = 3 + 4j+A(2-j)+/i(l+2j), real λ, μ, 0 < λ < 2, 0 < μ < 2}.
15. Prove that, if S= {zeC: \z-2\ < 1}, then, for all ζ e S, -%n < argz < £π.
16. Prove that the modulus of the quotient of two complex numbers is the
quotient of the two moduli, and that the difference of the two arguments is an
argument of the quotient.
(i) If ζ is a complex number, and = 2, prove that the locus of the
point which represents ζ in the Argand diagram is a circle, and find its centre and
radius.
(ϋ) If "86τϊ)=^·
find the locus of the point which represents z. (O & C)
17. If ζ is the complex number cos 0+j sin Θ, express 1/(1 +z) and 1/(1 -z) in
the form χ+yj.
If \z\ = 1, prove that the real part of 2z/(l -z2) is zero. (O & C)
18. If A, B, C, D are the affixes of the complex numbers a, b, c, d prove that
a + c = Ьл-do ABCD is a parallelogram.
What can you say about the parallelogram if a + bj = c + djl
19. Explain with the help of a sketch, why no complex number ζ can be found
such that argz = j* and |z-4-j| = |z-2-3j|.
20. a, b are real numbers; show that
is represented by a straight line in the Argand diagram.
Miscellaneous Exercises 17
1. By writing 1 =-j2, express ζ = (V3 + l)+j(V3-l) as the product of two
complex numbers and hence write down the values of \z\ and arg z.
2. If |z—2—j| < 2 and \w— 5 — 5j| < 1, find the maximum and minimum values
of|z-Hi
3. Describe geometrically the relation between the complex numbers 6j and
2 + 4j and the complex numbers (i) 1 + 5j; (ii) 3 + 3j.
If λ is an arbitrary real number, what can you say about the complex number
(l+A)+j(5 + A)?(over)
397

COMPLEX NUMBERS (1) [17
If B, Care the affixes of the complex numbers 6j, 2 + 4j, find the two complex
numbers whose affixes, A, A' are such that the triangles ABC, А'ВС are
equilateral.
4. zu z2, и>1, ve2 are four complex numbers and
By considering det (AB), prove that the product of two sums of four squares
is itself the sum of four squares.
5. Write down the product of the complex numbers хг+у^ and хг + уг].
Show that this product has modulus rxr2 and argument θ1 + θ2, where rlt r2 are
moduli and 0l5 #2 the arguments of the given numbers. Solve the equation
^ z-l = ^_
3 + 4] 5j 3 —4j
Show that, if (z-2)/(z-j) is real, the point corresponding to ζ = x+yj in the
Argand diagram lies on a straight line through the points 2 and j, and find the
equation relating χ and у when the ratio is purely imaginary. (O & C)
6. If the complex numbers zl5 z2, z3 are represented in the Argand diagram by
points ZltZ2,Z3, interpret geometrically (giving a justification) the modulus
and argument of the complex number (z3 — z^l{z2— zj.
If the complex numbers a, b, c, x, y, ζ are represented by points A, B, C, X, Y,
Zand if
x-c _ y-a _ z-b
b — c c—a a—b'
prove that the triangles ВСХ, С Α Υ, ΑΒΖ are similar.
Prove also that the triangles ABC and XYZ have the same centroid.
(O&C)
7. The cross-ratio of four complex numbers zl5 z2, z3, z4, written (гггг, ζ3ζύ, is
defined by
, , , \ _ (.Zi-z3Hz2-z4)
(ZlZ2'Z3Z4)-(!^J^7)·
Show that, Ί{(ζ1ζ2, ζ3ζ^ = λ, then the twenty-four possible permutations of
the numbers zl5 z2, z3, z4 give rise to six distinct cross-ratios with values λ, 1 — λ,
λ-ι, (1-λ)-1, λ(λ-1)-1, λ-\λ-1).
If a,b,c,d are four complex numbers, and the complex numbers Wi,Zi
(i = 1, 2, 3, 4) are connected by the relation
_ аг{+Ь
czi + a"
prove that (ζχζ2, ζ3ζ^ = (w^w^, w3w^).
8. A complex number zr is represented on the Argand diagram by a point Zr.
Prove that arg [{z3-ζ^Ι{ζ2-z-^j] is equal to the angle Z,,ZXZ3.
If zl5 z2, z3, z4 are distinct, and the cross-ratio {.z2-z^(Zi-z3)l{Zi~z^) (z2-z3)
(that is (z^Zi, ζχζ3)) is real, prove that, in general, Z1; Z2, Z3, Z4 lie on a circle.
In the exceptional case, what is the relation between Z1; Z2, Z3, Z4?
(O&Q
398

3] MISCELLANEOUS EXERCISE
9. The complex numbers a, b, c, x, y, ζ are represented in the Argand diagram by
points A, B, C, Χ, Υ, Z.
(i) If the triangles ABC, XYZ are directly similar (that is, corresponding
angles are equal in sense as well as magnitude) prove that
Ι ι ι ι I
(ii) If the triangles ABC, XYZ have equal a
| ι ι ι | | ι ι ι |
10. If β and у are complex numbers whose moduli are both equal to 1, prove
that (1+βγ)/(β+γ) is real.
Hence, or otherwise, prove that, if a, b, с are complex numbers whose moduli
are all equal, then (a2 + bc)/[a(b + c)] is real. (O & C)
11. The points А, В, С are the affixes of the complex numbers a, b, с in the
Argand diagram. If the circumcentre of the triangle ABC is at the origin, prove
that the orthocentre H, is the affix of the point a + b + c. Deduce that O, G, Η
are collinear, where G is the centroid of the triangle, and that OH = 30G.
12. Points A, B in the Argand diagram represent complex numbers a, b
respectively. О is the origin, and Ρ represents one of the values of j(ab). Prove that,
if О A = OB = r, then also OP = r, and OP is perpendicular to AB.
А, В, С lie on a circle with centre O, and represent complex numbers a, b, с
respectively. Prove that the point D which represents -bc/a also lies on the circle,
and that AD is perpendicular to ВС.
The perpendiculars from В, С to С A, AB meet the circle again at E, F
respectively. Prove that О A is perpendicular to EF. (O & C)
13. A complex number ζ is represented by a point Ζ on the Argand diagram.
Prove that multiplication of ζ by w is represented by taking the point Z' on OZ
(where О is the origin) such that OZ' = OZ\w\ and rotating OZ' through the
angle arg w.
If ζ is represented by a point on the circle of radius a which touches the ^-axis
at О and lies in the first and fourth quadrants, prove that
z-2a = jz tan (arg z). (O & Q
14. A point Ρ representing the complex number ζ moves in the Argand diagram
so that it lies always in the region defined by
|z-l|<|z-j| and |z-2-2j|<l.
Indicate on a sketch the region within which Ρ lies.
If Ρ describes the boundary of this region find: (i) the value of |z| when arg ζ
has its smallest value; (ii) the values of ζ in the form x+jy when arg (z— 1) = \π.
(London)
399

COMPLEX NUMBERS (1) [17
15. In the Argand diagram А, В, С and D are four points representing the
complex numbers zl5 za, z3 and z4 respectively. If О is the origin, prove that the
triangles OAB and OCD are similar if
£i_ f_3
z2 z4"
The complex number ζ is given by
= (Q±*Y
13J-4/ ■
Find, by drawing to scale, the position of the point representing ζ on the Argand
diagram. Check your result by calculation. (London)
16. The complex number ζ = x+jy = r (cos θ+j sin Θ) is represented in the
Argand diagram by the point (x, y). Prove that, if three variable points z1; z2, z3
are such that z3 = λζ2 + (1 — λ) zu where λ is a complex constant, then the triangle
with vertices zl5 z2 and z3 is similar to the triangle with vertices at the points 0,1
and λ.
ABC is a triangle. On the sides ВС, С A, AB triangles BCA', CAB', ABC'
are described similar to a given triangle DEF. Prove that the centroids of the
triangle ABC and A'B'С are coincident. (O&C)
400

18. Polynomials and partial fractions
1. THE EVALUATION OF POLYNOMIALS
(Unless specifically stated otherwise, we shall assume that all polynomials
have integral coefficients.)
We have seen in Chapter 4 that, given a polynomial P(x) the process of
dividing P(x) by x-a leads to the identity
P(x) = (x-a)Q(x)+R,
where Q(x) is a polynomial of degree one less than the degree of P(x).
The actual coefficients of Q(x) are obtained by the process of long division,
but the process may be shortened by the method of synthetic division
which consists, essentially, of comparing coefficients on both sides of the
above identity, as illustrated in the following example.
Suppose that P(x) = 2х* + Зл:8-5л: + 7 is to be divided by x+3. Then
the quotient, Q(x) will be a cubic polynomial, and we have
2л4+Зл;3 + 0л;2-5л; + 7 ξ (x + 3)(ax3 + bx2 + cx + d) + R.
Comparing coefficients
a = 2, 3a+b = 3, 3b + c = 0, 3c + d = -5, 3d+R = 7
and, working from the left, we have, by successive substitution
a = 2, b=-3, с = 9, d = -32, R = 103.
Observing that successive coefficients of Q(x) are obtained by multiplying
the preceding coefficient by — 3 and adding the result to the corresponding
coefficient of P(x), the work may be set out succinctly:
2 3 0-5 7(-3
-6 9 -27 96
2 -3 9 -32 103
giving Q(x) = 2*3-Эх2+ 9*-32 and R = 103.
Example 1. Find the quotient and remainder on dividing Ах* — 5хг + 1х + 2
byx-5.
4 0-5 7 2(5
20 100 475 2410
4 20 95 482 2412
Thus Q(x) = 4х3 + 20х3 + 95л;+482 and R = 2412.
401

POLYNOMIALS AND PARTIAL FRACTIONS [18
It will be seen that the above procedure constitutes an efficient method
for evaluating R = P(a), even if the coefficients of Q(x) are not required
explicitly.
Ex. 1. Explain how the method above may be modified for division by ax + b,
α Φ 1, by first dividing P(x) by a. Obtain Q(x) and R when
P(x) ξ 2xl-15x3-33xs-x+U
is divided by 2л: + 3. (In such cases Q(x) will not, of course, generally have integral
coefficients.)
Given a polynomial P(x) of say, degree four,
P(x) = a0xi+a1x3+a2x2 + a3x+ai,
the procedure outlined above to calculate P(b) amounts to finding suc-
a0, аф + а^, b{a0b+a^ + a2, Ь[Ь(а0Ь + а^)+а^+а3
and finally, P(b) = b{b[b(a0b + a^+aj+a3}+a4.
Thus the polynomial P(x) has been nested into the form
P(x) ξ x{x[x{a0x + ai) + a2]+a3}+al,
and P(b) calculated by substituting χ = b and working outwards from the
middle bracket. The process is often referred to as nested multiplication:
the reader familiar with the use of hand calculating machines will see that,
using the facility of back transfer, the whole computation may be effected
without having to record any numbers and clear the machine.
Ex. 2. Use a hand calculator to evaluate P(37) where
P(x) = 15χ*-61λ:3 + 46λ:2 + 18λ: + 56.
Details of the use of hand-calculating machines in evaluating
polynomials and, indeed, in a wide variety of other applications will be found
in Numerical Mathematics by A. J. Moakes.
It is often useful to express a polynomial P(x), of degree n, in the form
P(x) = A0(x-a)n + A1(x-a)n-1 + A2(x-a)n-2 + ...+An-1(x-a) + An
(A0, A1} A2, ..., An integral). Such a reduction may be effected by successive
applications of Horner's method of synthetic division. (See Example 1.)
402

U EVALUATION OF POLYNOMIALS
Example 2. Express 4л;4 - 5л;2 + 7л; + 2 in the form
A£x - 5)4 + A,(x -5)a+A2(x- 5)2+A^x - 5) + A,.
4 0-5 7 2(5
20 100 475 2410
4
4
4
20
20
40
20
60
20
95
200
295
300
595
482
1475
1957
2412
4 | 80
ThusP(x) = 4(л— 5у + Щх- 5)3 + 595(л— 5)2+1957(л— 5) + 2412.
Ex. 3. Show that, when χ χ 5,
P(x) ξ 4х4-5;с2 + 7л;+2 « 1957x-7373.
Find Д5-1) correct to 2 significant figures.
Ex. 4. With the notation of Ex. 3, what is the equation of the tangent to the
curve у = P(x) at the point (5, 2412)?
Ex. 5. With the notation of Ex. 3, evaluate P(5), P'(5), P"(5).
Ex. 6. Show how, in Example 2, the values of Ait A3, A2, Аъ А0 enable us to
calculate P(5), P\5), P"(5), P"'(5), i"v(5).
2. DIFFERENCES
Consider the quartic polynomial
/(л-) = л-4-5л;3 + л; + 2.
Tabulated overleaf are the values of f(x) for χ = -4(1)5 (that is, for all
values of χ in steps of 1 from χ = -4 to χ = 5). The first and second
columns give respectively the values of л; and f(x), while each subsequent
column gives the successive differences between entries in the preceding
column.
It is seen from this table that, although the first and second difference
columns follow no readily discernible pattern, the third difference column
contains the terms of an arithmetic sequence and the fourth differences are
constant. This is no coincidence: for any polynomial of degree n, the nth
differences are constant (see Ex. 11).
403

POLYNOMIALS AND PARTIAL FRACTIONS
X
-4
-3
-2
-1
0
1
2
3
4
/
574
215
56
7
2
-1
-20
-49
-58
A/
-359
-159
-49
-5
-3
-19
-29
-9
65
Δ2/
200
110
44
2
-16
-10
20
74
Δ3/
-90
-66
-42
-18
6
30
54
Δ4/
24
24
24
24
24
24
Δ5/
0
0
0
0
0
If we are content to assume for the moment that the fourth differences
continue constant, it is an easy matter to extend the table working
backwards from the constant fourth difference, to supply further values for
the polynomial:
x f Af Δ2/ Δ3/ Δγ

2] DIFFERENCES
Ex. 7. The cubic polynomial f(x) has the values/(0) = 2,Д1) = -8,/(2) =-20,
f(3) = —28. Form a difference table and deduce the values of Д4), Д5).
.Ex. 8. Explain the following check on a difference table : the sum of any column
of differences is equal to the difference between the first and last entries in the
preceding difference column.
Differences may be calculated for non-integral values. For example, in
the table below, differences are calculated for the polynomial
f(x) = x-x3
for values of χ = 0 (0-1) 0-6. (That is, for values of χ from χ = 0 to χ = 0-6
in steps of 01.) Since the values of f(x) are calculated to 3 decimal places
all the differences will also be to 3 decimal places and it is unnecessary to
enter the decimal point; for example, the constant difference (Δ3/), written
as - 6, is really - 0-006.
X
0
01
0-2
0-3
0-4
0-5
0-6
/
0
0-099
0-192
0-273
0-336
0-375
0-384
Δ/
99
93
81
63
39
9
Δ2/
-6
-12
-18
-24
-30
Ex. 9. Extend the table above, back from the constant difference in the Δ3/
column, to obtain Д-0-1) and/(0-7).
If we denote the successive values of the variable χ by л;0, х1г х2, ■■■,
where the step between successive values of χ is constant, the corresponding
values off(x) may be written f0,A,f2, .... Again, we write
Afr=/r+1-/r
and A% = Δ(Δ/,) = Afr+1-Afr, etc.
Similarly, if we work back from x0 through values ли, jc_2, X-a> ···to obtain
2 PPMn 405

POLYNOMIALS AND PARTIAL FRACTIONS [18
values /_i,/_2,/-3, ... of the polynomial, the successive differences are
written ΔΑ, Δ/_2, ···, and Δ2Α, Δ2Α, ···. etc.:
Δ/ Δ2/ Δ3/
Δ/-.
Δ2Α
ΔΑ Δ3/_2
Δ2Α
Δ/0 ΔΑ
Δ%
Δ/
Tf we wish to refer to an explicit value of x, we may write
4/t*) =Λ* + Α)-Λ*),
А2Дх) = Δ/(χ + /ζ)-Δ/(χ),
where /г is the difference between successive values of л; for which the
polynomial/^) is being tabulated.
*Ex. 10. If f{x) = xix-\) ix-2) ... ix-n+ 1) (and thus fix) is a polynomial
of degree ri) prove that, if /is tabulated for integral values of л:,
(i) ДДл:) = nxix-l)ix-2)...ix-n + 2),
(ii) Δ«Λ*) = и(и-1)*(х-1)(х-2)...(х-и + 3).
Suggest a form for Д'Дл:) and prove your conjecture by mathematical induction.
*Ex. 11. Prove that any polynomial fix) of degree η may be expressed in the
form
fix) = AlXix-l) ix-2) ...ix-n+l) + AzXix-l)...ix-n + 2) + ... + Anx+An+1,
where АиАг, ...,Ап+1 are numbers, and deduce, using Ex. 10, that the nth
differences of an nth degree polynomial are constant. Prove furthermore, that
Δ-/= Αχη\.
*Ex. 12. Prove, by mathematical induction, the Gregory-Newton formula
fin) =fi0) + (^Afi0) + ... + h Δ7·(0) + ... + Δ»/(0).
Example 3. The quadratic polynomial f(x) has values/(2) = 15,/(3) = 41,
/(4) = 81. Find an explicit form for f(x).
Since/(л;) is a quadratic polynomial, the second differences will be con-
406
/
*-2 /-2
x-i A
*o /o
Xl /l
*2 /2

2] DIFFERENCES
stant. Thus, we are able to build up the table of differences below to infer
the values of ДО) and Δ/(0).
Δ/ Δ«/
χ
0
1
2
3
4
/
5
3
15
41
81
Thus/(0) = 5, Δ/(0) = -2, Δ2/(0) = 14, and by the Gregory-Newton
formula (Ex. 12)
/(") = /(0) + ("J АЛО) + (у Δ*/(0)
= 5-2и + 7и(и-1)
= 7и2-9и + 5.
The Gregory-Newton formula was only proved for integral n, but
f(x) ξ 7χ2-9x +5
is the unique quadratic polynomial defined by the three values
Д2) = 15, /(3) = 41, /(4) = 81.
(We may thus be permitted to write formally
(l) = X> Q^^*-1^ (з)=*^-1)(^-2), ...
for all real x.)
Ex. 13. The quadratic polynomial f(x) has values Д1) = 0, f(2) = 1, ДЗ) = 6;
use the Gregory-Newton formula to find the explicit form of f(x).
Exercise 18(a)
1. Use the method of synthetic division to find the quotient and remainder:
(i) whenx3-5x2+x+16 is divided by x— 2;
(ii) when x4 + 3x3 - 9л:2 - 23л: +14 is divided by χ + 3;
(iii) when3x4 + x3-^2-l^-24is divided by л:+ 2;
(iv) when x1—5x— 5 is divided by л:—4;
(ν) when 7л:5 - Ъх—5 is divided by χ + 5.

POLYNOMIALS AND PARTIAL FRACTIONS [18
2. Use the method of synthetic division to find the quotient and remainder:
(i) when Ix3 - 5л:2 - 17л: is divided by 2л: + 1;
(ii) when Зх4 + 2x3 - Зх2 + Ίχ+6 is divided by Зл: + 2;
(iii) when 2л* - 9x* + 2л:3 + 7л:2 + Ίχ - 3 is divided by 2л: -1;
(iv) when Ax9· - 9л:3 - 9л:2 + χ + 2 is divided by 2л: + 1;
(ν) when 3χ?-χ*-21χ3+9χ2-1Αχ+12 is divided by Зл:-1.
3. Use a hand calculating machine to evaluate the following polynomials for
the given values of л:, giving your answers to 1 decimal place:
(i) 3·5λ:3 + 7·1λ:2 + 2·8λ:+5·9,λ:= 2-9;
(ii) 7·7λ:3-9·6λ:2-3·8λ:+17·2,λ: = 1-97;
(iii) 8-6jt*-9-2x8 + 7-3xa-5-8x+13-2,jt = 2-61.
4. Express the following polynomials in the form
a0(x-h)n + a1(x-h)n-1+...+an.1(x-h) + a„
for the given values of A
(i) x3 + x2 + 2x+l, A = 1;
(ii) 2л:3-17л:2 + 30л:, А = 2;
(iii) л:3 + 7л:2-6л:-4, А = -3;
(iv) je*-l, h= -2.
5. Use the method of synthetic division to find the quotient and remainder:
(i) when jc* + (1 + 4j) x2-2x + (\+)) is divided by (x+j);
(ii) when jc* + (1 -j) л:3 -jx2 + 3(1 + 2j) x+ (4- 3j) is divided by (x-2j).
6. By expressing P(x) = л:3- 11л:2 + 35л:- 30 in the form
а0(л:-3)3 + а1(л:- 3)2 + сфс- 3) + а3,
estimate (i) Ρ(3·1), (ii) Д301), both correct to 1 decimal place.
7. Express the polynomial P(x) ξ λ:4 + 16л:3 +100л:2 + 288л: +320 in the form
Σ аАх+4У.
What does your answer tell you about the shape of the curve
у = ^+16л:3+100л:2 + 288л:+320
at the point (-4,0)?
Solve the equation л:4+ 16л:3 + 100л:2 + 288л:+ 320 = 0.
8. Show how the method of synthetic division may be modified for division
by the polynomial л:2 + л:+2.
Find the quotient and remainder on dividing
λ:7 + λλ-λ5-3χ4-4λ:3+3λ:2+5λ:+3 by л:2+л:+2.
9. If л:2+1 is a factor o{ x'7 + Axs +x3 +Bx2 + 2x+l, find the values of A and B.
10. If л:2 + 4 is a factor oix*+Axi-x3 + Bx2 -Ax-A, find the values of A and B.
11. A cubic polynomial P(x) has values Д0) = -1, P(l) = 4, P(2) = 21,
A3) = 62. Use a table of differences to find P( -1) and P(A).
12. A quartic polynomial P(x) has values Д-2) = 11, Д-1) - 2, P(0) = 5,
P(l) = S,P(2) = 23, Find ДЗ) andP(-3).
408

2] DIFFERENCES
13. Find the quadratic polynomial f(x) defined by the values
Д1)=-15, Д2)=-20, Д3)=-23.
14. Find the cubic polynomial f(x) defined by the values
Д1) = 3, Д2) = 4, ДЗ) = 21, /(4) = 66.
15. Find an expression for Σ rl as a polynomial of degree five in n.
3. PARTIAL FRACTIONS
The process of adding together fractions with polynomials as
denominators will be familiar to the reader. For example
_2 3_
x-3 x-2
2(x-2)-3(x-3)
(x-3) (x-2)
_ 5~x
(*-3)(jc-2)·
The reverse process is called 'resolving a rational function into partial
fractions'. Such a resolution is of importance in differentiation and
integration and also in obtaining power series expansions of rational functions,
as we shall see later in this chapter.
Two questions arise—can the resolution always be effected, and, if it can,
are the partial fractions obtained unique? Detailed analysis of these
questions lies outside the scope of this book.f
13 7'
tion into arithmetical partial fractions is not unique.
Suppose we are given a rational function of the form
Q(x)
where P(x), Q(x) are polynomials in x. As a first step, if the degree of P(x)
t See, for example, Ferrar: Higher Algebra. We shall convince ourselves, in specific
examples, that the answer to both questions is a qualified 'yes'.

POLYNOMIALS AND PARTIAL FRACTIONS [18
is greater than, or the same as, the degree of Q(x), we may divide out until
this is no longer the case. For example, we should express
χ*-2χ3-3χ2 + χ + 4
We now suppose that this has been done; that is, that we have a rational
function of the form
Let us first consider the case in which Q(x) contains only non-repeated
linear factors. Consider, for example, the function
*+18
(*-3)(2x + l)·
If we can find numerical values A, B such that
x+18 ξ A(2x+l)+B(x-3) (1)
then, on division by (x-3) (2x+1), we have
(*-3)(2x+l) *-3 + 2*+l· K)
From (1), suitable values of A and В can be found by equating со-
efficients: 1=2A + B, 18 = A-3B,
and then solving the resulting simultaneous equations. A preferable method
is to put χ = 3 in (1) to find A, and then to put χ = —J to find B. The values
obtained are A = 3, В = -5 and
x+18 _J 5_
(*-3)(2x+l) x-3 2x + Y
Ex. 16. Students often object that, since (2) is meaningless for χ = - i or χ = 3,
it is invalid to substitute these values in (1) to obtain A and B. Expose the fallacy
in this argument.
Observe that, in our last example, A and В were also given by
A. 3 + 18 (-a + 18.
A "(2x3 + 1)' ΰ~ (-i-3)'
410

3] PARTIAL FRACTIONS
that is, A was found by covering up the term (s - 3) in
s+18
(jc-3)(2jc+1)
and then putting s = 3 in what was left. Similarly, В was found by
covering up the term (2л;+1) and putting χ = —\ in what was left. This
useful technique for obtaining a partial fraction expansion is usually
called the cover-up rule.
Example 4. Express in partial fractions
n 4s2- 19s + 7 ... x3 + x
W (s-l)(s-2)(s + 3)' {П) s2-4'
(i) By the cover-up rule
4s2- 19s + 7 _ 1 /4-19 + 7\ 1 /16-38 + 74
(s-l)(s-2)(s + 3) (s-l)l(-l)(4)j+(s-2)l (1) (5) )
1 /36 + 57 + 7\
+(s+3)l(-4)(-5)j
_2 3_ _5_
jc—1 s-2+s + 3"
(Note the check by putting χ = 0:
2=_2+§+£)
(ii) Since s3 + χ is a cubic polynomial and s2 - 4 is only a quadratic, we
must first divide out
x3+x s3-4s + 5s 5s
s2-4 ~ s2-4 ~ x s2-4'
,T 5s 5s
Now,
(s-2)(s + 2)
x~^2\a)+x~+2\^a)'
by the cover-up rule. Thus we have
x3 + x _ 5 5
s2-4 •X + 2(s-2) + 2(s + 2)·
Ex. 17. Express in partial fractions
... jc+11 л:+
(1) (x-l)(x + 2); (11) *=
('") 335 (fl + °); (iv> 3z

POLYNOMIALS AND PARTIAL FRACTIONS [18
*Ex. 18. A common application of partial fraction expansions is to the
evaluation of certain integrals. Evaluate, by first expressing the integral in partial
fractions:
*Ex. 19. Partial fraction expansions can also frequently be used to simplify the
work in differentiating. Differentiate
х'-Бх+б' К ' x^-Sx-b'
It is worthwhile considering the 'cover-up rule' from a rather different
standpoint. Let us suppose that we have to express
f(x) = 5Jc+l
ЛХ) (jc-3)(2jc-1)
in partial fractions.
We proceed by considering approximations to this rational function.
It has singularities at χ = \ and χ = 3; that is, f(x) is very large when χ
is near \ or 3 (but is not defined at χ = i or χ = 3). Near χ = 3, the
dominant term in f(x) is 1/(jc —3): this term is multiplied by a factor with
numerical value almost exactly equal to
5x3 + 5
2x3-1"
= 4
(the 'cover-up rule' again). We may thus say that f(x) is approximately
equal to .
j^3
near л; = 3.
The approximate shape of the graph of f(x) is shown in Figure 18.1
while the approximate shape of the graph of у = Aj{x— 3) is shown in
Figure 18.2.
Near the common asymptote χ = 3 the two curves are seen to be very
similar in shape and position.
Now consider the deviation off(x) from the approximation:
5s + 5 4
/(*)-
c-3~ (2x-l)(x-3) x-3
_5x+5-Sx+4
(2jc-1)(jc-3)
-3(x-3)
-(2*-l)(*-3)
3__
~ 2x-V

PARTIAL FRACTIONS
Thus we have finally
/(*) =
and our rational function has been expressed in partial fractions.
*Ex. 20. The fact that (x - 3) cancelled at the final stage in the example above was
no coincidence. Suppose that f(x), F(x) are polynomials and that F(x) has a
linear factor χ-a, that is,
F(x) s (x-a)g(x), ЙФО.
Show that, near χ = ex., the rational function f{x)/F(x) is approximately
№ ι
№*-«'
Я*)*(«)-*(*)Л«)
(x-a)g(x)g(a) ■
with deviation
Prove that the numerator of this fraction is a polynomial with a factor (х- а) and
deduce that cancelling is always possible.
The method of obtaining partial fractions outlined in Ex. 20 is clearly
applicable whenever the denominator contains a (strictly) linear factor.
We conclude this section with two worked examples.

POLYNOMIALS AND PARTIAL FRACTIONS
Example 5. Express
in partial fractions.
f(x) has singularities at χ = 1 and χ = -2. The results proved in Ex. 20
suggest that we consider the singularity χ = -2. Near χ =-2,/(л;) is
approximately /4(_2)2_3(_2) + 5 , J_ ^_
I (-2-1)2 / U + 2/ * + 2'
The deviation is
4s2-3;c + 5 3_ 4х2-Зл: + 5-3(л:2-2л:+1)
(*-1)2(* + 2) * + 2 (*-1)2(* + 2)
х2 + Зл: + 2
(х-1)2(* + 2)
(х+1)(х + 2)
(х-1)2(* + 2)'
and thus we have f(x) = -^ + (*_ 1)2.
The reduction can, if desired, be carried one stage further by writing
je+1 s(*-l)+2:
3 1 2
/(*) =
л+2т^-Г(л-1)2'
Example 6. Express 2,19
-(x-2)(x2 + 4)
w partial fractions.
f(x) has just one singularity, at χ = 2. Near jc = 2, f(x) approximates
t0 /22+12\ / 1 \ __2_
U2 + 4/ U-2/ jc-2"
The deviation is
л:2+12 2_ x2+12-2Q2 + 4)
(*-2)(*2 + 4) jc-2 (jc-2) (*2 + 4)
_ (jc-2)(jc + 2)
(*-2)(*2 + 4)
_ x + 2
~ *2+4'
2 x + 2
and thus we have f(x) = ——= —2~ΤΊ·
414

PARTIAL FRACTIONS
Ex. 21. Express in partial fractions
W (χ-2)*(χ+1)' (U) (х+1У(2х+1)'
Ex. 22. Express in partial fractions
,. x+2 x*+l
W (л:-1)(л:2 + 2); W л:3-!-
Ex. 23. By writing x2 + 4 = 0 + 2j) (>-2j), use the 'cover-up' rule to find the
partial fraction expansion for 2
-2)(^ + 4)·
л:2-л:-1
(i) as the sum of partial fractions with rational coefficients;
(ii) as the sum of partial fractions with real coefficients.
Exercise 18(b)
. Express in partial fractions
© ,-zfezbi №
3-7*+2*2' w *2 + 2*' v"' 2*2 + 5*-3'
2x-l , ч 2λ? + 3 , .ч 13ЛГ-19
1+Зл:
х* + х' (х-1)(х + 2)(х
20-х3
-5л: + 6'
(xii)
х2 + 2х3' ^"' (2 + x)(l-2x)2' у™' (2 + х2)(1+х)'
, ** ■ (xivl {Х~1У ■ (хх) χ3 + χ* + χ ■
v ' **(x-2) ·
2. Find numbers А, В, С, D such that
2л:2 + 8л: + 3 = Л(л:-2)3 + Я(л:-2)2 (х-1) + С(л:-2) (х-1) + Щх- 1)
and deduce the partial fraction expansion of
2λ? + 8λ: + 3
(x-2)3(x-l)·
3. Adapt the method of Question 2 to express
9л:3 - Xs + Ίχ -5
(х-1)Чх+4)
η partial fractions.

POLYNOMIALS AND PARTIAL FRACTIONS [18
4. Express
(*+!)(*+2)
in partial fractions.
Hence evaluate n
,?i (r+l)(r + 2)"
Evaluate similarly Σ — .
5. Express
(i) as the sum of two partial fractions with real coefficients;
(ii) as the sum of three partial fractions with complex coefficients.
6. Express 1/(χ»-3χ+2) in partial fractions and deduce the partial fraction
expansion of ?\{2f -3^+1).
Express x2/(Sx2-6x+1) in partial fractions.
4. POLYNOMIAL APPROXIMATIONS:
THE BINOMIAL SERIES
We saw, in Chapter 8, that, for any number χ and any positive integer η
(l+*)»s 1 + $χ+(^χ2+ - + (") *'+··■+*"■
If χ is numerically much less than one, so that, say, its rth and higher
powers are negligible, the identity above may be used to give a polynomial
approximation of degree (/■ -1) for (1 + x)n. Thus, for example, with η = 10,
(1+л;)10« 1 + 10х + 45л;2,
neglecting terms in χ of degree greater than 2 {terms of order 3).
Ex. 25. Use the quadratic approximation to (l+л:)10 obtained above to find
(0-998)10 correct to 4 decimal places.
By using Taylor's series expansionf
f{x + d)= f{a) + xf'(d) + i*«/*(e) + ■ ■ ■ ,
polynomial approximations may be obtained for many other functions.
In particular, taking a = 1, axiaf(x) = xk (к е Q) we have
/'(1) = k, fV) = k(k-l), /w(l) = k(k-l) (k-2)
f See, for example, Siddons, Snell and Morgan: Calculus III.
416

4] POLYNOMIAL APPROXIMATIONS
giving, for sufficiently small *f the binomial series expansion
*Ex. 26. Verify the following expansions:
(i) (l+л:)-1 = \-x+xt-x3 + xi...;
(ii) (1+л:)-2= l-2x+3x2-4x3 + 5x1....
(Notice that this result may be obtained by the formal differentiation of (i).)
(iv) (i+x)i/» = ι + (i*) - ^ (**)■+^ (**)»-(^^ (**)*...;
1.3.5
3!
Ex. 27. Suppose, when χ is very small, that (l+χγ can be approximated by
1 -Vax-Vbx2. Then 1 +x and {1+ax+bx2)2 are nearly equal. Deduce that, for a
'best fit', 2a = 1 and a2 + 26 = 0, giving the first three terms of expansion (iii) of
Ex. 26.
Deduce similarly a quadratic approximation to (1 + x)~* if χ is very small.
Example 7. If χ is sufficiently small for x1 and higher powers to be neglected,
find cubic polynomial approximations for
0) M-9v^9-vV (»)
(1-2*) (2-*)' W(1-*)V(1+*)'
(i) By the 'cover-up' rule
(l-2*){2-*) 3(1-2x) 3(2-*)
* ?(l+2*+4*» + 8*«)-g(l+;[+£+£)
s i + |*+¥*2+f|*3.
t The decision as to what constitutes a 'sufficiently small' χ depends, of course, upon
the accuracy demanded of the answer. In numerical approximations, answers are usually
required to a certain number of significant figures or places of decimals and the point
at which to truncate the binomial series can be found by inspection.
It can be shown that
lim l (»-0(Я-2) :(й_-н:1)х^
for any real n, provided \x\ < 1. The result is also true for χ
for χ = -1 provided η > 0. It is not true for any |x| > 1, ι

POLYNOMIALS AND PARTIAL FRACTIONS
(») Π-vWH+v^C1-*)-1 (!+*>-*
(l-x)V(l+*)
» 1+д:(1-^) + л:2(1-| + |)+Д:3(1-^ + |-А)
ξ 1+^л; + ^л;2+1а6-л;3.
Alternatively, we may write
1
i l+ax + bx2 + cx3.
(ΐ-*)να+*)Λ
Then, using Ex. 26 (v), we have
l-ljc+f*2—hx3 * (1-дс)(1+ах+йдс«+сх»),
giving a-1 = -i, b-a = f, c-6 = -re
or α = i, b = J, с = -!%
as before.
Ex. 28. Solve Example 7 (i) by the method of Example 7(ii).
Example 8. By using the binomial expansion for (1 — 2x)~i, estimate *]5 and
calculate an upper limit for the error in your approximation.
Since (1 -2x J-0-)-i = 1Д/Й) = -W5 we have (see Ex. 26 (v))
W5 = 1-(-А)+У (-А)'-^|г (-Л)'
τ 4|—riw 5[ *. ш -г···,
1+0-1 +0-015+ 0-0025 +0-00044+ 000008.
Thus У5 в 1-11802 and V5 « 2-23604.
To estimate the accuracy of our answer we have to obtain an upper limit
for the sum of the remaining terms in the series. Now the coefficient, ur,
of (Уо)г in the expansion above is given by
Thus
418
1
2r-
.3.5....
r\
=1=2-
< 2wr_!
.{lr-
1
-- <
-1)
2.

4] POLYNOMIAL APPROXIMATIONS
and the sum of the remaining terms in the binomial series above is certainly
less than
= ' ' — (Уй-)6 γ^-j (sum of infinite geometric series)
χ 1-805 χ 10-».
Since 4-4 χ 10~4 and 8 χ 10~6 are overestimates of the fifth and sixth terms
of the series above we may safely conclude that the error in taking 1 · 11802
as JV5 is less than 1-805 χ 10~5 and so our answer of 2-23604 for V5 has an
error of less than 3-61 χ ΙΟ"5.
Ex. 29. In fact, J5 = 2-236068 (6 dp.), and our limit for the error in Example 8
appears to have been rather crude. Explain why this is so. How could a more
accurate estimate of the error be obtained?
Ex. 30. To4d.p.,V5 = 2-2361, while in Example 8 above, we obtain J5 = 2-2360
(to 4 d.p.) although we subsequently show our error to be less than 5 χ Ю-5.
Explain.
The binomial series gives a polynomial approximation to (1 +x)n valid
in the neighbourhood of χ = 0. If we require a polynomial
approximation valid in some other neighbourhood, it is necessary to shift the origin,
as shown in our next example.
Example 9. Obtain a quadratic approximation to (1 +2x)_1 in the
neighbourhood of χ = 3.
(1+2χ)-ι = [7 + 2(*-3)]-1
, where у =
■\№"
since у is approximately zero
= ^-Д-(*-3) + з!з(х-3)2.
The quadratic approximation is best left in this form but it may be
reducedto *&-&* + &*
Notice that this is quite different from the quadratic approximation to
(1 +2*)-1 in the neighbourhood of χ = 0:

POLYNOMIALS AND PARTIAL FRACTIONS
Exercise 18(c)
1. Obtain cubic polynomial approximations for the following expressions in the
neighbourhood of л: = 0:
2. Obtain cubic polynomial approximations for the following expressions in the
neighbourhood of л: = 0.
3. Obtain cubic polynomial approximations for the following expressions, in
the neighbourhood of л: = О:
(i) (1 + x)*; (ii) (1 + 2*)-*; (iii) (1-х)-*; (iv) (2-х)"5; (v) (8-x)*.
4. Obtain cubic polynomial approximations for the following expressions, in
the neighbourhood of χ = 0:
W (\-х)(\-2хУ (U) (х-2)(х-ЗУ (ш) (1+хУ(2 + хУ
x(x+3) _ 1 + 4л:
(1VJ (1 + χ2) (1 + 2x)' W (1 - 2хУ (1 + 2x2)'
5. Obtain quadratic polynomial approximations for the following expressions
m the neighbourhood of χ = 0:
1+x ' w (l+x)V(l + 2x)"
6. Obtain quadratic polynomial approximations for:
(i) (1 + x)-1 in the neighbourhood of χ = 1;
(ii) (2 — x)-1 in the neighbourhood of л: = — 1 ;
(iii) V(3 + 2x) in the neighbourhood of χ = -1 ;
(iv) (2 + Зл:)* in the neighbourhood of χ = 2;
(ν) ——— in the neighbourhood of χ = -1.
7. Prove that . ... „. ,
V(l+x2) + l л:2
and deduce that, if χ is very small,
Vd+*2)+i ^ *~^2"
8. Find, correct to 5 significant figures, the values of (i) (0-998)1'3; (ii) (1-02)1'2;
(iii) (4-01)1'2; (iv) (0-799)1'3.

4] POLYNOMIAL APPROXIMATIONS
9. By substituting χ = 1/1000 in the expansion of (1 - л:)1'3, find (999)1'3 correct to
4 significant figures.
10. Prove that 19-97 < V399 < 19-98.
11. Prove that, if
2-x-2V(l-x)'
then£= 2-x+2V(l-*)-
Deduce that, if χ is small, then Ε is approximately equal to ^x2.
12. If χ is so small that x3 and higher powers of χ may be neglected, express the
fUnCti°n V(4 + x)
l-2x + (l+3x)2'3
in the form, a+bx+cx2. (O & C)
13. Express the function Ε given by
(2x+1)(1 + jc·)
in partial fractions.
Hence prove that, if л: is so large that x~i can be neglected, then
E = 5-^. (O&Q
14. Use the binomial expansion to calculate the value of (16-32)1'4 correct to six
places of decimals. (O & C)
Miscellaneous Exercise 18
1. The polynomial xi + Axs + Bxi-Ax3 + Ax!t+Bx+C is exactly'divisible by
x3+l;fmdA,B,C.
2. If (1+х+хг)п = а0 + а1х+а2хг+ ...+агпхгп, write down the values of a0
and a2n and prove that aT = а^п-г.
In In
Show that Σ aT= 3" and find Σ (-1)^·
If и is an even number, prove that Σ (-1)7+1 a2r-i = 0.
3. Find the value of the constant A for which the expression
2x+3-A(x*+x+l)
has (x—2) as a factor and find the remaining factor.
Hence, or otherwise, put into partial fractions the expression

POLYNOMIALS AND PARTIAL FRACTIONS
4. Provethat χ2 + ^+1 g {x_aHx+2b + a) + a> + 2ab+L
(x-2)(x* + 2bx+l)
into partial fractions
(i) when b = i,
(ii) wheno = 1.
5. Find А, В such that, for all values of χ other than i, -i, -f,
x+1 Ax+B A(x+1) + B
(2л:-1)(2л:+1)(2л:+3) (2x-1) (2x+ 1) (2л:+ 1) (2x+3)'
Find the sum to η terms of the series whose rth term is
(2r-l)(2r+l)(2r+3)·
6. Prove that, if - i < χ «Ξ i,
1.3 , 1.3.5 , 1.3.5.7 ,
Deduce that χ J3 13 ζ 135J
1 + 4+4.8 + 4.8.12 + 4.8.12.164
7. Sum to infinity the series:
,.4 , 1 1.3 1.3.5 1.3.5.7
(1) 1 + 3 + зТб+зХ9 + 3-Х9Л2 + -;
1 1.4 1.4.7 1.4.7.10
(11) +4 + 4.8 + 4.8.12 + 4.8.12.16+""'
,.. 3 3.9 3.9.15 3.9.15.21
} 8 + 8.16 8.16.24 + 8.16.24.32 ;
„ , , 1 1.5 1.5.9 1.5.9.13
(1V) 1 + 6 + 6Л2 + 6Л2Т8 + 6Л2Ж24 + ·-
8. The cubic polynomial P(x) assumes the values 2, 3, 2, 11 respectively for
χ = 1, 2, 3, 4. By writing
Fix) ξΑ + Β(χ-1) + C(x- 1) (x- 2) + D(x- 1) (л:- 2) (л:- 3) find Дл:).
Show how to fit a quadratic polynomial to coincide with the values of
у = sin χ at χ = 0, £π, π.
9. Find a linear approximation to J(3-x) in the neighbourhood of χ = - 1,
and interpret your result graphically.
Show that V3-9 « 1-975.
10. Find the sixth term and the rth term of the series whose first five terms are
on the assumption that the rth term is a polynomial in r of as low a degree
as possible.

19. Complex numbers (2)
1. INTEGRAL POWERS OF
COMPLEX NUMBERS
In Chapter 17 we saw that the multiplication of complex numbers was best
expressed in terms of their moduli and arguments: if ζ = r (cos 0+j sin 0)
and w = s (cos Φ+] sin φ) then zw = rs [cos (θ + φ)+) sin (θ + φ)]. In
particular, if ζ = cos 0+j sin 0 then z2 = cos 20+j sin 20, from which it
follows that z3 = cos 30+j sin 30 and so on; the general result,
z" = cos ηθ+] sin и0 (n е Z+),
may be proved by mathematical induction.
Theorem 19.1. (De Moivre's theorem for positive integral exponent.) For any
positive integer η ^ 0+. ^ ^ = ^ ^+. sin ^
Proof.
(cos 0+j sin 0)"-1 = cos (и-1) 0+j sin (и-1) 0
=> (cos 0+j sin 0)" = [cos (и-1) 0+j sin (и-1) 0] (cos 0+j sin 0)
=> (cos 0+j sin 0)m = cos «0+j sin ηθ
(cos 0+j sin 0)1 = cos 0+j sin 0
and the result holds for all positive integral n, by induction.
Ex. 1. Illustrate de Moivre's theorem using the Argand diagram for the cases
η = 2, 3, 4. What can you say about 0 if (cos 0+j sin#)" = 1 (n a positive
integer) ?
Ex. 2. Express V3 + J 'п modulus-argument form and deduce the value of
(V3+J)9·
De Moivre's theorem may be extended without difficulty to include
negative integral exponents. If we define z°to be 1, this enables us to say
that (cos 0+j sin 0)m = cos «0+j sin ηθ for all integral n.
Theorem 19.2. {De Moivre's theorem for negative integral exponents.) For any
negative integer n,
(cos 0+j sin 0)m = cos «0+j sin ηθ.
423

COMPLEX NUMBERS (2) Г19
Proof. Write η = —m; then m is a positive integer and
(cos (9+j sin &)m = cos mff+) sin mff,
by Theorem 19.1. Thus
(cos 0+j sin 0)" = . . j.
v J ' cos m(9+j sin m(9
= (cos тв-) sin m0), since |cos m(9+j sin m(9| = 1,
= cos >j(9+j sin и0.
Ex. 3. Illustrate de Moivre's theorem, using the Argand diagram, for the cases
η =—2, — 3, —4. What can you say about (i) zn+z~n, (ii) zn—z~n where
Μ = ι?
Ex. A. Evaluate (V3+j)-9.
The results proved in Theorems 19.1 and 19.2 are frequently useful in
deriving further results. We shall illustrate, in the next examples, some of
the techniques most commonly employed.
Example 1. Express cos 6(9 in terms of cos θ and sin Θ.
Since cos 6Θ +j sin 6Θ = (cos 0+j sin Θ)6 we have, using the Binomial
Theorem,
cos 6Θ +j sin 6Θ = c6 + 6jcs5'-15c45'2-20jc35'3+15c25'4+6jc5'5-ie,
where с = cos Θ, s = sin Θ. Comparing the real parts of both sides,
cos 6Θ = cos6 (9-15 cos4 θ sin2 (9+15 cos2 θ sin4 (9-sin6 (9.
Notice that, in Example 1,
(i) by comparing imaginary parts we immediately derive an expression
for sin 6(9 in terms of cos (9 and sin (9;
(ii) the expression for cos 6Θ (but not sin 6Θ) may be written as a
polynomial in either cos (9 or sin Θ.
Ex. 5. Show that
cos 6(9 = 32 cose (9-48 cos4 (9+18 cos2 (9-1.
Ex. 6. Express tan 6(9 as a rational function of tan <9.
Example 2. Express cos6 θ in terms of multiple angles.
Writing ζ = cos θ + j sin (9, we have z-1 = cos θ — j sin (9 and thus
2cos0 = z + z-\
Then 64 cos6 0 = (z + z-1)6
= (z6 + z-6) + 6(z4 + z-4) +15(z2 + z-2) + 20.
424

1] INTEGRAL POWERS
But
ζ6+ζ~6 = (cos 60 +j sin 6(9) +(cos 6(9-j sin 6(9), by de Moivre's theorem
= 2 cos 60,
and similar results hold for z4 + z~4 and z2 + z~2. Thus
32 cos6 0 = cos 60 + 6 cos 40+15 cos 20 + 10.
*£лг. 7. Suggest a quick check on the accuracy of the answer to Example 2.
*Ex. 8. By writing 2j sin 0 = z-z'1, express sin" 0 in terms of multiple angles.
Ex. 9. Evaluate cos6 0d0.
Example 3. Find the sum of the series
χ sin θ+χ2 sin 20 +x3 sin 30 + ... +xn~1 sin («- 1) 0,
w/геге χ is a real number.
The given series is reminiscent of a geometric series, but we have sines
of multiple angles, sin rff, rather than powers of sines, sinr 0. De Moivre's
theorem suggests a possible way of changing from the multiple angle form
into an exponent form. Consider the two series:
С = l+Jccos0 + ;c2cos20+...+;c'l-1cos(n-l)0,
S= xsin0 + .x2sm 20 + ...+ x"'1 sin (n-1)6».
Multiplying the second series by j and adding, this gives
C+jS = 1 +x(cos 0+j sin 0) + x2(cos 20+j sin 20) + ...
+ xn-1(cos(n-l)e+] sin (и-1)0).
Thus, writing ζ = cos 0+j sin 0,
C+jS = 1 +xz + x2z2+... +xn~1zn-\ by de Moivre's theorem
= -: , from the formula for the sum of a geometric series
1— xz
l-x*z* (1-jcz*)
1-jcz -(l-xz*)
= 1-Λ:ζ*-Λ:'Ιζπ+Λ:η+1ζπ-1|ζ|8
l-*(z + z*) + *2|z|2
1 -x(cos 0-j sin θ)-χη (cos «0+j sin и(9) +xn+\cos, (n-1)(9
= +j sin (n-1)0)
l-2xcos0 + x2
since |z| = 1.
425

COMPLEX NUMBERS (2)
Comparing the imaginary parts of both sides we then have
S =
χ sin θ — χη sin ηθ + xn+1 sin (n — 1) θ
l-2xcos0 + ;c2
Ex. 10. What is the value of С:
Ex. 11. If jjtrj < 1, find Σ λ:'si
1. Simplify
(i) (cosiTT+jsiniw)4; (ii) (cos ^π+j sin^)-3;
(Hi) (cosi^—j sin^r)6; (iv) (sin^r+j cos^n)6;
(v) (cos j7r+j sin %π)3 (cos i?r +j sin ^π)4;
(vi) (l+jtaniw)5; (vii) (1 -j cot ^n·)-";
(viii) Q+cos20+jsin20)-4.
2. Express in standard form
(i) (cos in +j sin ^)12; (ii) (1 + j)8; (iii) (V3 -j)4 (1 + JV3)e;
(iv) (cos 0+j sin (9)"/(cos φ -j sin ф)т.
3. Express in terms of sin Θ: (i) cos 4(9; (ii) sin 5(9.
4. Express in terms of cos (9: (i) cos 4(9; (ii) cos 5(9; (iii) . .
5. Express tan 40 in terms of tan Θ.
6. Prove that cos 7(9 = 64 cos7 (9-112 cos5 (9 + 56 cos3 5-7 cos (9.
Write down the seven roots of the equation
64л:7-112x5 +56л:3-7л: = 0,
and also the seven roots of the equation
64л:7-112х5 + 56л:3-7л:-1 =0.
7. Express, in terms of cosines of multiple angles:
(i) cos5 (9; (ii) sin4 (9; (iii) cos7 <9.
8. Express, in terms of sines of multiple angles:
(i) sin5 Θ; (ii) sin7 Θ; (iii) sin3 θ cos Θ.
9. Express sin5 θ cos4 θ in terms of sines of multiple angles.
10. Evaluate
e cos40d0and ι
Jo Jo
11. Sum ton terms the series:
(i) С08л: + С08 2л:+... + С08Лл:;(п) sin л: + 8Ш 2л: + ... + 81П п
12. Sum to η terms the series
cos л: + 2 cos 2л: + 4 cos Зл:+ ... +2"-1 cos nx.
426

1] INTEGRAL POWERS
13. Sum the series
l+Qcostf+Qc
n-l
14. Find Σ 0+1) sin r0.
16. By considering cos (<?i + 02 + θ3 + 0J + j sin (вг + θ2 + θ3 + 04), express
tan (θ1 + 02 + 03 + 0ί) in terms of tan 0b tan 02, tan 03, tan 04.
17. Find Σ sin(2r+l)(?.sin1!'-+1(?.
r=l
18. If ζ = cos 0+j sin (9, show that
z- 1/z = 2j sin (9 and zn- 1/z" = 2j sm л0.
Express sin5 0 in the form a sin 50+ ό sin 3(9 + с sin (9 and hence solve completely
the equation 16 sin5 0 = sin 5(9. (O & C)
2. RATIONAL POWERS OF
COMPLEX NUMBERS
In this section we shall denote a general rational number by pjq, where p, q
are integers and q > 0.
Given a real number r > 0, there is just one positive #th root of rp, that
is, there is just one a > 0 such that r* = a9. We write
a = ri' or a = %r*).
*Ex. 12. Prove that, it p, q are integral and r > 0, there is only one positive
real number a such that rv = a". (Show that, for b > 0, r* = ό5=> ό = α.)
*£лг. 13. Prove that, if? is odd there is just one real number a such that, for real
r, rv = a" and that, if q is even, there are either two or no such numbers.
Now let us consider the problem of finding a complex number w such
that zv = wq, where ζ is the complex number /-(cos 0+j sin 0), r > 0.
Suppose vc = sfcos φ + j sin φ); then
j-^cos />0+j sin pff) = s"{cos qφ +j sin #0).
(/j, q being integers, we may apply de Moivre's theorem to both sides.)
Now two non-zero complex numbers can be equal only if they have the
same moduli, and arguments differing by an integral multiple of 2π. Thus
ρθ + Tkn = дф.

COMPLEX NUMBERS (2)
Whether q is even or odd, we may take s =
,P1±7J^+..,P1±7^
ь-mi·
giving w = .
In this expression, к can take any integral value, positive or negative. Since
values of к differing by q or any multiple of q give rise to the same w, there
are precisely q different values of w given, for example, by taking
£ = 0, 1,2, ...,(9-l);
in other words, a complex number has exactly q #th roots. Since the real
numbers form a subset of the complex numbers, every real number has
q qth. roots; by Ex. 13 at least (q — 2) of these will be complex.
Ex. 14. What are the four fourth roots of: (i) 1; (ii) 4; (iii) -1 ?
If ζ = /-(cos θ +j sin Θ), where θ = arg z, we shall define zvlq to be
r*te(cos^+jsin^).
It is important to realise that zp/s is only one of the qth roots of z*. We
shall sometimes use the notation "Jz11 for zplq; in particular, 4Z = z^·
*Ex. 15. Under what circumstances do (ζ*)1'5 and zTlq represent the same
complex number?
Ex. 16. Verify that the definition given above for z*lq yields the correct value for
г*1", where r > 0 is a real number.
Example 4. Find (— 1 +j)i and the other fifth roots of the complex number
(-i+j).
Writing ζ = -1 + j
= V2(COS|77+jsini77)
and w6 = z,
we have № = 2" cos-— +jsin^-^ .
Distinct values for w are given by к = 0, 1, 2, 3, 4; к = 0 gives (— 1 +j)^.
ThUS (-1 + j)* = 2*(cos *V +J sin 2»·
The remaining four fifth roots of ( —1+j) are 2"llff(cos ^я+j sin y-J-я),
2"lA°"(cos j&r+j sin i&r), 2A"(cos f>+j sin f», and 2^(cos \v+\ sin Ίχή).
428

2] RATIONAL POWERS
Using tables we have
(-1+j)* χ 0-955 + 0-487J,
and similar approximations may be found for each of the other fifth roots
of (-1+j).
Ex. 17. Plot the positions, in the Argand diagram, of the affixes of the complex
number —— + — and of its five fifth roots.
Ex. 18. Find the three cube roots of j, and verify that their sum is zero. Plot their
positions in the Argand diagram.
3. THE nTH ROOTS OF UNITY
The equation zn = 1 has, by the results proved in the last section,
precisely η roots, for
1 = cos2fo7+jsin2fo7,
These η complex numbers are the nth roots of unity. Writing
and applying de Moivre's theorem
Ι ω* = cos hj sin 1
the >jth roots of unity may be written 1, ω, ω2, ω3, ..., ωη_1.
*Εχ. 19. Show that the nth roots of unity are represented by the vertices of a
regular и-sided polygon inscribed in the unit circle \z\ = 1.
Since ωη — 1 =0 and ω Φ 1, we have, by summing the geometric series
in the usual way, the following important result:
1+ω1 + ω2 + ω3+...+ω'1-1 = 0;
that is, the sum of the η nth roots of unity is zero.
*Ex. 20. If ρ is a prime number, and if ξ is anypth root of unity other than 1,
show that the complete set otpth roots of unity may be written as 1,1, ξ2, ..., I*-1.
Discuss possible generalizations of this result for the case where ρ is not prime.
(If you have difficulty in proving the general result, consider the particular cases
ρ = Ъ,р = 4.)
429

COMPLEX NUMBERS (2) [19
Example 5. Prove that (1 + z)n = z*=> Re (z) = -%.
(l+z)- = z-
=> (1 + z) = z£, where ξ is an «th root of unity other than 1,
(1+ζ*) = ζ*ξ*
^> (l+z)(l+z*) = zz*£*
=> l+(z+z*)+zz* = zz* (since ξξ* = 1)
=> Re (z) = -i. (See p. 383 Ex. 14(v).)
Exercise 19(b)
1. Find in standard form the three cube roots of:
(i) 8; (ii) -1; (iii) -j; (iv) (1+j)3.
2. Express in the form a+bj, giving a,bto2 significant figures in (ii)-(iv)
(0 V0-4J); (ii)Vd+2j); (Ш) V(-3-j); (iv)V(i-3i).
3. If ω is a complex cube root of unity, show that the cube roots of z3 are
ζ, ζω, ζω2.
Find the three cube roots of — 2 + 2j and deduce surd expressions for
cos (π/12) and sin (π/12).
4. Find, correct to 2 significant figures, the real and imaginary parts of (2-j)1,e.
5. Simplify:
(i) V(cos θ-i sin 0)5-=-V(cos θ+i sin β);
(ii) (sin Θ-j cos 0)1'3;
K ' л/ \cos30+jsin30/'
(iv) (1 +j cot 0)1'4;
(v) {(cos Θ+) sin Θ) (sin 0-j cos Θ)}111.
6. Plot in an Argand diagram the four fourth roots of 16 and, on a separate
diagram the six sixth roots of 64.
7. Solve the equation z*-z2+1 =0.
8. Solve the equation l + z + z2 + z3 + z4 + z5 = 0.
9. Solve the equation z3-(j-z)3 = 0.
10. Solve the equation (1 + jz)5 - (1 - jz)5 = 0.
11. Show that, if (J—z)" =(jz— 1)", then ζ must be a real number, and find all
the real numbers satisfying this equation.
430

3]
nTH ROOTS OF UNITY
12. If 1, ω, ω2 are the cube roots of unity, prove that:
(i) (α+ω-ωη(α-ω + ωη = α* + 3;
(ii) (l+jW-w2)(l-w+jW2) = 2;
(iii) (a + b) (a + bo>) (α+όω2) = a3 + b3;
(iv)(a+b + c)(a + bb) + c(jj2)(a + bb)2 + c<i)) = a3 + b3 + с3 - ЗаЬс.
13. Describe geometrically the effect of multiplying a general complex number
ζ by ω, a complex cube root of unity. Deduce geometrically that, if \z\ = 1,
then|z + wz| = 1.
14. 1, i, ξ2,Γξ3, ξ4 are the five fifth roots of unity and a, b two given complex
numbers, and if Au A2, A3, Ai; A5 are the affixes of the complex numbers
α + b, a + bi,a + bi2,a + bi3, α + όξ4 show that Λ^ Λ 3Λ4Λ5 is a regular pentagon.
s a factor of the
Show also that
Hence express Δ as a product of linear factors and also as a product of a real
linear and a real quadratic factor.
Factorize into four linear factors the determinant
4. COMPLEX POWERS OF
COMPLEX NUMBERS
{This section may be omitted at a first reading)
If e is the base of the natural logarithms, and if у is a real number, it
seems plausible to assume that, provided d" has a meaning,
On this assumption, ζ = eJ" satisfies the differential equation
dz .
dy = ^

COMPLEX NUMBERS (2) [19
But w = cos y+i sin у also satisfies this differential equation (by direct
verification). Also, when у = 0, ζ = 1 = w and thus ζ = w for all real y:
e3" = cosy+'] siny.
Since ex+ly = e^.eJ»,
it follows that e^+J" = e^cosj+j sin y). (1)
♦Ex. 21. Show that e~Sv = cos^—j sin у and deduce that cos^ = K^+e"1·),
sin^ = — (&y - e~ly). Deduce from these expressions and the infinite series for
ex the series expansions for cos у and sin y.
Ex. 22. Show that cos у = cosh j> and that j sin у = sinh \y.
*Ex. 23. Prove that e*1 = &*<?> z1—zi = 2кщ, к an integer.
Now consider the equation _ w
Given any non-zero ζ we can certainly find a w which satisfies this equation.
For example, if w = In \z\ +j arg z,
e«> _ glnl^l+jarg2
= glnlilelarg»
= \z\ (cos arg z+j sin arg z)
By Ex. 23, any other solution of the equation may be written in the form
w = In \z\ +j(arg z + 2kn). Such an expression is called a logarithm of z;
the particular expression with к = 0 is called the principal logarithm of ζ
and is written In z: , , , . .
In ζ = ln|z|+j argz.
With this definition of the logarithm of a complex number we are in a
position to define a complex power of a complex number:
zW = eWln, (z + 0).
For example, j]' = eJlnl

3] COMPLEX POWERS
while (i+j)i-i = ed-J'^d+J)
= e(l-J){lnV2+i*J>
= 6(1ηΛ/2+ίπ)+ί(ίπ-1ην2)
= V2ei*(cos (i7r-lnV2)+j sin (£π-1η V2))
χ 2-8 + 1-3J.
Ex. 25. Find expressions for:
(i) ln(-l); (ii)ln(-j), (iii)(-l)J; (iv) Kl+j)^.
Ex. 26. Verify that the definition given in Section 2 for zvlq (p/q rational) is in
accordance with the definition given here for a complex power of a complex
number.
Exercise 19(c)
1. If a, b, r, s are real numbers and
и = r&e, ν = se'e,
find:
(i) \au + bv\;
(ii) an argument of the complex number au+bv.
2. Express in the form a + by.
(i) e1-·*»!; (ii) e<"+»J>.
3. Express in the form a + by,
(i) ln(V3-j); (ii) (V3-j)J.
4. If ζ moves once anticlockwise around the unit circle in the Argand diagram,
starting at the point — 1, describe the motion of the point representing zK
5. If ζ is a complex number and sin z, cos ζ are defined by
sin ζ = ^(eJ«-e-i«), cos ζ = ^(fi>z + e-!'),
prove that:
(i) sin2 z+cos2 ζ = 1; (ii) sin 2z = 2 sin ζ cos z;
(iii) cos (in- z) = sin z; (iv) cos (ζλ + z2) = cos zx cos z2 - sin zx sin z2.
6. Show that:
(i) cos (x + jd) = cos χ cosh y-\ sin л: sinh y;
(ii) sin (лг+jd) = sin χ cosh ^+j cos χ sinh y.
7. Show that: . , ... ..
If x+jy = с tan i(u+jv), express хг+уг + сг in terms of u, υ and с If и and с
are positive constants show that the locus of the point (x, y) referred to Cartesian
axes is a circle of radius с cosech υ. (tan ζ = sin z/cos z.) (London)

COMPLEX NUMBERS (2) [1
8. Give a sketch of the representation in the Argand diagram of the two sets:
A = {ze C: \z\ = 1, -%n < Im ζ < \π};
B={weC: zeA, ezw-ez + w + l = 0}.
Miscellaneous Exercise 19
1. Solve the equation zs-zi+1 = 0 and mark the positions of the roots in the
Argand diagram.
2. Prove that, if л is a positive integer,
(cos 0+j sin 0)" = cos ηθ+i sin ηθ.
By putting η equal to 5 in this formula, or otherwise, prove that
sin^=^i. (0&Q
3. Prove that cos 70 = cos7 0(1 -21 tan2 (9+35 tan4 (9-7 tan" (9).
Find the real part of (i+Cos 0+j sin0)« „ . ,
7Γ, a ■ ■ mn· (London)
(l+cos(9-jsin(9)«
4. Express each of the complex numbers zx = (1+j)V2, z2 = 4(-l+j)V2 in
the form r(cos #+J sin 0), where r is positive. Prove that z\ = z2, and find the
other cube roots of z2 in the form r(cos θ+j sin (9). (O & C)
5. Solve the equation z4 + 2z2+3 = 0, giving the real and imaginary parts of
each root correct to 2 significant figures.
6. If α+}β = yj{(a+ib) (c+jd)} where a, b, c, d, a. and β are real, find the value
of a? in terms of a, b, с and d. (London)
7. Prove that the roots of the equation (z+ l)"-(z-1)" = 0 (n > 3) all lie on
the imaginary axis. Illustrate the result geometrically in the case η = 3.
where ω is a complex cube root of unity, prove that £>2 = — 27. (O & Q
9. Evaluate Σ cos (cc+rfi) and Σ sin(a+r/?).
r=0 r=0
10. Ifz= cos 0+jsin#, showthat z+l/z = 2 cos 0 and find the corresponding
result for z—l/z.
Prove that
cos9 0 = тЪ· [cos 80 + 8 cos 60+28 cos 40+ 56 cos 20 + 35].
Evaluate (cos8 0 + sin8 0)d0. (London)
434

3] MISCELLANEOUS EXERCISE
11. If \z\ - 1 and arg ζ = θ Φ 0, express in modulus argument form:
(i) 1+z, (ii) 1-z; (iii) У(^)·
Provethat w = J Ш
lies on a fixed straight line, whatever the value of Θ.
12. Demonstrate the following results geometrically, where ω is a complex cube
root of unity:
(i) (1 + ω) (1+«»■) = 1; (ii) ^~ = ω; (iii) Re(l+2w) = 0.
13. Find the roots of the equation z5 +1 = 0 in the form
cos^+j sin^,
where φ is to be determined.
Deduce, or prove otherwise, that the roots of
16х5-20л:3 + 5л:+1 = О
are cos H(2r+1) π] (r = 0, 1, 2, 3, 4). (London)
14. Solve the equation .,„ .. ., ..
j(l-xj)" = (1+дд)я,
and verify your solution by setting: (i) η - \; (ii) η = 2.
15. Find |w+l+j| and arg(w+l+j) geometrically, where ω is that complex
cube root of unity with positive imaginary part.
Hence, or otherwise, find surd expressions for sin -fer and cos γ^π.
16. Prove that:
(χ+γ + ζ)(χ+ωγ + ω*ζ)(χ+ω*γ + ωζ) = x3+y3 + z3-3xyz,
where ω is a complex cube root of unity.
Hence, or otherwise, solve the following problems:
(i) Prove that the product
(x3 +y3 + z3 - 3xyz) (a3 + b3 + c3 - 3abc)
is expressible in the form
A3 + B3 + C3-3ABC,
where A = ax+by + cz, В = ay + bz + cx, С = az + bx + cy.
(ii) Solve the equation , „
л:3-9л:+12 = 0. (О & С)
17. In the determinant
| a b с d
d a b с
с d a b
b с d a
the cofactors of a, b, c, daxe denoted by А, В, С, D respectively (the expansion of
Δ by its first column being аА+ЬВ+cC+dD).

COMPLEX NUMBERS (2)
By considering the product ΔΩ, where
0 0 1 |
prove that, if ω is any root of the equation x*—l =0, then a+bo) + cu>2 + d(u3
divides into Δ, the quotient being
A + B(jJ3 + C(jJ2 + D(jJ.
Hence show that
A + B+C+D = (a + c-b-d){(a-c)2 + (b-d)2}. (O & С)
18. P, Q are the affixes of the complex numbers p, q and ξ = cos 2π/η +j sin 2π/η.
Locate the affix of the complex numbers (j>-q)i·
If a, b are two complex numbers, prove that the affixes of the complex numbers
zb z2, ..., zn are the vertices of a regular л-sided polygon if and only if
(zT-a)" = b" 0 = 1,2,3,..., n).
19. If Z,, Z2, Z3 are the affixes of the complex numbers zl5 z2, z3 prove that a
necessary and sufficient condition for the triangle ZXZ2Z3 to be equilateral is that
Zl + Zl+ Z$— Z2Z3 — Z3Zi — ZXZ2 = 0.
20. Prove that, if л is a positive integer (n > 1) and
ω = cos 2π/η + j sin 2π/η,
ΐηβη1+ω + ω2+... + ω"-1 = 0.
In the Argand diagram the points Аъ А2, ..., A„ are the vertices of a regular
polygon inscribed in a circle of radius a with its centre at the origin. The complex
numbers represented by the points Ax, Аг, ..., An are zl5 z2, ..., z„. Prove that
zl + z\+... + zl = 0.
The perpendicular distances of the points Аъ А2, ..., An from any given line
through the centre are dlt 4> ···> ^n· Prove that
dl + dl+... rf2 = i«a2. (O & C)
21. If x+jy = tanh (ii+jf) where г/, и, л:, д' are real, find χ and д' in terms of и
andt\
Prove that
лЛ+>'2-2л:соШ2г/+1 = 0
and x2+y2 + 2ycot2v-l = 0.
If и and >> are regarded as variable parameters, and χ and у as Cartesian
coordinates, describe the relationship between the two families of circles.
22. Prove that .,,„.,, . , „ , . , „ . ,
sinh (Р+}ф) = sinh 0 cos 0 +j cosh a sin 0.
For all real or complex values of ζ the sum of the infinite series
«0 + α1ζ + α2ζ2 + ...
is/(ζ). Prove that if ω is a root of the equation ω2 + ω+1 = 0, then
a0 + a3x3 + aex°+...+ = ΗΛ*) +Λωχ) +/(ω2χ)}.
436

3] MISCELLANEOUS EXERCISE
By considering the series for sinh x, prove that
♦ж
23. A sequence uu и2 ··· is denned by иг = 1 and
un+1 = ошп + п+1 (и>1),
where α is independent of л. Find an expression for un when α = 1.
By induction or otherwise show that, when α Φ 1, u, is of the form
un = Accn + Bn + C,
where А, В, С are independent of n, and find А, В and С
If now α is a complex wth root of unity and л is a multiple of m, determine the
real part of u„. (O & C)
24. Prove that n_x . . .
лг2"-1 =(jc-1)(jc+1) Π x2-2xcos — + 1
ы\ я /
and devise a similar expression for x*n+1 -1.
25. Prove that
x*"-2xncosnff+l ="n ix2-2xcos (W — ) + l|.
Deduce the following results:
(i) sin/га = 2""1 Π sin (a+—);
fc=o V n)
(ii) cos noc-cos nfi = 2"_1 Π jcosa-cos (/?+ J J.
From (i) and (ii) deduce, by logarithmic differentiation, the further results:
(iii)cot«a = - Σ cot[oi+ — l, αφ—;
Пк=о \ η) η
(iv) cosec2 ηθ = — Σ cosec2 ((?+— ), 0 Φ —.
26. Prove that л:2+.у2+г2-.уг — ζλτ-λ^ has a linear factor Λ:+ω_ν+ω2ζ, where
ω is a complex cube root of unity. Deduce that, if л: Φ 3, хг + уг + ζ2 —yz - zx - xy
is a factor of (y-z)n+(z-x)n+(x-y)n.

2o. Mappings in the Argand diagram
The equation of a curve in the Argand diagram is the condition imposed
upon the complex number ζ whose affix, P, is any point of the curve.
For example, the equation of a straight line through О and containing the
point В (represented by the complex number b) is
ζ = Xb, λ real.
A straight line not containing О is uniquely defined by the foot, A, of the
perpendicular from О to the line (see Figure 20.1). Suppose A is the affix of
the complex number a and that P(z) is any point on the given line.
Fig. 20.1
Then OP = OA + AP or, in complex number notation,
ζ = a+jAa, λ real. (1)
(Recall that multiplication by j rotates the vector representing a complex
number through %π.) Taking complex conjugates of both sides of (1)
Eliminating λ this gives
z* = a*—jAa*.
a*(z-a) + a(z*-a*) = 0,
a*z + az* = c,
(2)
where с (= 2|α|2) is a real number, as the equation of the line through A
and perpendicular to О A.
*Ex. 1. By writing α = A + &jandz = χ+я, prove conversely that any equation
of type (2) above represents a straight line.

STRAIGHT LINES AND CIRCLES
Ex. 2. What is the equation of the perpendicular bisector of the line joining
the points which represent the complex numbers 0 and 2 — 3j ?
*Ex. 3. Show that
a*z + az* = 2k\a\2, к real,
represents a straight line and describe its relationship to the line
a*z+az* = 2\a\\
Interpret geometrically the quantity Re (a*z).
Ex. 4. Where does the line (1 + 2j) ζ + (1 - 2j) z* = 12 cut
(i) the real axis; (ii) the imaginary axis?
Fig. 20.2
The equation of a circle in the Argand diagram is also easily found.
If the centre of the circle is B, corresponding to the complex number b,
and if the radius is r, then, for any point P(z) of the circumference,
\z-b\=r,
or (z-b)(z*-b*) = r\
which gives, on writing the real number \b\2 — r2 as d,
zz*-b*z-bz* + d = 0 (3)
as the equation of the circle centre В and radius [|6|2— d]i.
Ex. 5. Show conversely, that for real α Φ 0 and real c, the equation
azz* + b*z+bz* + c = 0
represents a circle, provided ас < \Ь\г.
Ex. 6. What is the equation of the circle, centre 1 — 2j and radius 3 in the form
(3)?
Ex. 7. Show that 2zz* + (3-j) z+(3+j) z*+ 1 = 0 is the equation of a circle,
centre -i(3+j) and radius J2. Find the centre and radius of the circle with
equation:
(i) zz*-z(l-3j)-z*(l+3j) + 6 = 0;
(ii) 4zz*-z(2+4j)-z*(2-4j)+l = 0.
3-2 439

MAPPINGS IN THE ARGAND DIAGRAM
[20
We now consider the images of straight lines and circles under three
simple functions /, g, h from the set С into the set С given by
(I) f{,z) = w, where w = ζ+β;
(II) g(z) = w, where w = ccz;
(III) h(z) = w, where w = z_1;
α, /? being complex constants.
(I) w = ζ+β.
This is clearly a translation of the whole plane by an amount equivalent
to the position vector corresponding to the complex number β (see
Figure 20.3).
ζ+β
Fig. 20.3
Geometrically it is obvious that this function maps a straight line into a
parallel straight line, and a circle into an equal circle with its centre
translated an amount β.
Analytically, „ „
crz + az* = с
=>α*(\ν-β) + α(\ν*-β*) = с
=> a*w + aw* = ο + α*β+αβ*.
But α*β+αβ* = 2 Re {α*β); thus the right-hand side is real and the
equation represents a straight line.
Again, for a circle we have
\z—b\=r (r real and positive)
=> \w-{b+jl)\ = r,
which represents a circle of the same radius but with its centre translated
by an amount β.
Ex. 8. Explain why the straight lines a*z+az* = с and
a*w + aw* = β+α*β + αβ*
are parallel.
440

STRAIGHT LINES AND CIRCLES
(II) w = olz.
As shown in Chapter 17 this maps P^ Q where OQ = |a|0i> and
/-POQ = arga; that is, it represents an extension by the factor \a\
followed by an anticlockwise rotation of magnitude arg α (Figure 20.4).
Fig. 20.4
The effect of this mapping upon straight lines and circles may be deduced
analytically as follows:
a*z + az* = с
=> (ад)* w + (oca) w* = c|a|2
and \z-b\ = r
\w-ba\ = r\a\.
Thus straight lines are mapped into straight lines and circles are mapped
into circles under the transformation w = clz.
*Ex. 9 Show that the centre of the ζ circle maps into the centre of the w circle
under this transformation and that the ratio of the two radii is \a\; 1.
Ex. 10. Illustrate geometrically the effect of the transformation w = (1 + j) ζ
upon the circle |z—1| = 1.
(Ill) w = z-\
We must restrict the domain of this function to the whole of С with the
number 0 deleted. To describe the transformation geometrically it is
useful to define the inverse of a point Ρ with respect to a circle. Given a
441

MAPPINGS IN THE ARGAND DIAGRAM
circle, centre О and radius r, the
on OP such that О P.OP' = r2.
of the point Ρ is the point P'
*Ex. 11. Show that the transformation w = z~x maps the point Ρ into the
reflection in the real axis of the inverse of Ρ with respect to the unit circle \z\ = 1
(see Figure 20.5).
Fig. 20.5
Now consider the effect of the transformation w =
straight line and circle. The straight line
a*z + az* = с
1 upon a general
maps into
<№)'■
This represents a circle through O, if с Φ 0, and a straight line through О
if с = 0. Thus, a straight line not containing О maps into a circle through O,
while a straight line through О maps into another straight line through O.
*Ex. 12 Show that a straight line through О maps into its reflection in the real
axis under the transformation w = z_1. What lines map into themselves under
this transformation?
*Ex. 13. Show that, under the transformation w = z_1, the straight line through
A and perpendicular to О A maps into a circle with its centre on the line О A*,
where A, A* axe, the affixes of the conjugate complex numbers a, a*.
Now consider the effect of the transformation w = z_1 upon the circle
zz*-b*z-bz*+d = 0.
442

STRAIGHT LINES AND CIRCLES
The image is the set of points defined by
thatis,by dww* — b*w*— bw +1 = 0.
If d = 0 this represents a straight line, otherwise it represents another
circle. Thus, a circle passing through О maps into a straight line not through
O, while a circle not through О maps into another circle not through O.
*Ex. 14. Show that, under the transformation w = z_1, a circle, centre B, which
passes through the origin maps into a straight line perpendicular to OB*,
where В, В* are the affixes of the conjugate complex numbers b, b*.
*Ex. 15. Show that, under the transformation w = z~\ a circle, centre B, which
does not pass through the origin maps into a circle with its centre lying on the
line OB*, where В, В* are the affixes of the conjugate complex numbers b, b*.
*Ex. 16. Given a circle, centre B, not passing through O, show that the
transformation w = z_1 does not in general map В into the centre of the image circle.
*Ex. 17. Show that a diameter of a circle is mapped onto a diameter of the
image circle under the transformations w = ζ +β and νι> = olz. Show also that,
under the transformation w = z_1, a diameter through the origin is mapped
onto a diameter of the image circle.
To summarize the effect of the mapping w = z_1 upon straight lines and
circles we have (reading -> as 'maps into'):
(i) straight line through О -> straight line through O.
(ii) straight line not through О -> circle through О;
(iii) circle through О -> straight line not through О;
(iv) circle not through О -> circle not through 0.
Example 1. //" г/ге />οί/ζί ζ /ιβί on the circle \z\ = 1, find the locus of the
point w where w = j/(z+j).
Method (i)
Consider the sequence of transformations:
f1:z^u = z+j,
f3:v^W = iv,
upon the circle \z\ = 1. Their effects are shown in the following sequence
of diagrams (Figure 20.6). (For/2, recall the result of Ex. 14.)
The locus is seen to be the line w = i + Aj or w + w* = 1.
443

MAPPINGS IN THE ARGAND DIAGRAM
Method (ii)
-
^>
^>
^>
^>
4*+J) = j
w
|z| = l
IH = |j(i-w)|
N = |(i-w)l
vw* = (l-W)(l-w*)
w + vf* = 1, with the sa
e conclusion as before.
Ex. 18. If ζ moves anticlockwise around the circle \z\ = 1, starting at ζ = j,
how does w move along the line w + w* = 1 ?
Example 2. Show that the transformation
_2z-3+j
W jz-2
maps the unit circle \z\ = 1 into a circle with centre on the real axis and radius
V2.

TRANSFORMATIONS
Method (i)
Write
2z-3+j_2(z + 2j)-3-3j
jz-2 j(z + 2j)
2J z + 2j'
and consider the effect of the successive transformations
l-j)M = 3V2(cos^+jsin^M,
upon the circle \z\ = 1. (Recall the result of Ex. 15 for deducing the effect

MAPPINGS IN THE ARGAND DIAGRAM [20
of/2. Notice that, by Ex. 17, the line joining the successive images of +j
remains a diameter of the corresponding circle.)
Thus, the image is a circle, centre M(l-j)+(3+j)] = 2 and radius
il(3+j)-0-J)l=i|2 + 2j|=V2.
Method (ii)
_2z-3+j
jz-2 '
Qz-2)w = 2z-3+}
_2w-3+)
^ Z jw-2 '
Thus \z\ = 1
|jW-2| = |2W-3+;|
=> (jn, _ 2) (-)W* - 2) = (2w - 3 + j) (2w* - 3 -j)
=> ww*-2)(w-w*) + 4 = 4ww*-6(w + w*)-2j(w-w*) + 10
=> ww*-2(w + w*) + 2 = 0.
This equation certainly represents a circle. Furthermore, since the
interchange of w and w* does not affect the equation, it is symmetrical
about the real axis and thus its centre lies on the real axis. It cuts the real
axis at points given by w = w*,
i.e. x2-Ax + 2 = 0,
i.e. χ = 2 + V2,
giving a radius of л/2, as before.
Ex. 19. With the notation of Example 2, if ζ moves anticlockwiseroundthe circle
\z\ = 1, starting at ζ = j, how does w move around the circle
ww*-2(w+w*) + 2 = 0?
Example 3. The function f: С -> С w defined by Дг) = (ζ— l)2. Prow ί/ζαί
г/ге image о/ г/ге иш7 cw-c/e |z| = 1 ми^ег this mapping is a closed curve,
consisting of all points w with the property that r = 2(1 + cos Θ), where
r = \w\ and θ = argw.
Since \z\ = 1, we may write ζ = cos φ+j sin φ; then
z-1 = cos$5-l+jsini5
= -2 sin2 \φ +2j sin Ц cos y>
= 2 sin i0(-sin i^+j cos i0)
= 2 sin ^{cos (i" + W) +j sin Цтт +Щ).
Thus w = (z -1)2 = 4 sin2 10{cos (я + φ) + j sin (я + φ)) and it follows that
446

TRANSFORMATIONS
;* = 4 sin2 ^φ, θ = π + φ — lkn, where A; = —1, 0, 1, according to which
value is required to bring θ in the range — π < θ < π. Eliminating φ gives
;·= 4cos2i<9 = 2(1+ cos Θ).
In the equation ;* = 2 (1 + cos Θ), any
value of θ in the range 0 < θ < π
determines a unique value of ;* and thus a
unique complex number ;*(cos θ+j sin Θ).
As θ varies the corresponding complex
number moves around a curve Γ which is
clearly closed since ;* returns to its original
value after θ has turned through an angle
of 2π. r = 2(1 + cos ff) is called the polar
equation ο/Γ; the shape of the curve, known
as a cardioid, is shown in Figure 20.8.
Ex. 20. Prove the first part of Example 3 geometrically by constructing the unit
circle and the affixes of numbers z-1, (z-1)2.
Exercise 20
1. The function/has domain {ze C: \z\ = 1}. Find the range of/in each of the
following cases:
(i) /(z) = z + 3; (ii) /(z) = z+j; (iii) flz) = 2z; (iv) /(z) = jz;
(v)/(z)=(l+j)z.
2. In Question 1, if the point ζ is regarded as moving anticlockwise around the
circle |z| = 1, starting at ζ = 1, describe the motion of the image point w = f(z)
in each of the cases (i)-(v).
3. The function/: С -> С is defined by/(z) = z3 + z2 + 2z+1. Find the values of
ζ which remain invariant under this mapping and illustrate your answer by
reference to the Argand diagram.
4. The function /: С -> С is defined by
f(z) = az* + bz + c,
where a, b, c, e C. If Д1) = 0, ДО) = 2j, f(j) = 1 +j, show that just one ζ
remains invariant under this mapping, and find its value.
What two complex numbers map into 3 + j ?
5. As ζ moves once anticlockwise around the unit circle \z\ = 1, starting at
ζ = 1, describe the motion of the point w where:
(i) w = z2, (ii) w = z3; (iii) w = jz4, (iv) w = z~\
6. Show that the affixes of the сотрГех numbers 1, i(-l+V3J)> i( — 1 — V3J)
form the vertices of an equilateral triangle. Describe the effect of the transforma-
447

MAPPINGS IN THE ARGAND DIAGRAM [20
tion w = jz +1 + j upon this triangle, illustrating your result by a sketch of the
Argand diagram.
7. Answer the same question as in Question 6 for the transformation
w=(l+j)z + l+j.
8. Show that, under the mapping defined by w = j/(z-l), the interior of the
circle \z-1| = 1 is mapped into the exterior of the circle |w| = 1. Show that two
points remain invariant under this transformation and locate their approximate
positions on an Argand diagram.
9. Show that, under the mapping defined by w = (j-jz)/(l +z), the interior of
the circle \z\ = 1 is mapped into the half-plane Im (ve) > 0.
10. Show that, under the mapping defined by w = (j + 2jz)/(l —z), the image of
the set {z e C: \z\ < 1} is the set {w e C: Im(vf) > - i).
11. Find the equation, in the form a*z + az* = ό, of the line joining the points
ζ = 1 and ζ = j.
Find the image, under the mappings defined by w = z\(z + 1), of the set of
points represented by the interior of the triangle with vertices ζ = 0, ζ = 1, ζ = j.
In nos. 12-17 find the image of the circle \z\ = 1 under the given
transformation. If the image is a circle, find its centre and radius.
1 + z
18. Prove that, if (z - 8j)/(z + 6) is purely imaginary, the locus of ζ in the Argand
diagram is a circle with centre at the point 4j - 3 and radius 5. (O & C)
19. Show that the set of points {z} satisfying
where θ is a real number in the interval - π < θ < η and α, b are complex
numbers, is represented by the arc of a circle through the affixes А, В of α, b.
What is the condition for a point ζ to lie on the other arc of this circle?
20. Show that, if & is a real number not equal to 1, the set of points {z} satisfying
is a circle, with its centre the point С on AB such that CAjCB = k2, and that A
and В are inverses with respect to this circle.
What can you say about this circle (i) if к = 0; (ii) if к is very large?

EXERCISE 20
Draw on the same diagram the system of circles obtained for various values of
к by taking a = 1, b = -1.
What happens if к = 1 ?
21. Show that the transformation denned by w = 1/(3+j-z) maps the circle
|z-2-j| = 1 into the straight line Re (w) = i. What transformation maps the
line Re(z) = } into the circle |w-2-j| = 1?
22. Show that the transformation defined by w = 1 + z2 maps the unit circle
|z| = 1 into the unit circle \w—1| = 1. Draw a sketch of the Argand diagram,
construct the point 1 + z2 and deduce the above result geometrically.
23. Show that the transformation defined by w = 1/(1 -z)2 maps the unit circle
\z\ = 1 into the curve with polar equation 2r(l +cos Θ) = 1. Draw a sketch of
this curve, indicating on it the images of the points 1, j, —1, -j.
24. Prove that if 1
then the points on the Argand diagram defined by making A constant lie on a
circle, and the points defined by making μ constant lie on a circle.
Prove also that, whatever be the values of the constants, the centres of the two
systems of circles obtained lie on two fixed perpendicular lines.
25. In the transformation defined by w - l/(z-j), describe the motion of the
image point w if ζ moves anticlockwise around the rectangle with vertices 0, b,
b + aj, oj, starting at the origin (where a, b are positive real numbers).
26. The points Ρ and Q represent, in the Argand diagram, the complex numbers
ζ and l/(z2 + 2). The point Ρ describes a quadrant of a circle, from the origin
along the real axis to the point ζ = a, round the arc of the circle \z\ = a to the
point ζ = ja, and back along the imaginary axis to the origin. Describe the path
traced out by Q (i) when a = 1, (ii) when a = J2.
27. Find the transformation of the form
_az + b
W cz + d'
which maps 0 into j, j into 0 and 1 into — 1.
Show, that, if this particular transformation maps the complex number ξ into
the complex number i/, then it also maps η into ξ.
Can you generalize these results in any way?
28. Show that, if the image of the complex number ζ under the transformation
w = z + z-1 is real, then \z\ = 1.
Deduce a geometrical construction for determining the roots of the quadratic
equation z2 + az+1 = 0 where α is a real number in the interval -2 < a < 2.
Extend your results to deal with the quadratic equation z2 + az + b = 0, where
a, b are real numbers such that a2 < 4b, by considering the transformation
w = z + bz~\
449

21. Quadratic equations and
quadratic functions
1. THE QUADRATIC EQUATION
We have shown in Chapter 19 that it is always possible to extract the
mth root of a complex number ρ; that is, it is always possible to solve the
nation ^_p = Q
In particular, if m = 2, the quadratic equation
w2-p = 0 (1)
has roots +Jp and -φ, and the real and imaginary parts of these
two numbers may be calculated to any required degree of accuracy.
The more general quadratic equation
az2 +bz + c = 0 (a, b, с е С, а + 0) (2)
may be reduced to form (1) by the transformation
—*έ·
a process usually known as 'completing the square':
az2 + bz + c = 0
^> z2 + -az=-? (α + 0)
/ b\2 b2-Aac
-b±J(b2-4ac)
Z~ la
If b2 = 4ac, only one root is obtained, otherwise equation (2) has two
distinct roots. For uniformity it is useful, in the case b2 = Aac, to say that
(2) has two coincident roots, or a repeated roof, with this convention, every
quadratic equation has two roots.
Ex. 1. Solve the equation (3+j) z2 - 8jz - 6 + 2j = 0.
450

1] QUADRATIC EQUATION
If the two roots of equation (2) are denoted by α and β then az2 + bz + c
has linear factors (z - a) and (ζ - β) and we may write
az2 + bz + c = α{ζ-α){ζ-β),
or az2 + bz + c ξ αζ2-α(α.+β)ζ + ααβ.
Thus α+β = -δ/α (3)
and a/? = cja. (4)
Relations (3) and (4) frequently enable us to avoid the explicit solution
of a quadratic equation, especially in those cases where we are concerned
with symmetrical relations between the roots. (Particularly important
cases of this arise in analytical geometry. See Chapter 22.)
Example 1. If the roots of the equation (1 +j) z2-2jz+1 -j = 0 are α., β
find:
(i) \+\; (") "2+/?2;
(iii) χβ*+χψ, (iv) a3+/?3;
(v) ct+β*; (vi) |a-/?|.
From (3) and (4).
(i)
(ϋ)
H-
Since a
cc + β
αβ
-, β both
*+β -
χβ =
-J
-i?j-I+i
satisfy the given equation:
(l+j)"2
(l+j)/?2
-2ja+l-j=0:
-2j/?+l-j = 0:
whence, by addition,
(1+])(*2+β2)-2)(χ+β) + 2-2} = 0
Buta+/? = 1+j; thus
(1+))(*2+β2) = 2j(l+j)-2 + 2j
= -4+4j
giving χ2+β2 = 4j.

QUADRATIC EQUATIONS AND FUNCTIONS [21
[Alternatively we could write a2+/?2 = (a+/?)2-2a/?, but this approach
is less easy to generalize to higher powers—see (iv).]
(iii) φ+βα? = *β{α?+β*)
= -J(4j), by (ii),
= 4.
(iv) Since (1 + j) a2 - 2ja +1 - j = 0
we have (1 + j) a3 - 2ja2 + (1 - j) a = 0;
similarly, (1 + j) β* - 2j/?2 + (1 - j) β = 0
whence, by addition,
(1+j) (а3+Я-2Ха2+/?2) + (1-.|) ("+/?) = 0.
Thus (1 + j) (a3 +Л = 2j(4j) - (1 - j) (1 +j), by (ii),
= -10,
giving a3+ β3 = -5 + 5j, on multiplying both sides by i(l -j).
[Alternatively, α3+/?3 = (a+/?) (a2-a/?+/?)
= (a+/?){(a+/?)2-3a/?},etc]
(v) As in (iv)
(1 + j) (<* +/?4) = 2j(a3+Л - (1 - j) (a? + Л
giving
Thus,
= 2j(-5 + 5j)-(l-j)(4j)
= —14—14j
a4 + /?4 =-14.
(1+j) («·+/?«) = 2j(a*+/?«)-(l-J)(as+^
= 2j(-14)-(l-j)(-5 + 5j)
= -38j
giving a5+/?6 = -19-19J.
[Alternatively,
(а2+Л (а3+Л = afi+p* + a*fl\a+fi), etc.]
(vi)
(a-/?)2 = (a+/?)2-4a/?
= 6j,
|a-/?|2 = 6,
|a-/?| = V6.
Example 2. The roots of the equation 2z2 — 3z + 5 = 0 are α and /?. Form
ί/ге equations: (i) vWi/г rooii oc + μ,β + μ; (ii) vf/г/г roote λα, λ/?; (iii) with
roots а2, Л

1] QUADRATIC EQUATION
(i) If w = ζ + μ, then, when 2z2-3z + 5 = 0, w = α + μ or β + μ. Thus,
ct + μ and β + μ are the roots of the equation
2(\ν-μ)2-3(\ν-μ) + 5 = 0,
i.e. 2w2 - w(3 + Αμ) + 2μ2 + 3μ + 5 = 0.
Alternatively, cc + μ and/? + /t are the roots of the equation
w2 + Aw + B = 0,
where A = -(a+/?+2/t), Β = (α + μ) (/?+/*) = αβ + μ{α+β) + μ*.
Thus Л = f — 2/t, Β = \ + \μ + μ2,
leading to the same equation as before.
(ii) With reasoning similar to that of (i), we make the transformation
z-> (l/λ) w: the equation
2νν2-3λνι> + 5λ2 = 0
has roots λα, λβ.
Alternatively, the equation is
w2 +Cw + D = 0,
where С = -λ(α+/?) = -|A; D = λ2<χβ = |λ2, etc.
(iii) If w = z2, then either ζ = wl or ζ = — w%. Thus we have
2z2-3z + 5 = 0
o2z2 + 5 = 3z
■neither 2vi> + 5 = Ъ\Л or 2vi> + 5 =-3vi>£
o{2w + 5-3w^} {2и> + 5 + Зи>*} = 0
oAw2-Uw+25 = 0,
on multiplying out and rearranging terms; and this last equation has roots
cc2, β2.
Alternatively, the required equation is
w2-Ew + F = 0,
Where Ε=α2+β2={*+β)2-2*β, F = a2/?2 = (a/?)2,
giving £=(f)2-5, F=(%)2, etc.
453

QUADRATIC EQUATIONS AND FUNCTIONS [21
Ex. 2. If α, β are the roots of the equation 2z2 - 3z + 7 = 0, find:
(i) a2+/?2; (ii) a3+/?3; (iii) \α-β\; (iv) ^+^2.
£лг. 3. If α, /? are the roots of the equation 3z2 - 5z + 3 = 0 form the equation
(i) with roots 2a, 2/?; (ii) with roots a- 1, /?- 1; (iii) with roots a2, /?2.
2. QUADRATIC EQUATIONS WITH
REAL COEFFICIENTS
If in the equation , .
az2 + bz + c = 0,
a, b, с are all real (and a is non-zero) all the previous results naturally still
hold but the additional restriction placed upon the coefficients enables us
to deduce further results. In this section it will be assumed, unless explicitly
stated otherwise, that a, b, с are real numbers and α φ 0.
*Ex. 4 If fix) = ах? + Ьх + с and if/(a) > 0, /(β) < 0 (μ, β real) prove that
the equation ax2 + bx+ с = 0 has a root lying between α and β. Why is there no
corresponding theorem if a, b, с are complex ?
Theorem 21.1. If the equation
az2 + bz + c = 0 {a, b, с e R, a + 0)
has a complex root a, Im (α) Φ 0, then its other root is the complex
conjugate a*.
Since act2 + bet + с = 0, we have by taking the complex conjugate of both
sides' ew+w+c'-o·.
But a* = a,b* = b, c* = c, 0* = 0 and thus
a(ct*)2 + bct* + c = 0
and the result follows.
*Ex. 5. Explain why the proof of Theorem 21.1 breaks down if a, b, с are not
restricted to the set R. Explain also why the condition Im (α) Φ 0 was added in
the enunciation of the theorem.
*Ex. 6. Prove that a quadratic equation with real coefficients either has no non-
real root or two distinct non-real roots.
Ex. 7. If α, β are two non-real numbers with the property that
Im(a+/?) = Im(a/?) = 0,
prove that α, β are complex conjugates.
There is an analogous result to Theorem 21.1 for irrational roots of a
quadratic equation with rational coefficients.

2]
REAL COEFFICIENTS
Theorem 21.2. If the equation
az2 + bz+c = 0 (a, b, с e Q, a * 0)
/km α rooi p + q^r where p, q, r are rational (q φ 0) and φ is irrational, then
the other root of the equation is ρ - q*]r.
(Notice carefully the change in conditions for this theorem: a,b,c are
now restricted to be rational numbers.)
a(p + q*]rY+b(p+q<Jr) + c must be of the form P+Q-Jr, where Ρ and Q
are rational. Thus p<n ι — η
since ρ +qψ is a root of the given equation.
But this is possible only if Ρ = Q = 0 since ψ is irrational; thus
and it follows that p — q^r is a root of the given equation.
Ex. 8. Explain why the proof of Theorem 21.2 breaks down if a, b, с are not
restricted to the set Q. Explain also why the condition ?Ф0 was added in the
enunciation of the theorem, and where the fact that Jr is irrational is used.
Exercise 21 {a)
1. Write down the sum and product of the roots for each of the following
equations:
(i) z2-3z-7 = 0; (ii) 2z2-4z+ll = 0; (iii) (1+j) z2-z + (l-j) = 0;
(iv) (z+j)2 = (4-j)z; (ν) (z-l+j)« + z» = (2jz-l)2.
2. Write down the equations with roots:
(i) -2,3; (ii) -if; (iii)3-V5,3 + V5; (iv)2-V2,3 + 2V2; (v)3-4j,3 + 4j;
(vi) Kl-ty3), K1+JV3); (vii) l + 2j, 1 -j; (viii) V2-J, W2j.
3. Solve the following equations, giving the real and imaginary parts of the roots
correct to 2 significant figures. Check your solutions by calculating the
approximate product of the roots (using a slide rule):
(i) 2z2-3z-7 = 0; (ii)2z2-3z + 7 = 0; (iii) z2 + (l+j) z+(l-2j) = 0.
4. Find л + β, αβ, α2+/?2, (1/α) + (1//?) and α4/?+α/?4:
(i) when α, β are the roots of the equation z2-3z —9 = 0;
(ii) when α, β are the roots of the equation 3z2 - ζ - 5 = 0;
(iii) when α, β are the roots of the equation z2-jz-(l -j) = 0.
5. α, β are the roots of the equation 2z2-9z-4 = 0. Find the values of
(i) (1/α)+(1/0; (ii)(a//?2) + G%2); (iii) |a-/?|; (iv) |a2-/?2|.
6. α, β are the roots of the equation 3z2-2z-7 = 0. Find the values of :
(i) (a + k) (β + к); (ii) a3 + β3; (iii) (1 + a)-1 + (1 + β)~\
7. α, β are the roots of the equation (z-a)2 = Azb. Find:
(i) a2 + /?2; (ii) (1/α) + (1/Α· (ш) \α-β\.
455

QUADRATIC EQUATIONS AND FUNCTIONS [21
8. The equations
(b2x2 + a2y2 = a2b2,
\ у =mx + c
are solved simultaneously to give two pairs of solutions χ = Χι, у = yi ar»d
x = *2> У = >V Find iUi+*2> and iOi+^a).
9. If a, /? are the roots of the equation 2z2-z —7 = 0, form the equations with
roots:
(i)a-l,/?-l; (ii) 10a, 10/?; (iii) (1/a), (1//?); (iv) a2, β2.
10. If a, /? are the roots of the equation (1 +j) z2-2jz+(2-3j) = 0, form the
equations with roots:
(i) a-2j, /?-2j; (ii) (1 +j) a, (1 +j) β; (iii) a2, β2.
11. If a, /? are the roots of the equation 2z2 - 5z + 4 = 0, form the equations with
roots:
(i) α-/?,/?-α; (ii) α+j/?, /?+ja; (iii) α2 + α,/?2+/?.
12. The roots of the equation 4z2 + az-37 = 0 differ by 1; find the possible
values of a.
13. The roots of the equation z2 — az+ 9j = 0 are λ and jA. Find the possible
values of a.
14. The roots of the equation (2—j) z2 + (3+j) z-4-5j = 0 are α and β. Form
the equations with roots (i) (1/a), (1//?); (ii) α*, β*.
15. If the roots of the equation az2 + bz + c = 0 are α, β prove that
-aw2 + bjw + c = aQw-cc)(jw-fi).
16. If a, b are real and non-zero and z2 + az + b = 0, where ζ is non-real, prove
that |z| = 1 => й = 1 and [«[ < 2.
17. Discuss the application of the method of completing the square to the vector
quadratic equation , „ ,
4 дг.г + Ь.г + с=0 (α Φ 0).
18. The two equations
z2 + az+b = 0,
z2 + Az + B= 0
have a common root. Prove that
b(a-Ay-a(a-A) (b-B) + (b-B)2 = 0.
19. If one root of the equation az2 + bz+c = 0 is the square of the other, find a
relation connecting a, b, с
20. If one root of the equation az2 + bz + с = 0 is j times the other, find a relation
connecting a, b, с
21. Prove that, if one of the roots of the equation z2 + za+1 = 0 has unit
modulus then a is real.

3] QUADRATIC FUNCTION
3. THE QUADRATIC FUNCTION
A function/: R-> R defined by
f(x) = ax2 + bx+ с (a, b,ceR)
is called a (real) quadratic function. Since in this section we shall be
primarily concerned with inequalities, we confine our attention to real
quadratic functions and the coefficients a, b, с as well as the variable x,
will be assumed to be real.
We now derive a necessary and sufficient condition iorf(x) to be positive
for all real values of x.
Theorem 21.3. If a is not zero, then
ax2 + bx+c > 0 for all real χ ο α > 0 and b2 — Aac < 0.
Proof. Write у = ax2 + bx+c.
f , b b2 с b2\
Then y=a\x2 + -x + -r-. + — -^
[ a Aa2 a Aa2)
(( b\2 4ac-b2\
= α[{Χ + 2*) +-4Η- (1)
(i) If a > 0 and b2 < Aac, then у > 0 from (1), since
(Х + Ш > Oforalljc·
(ii) If ax2 + bx + с > 0 for all χ, then certainly α > 0, for we may choose
χ such that
(X+2a) +^F~ > α
But, if b2 > Aac, the equation ax2 + bx + с = 0 has real roots, α, β say, and
ax2 + bx+c = α(χ-χ)(χ-β),
from which it follows that γ<0ϊΐα<χ<β. But we are given that у is
always positive; thus b2 < Aac.
*Ex. 9. Give a geometrical demonstration of the"truth of Theorem 21.3.
*Ex. 10. Prove that
ax* + bx+c <0 for all real x<s>a < 0 and b2-4ac < 0.
The quantity Δ = b2 — Aac occupies a position of central importance
when considering the quadratic expression ax2 + bx+c or the quadratic
457

QUADRATIC EQUATIONS AND FUNCTIONS [21
equation ax2 + bx+c = 0; Δ is called the discriminant of the expression
ax2 + bx+c.
*Ex. 11. If the roots of the quadratic equation ax2 + bx+ с = 0 are α, β, prove
that α\α,-β)2 - Δ. If the coefficients are real, what does this tell us about the
reality of the roots of the quadratic equation?
Example 3. Determine the range of the function f: R-* R defined by
fix) = x+a
JW x2 + ax+l
for different values of a.
If x2 + ax +1 = 0,/(л;) is undefined; excluding this case, if у is a value
assumed by the given expression then
y(x2 + ax+\) = x + a,
or yx2+ (ay-1) x + (y-a) = 0.
Thu sy is a possible image of the function / if this quadraticf in χ has
real roots; that is, if , , 4, . . ч
(ay-I)2 > 4y(y-a)
or Ε ξ (a2-A)y2 + 2ay+\ ^ 0.
To determine what values of у satisfy this inequality we have, by
Theorem 21.3, to consider the discriminant of Ε and the sign of the coefficient
of y2. The discriminant of Ε is
Δ = Aa2-A(a2-A) = 16
and thus, since Δ > 0, the equation Ε = 0 has real roots α, β (where,
in fact, α = (a + 2)-1 and β = (a-2)-1). Thus we have
provided \a\ Φ 2.
The coefficient of y2 in Ε shows us that the critical values of α are +2.
0) N < 2:
For Ε to be greater than or equal to zero, у must lie between α and β;
the range is thus {y e R: *< у ζ β}.
(ii) \a\ > 2:
By a similar argument, the range is
{yeR: у ^ α or j ^ /?}.

3] QUADRATIC FUNCTION
(iii) a = 2:
Ε = Ay+1 and thus the condition Ε > 0 gives us the range
{yeR:y> -$.
(iv)fl=-2:
By a similar argument, the range is
{yeRiy^i}.
Ex. 12. Illustrate the solution of Example 3, by drawing rough sketches of the
curve у = (x+a)/(x* + ax+1) in the separate cases a = -4, a = - 2, a = 0, a = 2,
α = 4. [Express^-1 in the form ζ = .у"1 = χ + (χ+a)'1 and deduce that the graph
of ζ has turning points at x+a = ± 1; hence sketch the у curve.]
Miscellaneous Exercises 21
1. If α,, β are the roots of the equation z2-5z—11 = 0 form the equation
(i) with roots α + (βΙα),β + (α,Ιβ); (ii) with roots (α+j/?, ja+/?).
2. If a, /? are the roots of the equation az2 + bz + c = 0, find the roots of the equa-
tl0n a*z* + (lea - ό2) ζ + с2 = 0 in terms of α, β.
3. If a, b, c, d are real numbers, prove that the roots of the equation
can be complex only if b and с are of opposite signs.
4. Show that, if χ is real,
5. Show that, if χ is real, the expression
2л:2- 13л:-
cannot assume any value between 25/9 and 9.
6. Show that, if л: is real, the expression
л:2-4л:+3
л:2-6л:+8
can attain all real values.
7. If a,/? are the roots of the equation ax2 + bx+c = 0 express the roots of
the equation , , , ,. ... , .
^ ac(x2 +1) - (ό2 - lac) χ = 0
in terms of α and β.
Prove that, if л:ь х2 are roots of the equation
(л:2 + 1)(а2+1)-иал:(ал:+1) = О,
then (*?+l) (*| + 1) = шл:1л:2(х1+л:2). (О & Q
459

QUADRATIC EQUATIONS AND FUNCTIONS [21
8. x, у are real numbers.
(i) Show that, if χ and у are connected by the equation
x*+y*-2x+4y-4 = 0,
then -2 < χ < 4and -5 «Ξ у < 1.
(ii) Show that, if л: and у are connected by the equation
x2 + xy-2y2-3x+3y + 9 = 0,
then χ may have аиу value, but у cannot lie between -1 and 3.
9. Show that, when ,
Хл:-1)(л:-3) = л:-2,
л: is real for every real value of _y.
Sketch the graph of у as a function of л:, indicating the behaviour of the graph
when χ or у is large. (O & C)
10. Two quadratic equations have roots α, β and α + λ, β+λ respectively;
prove that their discriminants are equal, provided they are written with their
leading coefficients as unity.
11. If a, b, c, p, q are all real and if the solutions of the simultaneous equations
px+qy = 1, ax2+by2 = 1
are all real, prove that ab < aq2 + bp2.
12. If a, b, h, λ are all real, discuss the reality of the solution of the simultaneous
equations „ „, , . .
ax2 + 2hxy+by* = A,
x2+y2 = 1.
by putting χ = cos Θ, у = sin θ and considering a quadratic equation in tan Θ.
13. Show that, if , .4, ,.
X*-2)0-3) = лг-1,
then л: is not real when у has any value between - 3 - 2^2 and - 3 + 2^2. (O & Q
14. Show that, if p, q, r are not all equal and if
y (x-q)(x-r)'
then χ is real for every real value of у provided thai
(p-4)(p-r)<o. (o&C)
15. If α, β are the roots of „ л
x2-2ax+ab+c = 0
and y,S are the roots of x,_2bx+ab_c = Q>
provethat ^_^ (у_^ = (а_^ ^_^ = (α_^ ^_^
Hence, or otherwise, prove that, if the quadratic equations have a common root,
it is a repeated root of one of them. (O & Q

22. The parabola and rectangular
hyperbola
1. SETS OF POINTS IN A PLANE
DEFINED BY PARAMETERS
The reader has already met (Chapter 3) the parametric representation of a
line in two and three dimensions. For example, given coordinate axes Ox,
Oy in a plane, the set of points
S = {(x,y)-x = 2t-\,y = t + 2; ί real}
represents a line. The set of points may be written in the alternative form
S= {(x,y):x-2y+5 = 0},
for ~2~= y~2= U
ox-2y+5 = 0
(Notice that, to show that the parametric equations
χ = 2t-l, у = t + 2
represent the line x— 2y+5 = 0,
it is insufficient merely to verify that the point (2t-l, t + 2) lies on the line:
it is necessary also to show that any point of the line has coordinates of this
form. See Ex. 29 below.)
Ex. 1. Show that, if both χ and у are given as linear expressions in a parameter t,
then the equation connecting χ and у is linear and thus represents a straight line.
Ex- 2. Find the Cartesian equation of the line given by the parametric form
x=3i-2, y = 2t+3;
and find a parametric form for the line with Cartesian equation
Ъх-Sy-l = 0.
If the expressions for χ and у involve non-linear expressions in a
parameter t, the set of points so defined generally constitute a curve. For
example, Г = {(*,,):*= i, ,= *V real}
represents a curve, since the expression for у in terms of χ is not linear.
461

PARABOLA AND RECTANGULAR HYPERBOLA [22
Ex. 3. Show that the Cartesian equations of the curve Γ defined above is
у = x\
Distinguish between the curve Г and the curve Г' defined by
Г'= {(*,*):*=/*, * = **;/real}-
Sketch Г and Г' for -3 < χ < 3.
Ex- 4. Show that the set of points
{(x, y) e R: χ = i2, у = ί2-1; t real}
constitute a line segment. (Only in such artificial cases does a non-linear
parametric expression yield a straight line.)
The expressions for χ and у need not be algebraic: the circle with centre
(0, 0) and radius 2 may be expressed in the parametric form
χ = 2 cos Θ, у = 2 sin Θ.
Ex. 5. Find the equation of the curve with parametric representation x= α sec 0,
у = b tan Θ.
The use of a parametric representation greatly facilitates the study of the
geometry of curves with known equations. This is because the position of a
point on the curve may be specified by one variable (the parameter) rather
than two variables (jt and y) connected by an equation (the equation of the
curve).
2. TRANSLATION OF COORDINATE AXES
The equation of a given curve naturally depends upon the choice of origin
and axes. Equations can often be much simplified by retaining the directions
of the χ and у axes (defined by the unit vectors i and j) but shifting the
origin to another point O'. Such a change in coordinate system is referred
to as a translation of coordinate axes.
Suppose we are given a pair of perpendicular unit vectors i and j and a
point Ρ with coordinates (jc, y) referred to the origin О and axes Ox, Oy
in the directions i, j. Then r = OP = xi+yj is the position vector of Ρ
with respect to O. Now let O' be the point (h, k) referred to the axes
Ox, Oy (Figure 22.1). Then
a = OO' = hi + kj
is the position vector of O' with respect to O. It follows (Figure 22.1) that
s = O'P = r-a = (x-h)i + (y-k))
is the position vector of Ρ with respect to O'. Put another way, if O' is
taken as the origin, with axes O'x', O'y' in directions defined by i, j then
462

COORDINATE AXES
the coordinates (x', y) of Ρ with respect to the new axes and origin O'
are given by
-h, у' = у—к.
Ex. 6 The coordinates of A, B, C, D referred to perpendicular axes through О
are respectively (1, 2), (-2, 3), (0, 4) and (-1, -5). What are their coordinates
referred to a pair of parallel axes through O' (2, -1) ?
У
~0
yt
O'V^^^
^^P
"-^*
Example 1. Show, by a translation of coordinates to a new origin O', whose
coordinates in the original system were (2, — 3), that the equation
x2 + xy-2f-x-Uy-20 = 0
represents a pair of straight lines.
Denoting new coordinates by dashes we have
x' = x-2, y' = y+3,
^x = x' + 2, y = /-3,
and the given equation becomes
(*' + 2)2 + (*' + 2)(/-3)-2(/-3)2-(*' + 2)-14(^-3)-20 = 0)
which simplifies to „ , , , . „
F χ 2 + x у -2у2 = 0.
But the expression on the left of this equation factorizes:
(x' + 2y')(x'-y') = 0;
and thus the equation represents the pair of straight lines
x' + 2y = 0, x'-y' = 0
through O'.

PARABOLA AND RECTANGULAR HYPERBOLA
Referred to Ox, Oy, the equations are
(x-2) + 2(y+3) = 0 and (je-2) -(y+3) = 0,
or x + 2y+4 = 0 and x-y-5 = 0.
Exercise 22(a)
Find the equations of the curves given parametrically in Questions 1-6.
1. χ = cos Θ, у = 2 sin Θ.
2. χ = t*,y = i3.
3. χ = l-f,.y = 1 + ί + ί2.
4. л: = cos (9-sin θ,γ = cos(9 + sin(9.
. at2 at
5-х = ш» ^ = йтз·
6. x= t+l/t, y= l+t2.
7. Show that the parametric form
represents the curve x2+y2 = 1,
less the point (-1,0).
8. If the axes are translated to (1, -4) as new origin, find the new coordinates of
the points:
(i) (0,0); (ii) (3, -4); (Hi) (3,4); (iv) (5, -5).
9. By translating the axes to pass through (1,1) as new origin prove that the
equation *-4y*-2x+8y-3 = 0
represents a pair of straight lines, and find their gradients.
10. Prove that the equation
2х2 + Зху-2у2-Пх + 6у + 8 = 0
represents a pair of straight lines intersecting in the point (2, 3).
11. Show, by a translation of axes to pass through (p, q) as new origin, that the
equation ax*+7hxy + by* + 2gx+2fy + c = 0
is transformed into the equation
ax'2 + 2hx'y' + by'* + d = 0
if p, q are chosen so that ap + hq+g = 0,
hp + bq+f= 0.
464

COORDINATE AXES
Deduce that the equation
ax2 + 2hxy + by2 + 2gx + 2fy+ с = 0
represents a pair of straight lines only if det A = 0 where
i\}
12. Use the results of Question 11 to show that the equation
6x2 + 5xy+y2-x-l = 0
represents a pair of straight lines, and find their point of intersection.
3. THE PARABOLA
A parabola is defined as the set of points in a plane whose distances from
a fixed point, S, are equal to their distances from a fixed line / not
containing S: see Figure 22.2. For the two typical points Px andi>2,
SPi = Pi-Μί, SP2 = P2M2.
X
Mx
P, /
0\
M2
I
s\
^2\
The curve is clearly symmetrical about the line through S perpendicular
to /; this line is called the axis of the parabola. The axis cuts the parabola
at O, the vertex of the parabola.
Ex. 7. In Figure 22.2 why does SO = OTl
The point S referred to in the definition is the focus of the parabola;
the line / is the directrix. From the symmetry of the figure about the axis,
the tangent at the vertex is parallel to the directrix.

PARABOLA AND RECTANGULAR HYPERBOLA [22
Ex. 8. If the chord PQ of a parabola, when produced, cuts the directrix at K,
prove that SK bisects the exterior angle between SP and SQ. What can you say
about the angle PSZ, where Ζ is the point at which the tangent at Ρ cuts the
directrix ? What does this tell you about the angle SPM, where Μ is the foot of
the perpendicular from Ρ on to the directrix ?
Ex. 8 shows that a number of properties of the parabola may be derived
very quickly from the definition by pure geometry, but the further study
of the curve is best made by analytical methods which we shall now develop.
As has been pointed out already, the equation of a parabola depends
not only upon the relative positions of the point S and the line / but
also upon the choice of coordinate axes. Let us suppose (Figure 22.3) that
Μ
X
У
Ί
О
Р^"
S >
Fig. 22.3
SX = la. The fact that SX is an axis of symmetry suggests strongly that
this line should be taken as one of the axes of coordinates. The absence
of a perpendicular axis of symmetry leaves us without an immediately
obvious choice for second axis; however, if we take the origin to be a
point of the curve, we shall avoid a constant term in the equation. Thus we
shall take О as the origin, OS as the χ axis and the perpendicular through
О as the у axis. Now suppose that Ρ is any point on the parabola, Γ, and
that the coordinates of Ρ referred to our axes of coordinates are (x,y);
let Μ be the foot of the perpendicular from Ρ on to /.
Since XO = OS, S is the point (a, 0) and / has equation x + a = 0.
ThuS SP>=(x-af+y\
PAP = (x + a)2,
and Γ = {(*, у): (x-af+f = {x+af}
= {(x,y):y2 = 4ax}.
466

3] PARABOLA
Thus, referred to its axis as the χ axis, its vertex as the origin and S as the
point {a, 0), the equation of a parabola is
y2 = Aax. (1)
*Ex. 9. Show that χ = at2, у = lat is a parametric form for the parabola
y2 = Aax. Need a be positive in this equation ?
From the equation derived above it may be seen that all parabolas have
a similar shape but differ in size (depending upon the distance of the focus
from the directrix). The chord of the parabola through S and perpendicular
to the axis is called the latus rectum; the length of the latus rectum
determines the size of the parabola.
*Ex. 10. Show that the length of the latus rectum is 1SX.
*Ex. 11. What is the length of the latus rectum of the parabola y2 = Aaxf
Answer the same question for the parabolas y2 = —ax and x2 = Ъау.
Ex. 12. Using the same axes, plot, on graph paper, the parabola y2 = Aax, for
a = 2,l,i,Tfo, -1,-2.
*Ex. 13. By writing the equation
y2—Aax—lay-За2 = О
in the form (y — a)2 = Aa(x+a)
and translating the axes to the new origin ( — a, a), show that it represents a
parabola of latus rectum Aa. Show also that, with the original coordinate system,
the vertex is the point ( — a, a), the focus (0, a) and the directrix x+la = 0.
Ex. 14. Show that the equation
y2 -lax-lay + 5a2 = 0
represents a parabola and find its focus and directrix. Answer the same question
for the equations:
(i) y2 + 3ax+6a2 = 0; (ii) x2 + lax - Say - la2 = 0.
The chord joining the points P^at2, lat^, P2{atl, 2at2) of the parabola
is easily obtained. Its gradient, m, is given by
m ~ at\-at\
2
h+h'
and thus its equation is
y-2ah = -^-{x-atb
which reduces to 2x-{h+t^ у+2а^2 = 0. (2)
467

PARABOLA AND RECTANGULAR HYPERBOLA [22
By letting i2-> tx this leads immediately to the equation of the tangent at
ЛК,2<>: x-by+atl-0. (3)
*Ex. 15. Interpret the parameter tL geometrically.
*Ex. 16. Л is the point {atl, 2αία), P„ is the point {atl, 2at^). Show that РгР2
passes through the focus, S, of the parabola if and only if ίιί2 = — 1. (A chord
which passes through the focus is called a focal chord.)
*Ex. 17. Obtain the gradient of the tangent to the parabola y2 = Aax at the
point {at2, 2at) by differentiating the equation y2 = Aax.
*Ex. 18. If lx+my + na = 0 cuts the parabola at the points Λ(αίί, 2ah),
P2{at\, 2at2), show that ib t2 are the roots of the quadratic equation
lt2 + 2mt + n = 0.
Write down the values of m/l, n/l in terms of ii and t2 and deduce the
equation of the chord РгР2. How may this method be modified to obtain
the equation of the tangent at P(at2, 2at)?
*Ex. 19. Show that the line
(χ-χύ/l = (У-Уд1т = Я
meets the parabola y2 = Aax at points with parameters Ab λ2 which are roots of
the quadratic equation
»г2Л2 + (2тед'1-4аОЛ + 0'?-4ал:1) = О.
Deduce that, if (лг1; уг) lies on the parabola and the given line is a tangent, then
2my1-Aal = 0.
Hence show that the tangent at (лг1; у^ has equation
(x-xi)lyi = (y-y1)l2a.
The equation of the normal to the parabola y2 = Aax at the point
P{at2, 2at) (that is, the line through Ρ perpendicular to the tangent) is now
easily found. For, since the gradient of the tangent is \jt, the gradient of the
normal must be — t and thus its equation is
y-2at =-t(x-at2),
which reduces to tx+y = 2at+at3. (4)
Ex. 20. Show that, if the normal to the parabola y2 = Aax at the point
P{ap2, Zap)
cuts the parabola again at the point Q(aq2, 2aq), then q is a root of the quadratic
equation in λ «,„■»-» ·,
ρλ2 + 2λ-2ρ-ρ3 = 0.
What is the other root of this equation? Find q in terms of p.
468

3] PARABOLA
The parabola is rich in geometrical properties which may be easily
derived analytically from the results already found. We shall content
ourselves here by proving two famous results; the reader will find further
examples in Exercise 22(6).
In proving the following two properties, we shall take the parabola in
the standard form y2 = Aax by making the vertex the origin, the axis of the
parabola the χ axis, the tangent at the vertex the у axis, and writing the
length OS as a.
(i) Tangents at the extremities of a focal chord meet at right-angles on the
directrix.
The chord joining Ρχ{αί\, 2atJ and P2{at\, 2at2) is, from equation (2),
Ix-fa+Qy+lat^z = 0.
[f ΡΧΡ2 is a focal chord, (a, 0) lies on this line and thus
From equation (3), the gradient of the tangents at Ρλ and P2 are l/ix
and l/t2 and thus, since t±t2 = -1 for a focal chord, tangents at the
extremities of such a chord certainly meet at right-angles.
On solving the equations
x-^y+atf = 0,
x-t2y+ati = 0,
we obtain the point of intersection R of the two tangents in the form
(at^, a(h+ Q)· For a focal chord РхРг, this reduces to {-a, a{tx+12))
and thus R lies on the directrix. (See Ex. 22 for an alternative method of
obtaining the coordinates of R.)
(ii) If Ρ is any point on the parabola, focus S, and Μ is the foot of the
perpendicular from Ρ on to the directrix, then the angle SPM is bisected by the
tangent at P.
The reader who has successfully completed Ex. 8 will have already proved
this property by pure geometry.
In Figure 22.4 we have
tana = gradient of tangent at Ρ {at2, 2at)
= \\t, from equation (2),
tan β = gradient of SP
2at
at2 —a
It
t2-\
2(1/0
i-(i/02·
4 PPM π 469

PARABOLA AND RECTANGULAR HYPERBOLA [22
= tan 2a
and the result follows.
Property (ii) is usually known as the parabolic reflection property since
it shows that a ray of light emanating from the focus will be reflected by a
parabolic mirror along a line parallel to the axis.
Μ
y.
0
^
p^^
rf .
5
Fig. 22.4
Ex. 21. Explain the use of parabolic reflection in car headlights and reflecting
telescopes. Explain, with the aid of diagrams, the ' headlights on' and 'headlights
dipped'positions.
Parabolas frequently occur both in nature and in design. For example,
a first approximation to the path of a projectile moving under gravity is a
parabola. Again, in architecture, it is becoming quite common to find roofs
of buildings designed so that certain cross-sections are parabolic. The
curves of suspension bridges are also approximately arcs of a parabola.
We conclude this section with an example illustrating the method of
determining a locus associated with a parabola.
Example 2. PQ is a focal chord of a parabola, focus S. Show that the locus
of the mid-point ofPQ is another parabola and locate its focus and directrix.
Set up coordinate axes with the vertex, O, as origin, the axis of the
parabola as χ axis, the tangent at the vertex as у axis and call the length of
OS, a. The equation of the parabola is then
y2 = Aax.
470

3]
PARABOLA
= 0, has the form
Any line through the focus, S(a, 0), other than л
у = m(x-a).
(Notice that m is a parameter: different values of m give different lines
through the focus and all such lines, with the single exception mentioned
above, may be put in this form.)
The line у = m(x-a) cuts the parabola y2 = Aax in points with у
coordinates y1 and y2 given by the roots of the equation
that is, by my2 — Aay — Aa2m = 0.
Let (h, k) be the mid-point of this focal chord; then
к = Кл+Л) = 2a/m
by a property of the roots of a quadratic equation.
Again, since (h, к) lies on the chord,
к = m(h—a)
or h = a+(2alm2),
on substituting for к.
The locus is thus given parametrically by
x = a+(2alm2), у = (2a/m).
Elimination of m gives this locus as the curve
y2 = 2a(x-a).
(Strictly speaking, to complete the
question we should also prove that any
point of the curve y2 = 2a(x— a) is a
point of the locus. The reader may care
to supply the details.)
By translating the axes to the new
origin (a, 0) it is easily seen that this
equation represents a parabola of latus
rectum la. In the original coordinate
system, the focus is <S"(fa, 0) and the
directrix, /', χ = $a. The directrix of the
locus cuts the common axis at A where
О A = AS = SS' (see Figure 22.5). Fig- 22-5
Notes on Example 2.
(i) The use of symmetric functions of the roots of a quadratic equation
is frequently applicable to problems involving the parabola and has the
advantage of retaining symmetry in the calculation.
4-2 471

PARABOLA AND RECTANGULAR HYPERBOLA [22
(ii) An alternative method of solving the question would have been to
write Ρ as the point (atf, 2at^) and Q as the point (at\, 2at2). The coordinates
of R could then be found in terms of tx and t2, again giving the locus of
R parametrically.
(iii) If LL' is the latus rectum of the given parabola, the mid-point of
the particular chord LL' is S which must therefore lie on the required
locus, giving us a check on our working. Again, the symmetry of the locus
about the χ axis was to be expected from the symmetry of the parabola
about its axis.
Ex. 22. Show that if the tangent at the point P(at2,2at) passes through the point
R(h, k) then г is a root of the quadratic equation
aA2-kA + h = 0.
Deduce that the coordinates of the meet of tangents at Λ(α?ί, 2at^ and P2(at\, 2at£
is Riat^aifi+Q).
Exercise 22(b)
1. Find the latus rectum, vertex, focus, and directrix of each of the following
parabolas:
(i) (y-2)2 = Aix-3); (ii) (x-3)* = 2(y + l);
(iii) (y + 1)2 = -(x-1); (iv) (x+2y = 3(y + 3);
(ν) (y + 3)2 = 4(x-5); (vi) (x-2y = 80>-2);
(vii) (y-4)* + 2x= 0.
2. Find the equations of the parabolas having the given points as foci and the
given lines as directrices:
(i) (2, 1), χ = 0; (ii) (-3,2),y = 3.
3. Find the equations of the parabolas: (i) with focus (4, 3) and vertex (1, 3);
(ii) with focus (-1, - 7/4) and vertex (-1, -1).
4. Find the equations of the tangents to the following parabolas at the points
given:
(i) y* = 2{x+1), (7, -4); (ii) 0> + 2)2 = x, (9, 1);
(iii) y*-3y-5x = 0, (2, 5); (iv) л:2-2х-3y-21 = 0,(-3, -2).
5. Show that the line x+y-2 = Oisa tangent to the parabola x2 + Sy = 0 and
find the point of contact.
6. Show that the line 5x+2y + S = 0 is a tangent to the parabola
л:2-2л:-4.у = О,
and find the point of contact.
7. Any element Ρ of the set S of points in a plane has a position vector of the
f0m r = U-Di + i%
where г is a scalar parameter. Show that S defines a parabola and determine the
position vectors of its vertex and focus.
472

3]
PARABOLA
8. A ball is thrown obliquely into the air in such a way that its position vector
at time ns given by r = 10гН-(Ш-Ш*) к,
where i is a horizontal unit vector and к is a unit vector pointing vertically
upwards. Show that the path of the ball is a parabola, and determine the distance
of its directrix from the origin.
9. Any element Ρ of the set S of points in a plane has a position vector of the
form r = cos20.i-(l+sin0).j,
where θ is a scalar parameter. Show that S forms part of a parabola with its axis
parallel to i and sketch the set of points S.
Questions 10-15 refer to Figure 22.6. О is the vertex, S the focus of the parabola,
AM is the directrix, ANPM is a rectangle. The tangent at Ρ cuts the axis at Τ
and the normal at Ρ cuts it at G. PT and SM intersect at Z. You may use any
method which you regard as suitable.
Fig. 22.6
10. Prove that SZ and PT are perpendicular and that Ζ lies on the tangent at
the vertex. Suggest how, with the aid of a set square, this property may be used
to draw a parabola.
11. Prove that ST = SP and deduce that SPMTh a rhombus. (Notice that this
gives the parabolic reflector property.)
12. Prove that ST = GS and deduce that NG is of fixed length.
13. Prove that NT is bisected at O.
14. Prove that SZ2 = SO.SP.
15. Prove that TZ = ZN and that GP = 2SZ.
473

PARABOLA AND RECTANGULAR HYPERBOLA [22
16. PQ is a focal chord of a parabola. If the tangents at Ρ and Q meet at Γ and
the normals at Ρ and Q meet at [/ prove that TV is parallel to the axis of the
parabola.
17. If R is the mid-point of the focal chord PQ of a parabola and V is the foot
of the perpendicular from R on to the directrix, prove that RV = \PQ.
18. Ρ is a variable point on a parabola, focus 5, and R is the mid-point of PS.
Prove that the locus of R is a parabola and find its axis, focus and directrix.
What checks can you apply to your result?
19. The chord PQ of a parabola subtends a right angle at the vertex. Find the
locus of the mid-point of PQ.
20. In Figure 22.6 the line UOV is perpendicular to the tangent PT, meeting
PT at U and the parabola at О and V. Prove that UO.OV is constant for all
positions of Ρ on the parabola.
21. Ρ is a variable point of the parabola y2 = Aax. Find the locus of the
midpoint of OP and sketch in the same diagram the given parabola and the locus.
22. With the notation of Figure 22.6, К is the mid-point of PG. Prove that SK
is parallel to TP and find the locus of K.
23. Prove that the equation of the tangent at the point (at2, lot) on the parabola
A and В are any two points on the parabola y2 = Aax, and the tangents at
A and В meet at P. The line PM is parallel to the axis of the parabola and meets
the line AB at M. Prove that Μ is the mid-point of AB.
If the parameters of the points А, В are t and 2i respectively and the tangents
meet at P, find the coordinates of Ρ and show that it always lies on the parabola
y2 = 9ax/2. (O & C)
24. Show that any line through the fixed point (ha, ka) may be written in the
form , , , ,
y—ka= m(x—ha),
where m is a parameter.
Deduce that, if the chord PQ of the parabola y2 = Aax passes through a fixed
point, then the tangents at Ρ and Q meet on a fixed line.
25. РгР2 and Qr Q2 are parallel chords of a parabola. If Qr Q2 produced cuts
the tangents at Д and P2 at ^ and R2 then QiRt = Q2R2·
26. Prove that the equation of the chord joining the points (at2,2at) and
(aT2,2aT) on the parabola y2 = Aax is
2x-(t+T)y + 2atT= 0,
and deduce the equation of the tangent at the point whose parameter is t.
The parameters of the three points А, В and С on the parabola are tu t2 and
t3. The tangents at A and В meet the tangent at С at Q and Ρ respectively; the
tangents at A and В meet at R. Prove that if Ρ is the mid-point of QC, then
2ί2 = ί!+ί3.
Prove also that RP is parallel to А С. (О & Q
474

3] PARABOLA
27. Prove that the equation of the normal at the point P(at2,2at) to the parabola
y* = 4ax'is tx + y= at3 + 2at,
and that it meets the parabola again at QiaT2, 2aT) where Τ = -t- lit.
The tangents at Ρ and Q meet again at R; prove that, if Ρ is a variable point
on the parabola, the locus of R is
y2(x + 2a) + 4a2 = 0. (O&C)
28. The tangents at two points Ρ and Q on the parabola y2 = Aax meet at R.
Find the equation of the locus of R in each of the following two distinct cases:
(i) when the sum of the ordinates of Ρ and Q is 4a;
(ii) when the sum of the squares of the ordinates of Ρ and Q is 4a2. (O & Q
(The ordinate of a point is the у coordinate.)
29. The chord PQ of the parabola y2 = Aax subtends a right angle at the vertex,
Ρ and Q being the points (at2,2at) and (aT2,2aT). Prove that tT+ 4 = 0 and that
the locus of the point of intersection of the normals at Ρ and Q is
уг = 16a(x-6a). (O&Q
30. Show that the differential equation
d2y/dx2 = a,
where α is a constant, defines a family of parabolas (that is, any solution of the
given equation represents a parabola). If a = 6, find the equation of the parabola
of the family which has its vertex at the point (f, - f).
31. Prove that, in general, a unique parabola with its axis parallel to a given
direction may be drawn through three given points. What exceptional cases
may arise?
Find the equation of the parabola with axis parallel to the у axis and passing
through the points (-1, 5), (0,2), (1, 3).
32. A man rows a boat across a river flowing at a constant speed v. If his speed
in still water is also ν and he always aims at a point on the further bank exactly
opposite his starting point, show that his path is a parabola.
4. THE RECTANGULAR HYPERBOLA
The parabola proved particularly amenable to analytical treatment
because of its simple parametric form when referred to suitable axes.
Another curve with a simple parametric representation is the rectangular
hyperbola. We shall see later that the rectangular hyperbola is a special
case of a more general type of curve, namely, the hyperbola, and, for this
reason, we shall postpone a geometrical definition (see Chapter 27) and
content ourselves with a purely analytical treatment by defining a
rectangular hyperbola as the set of points
{(x,y):xy = c2},
475

PARABOLA AND RECTANGULAR HYPERBOLA [22
where the coordinates are referred to perpendicular axes Ox, Oy. (c is a
constant which appears here as c2 since it will later be interpreted as a
length. It is not necessary for the constant to be positive and the set of
also represents a rectangular hyperbola.)
*Ex. 23. Show that χ =
rectangular hyperbola.
ct, у — cjt is a suitable parametric representation of the
Fig. 22.7
The shape of the curve is shown in Figure 22.7. It is seen to consist of
two branches (notice that, if (A, k) lies on the curve, so also does (- h, — k)).
The point О is called the centre of the rectangular hyperbola (see Ex. 24)
and the curve is symmetrical about the lines x—у = 0 and x+y = 0.
When χ = 0, there is no corresponding value of у and vice-versa;
in fact, as x-^0 through positive values, j-^αο and as x^O through
negative values, у ->—oo. Thus, the line χ = 0 is an asymptote and
similarly so is у = 0.
Ex. 24. Show that, if any line through О cuts the rectangular hyperbola at Ρ
and Q, then О is the mid-point of PQ. (This corresponds to a property of the
centre of a circle.)
Ex. 25. Trace the position of the point P{ct, c/t) as t varies from -oo to +oo.
The chord joining the points P^ch, cjt^) and P2(ct2, c/t2) on the
rectangular hyperbola has equation
с φ2-Ί) , ,ч
У— = — ,—^ч (х— cU)
476

4] RECTANGULAR HYPERBOLA
which reduces to ± ^ .. , „ ч
The tangent at Д may be obtained immediately by letting t2 -> ^:
♦Ял:. 26. Obtain the gradient of the tangent at Pt by differentiating the equation
xy = c2.
*£Ά\ 27. Obtain the equation of the chord РгРг by the method of Ex. 18.
Ex. 28. Show that the equation of the normal to the curve at Px is
t\x-y= ci/i-l)//i.
Problems concerning the rectangular hyperbola are solved by methods
reminiscent of those employed for the parabola. We conclude this chapter
with a worked example.
Example 3. A rectangular hyperbola has centre O. Through a fixed point A
lines are drawn to cut the hyperbola at Ρ and Q. Show that the mid-point,
M, ofPQ lies on another rectangular hyperbola and find its centre and
asymptotes.
Set up coordinate axes with О as origin and the asymptotes of the given
rectangular hyperbola as the axes of coordinates, labelled in such away that
one branch of the hyperbola lies in the first quadrant (positive x, positive;;).
Let A be the point (α, β) and let PQ be any chord through A, where Ρ is the
point (ctv c/ij and Q is the point (ct2, cjt2). Then PQ has equation
x + ht2y = (fti + h)
(as shown above) and thus, since PQ contains the point A,
a+ ht^ = c(?! +12). (1)
Let (h, k) be the mid-point of the chord PQ; then
2h=c(t1 + t2), 2*=C(I+})=^,
t,t2
_<h + t2)
2lc
Hence, using (1) we see that h, к are connected by the equation
which may be rewritten as
(Α-*α)(*-*/ί) = *α£

PARABOLA AND RECTANGULAR HYPERBOLA [22
Thus {h, k) lies on the curve
(x-fr)(y-W)=W (2)
which represents the required locus (or, more accurately,1 contains the locus:
see Ex. 29).
By translating the coordinate axes to pass through the new origin
(ia> iA)> this is seen to represent a rectangular hyperbola, centre (^α, %β),
with axes parallel to the axes of coordinates. Thus, Μ lies on a rectangular
hyperbola, centre the mid-point of О A and with asymptotes parallel to
those of the given hyperbola.
The reader should compare the choice of parameters in Example 3
with that in the similar problem of Example 2.
Ex. 29. The locus sought in Example 3 may be definitely only part of the curve
(x— ia) (y- \β) = ia/3. Show this by taking A as the origin.
Exercise 22(c)
1. The tangent to a rectangular hyperbola at the point Ρ meets the asymptotes
at Q and R. Prove that Ρ is the mid-point of QR.
2. An axis of symmetry cuts a rectangular hyperbola at A, A'. P is any point of
the hyperbola and N is the foot of the perpendicular from Ρ to AA'. Prove that
PN2= AN.NA'.
3. With the notation of Question 2, if О is the centre of the hyperbola and G is
the point of intersection of the normal at Ρ with the line AA', prove that
ON ■= NG.
4. With the notation of Questions 2 and 3, if Ζ is the foot of perpendicular from
О to the tangent at P, prove that OZ.OP - ОАг.
5. With the notation of Questions 2 and 3, if the tangent at Ρ meets AA' at T,
prove that ON .ОТ = OA\
6. Show that the equation
(x-a)(y-b)= c2
represents a rectangular hyperbola and find its centre and asymptotes.
Find the centre and asymptotes of the rectangular hyperbola
ху-хЛ-Ъу-Ί - 0.
7. A variable line passes through the point (1, 1) and meets the χ and у axes at
P, Q respectively. The rectangle OPRQ is completed. Prove that R lies on a
rectangular hyperbola, and find its centre and asymptotes.
8. The tangent to a rectangular hyperbola at the point Ρ meets the asymptotes
at Q and R. If О is the centre of the hyperbola, prove that QR = 2θΡ and that
the area of the triangle OQR does not depend upon the position of P.
478

4] RECTANGULAR HYPERBOLA
9. Ρ is a variable point on a rectangular hyperbola and Q is the foot of the
perpendicular from Ρ on to one of the asymptotes. If R divides PQ in a fixed ratio
show that R lies on a rectangular hyperbola with the same asymptotes as the
original hyperbola.
10. Find the equations of the normals to the hyperbola xy = c2 which are
parallel to the line x— Ay = 0.
11. The tangents at Ρ, β to a rectangular hyperbola meet at R. If О is the centre
of the hyperbola, show that OR bisects PQ.
12. Л, P2, P3, Pi are four points on a rectangular hyperbola. Prove that, if Р±Р2
is perpendicular to P3Pi, then each of the four points is the orthocentre of the
triangle formed by the other three.
13. The normal to the rectangular hyperbola xy — c2 at the point A(cib c/O
meets the curve again at P2(ct2, c/t2). Prove that t\t2 = — 1 and deduce that the
locus of the mid-point Μ of РгР2 has equation
4x3y3 + c2(x2—y2)2 = 0.
Miscellaneous Exercise 22
1. Prove that the line px+qy = 1 is a tangent to the parabola
y2 = 4a(a-x) if a{p2+q2) = p.
A line / and a point О not on / are given; Ρ is a variable point on / and Q
divides OP in a fixed ratio. Prove that the line through Q perpendicular to OP
touches a fixed parabola with focus at О and directrix parallel to /. (O & C)
2. The coordinates of the mid-point of the line joining the points (au2,2au),
(av2, lav) are (X, Y). Express u + v and uv in terms of X and Y.
A variable chord of the parabola y2 = Aax passes through the fixed point
(b, 0). Prove that the locus of the mid-point of the chord is a parabola, and
find the coordinates of its vertex and focus. (O & C)
3. The normal at a point Ρ meets the parabola again at Q. Find the length of
PQ in terms of the acute angle θ between the normal and the axis of the parabola.
4. Two parabolas, y2 = 4a(x + b) and x2 = 4a(y + b), where a > 0, are given.
Prove that each point of intersection lies either on the line у = л: or on the line
y + x+4a= 0.
Hence, or otherwise, prove that, if the parabolas have four real points of
intersection, then b > Ъа. (О & Q
5. Lines are drawn through a variable point Ρ oi the parabola y2 = Aax, making
angles oc. and π— ос. with the axis of the parabola. The lines meet the parabola
again at Q and R. Prove that:
(i) the point of intersection of the tangents at Q and R lies on the parabola
y2 = 4a(x+4acot2a);
(ii) the line QR touches the parabola y2 = 4a(x-4a cot2 a). (O & Q
479

PARABOLA AND RECTANGULAR HYPERBOLA [22
6. Prove that the line у — mx+a/m touches the parabola^2 = Aax for all values
of m. Deduce that, if two tangents to a parabola are perpendicular, then their
intersection lies on the directrix.
7. Two directed lines meet at A, and points Ρ and P' are taken on the two lines
so that AP+AP' is constant, sense being taken into account. It is required to
prove that PP' touches a fixed parabola. Prove this by first showing that the
equations of the lines may be taken in the form
у = (x + c) tan α and у = - (x + c) tan a,
where 2a is the angle between the lines and
AP+AP' = 2c sec a,
and that the coordinates of Ρ and P' may be taken in the form
(t cos a, t sin a + с tana) and (-i cos a, t sin a— c tan a).
Deduce that PP' touches the parabola уг = Acx tan2 a. (O & C)
8. Prove that the mid-points of parallel chords of a parabola lie on a straight
line (called a diameter of the parabola) parallel to the axis.
Given a parabola traced on paper, find a construction for its focus.
9. If a point is marked on a rectangular sheet of paper, one of whose sides is AB
and the paper is then folded in such a way that AB passes through the point,
show that the crease will always touch a fixed parabola.
10. The tangents /, m from a point Ptoa parabola meet the directrix at L, Μ
respectively. The other tangents to the parabola from L, Μ meet m, /at Q, R
respectively. Prove that QR passes through the focus, S, and that the angle PSQ
is a right angle.
11. A straight line passes through the pomt P^x^y^ and has gradient m = tana.
Prove that a general point of the line has coordinates (xt+ r cos а, уг + г sin a),
where r is a parameter, and that the values of r giving the points of intersection of
the line with the parabola y2 = Aax are the roots of the quadratic equation
r2sin20+2f-O>1sin0-2acos0)+>'2-4ax1 = 0.
Deduce that, if Д is the mid-point of the chord QR, then the gradient of QR
is 2а/у! and its equation is „ . „
4 угу-2ах = yl-2axv
12. Two chords Д Qls P2 Q2 of a parabola intersect at K. Prove that the value of
thefraction (ΚΡ^ΚΩύΚΚΡ,.ΚΩύ
depends on the directions of the two chords, but not upon the position of the
point K.
[This result, known as Newton's Theorem, holds for all conies (see Chapter 27).
Use the analysis of Questions 11 to prove the result for the parabola.]
13. Follow through the work of Questions 11 and 12 to prove Newton's Theorem
for the rectangular hyperbola xy = c\

MISCELLANEOUS EXERCISE
14. Three points P(ap2, lap), Q(aq2, 2aq), R(ar2, 2ar) taken on the parabola
y2 = Aax are such that PQ subtends a right angle at R. Show that
(p + r)(q + r)+4= 0.
Show that every chord of the parabola which subtends a right angle at R
intersects the normal at R at the same point F. Show also that, as R varies, F describes
another parabola. (London)
15. The chord AB joining points A(ctu c/iO and B(ct2, clQ on the rectangular
hyperbola xy - c2 is of constant length /. Show that, as the position of the chord
varies, the centroid G of the triangle AOB, where О is the origin, moves on the
CUrVe (9xy-4c2) (x*+y2) = Pxy.
Find the area of the triangle AOB when the coordinates of G are (c, 2c).
(London)
16. Prove that the equation of the tangent PT at the point P(ct, c/t) on the
rectangular hyperbola xy = c2 is
x + t2y = let.
The perpendicular to PT from the origin О meets PT at Q and the normal at
Ρ meets the hyperbola again at R.
Prove that:
(i) as Ρ varies, the locus of Q is
(x2+y2)2 = 4c2xy;
17. Prove that the equation of the chord joining the points P{ct, c/t) and
QicT, c/T) on the rectangular hyperbola xy = c2 is
x+tTy = c(t + T).
Μ is the mid-point of PQ and PQ meets the л: axis at N. Prove that OM = MN,
where О is the origin.
The line through N parallel to OM meets the hyperbola at R and S whose
parameters are h and i2; С is the mid-point of RS. Prove that:
(i) CM is parallel to the у axis;
(ii) iti2 = - tT (O & C)
18. Prove that the equation of the tangent at the point (A, k) on the parabola
y2 = Aax and the equation of the tangent at the point (Я, К) on the rectangular
hyperbola xy = c2 are, respectively,
ky = 2a(x+h) and Хх+Яу = 2c2.
Find the coordinates of the point of intersection, P, of the parabola y2 — Aax
and the rectangular hyperbola xy = 4а%/2 and prove that the tangent to the
parabola at Ρ is the normal to the rectangular hyperbola at P.
Prove that, if this normal meets the rectangular hyperbola again at Q, then
the abscissa of Q is - 4 a. (O & C)
481

PARABOLA AND RECTANGULAR HYPERBOLA [22
19. Prove that the normal to the rectangular hyperbola xy - c2 at the point
P(ct, c/t) meets the curve again at Q{- c/t3, - ct3).
The circle on PQ as diameter meets the hyperbola at R; prove that PR passes
through the origin. (O & C)
20. Given a point Ρ on a rectangular hyperbola, prove that one and only one
real chord PQ can be drawn which is normal to the hyperbola at Q. Deduce that
there is only one (real) chord AB of the hyperbola which is normal at both A and
В and locate the two points A, B.
21. Points P, Q, R, S are taken on the rectangular hyperbola xy = c2. Prove
that PQRS is a rectangle if and only if the parameters of the four points are of the
form t, t~\ -t, -1-1. Deduce that it is impossible to inscribe a square in a
rectangular hyperbola.
22. The tangent to a rectangular hyperbola Г atthe point Ρ meets the asymptotes
at Q and R. О is the centre of the hyperbola and OQSRis a rectangle. If SQ, SR
cut Г at Q', R' prove that Q'R' touches a second rectangular hyperbola whose
asymptotes coincide with those of Г.
23. Prove that all chords of a parabola which subtend a right angle at a point
Ρ on the parabola pass through a fixed point Q.
Q is called the Frigier point of P. If a given chord subtends right angles at
two points, Ρ and P' of the parabola, prove that the join of the two Fregier
points, Q and Q', is parallel to the given chord.
24. Find the equation of the tangent to the parabola 5Ί: y2 = Aax at the point
P(at2, 2at).
If this tangent cuts the rectangular hyperbola SV· xy = k2 at real points Q, R
and Μ is the mid-point of QR show that Μ lies on part of the parabola Σ:
2y2 + ax= 0.
Sketch in the same diagram the curves Si and 52 and the set of all such points
Μ in the case a > 0. Mark in your diagram the line other than χ = 0 which
is a tangent to both 5t and 52.
482

23. Polynomial equations
1. SOME PRELIMINARY OBSERVATIONS
The results we have obtained in Chapter 18 generalize very readily to
general polynomial equations. In particular, it is possible to find
expressions for symmetric functions of the roots of polynomial equations in
terms of the coefficients. We shall pursue the question of symmetric
functions in Section 2; in this section we shall discuss a number of other results
connected with the solution of general polynomial equations of degree n.
If we are given a specific equation, say
z* + (3+2j)z2-(l-j)z + 4=0
it is not at all obvious whether or not the equation has a solution. Indeed,
two questions arise:
(i) Does every equation possess a solution ? (That is, is there a complex
number which satisfies the equation ?)
(ii) If a solution exists, can it be found by processes similar to those
employed for solving a quadratic equation ?
The answer to the first question is 'Yes' but the proof of this is rather
difficult (but not beyond the understanding of a really enthusiastic pupil: see,
for example, Hardy's Pure Mathematics, Appendix II or Courant and Rob-
bins What is Mathematics ?, Chapter 5). The result was first proved, like so
many other central theorems of mathematics, by C. F. Gauss and is
generally known as 'The Fundamental Theorem of Algebra': explicitly, it states
that any polynomial equation with complex numbers as coefficients (and
in particular, any equation with real coefficients) has a root which is a
complex number (R is regarded as a subset of С in this result). We shall
content ourselves with assuming the truth of this theorem.
The answer to the second question is 'Yes, if the degree of the given
equation is less than five' but 'No, in general, if the degree is five or more'.
The first part of this result is comparatively easy to prove and is due to a
number of Italian mathematicians of the Sixteenth Century; the second
part is very much harder and is due to two mathematicians, E. Galois
(1811-32) and N. H. Abel (1802-29). (The truly dedicated reader may care
to follow the matter up in, say, Birkhoff and Maclane, 'Survey of Modern
Algebra. Brief readable accounts of the life and work of Galois and Abel
may be found in Bell, Men of Mathematics, Volume 11.)
483

POLYNOMIAL EQUATIONS [23
Ex. 1. Although it is not possible, in general, to solve a given quintic equation
except by approximate numerical methods, this is not to say that no quintic
equation is soluble. Solve the equation z5— 32j = 0.
If P(z) is a polynomial of degree n, and if P(z) has a factor (z - cc)r (but
not (z~x)r+1) where r is an integer ^ n, the equation P(z) = 0 is said
to have a root of multiplicity r. For example, the quartic equation
(z- 1) (z + 2j)3 = 0 has a root ζ = 1 and a root ζ = - 2j, of multiplicity 3.
Ex. 2. Solve the cubic equation z3+3z2 — 4 = 0, given that it has an integral
root of multiplicity 2.
*Ex. 3. Assuming that every polynomial equation has a root, prove, by induction,
that a polynomial equation of degree η has η roots, where a root of multiplicity
r is counted as r roots.
*Ex. 4. By writing P(z) = (z-α)' Q{z) prove that, if P(z) = 0 has a root of
multiplicity r, then P'(z) = 0 has a root of multiplicity 0- 1).
Prove that the converse is false.
*Ex. 5. If P(z) ■= 0 has a root of multiplicity r > 2, what can you say about the
equation P"(z) = 0?
As with quadratic equations, additional restrictions upon the coefficients
of a polynomial equation enable us to make further assertions about the
roots. In particular, if all the coefficients are real we have the following
three important results which enable us to decide whether we have an odd
or even number of roots and to locate such roots.
Theorem 23.1. If all the coefficients of
P(z) s a0zn + a1zn-1+...+a„
are real and if the complex number zx is a root ofP{z) = 0 so also is zf.
Proof. Since „ , „ , , , _
α^ϊ + α^ϊ^+.,.+αη = 0,
we have a0{zt)n + a^zf)71'1 + ...+an = 0,
on taking the complex conjugate of each side, and noting that a* = ait since
the at are real. The result follows immediately.
Ex. 6. Given that — 1 + j is a root of the equation
z4+3z3 + 5z2+4z+2 = 0,
solve the equation completely.

1] PRELIMINARY OBSERVATIONS
Theorem 23.2. If all the coefficients of
P(z)sa0z>+o1z«-4...+a» (a0 * 0)
are real and if the equation P(z) = 0 has no real root, then P(z) is always
positive or always negative for all real z.
Proof (The theorem is intuitively obvious from graphical considerations.)
By Theorem 23.1 the roots of P(z) = 0 occurs in conjugate pairs:
Oi+JA, «,±JA. ....
But [*-(«,+j&)] [*-K-j/?r)] = (*-«,)'+#
and thus P(z) = α0[0-θι)2+Λ2] [(.z-atf+ffl -
and every square bracket is positive for all real z. The sign of P(z) is thus
determined by the sign of a0.
*Ex. 8. Prove that, if a0 and an have opposite signs, the equation P(z) = 0 has
a real root.
Theorem 23.3. If all the coefficients of
P(z) = α0ζη + α1ζ"-1 + ...+Λ„ (α0 * 0)
are real and if 'xx, x2 are real numbers such that P{x^), Р{Хг) have opposite
signs, then P(z) = 0 has an odd number of real roots between χλ and x2
(and, in particular, of course, at least one real root). For the purposes of
this theorem a root of multiplicity к is counted as к roots.
Proof. (Again, the result is intuitively obvious from graphical
considerations.)
Let ocj, a2, ..., ccr be all the real roots of P(z) = 0. Then
P(z) = ίψ-αύ{ζ-αύ ... (ζ-Ο Q(z),
where Q(z) is either positive or negative for all real ζ (by Theorem 23.2).
Thus (x1 — cc1)(x1 — cc2) ... (x1 — ccr) and (χ2—α.1)(χ2—οί2)...(χ2—α,τ) have
opposite signs and
K*l-«l) (*.-«!)] K*l-"2) (*2-"2)] .·. К*!"*,) (*,-«,)] < 0.
Thus (x1—oci) (x2— a,·) is negative in an odd number of cases and the result
follows.
Ex. 9. Prove that x3-x + 3 = 0 has a root between л: = -2 and χ = - 1.
We conclude this Section with two worked examples. The first shows
how one equation may be transformed into another in such a way that
485

POLYNOMIAL EQUATIONS [23
the roots of the two equations bear a given relationship to one another.
The second example indicates how calculus methods may be used to
solve problems on polynomial equations.
Example I. If α, β, γ are the roots of the equation
3z3-5z2 + 2z + 2 = 0
form the equation: (i) with roots α - 2, β — 2, γ — 2;
(ii) with roots 2a, 2β, 2γ;
(iii) with roots α?, β2, γ2.
(i) As in Example 2, Chapter 21, a- 2, β-2, γ- 2 are the roots of the
3(w + 2)3-5(w + 2)2 + 2(w + 2) + 2 = 0.
equation
This may be expanded directly. Alternatively, since we are essentially
making the transformation ζ -+ w + 2 we may rewrite the polynomial
3z3-5z2 + 2z + 2
in the form a(z -2)3 + b(z- 2)2 + c(z - 2) + d
and the required equation will be
aw3 + bw2 + cw + d = 0.
By Horner's Method (Chapter 18, Section 1) we have
3-5 2 2(2
3
3
3
6
1
6
7
6
13
2
4
14
18
8
10
giving the required equation
3w3+13w2+18w+10 = 0.
(ii) In a similar way, to form the equation with roots 2a, 2β, 2γ we
make the transformation ζ -+ wj2, giving the required equation:
3(iw)3-5(iw)2 + 2(iw) + 2 = 0.
or 3w3-10w2+8w+16 = 0.

1] PRELIMINARY OBSERVATIONS
(iii) If w = z2, then either z= wb or ζ = — w*. Thus
3z3-5z2 + 2z + 2 = 0
ο z(3z2+2) = 5z2-2
о either w*(3w + 2) = 5w-2
or -v^(3w + 2)= 5w-2
о {5w-2-w*(3w + 2)}{5w-2+wi(3w + 2)} = 0
о 9w3- 13w2 + 24w-4 = 0,
on multiplying out and rearranging terms, and this equation has roots a2,
β\ У2·
Example 2. Discuss the nature of the roots of the equation
3xi-4x3-l2x2 + k= 0
for different real values ofk.
P(x) = Ъх*-4х*-\2х2;
P'(x) ξ \2χ3-\2χ*-2Αχ
P'(x) = 0 when χ = - 1,0, or 2.
Write
then
and thus
Since P(x) > 0 for sufficiently large x, the values - 1, 0, 2 for χ yield
respectively a minimum, maximum and minimum of P(x). The graph of
у = P(x) thus assumes roughly the shape shown in Figure 23.1.
487

POLYNOMIAL EQUATIONS [23
Now the given equation may be written as
P(x) = -k
and the number of real roots of the equation may be deduced immediately
by considering the intersections of the curve у = P(x) and the straight
line у = —k:
к < 0: the equation has two unequal real roots and two complex
conjugate roots.
к = 0: the equation has three unequal real roots, one of multiplicity 2.
0 < к < 5: the equation has four unequal real roots.
к = 5: the equation has three unequal real roots, one of multiplicity 2.
5 < к < 32: the equation has two unequal real roots and two conjugate
complex roots.
к = 32: the equation has a real root of multiplicity 2 and two conjugate
(complex) roots.
к > 32: the equation has two pairs of conjugate complex roots.
Ex. 10. What are the roots of the equation in Example 2 in the cases
(i) к = 0; (ii) к = 5; (iii) к = 32?
Exercise 23(a)
1. Form the equation with roots 1, -2, 3~V5, 3 + V5.
2. Form the equation with roots 1 + 2j, 1 -2j, -2, 3.
3. Form the equation with a root — 1 of multiplicity 3 together with a pair of
roots +j, -j.
4. Solve the equation „„,„„.,
~* z4- 2z3 + 3z2- 4z+2 = 0,
given that it has a root of multiplicity 2.
5. Use the Remainder Theorem to solve the equation
z3-3z2-8z + 30 = 0,
given that it has an integral root.
6. Theequation 24zi+100z3+ 126z*+27z_27 = 0
has a root of multiplicity three. Solve the equation completely.
7. If oc, β, γ are the roots of the equation
z3-z-4= 0,
form the equation with roots;
... 1 1 1. _.,«+! β+1 7+1

1] PRELIMINARY OBSERVATIONS
8. If α, β, γ are the roots of the equation
z3-3z2-3z-4 = 0,
form the equation with roots:
(i) 2a, 2β, 2γ; (ii) a\ β*, y\
9. If α, β, γ are the roots of the equation
z3+jz+l+j = 0,
form the equation with roots:
0) К tf, Ь; (ϋ) (1 +JK (1 + j)A (1 + j)7·
10. If a, /?, γ are the roots of the equation
2z3-z2+4z+7 = 0,
form the equation with roots
(i) α + 2,/? + 2,γ+2; (ii) α-3, /?-3, γ-3.
11. If a, /?, γ, £ are the roots of the equation
z4-z-5= 0,
form the equation:
(i) with roots - a, - β, - γ, - δ; (ii) with roots a + 2, /?+2, γ+2, <?+2;
(iii) with roots α2,/?2, γ2, <?2.
12. If a, /?, γ are the roots of the equation
3z3-20z2+39z-18 = 0,
form the equation with roots a- 3, β— 3, γ- 3. Hence find a, /?, γ.
13. Solve the equation z4+4za + 5z2+2z_2 = α
by first increasing the roots by 1.
14. Solve the equation
z3+3jz2-5z-3j = 0
by first increasing the roots by j.
15. Show that the cubic equation
z3 + 4z2-3z+25 = 0
has three real roots, one of which is positive and the other two negative.
16. Show that the cubic equation
z3~3z+k= 0
has three distinct real roots if and only ii k is real and — 2 < к < 2.
17. Discuss the nature of the roots of the equation
2ζ3-15ζ2+24ζ+& = 0
for different real values of к.
18. Discuss the nature of the roots of the equation
ζ4-4ζ3-2ζ2+12ζ+&= 0
for different real values of к.
489

POLYNOMIAL EQUATIONS [23
19. Discuss the nature of the roots of the equation
3z4-4z3-24z2 + 48z+&= 0
for different real values of k.
20. Find a necessary and sufficient condition that the three roots of the cubic
equati°n z°+3tfz+G=0
should be real and unequal.
2. RELATIONS BETWEEN ROOTS AND
COEFFICIENTS FOR POLYNOMIAL EQUATIONS
Consider first the cubic equation
α0ζ3 + α1ζ2 + α2ζ+α3 = 0
with roots α, β, γ. We have
ας?3 + a1z2 + a2z+a3
Sflb(z-a)(z-yS)(z-y)
ξ α0ζ3-€φί+β+γ) ζ2 + α^βγ + γα, + ΰίβ) ζ-α0χβγ
and, equating coefficients of z2, z1 and z°,
cc+β+γ =-aila0,
βγ + αα, + χβ = +а2/а0,
αβγ =-α3/α0.
Similarly, for the quartic equation
a0zi + a1z3 + a2z2 + a3z + ai = 0,
with roots α, β, γ, δ we have
a0zi + a1z3 + a2z2 + a3z+ai
= α£ζ-*)(ζ-β)(ζ-γ)ίζ-ί)
s α0ζί-α(£ΰΐ,+β+γ + δ)ζ3 + αο(αβ + αγ + χδ+βγ+βδ+γδ)ζ2
-αάβγδ+γδα + δχβ+χβγ) ζ + α0*βγδ
from which, by equating coefficients,
cc+β+γ+δ = -αι/α0,
а£+ау + а*+у?у+уЮ+у* = +a2/a0,
βγδ+γδα, + δχβ+α,βγ = -α%\α*
χβγδ = +а^
490

2]
RELATIONS BETWEEN ROOTS
In words, the sum of the roots one at a time is - ajcio, two at a time is
+ a2ja0, three at a time is —a3/a0, and four at a time is aja0. For brevity,
we denote χ+β+y + S by Σα, αβ+αγ + αδ+βγ+βδ+γδ by Σα/?, etc.;
no ambiguity arises provided we know the degree of the equation under
consideration (in this case four).
Ex. 11. The roots of the equation
2z3-3z2-5z-13 = 0
are α, /?, у. Write down the numerical values of Σα, Σα/?, αβγ.
Ex. 12. The roots of the equation
z4-4jz+l = 0
are a, /?, y, δ. Write down the numerical values of Σα, Σα/?, Σα/?γ, α/?γ£.
*£лг. 13. If the coefficients of a cubic equation are all real, it follows that
Σα, Σα/? and αβγ are all real. Explain how this can be so, even if the equation
has complex roots. Elustrate your answer by considering the equation z3- 1 = 0.
Now suppose that the general polynomial equation
α0ζη + α1ζη~1+α2ζη~2+...+αη = 0
has roots ocj, a2 a„. Then it may be shown by induction that the results
proved previously in the particular cases η = 2, 3, 4 hold generally.
Σα; = — flj/flo
Σα^-α,- = +α2/α0
Σ^α,-α^ = -α3/α0
cc1cc2cc3...ccn = (-1)иак/а0.
(Let the equation with roots ocj, a2, ..., aK-1 be
a0zn~1 + b1zn-2 + b2zn-3+...+bn_1 = 0
and consider the identity
(z-aj (a0zn-1 + b1zn~2 + b2zn-3 + ... +bn_1)
ξ α0ζη + α1ζη-1 + α2ζη-2 + ...+αη.)
Ex. 14. By considering the equation zn— 1 = 0, prove that the sum of the nth
roots of unity is zero (n > 2).
Example 3. Find the quartic equation with roots 1, 3, —2, —4.
We have Σα = - 2,
Σα/? = 3-2-4-6-12+8 = -13,
Σα/?γ = 24+8-12-6 = 14,
χβγδ = 24,
491

POLYNOMIAL EQUATIONS [23
and the required equation is
z4 + 2z3-13z2-14z + 24 = 0.
Example 4. If the roots of the equation z3 —4z—6 = 0 are ot-, β, у find
(i) Σα2, (ϋ) Σα2/?, (iii) Σα3, (iv) Σα*.
We have Σα = 0, Σα/? = - 4, α,βγ = 6:
(i) Σα2 = (Σα)2-2Σα/?
= 0 + 8.
= 8.
(ii) Notice first that Σα2/? contains six terms:
Σα2/? = a^+a^+^y+y^a+r^ + y2/?.
Consider the product (Σα) (Σα/?): each term such as a2/? occurs just once,
but the terms α/?γ appears three times, as α./?γ, β.γα, and γ.α/?. Thus
Σα2/? = ΣαΣα/?-3α/?γ
= -18.
(iii) Since α is a root of the given equation, we have
α3-4α-6 = 0
and similarly for β and γ. Adding these three results
Σα3-4Σα-18 = 0.
Thus Σα3 = 18.
(iv) By an argument similar to that adopted in (iii),
Σα*-4Σα2-6Σα = 0.
Thus Σα* = 32,
on using the result of part (i).
Example 5. Solve the equation
8z3-12z2-66z+35 =0
given that the roots are consecutive terms of an arithmetic sequence.
From the equation we have
Σα = f, Σα/? = -^, αβγ = -ψ.
Since the roots are consecutive terms of an arithmetic sequence, we take
α = a—d, β = α, γ = a + d.
492

2] RELATIONS BETWEEN ROOTS
Then Σα = За, Σχβ = 3a2-d2, giving
За = I, 3a2-d2 = -ψ.
Thus a = \, d = + 3 and the roots are -2\, \, 3\.
Example 6. Solve the simultaneous equations
x+y + z = 2,
x2+y2 + z2 = 30,
x3+y3 + z3 = 116.
Let x, y, ζ be the roots of the cubic equation
t3-at2 + bt-c = 0.
Then v
a = Σχ = 2,
b = Σχγ = ${(Σχ)2-Σχ2} = K4-30} =-13.
To find c, we make use of the identity
x3+y3 + z3-3xyz ξ (x+y + z)(x2+y2 + z2-yz-zx-xy);
с = xyz = %{Σχ3 — Σχ{Σχ2 — Σγζ]}
= i{116-2[ 30 + 13]}
= 10.
Thus χ, у, ζ are the roots of the cubic equation
t3-2t2-l3t-l0 = 0.
By observation, this has a root t = — 1:
0+1)02-Зг-10) = 0,
i.e. 0+1) 0 + 2) 0-5) = 0,
and χ = — 1, у = —2, ζ = 5 (or any equivalent permutation).
Example 7. If the roots of the cubic equation z3 + 3Hz + G = 0 are α, β, у,
form the cubic equation with roots (β—γ)2, (у—α)2, (α,—β)2.
w = (β-γ)2
=> w = Σχ2-χ2-2βγ
^> aw = α(Σα)2-2αΣα/?-α3-2α/?γ (since (Σα)2 = Σα2 + 2Σα/?),
*ад= -6tfa-a3 + 2G (since Σα = 0, Σα.β = 3tf),
^> a = 3G/(w + ЗЯ) (since a3 = - 3tfa- G)
=> 27G3 + 9HG(w + 3#)2 + G(w + ЗЯ)3 = 0 (since a3 + 3tfa + G = 0)
=> κ;3+18#κ;2 + 81#2νι; + 27((72 + 4#3) = 0.
493

POLYNOMIAL EQUATIONS [23
Thus, {β—γ)2 is a root of the equation
κ;3+18#κ;2+81#ν+27((72+4#3) = 0
and, by symmetry, so also are (γ —α)2, (a—/?)2.
Note: the equation derived in Example 7 is of importance in studying the
nature of the roots of cubic equations—see Section 4.
Exercise 23(b)
1. If α, β, γ are the roots of the equation
z3-4z2 + z + 2 = 0,
find:
(i) Σα*; (ϋ) Σα3; (iii) Σα*; (iv) Σα*/?;
(ν) Σα2/?*; (vi) (α+ 2) (/?+ 2) (у + 2); (νϋ) Σ1/α;
(viii) Σα/?(α2 + /?2); (ix) Σ1/α/?; (χ) Σ(α+/?)*.
2. If α, /?, γ, δ are the roots of the equation
z4-4z3-z + 9 = 0,
find:
(ί)Σα*; (ϋ)Σα*/?; (iii) Σ1/α; (iv) Σα3;
(ν) Σα3/?; (vi) Σ(α+/?+γ)*; (νϋ) Σ(α+ 1) (β+ 1).
3. If α, /?, γ are the roots of the equation
z3+pz+q = 0,
find:
(i) Σα4; (ii) Σ1/α4; (iii) Σ(α+/?-2γ).
4. Solve the equation
-3z2-23z+12 = 0,
given that its roots are successive terms of an arithmetic sequence.
5. Solve the equation 32z3+48z2 + 6z_5 = 0>
given that its roots are successive terms of an arithmetic sequence.
6. Solvetheequation 24zs_38z2+19z_3 = 0>
given that its roots are successive terms of a geometric sequence.
7. Solve the equation
-5z + 50 =
given that one of the roots is twice a second root.
8. Solve the equations:
x+y+z = -4,
yz + zx+xy = 1,

2]
RELATIONS BETWEEN ROOTS
x+y + z = 1,
x2+y2 + z2 = 29,
x3+y3+z3 = -29.
лг+^ + z = 4,
λτ2+^2 + ζ2 = 14,
x3 + >,3 + z3 _ 34
11. The roots of the equation:
ax3+3bx2+3cx + d = 0
are in arithmetic progression. Express с in terms of a, b and d. (O & C)
12. Prove that, if the cubic equation
ax3 + bx2+cx + d= 0
has a pair of reciprocal roots (that is, α and 1/a), then
a2—d2 = ac—bd.
Verify that this condition is satisfied for the equation
6х3+Пх2~2Ах~9 = 0
and hence, or otherwise, solve the equation. (O & C)
13. If α, β, γ are the roots of the equation
z3+pz+q = 0,
form the equations with roots:
(i) α2, β2, γ2; (ii) fly, γα, α/?;
(iii) /? + γ, γ + α, α+/?; (ίν) β + γ-οί,γ + α-β, α,+β-γ.
14. If α, /?, γ are the roots of the equation
z3+pz+q = 0,
form the equations with roots:
(i) α3, β3, γ3; (ii) α(β+γ), β(γ+ά), γ(α,+β).
15. If α., β, γ are three non-zero complex numbers such that α,+ β+γ = 0,
prove that α2-βγ = β2-γα, = у2—αφ.
16. Given that α, /?, γ are the roots of the equation
л:3+.рл:2 + 9л: + г = О,
and that a = fiy—a,2,b = γα—β2,ο= α-β—y2, prove that
a = poc + q, b = pfi + q, c = py + q.
Hence, or otherwise, prove that
ax + bfi+cy = (a + b + c)(a+fi+y). (O&C)
17. If α, β, γ are the roots of the equation
x3+px2+qx + r = 0,
495
9. Solve the equations:
10. Solve the equations:

POLYNOMIAL EQUATIONS [23
find the equations whose roots are:
(i) α2-/?γ, /?2-γα, γ2-α/?; (ii) /?+γ-2α,γ + α-2/?, α+β-2γ.
What can be said about a, /?, γ if
(i) q3 = pV; (ii) 2p3~9pq+21r =0? (O &Q
18. By considering the product (/?+γ) (γ + α) (μ+β), find a necessary and
sufficient condition that two of the roots of the equation
z3+pz2+qz+r = 0 HO)
should be equal in magnitude but of opposite signs.
3. THE CUBIC EQUATION
For practical purposes an algebraic solution of the general cubic is of
limited value since, in practice, only approximate numerical solutions are
required and these are best found by the iterative methods to be described
in Chapter 26. Nevertheless, a brief survey of the method of attack seems
worthwhile on at least three counts: having seen a solution to the general
quadratic it is natural to wonder whether a similar method holds for the
cubic; such a question did occur to mathematicians of the past and the
study of the cubic is of considerable historical interest; finally, the question
whether general polynomial equations are soluble leads one to some of the
central ideas of modern mathematics.
The general cubic equation was first solved in the sixteenth century by
Tartaglia and published by Cardan in one of the most famous of all acts
of mathematical plagiarism. (See Cardan: the Gambling Scholar by O. Ore
for a biography of this unsavoury yet fascinating character.) The general
quartic soon followed (Ferrari) but, as has been pointed out already in
Section 1, the general quintic evaded all attempts at solution, for the reason
discovered independently by Abel and Galois in the nineteenth century.
An account of the Italian mathematicians of the sixteenth century whose
work we have referred to will be found in History of Mathematics by
D. E. Smith, Volume 1, Chapter 8.
We shall take the general cubic in the form
a0z8 + 3a1z2+3a2z + ai = 0. (1)
If we apply the transformation ζ -> (w—a-^ja0 we have
a0z3 + 3a1z2 + 3a2z + a3 = 0
=> alz3 + 3a1a\z2 + 3a2a0a0z + azal = 0
=> (w - aj3 + За^и; - аг)2 + 3a2a0(w - a-,) + a3 a\ = 0
^> w3 + 3w(a2a0 -af) + (2a? - ЗодС^ a2 + a3al) = 0
=> w3+3Hw + G = 0, where Я = a2a0-a\, G = Ъа^ЗооО^а^ + а^.
496

3] CUBIC EQUATION
Thus any cubic equation may be reduced to
w3 + 3Hw + G = Q (2)
and we shall, from now on, regard this as the general cubic equation which
it is our intention to solve.
Ex. 15. Show how to reduce the equations:
(i) x3- 9x2 + 22л:- 3 = 0, (ii) 2л:3 + 6л:2+ 1 Ox- 1 = 0,
to the standard form x3 + 3Hx + G = 0.
If the roots of (1) are zlt z2, z3 and the roots of (2) are κί, h>2, w3, then the
geometrical interpretation of the transformation z->(w—aj/flo is that it
transforms the triangle z1z2zs into the similar triangle \νχ\ν2\νβ which has its
centroidat the origin (и>! + и>2+и>3 = 0). (See Figure 23.2.)
Fig. 23.2
Ex. 16. Why are the triangles ZiZ2z3 and w1w2w3 similar?
Cardan's solution of (2) consists in expressing the roots as the sums of
pairs of complex numbers lying at the vertices of two equilateral triangles
with centroid at the origin. (There is no 'completing the cube' process in
general and thus it is not possible to find a further transformation that
maps the triangle и^и^и^ into an equilateral triangle.)
Since
w3-3wpq-(p3+q3) = (w-p-q)(w-a)p-u?q)(w-u)2p-<1>q)
(see Exercise 19(6), Question 12) we have
w3 — 3wpq — (p3+q3) = 0
о w = p+q or w = b)p + o)2q or w = ω2ρ + ως. (3)
Now write -pq = H, -(p3+q3) = G; (4)
497

POLYNOMIAL EQUATIONS [23
if we can find values p, q satisfying (4) then we shall have a complete
solution of the cubic equation.
From (4), p3 and q3 are the roots of the quadratic equation
ZZ + GZ-H3 = 0. (5)
Let α, β be the roots of the quadratic equation (5). Take ръ q1 as any
cube roots of α and β; then
Α+£ = α+β= -G,
p\ql = *β= -Η3.
If Я = 0, the original cubic is immediately soluble; if Я Ф 0, then
giving && = e,
where e = 1, ω or ω2, and^ and qje are suitable values to take for;? and q.
In summary, to solve the equation
ц;3 + ЗЯц; + С7 = 0 (Я + 0),
let α, β be the roots (repeated if necessary) of the quadratic equation
z2+Gz-IP = 0,
and let ρ = ca,q = fie where e = oaftl(-H). Then
p+q, ωρ + ω% aj^p + ajq (6)
are the required roots for the cubic equation.
For
(w-{p+q)){w-{o)p + u)*q)){w-{o)*p + o)q)) = w3-3wpq-(p3+q3),
and p, q have been so chosen that
pq=—H, p3+q3 =-G.
Example 8. Solve the equation z3 — 6z + 6 = 0.
We seek p, q such that „ „ ,
p3 + q3=-6,
pq = 2.
Thus, p3 and q3 are the roots of the quadratic
Z2 + 6Z+8 = 0,
498

CUBIC EQUATION
from which it follows that^3 = 2, q3 = 4 = 22.
Thus z3-6z + 6 = 0
=> ζ3 + (2*)3 + (2Ϊ)3-3ζ(2*) (2?) = 0
^> (z + 2* + 2*) (ζ+ω2» + ω2ΐ) (ζ + ω22* + ω2?) = 0
^>z=-2*-2i or -ω2*-ω22ΐ or -ω22ϊ-ω2?
^>z=-(2* + 2t) or {(2* + 2t)±j(2t-2*)V3}/2.
*Ex. 17. Show that w= Z-(H/Z) transforms equation (2) into
Z°+GZ3-H3= 0
the roots of which are ρ, ωρ, οβρ, q, o)q, ω\.
*Ex. 18. With the notation of Ex. 17, show that, to each w correspond two
values of Ζ and that to the triangle κί κί w3 formed by the roots of (2) correspond
two equilateral triangles.
Suppose now we have determined ρ and q as two complex numbers.
Then, by (6), we have
p+q, \ν2 = ωρ+ώ
w3 = rfp + uq.
(See Figure 23.3.)
Ex. 19. Show that κ>2, w3 may be found by constructing equilateral triangles on
—p, —q as base.

POLYNOMIAL EQUATIONS
[23
From now on we shall suppose that the coefficients of (2) are real: then
either one root is real and the other two are conjugate (complex) numbers,
or all three roots are real. From (5), since G, Η are real, either^3 or q3 are
both real or else p3 and q3 are complex conjugates, in which case we must
have \p\ = \q\.
Case (i). p3 and q3 both real (and unequal)—in which case we may take
ρ and q as both real and the geometrical construction (Figure 23.4) shows
that, of the solutions of (2), one is real and two are complex conjugates.
Case (ii). p3 and q3 are complex conjugates (and thus p, q may be taken
as complex conjugates). In this case, we must have \p\ = \q\ and the
geometrical construction (Figure 23.5) shows that all three roots of (2) are
real.
Fig. 23.4
Since this exhausts all the possibilities, we have arrived at the famous
paradox that, if only one root of (2) is real, then;? and q are both real, but,
if all three roots of (2) are real, then they arise from complex values of ρ
and q {the irreducible case). Put another way, if all three roots of a cubic
equation are real then the working of Cardan's solution necessarily
involves complex numbers ρ and q at the intermediate stages.
Ex. 20. Explain geometrically how (i) a root of multiplicity two, (ii) a root of
multiplicity three, arise.
e expression д = ^ _ ^ ^ _ ^ ^ _ ^2
500

3] CUBIC EQUATION
is called the discriminant of the equation
w3 + 3Hw + G = 0.
FromExample?, Δ = -27(G" + 4/7»).
Consider now the various cases that may arise (remember that we are
considering cubic equations with real coefficients).
(i) One real root a, two conjugate complex roots β+ ')"/.
A = (2/r)2(a-/?-Jy)2(/?-Jy-*)2
= _4γ2(_α2_^2_72+2α^)2
,-тС
~'\^
τ
\ I
\ 1
\Щ
\P
ys
/1
(ii) A multiple (real) root. д _ η
(iii) Three unequal real roots
Δ > 0.
Thus, the nature of the roots determines the sign of Δ.
*Ex. 21. Show, conversely, that the sign of Δ determines the nature of the roots
of the real cubic equation w3 + 3Hw + G = 0.
The case Δ > 0 leads to three real solutions via complex values of ρ
and q: the consequent working in Cardan's solution is frequently rather

POLYNOMIAL EQUATIONS [23
tedious, but can be avoided in this case by employing the trigonometric
solution illustrated in Example 10, which employs the identity
cos 3(9 ξ 4 cos3 0-3 cos Θ.
(See p. 244.)
Example 9. Solve the equation z3 — 6z + 4 = 0.
Here Я = -2, G = 4 and Δ > 0: there are three real roots. Put
ζ = 2(2)i cos Θ.
16^2 cos3 Θ-12^2 cos θ = -4
4 cos3 0 — 3 cos θ = — ^
cos30 = -Vi,
giving θ = \π or -γ^π or fytf and ζ = 2 or 2* cos -Д-7Г or 2* cos \\π.
*Ex. 22. Show that if Δ > 0, the substitution ζ = 2(-#)*cos<9 will always
yield the solution of the cubic equation z3 + 3Hz + G = 0.
Solve the cubic equations in Questions 1-5, using a trigonometric substitution
if possible and otherwise employing Cardan's solution.
1. z3-2z+4 = 0.
2. 4z3-5z + 6 = 0.
3. z3-3z-l = 0.
4. 12z3-9z-2 = 0.
5. z3-9z + 12 = 0.
6. Reduce the equation ^ fe2+ lfe_ ^ = Q
to the form z3 + 3ffz + G = 0, and hence solve it completely.
7. Show that the equation ,,_,,,_ , , „
az3 + 3bz2 + 3cz + d = 0
is reduced to the equation , ., , 4,
p(w~q)3 = q(w-pY
by the transformation w = az + b where p,q are the roots (supposed unequal)
of the equation
(ac - b2) A2 + (А/- ЪаЬс + 2b3) λ - (ас - ό2)2 = 0.
Solvetheequation za_15z2 + 57z_5 = 0. (O & Q
8. Solve the equation z3 + 5z2 + 5z + 4 = 0.
502

3] CUBIC EQUATION
9. Solve the equation z3 + 5z + 2j = 0.
10. Iip,q are real, find the turning points (if any) of the graph of у = x3+px+q.
Deduce a necessary and sufficient condition for all the roots of the equation
x3+px+q = 0
to be real.
Prove that the roots of the cubic equation det (A—Al) = 0 where
A =
В
are all real. Can two of the roots be equal?
11. (The Quartic Equation.) Show that the general quartic equation
z* + 4pz3 + 6qz2 + 4rz + s = 0
can be reduced to the form „ ,
wi + aw1 + bw+c = 0.
By identifying this equation with the equation
(w> + ay = β(νν+γ)*
show that α is a root of the cubic equation
8a3-4aa2-8ca+4ac-62 = 0.
If one of the roots of this equation is a,, and the corresponding values of/?, γ
are β а, У a, show that
w* + aw2 + bw+c = (w2-β\\ν+α0~γαβΙ) (w2+β^+α,,+ΎοβΙ).
Solve the quartic equation . „
xl + Ax-\ = 0.
Miscellaneous Exercise 23
Is or otherwise, that ;
:3 + bx2 + cx + d= 0
1. Prove, by graphical methods or otherwise, that a cubic equation (with real
coefficients) of the form
has at least one real root.
Prove that, if A is real, the equation
χ3-3χ* + 3χ+λ = 0
is satisfied for one and only one real value of л:.
2. By considering two alternative expansions of the expression
\η{(1+χχ)(1+βχ)(1+γχ)}
(ос, β, γ all real), prove that
&+βι+γι = pl-4p2q + 4pr+2qi,
where α, β, γ are the roots of the equation
x3+px*+qx + r = 0.
3. In the equation . , ,„ „ ,
xi + ax3~20x2 + bx~516 = 0,

POLYNOMIAL EQUATIONS [23
a and b are to be chosen so that two of the roots are equal and the sum of the
other two roots is zero.
Find all possible values of a and b, supposing them chosen so that the roots
of the given equation are all real. (O & Q
4. Find the value or values of a for which the roots of the equation
2х* + 6х* + 5х+а = 0
are in arithmetical progression. (O & Q
5. Given that — 2+jk is a root of the equation
2xl + 8x3+Ux2 + 4x+S = 0,
find к and the other three roots of the equation.
6. Given that α, β, γ are the roots of the equation
x3-px2+qx-r = 0,
and that sn = α.η+βη+γη, prove that
Si = P, s2 = p2—2g, s3 = i"2—qsx+Ъг.
Calculate s2 and s3 for the equation
χ3-12x4 30л:-20= 0.
Show that (s2)i and (j3)* are both near to 9 and verify that 9 approximates to a
root of the equation. (O & C)
7. If ω is a root other than 1 of the equation ω7 = 1, prove that the other roots
are ω2, ω3, ω4, ω5, ωβ.
Prove that 1+ω + ω2 + ω3 + ω4 + ω5 + ω6 = 0.
If α = ω + ω6, β = ω2 + ω5, γ = ω3+ωι, prove that the equation with roots
a>/?> 7 is „ л
'H' ' z3 + z2-2z-l = 0.
Hence, or otherwise, find the values of
(i) cos \n+cos \n + cos fπ. (ii) cos fyr cos |я cos fn-. (O & C)
8. When f{x) and^(x) are given polynomials having no common factor, prove
that the values of the constant к for which the equation
f(x)~kg(x) = 0
has a repeated root are given by/(a)/^(a) where a is a root of the equation
f(x)g'(x)~f'(x)g(x) = 0.
Hence, or otherwise, find the values of & for which the equation
л:3-Зл:2 + ЗЬ:-1 = 0
has a repeated root, and find also all the roots of the equation for each case.
(O&C)
9. Discuss the reality of the solution of the equation
2л:3 + Зл:2-36л: + А: = 0
for different real values of k.
504

MISCELLANEOUS EXERCISE 23
10. Iff[x) is a polynomial in χ which has a factor (x—oCf, prove that (л:—a) is а
factor of/"(*)· Prove conversely that, iff{x) and f\x) have a common factor
x-cc, then (лг-α)2 is a factor off(x).
Solve the equation ^^^^«^ = 0
given that it has a repeated root. (O & C)
11. If α, β, γ are the roots of the equation
x3+px+q = 0,
find the equation with roots β/γ + γ /β, γ/α + α/γ, α/β + β/α. (Ο & C)
12. The sum of the Mi powers of the roots of the equation
xt+px^+qx+r = 0
is denoted by sk. Prove that
j2 = -2p, s3 = -3q.
sn+i+psn+2+qsn+1+rs„ = 0 (n > 0),
and deduce that st = 2p*-4r, s7 = 7q(r-p%).
Given that the roots a, b, c, d of the equation are all real and that s, = 0,
prove that q = 0 and deduce that
(a+b)(a+c)(a+d)= 0. (O & Q
13. Show that, if ζ = 2(-#)*cos Θ, where ζ is a root of the cubic equation
z4-3tfz + G=0,then cos3, = V(_G2/4JP).
Deduce that, if Η < 0 and Ga+4#3 < 0 the cubic equation has three real
roots.
By using the expression ^ ^ = ^ + e_^
solve the equation when G2 + AIP > 0,
treating separately the cases ff < Q ^ ff>Q_

24. Vector products and their
applications
1. THE VECTOR PRODUCT
We have already seen, in Chapter 11, that it is frequently useful to find a
vector which is perpendicular to two given vectors; for example, to find
the equation of the plane through three given points А, В, С it is sufficient
to know the unit normal vector; that is, a unit vector perpendicular to
both AB and AC. We therefore introduce a new method of combining two
vectors a and b to form a third vector perpendicular to both of them and
called their vector product, written а л b (or a x b). It is defined by
a Ab = |a||b| sin 6>έ,
where θ is the angle between a and b (see
Chapter 11) and t is a unit vector perpendicular
to both a and b and with sense that makes a,
b, с (in that order!) a right-handed triple (see
Chapter 3). The relative positions of a, b,
and а л b are shown in Figure 24.1 where a,
а л b lie on the page and b points into the
*Ex. 1. Showthata Л а = 0. 6'
*Ex. 2. Show that а Л Ab = A(a Л b).
*Ex. 3. Show that, if a and b are parallel, then a A b = 0.
*Ex. 4. Show that, if a A b = 0, then either a and b are parallel or one or other
or both of a, b is zero.
*Ex. 5. Show that the vector product is anti-commutative; that is, that
aAb = -bAa.
*Ex. 6. Verify that iAi = jAj = kAk=Oand that
jAk=-kAj = i, kAi=-iAk = j, iAj=-jAi = k.
Ex. 7. a is a vector of magnitude 2 units pointing due east ;b has magnitude 3 units
and points N60°E. с is a vertical unit vector. Find a A b and b A a.

1] VECTOR PRODUCT
So far, apart from the result of Ex. 2, the vector product appears
remarkably dissimilar to the product of ordinary numbers. However, there
is one rule which vector products obey and which, in a sense, justifies the
use of the word 'product': it is distributive over vector addition; that is
а л(Ь + с) = (а лЬ) + (а л с).
We shall deduce this very important result in a manner reminiscent of
that used to prove the analogous result for scalar products:
a.(b + c) = (a.b) + (a.c).
In Chapter 11, we saw that the scalar product of a = OA with a unit
vector Й could be interpreted geometrically as the (scalar) projection of a
Fig. 24.2
in the direction Й and it was this fact that enabled us to deduce the
distributive law for scalar products. Consider the plane, П, perpendicular to fi
and through O; let the foot of the perpendicular from A to Π be D; then
the vector OD is called the (vector) projection of a on the plane Π (see
Figure 24.2).
It follows from the definition of the vector product that
OD= |алй|,
but OD Φ а л Й,
since OD and а л Й are perpendicular to one another.
As in Chapter 11, Section 1, the vector projection of a on Π is unique
and, ifthe projection of a! and a2 on Π are ΟΌ1 and OD2, then the projection
of OB = a! + a2 is ОС = OD! + OD2 (see Figure 24.3 and consider the
mid-point of Λ^2).
In words, the (vector) projection, in a plane, of the sum of two vectors is
the sum of the separate projections of the two vectors.
507

VECTOR PRODUCTS
[24
Now consider й л (a! + a2): from what has just been demonstrated,
|й л(а!+а2)| = |(й ла^ + СЙ ла2)|.
But й л (ах + а2) is perpendicular to the plane О ВС and thus to ОС, й л at
is perpendicular to OD^ й л a2 is perpendicular to OD2 and thus, by
rotating the parallelogram OD-^CD^ through a right angle in the plane П,
it follows that . . ч . .
й л (а! + а2) = й л а!+й л а2.
Fig. 24.3
*Ех. 8. By writing а = Ai, deduce that
аЛ(Ь + с) = (аЛЬ) + (аЛс).
Show also that (b+c) Л a = (b Л а) +(с Л а).
With the distributive rules behind us, it is an easy matter to express the
vector product а Л b in terms of components in three mutually
perpendicular directions i, j, k. If a = a!i+a2j+a3k andb = ^i+^j + ftgk, then
a Ab = Oii+aJ + азк) л (^i + ftj + ^k)
= (a2b3-a3bj i +(<%6i-ai6s) j +{а1Ьг- а2Ъд к,
on repeated application of the distributive rule and using the results proved
in Ex. 6. The result is best remembered written formally as a determinant:
j к |
bi b2 b3 I
Ex. 9. If a = i+2j + 3kandb = ί-j-k, find а Л b.
Since |a Ab| = |a| |b| sin Θ, we see that the magnitude of the vector
product gives us twice the area of AOAB (see Figure 24.4). The vector
508

1]
VECTOR PRODUCT
i(a л b) is often referred to as the vector area of AOAB. (Notice that the
vector area of AOAB is minus the vector area of AOBA.) More generally,
the vector area of the triangle ABC is i(AB л AC).
Ex. 10. OABC is a tetrahedron; prove that the sum of the vector areas of the
triangles OBC, OCA, OAB is the vector area of the triangle ABC.
Fig. 24.4
Example 1. ABCD is a plane quadrilateral whose sides CD, В A intersect
at O; if P, Q are the mid-points of the diagonals AC, BD, prove that the
area ofAOPQ is one-quarter of the area of the quadrilateral ABCD.
О d С
Fig. 24.5
Take origin О and a = OA, b = Ла, с = ОС, d = μο. We have
Ρ = i(a + c),
q = i(Aa + /*c),
and the vector area of OPQ
= ipAq
= l(a + c) л(Ла + /*с)
= 1(/<-Л)(алс)
on using the distributive and anti-commutative rules. But since ABCD
is a plane quadrilateral, its area is the magnitude of the sum of the vector
509

VECTOR PRODUCTS [24
areas of AADB and ABDC (notice that we letter the triangles in the same
sense). Thus
ABCD = i(d лЬ+b ла + а Ad)+i(d лс + с лЬ + Ь Ad)
= ^(bAa + aAd + dAc + cAb)
= MaA(d-b)-cA(d-b)]
= i(a-c)A(d-b)
= i(a-c)A(/*c-Aa)
= i(/*— λ) (a A c), and the result follows.
Ex. 11. The position vectors, relative to some origin O, of the vertices of a
convex polygon Р^РгРъ ...P„ are denoted by Pi, p2, p3,..., p„. Extend the definition
of the vector area of a triangle to the vector area of a convex polygon and show
that the vector area о{РгРгР3...Рп is
i(Pi Λ ρ2 + ρ2 Λ ρ3+ ... + Ρ„-ι Λ ρ„ + ρ„ Λ Pl).
Prove the result of Example 1 by considering the vector areas of ABCD and
OPQ.
2. PRODUCTS OF THREE VECTORS
If from two vectors, b and c, we form the vector product b л с, we may
multiply a by this new vector in two quite different ways, to give either a
scalar or a vector:
(i) the scalar triple product, a.fb л с);
(ii) the vector triple product, а Л (Ъ л с).
The component form for a. (b л с) is easily found. Recall that
X У = Х1У1 + Х2У2 + ХгУг
«li + fly + ^k and
b Ac =
а.(Ьлс) =
i
bi
Cl
ai
bi
Cl
J к
ь2 h
c2 c3
a2 a3
b2 b3
c2 c3
*Ex. 12. Prove that a.(b A c) = b.(c л а) = с. (а Л b) (that is, the scalar triple
product is unaffected by cyclic interchange of the letters) and that
a.(bAc) = (a Ab).c
(the . and A may be interchanged).
510

2] PRODUCTS OF THREE VECTORS
The scalar triple product a. (b л с) has an interesting and important
geometrical interpretation. Suppose Й is perpendicular to the plane OBC
(see Figure 24.6). Then b Л с = 5Й, where S is the area of the
parallelogram OBDC. Thus
а.(Ьлс) = (а.Й)^
= volume of the parallelepiped formed by a, b, с
Fig. 24.6
Ex. 13. Show that a.(b л с) = 0<> О, А, В, С, lie in a plane.
Ex. 14. Prove that x.(y Λ ζ) = (χ Λ у).ζ without using components (and thus
without assuming the distributive law аЛ(Ь + с) = аЛЬ + аЛс).
Ex. 15. The fact that χ. (у Λ ζ) = (χ Λ у). ζ may be used to verify the distributive
law: define d = a Λ (b + c)-a Λ b-a Л с and form the product u.d, where u
is an arbitrary unit vector. Show that u.d = 0. Why does this imply that d = 0?
*Ex. 16. Show that the volume of the tetrahedron О ABC is |a. (b Л с).
*Ex. 17. Prove that, if any one of the vectors a, b, с is a scalar multiple of one
of the other two, then a. (b Л с) = 0.
The vector triple product а л (b л с) is a little more difficult to express
in component form. First observe that, if a is perpendicular to b and also
perpendicular to c, then it is parallel to b л с and thus а Л (b л с) = 0;
let us assume that none of a, b, с is zero and that a is not perpendicular to b
and thus that a.b φ 0. Again, if b = Ac, а Л (b л с) = 0; let us assume
that b φ Ac. Since b л с is perpendicular to both b and с and since
а л (b л с) is perpendicular to b л с it follows that а Л (b л с) lies in the
plane of b and с Thus, а л (b л с) can be expressed uniquely as λЪ+μc;
we have to determine A and μ. By Ex. 17,
а. [а л(Ь л с)] = 0,
.·. A(a.b)+/i(a.c) = 0,
.·. * = -^^ (since a.b Φ 0),
μ (a.b) ^ h
511

VECTOR PRODUCTS [24
and thus а л (b Λ c) = v[(a.c) b-(a.b) c].
To find v, consider the i component of both sides:
<h{b\ c2—b2c^ — a3{b3 c-t-b-L c3)
= v[(axcx + a2c2 + a3c3) bx-(axbx + a2b2 + a3Ьэ) cj,
.·. 6i(a2C2 + a8cs)-ci(«2*2 + as*s) = ^Ь^^ + ^с^-с^а^-а^].
.·. ν = 1.
Thus we have finally
а л(Ь Л с) = (a.c)b-(a.b) с.
*Ex. 18. Prove that the vector triple product is not associative; that is
а Л (b Л с) Ф (а Л b) Л с.
Ex. 19. Prove that а Л (b Л c) + b Л (с Л а) + с Л (а Л b) = 0.
Exercise 24(a)
1. Prove that (a-b) Л (a + b) = 2a Л b and interpret the result geometrically in
terms of areas.
2. What can you deduce from the equation гЛа = гЛЬ?
3. Prove that a + b + c =0*ЬЛс= сЛа = аЛЬ and interpret the result
geometrically.
Is it true that ЬЛс = сЛа = аЛЬ* a + b + c = 0?
4. Find а Л b in the following cases:
(i) a= 3i-j + k, b= 2i+j + k; (ii) a= 5i + 4j + 3k, b= 3i + 4j + 5k;
(iii) a = i-k, b = j + k; (iv) a = 2i-j-k, b = 2i+j+k;
(v) a = ai + ck, b = b].
5. If a = 3i+j + 2k, b = i + 2j + 3k, с = i-3j-4k, find а.(ЬЛс). What can
you deduce about the points А, В, С?
6. If a = i+j, b = 2i+j, с = i + k, form the vector products а ЛЬ and
(а Л b) Л с and mark the position of the five vectors a, b, с, а Л b, (а Л b) Л с
in a rough sketch.
7. The points А, В, С have position vectors a = i-4j + 3k, b=2j-3k,
с = 3i- 6k. Find the unit vector perpendicular to the plane OAB and deduce
the perpendicular distance from С to this plane.
8. With the notation of Question 7, find the area of b.OAB and hence the volume
of the tetrahedron О ABC.
9. If А, В, С are non-collinear points with position vectors a, b, c, prove that
the vector ЬЛс + сЛа + aAbis normal to the plane ABC.
If d is the position vector of the point D and a = 5i+j + k, b = i+j + 3k,
с = 3i + 4j-k, d = 8i + 3j + 2k, find:
(i) the unit normal to the plane ABC;
(ii) the area of the triangle ABC;
(iii) the volume of the tetrahedron ABCD.
512

PRODUCTS OF THREE VECTORS
«i (h a3\
i Ui h b,\ cubic units-
I ct c2 c3 I
If d = i^i + iy + i^k, write down the volume of the tetrahedron ABCD.
What is the condition that A, B, C, D should be coplanar. Is this a necessary
and sufficient condition?
11. If a, b are perpendicular vectors and if г Л а = b, prove that
г = Ла + /«(аЛЬ),
where λ may be chosen arbitrarily. Find μ in terms of |a|.
12. If /t... m3 are real numbers such that
l\ + ll+ll = ml + ml + ml = 1 and /^Ч- /2w2 + l3m3 = 0
prove that (/2 m3-l3m^2 + (l3 тг - /ι w3)2 + (/i w2 - /2 w^2 = 1.
13. Prove that (aAb).(cAd) = (b.d) (a.c)-(b.c) (a.d). What trigonometric
identity follows if O, A, B, C, D are coplanar and the angles AOB, COD are both
right angles?
14. The vectors a = OA, b = OB, с = ОС are non-coplanar; prove that a
given vector г may be expressed uniquely in the form
г = ab Л с+/?с Л а+уа Л b
and find a, /?, γ in terms of a, b, с and r.
Ifr = OP, locate Ρ in terms of О, А, В, С: (i) if α = β = γ = 1; (ii) if α = О,
/9 = 7=1.
15. Points β, R are taken on the sides CA, AB {not produced) respectively of a
triangle ABC in such a manner that the triangles ABQ, ARC are equal in area.
Prove that RQ is parallel to ВС.
16. G is the centroid of the triangle ABC, and E, F are the mid-points of CA,
AB respectively. Prove that the quadrilateral AFGE and the triangle GBC have
the same area.
17. ABCD is a skew quadrilateral; the mid-points of the opposite sides ВС, DA
are P, Q, while the mid-points of the diagonals AC, BD are R, S. Prove that
PRQS is a parallelogram whose area is i\&ABC-AABD\.
18. ABC is a triangle and points L, M, Ware taken on ВС, CA, AB respectively
such that
BL _ CM _ ЛЛГ _
LC ~ ' МЛ ~ μ' ΝΒ~ V'
Prove that ALMN = (1 + λ/tν) ДЛВС and deduce the theorem of Menelaus, that,
iiLMN is a straight line, A^v = - 1.

VECTOR PRODUCTS
[24
19. The sides (and sides produced) of the parallelogram ABCD separate the plane
into nine regions (see Figure 24.7). Prove that, if a point О is taken within
region I, II, VI or IX the area of О AC is the sum of the areas of OAB and
О AD but that, if О is taken within any other region, the area of О AC is the
difference of the areas of OAB and О AD.
(The reader who has studied mechanics may be able to interpret this result in
terms of moments.)
Fig. 24.7
20. ABCD is a tetrahedron. The lengths of AB and CD are a and b and the angle
between AB and CD is Θ. A plane is drawn parallel to AB and CD and meets the
edges CA, CB, BD, DA at P, Q, R, S. Prove that PQRS is a parallelogram and
that its area is
where χ is the distance of the plane from AB and h is the distance between AB
and CD.
Hence, or otherwise, prove that the volume of the tetrahedron is
\abh sin Θ.
3. APPLICATIONS TO COORDINATE
GEOMETRY
Throughout this section we shall assume that an origin О and a right-
handed set of coordinate axes Ox, Oy, Oz specified by the unit vectors
i, j, к are given. The position vector of the point A... with respect to О will,
as usual, be denoted by a ....
Consider first the line through A in the direction of the unit vector u.
If R is any point on this line, then AR = λύ and thus
ARaU = 0,
giving the equation of the line in the form
(r-a) ли = О.
514

3] APPLICATIONS TO COORDINATE GEOMETRY
If г = xi+yj+zk, a = axi+OiJ+a^k, й = Γι+mj+nk, we can transform
this to (or from) the familiar coordinate form as follows:
(r-a) Ли = 0
=> [(x-ai)i + (y-a2)i + (z-a3)k] А(П + т\ + пк) = 0
o(y-a£n-(z-(Qm = {z-a^l-{x-a^n = {x-a^)m-{y-a2) I = 0
χ—αχ _ y—a2 _ z—a3
^ I ~ m ~ η "
Fig. 24.
The distance from the point В to the line, the length BN (see Figure
24-8».'s |(Ь-.)лО|.
If В is the point (bu b2, b3) this gives
BN = \[(b2-aj n-(b3-a3) m] 1 + Wb-aJ l-ih-a^ n] j
+ [(b1-a1)m-(b2-a2)[\k\.
Ex. 20. Show that the perpendicular distance from the point (3, 3, 2) to the line
x-l _y-2_z-3 .
/83
1S Jt4-
The vector product is particularly useful in giving us the vector
perpendicular to two given vectors. For example, consider the plane through the
(non-collinear) points А, В, С The vector
d = AB л AC
is perpendicular to this plane. Thus
d = (Ь-а)л(с-а)
= b лс + с ла + а лЬ
and the equation of the plane is now found easily.

VECTOR PRODUCTS
[24
Example 2. Find the equation of the plane through the points A(l, 1, —3),
B(2, -1, 10), C(-l,-l, 1).
AB= i-2j+13k,
AC=-2i-2j+4k,
AB лАС= 18i-30j-6k,
and thus the vector 3i—5j —к is perpendicular to ABC Since A lies in the
plane the equation of ABC is therefore
3(x-l) -5(y-l)-(z + 3) = 0
which gives, on simplification,
3x-5y-z = 1.
*Ex. 21. If г = xi+y) + zk is the position vector of the point Ρ show that the
volume of the tetrahedron Ρ ABC is
1(г-а).[(Ь-а)л(с-а)].
(See Ex. 15.)
Hence obtain the equation of the plane ABC of Example 2 in determinant
form, and verify that the expansion of this determinant gives the same equation
as that obtained above.
Example 3. Find the equation of the plane through the point (2, — 5, — 1)
perpendicular to the line of intersection of the planes
x + Jy + z = 0, x-5y-2z = 0.
The line of intersection is perpendicular to both of the vectors
i+7j+k and i—5j —2k
and thus its direction is given by
(i+7j+k) A(i-5j-2k) = -9i + 3j-12k.
Thus 3i — j + 4k is normal to the required plane, whose equation must be
3(x-2)-(y + S) + 4(z+l) = 0
or Ъх-y + Az = 7.
Example 4. Find the length of the common perpendicular to the two skew lines
x y+l z + 2 x-2 y-6 z-Ъ
1~ 2 ~ 4 ' -1~ 4 ~ 2 ■
The direction of the common perpendicular is given by
(i + 2j+4k) A(-i + 4j + 2k) =-12i-6j + 6k.
516

3] APPLICATIONS TO COORDINATE GEOMETRY
A unit vector in the direction of the common perpendicular is thus
fl = (2i+j-k)A/6.
The lines pass through the points A(Q, — 1, — 2), B(2, 6, 3) respectively and
we have the length of the common perpendicular equal to the projection
of AB in the direction u
= (b-a).U
= (2i + 7j + 5k).(2i+j-k)/V6
= (4+7-5)/V6
The vector product may be used to obtain an alternative method of
solving three linear simultaneous equations. Consider the three equations
3x + 5y + Sz=-l Pu
2x + 3y + 4z = 3 P2,
4x+2y + 5z =-2 P3.
These three equations represent three planes, Pu P2) P3; their solution
consists in finding the point Q common to each plane (assuming such a point
exists).
From equations P1 and P2 we obtain (3PX +P2), the equation of the plane
through the line of intersection of Ръ Р2 containing the origin. Thus
Ux + 18y+28z = 0 Pi
is a plane containing О and Q. Similarly, ДО-Д),
2x + 8j + llz = 0 P5
is a plane containing О and Q.
Thus OQ is perpendicular to both 1 li +18j +28k and 2i + 8j +1 Ik; that is
OQ = A(lli + 18j428k) A(2i + 8j + llk)
= -13A(2i + 5j-4k),
and Q has coordinates (—26λ, — 65λ, 52λ) where Λ is some scalar. Writing
— 13λ = μ, Q has coordinates (2μ, 5μ, —4μ). Substituting in Ръ we find
that μ = 1 giving Q as the point (2, 5, — 4). The solution of the given set
of equations is thus „ , .
n χ = 2, у = 5, ζ = — 4.
517

VECTOR PRODUCTS [24
Ex. 22. Use the method outlined above to find a necessary and sufficient
condition for the existence of a non-trivial solution to the set of three homogeneous
linear equations: , , ,
a-LX+Wy + CiZ = 0,
агх + Ьгу + czz = 0,
a3x+b3y + c3z = 0.
Exercise 24(b)
1. Find the equations of the planes through the following sets of points:
(i) (1, -1,1), (2, 1,2), (3,1, -1);
(ii) (1, 1, 1),(0, -3, -2), (-1,3, -1);
(iii) (1,1,5),(1, -1, -3),(2, 1,1);
(iv) (4, 0, 1), (7, 3, 5), (-10, -4, -1);
(v) (a, b, с), (За, b, 0), (4a, 2b, c).
2. Find, in each part, the equation of the plane through the given point and
perpendicular to the line of intersection of the given planes:
(i) (1, -1, 3); 4x-y + 4z = 0, 12x-5y + 8z = 0;
(ii) (-5, -4, -1), 3x+4y + 3z = 15, 9x-2y-12z = 3;
(iii) (2, 3, 4), 5x+y + 2z = 7, x+y+z = -4;
(iv) (9, 3, 1), 3x+2y+2z = 0, 5x+2^+z = 4;
(ν) (1, 1, -1), 3x+2y-4z = 0, 4x+y + 3z = 0.
3. Find the length of the common perpendicular to the following pairs of skew
lines :
ω -з~ 2 ~ 2 ' ■>-_-;-*;
(ii) x-6 = y-3 = z-l,
x-4 = y-7 =
3 ~ 5
x+l _y+l _ ζ
2' 2 -1 2 "
4. Find the length of the common perpendicular to the two lines
x-4 у z+3 . x-6 y+1 z + 5
What deductions can you make from your answer?
5. Find the equation of the plane containing the points A (6, 3, 0), B(—2, 0, — 5)
which is parallel to the vector i + j + k.
6. Find the equation of the plane containing the points A(3, 5, 10), B(4, 3, —3)
which is parallel to the vector 3i + 5j - 6k.
7. Find the equation of the plane parallel to the two lines
f(x-6)=i0>-6) = |(z-l), $x = -{y-2) = z + A
which passes through the point (2, 0, — 1).
518

EXERCISE 24
8. Given the plane P: 2x+3y-z = -1 and the line L: %x = -y = -(z-5),
find·.
(i) the coordinates of the point of intersection of Ρ and L;
(ii) a vector parallel to Ρ and perpendicular to L;
(iii) a vector in the direction of the projection of I, in P;
(iv) the equation of the projection of I, in P.
9. Show that the equation of the plane containing the points А, В, С has
equation
(г-я).(Ь-я)Л(с-я) =
Write down, in determinant form, the equation of the plane through the points
А(аъ Ch, a3), В{ЬЪ Ьг, b^), С(съ с2, с3).
10. Find a vector perpendicular to both ca + bj + ck and li + nq + nk. Hence
write down the equation of the plane through the origin containing the line
x—a _ y—b _ z—c
Use this result to find the line through the origin which intersects each of the
ППе8 K*+D = y-2 = i(z + 2),
i(*+D= Ky-4) = (г-3),
giving your answer in standard form.
11. Find the vector equation of the line through the point С (position vector c)
which intersects both of the lines
r = a!+Abi, r = a2 + /tb2.
Discuss any special cases that might arise.
12. Find the volume of the tetrahedron formed by the planes
βγ+γζ = 0, γζ + αχ = 0, αχ + βγ = 0, αχ + βγ + γζ = δ.
13. A square piece of paper ABCD of side a is folded along the diagonal BD so
that the planes of the triangles ABD, CBD are perpendicular. Find the shortest
distance between the edges A B and CD.

25. Continuous probability distributions
1. DENSITY FUNCTIONS
In Chapters 7, 10 and 16 we discussed probability distributions in discrete
outcome spaces, that is, in spaces whose elementary events we could count
by putting them into one-to-one correspondence with the set N of natural
numbers. Associated with each elementary event 5* of the outcome space
S was a number x{, usually an integer—the value of our associated random
variable X.
Random experiments occur, however, in which it is mathematically
more convenient to take as outcome space a set which cannot be counted
and as random variable a subset X of the set R of real numbers. For
example, suppose we take a point Ρ at random on a line AB of length / m.
Since we can only measure lengths to within some given limits of accuracy,
say 0-5 mm, we could take as values of a (discrete) random variable the
number of intervals of length 0-5 mm Ρ is from A, giving 201 possible values
in all (0-200). Mathematically the situation may be simpler to analyse if
we take as values of our (continuous) random variable the supposed' exact'
length of AP, measured as a real number of mm; that is
X={xeR:0< χ «ξ 100}.
If we are dealing with a subset of R, calculus methods are available and it is,
for example, easier in general to integrate a function than it is to sum a
series.
Given a continuous random variable X, it will generally be impossible
to assign a probability to a specific value of X; rather we assign a
probability to each interval
Ε = {x eX; хг < χ < x2}
as follows. Let/be an integrable function such that
(i) f(x) Ζ 0, for all χ e X;
(ii) $xAx)dx= 1;
where the notation denotes that the integration is to be performed over
the set X. Then we define
Pr(£)=£V(*)d*.
520

1] DENSITY FUNCTIONS
The function/so defined is called a. probability density function. As in
the discrete case the choice of/ constitutes an assumption about our
random experiment; that is, to choose/is to choose a particular mathematical
model. Geometrically, probabilities are measured as areas under the curve
y=f{x).
Consider again the random experiment of marking a point on the line
AB. To analyse the situation we must first set up a mathematical model;
in other words, we seek to make the vague statement ' we mark a point Ρ
at random on AB' mathematically precise by stating explicitly what we
feel 'at random' could mean.
0 50 100
Fig. 25.1
We may regard AB as an interval on the χ axis, with A as origin and В
100 units to the right. Suppose we conjecture that a point is more likely to
be chosen towards the middle of AB than near the ends of the interval.
Taking, as is customary, the whole set R as the range of our random
variable Χ,-f a possible choice of/(x) would be
(0 if x < 0
kx if 0 < χ < 50
£(100-x) if 50 < χ < 100
0 if χ ^ 100,
where к is some number. Figure 25.1 shows the graph of у =f(x).
f(x) = 0if;c<0or.x;> 100 simply means that we are restricted to a
choice of Ρ within AB.f(x) = kx if 0 < χ < 50 and f(x) = £(100-x) if
50 < χ < 100 ensures that our probability distribution is symmetrical
(reflecting a feeling that we are as likely to take χ to be between, say, 35 and
45 as we are to take χ to be between 55 and 65) and that the probability of
selecting a point Ρ within an interval of some specified length becomes
progressively greater the nearer the interval is to the centre of AB (see
Figure 25.1 again).
t For consistency we shall take R to be the range of X throughout this chapter.
521

люо
Jo
CONTINUOUS PROBABILITY DISTRIBUTIONS [25
To find к in this case, we must have
JO
f(x)dx= 1
Го люо
kxdx+\ !c(100-x)dx= 1
^ik[x^0+k[l00x-ix2]^= l
^>i.£.502 + A:.100.50—1£.150.50= 1
2500'
люо
(Figure 25.1 is not drawn to scale!) Since f(x) dx represents the area of
AAPB, integration is a heavy tool to use here—we employ it to illustrate
the general method.
Now suppose we wish to find the probability that Ρ lies between 30 and
60 mm from A. We define the event Ε by
Ε = {χ e R: 30 < χ ^ 60}
and obtain
Pr (E) = Γ 2Ao xdx + Г 25ТЙ) (100-jc) dx
J30 J50
= Ц= 0-54 (see Figure 25.2).
(Again, integration is heavy in the circumstances.)
A different mathematical model, that is, a different choice for/, leads to
a different estimate of the probability of the event Ε as we shall now show.
We shall suppose a function/defined thus:
if χ < 0
f{x) = ]fce(100-je) if 0 < χ < 100
if χ > 100

И
DENSITY FUNCTIONS
(see Figure 25.3). This distribution is again symmetrical about χ = 50
and gives a greater probability to points near the centre. To find Pr (E)
we must first calculate k.
We have
"*" кх{Ш-х)ах = 1
(Again, Figure 25.3 is not drawn to scale!) Defining Ε as before we have:
3 ,mn ч j
/•60
Jso 500000л^"" л'ил
= 6xlO-6[5Qx2-l;c3]!0°
= Λψ5- = 0-432.
Ex. 1. Why would you expect the second answer for Pr (E) to be less than the
first?
Ex. 2. Write down the value of Pr (E) on the assumption that all points of the
interval are equally likely. What form does f(x) take in this case?
Ex. 3. lif(x) = kx, 0 < χ < 100 and f(x) = 0 otherwise, find & and Pr (E).
A mode of a random variable with given density function/is a value of
the random variable for which / has a local maximum. The median, m, is
the value of the random variable defined by the equation
i-J(x]

CONTINUOUS PROBABILITY DISTRIBUTIONS
Similarly, the lower and upper quartile scores L, Q, are defined by
£j(x) dx = i, jQJ(x) dx = I
Ex. 4. Interpret the median and quartiles in terms of probabilities.
2. DISTRIBUTION FUNCTIONS
The density function, /, of a continuous probability distribution bears the
same relation to probability as mass density does to mass:mass density
is the mass per unit volume, probability density is the probability per unit
interval. It must be remembered that f(x) itself is not a probability, but
δρ = f(x) δχ is.
We now introduce a function, closely related to the density function,
whose values do represent probabilities. Suppose the domain of our
random variable is R and that we are given a density function/: R-* R. Then
weknow (i)/(*)>o,
(ii) £j(x)dx=l.
Let Ε be the set of real numbers less than, or equal to x, that is, the set of
real numbers to the left of χ on the real axis:
Ε = {у e R: у < χ}.
Then Pr (E) = ^Jlx) dx
depends upon χ and we may define a function F: R-> R where
F(x) = jX_/(x)dx=PT(E).
F is called the distribution function for the probability distribution;
F(x) is the probability that, given a probability distribution with density
f(x), a point chosen at random will be less than x.
From the original definition we have (provided/is continuous)
dF
f.
dx~
Since/(л;) > 0 for all x, F(x) increases as χ increases. Also we have
lim _F(;c) = 0, lim _F(;c) = 1,
which gives the range of F as the set {x e R: 0 < χ < 1}.
524

2] DISTRIBUTION FUNCTIONS
Again, given an event Ε = {χ e R: x1 < χ < x2} we have
Pr(E) = £7(*)dx
= F(x2)-F(Xl).
Since the sets Ε and {x eR: χ < xj are mutually exclusive, this illustrates
the result that the probability of the union of mutually exclusive events is
the sum of the probabilities of the separate events.
*Ex. 5. If Еъ E2, ..., E„ are mutually exclusive events in a continuous outcome
space, prove that
Pr^ + Pr^ + .-. + Pr^J = Pr(E1uEtu...uEJ.
Example 1. The density function, f, is defined by
10 if χ < 0
kx(l-x) if 0 ^ x< 1
0 if χ > 1.
Find
(i) the value ofk;
(ii) the distribution function F;
(iii) the density and distribution functions, g and G, of the random
variable W where ,
w = x2.
Sketch the graphs off and F.
(i) Since ГV(*) ax = 1,
we have Г kx{\ -x) ax = 1
=> k= 6.
(ii) If χ < 0, F(x) = 0; if χ > 1, F(x) = 1.
If 0 ^ χ < 1,
F{x)= J 6jc(l-*)cb:
= [3x2-2x*]0*
= *2(3-2*).
Thus F is the function F: R -> {j e Д: 0 < у ζ 1} defined by
if χ < 0
F(x)= {хг(3-2л;) if 0 < χ «ξ 1
if je > 1.
525

CONTINUOUS PROBABILITY DISTRIBUTIONS [25
(iii) If w < 0, G(w) = 0; if w > 1, G(w) = 1.
IfO < w < 1,
G(w) = Pr {z e R: 0 < ζ < w}
= vi>(3 — 2.» (since л; is positive, x = V^)·
(0 if №<0
G(w) = J3w-2wt if 0 < w < 1
[l if w > 1.
i0 if w < 0
g(w)= 3(1-» if 0< w<l
(θ if w > 1.
The graphs of/, F are shown in Figures 25.4(i) and (ii).
(i) 0i)
Fig. 25.4
Notice in this question that, since Jx increases as χ increases, as we move
from left to right along the χ axis so we move from left to right along the
w axis. More generally, if a transformation у = h(x) of a random variable
is made, the method of Example 1 may be used directly if у either increases
or decreases steadily with χ but, if this is not the case, the domain of/
should be split into those parts for which it is an increasing function and
those for which it is decreasing. (See Example 4.)
The value of transforming from one random variable to another is
that a complicated form for the density function may be changed into a
simpler (or more familiar) density function.
F is sometimes referred to as the cumulative distribution function, to
emphasize the property it has of increasing from left to right. The graph
526
Thus
Also,
giving

2] DISTRIBUTION FUNCTIONS
of F, which usually has a shape very similar to that shown in Figure 25.4,
is called a cumulative probability curve (or, occasionally, an ogive curve).
Ex. 6. Explain how the cumulative probability curve may be used to find the
median and upper and lower quartiles.
In the case of discrete distributions, if we had a random variable X
which could take all integral values r in the range 1 ^ r ^ n, where
PT(X=r)=Pn
we defined the expectation of X by
*(X) = Д rpr.
For continuous distributions, integration takes the place of summation
and we define the expectation of the random variable X, whose density
function is fi by
S{X)=Llxf(x)dx·
${X) is usually referred to as the mean of X, and is denoted by μ.
Example 2. Find the mean of the random variable X whose density function,
f L· defined by
(0 if χ < 0
f{x) = -LjcO-jc2) // 0< *< 1
10 if χ > 1.
We have
μ = *(Χ)= J*^xf{x)dx
= Γ 4x2(l -x2) dx (since Дх) = О outside the interval
0 ^ χ < 1),
More generally, if we have a function g: R-* R, the expectation of the
function g of the random variable X is defined by
<?[g(x)]= £\(х)Лх)<1
Particularly important is the case where
g(x) = (χ-μΥ.

CONTINUOUS PROBABILITY DISTRIBUTIONS [25
The variance of X is the expectation of the function g so defined:
σ* = £[{χ-μγ\ = Ι*\χ-μγΑχ)άχ.1
Since
/_+Γ (*-Ό'Λ*) <** = J"_+J (*-2μχ + μ*)Λχ) α*
= £J **/(*) d*-2/i £^*/W dx+μ2 J^/OO &c
= £" Χ*/[χ)άχ-2μ.μ+μ*Λ,
we have σ2+μ2= x2f(x)dx,
a result which may be compared with that on page 178.
*Ex. 7. Show that the variance of the random variable in Example 2 is given by
о-2+Йт = ΓΑχ\\-χ2)άχ.
Deduce that σ2 = J^·.
tion ah
g(x) = \χ-μ\.
*Ex. 8. The mean deviation about the mean, η, is defined as the expectation of
the function g where
Find η if
[0, x<
10, xi
Ax) = \i, -1 < χ < ι
► 1.
Exercise 25(a)
1. /(x) = &л-2 if χ > 1, Длг) = 0 if χ < 1; find the value of к and hence find
Pr (x < 2).
2. f(x) = ке~2х if χ > 0, /O) = 0 if χ < 0; find fc and hence find
Pr (1 < χ < 2).
3. If Дх) = к sin πχ if 0 < χ < 1 and is zero otherwise, find к and hence find
Pr (x > i).
4. Find the distribution function for each of the density functions in Questions
t The use of σ2 to denote variance is sometimes insufficiently explicit. If it is necessary
to refer to the random variable, X, under consideration, the associated variance may be
written σ| or, more commonly, "f{X).

Find the mean, median, mode and
function defined as in Questions 5-8.
DISTRIBUTION FUNCTIONS
for the distribution with the density
r
χ) if
if
0 if
(6x+l)/4 if
0 if
0
\π cos πχ
0
0 < χ < 2
x>2.
χ < 0
0< лг« 1
χ > 1.
if *< -i
if -*<x
if x>i.
<i
/W=W- if x>a
9. If the density function, /, of a distribution is given by
ix
i
1(3-*)
0
if 0<
if 1 <
if 2<
if χ >
find the distribution function, F, and sketch the graphs of/and F.
Find the density and distribution functions, g and G, of the new random
variable Y, where у = л:2, and sketch the graphs of g and G.
10. The density function, /, of a probability distribution is given by
if 0 < χ <. 1
: if 1 < χ < 2
if x>2.
Find the density function, g, of the new random variable, Y, where у = *Jx
Их > 0,y = Oifx <0 and sketch the graphs of/and g.
Find the mean and variance of X and the mean of Y.
11. A probability density function of a random variable Xh defined as follows:
Cx(x~l)(x-2) for 0<л:<1
f(x) = \ λ for 1 < χ < 3
lo otherwise,
where Λ is a suitable constant. Calculate the expectation μ of x. What is the
probability that χ is less than or equal to /t? (M.E.I.)
529

CONTINUOUS PROBABILITY DISTRIBUTIONS [25
12. A continuous probability distribution has density function/, where
fix) = a+bx+cx2 for 0 < χ < 1, and f(x) = 0 outside this range.
The mean is 2/3 and the variance is 4/45. Find the values of a, b, and c.
13. The probability density function pit) of the length of life, t hours, of a certain
component is given by , ., ,n
pit) = kerkt (0 < t < oo),
where & is a positive constant. Show that the mean and standard deviation of this
distribution are each equal to l/k.
Find the probability that the life of a component will be at least t0 hours.
Given that a particular component is already tt hours old and has not failed,
show that the probability that it will last at least a further t0 hours is е-*'».
An apparatus contains three components of this type and the failure of one
may be assumed independent of the failure of the others. Find the probability
that:
(i) none will have failed at t0 hours;
(ii) exactly one will fail in the first t0 hours, another in the next t0 hours and a
third after more than 2i0 hours. (M.E.I.)
14. A random variable X has the distribution function
Fix) =0 (χ < 0),
F{x) = kx3 (0 < χ < 2),
F{x) =1 ix > 2),
where & is a constant. Find the mean, median and variance of Χ. (Μ.Ε.Ι.)
15. A random variable X has probability density given by
fix) = A/il + x*) if 0< x< 1,
and fix) = 0 otherwise.
Find A the mean and the variance of the distribution.
Find also the median and the 90th percentile.
(The 90th percentile is the value of Xior which Fix) = i%.) (O & С adapted)
3. SOME IMPORTANT CONTINUOUS
DISTRIBUTIONS
(i) The uniform (or rectangular) distribution
If a continuous random variable can assume all real values between л; = а
and χ = b and if the density function, /, is independent of position within
this interval, then the random variable is said to possess a uniform dis-
530

3] CONTINUOUS DISTRIBUTIONS
tribution. / and F are thus given, for the uniform distribution over the
interval a ^ χ ^ b, by
0 if χ < a
τ-ί- if a ^ χ υ b
b — a
0 if χ > b;
0 if χ < a
x~a -с ^ г.
-, if a 4: x < b
b — a
1 if χ > b.
*Ex. 9. Sketch the graphs of/, F as defined above.
*Ex. 10. Prove that the mean and variance of the uniform distribution defined
over the interval a < χ < b are given by
μ = Ца+Ь), σ* = &а-ЬУ.
The continuous uniform distribution arises as a direct extension of the
discrete uniform distribution. As in the discrete case, the assumption of a
uniform distribution is often not made explicit in the formulation of
problems. This is a weakness in the statement of the problem and the reader
must be on his guard. For example, such a statement as 'a point is chosen
at random on the line AB' is likely to imply, in the eyes of an examiner,
that a uniform distribution is the required mathematical model, although
there may be little physical justification for this. The dangers of a loose
enunciation of a problem are illustrated in the following very well-known
example, due to J. Bertrand.
Example 3. A chord of a circle is drawn at random. Find the probability
that its length will exceed that of the side of the inscribed equilateral triangle:
(i) if one end, A, of the chord is regarded as fixed and the other end, P,
obeys a uniform distribution on the circumference of the circle;
(ii) if the direction of the chord is regarded as fixed and the perpendicular
distance, x, of the chord from the centre obeys a uniform distribution over
the interval 0 ^ χ ^ a, where a is the radius of the circle.
(i) Let ABC be an equilateral triangle inscribed in the circle (Figure
25.5). The chord AP will exceed the length AB if Ρ lies on the arc ВС and
the probability of this occurring, on the assumption that Ρ is distributed
uniformly on the circumference of the circle, is j.
(ii) Let AB, CD be the two chords in the given direction which are
distant \a from the centre of the circle (and thus are of length а^Ъ, the side
of the inscribed equilateral triangle). If x is the distance of a random chord
531
Ax) =

CONTINUOUS PROBABILITY DISTRIBUTIONS [25
from О in the given direction, and if χ is uniformly distributed in the
interval — a =ξ χ =ξ a, the chord will exceed а^Ъ in length (e.g. PQ, in Figure
25.6) if — £a =ξ χ =ξ \α giving a probability of \.
Ex. 11. If, in Example 3, the mid-point of the chord is taken to have a uniform
distribution over the interior of the circle, prove that the probability that the
chord will exceed the length of the side of the inscribed equilateral triangle is \.
Fig. 25.5 Fig. 25.6
Example 4. A point Ρ is taken on a line AB, of length la, the distribution
being uniform. A rectangle is then drawn with adjacent sides of lengths AP,
PB. Find:
(i) the expected value of the area of this rectangle; (ii) the distribution
function, G,for the random variable, Y, whose value represents this area.
Let AP = χ and the area of the corresponding rectangle = y. Then
у = x(2a-x).
Now, if/is the density function of X, the random variable whose value for
given Ρ is the length AP,
10 if χ < 0
l/2a if 0 ^ χ ^ 2a
0 if χ > la,
and we have
C2a 1
£(Υ) = γ x(2a - χ) άχ, by the definition of expectation,
As χ varies from 0 to а, у varies from 0 to its maximum value, a2, and,
532

3] CONTINUOUS DISTRIBUTIONS
as χ varies from a to 2a, у varies from a2 to 0. If F is the distribution
function of X,
(0 if x<0
F(x) = \x/2a if 0 ^ χ ζ la
(l if χ > 2a
and, by the symmetry of the curve у = x(2a—x) about the line χ = a
(see Figure 25.7), we have, for any point (x, y) on the curve,
Pr (z e R: ζ < у) = Pr (ζ e R: ζ < x) + Pr (z e Д: ζ > 2a-χ)
= F(x)+[l-F(2a-x)]
χ 2a—χ
2a
2a
But, since
we have
giving
Thus we have
у = 2ax—x2,
χ = a-J{a2-y) (x < a),
G{y) = Pi(zeR:z < y)
_a-J(a2-y)
a
(0 if у < 0
1-7(1-$) if 0^<Я
1 if у > а2.
The graphs of F and G are shown in Figure 25.8.

CONTINUOUS PROBABILITY DISTRIBUTIONS
Example 5. Χ, Υ are both uniformly distributed between 0 and 1. Values
x, у are chosen independently at random and a new random variable, Z,
is formed where
z = x+y.
Show that Ζ has a triangular distribution {that is, a distribution with density
function whose graph is triangular).
Like most problems concerned with independent uniformly distributed
random variables, it is best to consider this question graphically. Since
Χ, Υ are distributed independently, the choice of two numbers, χ
and y, may be represented by plotting the point R(x, y), all points
of the square О ABC (see Figure 25.9) being equally likely. If ζ = x+y,
A\ .β
m ρ с *
Fig. 25.9
for given ζ all possible points R lie on the line segment PQ, where Ρ is the
point (z, 0). For 0 < ζ < 1, PQ ос ζ; for 1 < ζ < 2, PQ cc (2-z) and
thus the density function, /, of Z, whose value for given ζ is proportional
to z, is triangular. In fact
Ό
if
if
if
if
ζ < 0
0 ^ ζ < 1
1 < ζ < 2
ζ > 2.

3] CONTINUOUS DISTRIBUTIONS
Ex. 12. Find the distribution function, F, for the random variable Ζ of Example 5
and sketch its graph.
Ex. 13. If, with the notation of Example 5, the random variable И^ is defined by
w = x—y, describe the distribution of W.
(ii) The normal distribution
Suppose that shots are fired at the centre, O, of a large circular target. Set
up coordinate axes as shown in Figure 25.10 and let P(x, y) be the point
at which one of the shots hits the target. If we suppose that shots aimed at
the centre are liable to errors, equally likely to occur above or below Ox
and to the left or right of Oy, we may reasonably assume that the
probability of x lying between χ and x + Sx, the 'x error', depends upon the
Fig. 25.10
numerical value of x; that is, the probability is of the form φ(χ2) δχ. On the
assumption of the symmetry of the errors made above, the probability of
the 'y error' is ф(у2) Sy and the probability of lying in a small area
δ A = SxSy
around Ρ is ф(х2) ф(у2) δ A, on assuming that the errors occur independently.
But on these assumptions the probability may also be written
Ф(х'2) Ф(У'2) δ A, (approx.)
where new axes Ox', Oy' are taken with Ox' passing through P, as shown
in Figure 25.10. Thus, since x'2 = x2+y2 and y' = 0 at the point P,
ф(х2) ф(у*> = ф(х'2) ф(у'2) = ф(х2+у2) ф(0)
and φ is a function with values satisfying the equation
ф(х2)ф(у2) = кф(х2+у2).
6-2 535

CONTINUOUS PROBABILITY DISTRIBUTIONS [25
A solution of this equation is φ(χ2) = ks~Xx2 where A is taken as a
positive constant, since it is reasonable to assume that the probability decreases
the further we move from O.
The argument above (due to Thompson and Tait) suggests that the
function Дл;) = ks~Xxl could form a suitable model for the probability density
of errors occurring in observations. This indeed turns out to be the case
and we shall therefore make the following definition.
The random variable X is said to have a normal distribution if its density
function, ф(х), is of the form
The letter φ is customarily used in place of/to denote the density function
of the normal distribution; the corresponding distribution function is
The factor (2π)-% appearing in the forms for φ and Φ ensures that the
requirement r+00
)_βαφ(χ)άχ=1
is met—that is, that φ(χ) is a genuine density function. This follows from
the well-known integral
j_ooe-^d^ = V(2^)>
a result whose proof will be found in any sufficiently advanced book on
calculus. (The reader should not be too discouraged by the remark,
attributed to Sir William Thompson, about this integral: 'No-one can call
himself a mathematician to whom this result is not obvious!')
The shape of the curve of the normal density function is shown in Figure
25.11. As is to be expected, the curve is symmetrical about χ = 0, which
clearly gives the mean of the distribution:
<?(X) = 0.
Furthermore, the variance of a normally distributed random variable
X is l.for
= ,,- , е-**2 dx, on integrating by parts,
= 1, on quoting the integral result above.

3] CONTINUOUS DISTRIBUTIONS
Statistical tables (e.g. The Cambridge Elementary Statistical Tables by
Lindley & Miller) give values of Ф(л;) for values of χ from 0 to about 4.
(Since Φ(4) « 0-99997 values of Ф(л;) for χ > 4 are very rarely required.)
By the symmetry of the curve
about the line χ = 0, values of Φ(χ) for χ < 0 may be deduced immediately.
Fig. 25.11
The normal distribution is frequently used as a probability model.
Suppose that the random variable X has mean μ and standard deviation
σ; then the new random variable Υ defined by
has mean 0 and standard deviation 1. If Υ is normally distributed it is
customary to say that ' X is normally distributed about the mean μ with
standard deviation σ\ sometimes written as ' X is distributed Ν(μ, σ)', and
that' X has been standardized to the normal random variable Υ by the given
substitution'. For emphasis, Υ is often referred to as ' the standard normal
distribution JV(0, 1)'.
Example 6. If X is normally distributed N(0, I) find
(i) Pr(* < 1-4); (ii) Pr(0-8 < χ < 1-4);
(iii) Pr (* < -1-4); (iv) Pr (-1 ·4 < χ < 0-8).
// Υ is normally distributed N(2, 0-75), find
(v) Pr(j < 0); (vi) Pr (1 < у < 3).
537

CONTINUOUS PROBABILITY DISTRIBUTIONS
From statistical tables,
(i) Pr(* < 1-4) = Φ(1·4)
« 0-9192
(Figure 25.12).
(ii) Pr(0-8 < χ < 1-4) = Φ(1 ·4) - Φ(0·8)
и 0-9192-0-7881
= 0-1311
(Figure 25.13).
Fig. 25.14 Fig. 25.15
(iii) Pr (x < -1-4) = Pr (x > 1-4) by symmetry,
= 1-Φ(1·4)
« 1-0-9192
(Figure 25.14).
(iv) Pr (-1-4 < χ < 0-8) = Pr (* < 0-8)-Pr (* < -1-4)
= Φ(0·8)-[1-0(1-4)]
«0-7881-0-0808
~. oc, „ = 0-7073
(Figure 25.15).
538

3] CONTINUOUS DISTRIBUTIONS
(ν) The random variable Z, where
z=^
0-75
has standardized normal distribution. Thus
Pr(y< 0) = Pr(z < -2-67)
= 1-Φ(2·67)
« 1-0-9962
= 0-0038.
(vi) Again,
Pr(l < у < 3) = Pr(-l-33 < ζ < 1-33)
= 2Φ(1·33)-1
« 1-8164-1
= 0-8164.
Example 7. Г/ге heights of a large number ofschoolchildrenare measuredcorrect
to the nearest centimetre and the mean and standard deviation of the resulting
frequency distribution are calculated and found to have the values 122 cm
and 5-2 cm respectively. As a model of the situation it is assumed that the
heights, x, of the children are distributed normally about a mean μ = 122
with standard deviation σ = 5-2. Calculate the probabilities for each
of the class intervals χ < 105, 105 < χ ^ 110, 110 < χ ^ 115, ...,
130 < χ < 135, χ > 135.
Since the heights are measured to the nearest centimetre, the upper
limits for χ for the class-intervals above are successively
105-5, 110-5, 115-5, .... 135-5
(there is no upper limit, of course, for the last class).
We now standardize our variable, x, by the transformation
'- 5-2 '
so that у has the standard normal distribution (/* = 0, σ =1) and using
statistical tables, we find Ф(у) for the upper limit of each class-interval.
Finally, the probability associated with each class interval may be
calculated , = Ф^-Ф^).
539

CONTINUOUS PROBABILITY DISTRIBUTIONS
The working is set out in tabular form below:
Class χ у (у)
130-135
125-130
120-125
115-120
110-115
105-110
< 105
135-5
1305
125-5
120-5
115 5
110-5
105-5
10000
0-9953
0-9495
0-7517
0-3859
0-1038
0-0132
0-0007
00047
0-0458
0-1978
0-3658
0-2821
00906
0-0125
00007
1-0000
It should be noticed that a continuous distribution has been used here
as a mathematical model for a discrete situation (heights measured to the
nearest centimetre). Again, it might be objected that, whereas heights could
reasonably be expected to fall, say, within the interval 90-180 cm, the
normal distribution is defined for all real values of x; however, as has been
pointed out earlier, for the standard normal distribution,
Pr (\y\ > 4) * 0-00006
and the two tails of the distribution, beyond у = + 4, may be neglected.
As well as appearing as a distribution in its own right, the normal
distribution is often used as an approximation to the binomial distribution
B(n, p) for large n, provided neither ρ nor 1 — ρ is too near zero. In Figure
25.16, the probabilities of obtaining the scores 0,1,2, ...,20 in the

3]
CONTINUOUS DISTRIBUTIONS
binomial distribution 5(20, \) are proportional to the lengths of the
corresponding vertical lines. The outline shape is strongly reminiscent of the
normal curve. Even in a skew case, where ρ Φ \, the outline is still
approximately normal: see Figure 25.17, in which the length of the vertical lines are
proportional to the probabilities of obtaining the scores 0, 1, 2,..., 20 in
the binomial distribution 5(20, f).
In fact, it may be shown that, given a binomial distribution B(n,p)
where η is large and neither ρ nor 1 — ρ is too near zero, then the
probability of obtaining a score r, that is,
lftpV-РУ"
is approximately equal to σ_1 times the ordinate of the standard normal
curve at the point
where μ, σ are the mean and standard deviation of the corresponding
binomial distribution
μ = Пр, σ = *]{npq).
Furthermore (and more importantly for applications) we can relate
areas under the normal curve to sums of successive terms of the given
541

CONTINUOUS PROBABILITY DISTRIBUTIONS [25
binomial distribution; in fact, the probability that the variable takes a
value in the interval
/■i ^ r ζ r2
is approximately
■т-»ж
where, as before, μ = ηρ, σ = J(npq). (The limits rb r2 are changed to
>Ί— i »2+i respectively in order to avoid complications arising out of
approximating to a discrete distribution by a continuous distribution;
for example, we wish to associate a non-zero probability with a single value,
r, of the variable.)
The reader is referred to one of the standard texts on probability for a
proof of the results outlined above. As a working rule, the approximations
give reasonable results if the lesser of the two numbers, the mean number of
successes and the mean number of failures is greater than about five.
Example 8. A coin is spun 250 times and turns up tails 139 times. Does this
provide any evidence of bias?
Take as the null hypothesis
We have to assess the probability of getting a result as bad as, or worse
than, 139 tails on the assumption that Я is true; that is, we have to deter-
mme Pr (139 or more tails |Я) + Рг (139 or more heads \H).
The calculation of this from the binomial distribution 5(250, £) would
be prohibitively laborious; we therefore use the normal approximation
to the binomial distribution as our model where
/* = 250х£= 125,
σ2 = 125xi= 62-5,
giving σ = 7-906.
We seek 2 Pr (je > 139),
where χ is distributed 5(250, £), and this is approximately
'['-•Н1Л]
*2[1-Φ(1-71)]
« 0-087.
There is thus insufficient evidence of bias at the 5 % level.
542

3] CONTINUOUS DISTRIBUTIONS
Note. Tables of the normal distribution show us that
Pr (b| S; 1-96) «0-05
and thus a normal variable greater than 1 -96 indicates significance at the
5% level. In Figure 25.18 the two shaded areas each contain 2-5% of the
total area under the curve.
One further application of the normal distribution should be mentioned:
if хъ x2, ...,xn are independent observations of any random variable X
and a new random variable Ζ is defined by
ζ = (x1+x2+... + xn)ln
then, if the mean and variance of Zare respectively μ and σ2, the distribution
of Ζ will be approximately normal, with mean μ and variance σ2/η. (The
Central Limit Theorem.) The proof of this result is beyond the scope of this
book.
(iii) The exponential distribution
The exponential distribution is defined by the density function
(0 if χ < 0,
/(Нле- if x>0,
where λ is a positive constant.
*Ex. 14. Show that, for the exponential distribution,
SiX) = l/λ, -Г(Х) = 1/λ2.
The result of Ex. 14 suggests a resemblance between the exponential
and Poisson distributions. We shall exhibit the relation between the two
distributions in the following problem. (See Exs. 16-18.)
543

CONTINUOUS PROBABILITY DISTRIBUTIONS [25
Consider the calls received at a telephone exchange, the rth call occurring
at time tT, where the time, t, is measured from t = 0. We shall assume that
the number of calls constitutes a purely random process; that is, the number
of calls received in any interval (t, t') is independent of anything that has
occurred previously. Furthermore, we shall assume that the purely random
process is a stationary process: that is, the number of calls received in the
interval (t, t') depends only upon its length, not on its position.
Ex. 15. Discuss the validity of the assumptions made above in the light of what
you imagine would be a typical exchange.
Suppose now that
Pr {no calls are received in the interval (0, t)) = pJit),
and let us assume thatp0 is a differentiable function. Since, by the definition
of a purely random process, the probability of receiving no calls in the
interval (t, t+St) is independent of receiving no calls in the interval (0, t),
p0(t+St)=p0(t)p0(St).
Put t = St = 0: ,m r .....
and thus, since p^O) Φ 0, we must have^0(0) = 1. Again
p<lt+St) = p<lt)pa(St)
=> Po(t+$t)-Po(t)=Po(t)[p0(St)-l]
=> p0{i + St) -pJit) = ρ Jit) [p0(St)-p0(0)]
p,(t + St)-PJjt) _ p0(St)-p0(0)
Έ ~p<kt) st ·
pO{t)=pS)p'SH)
=> p0(t) = At-*, (1)
where λ is a constant (-p'0(0)) which must be positive since p0{t) < 1 and
A = 1 since;? o(0) = 1.
Result (1) now enables us to prove that if ΧΊ& the random variable whose
value is the length of the time interval up to the first call, then X has an
exponential distribution. If F is the distribution for X then, for χ ^ 0,
F(x) = Pr (length of the time interval up to the first call ^ x)
= Pr {at least one call has been made by the time χ has elapsed)
= 1-е-л*
and thus f{x) = Ae~Ax for χ > 0.
Since/(л;) = 0 for χ < 0 it follows that X has an exponential distribution.
LeUi^O:

3] CONTINUOUS DISTRIBUTIONS
Ex. 16. If pn(t) is the probability that there are η telephone calls in the time
interval (0, t) and if St is sufficiently small for there to be a negligible probability
of more than one call in the interval (i, t+St) show that
pn(t+St) = (1-λ&)Α,(0 + λΡ»-ι(0&
and deduce that —- = λ(ρη^—pn).
Ex. 17. Prove by induction that the probability, pn(t), defined in Ex. 16, is
8"еПЬу м e-^tr
Ex. 18. Show that the distribution of the number, n, of calls received during a
fixed time interval is Poisson and account for this in terms of the Poisson
distribution being a limiting form of the binomial distribution.
Exercise 25(b)
1. The random variable, X, has uniform distribution in the interval 0 < χ < 10.
Find Pr (X = x, where хг - 5x + 4 > 0).
2. The line AB has length 10 cm. A point Ρ is taken at random on AB, all points
being equally likely; what is the probability that the area of the circle of radius
AP will exceed 10 cm2?
3. The line AB has length 10 cm. An interval of length 2 cm is marked at
random on the line, the positions of the interval being uniformly distributed. What
is the probability that it will contain the mid-point of ΑΒΊ
4. A circular disc of radius 10 cm is placed on a table. Another disc, of radius
3 cm, is now placed on the table so that it is at least partially in contact with the
first disc. On the assumption that the permissible positions of the smaller disc
are uniformly distributed, what is the probability that it covers the centre of the
larger disc?
5. A point Ρ is taken at random on the side AB (and between A and B) of the
square ABCD, the positions of Ρ being uniformly distributed. If PC cuts BD at
X, find the probability (i) that BX < \BD; (ii) that BX < \BD.
6. Figure 25.19 shows a square wooden frame
ABCD with a square hole A'B'C'D' cut
symmetrically in it. AB = 50 cm, A'B' = 30 cm.
A ball of diameter 5 cm is dropped on to the
frame. Assuming that its centre falls within
the square ABCD and that it is equally likely
to meet the plane of the frame at any point
within ABCD, what is the probability that the
ball will pass straight through the hole without
touching the frame ?
7. A point A is marked on the circumference of
a circle of radius r and a chord AP is drawn at Fig. 25.19
random, the positions of the point Ρ on the
circumference being uniformly distributed. Find the expected length of the chord.
545

CONTINUOUS PROBABILITY DISTRIBUTIONS [25
8. Determine the variance of the length AP of Question 7.
9. A point Ρ is marked on the side AB of the square ABCD, the points within
AB being uniformly distributed. Find the mean and variance of the area of the
triangle APD.
10. ABC is an isosceles triangle right-angled at B. Through A a line is drawn
at random to cut ВС at P, the angle BAP being uniformly distributed between 0
and \π. If AB = a, show that the expected area of the triangle ABP is (In 2) α2/π
and find its variance.
11. The intelligence quotients of 500 schoolchildren are assumed to be normally
distributed with mean 105 and standard deviation 12. How many children may
be expected:
(i) to have an intelligence quotient greater than 140;
(ii) to have an intelligence quotient less than 90;
(iii) to have an intelligence quotient between 100 and 110?
12. If an unbiased die is thrown 600 times, what is the probability of throwing
a six less than 90 times ?
13. X is normally distributed with mean 2 and standard deviation \. Find
numbers p, q such that Pr (x > p) = 0-2 and Pr (j? < χ < q) = 0-1.
14. A multiple choice test has 100 questions, each question having written
beside it five answers, only one of which is correct. If the pass mark is 30%,
what is the probability that a student who makes a completely random guess at
each answer will pass ?
15. Rods are manufactured with a mean length of 20-2 cm and standard
deviation 0-09 cm, the distribution being normal. If rods of length less than 20-1 cm
are rejected, what is the probability that a rod that is retained has a length in
excess of 20-3 cm?
16. Experience has shown that when a certain machine is functioning
satisfactorily it produces capacitors with capacitances which are distributed normally
with standard deviation 0-080 /tF. Under these circumstances find the mean
capacitance if 99% of the output has a capacitance of at least 2 /tF. Find also
what proportion of the output will then have a capacitance between 2-10 and
2-30 /tF.
Tests on a large batch of capacitors just produced reveal that their mean
capacitance is 2-20 /tF and that 10 % of them have capacitance below 2 /tF.
What do you deduce about the variability of the capacitance of the capacitors
in this batch? (M.E.I.)
17. The average proportion ρ of insects killed by administration of χ units of
insecticide is given by Пх-ма
p= (2 л)-1/2 е-'2'2 di,
where μ and σ are constants.
Whenx= \0,p= 0-400 and when χ = \5,p = 0-900. What dose will be lethal
to 50% of the insect population, on average?
546

EXERCISE 25
If a dose of 17-5 units is administered to each of 100 insects, how many will
be expected to die? What is the probability that just two insects will survive?
(M.E.I.)
18. Packets are advertised as containing 500 g of sugar. Tests carried out show
that 6-7 % of such packets contain a mass greater than 508 g, while 0-6 % have
a mass less than 492 g. Estimate the average mass of the contents of a packet
on the assumption that a normal distribution constitutes an acceptable
mathematical model.
19. A firm advertises that their runner bean seeds give a 95 % germination rate.
Of 200 such seeds, 14 fail to germinate. Have you cause for complaint ?
20. A man claims that he can forecast rainy or dry weather 48 hours ahead.
Careful records are kept over 80 days and his forecast is found to be correct
47 times. Is his claim justified, or is he merely lucky ?
21. A 'chance of failure' distribution is given for time t by the probability
density function , . ,., . ,, .„
pit) = (1/a) e-"a (0 < / < со).
Show that a is the mean time of failure and that the variance is a2.
Two components in a machine have failure time distributions corresponding to
means a and 2a respectively. The machine will stop if either component fails and
the failures of the two components are independent. Show that the chance of the
machine continuing to operate for a time a from the start is e~3'2. (O & Q
Miscellaneous Exercise 25
1. AB is a rod of length 2a. A point Ρ is taken on AB, the points being uniformly
distributed, and the stick is broken at P. Find the expected value of AP2+PB2.
The two parts of the rod are placed on a table in such a way that A and В
are Ол/3 apart. What is the probability that the triangle APB will be obtuse?
2. The mean deviation about μ for a distribution with density function / is
defined by the equation . m
V = J_ \χ-μ\/(χ)άχ.
Prove that the mean deviation of a random variable distributed normally about
μ with standard deviation σ is approximately |σ.
Rods are manufactured with a mean length of 18-4 cm and standard deviation
0-25 cm. If only those rods with a length greater than 18-4 cm are retained, what
is their average length on the assumption that the rods were originally normally
distributed about 18-4 cm with standard deviation 0-25 cm?
3. If Χ, Υ are independently and uniformly distributed between 0 and 2, find
Pr (xy < 1).
4. The side of a square is uniformly distributed between 0 and 1. Find the
density and distribution functions for the area of the square and sketch their graphs.
Determine the expected value of the area and its variance.

CONTINUOUS PROBABILITY DISTRIBUTIONS [25
5. The probability density function for the life л: of a motor car tyre is given by
Ax) = Ле~л*
for χ =s 0, where λ = 0-04 and χ is measured in units of 1500 km. What is the
probability that a single tyre will last more than 30 units?
A car has four tyres in use and they are all of the same age. What is the
probability that all of them will need replacing before 30 000 km? (M.E.I.)
6. A random variable X has the probability density function
nexp{-nxl^3)}
ПХ} ^3{1+ехр[-™/(^3)]}2
for all real x. Show that the distribution is symmetrical about χ = 0. Determine
the cumulative distribution F(x) = Pr {X < x].
You are given that the variance of X\s σ2. Compare the value of F(cr), Щ&),
Ρ(3σ) with the corresponding values for the normal distribution with zero mean
and variance σ2. (Μ.Ε.Ι.)
7. A random variable Xis distributed normally with expectation μ and standard
deviation σ. Find the mode and median of the distribution.
What is the proportion of the population lying between the points of inflexion
of the curve of the probability density function? (M.E.I.)
8. The diameters of some machined components are distributed normally with
mean 5-00 cm and standard deviation 0-05 cm. Find the expected proportion
of the components which will be outside the range 4-925 to 5-075 cm and the ratio
of the expected proportion in the range 5-025 < d < 5-050 to the expected
proportion in the range 5-050 < d «Ξ 5-075.
It is desired to adjust the mean of the process so that there are, on average,
twice as many components in the range 5-025-5-050 cm as in the range 5-050-
5-075 cm. Show that this can be done if the mean is adjusted to a value between
5-00 cm and 4-95 cm, and find the value by trial to the nearest 0-01 cm.
(M.E.I.)
9. The probability density function of a distribution is given by
Ях) = Ту—ттт (x> 0,λ an integer > 0).
(Λ-1)!
Find the expectation and variance of X. И^(Х) = μ, find ${(X— μ)3}.
Sketch fix) when λ = 2. (M.E.I.)
10. A grocer sells bread and can buy batches of 120 loaves. The number of daily
customers for bread is distributed normally with mean 100 and variance 100.
The net profit on the sale of a loaf is 2-5p and the net loss on an unsold loaf is
3-5p. What is the average daily net profit to the grocer? (M.E.I.)
11. The random variable X has normal distribution. If a new random variable
Υ is denned by у = x2
show that the distribution function, F, for Υ is given by
548

MISCELLANEOUS EXERCISE 25
Deduce that Υ has α χ2 distribution with one degree of freedom, defined by the
density function /, where
12. Two chords are drawn independently at random in a circle. What is the
probability that they intersect?
13. Two men, A and B, are allowed a lunch-break of an hour. A is at liberty
to leave the office at any time between 12.00 and 12.45, while В may leave the
office between 12.45 and 2.00. What is the probability that they will both be out
together, on the assumption that the permissible starting times for both A and
В are uniformly distributed?
14. A needle is pivoted at the point (0, a) and is rotated. When it comes to rest,
the point Ρ at which its axis (produced if necessary) cuts the x axis is marked.
The random variable X is defined to have as its value the x coordinate of P.
On the assumption that the angle that the needle makes with the у axis is
distributed uniformly between ± \π, prove that X has a Cauchy distribution,
defined by the density function
л*)=й^Ь) <-»<*<«»·
Discuss the existence of S(X) and Ψ~(Χ) for this distribution.
Sketch the graph of the density function for a = 1.
15. The Laplace distribution has probability density function
fix) = Се-Л'*' (-oo < χ < сю)
where С is a constant and \x\ denotes the magnitude of x. Find Pr {|χ| < λ}
and the variance of X. (M.E.I.)
16. A famous early example of the 'Monte Carlo' method for solving problems
by random numbers was Buffon's determination of π. Small rods of length Lt
were dropped at random on to a sheet of paper ruled with thin parallel lines a
distance L2 (> LJ apart. A' success' is recorded when any part of the rod touches
a line. The ratio of 'successes' to the total number of tosses is recorded over
several thousand tosses. How may π be estimated from this ratio? (C.S.)
17. Let X be a random variable uniformly (rectangularly) distributed over the
interval 0 < χ < 1. Derive the probability density function of the following
random variables: (a) Υ = X2-l,(b)Z= sin πΧ. Find the mean and standard
deviation of Υ and Z. (C.S.)
18. The chance that the customer at the head of a queue completes service in
any interval of length St is μδί, and the chance that a new customer arrives in any
interval is ASt, arrivals and departures are independent and λ < μ. The chance
that at time t there are η customers in the queue (including the one being served)
is denoted by pn(t). Show that
pn(t) = λ8ίΡη_1(ί-δί) + (1-λδί-μδί)ρη(ί-δί) + μδίρη+1(ί-δί) (η > 1)
and obtain the corresponding equation for p0(t). (over)
549

CONTINUOUS PROBABILITY DISTRIBUTIONS [25
By considering the forms that these equations take under the steady-state
condition pk(t) = pk(t— St), where к is zero or any positive integer, or by any other
means, obtain pn and the mean queue size (in terms if λ and μ) in the steady
state. (C.S.)
19. X is a continuous random variable with mean μ and variance σ2. λ is a
positive constant. Prove Chebyshev's inequality
Ψΐ(\Χ-μ\ > λσ) < 1/λ2.
Show how this inequality may be sharpened, in the case λ = 2, if the
distribution of X is assumed to be (i) normal, (ii) uniform.
20. Spacecraft land on a spherical planet of centre O. Each is able to transmit
messages to, and receive messages from, any spacecraft on the half of the surface
of the planet nearest to it.
(i) It is known that spacecraft have landed at points A and В on the surface
of the planet. Show that the probability that a space craft, landing at random on
the planet, will be able to communicate directly with the spacecraft at A and В is
(π-0)/2π
where θ is the angle AOB.
(ii) What is the probability that three spacecraft, all landing at random on the
planet, will be in direct contact with each other? (C.S.)
21. Engine crankshafts are manufactured so that the diameters, in centimetres,
form a normal distribution with mean 5 and standard deviation 0-03.
Crankshafts with diameters less than 4-94 or greater than 5-06 are rejected. The
accepted product is classified into three grades of size 4-940-4-988, 4-988-5-012,
5-012-5-060.
Show that:
(i) fxe-*a/2dx = -e-*!'2,
(ii) Γ x2 e-^'2 dx = - χ е-*2'2 + Г е-*2'2 dx.
Hence find the average diameter in each of the three grades and the ratio of the
standard deviation in the middle grade to the standard deviation of the
unclassified product before any rejection of under- and oversize shafts. (M.E.I.)
22. The police force in a certain district carries out tests on the brakes of
automobiles chosen at random on the road. Each man is required to test 20 cars.
Calculate the distribution of the number of cars with defective brakes in sets of
20 cars if the probability that a single car has defective brakes is 10 %.
Show the distribution graphically together with a plot of the normal
distribution with the same mean and variance. Comment on the relation between the
two distributions. (M.E.I.)
23. A population contains two strata I and II in proportions }, f respectively.
Both strata are exponentially distributed the first with mean 1 and probability
density function ДлтО = λιβ-Λ'^ (χ1>0);
the second with mean 3 and probability density function
Αχύ = Л2е~л^ (х2 > 0).
550

MISCELLANEOUS EXERCISE 25
Find the mean and standard deviation of the population and the probability
that a randomly chosen member of the population is greater than 2. (M.E.I.)
24. Initially a machine is in good running order but is subsequently liable to
breakdown. As soon as a breakdown occurs repairs begin. If the machine is in
good order at time t then the probability that a breakdown occurs in a small
interval it, t + d t) is α di, and if it is under repair at time t the probability that the
repair is completed in time(i, ί + di) is/?di. Let pit) be the probability that the
machine is under repair at time t.
Write down an equation relating pit + di) to pit) and hence show that pit) is
^{l-exp[-(a+/?)i]}. (C.S.)
551

26. Numerical solution of equations
1. ACCURACY
When equations arise in practical problems, their solutions are generally
required only to some given order of accuracy. General algebraic methods
of solution (for example, for the cubic equation), even if they are available,
are often less suitable than approximate methods for deriving numerical
solutions. Thus, to find the real root of the equation
х3-2-7л;-5-3 = 0
by Cardan's method would require burdensome calculations and no such
method exists to solve, for example, the equation
χ6-2·7λ;-5·3 = 0.
Because of its great practical importance, the estimation of roots of
numerical equations has been extensively studied; in this chapter we shall
confine ourselves to some of the most elementary methods available: the
reader who wishes to study the subject further should consult one of the
standard texts on numerical methods (for example, Henrici, Elements of
Numerical Analysis). It may be mentioned in passing that this subject allows
more scope than most others in elementary mathematics for students to
devise their own methods; although these may often prove less efficient
than the standard techniques, much profit will be gained by developing
them as far as possible—apart from the obvious enjoyment of producing
something original.
Before considering the solution of equations it is worth pausing to
consider some problems connected with the accuracy of calculations. In
numerical calculations, errors occur through rounding-offto a given number
of significant figures. If a number is to be rounded-off to N digits and the
discarded digits form exactly half a unit, round-off to the nearest even digit.
For example, to 3 significant figures,
3-864 к 3-86; 21990 m 2-20 χ 104;
0-03765 at 3-76 χ 10~2; 21-55 в 2-16 χ 10.
*Εχ. 1. Suggest any advantage you see in adopting the convention above. Can
you suggest any reason why even rather than odd digits have been selected ?
552

1] ACCURACY
Once numerical data have been rounded-off, their subsequent use in
calculations introduces further errors which will be cumulative. It would
be pleasant to be able to calculate exactly the error in any numerical answer.
This is usually impossible, but we may be able to produce a positive
'acceptance' for the answer; that is, a quantity which we are sure is more
than the absolute error in the answer. For example, suppose we round-off
a given angle Θ, measured in degrees, to 3 significant figures to give 37-2°.
The actual angle lies between 37-2°-0-05° and 37-2° +0-05°. From four-
figure tables, tan 37-2° = 0-7590 and the difference for +0-05° is
1(0-7618 -0-7563) к 0-0014.
Thus tan0 = 0-7590 + 0-0014;
in other words, tan θ is 0-7590 with an acceptance 0-0014.
*Ex. 2. If possible inaccuracies in the tangent tables are taken into account, show
that tan θ = 0-7590 ± 0-0015.
Now suppose у is a rational approximation to the number x; thus
x = y+e,
where e is the error involved in writing у for x. The absolute error is
|e| while the relative error is \e\jy (more correctly \e\jx, but the two forms
are nearly equal for small relative errors and the form \e\jy has the
advantage of possessing a known denominator).
Usually we do not know the exact error, only an upper limit to the
absolute error—the maximum absolute error. For example, if a number is
rounded to 4 decimal places, the maximum absolute error is 5 χ Ю-5.
Similarly, we have a maximum relative error. In a calculation in which we
know the maximum absolute errors in the given numbers it is important
to be able to estimate the maximum absolute error in the answer.
Suppose that x1 is written as yx with maximum absolute error \ex\
(we use modulus signs to emphasize that maximum absolute errors are
positive) and that x2 is written as y2 with maximum absolute error |e2|.
Then the maximum absolute error in writing yx+y2 for x1+ x2 is \ex\ +\e2\,
while the maximum relative error is
(Ν + Ν)/0ί+λ).
Ex. 3. Show that the maximum absolute and relative errors in writing у^-Уг for
xi-x2 are respectively |et| + |e2| and fle^ + \ег\)/(у1-у2).
*Ex. 4. Neglecting terms such as |e1e2|, show that the maximum absolute and
relative errors in writing у^уг for хгхг are respectively
ΐΛβ,Ι + ΐΛ^Ι and Μκ\ + Μγ%\.
Show also that, writing у^уг for xtlx2 they are respectively
ЫУг\ + МА\ and \еЫ + \чЫ-
553

NUMERICAL SOLUTION OF EQUATIONS [26
Ex. 5. If you have a calculating machine available, calculate:
(i) Х1 + Х2,У1 + Уг\ (ii)*i-*a, Уу-Уг, (Ш) ХуХг, У\Уг,
(iv) x-ijx^, Уг1уг (6 significant figures) where
Xl = 2-914, x2 = 0-3472, yi = 2-9, y2 = 0-35.
Ex. 6. What are the maximum absolute and relative errors in writing
(i) y\ for xl; (ii) Vtt for V*i5 ("О VCfi+^a) f°r Vfe+^2)?
Before leaving the subject of errors, it is worth noting one particular
case in which significant figures may be lost. Consider 3-144-3-097: both
of these numbers are given to 4 significant figures, but their difference,
0-047, is correct to only 2 significant figures. It is sometimes possible to
modify a calculation in such a way to minimize this loss of accuracy.
For example, if it is required to calculate the difference between the
values of a function of χ for two given values of χ that are close together,
it may help to use a Taylor expansion. Thus, using four figure tables,
sin3r-sin30° = 0-0150,
but, by Taylor's expansion,
sin 31° = sin 30°+(я/180) cos 30°-KW180)2 sin 30°...
= 0-5000+0-01512-0-00076
and sin ЗГ-sin 30° к 0-01504; again using four figure tables and
obtaining a result correct to 4 significant figures.
Again, suppose we have to solve the quadratic equation
х*-\6х-\ = 0.
16 + V260 . 16-V260
The roots are ^ and ~ .
Direct evaluation of these two quantities gives
16-06 and -0-06
(using four figure tables). However, in obtaining the negative root, we
lost significant figures by subtraction. A better approximation may be had
by calculating the negative root using the more accurate positive root 16-06
and the known relation that the product of the roots is — 1:
««-las*- °·0627·
Before leaving the subject of significant figures, it is worth noticing that
a calculation can be carried out to a high degree of accuracy and yet not
furnish a correct result even when corrected to 1 significant figure. For
554

1] ACCURACY
example, if the correct answer is 0-649 999 9 and our numerical answer is
0-6500001 then, to 1 significant figure our answer is 0-7 but the correct
answer is 0-6 although our calculation was only 0-0000002 in error.
Example 1. Given that J31 =5-568 and J 30= 5-477, find J 31-J 30.
Direct computation gives ^31—^30 = 0-091; but using
(V31 - V30) (V31 + V30)=ι
we have ^31-^30= 1/(5-568+ 5-477)
= 0-090539 (from 5 figure reciprocal tables)
and this is in fact correct to 5 s.F.
2. LOCATING THE ROOTS OF THE
EQUATION f(x) = 0
Before embarking upon the accurate estimation of a root of the equation
f(x) = 0 it is usually necessary first to locate the root roughly.
One method of doing this is to draw the graph of у = f(x) and find
where it cuts the χ axis. Indeed, it is possible, by accurately plotting a
succession of graphs of the relevant regions of f(x) on increasing scales,
to obtain quite good approximations to roots of given equations. When
using a graphical technique it is sometimes convenient to write the equation
in a form other than f(x) = 0, e.g. g(x) = x. See, for example, Ex. 7.
*Ex. 7. Verify from a freehand sketch that the equation
χ = 2 sin л:
has a root lying between \π and Jw. Draw the graphs of у = χ and у = 2 sin л:
accurately on graph paper, taking values of χ from \π to \n and hence obtain
a better estimate of this root. Suggest how you could continue the process. If
you have a table of sines of angles in radians, show how these may be used to
obtain rapidly a good approximation.
The graphical method just mentioned suggests a simple analytic method
for locating the presence of a root: if /is continuous and f(a), f(b) have
opposite signs, then the equation f(x) = 0 has at least one root lying
between χ = a and χ = b.
Ex. 8. Show that the equation _
has just one real root and that this root lies between χ = 1 and χ = 2.
Ex. 9. Show graphically that it is possible for roots of the polynomial equation
f(x) = 0 to occur between χ = a and χ = k without f(a), /φ) being of different
signs.
555

NUMERICAL SOLUTION OF EQUATIONS
[26
When a root has been located between χ = x0 and χ = хъ an
approximate value for the root may be obtained by linear interpolation; that is,
by approximating to the graph of у =f(x) between χ = x0 and χ = хг
by a straight line (see Figure 26.1, in which О A = x0, OB = xx; we take
ОС as our approximate root, the exact root being OD). Since the equation
of the linePg, in the notation of Chapter 18 with x1—x0 = h, is
Ky-fo) = A/o(*-*o)
we have, putting у = 0,
as an approximation to the required root.
_hfo
0 Δ/ο
*Ex. 10. Show from graphical considerations how the computation of Δ2/0
enables one to tell whether the approximation by linear interpolation is likely to
be too big or too small.
Example 2. Show that the equation
x3 + 2x-4 = 0
has only one real root, and find its value, correct to 1 decimal place, by linear
interpolation.
WriteДх) = x*+2x-A; then/'(x) = 3x2+2, and/"(x) = 6x.
Thus we see immediately that, since the graph of/has no maximum or
minimum, it can cut the χ axis only once, giving just one real root (Figure
26.2).
556

2] LOCATING ROOTS
By direct computation,/(1) = —1, /(2) = 8 and the real root lies
between χ = 1 and χ = 2, probably nearer to χ = 1.
By linear interpolation, a second approximation is given by
x= l+h
Since f\x) > 0, f"(x) > 0 between χ = 1 and л- = 2 (see Figure 26.2)
the root χ = Ц will be an underestimate.
Fig. 26.2
By direct computation again we have
Λ1·1)= -0-469, /(1-2) =+0-128,
showing that the root lies between χ = 1-1 and χ = 1-2, and our next
approximation to the root is л л , 469 1
1·1+Τ9ΤχΤο·
As before, this is an underestimate and we can be confident, without
further working, that χ = 1-2 is the value of the required root correct to
1 decimal place.
Another valuable method of locating roots of a polynomial equation is
based on a consideration of the number of sign changes among the
coefficients in the expression „ , „ , ,
Ex. 11. Prove that, if all of the coefficients are positive, then the equation
a0xn + a1x"-1+...+an = 0
can have no positive roots.
Prove further that, if the coefficients of all odd powers of χ are zero and
remaining coefficients are positive, then the equation can have no real roots

NUMERICAL SOLUTION OF EQUATIONS [26
The result of Ex. 11 may be extended to the following result, known as
Descartes's Rule of Signs: the number of positive roots of a polynomial
equation cannot exceed, and has the same parity as, the number of sign
changes among the coefficients, reading from left to right. (Two numbers
have the same parity if they are both even or both odd.)
For example, the equation
2χ»-ΊχΊ-4χ* + 1χ· + 2χζ + χ*-5χ-4 = 0
+ -- + + +--
ft t
has three sign changes and thus either one or three positive roots. Again,
theequation л.-бх.-4л» + Зх«-^+2 = 0
has four sign changes and thus has zero, two or four positive roots.
By writing — χ for χ one may similarly find an upper limit to the number
of negative roots.
Ex. 12. Show that the equation
χ7-3χ*-χ2-ί = 0
has just one real root, by considering separately the possible positive and negative
roots.
The proof of Descartes's Rule is not difficult, but is complicated by the
number of special cases that have to be considered. The reader who wishes
to follow the proof through should try Exs. 13-17; in all these equations
we adopt the notation
P(x) = aoxn + a^-i+.-.+a^x + an,,
where an > 0 and we suppose that P(x) has к sign changes.
Ex. 13. Prove that P\x) has к sign changes if a„_t > 0 and (k-1) sign changes
if e„-i < 0.
Ex. 14. Prove graphically that, if P'(0) > 0, no roots of the equation P(x) = 0
lie between χ = 0 and the least positive root of P'(x) = 0 and that if P'(0) < 0,
at most one such root of P(x) = 0 exists.
Ex. 15. What happens in Exs. 13, 14 if α„_ι = 0?
Ex. 16. Complete the proof of Descartes's Rule of Signs using mathematical
induction.

2] LOCATING ROOTS
Exercise 26(a)
1. If χ is rounded-off to 3 significant figures to give the numerical value 48-7,
give the value and range of acceptance for (i) sin x°; (ii) tan x°, using 4-figure tables.
2. If x, у are rounded-off to 3 significant figures, their values are χ = 17-2,
у = 5-16; give the range of acceptance for (i) x+y; (ii) xy; (iii) x/y.
3. Compare the relative accuracy obtained from your square root tables for
x = V3-V2 with that obtained for χ = (V3 + V2)-1·
If V2 = 1-41421... and V3 = 1-73205... find as accurately as you can a value for
V3-V2-
4. Given that V130 = 11-4018 and V132 = 11-4891 find as accurately as you
can a value for V132-V130.
5. The triangle ABC is right angled at В. AC, АВ are measured to the nearest
millimetre, their lengths being found to be 7-4 and 4-4 cm respectively. Use the
theorem of Pythagoras to calculate the length of ВС, stating what confidence
you have in the reliability of your answer.
6. If χ = y + e, use the Taylor expansion to find the absolute and relative error
in taking tan χ to be equal to tan y.
7. With the notation of Question 6, find the absolute and relative error in
taking (1+ЛГ2)-1'2 to be (1+У)-1'2.
8. If χ = 4-6 ± 005 and f(x) = x3 - 2x +1, and if χ is taken to be 4-6, find the
relative errors in (i) x; (ii) f(x). Comment upon your result.
9. Find graphically, using the method of enlarging scales, the least positive root
of the equation , χ
correct to 2 decimal places.
10. Solve graphically the equation 2x = 1 + In x.
11. Solve graphically x3 = 10.
12. Find 21'5, correct to 2 decimal places, by a graphical method.
13. Locate the three roots of the equation
2х3-6х-Ъ = 0
and find their approximate values, using linear interpolation.
14. Locate the two real roots of the equation
*4 + x410x-24 = 0
and find their approximate values, using linear interpolation.
15. Use Descartes's Rule of Signs to prove that the equations
(О^-глг'-З = 0; (11)^-^ + 5^ + 2 = 0; (iii) χ6 + **-4*3 + 5л: + 2 = 0
each have at least two complex roots.
559

NUMERICAL SOLUTION OF EQUATIONS [26
16. Use Descartes's Rule of Signs to show that the equation
x* + x+l = 0
has only one real root, and that this root is negative.
Show that this result can also be obtained from simple graphical considerations.
17. What can you say about the reality of the roots of the equation
xn + x2 + 2 = 0
where и is an integer greater than 2?
3. ITERATIVE PROCESSES FOR
SOLVING EQUATIONS
In this section we consider the application of iterative methods for finding
numerical solutions of equations; that is, methods which develop
successive approximations to a root of a given equation by a simple repetitive
process depending upon a recurrence relation.
As a first example, consider the equation
x2-5*-5 = 0.
(A quadratic equation is chosen for simplicity, but the method to be
outlined below is applicable to any polynomial equation.)
The given equation has roots lying between — 1 and 0 and between 5 and
6. We shall denote successive approximations to one of the roots by
X0, Χι, X2, ....
We first seek the negative root: take x0 = — 1. Since the equation may
be rewritten , , .
χ = jx2— 1
we try, as a plausible attempt at developing successive approximations,
the recurrence relation , , .
Xr+i = yx?-l.
Starting with x0 = — 1, this gives successive approximations
-1, -0-8, -0-872, -0-847, -0-857.
Thus xt = —0-857, and the process is seen to be converging (albeit rather
slowly) towards the negative root —0-8541 ....
Now suppose we adopt the same recurrence relation to find the root lying
between 5 and 6: , . , „ , „ A
x0 = 6 gives 6-2, 6-69, etc.
diverging.
x0 = 5 gives 4, 2-2, -0-32, -0-980, -0-808, etc.
and we are clearly converging towards the wrong root!
560

3] ITERATIVE PROCESSES
To see what has gone wrong let us write
where α is the exact root of the given equation and er is the error in the
estimate xT. _ j. _ 1
gives α - er+1 = |(a - erf -1
which reduces to er+1 = fae„
on neglecting e2. and recalling that α = ^-α2—1. Thus |eT+1| < |eT| only if
\oc\ < 2-5; in other words, the absolute errors diminish in this iterative
process only if the root to which we are approximating is less than 2-5
a magnitude.
Graphically, the solution о
= ΐχ2-1
means finding the χ coordinate of the intersection of the straight line
with the parabola у = jx2 — 1.
Our iterative process consists in starting with a given value of x,
Oo = — 1) finding the corresponding point on the parabola, moving from
there to the line у = χ (χχ = jxl—l), and thence to the parabola, then
on to the line у = x, again (x2 = \x\-1), and so on, spiralling in to the
root χ = — 0-854... in the cobweb pattern shown in Figure 26.3.
Fig. 26.3
However, if we start at x0 = 5, a quite different pattern emerges. The
reader should draw the graph and illustrate the various stages of the
iterative process: he will see that the path taken moves rapidly across the
graph to the negative root.
561

NUMERICAL SOLUTION OF EQUATIONS
*Ex. 17. Write the equation 2 _
in the form χ = 5 + 5/л:
and obtain the recurrence relation
Show that, with x0 = 6, this gives xb = 5-8541 which is correct to four decimal
places.
Show further that, in this case,
*Ex. 18. Illustrate graphically the iterative process of Ex. 17 by sketching the
graphs
у = χ and у = 5 + 5/x.
Ex. 19. Explain geometrically why convergence is more rapid for finding the
positive root from xr+i = 5 + 5/xr than for finding the negative root from
Ex. 20. Explain why, in both the iterative processes considered so far, the
successive approximations oscillate from side to side of the exact root.
Now consider a polynomial equation rewritten in the form
(We have already seen that there are various ways of doing this: our
object now is to choose the best one for a given root.)
Suppose we take „ ч
Xr+l = f(Xr),
where xr, xr+1 are successive approximations to the exact root a, with
errors er, er+1:
oc = xr + er, cc = xT+1 +eT+1.
Since a is a root of the given equation,
α =/(α).
But xr+i=f(xr),
giYing a-er+1=/(a-er)
» /(a) - erf'{a), on using Taylor's expansion.
Thus er+1 « erf'(oc).
Successive applications of this result give
er+1 « erf{cc) « e^JfW *... * ejf (*)]'*;
thus, provided \f'(oc)\ < 1, |er+1| < |e0|
562

3] ITERATIVE PROCESSES
and er decreases in magnitude as r increases. If, however, |/'(a)| > 1,
the error increases as r increases. The reader should now reconsider the
two iterative processes for the quadratic equation considered previously,
in the light of this analysis.
Ex. 21. Explain graphically what happens to the iterative process if /'(a) = 1.
*Ex. 22. Illustrate the significance of the condition |/'(a)| < 1 for the
convergence of the iterative process
by drawing four graphs with:
(i) fix) < l,/"(a) < 0; (ii) /'(a) < l,/"(a) > 0;
(iii) Γ (μ) > 1,/"(α) < 0; (iv) /'(a) > l,/"(a) > 0.
As we have already seen, the iterative procedure outlined above
converges to the required root rather slowly. A more powerful iterative
procedure is provided by the Newton-Raphson process. Consider the equation
f{x) = 0
and suppose, as before, that α is an exact root, with x0 the first
approximation and error e0; that is _
We then have Да) = О,
and thus, on using a Taylor expansion and regarding e0 as being sufficiently
small for us to be able to ignore e£ and higher powers,
f(x0)+ej'(x0)*0.
This yields an expression for the error term
F ~ /(*o)
0 ~ ГЫ
and we may take as our next approximation
r _ r Я*о)
10 /w
Repetition of this process leads us to the recurrence relation
f{Xr)
*+1 - Xr f'(xrY
Example 3. Show that the cubic equation
л:3-5л:-8 =0
has just one real root, and find its value, correct to 2 decimal plac

NUMERICAL SOLUTION OF EQUATIONS
we have
and the α
f'(x) = 3^-5,
= f(x) is seen to have two stationary points, a maximum
at χ κ —■s/l-l, and a minimum at χ « +^\·Ί. But —^/1-7 » —1-3 and
/(—1-3) < 0 therefore the curve cuts the χ axis just once and the equation
thus has only one real root (which is clearly positive). (See Figure 26.4.)
Fig. 26.4
The next step in the solution is to locate the root: to do this we substitute
integral values for χ until we discover a sign change. In order to write
down the differences Д/, Д2/,... we tabulate our working, writing the values
of/in a vertical line:
/
-8
-12
-10
Δ/
Δ2/
6
12
Δ3/
The root we seek lies between 2 and 3; using linear interpolation we
take as our first approximation
x0 = 2 + ifxl t
2-7.
Since ДД2) > 0, the curve у = f(x) is increasing between χ = 2 and
χ = 3 and since Δ2/(2) > 0, it is increasing at an increasing rate: we
564

3]
ITERATIVE PROCESSES
deduce that 2-7 is an underestimate of the exact root (see Figure 26.5,
which has not been drawn to scale).
SinceA*) ξ 3χ2-5,
Xl = 2-7-
= 2-7+
a 2-
Л2-7)
/'(2-7)
1-817
16-87
У
0
1 2 /
/Ax =
1
.1
ι ·
4Λ2) =
Fig. 26.5
The process is now repeated, taking, xx = 2-808 as our second
approximation. (The process is much facilitated by the use of a hand-calculating
machine: if one is available, recall the method of nested-multiplication for
the evaluation of polynomials—see Chapter 18.)
= 2-8(
Л2-8(
/'(2-8
Since we may regard this as a more accurate approximation to the
required root than 2-808, we take
χ = 2-80
as the root to 2 decimal places.
Ex. 23. Use a calculating machine to obtain the root of the equation
л:3-5л:-8 = О
correct to 4 decimal places.
A feature of the Newton-Raphson process is that, since at each stage the
value off(xr) is calculated, a running check may be kept on the residuals;
7 ppmii 565

NUMERICAL SOLUTION OF EQUATIONS [26
that is, the values obtained by substituting our successive approximations
into the polynomial. (We want the residuals to be zero eventually.)
We now investigate, as before, how rapidly the process converges.
Suppose that the exact root of the equation J{x) = 0 that we seek is α
and that we obtain a sequence of approximations x0, хъ x2, ..., where
From the recurrence relation
we thus have
giving
Now write
then
and also, since
we have
Xr+1-*r f(Xry
a_e x c Л«-*г)
Λ«-«γ)
8(x)=Mr for /,(*) + 0;
g(a) = 0, provided/'(a) + 0
1&)f(x)=№,
SW'M+£(*)/"(*) =/'(*)
and g"(x)f(x) + 2g'(x)f'(x) + g(x)f"(x) =f"(x).
Thus
From the equation
g'(pi)=l, g"(cc)=-f(cc)lf'(cc).
er+1 = er + g(cc-er)
we now have, using Taylor's theorem and ignoring powers of er higher
than the second, . N ,. N , 2 „, N
- 2/'(a)e-
The analysis above shows us that each error is proportional to the square
of the error in the preceding term: we deduce that convergence is more
rapid in the Newton-Raphson process than in the first iterative method
discussed in this Section. (The Newton-Raphson method is a second-order
process.)
Geometrically, the Newton-Raphson process is equivalent to drawing
a sequence of tangents to the curve у = f(x).
Let y0 =f(x0)', then the tangent at the point (xq, y0) on the curve
У = Λχ) has equation „. ч . ч
566

and this meets the χ axis
ITERATIVE PROCESSES
(see Figure 26.6).
Ex. 24. Show by graphical considerations, that, once the process starts to
converge to a root, it always does so from one side. Try to produce an analytical
argument to support this assertion.
Ex. 25. Explain by a graphical argument how in accuracies may arise in the
neighbourhood of two nearly equal roots.
The Newton-Raphson process involves division by the awkward
number/'^,.). A simplification is effected by using the von Mises iteration:
Xr+1 Xr ПхоУ
where the variable denominator f'(xr) is replaced by the constant f'(x0)·
The convergence is less rapid but nevertheless fairly good.
Ex. 26 Find the negative root of the equations
by Newton-Raphson and von Mises's iteration, taking x0 = - 1.
Ex. 27. Interpret the von Mises process graphically. Explain why a very efficient
procedure is to use the Newton method strictly for a suitable number of stages
and then stick with a constant value of/', e.g.f'(x^) after two stages.
We conclude this chapter with an example of how an iterative process
of a required order may be developed—in this case determining the
reciprocal of a number.
7-3 567

NUMERICAL SOLUTION OF EQUATIONS [26
Example A.Iflja is calculated from the recurrence relation
xr+1 = 2xr — ax\
prove that er+1 = αή..
Find also the connection between eT, eT+1 if I/a is calculated from the
recurrence relation „ „ „ „ „
xr+1 = Зхг-3ах}-а2х1
and suggest a fourth-order iterative process for finding \\a.
If xr+1 = 2xr-a4
we have xr+er = -
a
1
4
.. l_^x
a4=1-(2xr~a4)
=> er+1 = ее*.
Similarly, if xr+1 = 3χτ-3αχ} + α24,
-*--1,
. ι 3χτ 34 ,
=> а*4=--(3хт-3а4 + а24)
=> er+1 = а2$.
Now suppose we seek a recurrence relation such that erH
xr+er =-
n«4-
=> a34 = - - (4xr - 6ax2r + 4a24 - aa4).

3] ITERATIVE PROCESSES
Thus, if we set xr+1 = 4χτ-6α4+4α24~α3χ%
it follows that er+1 = a3e*
and we have a fourth-order iterative process for finding \\a.
Exercise 26(b)
Find the real roots of the equations 1-8, using any suitable iterative process. Use
linear interpolation to find the first approximation and give your final answer to 3
significant figures.
I. x3-3 = 0. 2. л:3-100 = 0.
3. x*-5 = 0. 4. хг-Ъх-П = 0.
5. χ3- 6x2 + \0x- 9 = 0 (1 root). 6. х3-Ъх2-Ъх-1 = 0 (1 root).
7. jc*-7x-12 = 0(2 roots). 8. х3 + Ъх2-9х-\6 = 0(3roots).
9. Find to 4 significant figures the least positive root of the equation
л4- 13л:2- 18л:- 5 = 0.
10. Show that the equation
2лл-10л:3+10л:-1 = 0
has two roots between 1 and 2 and find their numerical values, correct to 4
significant figures.
II. Use the Newton-Raphson method to find an approximate value for the least
positive root of the equation
3 tan χ = Ax (4 decimal places).
12. Find an approximate value of χ such that л: + е* = 3.
13. Solve, correct to 2 decimal places, the equation
sin^=3*-l.
14. Show that the equation
(2k+l)x3-k(x+l) = 0,
where к is large and positive, has a root near to 1.
Find the equation of the tangent to the curve
y= (2k+l)x3-k(x+l)
at the point χ = 1. From the equation of this tangent find a better
approximation to the indicated root of the original equation. (O & C)
15. Find the root of the equation
sin χ = хг
other than χ = 0, to 3 decimal places.
569

NUMERICAL SOLUTION OF EQUATIONS [26
16. Establish Newton's formula for obtaining a closer approximation to a real
root of the equation f(x) = 0.
Use this method to find, correct to 3 significant figures, the positive root of the
^иай°П 4cos*-2*-l = 0. (L.)
17. By using Newton's method of approximation, or any other method, find
the value of χ correct to 3 decimal places for which the expression
x+l
\nx
has a stationary value. (L.)
Miscellaneous Exercise 26
1. Find the greatest root of the equation
х3-Ъх+\ = О,
correct to 3 decimal places.
2. The roots of the quadratic equation ax2 + bx— 1 = 0 are calculated from the
recurrence relation .
Χτ+1 = ^Го-
Interpret this process geometrically, and prove that
eT+1 = -aX4T,
where Xis the exact value of the root being calculated and X = xT + eT.
With the help of reciprocal tables, use this method to solve the equation
3. Give a sketch showing the general shape of the graph of у = sec л: for values of
χ from χ = 0 to χ = %π.
Deduce from the graph that large roots of the equation
are approximately equal to (и+ i) π, where и is a large integer; and prove that
closer approximations are given by
where the positive sign is taken when η is odd, and the negative sign when η is
even. (O & C)
4. A root of the equation
sin3 ix0 + cosx0-f£§ = 0
is close to 60. Find the value of the root, correct to 0· 1. (O & Q
5. Draw an accurate graph of у = \π sin χ between χ = 0 and χ = π. Draw in
the same diagram the lines у = mx for m = f, f, f, f, 1. Determine the values
of χ where these lines cut the graph of у = \π sin χ, giving your answers in the
form ктт, where к is correct to 2 decimal places.
570

MISCELLANEOUS EXERCISE 26
Use the values obtained to draw a separate graph of у = in sin x/x between
χ = in and χ = η. (Ο & С)
6. Verify that χ = § η is an approximate solution of the equation cos χ = ix,
and show that a better approximation is 1-03. (O & Q
7. If Or + er)2 = a, prove that
Ι/β \ er2
and deduce the recurrence relation
—1Ы
for finding Vft
Suggest an intuitive argument leading to this recurrence relation and show
that the same relation is obtained by applying the Newton-Raphson process to
the equation x2-a = 0.
8. Vй is calculated from the recurrence relation
_ α2 + 6αχΙ + χξ.
Xr+1~ AxM+xl) '
""""""" ^~ 4xr(a+xl)
and find Vll correct to 6 decimal places.
9. Develop a recurrence relation for finding av* in which er+1 = ke\ (where к
depends upon r) and hence find $6 to 5 decimal places.
10. Prove graphically, or otherwise, that the equation
cos χ = mx (иФО)
has one and only one root in the interval — in < χ < \π.
The angle α is denned as that root of the equation
cot α = -α
which lies between %n and n. Prove that, when m lies in the range
the equation cos χ = mx has three and only three roots in the interval
-in <x<\n.
By means of careful graphs of the functions cot χ and — x, or otherwise, obtain
the value of a, giving your answer in radians to 2 significant figures.
11. If a root of the equation f(x) = 0 is obtained by an iterative process in which
er+1 « keT (k constant) show that the expression
(Xr-Xr-d*
xT — 2лгг_! + л:г_2
may be taken as an approximation to the required root.

NUMERICAL SOLUTION OF EQUATIONS [26
12. Find approximately the root of the equation
χ* + 3χ3-7χ* + 18χ-1& = 0
which lies between 1 and 2.
13. Prove, graphically or otherwise, that the equation
/(*)= l+|2-tanx=0
has a root near χ = ^π + kn, where к is any large integer.
Denoting (&+£) π by K, prove that a better approximation to the root is given
by K+A, where /да + Л/'Ш = 0,
and that λ « —.
Find an approximation to the root correct to the term of order K~*. (O & Q
14. The equation хг — ax + b = 0 has roots α, β. Form the equation with roots
a2,/?2 and, if the equation with roots a2",/?2" is x2-a„x+b„ = 0, write down
the equation with roots a2"+1, /?2"+1.
If α = λ/? where |λ| < 1, show how the formation of a sequence of such
equations may be used to give successive approximations to β and hence a.
Under what circumstances does this method provide a very satisfactory
method for solving quadratic equations ? Discuss the magnitude of the errors
involved and consider the cases of (i) equal, (ii) complex, roots.
Discuss the application of this method of solution to equations of degree
higher than the second.
15. Prove that the equation e* = λχ has a real solution if and only if λ > e,
but that xex = μ has a real solution for all real values of μ.
Solve the equation xsx = 1 to 5 decimal places.
16. xT, xT+u xT+2 are successive approximations to a root of the equation f(x) = 0
obtained by the Newton-Raphson process. Writing CT for the correction
-АхЖ(Хт), so that CV+1 = -fixT + Cr)/f'(Xr + CT) show that
CV+1 lffe)
Cr2 ~ If'ixr)
by using Taylor's result,
Ax+h) *Ax)+hfXx) + m"(x),
where h is small.
Deducethat _ иытгы
χ»* χτ+1« 2[Πχτ)Υ ·
17. It is required to find the nth root of the number a by the Newton-Raphson
process. Show that this can be done by solving the equation xn+k—axk = 0.
Using the analysis of Question 16, show that most rapid convergence is
obtained by choosing к = £(1 - η). Use this method to obtain a value for $2,
carrying out two iterations.
572

27. The ellipse and hyperbola
1. CONICS: FOCI AND DIRECTRICES
In Chapter 22 we considered the geometry of the parabola and rectangular
hyperbola; both these curves are examples of a type of plane curve called
a conic, which we shall now define. Given a point S and a line / not
containing S, a conic is the set of all points Ρ in the plane of / and S such that
the ratio of the distance of Ρ from S to the distance of Ρ from / is a fixed
number.
If we call the foot of the perpendicular from Ρ to /, M, and the fixed
ratio e (not to be confused with the base (e) of the natural logarithms) then
a conic is the set of points Ρ in the plane containing S and /, such that
\~\ = e (Figure 27.1).
The point S is called a. focus of the conic; /is a directrix. The number e
is called the eccentricity of the conic. The type of conic is determined by
Fig. 27.1 Fig. 27.2
the value of e. The reader will no doubt already have noticed that, with
e = 1, the conic defined is a parabola. More generally, we define the
following types of conic:
(i) if 0 < e < 1, the conic is called an ellipse;
(ii) if e = 1, the conic is called a. parabola;
(iii) if e > 1 the conic is called a hyperbola.
The shape of the three types of conic are shown in Figures 27.2 (ellipse),
27.3 (parabola) and 27.4 (hyperbola). (The dotted lines in Figure 27.4 are
573

ELLIPSE AND HYPERBOLA [27
not part of the hyperbola: they are, in fact, asymptotes and are included
as an aid towards drawing the curve.)
Ex. 1. Explain why the set of points
{(х,у):* = 41(х-1У + <у-1П
represents an ellipse.
Fig. 27.3 Fig. 27.4
2. THE ELLIPSE
The ellipse has been defined in Section 1 as a conic with eccentricity e < 1.
Before attempting to obtain the Cartesian equation of an ellipse, it is
worthwhile to consider a little of the geometry of the curve, in order that
we may be able to choose the most suitable coordinate axes. Suppose that
S is the given focus and / the corresponding directrix and let К be the foot
of the perpendicular from S on to /.
If e is the given eccentricity, then there are two points A, A' on the line
SK which belong to the ellipse—namely the points dividing SK internally
and externally in the ratio e : 1 (see Figure 27.5 and notice that A' lies on
KSproduced, since e < 1). Let О be the mid-point of AA' and set OS = s,
OK= k. Then, if A A' = 2a,
SA = a—s, A'S = a+s,
AK=k-a, A'K= k+a,
and thus, by the definition of A, A' as points of the ellipse,
a—s = e(k — a)
a+s = e(k + a).
Solving these two equations we have
s = ae, к = ale.
574

2]
ELLIPSE
Let us now take О as the origin, OK as the л: axis and the perpendicular
to this line through О as the у axis. Then, by what we have just shown, S
is the point (ae, 0) and /is the line x — aje = 0. LetP(x, y) be any point of
the ellipse; by the definition of the ellipse
SP2 = e2PAP,
where Μ is the foot of the perpendicular from Ρ on to / (Figure 27.6).
У
в
л\ о
В'
Ρ
\)
s Ια
μ
Fig. 27.6
Thus, the ellipse is the set of points
{(x,y): {x-aef+y* = e\ale-xf).
The equation may be rewritten
x2 — 2aex + a2e2+y2 = a2 — 2aex + e2x2
=> x\\-e2)+y2 = a2{\-e2)
If we write
a2^a2(l-e2) ' L
b2 = a\\-e2),

ELLIPSE AND HYPERBOLA
the equation becomes 2
-2 + Γ2 = ^
a2 b2
which is called the standard form of equation for the ellipse.
*Ex.2. Show that the ellipse ,, „
b2x2 + a2y2 = a2b2
is symmetrical about the χ and the у axes and that (with the notation of Figure
27.6) В is the point (0, b) and B' is the point (0, - b).
*Ex. 3. Deduce, by an appeal to symmetry, the existence of a second focus S'
(—ae,0) and second corresponding directrix x+a/e = 0.
*Ex. 4. О is called the centre of the ellipse. Show that every chord PP' of the
ellipse which passes through О is bisected at O.
AA', BB' are respectively the major and minor axes of the ellipse; a is
the length of the major semi-axis, b is the length of the minor semi-axis
(b < a, hence the word 'minor'). From the original definition of b, we have
the important equation connecting a and b:
b2 = a\\-e2).
Ex. 5. If a circle is regarded as a special case of an ellipse with equal major and
minor axes, show that it corresponds to a conic with zero eccentricity.
Ex. 6. A chord drawn through a focus perpendicular to the major axis of an
ellipse is called a latus rectum of the ellipse. Show that the length of the latus
rectum is 2b2/a.
Ex. 7. Show that the eccentricity of the ellipse
is у and deduce the coordinates of its foci.
Ex. 8. Show, by translating the coordinate axes, that the equation
(x-1)2 Q4-2)a =
3 + 2
represents an ellipse with centre (1, — 2). Find the coordinate of its foci and show
the eccentricity to be yV3·
The ellipse arises in nature as the typical orbit of objects moving under
the action of an attractive force varying inversely as the square of their
distance from a fixed point. For example, planets describe ellipses (with
slight variations and discrepancies due to the attractions exerted on each
other) with the sun at one focus—hence the use of the letter S for the focus.
Similarly, for the motion of satellites around the Earth, the centre of the
Earth being situated at a focus (again with a slight discrepancy, this time
576

2]
ELLIPSE
because the Earth is neither exactly spherical, nor uniform). The
aesthetically pleasing shape of the ellipse has long been admired: it can be seen, for
example, in the elliptic arches of some bridges which with their reflection
in still water yield complete ellipses.
Ex. 9. The eccentricity of the Earth's orbit is approximately -^ while that of the
planet Pluto is \. Compare the shapes of their orbits.
Ex. 10. An astronomical unit (a.u.) is the mean distance of the Earth from the
Sun. (It is approximately 500 light-seconds.) At its nearest approach to the Sun,
the planet Mercury is distant about 0-308 a.u. from the Sun, while its maximum
distance is about 0-466 a.u. Calculate the approximate eccentricity of the orbit.
*Ex. 11. Show that χ = a cos Θ, у = b sin θ gives a parametric representation for
points of the ellipse b2x2 + a2y2 = a2b2.
A rod has three points P, Q, R marked on it, where PQ = a, QR = b. If
P, Q are constrained to move along two fixed perpendicular lines, show that R
moves along the arc of an ellipse. (This is the engineer's paper trammel method
for drawing ellipses.)
*Ex. 12. Draw in the same diagram the ellipse
b2x2 + a2y2 = a2b2
and the circle x2+y2 = a2
(the auxiliary circle of the ellipse). Given a point Ρ of the ellipse, let the
perpendicular through Ρ to the major axis of the ellipse cut the auxiliary circle at Q.
If Ο β makes an angle θ with the major axis, show that Q is the point (a cos Θ,
a sin Θ) and that Ρ is the point (a cos Θ, b sin θ), θ is called the eccentric angle of
PQ.
The relationship between the ellipse
b2x2 + a2y2 = a2b2
and its auxiliary circle x2 + y2 = a2
outlined in Ex. 11 and Ex. 12 is worth a little further study.
It will be recalled (see Chapter 13) that linear transformations of the
plane into itself map straight lines into straight lines and in particular,
parallel straight lines into parallel lines. They also map ratios of lengths
on a line into the same ratio of corresponding lengths on the image line;
in particular they map the mid-point of a line segment into the mid-point
of the image line segment. We shall now consider the particular linear
transformation, T, defined by the equation
x' = x, y' = by/a.
Ex. 13. Show that perpendicular lines do not map into perpendicular lines
under T, unless a2 = b2. Identify the two linear transformations for which a = b
and a = —b, and explain why these do preserve perpendicularity.
577

ELLIPSE AND HYPERBOLA [27
*Ex. 14. Show that the image of the circle.
{(x,y):x2+y2 = a2}
under Τ is the ellipse , ,.,.„,. „ ....
{(x , у ): b2x2 + a2y2 = a2b2).
It follows from the preceding remarks and the result of Ex. 14 that
properties of parallel chords of circles, mid-points of chords of circles, etc.
are preserved by Τ and thus correspond to identical properties of the
ellipse. For example, since the mid-points of parallel chords of a circle lie
on a straight line through the centre, the same is true for an ellipse. The
line obtained is called a diameter of the ellipse. Furthermore, the two lines
parallel to the given chords passing through the intersections P, Q of the
corresponding diameter with the circle are tangents: the same is true of the
ellipse (see Figure 27.7).
Again, in the circle, the centres of chords parallel to PQ define another
diameter, RS, such that all chords parallel to RS are bisected by PQ. The
same property is thus true of the ellipse: P'Q' and R'S' are called
conjugate diameters; they have the property that each bisects all chords parallel
to the other (see Figure 27.8).
The geometrical interpretation of Τ is as follows. Given a point P, draw
578

2] ELLIPSE
Ρ X perpendicular to the χ axis to meet it at X {PX is the ordinate of X);
then P' is the point on PX such that
P'X = (b/a) PX.
In particular, using the language of Ex. 12, the image of a point Ρ on the
auxiliary circle lies on the ellipse (see Figure 27.9).
Fig. 27.9
*Ex. 15. Show that Γ is a non-singular linear transformation, that is, that T~x
exists.
Example 1. LP is the tangent at Ρ to an ellipse, centre O, and PR is the chord
parallel to LO. Show that, if RQ is a diameter of the ellipse, then LQ is a
tangent.
Suppose that the ellipse is taken in standard form and let us denote image
elements under T'1 (see Ex. 15) by dashes. The corresponding property
of the circle (see Figure 27.10) is immediately obvious, either by an appeal
to symmetry, or, equivalently, by proving the triangles L'P'O, L'Q'O
congruent. Thus, since the properties with which we are concerned remain
invariant under the linear transformation Τ (under which the image of P'
is P, etc.) the result is also true for the ellipse (see Figure 27.11).
Notice that no appeal can be made to symmetry in the figure for the
ellipse, but since the final result is couched in terms of tangents, diameters,
and parallel lines only, the results for ellipse and circle correspond
completely.
In solving problems on the ellipse it is sometimes necessary to obtain the
equations of particular lines, notably the tangent and normal. The quickest
579

ELLIPSE AND HYPERBOLA [27
way to arrive at these equations is to use calculus, although alternative
methods are available.
Fig. 27.10
*Ex. 16. Show that, if
then
6V + sy = aV,
Deduce that the equation of the tangent at the point (лгь y^ on the ellipse is
■-X\_ У-Ух _
а2уг -b*xi
and show that this may be rewritten in the form
xxi УУ1 _ j
*Ex. 17. Show that the equation of the tangent to the ellipse at the point
(a cos Θ, b sin Θ) is
: 1,
χ cos 0 ysma
~V~ + ~b~
and that the equation of the normal is
axsind-bycosd = (α2-ό2)ο
*Ex. 18. Show that the line L, with equation
cuts the ellipse
/ m
Ьгхг+агуг = a2b2

at points with parameters λ1; λ2 which are roots of the quadratic equation
Deduce that the point Piixi, уг) is the mid-point of the chord L if
and that this gives the equation of the chord with mid-point Px as
*-*i = У-У\ =
Show how to deduce from this the equation of the tangent at the point
-P2U2, уг) of the ellipse.
The ellipse, like the parabola, has a rich geometry. We shall now prove
two of its most famous properties. (Since both are focal properties, neither
is derivable from the circle.)
(i) Focal distance property
For any point Ρ on the ellipse with foci S, S' and major axis la,
SP + S'P = 2a.
Take the ellipse in the standard form
b2x2 + a2y2 = a2b2
and let M, M' be the feet of the perpendiculars from Ρ on to the two
directrices (see Figure 27.12).
У
[ S' О
Ρ
s J
Μ
Let P(h, k) be any point on the ellipse; then, by the focus-directrix
definition of the ellipse
SP+S'P = e(PM+M'P)

ELLIPSE AND HYPERBOLA [27
*Ex. 19. Prove that SQ + S'Q < la for all points Q within the ellipse and that
SR + S'R > la for all points R outside the ellipse.
*Ex. 20. Prove a converse of the above result, namely that if S, S' are fixed
points and Ρ is any point such that SP+S'P is constant and greater than SS',
then the locus of Ρ is an ellipse with S, S' as foci. (If the constant distance is
taken as la, show that a unique ellipse exists with S, S' as foci and major axis
of length la; then use Ex. 19.)
*Ex. 21. If a, b are complex numbers such that \a—b\ < c, where с is a real
positive constant, describe the set of points
{zeC: \z-a\ + \z-b\ < c)
in the Argand diagram.
Ex. 11. Explain the theory underlying the following well-known mechanical
construction for an ellipse. Two drawing pins are stuck in a sheet of paper and a
loop of cotton is placed loosely around the two pins. The loop is made taut by
the point of a pencil which is then made to trace out a curve on the paper,
keeping the loop taut at all stages of the construction.
(ii) Reflection property
The tangent and normal at any point Ρ of an ellipse bisect the angle SPS'.
Take the ellipse in standard form and let Ρ be the point
{a cos Θ, b sin Θ).
With the notation of Figure 27.13, the equation of the normal, PG, is
(see Ex. 17)
ax sin Θ — by cos (9 = (a2—b2) cos θ sin Θ,
У
---
is' о
^p
7V
G Si
Μ
Τ >
Fig. 27.13
and thus ΰ is the paint ([(a2_ft2) cos ,]/a> 0).
Using b2 = a\\ — e2), this reduces to
(ae2 cos Θ, 0).
582

Thus S'G = ae + ae2 cos Θ, GS = ae2 cos θ
, GS^ _ ae-ae2 cos θ _ aje-a cos θ _ PA£ _ PS
an S'G ~ ae + ae2 cos θ ~ aje + a cos θ ~ M'P ~ PS'
and so, by the angle bisector theorem for a triangle, PG bisects LSPS'
internally. Since LGPTis a right angle, it follows immediately that PTbisects
£SPS' externally.
Ex. 23. ABC is any triangle and Ρ is any point on the external bisector of the
angle ВАС. Prove that
BP+CP^ BA + CA,
with equality only if Ρ coincides with A.
Example 2. Prove that the chord of contact of tangents from the point
C(h, k) to the ellipse 2 2
is the line -5-+7T = 1·
Deduce that, if the tangents at the extremities of a variable chord through
the fixed point D{<x, β) meet at R, then R lies on the straight line
αχ /S> _
a2 + b2 ~
1.
From Ex. 16, we know that the equation of the tangent to the ellipse
at the point Ρχ{χν jj is
a2 + b2 ~ l-
Now suppose that Ρ^, jj, P2(x2, Уг)are the points of contact of tangents
from C(h, k) to the ellipse. Then the coordinates (h, k) satisfy the equations
and we have ^+^=1 and Щ+^ = 1.
α' οΔ α ο*
Thus PjOj, yx) and P2(x2, у2) lie on the line
hx ky= j
a2 b2 '
which must represent the required chord of contact.
If Ζ»(α, /?) lies on this chord of contact,
Ы k£_
a2 + b2 ~

ELLIPSE AND HYPERBOLA [27
for all positions of the point (h, k). Thus (h, k) lies on the line
Example 3. The normal at the point Ρ to the ellipse Σ with equation
b2x2 + a2y2 = а2Ьг
meets the χ axis at Η and the у axis at K, and Q is the fourth vertex of the
rectangle OHQK. Prove that the locus of Q is a concentric similar ellipse Σ',
but with the minor axis ο/Σ' lying along the major axis ο/Σ.
[Locus questions are usually best attempted using the parametric form
(acos Θ, b sin Θ): the coordinates of the point whose locus is sought are
then obtained in terms of the parameter Θ, which may be eliminated by
using the identity , . , . , .
6 J cos2 Θ+ sin2 θ = 1
to yield the equation of the locus.]
Let Ρ be the point . . , . ..
{a cos Θ, b sin Θ).
r
\ ·
к
1
=».
/
tl
h
•)
Q
Then the normal atP has equation ax sin Θ — by cos θ = (a2—b2) cos θ sin θ
(see Ex. 17). Thus we may write down the coordinates of the points
Я (у = 0) and Κ (χ = 0) and hence of Q (see Figure 27.14):
„ /a*-b* a A
a2-b2 .
b S1
ta2-b2
Κ: (θ, ■

If we call the coordinates of Q(x, y) we have
and the equation of the locus of Q is
a2*8 by
which, by comparison with the equation
A2 + B* L'
is seen to be an ellipse, Σ', with
a ' b
The centre of Σ' is the origin and Α.Β = b:a; the ellipses Σ and Σ' are
thus concentric and similar. Furthermore, if a > b, then A < В and
the minor axis of Σ' lies along the χ axis, that is, the major axis of Σ.
(Strictly speaking, the first part of our solution is incomplete until we
have verified that every point of Σ' is a point of the locus. The reader may
care to supply the details.)
Exercise 27(a)
1. Find the eccentricities of the following ellipses:
(i) |+^2 = 1; (ϋ) ^+Уг = Ι"; («Ο 3^+5^ = 1.
2. Draw a rough sketch of the ellipse
Find its eccentricity, the coordinates of its foci and the equations of the
corresponding directrices.
3. Show that the equation
3(x-2)2+4(y+l)2 = 36
represents an ellipse and find its eccentricity and the coordinates of its foci.
4. Showthattheequation χ2 + 2γ2+6χ_4γ + 9 = Q
represents an ellipse and find its eccentricity and the coordinates of its foci.
585

ELLIPSE AND HYPERBOLA [27
5. Show that the equation
Ах2 + Ъу2-\6х+\2у+\6 = 0
represents an ellipse and find its eccentricity and the coordinates of its foci.
6. The arch of an elliptic bridge is in the form of half of an ellipse. The span is
12-5 m and the maximum height of the arch is 1-75 m. Find the eccentricity of the
ellipse.
7. Find the equation of the tangent to the ellipse
at the point (3, - 2).
8. Find the equation of the tangent to the ellipse
4-^ =
at the point (1, 2).
Prove that the line x-y = 3 also touches the ellipse.
9. Prove that the line χ-Ъу-П = О
touches the ellipse 2л:2 + 3у2 = 14,
and find the point of contact.
Where does the line ^x—y + 7 = 0
meet the ellipse. 3x2 + 2y2 =14?
10. An ellipse has foci S, S', minor axis BB' and major axis of length 2a. Prove
that BSB'S' is a rhombus of side a.
11. Ρ is a point on the ellipse
6¥+аУ = a2b2
and TV is the foot of the perpendicular from Ρ to AA', where A is the point
(a, 0) and A' the point (-a, 0). Prove that
PN2 _ b2
A'N.NA ~ a2*
If the tangent at Ρ meets the directrix corresponding to the focus S at T, prove
that LPST is a right angle.
12. The feet of the perpendiculars from the foci S, S' of the ellipse
b*xs+a2y* = a2b2
to the tangent at the point Ρ are Υ, Υ'. Prove that Υ, Υ' lie on the auxiliary
circle of the ellipse and that , _
13. Ρ is a variable point of an ellipse, focus S. Prove that the locus of the
midpoint of PS is an ellipse and locate its centre.

14. The tangents at the points P, Q of an ellipse, centre O, meet at the point
T. Prove that ОТ bisects PQ.
15. The parallel chords Ρλ Qu P2 Q2 of an ellipse are bisected by the diameter UV.
Prove that Λ Q2 and P2 Qx meet on UV, as also do ΡλΡ2 and βα β2.
16. С/У is a diameter of an ellipse and Ρ is any point on the ellipse. Prove that
the diameters parallel to PU, PVais conjugate.
17. With the notation of Question 16, the tangent at U meets PV at Τ and the
tangent at Ρ meets TU at M. Prove that Μ is the mid-point of TU.
18. The diameter UV of an ellipse bisects the chord PQ. UP and VQ meet at
X, UQ and VP meet at У; prove that XY and ^Q are parallel.
19. What is the locus of the mid-points of chords of an ellipse which pass through
a common point?
20. UVis a diameter of an ellipse and the tangents at U, Fare u, ν respectively;
the tangent at any point Ρ meets u, υ at Xand Y. Prove that Xand У lie on
conjugate diameters of the ellipse.
21. Prove that tangents at the extremities of a focal chord of an ellipse meet on
the corresponding directrix.
22. The tangent to the ellipse 2 2
at the point Ρ meets the axes at Q and R. Find the locus of the mid-point of QR.
23. The perpendicular from the centre of an ellipse with focus S to the tangent
at a point Ρ meets SP produced at Q. Prove that the locus of β is a circle, and
find its centre and radius.
24. Define geometrically the eccentric angle φ of a point Ρ on the ellipse
-У^ =1 (a > b)
a2 b2
and express the coordinates of Ρ in terms of φ.
Prove that the equation of the normal at Ρ is
О is the centre of the ellipse and QP is the ordinate of P, the normal at Ρ cuts
the χ axis at TV. Show that
NQ = — cos φ.
If the normal at Ρ bisects the angle OPQ prove that the eccentricity e satisfies
the equation ^(1 + sin^) = 1,
that OP = ae, and that , „ _
iV2 < e < 1. (O & Q
587

ELLIPSE AND HYPERBOLA [27
25. S, Sf are the foci of the ellipse
аг b2
and Pia cos φ, b sin φ) is a point on the curve. Calculate the lengths PS, PS' in
terms of a, e and φ, where e is the eccentricity of the ellipse and verify that
PS+PS' is constant.
The tangents to the ellipse at Ρ cuts the χ axis at T, and the normal at Ρ cuts
the χ axis at N. Prove that
(i) ОТ. ON = a2e2; (ii) PT/PN = tan ф/(1 - e2)
where О is the centre of the ellipse. (O & Q
26. Show that the coordinates of any point Ρ on the ellipse
can be expressed as (a cos φ, b sin φ). Prove that the equation of the tangent to
Ρ and Q are two points on the ellipse, such that φ has the value & at Ρ and φ
has the value фг at β. If the tangents to the ellipse at Ρ and Q meet on the line
ay = олг, prove that , , , . . /л_ . _.
Ф1 + Ф2 = in or ψ. (O&C)
27. Prove that, ifa2P + b2m2 = n2, then the line
lx+my + n = 0
touches the ellipse —+ — = 1
and find the coordinates of the point of contact.
Find the equations of the common tangents to the two ellipses
28. If Ζ is the foot of the perpendicular from the centre of the ellipse
b2x2 + a2y2 = a2b2
to the tangent at a variable point P, prove that the locus of Ζ is the curve
(x2+y2)2 = b2y* + a2x*.
3. THE HYPERBOLA
A hyperbola is a conic with eccentricity e > 1. The analysis of the
hyperbola follows closely that for the ellipse and there are many striking
similarities between the geometry of the two curves. To obtain the standard
form of the equation of the hyperbola we first derive certain geometrical

3]
HYPERBOLA
results, just as we did for the ellipse in Section 2. Suppose that S is the
given focus and / the corresponding directrix and let К be the foot of the
perpendicular from S on to /. There are two points, A, A', on the line
SK which belong to the hyperbola—namely, the points dividing SK
internally and externally in the ratio e:l. Since, for the hyperbola, e > 1,
/ will lie between A and A' (see Figure 27.15, and compare with Figure
27.5). Again we take О as the mid-point of AA' and set OS = s, OK = k.
Then, if A A' = 2a,
SA --
К А --
a-k,
A'S =
A'K =
a + k,
and thus, by the definition of A, A',
= e(a + k).
Solving these two equations we have
-a/e.
(Compare these results with the corresponding results obtained for the
ellipse in Section 2.)
Following closely the corresponding analysis for the ellipse, we take О
as the origin, OK as the χ axis, and the perpendicular to this line through О
as the у axis. Then, exactly as for the ellipse, Sis the point (ae, 0) and / is
the line x — a/e = 0. If P{x,y) is any point of the hyperbola, by the focus-
directrix definition we have
SP2 = ίΡΡΜ2,
where Μ is the foot of the perpendicular from Ρ on to /.
Thus, the hyperbola is the set of points
{(x,y):(x-aef+f = ^(x-a/ef}.

ELLIPSE AND HYPERBOLA
The equation may be written
x2-2aex+a2e2+y2 = e2x2-2aex + a2
=> x\e2-\)-y2 = a\e2-\)
=> t t_ = l
a2 a2(e2-l) '
Since e2 > 1, a\e2-\) > 0 and we may write
b2 = a2(e2-l).
у
Μ
к
/л/
\s
/
JP
(Notice that this is the first slight point of departure from the
corresponding analysis fortheellipse. Notice also that, forahyperbola, b can be greater
than a.) With this notation the equation may be written as
which is called the standard form of equation for the hyperbola.
*Ex. 24. Show that the hyperbola is symmetrical about both the χ and у axes
and deduce by an appeal to symmetry, the existence of a second focus S'(—ae, 0)
and second corresponding directrix.
*Ex. 25. Prove that all chords through the centre, O, are bisected at О and that
the length of the latus rectum (see Ex. 6) is 2b2/a.
The form of the complete curve is shown in Figure 27.17. Notice that,
if P(x, y) lies on the hyperbola,
590

3] HYPERBOLA
and thus no part of the curve lies within the interval — a < χ < a. It
follows that the points B(0, b) and B'(0, -b) do not lie on the curve in
contrast to the corresponding points of the ellipse. A {a, 0) and A'{ — a, 0)
are called the vertices of the hyperbola; AA' is the transverse axis of the
hyperbola, while BB' is the conjugate axis (see Ex. 26).
Fig. 27.17
The equation of the hyperbola may be written in the form
(H)(->
that is
As χ, j> both tend tooo, (--r) tends to zero and the equation of the curve
approximates to
This line is thus an asymptote to the hyperbola; by a similar argument so
also is the line
*Ex. 26. Show that the equations of the asymptotes may be written in the form
а* Ьг
and prove that the acute angle between them is 2 arctan (b/a).

ELLIPSE AND HYPERBOLA [27
*Ex. 27. Sketch in the same diagram the two hyperbolas
-,-ζ= 1 and -,-ζ= -1
a2 b2 a2 b2
and mark in the points A(a, 0), A'(- a, 0), 5(0, b), B'(0, - b). Two hyperbolas of
this form are said to be conjugate; show that conjugate hyperbolas have the same
asymptotes.
A rectangular hyperbola was defined in Chapter 22 as the curve which,
with a suitable choice of axes, has equation
To justify the use of the word hyperbola in this definition, that is, to
show that such a curve has the required focus-directrix property, consider
the linear transformation, T, of the plane into itself defined by the equations
It is not difficult to see that Γ preserves distance; for, if the matrix of
Τ is A, then
V2 V2
\ V2 V2/
fcos( — $π)
(,sin(-i7r)
-sin(-M|
cos (-in))
which represents a rotation through an angle -\n (see p. 243, vol. 1).
It follows that, under T, the image of a curve С has precisely the same
appearance and geometrical properties as the curve С itself, but is rotated
through an angle of —\v. But, since
y = j2(X'+y%
we see that any point of the set
{(x,y):xy = c2}
maps into a point of the set
{(х,уУ.х'*-у'* = 2с*}.
592

3] HYPERBOLA
This latter set clearly represents a hyperbola, with a2 = b2 = 2c2 and
asymptotes , „
Thus, the equation xy = c2
represents a hyperbola with perpendicular asymptotes: we may therefore
redefine a rectangular hyperbola as a hyperbola with perpendicular
asymptotes.
Ex. 28. Prove that the eccentricity of a rectangular hyperbola is V2 and that the
latus rectum is equal to the distance between the vertices of the curve. (A
rectangular hyperbola is sometimes called an equilateral hyperbola. All rectangular
hyperbolas are similar to one another.)
Ex. 29. Show that the foci of the rectangular hyperbola
are the points (± cJ2, ± сц/2) and find the equation of the corresponding
directrices.
Ex. 30. A parametric form for a point on the curve
xy = i<z2
is (aty/l, a/(tJ2)); by considering the linear transformation T, deduce that the
parametric form for a point on the curve
is (a(l/t+t)/2, a(l/t—t)/2). Examine how this point moves along the curve as t
varies between —oo and +oo; in particular, explain what happens as t approaches
the value zero from below and from above.
Ex. 31. Show that χ = a sec θ, у = btand gives a parametric representation
for all points of the hyperbola
6V-es/ = Μ*
and explain how P(a sec Θ, b tan Θ) moves on the hyperbola as θ varies from 0
to 2π. In particular, what happens as θ approaches the values %π, ftf from below
and from above?
Ex. 32. The circle. x* + y2 = а2
is called the auxiliary circle of the hyperbola
(see Ex. 12). Sketch in the same diagram a hyperbola and its auxiliary circle. Ρ is
a point of the hyperbola with positive coordinates, TV is the foot of the
perpendicular from Ρ to the transverse axis and NQ is a tangent to the auxiliary circle
at the point Q with positive coordinates. The angle NOQ = Θ. Show that Q
is the point (a cos Θ, a sin Θ) and that Ρ is the point (a sec в, b tan Θ).
593

ELLIPSE AND HYPERBOLA
Ex. 33. Show that the equation of the tangent to the hyperbola
at the point P(a sec Θ, b tan Θ) has equation
χ sec θ у tan θ _
Corresponding to the property SP+S'P = 2a for points of an ellipse
we have the following property for the hyperbola.
Focal distance property
If Ρ is any point on a hyperbola, foci S, S', then \SP—S'P\ = la.
Take the hyperbola in the standard form
UV
-azf = aW
and let M, M' be the feet of the perpendiculars from Ρ on to the two
directrices (see Figure 27.18). Let Ρ be the point (h, k); then, by the definition
of the hyperbola sp = ePM, S>P = ePM
and \SP-S'P\ = e\MM'\ = 2a.
It is not difficult to see that SP-S'P = -la if Ρ lies oi
hyperbola enclosing S, and that SP-S'P = +2a if Ρ 1:
branch.
the branch of the
ss on the opposite
*Ex. 34. Prove that, if S, S' are fixed points and Ρ moves in such a way that
SP-S'P is constant, then the locus of Ρ is a branch of a hyperbola.

3] HYPERBOLA
*Ex. 35. Interpret the conditions \SP-S'P\ < 2a and \SP-S'P\ > 2a
geometrically.
Ex. 36. Devise a mechanical construction for a branch of a hyperbola with given
foci and length of transverse axis, based upon the relation SP— S'P = 2a.
*Ex. 37. By adapting the method used for the ellipse, prove that the tangent
and normal at the point Ρ οι a hyperbola with foci S, S' bisect the angle SPS'.
Hyperbolic orbits arise under the action of forces of repulsion varying
inversely as the square of the distance from the centre of force, such as arise
in the case of electrically charged particles. They also arise under
attractive forces, such as gravity, when the energy content of the orbit is too
great for it to be an ellipse, such as, for example, comets which orbit the
sun only once before retreating into outer space. (Comets such as Halley's
comet, which reappear, obviously follow elliptic orbits: the orbits are
usually highly eccentric, that is, e χ 1.) The property \SP-S'P\ = 2a is
used in range-finding. If a gun at Ρ is fired and the times taken for the sound
to reach two listening posts, S and S', recorded, \SP—S'P\ = 2a may be
determined and Ρ lies on the unique hyperbola with foci S, S' and
transverse axis 2a. If the same experiment is conducted from S and a further
listening post S", a second hyperbola is obtained and Ρ lies at the
intersection of the two hyperbolas. A more sophisticated application of the
same idea is employed in navigation. [See the article 'Sound Ranging' in
No. 195 of the Mathematical Gazette (July 1928) by W. Hope-Jones.]
The ellipse and hyperbola have very similar geometrical properties—
most of the properties peculiar to the hyperbola are associated with its
asymptotes. Both the ellipse and hyperbola possess a centre, that is, a
point at which all chords are bisected and, for this reason, they are called
central conies; the parabola is not a central conic. If we are concerned with
a property common to all central conies, we may take the equation of a
typical conic in the form 2 „ 2 _ 1
(where α, β are not both negative).
Example 4. Prove that the line
у = mx+c
touches the central conic
xx2 + £y2=1 (a,/? + 0)
if and only if c2 = (a+/?m2)/(a/?).
Prove that perpendicular tangents to a central conic meet, in general, on a
circle, the director circle of the conic, and point out what exceptional cases
may arise.
595

ELLIPSE AND HYPERBOLA [27
The given line and conic meet in points with χ coordinates given by
*χ2+β(τηχ + ό)2 = 1,
that is, by (a+/3m2) x2 + Ifimcx + (fie2 -1) = 0.
The line у = mx + c touches the conic αχ2+βγ2 = 1
о fi2m2c2 = (a+/?m2) (/?c2-l) (the condition for double roots of the
quadratic equation above)
о a/?c2 = χ+βτη2
о с2 = (χ+βηι2)Ι(ΰφ), since α,/? + 0.
Now let у = тх+сЪеа tangent through the point (h, k). Then we have
к = mh+c
and thus, using the condition proved above for the line to be a tangent,
^(к-mh)2 = *+βτη2,
which reduces to m2(ocfih2-/3)-2oLfihkm+(oLfik2-oL) = 0. This is a
quadratic equation in m (showing that two tangents can, in general, be
drawn through the point (h, k)). Let the roots be тъ т2: then
mim2 = Wk2-cc)IWh2-p).
Thus, (h, k) lies on perpendicular tangents if
afiV-jl = -Wk*-a),
that is, if h2 + k2 = -+\,
and the locus of (h, k) is thus
χ2 + ^ = ϊ+-β·
This is a circle provided (l/a) + (l//?) > 0. This is certainly always true
for an ellipse. For a rectangular hyperbola, (l/a) + (l//?) = 0 and the
director circle reduces to a point (the origin); for any other hyperbola, the circle
exists provided (1/a) > —(1//?), that is, in the usual notation, provided
a2 > b2: a hyperbola in which the conjugate axis is greater than the
transverse axis has no director circle. Since the angle between the
asymptotes is 2 arctan (b/a), a hyperbola has no director circle if the angle
between those parts of the asymptotes containing the curve is greater
than \ti.
Ex. 38. An elliptic lamina moves in a plane in such a way that it touches each
of two fixed perpendicular lines. What is the locus of its centre?
596

HYPERBOLA
Exercise 27(b)
1. Find the eccentricities of the following hyperbolas:
(i) ix*-b* = 1; (ii) 2x2-y2 = 1; (Ш) (x-l)2-20>-2)2 = 1.
2. Sketch roughly the hyperbola
i*2-ib2=l
and find its eccentricity, the coordinates of its foci and the equations of the
corresponding directrices.
3. Sketch roughly the hyperbola
find its eccentricity, the coordinates of its foci and the equations of the
corresponding directrices.
4. Show that the equation , . „ r
x2-4y2 + 6x + 8y + l = 0
represents a hyperbola, and find its eccentricity, the coordinates of its foci and
the equations of the corresponding directrices.
5. Find the equations of the asymptotes of the hyperbolas
(i) 9x2-4y2 = 12; (ii) 3x2-y* = 1.
6. Find the equations of the asymptotes of the hyperbolas
(i) x2-4y2 + 2x+8y-5 = 0; (ii) 2x2-4y2-8y-5 = 0.
7. Find the equation of the hyperbola with asymptotes
2x-y = 0, 2x+y = 0,
which passes through the point (2, 3).
8. Find the equation of the hyperbola with asymptotes
3x-2y-9 = 0, 3x + 2y+3 = 0,
which passes through the point (2, —4).
9. Find the equation of the tangent to the hyperbola 2л:2- Ъу2 = 6 at the point
(3, —2) and prove that the line x + y+1 = 0 also touches the curve.
10. Find the equation of the normal to the hyperbola 3x2—y2 = 2 at the point
(1,1) and find the χ coordinate of the point where it meets the curve again.
11. Find the equation of the tangent to the rectangular hyperbola
x2-y2 = 3
which passes through the point (3, — 3).
12. Prove that a hyperbola intersects a line parallel to one of its asymptotes at
just one point.
13. Find the equation of the rectangular hyperbola which has the points (4, 0),
(-4,0) as foci.
14. If the eccentricity of a hyperbola is e, prove that the eccentricity of the
conjugate hyperbola is e/V(e2-l).

ELLIPSE AND HYPERBOLA [27
15. Prove the results of Exercise 27(a) 11 for the hyperbola
b2x2-a2y2 = alb\
16. Prove the result of Exercise 27(a) 12 for the hyperbola
Ъ2х2-Фу2 = a2b2.
17. If Ρ is a variable point on a hyperbola with vertex A, show that the locus of
the mid-point of PA is another hyperbola and find its centre and eccentricity.
18. Find the equation of the hyperbola (x2/a2) - (y2jb2) = 1 when the origin is
changed to the point whose coordinates are (h, k), and the new axes are parallel
to the original axes.
The coordinates of a point are given by
χ = 2 cos2 0/(2 cos2 Θ-1), у = 2 tan 20.
Show that the locus of the point is a hyperbola, and find the coordinates of the
centre.
Find the equation of the tangent to the curve at the point θ = \v. (O & Q
19. Obtain the equations of the tangent and normal to the hyperbola
4хг-у2 = 36
at the point Д5, 8).
The tangent at Ρ meets the у axis at Q and the normal at Ρ meets the χ axis
at R. Prove that the area of the triangle PQR is 145 square units.
Obtain the coordinates of the point S such that PQRS is a rectangle. (O & Q
20. Prove that the normal to the hyperbola
α2"^2 = l
at the point (a sec φ, b tan φ) is given by the equation
ax sin φ + by = (α2 + ό2) tan φ.
The normal at a variable point Ρ on the hyperbola meets the axes at Хгпа Y;
the mid-point of XY is Z. Prove that, if О is the centre of the hyperbola:
(i) the length of OZ is not less than (a2 + b2)/2a;
(ii) OZ is inversely proportional to the perpendicular distance from О to the
tangent at P;
(iii) if a = b, then Ζ always coincides with P. (O & Q
21. The tangent at the point Ρ of the rectangular hyperbola
xy = c2
meets the asymptotes at Q, R. Prove that PQ = RP.
By considering a linear transformation Τ with matrix of the form
show that the result above is true also for the hyperbola
Ь2х*-а2у2 = aV.
598

3] HYPERBOLA
Repeat the above process to prove the following more general result: if a chord
PQ of a hyperbola meets the asymptotes at P', Q', then PP' = QQ'.
Suggest a further generalization of this result by considering a second
rectangular hyperbola xy = d2 in place of the asymptotes.
4. POLAR EQUATIONS OF CONICS
Given coordinate axes Ox, Oy, the position of a point is uniquely
determined by its Cartesian coordinates (x, y). However, other systems of
coordinates are available; in Figure 27.19 the position of the point Ρ is
determined if, given the point О and the line OX, we know the length
r = OP and the angle θ = ХОР. {r, θ) are called the polar coordinates
of Ρ relative to the origin or pole О and the initial line OX. We adopt the
usual convention that positive values of θ are measured in the
anticlockwise sense.
The polar coordinates of a point are not uniquely determined. For
example, consider the point Ρ with Cartesian coordinates (1, — ^3) (see
Figure 27.20). Taking О as origin and Ox as initial line, the polar
coordinates of Ρ can be taken in any of the alternative forms
(2,- fr), (%br), (-2, in), etc.
If unique polar coordinates are desired, we define the principal polar
coordinates of a point to be the ones for which
r > 0, -π < θ < тт.
Generally, however, we allow r to take both positive and negative values.
In the same way that a set of points in a plane could be defined by an
equation or inequality involving Cartesian coordinates, sets may be
defined using polar coordinates. For example, the set
{ir, в): г
-a)

ELLIPSE AND HYPERBOLA [27
represents a circle, centre О and radius a; the set
{(r, Θ): r cos θ = a)
represents a straight line (traversed twice) perpendicular to the initial line
and at a distance a from O; the set
{{r,0):a<r<b}
represents the annulus defined by two concentric circles of radii a and b.
Ex. 39. Describe in words and draw sketches of the following sets of points:
(i) {(r, ff):rsme = a}; (ii) {(r, θ):θ=ο};
(iii) {(r, Θ): r = a sin α cosec (β-a)}.
{Hint: rewrite the defining equation and think of the Sine Rule.)
(iv) {(r, 0):a<r < b, 0 < θ < Щ; (ν) {(r, Θ): r sec θ < 2a}.
Ex. 40. Find the polar equation of:
(i) the line passing through the points with polar coordinates (1, 0) and
(1, Ϊ");
(ii) the circle through the points with polar coordinates (2, in) and touching
the initial line at the origin;
(iii) the circle of radius 1 with centre at the point with polar coordinates
(V2, iw).
Using the relation „ . a
χ = r cos О, у = r sin a
it is a simple matter to rewrite the defining equation of a set of points in
the plane in terms of polar coordinates. For example, the ellipse Г may
be written in the alternative forms
Г = {(x,y):bV+aY = azb2}
= {(r, Θ): r\b* cos2 θ+α2 sin2 Θ) = azb2}.
However, it may well happen that a curve is best expressed in polar form
without recourse to Cartesian coordinates. Suppose, for example, that
we are given a conic of eccentricity e and semi-latus rectum /. The focus-
directrix definition suggests that it would be reasonable to take the origin,
O, at a focus and the initial line along the major axis.
In Figure 27.21, О is the focus of the conic, MN is the corresponding
directrix and OL = /is the semi-latus rectum.
Then, since Qp = ePM>
we have, r = ePM.
But PM = LN-r cos θ
= Ije—r cos Θ,
giving us r{\ + e cos Θ) = I.
600

4] POLAR EQUATIONS OF CONICS
This equation is the standard form for the equation of a conic in polar
coordinates with the origin at a focus and the initial line along the major
axis.
Ex. 41. Sketch the parabola r{\ +cos Θ) = 2.
Ex. 42. Show that the equation r(2 + cos Θ) = 2 represents an ellipse and sketch
the curve.
^^
L
\p
/v\
0 J
N
Μ
X
Fig. 27.21
The polar form for a conic is of great value in dealing with properties
of focal chords. We conclude this section with an example illustrating its
use in this context.
Example 5. PQ, UV are perpendicular focal chords of a rectangular
hyperbola, focus S. Prove that:
(i) \PS.SQ\ = \ US.SV\, (ii) PQ = UV.
Taking S as origin and the major axis SX as initial line, the polar
equation of the rectangular hyperbola is
<1+V2cos0)= /.
Let P, Q, U, V have polar coordinates
(гъв), (г„ <?+*), (r„ <?+**), (rtoe+\ri).
Then, from the equation of the conic we have
r1= /(1+V2COS0)"1,
r2= /(1-V2 cos θ)~\
r3 = /(l-V2sin0)-\
rx = /(1 + 42 sin θ)~\
601

ELLIPSE AND HYPERBOLA [27
(Notice that θ cannot be an odd multiple of \π if the chords PQ, UV are
to exist.) Taking sense along each chord into account, this gives
PS =/(l+V2 cos ff)~\
SQ = /(1-V2COS0)-1,
US = /(l-^sintf)-1,
SV= /(l+^sintf)-1.
(i) PS. SQ = /2(1 - 2 cos2 Θ)-1 = - /2 sec 2Θ,
US. SV = /2(1 - 2 sin2 Θ)-1 = I2 sec 2Θ.
Λ PS.SQ+US. SV=0.
(ϋ)
and sir
nilarly
PQ = PS+SQ
I
1+V2cos0' 1
2/
l-2cos2<9'
\PQ\ = \2l sec 2Θ\,
\UV\ = |2/sec2(9|.
/
-V2cos^
Ex. 43. .Ρβ is a focal chord through the focus S of a conic of semi-latus rectum
/. Prove that

4] POLAR EQUATIONS OF CONICS
Ex. 44. PP', QQ' are perpendicular focal chords of a conic; prove that
PP' QQ'
is constant.
Ex. 45. Show that the polar equation of the directrix corresponding to the focus
Show also that a polar equation of the form
l/r = acos0+osin<?
represents a straight line.
Deduce (with the help of Exercise 27(a), question 11 (second part)), that
the equation
-= cos(0-a) + ecos6>
is the equation of the tangent to the standard conic at the point with vectorial
angle a.
Prove that the tangents at the ends of a focal chord of a conic meet on the
corresponding directrix.
5. SECTIONS OF A CONE
This section is included for the attention of readers interested in the
historical development of the geometry of the conies.
The conies, or conic sections, were first studied, as their name suggests,
as sections of a right circular cone. Their history extends back to the time
of Ancient Greece: they were first extensively studied by Apollonius
of Perga (247-205 B.C.) who wrote a treatise on their properties. The focus-
directrix property, which we have made the basis of our definition, was
not discovered until later.
The various types of conic arise as sections of a cone made by planes
making various angles with the axis of the cone. For the purposes of the
following definitions, the cone is taken to extend infinitely in both
directions from the vertex. (See Figure 27.23.)
First observe that any plane perpendicular to the axis of the cone (and
not through the vertex) cuts the cone in a circle, which may therefore be
regarded as a particular type of conic.
If the plane is oblique, not parallel to one of the generating lines of the
cone and cuts only one half of the cone, the resulting section is an ellipse
(Figure 27.24).
If the plane is parallel to one of the generating lines of the cone, the
resulting section is a parabola (Figure 27.25).
If the plane is oblique and cuts both halves of the cone, but does not
603

ELLIPSE AND HYPERBOLA [27
pass through the vertex of the cone, the section is a hyperbola (Figure
27.26).
Ex. 46. Show how a pair of straight lines arises as a conic section.
Fig. 27.25
Fig. 27.26
The connection between the conic section and the focus-directrix
definitions is exhibited in Exs. 47-52. The notation refers to Figure 27.27
(for convenience of drawing, we take an elliptic section and show only one
half of the cone).
604

5] SECTIONS OF A CONE
V is the vertex of a right circular cone with its axis vertical and we
consider the section of the cone by the plane Π. A sphere may be drawn to
touch the plane Π at S and also to touch the cone in a circle lying in a
horizontal plane Π'. The vertical plane containing V and S cuts Π and
the cone at A and A' and Π' and the cone at С and C". /is the line of
intersection of Π and Π' and A'A meets latK.P is any point on the cone lying
in the plane Π and VP touches the sphere atP'. Μ is the point of / such that
Ρ Μ and AK are parallel, NPQ is a horizontal section, N lying on A A' and
Q lying on VA.
Fig. 27.27
Ex. 47. Show that PNKM is a rectangle.
Ex. 48. Show that SP = QC and PM = NK.
Ex. 49. Show that QC/NK = AC/AK.
Ex. 50. Deduce that SP/PM is fixed for a given plane Π, and locate a focus and
its corresponding directrix for the resulting conic section.
Ex. 51. Show how the parabolic cross-section arises.
Ex. 52. Show that, both for elliptic and for hyperbolic sections, a second sphere
may be drawn to touch the cone and the plane Π, giving rise to a second focus
and a second directrix.
Miscellaneous Exercise 27
1. Show that the equation
хг + 2/ -2x +12^+8 = 0
represents an ellipse, and find its eccentricity and the coordinates of its centre.
Prove that „
3x-2y+2 = 0
is a tangent to the ellipse and find the point of contact.
605

ELLIPSE AND HYPERBOLA [27
2. The tangents drawn from a point P(xu yd to the circle x2+y2 = r2 touch the
circle at L, M. Show that the equation of LM is ххг+ууг = r2.
If Ρ moves on a hyperbola of eccentricity e with its centre at the origin, show
that LM touches a hyperbola of eccentricity e, where
e2 e2
Draw a figure for the case when e < J2 and the circle touches the given
hyperbola. (London)
3. Show that the equation of the tangent at P(a cos a, b sin a) to the ellipse
ftV+flV = fl№
is bx cos a+ ay sin a = a6.
The tangent at Ρ cuts the χ and .y axes at A and 5 respectively and the normal
at Ρ cuts the χ and д' axes at С and £> respectively. Find the ratio PC.PD.
If AD and ЯС meet at E, prove that Я£ is perpendicular to AD and hence, or
otherwise, find the equation of the circle through А, В, Е. (London)
. Prove that the conies
X*l_
a^b* «""" cP+λ b* + X
-- 1
have the same foci, whatever value A takes (provided Α Φ - φ or - b").
Find the equation of the rectangular hyperbola whose foci coincide with those
of the ellipse b2x2 + d>y2 = <fb2.
Show further that two ellipses which have the same foci cannot intersect in
real points but that if an ellipse and a hyperbola have the same foci, they
intersect orthogonally at four real points.
5. Ρ is any point on a rectangular hyperbola, centre О and foci S, S'. Prove that
OP2 = SP.S'P.
6. Ρ is the point (a cos Θ, b sin Θ) of the ellipse
and Q(a cos Θ, a sin Θ) is the corresponding point of the auxiliary circle. Prove
that the perpendicular distance from S to the tangent at Q to the circle is equal to
SP.
7. Write down the equation of
(i) an ellipse which has its minor axis along the у axis and touches the χ axis;
(ii) an ellipse which has its minor axis along the χ axis and touches the у axis.
Two such ellipses are given. Assuming that they meet in four (real) points,
write down the equation of any conic through these four points, and prove that
its centre lies on a certain rectangular hyperbola. (O & C)
8. A tangent to the ellipse , „ „ , , , ., „
b2x2 + a2y2 = <fb2
(a, b > 0) meets the ellipse
at the points P, Q. Prove that the tangents to the second ellipse at Ρ and β
meet on its director circle.
ab(a+b)

MISCELLANEOUS EXERCISE 27
9. Two hyperbolas, S and S', have the equations
a· w
-- A,
where A < 1. Prove that the tangent to S at any point Ρ meets S' in two points
Q and R which are equidistant from P.
If A = -1, prove that the tangents to S' at Q and Λ meet on S at i*, the
reflection of Ρ in the origin. (O & C)
10. The asymptotes of the hyperbola
cfi b*
(ii) PX.PY =
(Ш) py.py' = ""Г7™· (°&Q
are / and /', and Ρ is a point on the hyperbola. The perpendicular from Ρ to I
meets / and /' at X and Y, and the perpendicular from Ρ to /' meets /' and / at
X' and Y'. By expressing the coordinates of Ρ in parametric form, or otherwise,
prove that, for all positions of P,
®PX-PX' = ^->
" a*-b*;
11. Show that, if the point (хг + r cos Θ, yx + r sin в) lies on the central conic
ax2 + by2 = 1, then r satisfies the quadratic equation
r\a cos2 θ+b sin2 Θ) + 2^-*! cos (9+6л sin ff) + ax\ + by\ - 1 = 0.
Deduce that, if (xj, >ί) is the mid-point of the chord PQ, then PQ has gradient
-axjbyt.
Prove that the locus of mid-points of parallel chords of a central conic is a
diameter of the conic
12. Use the analysis of Question 11 to prove Newton's Theorem for a central
conic: if PQ, RS are two chords intersecting at X, then the ratio
PX.XQ
RX.XS
depends only upon the directions of the chords PQ and RS, and not on their
positions.
13. Given the outline of an ellipse, show how you would construct the centre,
the axes, the foci and the directrices.
14. Prove that, if tangents are drawn to a hyperbola from any point of the
conjugate hyperbola, their chord of contact touches the opposite branch of
the conjugate hyperbola and is bisected by it.

ELLIPSE AND HYPERBOLA [27
15. Find the equation of the perpendicular bisector of the line joining the points
(xu yu, (*2, yd-
A fixed circle has centre С and radius 2a. A is a fixed point inside the circle and
Ρ is a variable point on the circumference. Prove that the perpendicular bisector
of AP touches the ellipse whose foci are at С and A, and whose major axis is of
length 2a. (C.S.)
16. A point Ρ is taken at random inside an ellipse of eccentricity e. Calculate
the probability (in terms of e) that the sum of the focal distances of Ρ should be
not greater than the distance from a focus to the opposite end of the major axis.
(C.S.)
(Note: the area of an ellipse with major and minor semi-axes of lengths a and b
respectively is nab—a result easily deduced by the usual calculus methods, or
alternatively by considering the effect on areas of the linear transformation Τ of
Section 2 of this Chapter.)
17. Σ! is a circle, centre O, radius b, Σ2 is a circle, centre A radius a(a<b) which
touches Σ! internally. Describe the locus of the centre, P, of a variable circle, Σ,
which touches Σ! internally and Σ2 externally.
If Σ! is a straight line, Σ2 is a circle, centre A and radius a which touches Σ!
and Ρ the centre of a variable circle Σ which has Σ! as a tangent and touches Σ2
externally, describe the locus of P.
18. Prove that at most four normals may be drawn from a point A to a central
conic with centre O.
Prove further that, if the normals at the points Pu P2, P3, P4 on a central conic
intersect at A, then P^PaPi lie on a rectangular hyperbola which passes through
О and A and has its asymptotes parallel to the axes of the given conic. (The
rectangular hyperbola of this question is known as the hyperbola of Apollonius.)
19. Prove that the vector equation (referred to the vertex as origin) of the tangent
to the parabola уг = Aax at the point ρ = at4+2at] is г = p+Au where u is
the unit vector (ii+j)/(l+i2)*.
Prove that the locus of the meets of tangents to the parabola уг = Aax which
cut at a fixed angle α is a hyperbola of eccentricity sec a.
20. Prove that, if the chord PQ of any conic with focus S, when produced, cuts
the corresponding directrix at R, then SR is a bisector of the angle PSQ. Deduce
that, if the tangent at Ρ cuts the directrix at T, then the angle PST is a right angle.
21. If X is any point on the tangent at the point Ρ of any conic with focus S,
and if Η, Κ are respectively the feet of the perpendiculars from X to SP and the
directrix corresponding to S, prove that SH = eXK, where e is the eccentricity.
608

28. Further matrices
Throughout this chapter it is to be assumed, unless explicitly stated
otherwise, that the matrix A of a linear transformation Τ is referred to the base
vectors i, j {or i, j, к in three dimensions).
1. EIGENVALUES AND EIGENVECTORS
FOR 2x2 MATRICES
Let us write the matrix of the linear transformation, T: R2 -» R2 as A
where
Μ ьл
U ν*
Whatever form T, and therefore A, has, the origin О is mapped into itself.
In general, no other point remains fixed under Τ but, if there is a point P,
distinct from O, such that T(P) = P, then all points of the line OP are
mapped into themselves.
Ex. 1. Prove the assertion that if Τ maps a point Ρ (other than the origin) into
itself, then it maps every point of the line OP into itself.
Ex. 2. Prove that, if Τ maps two points, Ρ and Q, into themselves, where О, Р, Q
are distinct and not collinear, then Τ is the identity transformation, that is, the
transformation with matrix
We may now investigate the answer to the question: Even if Τ does not
map any point other than the origin into itself, is there a line / through the
origin such that every point of / is mapped into a point of /? If such a line
exists, we say that / is an invariant line and that / maps into itself under T.
Ex. 3. If S is a point such that T(S) e OS, prove that every point of the line OS
is mapped into a point of OS.
Suppose that S is a point other than the origin, with the property that
T(S) e OS; write OS = s. Then As = As and thus
(A-AI)s = 0.
Since s Φ 0, we must have det (A —AI) = 0 (see p. 294). This is a quadratic
equation in A, the characteristic equation of the matrix A, with, in general,
609

FURTHER MATRICES [28
two distinct roots, λ1 and A2, called the eigenvalues of the matrix A. Any
non-zero solution, s^, of the homogeneous equation
(A-A1I)s = 0
is called an eigenvector corresponding to the eigenvalue λχ; similarly, we
can find eigenvectors corresponding to the eigenvalue A2.
Example 1. Find the eigenvalues of the matrix
Ά
■С
and determine their corresponding unit eigenvectors.
The characteristic equation
2 \_n
*(Y
о А2-9А + 8=0
о A = 1 or A = 8.
Consider first the eigenvalue A = 1: if Sj is any corresponding
eigenvector then, writing . ,
we have (A-I)s! = 0,
6
that is
Э©-©·
On solving these equations we obtain χ = 2μ, у = — 3μ, giving as an
eigenvector
In particular, a unit eigenvector is given by
2/V13\
-3/V13J
All the points of the line г = к&,
that is, all points of the line 3x+1y = 0,
map into points of the same line (in fact, in this case, into themselves
since A = 1): 3x + 2y = 0 is an invariant line under the transformation.
Now consider the eigenvalue A = 8: if s2 is any corresponding eigen-
vector we have (A-8I)s2 = 0,
610

1]
(1 -W-Q-
giving s2 = (^ .
In particular, a unit eigenvector is given by
е2 = (ад-
All points of the line г = к2е2,
that is, of the line 2x-y = 0,
2x2 MATRICES
map into points of the same line (but, apart from O, not this time into
themselves): 2x—у = 0 is a second invariant line under the
transformation.
We shall now analyse the linear transformation represented by the
matrix A of Example 1 in greater detail. The matrix
formed by taking as its columns the eigenvectors
s1 = (_2) and s2=Q
(these eigenvectors being chosen for their simplicity, although any
nonzero multiples of S! and s2 would do just as well) has an important property.
Since . , . 0
Α% = Si and As2 = 8s2
it follows that the product AP is a 2 χ 2 matrix with s^ as its first column
and 8s2 as its second column. Thus
/1 0\
>(l I
But, since det Ρ = 7, Ρ is non-singular and we may multiply both sides
of the above equation on the left by P_1 to obtain
The matrix A has been reduced to diagonal-form. Notice that the
elements of the diagonal are precisely the eigenvalues of A.

FURTHER MATRICES
[28
The linear transformation T, with matrix A relative to i, j as base vectors,
maps the point with position vector as!+/?s2 where s^, s2 are defined above,
into the point with position vector as!+8/?s2; it follows that, if we take
Si, s2 as base vectors, the matrix of Τ is
as required.
Thus, when considering the transformation T, it is simpler to express
vectors in terms of the eigenvectors sb s2 rather than in terms of the more
usual base vectors i, j. For example, the point Q, where
q = 2^ + ^,
maps into the point Q', where
q' = 2s!+4s2.
Figure 28.1 shows the effect of the transformation Τ upon Q:
OQ = 2^ + ie, = OP + PQ,
OQ' = 2s!+4s2 = OP'+P'Q',
where OP'Q'P is a parallelogram.
Fig. 28.1
612

1] 2x2 MATRICES
Ex. 4. Calculate the eigenvalues and corresponding unit eigenvectors of the
matrix
■UD·
Hence describe geometrically the linear transformation Τ which has matrix В
relative to the base vectors i, j.
Reverting to Example 1, the reduction to diagonal form enables us to
calculate powers of A. For example,
/1 0\
\0 83)
= (P-iAP) (P"1AP) (P-1AP)
= P-1A(PP"1)A(PP-1)AP
= p-iA3P
■*GJ·)-
More generally we have
7\-3 2Д0 512ДЗ
_ 1 / 2 512\ /2 -1\
71-3 1024J \3 2J
_ 1/1540 1022\
7^3066 2051 j
_ /220 146\
Ц38 293 j'
*-H 2 l)(l °W2
7^-3 2J l0 8"j ^3
-и.
4+3.8й -2+2.8η
-6+6.8" 3+4.8"J
Ex. 5. Verify by direct calculation the form given for A™ above and also prove
the result by mathematical induction.
*Ex. 6. If st and s2 are eigenvectors corresponding to the distinct real
eigenvalues Яь λ2 of a matrix A and if Ρ is the matrix with first column Sj and second
column s2, and assuming that Ρ is non-singular, show that
■Gi)·
Eigenvalues need not be real. For example, if
■U !)

FURTHER MATRICES [28
the characteristic equation of A is
(A-l)2+l =0
giving eigenvalues λ1 = 1 + j, A2 = 1 — j, with corresponding eigenvectors
(9· (-1
In this case, the invariant lines are imaginary.
Of the linear transformations of the plane into itself, those which
preserve distance are of particular importance—that is, linear transformations
Τ with the property that, if T(P) = PT,f then OP = OPT for all points Ρ
of the plane. In Theorem 28.1 we shall show that the matrix A of such a
transformation is of a special type; to this end we make the following
definition (which holds for η χ η matrices, although we are considering here
only 2x2 matrices): A is called an orthogonal matrix if AA' = I.
*Ex. 7. Show that, if A is an orthogonal matrix then det A = ± 1, A is non-
singular, and A-1 = A'.
Ex. 8. In the course of the proof of Theorem 28.1 we shall require the fact that
the transpose of a product AB of two matrices is the product of the transposes of
A and В in reverse order; that is
(AB)' = B'A'.
Prove this result
(i) when A is 2x2 and В is 2x1;
(ii) when A is 2 χ 2 and В is 2 χ 2.
Show how your proofs may be generalized to cover all cases of conformable
matrices А, В up to order 3 χ 3 (or m χ η if you can manage it).
Theorem 28.1. Let Τ be a linear transformation of the plane into itself
with matrix A. Then Τ is distance preserving {that is, |OP| = ΙΟΡ^Ι for all
positions of the point P) if and only if A is orthogonal.
First observe that, if OP = г = xi+yj, then
r'r = (*J0Q =(х2+У2У,
that is, the 1 χ 1 matrix r'r represents OP2.
Now suppose that Ρ is any point of the plane, that OP = r, and that
OPy = rT.
Then ГуГг = (Ar)' (Ar) = r'A'Ar (by Ex. 8)
and thus r' r _ r'r
о r'(A'A) r = r'r
■^ r'(A'A) r = rlr
or'(A'A-I)r = 0.
t We use the notation Рт rather than P' here to avoid possible confusion with the
notation for the transposed matrix.
614

1] 2x2 MATRICES
(i) Suppose A is orthgonal.
Then A'A = I and thus ГуГг = r'r for all points Ρ of the plane and Τ is
distance preserving.
(ii) Suppose Τ is distance preserving
Then r'(A'A-I)r = 0 for all position vectors г and, in particular, for
Г! = i, r2 = j and r3 = i+j. Writing
we thus have
Thus bx =
But (A'A)'
<">G80-©^-*
<"«(!) =©-'+™·=°·
bt = 0 and b2 = —b3 = b say, and we
B-(_J ») .nd A'A-(_
= A'A and therefore
and 6 = 0. This shows that A'A = I and thus A ii
have
1 Μ
b l)'
i orthogonal.
*Ex. 9. Prove that an orthogonal matrix Ρ preserves separations, in the sense
that , „ . . .
|Px-Py| = |x-y|.
*Ex. 10. Prove that, if Ρ is a 2 χ 2 orthogonal matrix, then either
о a rotation through an angle Θ, or
sin*\
-COS0/
(COS
sin
corresponding to a rotation through — θ followed by a reflection in the χ axis
(that is, a reflection in the axis у = χ tan $θ).
*Ex. 11. Prove that the eigenvalues of a 2 χ 2 orthogonal matrix Ρ are
(i) conjugate complex numbers of modulus 1 if det Ρ = +1, or
(ii) the numbers ± 1 if det Ρ = - 1.

FURTHER MATRICES [28
The reduction of a matrix A to diagonal form P_1AP is of particular
importance if A is a symmetric matrix (A' = A). We shall show (Theorem
28.5) that, in this case, the reducing matrix Ρ may be taken to be orthogonal.
Before proving this, it is necessary to obtain some simple preliminary
results. In the following four theorems, A is a 2 χ 2 matrix with eigenvalues
Ab A2 and corresponding eigenvectors sl5 s2 and Ρ is the matrix with first
column Si and second column s2.
Theorem 28.2. If the eigenvalues of the 2x2 matrix A are distinct then the
matrix Ρ is non-singular.
Proof det Ρ = 0
=> Si = &s2, where к is a non-zero number,
=> A^-Msa = 0, on multiplying by A,
=> Ад —A&s2 = 0
=> (At — Ag)^ = 0, since ks2 = s^,
=> λ1 = A2, since Si φ 0.
Thus, since det Ρ = 0 => At = A2, we have At Φ λ2 => det Ρ φ 0.
Theorem 28.3. //^ and s^ are unit eigenvectors of A then s^ and s2 are
perpendicular if and only ifP is orthogonal.
Proof
write * = (;;), S2 = (;:);
" Vi У J
and pt = (*1 *) fa *») = ( *+* *х;+у?>).
\X2 у J \л yj ил+лл χΙ+Α ι
(i) Suppose Ρ is orthogonal then
/ x\+y\ Xlx2+yiy2\ = /1 0\
и*2+лл 4+A 1 \o ι/
giving Xi+yt = xl+yl = 1, ΧχΧι+УхУг = 0 and thus ^ and s2 are
perpendicular unit vectors.
(ii) If Si and s2 are perpendicular unit vectors then
P'P = / *?+■>* *А+ЙЛ]
-Π
and Ρ is orthogonal.
616
then Ρ =

1] 2x2 MATRICES
Theorem 28.4. If к is a 2x2 real symmetric matrix Φ kl then its
eigenvalues are real and unequal.
Proof. Write . ,4
Then det(A-AI) = 0,
о (a-A)(c-A)-Z>2 = 0
oA2-(a+c)A + (ac-62) = 0.
But the discriminant of this quadratic equation is
(a + cf-Щас-Ъ*)
= (a-c)2+4b2
> 0, since, if b — 0, а Φ с.
Thus, At and A2 are real and distinct.
Theorem 28.5. 77ге reduction to diagonal form P^AP/oj- a symmetric matrix
Афй may always be effected, and Ρ may be taken to be orthogonal.
Proof Notice first that, since st and s2 are non-zero,
st perpendicular to s2osis2 = 0.
Now suppose that st and s2 are unit eigenvectors of A.
ASi = A^
=> SgASi = A1S2S1, on premultiplying by s2,
<*- (s^Asl)' = A^sisg), on taking the transpose of each side,
о s[As2 = A1sis2, by Ex. 8 and noting that A' = A,
о A2sis2 = A1sis2, since As2 = A2s2,
o(A1-A1)sis2 = 0
<*- sis2 = 0, since At Φ λ2 by Theorem 28.4.
Thus, Si and s2 are perpendicular and, by Theorem 28.3, Ρ is orthogonal.
Furthermore, by Theorem 28.2, P_1 exists and the theorem is complete.
(Notice that orthogonal matrices are always non-singular.)
■G3-
find an orthogonal matrix Ρ such that
and interpret Ρ, Ρ' as matrices of a rotation transformation.

FURTHER MATRICES [28
Ex. 13. A is a symmetric matrix corresponding to the linear transformation T.
ВУ writing A = PDP',
where Ρ is orthogonal and D is diagonal, give a geometrical interpretation of T.
Illustrate your answer with the particular matrix A of Ex. 12.
Exercise 28(a)
1. Find the eigenvalues of the matrix
■еэ
and the corresponding unit eigenvectors. What lines in the plane map into
themselves under the linear transformation Τ with matrix A ?
and the corresponding unit eigenvectors. What lines in the plane map into
themselves under the linear transformation Τ with matrix A ?
3. Find the eigenvalues of the symmetric matrix
«
and hence find an orthogonal matrix Ρ such that P_1AP is diagonal.
4. Find an orthogonal matrix Ρ such that the matrix P_1AP is diagonal, where A
is the symmetric matrix . _ .
(-1 5Γ
Give a geometrical interpretation of the linear transformation Γ with matrix A.
5. Give a geometrical description of the linear transformation Τ with matrix A,
where /2 2\
6. Give a geometrical description of the linear transformation Γ with matrix A,
where
■e -:)·
7. The 2x2 matrix A has the property that
A2 + I = 0.
Prove that the eigenvalues of A are ±j.
8. A matrix A which has the property
A2 = A
is said to be idempotent.
618

1] 2x2 MATRICES
If A is a 2 χ 2 idempotent matrix, prove that its eigenvalues are either 0 or 1
and that, if Α Φ I, then A is singular.
Show that the matrix
G3
is idempotent and give a geometric interpretation of the linear transformation
for which A is the matrix.
9. Find An in the following cases:
10. A is a 2 χ 2 matrix and the eigenvalues of the matrix A-1 are ± j. Prove
that det A = 2.
Is the converse of this result true?
11. a, b, c, d are unequal non-negative real numbers such that
a+b = c+d = 1.
Prove that the eigenvalues of the matrix
la b\
are 1 and λ, where 0 < |λ| < 1.
Find λ for the matrix
СЭ
and describe geometrically the linear transformation with matrix An where η
is a large positive integer.
12. Find the eigenvalues of the matrix
A ~ ll4 -20J
/11.2n+1-21 -33.2"+3:
\ 7.2n+1-14 -21.2- + 2:
and deduce that /(, ,„„ „, „ r+33\
;n+22J·
= An + An-1+A"-2
find an explicit form for B.
What are the eigenvalues of В ?
2. EIGENVALUES AND EIGENVECTORS
FOR 3x3 MATRICES
The results we have proved for eigenvalues and eigenvectors of 2x2
matrices hold in large part for 3 χ 3 matrices too. As before, we define the
619

FURTHER MATRICES [28
eigenvalues of the 3 χ 3 matrix A to be the roots A1; A2, A3 of the character-
isticequation det (A - AI) = 0,
and, corresponding to each Аг-, any non-zero vector sf such that
Авг = Агвг
is called an eigenvector of A.
Geometrically, each real A,· gives rise to a real яг which defines a line
through the origin, the points of which all map into points of the same line
under the linear transformation for which A is the matrix. In contrast to
the two-dimensional case, since a real cubic must possess at least one real
root, there must be at least one such line.
Ex. 14. Show that, if T(S) = S where S is not the origin, then Г maps each point
of the line OS into itself.
Ex. 15. U, V, W are distinct non-collinear points in space with T(U) = U,
T(V)= V, T(W) = W. Prove that, if the plane UVW does not contain the origin,
then Τ is the identity transformation.
As in two dimensions, if the 3 χ 3 matrix A has three distinct eigenvalues
\, λ2> λ3 with corresponding eigenvectors Sj, s2, s3 and if the matrix P,
with first column slt second column s2, and third column s3 is non-singular,
then P_1AP is a diagonal matrix. For
Asi = ^ι18!» As2 = A2s2, As3 = A3s3
AP =
Mi 0 0\
Ρ I 0 A2 0 I
\0 0 Aj
Mi 0 0\
= I 0 A2 0 J, i
\0 0 A3/
г Р is non-singular.
If such a diagonalizing procedure exists, A is said to be reducible
to a diagonal matrix. Not all 3 x 3 matrices are reducible to a diagonal
matrix; the discussion of necessary conditions upon A for it to be so and
the possibility of reducing a matrix A which does not satisfy these
conditions to a form (the Jordan canonical form) which approximates to a
diagonal matrix is beyond the scope of this book: the interested reader
should consult one of the more advanced algebra texts mentioned in the
bibliography in Volume 1. (See, however, Exercise 28(6), Question 25.)
*Ex. 16. Given two 3x3 matrices A and B, if a non-singular matrix Ρ exists
such that в = Р-АР,
В is said to be similar to A.
620

2] 3x3 MATRICES
Prove that:
(i) A is similar to A;
(ii) if В is similar to A, then A is similar to B;
(iii) if В is similar to A and С is similar to B, then С is similar to A.
Example 2. Find the eigenvalues and corresponding unit eigenvectors of the
matrix
/10 OX
= (10 -7 10
\ 7 -5 8/
and interpret the results geometrically.
Find a matrix Ρ such that P"1AP is diagonal.
The characteristic equation of A is
Ι 1-λ 0 0 I
10 -7-Λ 10 =
I 7 -5 8-Λ|
that is (1-λ)(λ2-λ-6) = 0,
giving the three eigenvalues λ = 1, λ = — 2, λ =
(i) λ = 1
The equations
10x-8j>+10z = 0,
7x-5y+7z = 0,
have solutions χ = 1, у = 0, ζ = —1 and the corresponding unit
eigenvector is
V-1/V2/
All points of the line
map into points of the same line (in fact, in this case, into themselves, s
λ=1).
(ϋ)λ=-2
The equations „ _ _
Kk-5j>+10z = 0,
7x-5y+10z = 0,

FURTHER MATRICES [28
have solution χ = 0,y = 2,z = 1 and the corresponding unit eigenvector
= (2/V5).
\1/V5/
All points of the line - = У = -
0 2 1
map into points of the same line (but not, apart from the origin, into
themselves).
(iii) λ = 3
The equations — 2x = 0,
10x-10j>+10z = 0,
7x- 5y+ 5z = 0,
have solution x = 0,y=l,z=l and the corresponding unit eigenvector
All points of the line
map into points of the same lim
The matrix
P =
has the required property, that
P1AP =
3=(l/V2).
\1/V2/
c =l = z-
) 1 1
/1 0 0\
0 -2 0 .
\0 0 3/
Exactly as in the two-dimensional case (see Example 1) the geometrical
interpretation of the linear transformation T, which has matrix A, relative
to i, j, к as base vectors, is facilitated by expressing the position vector of
a point in terms of s^ s2 and s3. For the point Q, with position vector
q = а^ + ^г + ГЯз,
maps into the point Q' with position vector
q' = a&i-2fis2 + 3ysz.
622

2] 3x3 MATRICES
Since the coefficient of s^ is unchanged, the displacement QQ' is seen to
be parallel to the plane determined by s2 and s3.
Ex. 17. Verify that, in the notation of Example 2, P_1AP is a diagonal matrix.
Pursuing the analogy with 2x2 matrices, we define a 3 χ 3 orthogonal
matrix Ρ as a matrix which has the property that PP' = I.
*Ex. 18. If Ρ is a 3x3 orthogonal matrix, prove that Ρ is non-singular,
det Ρ = ± 1 and P"1 = P'.
*Ex. 19. Follow through the steps of the proof of Theorem 28.1 to show that a
3x3 matrix is distance preserving if and only if it is orthogonal.
Theorem 28.6. If Ρ is a 3x3 orthogonal matrix with detV = + 1, then 1 is
an eigenvalue of P.
Proof
SinCe (P-I)P' = I-P'=-(P-I)',
we have det(P-I)detP' = det (-(P-I)').
But det P' = det Ρ = 1, and also
det (-(P-I)') = det (-(P-I)) = - det (P-I)
(since P —Iisa3x3 matrix)
and thus det (P-1) = - det (P-I)
=> det (P-I) = 0
=> 1 is an eigenvalue of P.
Corollary. If J? is a 3x3 orthogonal matrix with det Ρ = — 1, then — 1 is an
eigenvalue ofP.
This follows immediately on writing —Ρ for P, since det (—P) = +1.
*Ex. 20. In the case det Ρ = +1 (Ρ orthogonal), let s be an eigenvector
corresponding to the eigenvalue λ = 1. The linear transformation Γ of which Ρ is the
matrix:
(i) preserves distances between all pairs of points;
(ii) leaves all points of the line OS fixed.
Show that Γ may be interpreted geometrically as a rotation about the axis OS.
*Ex. 21. Show that, if det Ρ = — 1 (Ρ orthogonal), then the corresponding linear
transformation may be interpreted as a rotation about the axis denned by the
eigenvalue λ = — 1 followed by a reflection in the origin.

FURTHER MATRICES
Ex. 22. Show that the matrix
(cos θ -sin θ 0\
sin θ cos θ 0)
0 0 1/
is orthogonal and represents a rotation about the ζ axis through a positive
angle Θ.
Ex. 23. Show that the columns of a 3 χ 3 orthogonal matrix Ρ represent three
mutually perpendicular unit vectors and interpret this result geometrically.
In summary, if Ρ is a 3 χ 3 orthogonal matrix representing the linear
transformation T, then the columns of Ρ are the components of three
mutually perpendicular unit vectors and Τ represents either a pure
rotation, or a rotation followed by a reflection in the origin.
Ex. 24. If
α ι :i)·
prove that Ρ is orthogonal and interpret Ρ as the matrix of a linear transformation.
*Ex. 25. If u is a unit vector, show that it is possible to construct an orthogonal
matrix with u as its first column.
*Ex. 26. If P, Q are two 3x3 orthogonal matrices, prove that:
(i) P_1 is orthogonal; (ii) PQ is orthogonal.
Interpret both these results geometrically.
Ex. 27. Find a result corresponding to Theorem 28.6 for 4 x 4 orthogonal
matrices. (Be careful when you take determinants!)
We saw earlier (Theorems 28.4 and 28.5) that the eigenvalues of a 2 χ 2
symmetric matrix Афй were always real and distinct and this enabled us
to construct an orthogonal matrix Ρ such that P_1AP was diagonal. 3x3
symmetric matrices are not quite so amenable: although it can be shown
that their eigenvalues are always real, they are not necessarily distinct.
Nevertheless, the diagonalization process is still always possible as we shall
now show.
Theorem 28.7. Given any 3x3 symmetric matrix A it is always possible to
find an orthogonal matrix Ρ such that P_1AP is diagonal.
Proof. We reduce the problem to the 2 χ 2 case. First, if λι is a real
eigenvalue (such an eigenvalue must exist, since the characteristic equation is a
cubic) with Si the corresponding unit eigenvector, then Asx = A1s1 and thus
AP =
Ρ ο Α βλ,
\0 7i yj

2] 3x3 MATRICES
where Ρ is an orthogonal matrix with sx as its first column and at... γ2 are
constants (see Ex. 25). Premultiplying by P"1 we then have
IK -ι «Λ
P-!AP= 0 β1 βΛ.
\o η yJ
But (P-iAP)' = P'A'CP"1)' = P'A(P')' = Р~1АР (see Ex. 8), since
A' = A and Ρ' = Ρ-1. It follows that P_1AP is a symmetric matrix:
Mi 0 0\
P1AP = (θ Α βλ.
\0 βг УJ
Now, by Theorem 28.5, there exists an orthogonal matrix
e: *;)
which reduces the symmetric matrix
U yJ
to diagonal form. Thus the matrix
/1 0 0\
Q= о 9l 9J
\0 rx rj
reduces P_1AP to diagonal form:
Mi 0 0\
Q-ip-lAPQ = / 0 μ% Ο .
\0 0 fj
Finally, we may write PQ = R where, by Ex. 26 (ii), R is orthogonal.
Ex. 28. Is the converse result true that, if Ρ is orthogonal and P"1AP is diagonal,
then A is a symmetric matrix ?
The determination of an inverse matrix is generally a burdensome
operation, but, if Ρ is orthogonal, we have
p-i = F
and thus the diagonalization process for a symmetric matrix is particularly
simple to effect.
625

FURTHER MATRICES [28
3. THE CAYLEY-HAMILTON THEOREM AND
POWERS OF MATRICES
A matrix A is said to satisfy the equation
α0λη + α1λη~1 + α2λη~2+... + αη-1λ+αη = 0,
if a0An + % A""1 + a2An~2 +... + ап_г\ + аи1 = 0.
Theorem 28.8. {The Cayley-Hamilton Theorem.) Every square matrix A
satisfies its own characteristic equation.
Proof. (We prove the result for a 3 χ 3 matrix A, but the proof clearly
generalizes very readily to the и х и case.)
If det (A-AI) = α0+α1λ + α2λ2-λ3, we have to show that
αοΙ + ύ^Α + α^-Α3 = 0.
Now each element of the matrix adj (A-AI), being a cofactor of
(A —AI), is a polynomial of degree at most two in λ. Thus
adj (Α-ΛΙ) = C0+Ad + A2C2,
where C0, d, C2 are 3 χ 3 matrices whose elements do not contain λ. But
(A-AI) adj (A-AI) = I det (A-AI) (Theorem 14.2)
and thus (A-AI) (Co+Ad + A2C2) s Ι(α0+αιλ + α2λ2-λ3).
Comparing coefficients of powers of A in this identity
AC0 = a0I, (1)
Ad-C0 = aj, (2)
AC2-d = αΛ (3)
-C,=-I. (4)
Multiplying (2) by A, (3) by A2, and (4) by A3 and adding this gives
0 = αοΙ + ^Α + α^-Α3
and the proof is complete.
*Ex. 29. Verify the Cayley-Hamilton theorem for 2x2 matrices by direct
substitution.
Ex. 30. If
\ 0 0 -3/
find A3 by applying the Cayley-Hamilton theorem.
626

3] CAYLEY-HAMILTON THEOREM
Example 3. If
A-(2 ~з)·
./ЫА8.
The characteristic equation of A is
А2-4Л + 5 = 0
and thus by the Cayley-Hamilton theorem
A2-4A+5I = 0.
But λ8 ξ (Λ2 - 4λ + 5)/(λ) + αλ + β,
where α, β are integers. Since Λ = 2+j => λ2-4λ + 5 = 0 we have
(2+j)8 = a(2+j)+/?.
Using the Binomial Theorem and equating real and imaginary parts,
|28-28.26 + 70.24-28.22+l = 2a+/?
l8.27-56.2B+56.23-8.2 =a
Π6[16-112 + 70-7]+1 = 2a+/?
* ll6[64-112 + 28-1] =a
(a =-336
* I/? = 145.
Thus λ8 = (λ2 - 4λ + 5)/(λ) - 336λ +145
and A8 = -336A+145I, since A2-4A+5I = 0,
_ /-336 336\ /145 0\
1-672 -1008J + 1 0 145 j
/ — 191 33(
1-672 -863;
Alternatively, one can 'build up' A8:
A2 = 4A-5I, as before;
A4 = 16A2 —40A+25I, using the commutative rule,
= 16(4A-5I)-40A+25I
= 24A-55I;
A8 = 242A2-48.55A+552I
= 242(4A-5I)-48.55A+552I
= -48.7A-5(576-605)I
= -336A+145I,
and proceed as before.
627
!36\
S63J'

FURTHER MATRICES [28
Notice that, by multiplying the Cayley-Hamilton result by Ar we
establish a recurrence relation between successive powers of A :
Ar+2 = 4A-+1-5A7.
Exercise 28(b)
1. Find the eigenvalues of the matrix
and determine corresponding unit eigenvectors.
2. Find the eigenvalues of the matrix
/1 -2 IX
A= 3 -4 1 .
\3 -7 4/
What does the fact that zero is one of the eigenvalues tell us about A?
Find a matrix Ρ such that P_1AP = D, where D is a diagonal matrix.
3. Find the eigenvalues of the matrix
/3 1 -2V
A = 14 0 -21
\4 -1 -1/
and hence write down the value of det A.
Find A-1 and determine the eigenvalues of A-1.
Suggest and prove a general result about the eigenvalues of an inverse matrix.
4. Find the eigenvalues of the matrix
/1 2 -2V
A = 6 4 -6
\6 5 -7/
and corresponding unit eigenvectors. Find a matrix Ρ such that Ρ"*ΑΡ = D
where D is a diagonal matrix.
Show that, if A is the matrix of the linear transformation T, then Γ maps all
points of the plane 2x+y-2z = 0 into points of the same plane. Find the
equations of the other two planes through the origin which have this property.
5. Find the eigenvalues of the symmetric matrix
/-1 -6 -4\
A = H-6 -2 -2
\-4 -2 3/
and hence find an orthogonal matrix Ρ such that P_1AP = D, where D is
diagonal.
628

EXERCISE 28
6. Find the equations of the planes through the origin, which map into
themselves under the linear transformation Τ with matrix
*e ■ э-
7. Ρ is a 2 χ 2 orthogonal matrix with det Ρ = 1. Show that the eigenvalues of
Ρ are eje and e->°, and interpret Θ.
Show that, although the geometrical property of a 2x2 orthogonal matrix Ρ
is characterized by its eigenvalues if det Ρ = +1, this is no longer true if
det Ρ =-1.
8. The trace of the matrix A, written tr A, is defined to be the sum of elements of
A in the leading diagonal (top left to bottom right). If A is a 3 χ 3 matrix with
eigenvalues Ab λ2, λ3, prove that
tr A = λ^λ, + λ^
Prove also that tr (A + B) = tr A + tr В and that tr (M) = /* tr (A).
9. A and В are both 2x2 matrices. Prove that A and В have the same eigenvalues
if and only if both det A = det В and tr A = tr B. (See Question 8 for definition
of trace.)
10. Prove that, if A is a non-singular 3x3 matrix, then
det(A-AI) = detA-AdetAtr(A-1) + A2tr Α-λ3.
11. If А, В have a common eigenvector s, with corresponding eigenvalues λ, /t,
prove that s is an eigenvector of (i) A + B; (ii) AB. What are the corresponding
12. A is a 3 χ 3 skew-symmetric matrix (A = -A'). Prove that
det(A-AI) =- det(A + AI)
and deduce that if A possesses a non-zero eigenvalue a, then - a is also an
eigenvalue.
If ДА) = det (A2 - λΐ), prove that - /(λ)/λ is the square of a linear
polynomial in λ.
13. If В = Р_1АР and s is an eigenvector of A prove that p-"s is an eigenvector
ofB.
14. If λ is an eigenvalue of the 3 χ 3 matrix A prove that λη is an eigenvalue of
An. Deduce that, if ж
Ar = a^A' + a-iA + aol,
15. A matrix A is said to be nilpotent if there exists a positive integer η such that
An = 0. Prove that, if A is nilpotent, then all the eigenvalues are zero.
16. Find A4 + A2 +1 if
■и -Э-

FURTHER MATRICES
-(ΰ :?)·
find An, where и is a positive integer.
18. If
■U
findA3 + 3A2+12A.
19. Show that the characteristic equation of the matrix
is λ3-11λ2+15λ-1 = 0.
Deduce that A is non-singular and that
A"1 = A2-11A+15I.
L2+ 15A-1 =
that
A2-11A+15
show that A3 = (5A -1) (A -1)
ind A"1 = (A -31) (A -21).
Deduce explicit forms for A3 and A-1.
21. The matrix A is denned by
с; *
Show that A3 - ЗА2 -16A - 161 = 21 - A
and express (21-A)-1 as a quadratic polynomial in A.
22. Show that 1, 2, 3 are the eigenvalues of the matrix
(17 -16 8\
10 -8 61.
-10 11 -3/
Hence, or otherwise, find an explicit form for An.
Write down the inverse matrix of A.
23. The 2x2 matrix A has equal eigenvalues λ. Prove that, for и > 2
A" = иА»-1А-(и-1)А"1.
Prove further that, if λ Φ 0, this formula is true for all integral n.
630

EXERCISE 28
24. If S is a 3 χ 3 skew-symmetric matrix, prove that its eigenvalues are 0 and
±ja, α real.
If Ρ is a 3 χ 3 orthogonal matrix and Ρ +1 is non-singular, prove that
S = (Ρ-ΙΧΡ + Ι)"1
is skew-symmetric, and that, if λ is an eigenvalue of S, then
A = Ρ-Ι-λ(Ρ+Ι)
is singular.
The eigenvalues of Ρ are 1, eie, s~ie; what are the corresponding eigenvalues
ofS?
25. The 3x3 matrix A has characteristic equation
(Л-ЛОЧА-Я.) = 0,
where Ab λ2 are real and At Φ λ2.
Show geometrically that, if the equations
Ax = Atx
have a plane of solutions, then it is possible to form a matrix Ρ such that
(At 0 0\
0 At 0 1.
0 0 V
If A is the matrix of the linear transformation T, and if / is the invariant line
through the origin corresponding to the eigenvalue λ2, show that every plane
containing / is mapped into itself by T.
If
(-4 -10 30\
-3 -5 18
-2 -4 13/
show that rieaves all the points of a certain plane containing the origin fixed and
find the equation of this plane. Show also that every plane containing the line
- =y- = -
5 3 2
is mapped into itself by T.
Find a diagonalizing matrix Ρ for this particular matrix A.
26. The linear transformation Τ has matrix A where A is non-singular with an
eigenvector d = OD. Ρ is a given point not on OD and Ί\Ρ) = Q. Show that
Τ maps PQ into itself if PQ is parallel to OD. Show further that, if this is the
case, then the linear transformation S with matrix A-I maps the line PQ into
the line OD.
9-2 631

FURTHER MATRICES [28
27. Ρ is a non-singular 3x3 matrix, ω is a cube root of unity and r is an integer.
Prove that, if
,.r 0 OX
1 0 ω^1 Ο Ρ
\0 0 ω-"/
Prove conversely, that if Ω is a 3 χ 3 matrix, with real determinant, having the
property
(ω' Ο 0 \
0 W+1 0 J Ϊ
0 0 ω'+2/
for some matrix Ρ and integer r.
Prove further that, if Ω Φ I, then the matrix
Ι+Ω + Ω2
is singular.
28. Show that the linear transformation Τ with matrix
■(:! i Э
leaves just two lines through the origin invariant and find their equations.
Explain geometrically why it is impossible to find a matrix Ρ such that the
matrix p_1Ap
is diagonal.

29. Further coordinate geometry
1. PARAMETRIC FORMS FOR
PLANE CURVES
We have seen in Chapters 22 and 27 that the coordinates of points on
certain plane curves may be given in terms of parameters. Indeed, we could
define, for example, the parabola as the set of all points Ρ with position
veCtOTS r = at4+2ati
relative to some origin O, where t is a scalar parameter. More generally a
plane curve is completely specified if we are given a parametric representa-
tion r=f(t)i+gm
where/, g are two (continuous) functions. We shall also demand that there
is a 1-1 correspondence between points of the curve and values of the
parameter - with the possible exception of a limited number of multiple
points, for which several values of the parameter may yield the same point
(see Ex. 5). If f{t), g(t) happen to be algebraic expressions, the resulting
analysis is much simplified and we are often able to employ the theory of
polynomial equations in problem solving.
Ex. 1. Show that an algebraic parametric form for the ellipse
Wx* + a*y* = а2Ьг
α(1-ί2). 2bt .
r = -TT^1+Τι?1·
What point of the ellipse is excluded by this parametric representation?
Find a similar algebraic parametric form for the hyperbola.
i¥-flY = a*b*.
In the following two examples we illustrate some of the methods available
for the solution of geometrical problems employing algebraic parameters.
In the first example, we define a curve parametrically and deduce a
geometric property; in the second, we obtain a geometrical property of the
parabola using the standard parametric form.
Example 1. The semi-cubical parabola is defined parametrically by
г = at2\ + ataj.
633

FURTHER COORDINATE GEOMETRY [29
Prove that a straight line not parallel to the у axis cuts the curve in either
one or three real points and that, if a straight line not through the origin cuts
the curve at P, Q, R and the tangents to the curve at these three points cut the
curve again at P', Q', R', then P'Q'R' is a straight line.
Any line not parallel to the у axis has an equation of the form
This cuts the curve in points with parameters given by the roots of the
equation .4 .2
at3 = mat2+c,
that is, by at3-mat2-c = 0.
Since this is a cubic equation in t, it has either one or three real roots,
due regard being paid to their multiplicity. This proves the first result
Now suppose that the parameters of the points P, Q, R are respectively
p, q, r. Then, if the equation of the line PQR is у = mx + с we see that the
roots of the cubic equation
at3-mat2 -с = 0
are/>, q, r; thus qr+rp+pq = 0.
But the line does not contain the origin; thus p,q,r + 0 and we have
l/p+l/?+l/r = 0.
Again, since the tangent atPcuts the curve atP', parameterp', we have,
as above, ,,, ч ,,. ч ,,, ,ч .
(i/p)+(i/p)+(i/p) = Q,
giving, lip' = —2jp, similarly \\q' = —1\q, and \\r' = —1\r and we have
α/ρΟ+α/ϊΟ+α/ο = -2[(ΐ//0+(ΐ/?)+(ΐ/ι-)] = o.
It follows that^', q', r' are the roots of a cubic equation of the form
at3-m'at2-c = 0
(where m' = p' + q' + r' and c'ja = p'q'r') and thus the points P\ Q', R'
lie on the line , ,
у = mx + c .
The shape of the semi-cubical parabola is shown in Figure 29.1. О is
called the pole and Ox is the axis of the curve. The Cartesian equation is
x3 = ay2. A connection between the parabola and the semi-cubical
parabola is obtained in Example 2.
Example 2. Given a parabola and a point C, prove that, in general, either one
or three normals may be drawn to the parabola to pass through С but that,
if just two such normals may be drawn, then С lies on a certain semi-cubical
parabola.
634

1] PARAMETRIC FORMS
Fig. 29.1
If the normals at three points, P, Q, R on a parabola are concurrent,
show that the circumcircle of the triangle Ρ QR passes through the vertex of
the parabola.
Take the parabola in the standard form y2 = Aax and let С be the point
(h, k). The equation of the normal at the point P(at2, 2at) is
tx+y = at3 + 2at
and this passes through С if
at3 + (2a-h)t-k = 0.
This is a cubic equation in t, with, in general, either one or three real roots,
and the first part of the question is proved.
If just two normals pass through C, then the cubic equation above has
a repeated root and the condition for this (see Chapter 23) is
4 /. h\3 к2 п
showing that С in this case lies on the curve
4(x-2a)3 = Π ay2.
By Example 1, this is a semi-cubical parabola with pole at (2a, 0) and axis
coinciding with the axis of the parabola.
Now suppose that the normals at P(ap2, lap), Q(aq2, 2aq), R(ar2, 2ar)
meet at C; then p, q, r are the roots of the equation
at3 + (2a-h)t-k = 0
and thus p+q + r = 0.
635

FURTHER COORDINATE GEOMETRY [29
Suppose that the circumcircle of triangle PQR has equation
This cuts the parabola at points with parameters given by the quartic
equation
(at*-x)4(2at-py = y\
But the coefficient of t3 in this equation is zero and thus the sum of the
roots is zero. Since three of the roots are, by definition p, q and r, and since
p + q+r = 0, the fourth root must be zero and we have proved the final
result.
Ex. 2. Prove that the normals to the parabola у2 = Лах touch the semi-cubical
parabola of Example 2. Draw in the same sketch the parabola y2 = 4ax and the
semi-cubical parabola 4(x-2a)3 = Пау2. (It is worth drawing the parabola
accurately and then constructing a large number of normals—which may be
done very rapidly using the geometrical property that, in the notation of Exercise
22 (ft), NG = 2a.)
Ex. 3. Draw the parabola y2 = Aax and shade in the set of all points from
which three real normals may be drawn to the parabola.
Ex. 4. The Folium of Descartes has Cartesian equation
By considering the intersection of the line у = tx with the folium, show that the
curve may be represented parametrically by
_ 3at _ 3at2
X ~ 1 + f3' У ~ 1 + f3'
Taking t = 0 it is seen that the origin lies on the curve and that the χ axis is a
tangent there; by writing и = Г1, show that the у axis is also a tangent at the
origin (which is thus a double point of the curve, that is, a point through which
pass two separate branches of the curve).
Show that the line χ - у = 0 is an axis of symmetry and that the line χ+у = a
is an asymptote. Sketch the curve.
Ex. 5. Using the method of Ex. 4, find a parametric form for the curve
y2 = x\l + x),
and deduce that no point of the curve has an χ coordinate less than -1. Prove
also that the χ axis is an axis of symmetry for the curve.
Show that the origin lies on the curve and that this point arises from two
distinct values of the parameter. Deduce that the origin is a double point of the
curve and determine the equations of the two tangents there.

И
PARAMETRIC FORMS
Exercise 29(a)
1. Show that, if OP = г = at'\ + at3], then Ρ lies on the curve Γ: x3 = агу.
Prove that any line in the plane cuts Г in either one or three real points.
The tangents at three points P, Q, R of the curve meet Г again at P', Q', R'
respectively. Prove that:
(i) if the centroid of the triangle PQR lies on the у axis, so also does the cen-
troid of the triangle P'Q'R';
(ii) if P, Q, R are collinear, so also are P', £>', R'.
2. The cissoid is defined parametrically by
at2 . at3 .
Find the Cartesian equation of the cissoid and give a rough sketch of the curve.
Prove that, if the chord PQ subtends a right angle at O, then the mid-point
MoiPQ lies on a fixed straight line.
3. A curve Г is given parametrically by
r^i + '-^J HO).
Prove that, if the points with parameters tu i2, t3 are collinear, then
1 + ίι+ί2 + ί3 + ίιί2ί3 = 0,
and conversely.
Show that, given a point Ρ lying on Γ, two lines may be drawn through Ρ to
touch the curve at Q and R. If QR cuts Γ again at S find the parameter of the
point S in terms of the parameter of P.
Show that the roots of the equation i2 +1-1 = 0 give the double point of the
curve.
4. A curve Γ is given parametrically by
r = (i2+l/i)i + (i2-l/OJ·
Obtain a necessary and sufficient condition for the three points P, Q, R of the
curve to be collinear.
The tangent at Д meets Γ again at P2, the tangent at P2 meets Γ again at P3
and the tangent at P3 meets Γ again at Pt. РгР3 meets the curve at Q while PaP4
meets the curve at R. Prove that QR is a tangent to Г at Q.
5. A curve is given parametrically by the equations
x = a(\-t\ y=a(t-t3).
Prove:
(i) that an arbitrary line Ix+my+na = 0 meets the curve in three points;
(ii) that if three points t = tu t = i2, t = t3 are collinear then
hh + tsti + htz - -1; (over)
637

FURTHER COORDINATE GEOMETRY 29
(iii) that if tu t2 and t3 satisfy the above equation then the points
t = tj, t = t2, t = t3
are collinear.
A chord through the point (a, 0) meets the curve again at the points Ρ and Q.
Prove that the locus of the middle point of PQ is a curve with parametric equa-
ti°nS x = -iaT\ у=-ОХ7* + 2). (О&С)
6. Prove that, in general, three tangents may be drawn from a point С to the
cubic curve x3 = a2y and that, if these tangents cut the curve again at P, Q, R
then the tangents at P, Q, R are concurrent.
7. A rectangular hyperbola is given parametrically by the equations
χ = ct, y= c/t.
If the four points of the curve with parameters h, t2, t3, i4 lie on a circle, show
that ίιί2Μ4 = 1· Show conversely that, if ht2tzti = 1, then the four points lie
on a circle.
A variable circle passes through the fixed points Α, Β οία rectangular
hyperbola, and meets the hyperbola again at P, Q. Show that the direction of PQ is
fixed.
8. P, Q are variable points on the parabola y2 = Aax such that PQ is parallel to
the fixed line x + ky = 0. The normals to the parabola at Ρ and Q meet at R.
Prove that the locus of R is the normal to the parabola at a fixed point on the
parabola, and find the coordinates of this point. (O & C)
9. Find the equation of the normal to the rectangular hyperbola xy = c2 at the
point (ct, c/t).
The normals to the rectangular hyperbola at the points P, Q, R, S are
concurrent; prove that each of these points is the orthocentre of the triangle formed by
the other three.
10. The normals at three points P, Q, R of the parabola y2 = Aax meet at a
point N. Prove that the centroid of the triangle PQR lies on the axis of the
parabola.
If N coincides with P, prove that QR passes through a fixed point (that is, a
point whose position is independent of P, Q, R). (O & C)
11. Find the coordinates of the point P, other than the origin in which the line
у — tx meets the curve x3+y3 = axy. (t is called the parameter oiP.)
If a line meets the curve in three points whose parameters are h, t2, t3, prove
that h t2t3=-1.
If Q is any point on the curve with negative parameter, not equal to -1, prove
that there are two points РъРг of the curve (other than Q) such that the tangents
at Д and P2 pass through Q. Prove that ОД and OP2 make equal angles (apart
from the sense) with either of the coordinate axes. (O & C)
12. Prove that the equation of the normal to the curve у = χ3 at the point

1] PARAMETRIC FORMS
By considering the maximum and minimum values of a certain function of t
and drawing a rough graph of the function, or otherwise, prove that three normals
can be drawn to curve у = χ3 from a point (0, b) when b > 4/3V3.
How many real normals can be drawn to the curve from a point (0, c) when
0< с < 4/3V3? (O&Q
13. A circle has a diameter О A of length a, and the tangent at A is /. A variable
line through О meets the circle again at Q, and / at R; Ρ is the point on OR
such that OP = QR. If О is the origin and OA is the χ axis, show that Ρ has
coordinates (at2/(1 + t2), at3/(1 + i3)), where t is a suitable parameter.
Prove that, if a line meets the locus of Ρ in three points with parameters
tu t2, tz then t2t3+ t3t!+ ht2 = 0.
Hence, or otherwise, prove that the tangent to the locus at Ρ meets the locus
again at the point with parameter — %t. (O & C)
14. A point on the curve ay2 = χ3 is given parametrically in the form (at2, at3).
If the points on the curve with parameters ii, t2, t3 are collinear, prove that
(l//i) + (l//i) + (l//s) = 0.
Hence show that, if the tangent at the point with parameter it meets the curve
again at the point with parameter i4, then it+2i4 = 0.
Perpendicular lines through О meet the curve at P, Q; PQ meets the curve
again at R and S is the point of the curve such that the tangent at S passes through
R. Prove that OP, OQ are the bisectors of the angles between OS and Ox.
(O&C)
15. The rectangular hyperbola xy = k2 is met by a circle, passing through its
centre O, in four points Au A2, Bu B2. The lengths of the perpendiculars from
О to АгА2 and ВгВ2 are a and b. Prove that ab = k2. (C. S.)
16. Show that there are three values of t, not necessarily real, for which the point
(t2, t3) lies on a given straight line.
P, Q and R are distinct points (p2, p3), (q2, q3) and (r2, r3) on the curve j>2 = x3.
Show that:
(i) if these points are collinear then ~Zpq = 0;
(ii) if the tangents at these points are concurrent, then Σρ = 0;
(iii) there are no real points on the curve for which these two conditions
co-exist.
The tangent at the point Ρ on this curve meets the curve again at P'. Find
the ratio in which PP' is divided by the χ axis. (London)
2. SURFACES AND CURVES IN
IN THREE DIMENSIONS
The simplest surface in three-dimensional space is the plane. Given three
points А, В, С, the vector equation of the plane through ABC is
r = a + A(b-a) + /t(c-a),
where λ, μ are scalar parameters. Notice that we need two parameters to
define the surface.
639

FURTHER COORDINATE GEOMETRY [29
Two surfaces intersect in a curve (which need not necessarily be a
plane curve). Thus, two planes intersect in a line; the vector equation of
the line through А, В is ч
6 г = а + Л(Ь-а),
where we have the single scalar parameter λ.
Another familiar surface is the sphere, which is defined as the set of
points in three dimensions lying at a fixed distance from a fixed point.
If the fixed distance {radius) is с and the fixed point {centre) is A, then the
vector equation of the sphere is
(r-a).(r-a) = c2.
Ex. 6. Prove that
г = a + cos θ cos 0i + sin Θ cos iij + sin фк
gives a parametric representation of the sphere. (Notice now that we have two
parameters; suggest names for them.)
*Ex. 7. Prove that the vector equation (r-a).(r-b) = 0 represents a sphere,
and locate the points A and В as points on the sphere.
The Cartesian equation of a sphere, centre А{аъ а2, a3) is thus
{x-a1)2 + {y-a2)2 + {z-a3)2 = c2,
which may be rewritten
x2+f+z2+2ux + 2vy+2wz + d = 0.
Conversely, an equation of the form
x2+f + z2+2ux+2vy+2wz + d= 0
may be rewritten as
{x+uf + {y+vf + {z+wf = и2+1?+п*-а,
and thus represents a sphere, centre {—u, —v, —w), provided
ιΡ + ΰ+ν? > d.
Ex. 8. Show that the equation
хг+уг + гг-2х-4у+2г-\0 = О
represents a sphere, and find its centre and radius.
*Ex. 9. Show that four points in space in general define a unique sphere. What
exceptional cases may arise?
The intersection of the sphere
{x-a1)2+{y-a2f + {z-a3f = c*
640

2] SURFACES IN THREE DIMENSIONS
with the plane ζ = 0
is the curve in the Oxy plane with equation
{x-a^f + iy-a^ = сг-а\,
which is seen to be a circle provided c2 > c%. If c2 = a$, the equation
represents a single point and, if c2 < of, the sphere and plane do not intersect.
Thus, a sphere and a plane, if they intersect at all, intersect in a circle,
which may be of zero radius if the plane is tangential to the sphere. The
centre of the circle lies on the perpendicular from the centre of the sphere
on to the plane. If the sphere and the plane have vector equations
(r-a).(r-a) = c\ r.n=p (where |n| = 1),
this perpendicular will be r = a + An.
To find the value of λ which gives the centre of the circle determined by
the sphere and the plane we solve
r = a + An and r.n=^,
giving (α + λή).η = p.
Thus, λ =p — a. n, since η. η = 1 and the position vector of the centre of the
r = a + (p-a.n)n.
circle ii
Ex. 10. Find the centre and radius of the circle determined by the sphere
(x-l)2 + (y-3)2 + (z-2)2=20
and the plane x—y-3z=3.
Now consider two spheres, with equations
(r-a).(r-a) = c2, (r-b).(r-b) = d\
These equations may be rewritten
r.r-2r.a = c2-a.a and r.r-2r.b = uP-b.b
which on subtraction, give
2r.(b-a) = c2-uP-a.a+b.b.
But this is the equation of a plane, perpendicular to the vector b—a,
that is, to the line of centres AB and, since a plane and a sphere determine
a circle, we see that two spheres, provided they intersect at all, intersect in
a circle.
641

FURTHER COORDINATE GEOMETRY [29
The line г = a + λη (|n| = 1)
cuts the sphere (r-b).(r—b) = c2
in points with parameters given by
(An + a-b).(An + a-b) = c2;
that is, by the quadratic equation
A2 + 2An.(a-b) + (a-b).(a-b)-c2 = 0.
Thus, a line cuts a sphere in two points (which may be coincident or
imaginary).
If η is given and we take A to be the mid-point of any chord in the
direction defined by n, the two roots λ1 and λ2 of the above quadratic equation
have the same magnitude but opposite sign; that is, λι + λ2 = 0. It follows
that n.(a —b) = 0 and A lies on the plane n.(r—b) = 0 which passes
through the centre, B, of the sphere and is perpendicular to n; thus the locus
of the mid-points of parallel chords of a sphere is a plane through the
centre of the sphere (a diametral plane).
Example 3. Find the equation of the tangent plane to the sphere
x2+y2 + z2-Sx-6z-2 = 0
at the point (1, 3, 0).
Prove that the plane . , , „„ _
F x-5y+z + 20 = 0
is a tangent to the sphere and find its point of contact.
Rewriting the equation of the sphere in the form
(*-4)2+/ + (ζ-3)2 = 27
we see that the centre of the sphere is the point (4, 0, 3).
(i) The vector (4i + 3k)- (i + 3j) = 3i- 3j + 3k is normal to the required
tangent plane whose equation is thus
l(*-l)-l(y-3)+l(z-0) = 0,
that is x-y+z + 2 = 0.
(ii) The radius of the sphere is ^27; but the perpendicular distance from
(4, 0, 3) to the plane x-5y+z + 20 = 0 is
4+3 + 20 _
V(l2+52+l2)- VZ/·
Thus the plane x— 5y+z+20 = 0 is tangential to the sphere.
A normal to the given plane is i —5j + k and thus any point on the
642

2] SURFACES IN THREE DIMENSIONS
radius to the point of contact has coordinates (4 + λ, — 5λ, 3 + λ). This lies
on the given plane if
(4 + Λ)-5(-5λ) + (3 + λ) + 20 = 0,
that is, if λ = — 1. This gives the coordinates of the point of contact as
(3, 5, 2).
The vector equation of the tangent plane at a point Τ of the sphere,
centre A and radius c, is easily derived by the same method as that used
in the last example. For a unit vector in the direction AT is (t —a)/c and
thus the equation of the tangent plane is
(r-a).(t-a) = c\
Notice that the equation of the tangent plane is obtained from the
equation of the sphere by writing t for г in one of the brackets; this simple rule
enables us to write down the equation of the tangent plane at any point of
the sphere. For example, the equation of the tangent to the sphere
x*+y2 + z*-2x + 4y+12z + 4 = 0
at the point (1, -3,0) is
*.l+j>.(-3) + z.0-(*+l) + 20>-3) + 6(z+0) + 4 = 0
or y-6z+3=0.
Example 4. Determine whether or not the circles
Si:xz+f + z2-lSx-6y-4z+14 = 0, x+3j>+5z = 0,
S2: x2+y2 + z2-6x+l4y-12z + 22 = 0, x+y+z = 2,
are linked (as in the links of a chain).
The planes of the two circles meet in the line, /, with equations
x + 3j>+5z = 0, x+y+z = 2
,•1 j j. ·* —3 У+1 z
which reduce to —j— = —у = у
and any point on this line has coordinates (3 + λ, — 1—2λ, λ). Thus /
meets the given sphere through 5Ί at points given by
(3 + λ)2+(-1-2λ)2+Λ2-18(3 + λ) + 6(1+2λ)-4λ+14 = 0,
or 6λ2-24 = 0,
giving λ = +2 or —2.
643

FURTHER COORDINATE GEOMETRY [29
Again, / meets the given sphere through S2 at points given by
(3 + λ)2+(-1-2λ)2+Λ2-6(3 + λ)-14(1+2λ)-12λ + 22 = 0,
or 6λ2-36λ = 0
giving λ = 0 or 6.
Thus, if Si. meets / at A1 (λ = -2) and B1 (λ = +2) and S2 meets / at
Α2(λ = 0) and52(A = 6), the order of the points on the line is A1A2B1B2
and the circles must therefore be linked.
Another familiar surface in three dimensions is the circular cylinder.
If the axis of the cylinder passes through the point A(alt a2, a3) and is in
the direction of the unit vector u = /i + mj + ик, any point P(x, y, z) lies
on the cylinder if its perpendicular distance from the axis is a constant, b.
Thus, if OA = a, OP = r, we have (see Figure 29.2) |(r-a) л u| = b,
or in Cartesian form,
Z[n(y-a2)-m(z-a3)]* = b\
Fig. 29.2
If we take the χ axis as the axis of the cylinder
(and thus 1=1, m = η = 0 and аг = a2 = a3 = 0)
this equation reduces to the much simpler form
y2+z* = b\
*Ex. 11. Show that a parametric form for a cylinder, radius b, with its axis along
the ζ axis, is , a. , , . a. ,,
г = b cos θι+b sin 0j + Ak.
Notice once again that we have two parameters, θ and λ for this surface.
Ex. 12. What does the surface ,
-A = '
a2 b2
represent in three dimensions?
A cylindrical spring is a three-dimensional curve lying on the surface
644

2] SURFACES IN THREE DIMENSIONS
of a circular cylinder. The curve is called a helix and is defined parametric-
ally by θ
r = 6cos0i + 6sin0j+|-k
where;> (thspitch of the helix) and b (the radius of the cylinder on which the
helix lies) are constants. Note that the parametric representation of the
helix involves only one parameter.
*Ex. 13. Interpret geometrically the constant p, the pitch of the helix.
Ex. 14. Show that the helix defined above is one of the two curves of intersection
of the cylinder x2+y2 = b2 and the corrugated surface χ = a cos . Show also
Ρ
that the other curve formed is an oppositely twined helix.
The right-circular cone is the surface traced out by a variable line through
a fixed point О (the vertex of the cone) and making a constant angle with
a fixed line though О (the axis of the cone). If we take О as the origin and
the unit vector u = li + m\ + rik as defining the direction of the axis we have,
for any point Ρ of the cone,
r.u = |r| cos a,
where r = OP and α is the angle of the cone (Figure 29.3). In Cartesian
form, this gives ., N, , „ , , , „4 ,
6 (lx + my+nz)2 = (x*+y2 + z2) cos2 α
Fig. 29.3
and we see that the equation of a cone with its vertex at the origin is
homogeneous in x, y, z. We saw in Chapter 26 that the section of a plane
with a cone is a conic.
Example 5. Obtain the equation of the cone with vertex V (2, —1, —2)
which touches the sphere x^+yP+z2 = 1 at the points of a circle on the
surface of the sphere.
Since the centre of the sphere is the origin, O, and the cone touches the
sphere, VO is the axis of the cone. Let FT be a line on the surface of the
645

FURTHER COORDINATE GEOMETRY [29
cone (a generator of the cone) (see Figure 29.4). Then, if LOVT = a, since
OV = 3 and the radius of the sphere is 1,
_2V2
cosa 3 .
Now consider any point P(x, y, z) on the cone. A unit vector along
OF is u = f i — Jj — f к and thus, since
VP.u= |VP|cosa
we obtain
[f(x-2)-Ky+l)-f(z + 2)]2 = l[(x-2y + (y + lf + (z + 2n
which may be rewritten as
4(*-2)2 + 7(y+l)2+4(z + 2)2-4(>+l)(z + 2) + 8(z + 2)(;c-2)
+ 4(*-2)O+l) = 0.
3. CHANGING THE COORDINATE SYSTEM
IN A PLANE: ROTATION OF AXES
In Chapter 13, a linear transformation of the plane into itself of the form
x' = ax + by, y' = cx + dy
was regarded as a mapping of the plane into itself, in which the point
P(h,k) is mapped into the point P'(ah + bk, ch + dk), coordinates being
referred to the same axes. In particular, the transformation Τ whose
matrix P, is orthogonal, . „ . _4
_ /cos0 -sin0\
Г - Ism θ cos θ)
has the effect of rotating any line О A through the angle Θ, measured
in a positive sense, into the position OA', where О A = О A'.
646

3] CHANGES OF COORDINATE SYSTEM
There is, however, an alternative way of looking at an orthogonal linear
transformation. Suppose we leave each point of the plane where it is but
re-name the coordinates by taking a new pair of perpendicular axes, with
the origin still at O, but with the new axes Ox', Oy' making an angle — θ
with the old axes Ox, Oy (Figure 29.5).
Fig. 29.5
Let OP = r, and suppose OP makes an angle α with Ox. Then
Thus,
*' = г cos (« + <?),
x' = r cos α cos 6
= л; cos θ—ysin Θ,
y' = r cos α sin θ + r sin α cos
= л; sin θ+ycos Θ.
In matrix notation, this means
sin(a+0).
;sin0
(Й-
\sin0
cos θ
and we have precisely the orthogonal linear transformation discussed ii
the previous paragraph.
In summary, the linear transformation T, with orthogonal matrix
may be regarded either (i) as mapping each point A into another point A',
where О A = О A' and АО A' = + θ or (ii) leaving each point in its original

FURTHER COORDINATE GEOMETRY [29
position but rotating the axes of coordinates through an angle — θ (i.e.
through θ clockwise).
We shall now show how this new interpretation of an orthogonal matrix
may be employed to find the major and minor axes of a central conic.
Consider first the equation
ax2 + by2 = 1 (a, b not both negative),
which represents a conic with its centre at the origin and its major and
minor axes along the axes of coordinates. If we rotate the axes through
an angle - Θ, that is, if we rename each point by the rule
χ' = χ cos Θ—у sin Θ,
у' = л; sin θ+y cos Θ,
then χ = x' cos θ+y' sin Θ,
у = — χ' sin θ+y' cos Θ,
and the new equation of the conic takes the form
a(x' cos θ+y' sin 0)2 + b(—x' sin θ+y' cos Θ)2 = 1,
or
(acos2e + bsin20)x'2 + '2(a-b)x'y' sin θ cos θ + (a sin2 θ + b cos2 Θ) y* = 1.
We may rewrite this as
a'x'2+2h'x'y' + b'y'2 = 1,
which is the typical form of equation of a central conic with its centre at the
origin.
Now consider the converse problem: given the equation of a central
conic in the form ^ + lhxy +by2 = \,
how can we rotate the axes to obtain the equation in the form
a'x'2 + b'y'2 = 1?
The equation a^ + lhxy+by2 = 1 may be rewritten in matrix form as
or u'Au = 1,
where o=(J) and Α- β J).
The problem thus reduces to that of diagonalizing the matrix A for
then the equation would become, in the new coordinate system,
648

3] CHANGES OF COORDINATE SYSTEM
or λ1Λ;'2 + λ2>''2 = 1.
But A is a symmetric matrix and, by Theorems 28.3 and 28.4, we can find
an orthogonal matrix Ρ such that PAP' = D where
Ai and λ2 being the eigenvalues of A. (Recall that, for an orthogonal matrix,
P"1 = P'. Notice too that, to obtain PAP' in diagonal form (rather than
P'AP), the matrix Ρ is obtained by transposing the matrix with columns
which are the eigenvectors of the matrix A; that is, the eigenvectors appear
as the rows) The fact that Ρ is orthogonal tells us that the new axes, Ox',
Oy' will be perpendicular. The details of the transformation are given
below.
Consider the linear transformation defined by
ν = Pu,
where ν = I , I, the new position vector of the point u = I I. Then
P'v = Р-Ч = u
and the equation u'Au = 1
transforms into the equation
(P'v)' A(P'v) = 1
or, since (P')' = P, and using the results of Ex. 8, Chapter 28,
v'PAP'v = 1.
But PAP' = D, a diagonal matrix and the equation has been transformed
into
<*»(*£)(?)■
or λιχ'*+λ?' = \.
If we define the trace of the matrix A, tr A, as
trA = a+b
(see Exercise 28 (b), Questions 8-10), the characteristic equation of A,
Ι α-λ h I
\ h b-X\ '
may be written in the form
A2-AtrA + detA = 0.

FURTHER COORDINATE GEOMETRY
Thus we have
but
tr A = λ1 + λ2,
detA = AiA2;
trD = Ai + A^
detD = λ^,
and thus both the trace and the determinant of the matrix denning the
conic remain invariant under the given linear transformation.
*Ex. 15. Show how the invariance of the trace and determinant enables us to
deduce immediately the nature (ellipse, hyperbola) of a central conic
ax2+2hxy + by2 = 1.
Ex. 16. Prove that, if Q is any non-singular 2x2 matrix, then
tr (Q^AQ) = tr A and det (Q^AQ) = det A.
Can these results be generalized for higher-order matrices (the trace still being
denned as the sum of the elements in the leading diagonal)?
Example 6. Determine the nature of the central conic
5хг + %ху+\\уг = 1
and find the lengths of its axes.
The matrix A of the conic is given by
Let the eigenvalues of A be Λ1( λ2; then
Λ1+λ2= 5+11 = 16,
λ1λ2 = 55-16 = 39
giving λι = 3, λ2 = 13.
The equation of the conic may thus be reduced to the form
3x'2+13/2= 1.
This is seen to be an ellipse, with major semi-axis 1/^/3 and minor semi-
axis 1/V13.
To draw a sketch of the ellipse, showing it in relation to the original
axes, we need to find the inclination of the axes of the conic to the axes
Ox, Oy. This may be done by obtaining an explicit form for the reducing
matrix P.
Corresponding to the eigenvalue λ1 = 3 we have the unit eigenvector
(_ ι/ /5) > wnile corresponding to λ2 = 13 we have U, /5) ·
650

CHANGES OF COORDINATE SYSTEM
Thus
>-4 D-
and
The transformation ν = Pu
may be interpreted as a rotation of axes clockwise through a
θ = arctan i
y,
xtCx
(Figure 29.6).
/
1 ^4
^__J/^^*
Fig. 29.6
In other words, in terms of the original coordinate system, the major and
minor axes of the conic lie respectively along
x + 2y = 0 and Ix-y = 0.
*Ex. 17. Show that the equation
f+ihxy+by* ■■
(i) represents an ellipse if h2-ab < 0, and a > 0;
(ii) represents a hyperbola if h2-ab > 0;
(iii) represents a rectangular hyperbola oi a+b = 0;
(iv) represents a pair of parallel straight lines if h2 = a
Ex. 18. Let us write the conic of Ex. 17 in the form
x'Ax = 1,
И·»©'
u is a unit direction vector and Ρ is a point with position vector p. The point
with position vector p + λιι lies on the conic if
(р + Ли)'А(р + Ли) = 1;

FURTHER COORDINATE GEOMETRY [29
show that this reduces to
A2u'Au + 2Au'Ap + p'Ap = 1.
If Ρ is the mid-point of a chord parallel to u, prove that
u'Ap = 0;
what does this tell us about the vector Au ?
The axes of a central conic (other than a circle) may be defined as the (unique)
pair οι perpendicular conjugate diameters: use the results proved above to deduce
that u is in the direction of an axis if there exists a number к such that
Au = ku.
Interpret your results in terms of eigenvectors.
Exercise 29(b)
1. Obtain the equation of the sphere:
(i) with centre (0, - 1, 0) and radius 1;
(ii) with centre (2, - 3, - 1) and radius 5;
(iii) with centre (a, b, c) and radius J(a2 + b2 + c2).
2. Find the centre and radius of the sphere:
(i) withequationx!+y! + z2-6;c-2>' + 2z-5 = 0;
(ii) with equation x2 + y2 + z2 + Ax -Sy-2z- 60 = 0;
(iii) with equation x2+y2 + z2-2ax-2by + 2cz + 2bc-2ca+2ab = 0.
3. Find the equation of the sphere:
(i) through the points (4, 4, 4), (5, 6, 1), (0, -4, 2), (7, 3, 2);
(ii) through the points (2, 4, 4), (5, 1, 4), (3, 4, 3), (-3, - 1, 0);
(iii) through the points (0, 0, 0), (a, 0, 0), (0, b, 0), (0, 0, c).
4. Find the centre and radius of the circle:
(i) with equations x2+y2 + z2-12x + 2y-10z+A4 = 0, x-y + z = 6;
(ii) with equations x2+y2 + z2-lSx- 10y+ 12z- 55 = 0, 2x+3y-Az = -1.
5. The position vectors of points A and В relative to some origin О are
respectively a and b. Show that the locus of a point Ρ which moves in space in such
a way that PA2 + PB2 = d2, where d is a constant, is a sphere with its centre at
the mid-point of AB. Find the radius of the sphere in terms of a, b and d. What
can you say about the sphere if d = |a- b| ?
6. With the notation of Question 5, show that the locus of a point Ρ which moves
in space in such a way that РАг+РВ2 +РС2 = d2, where С is a point with
position vector c, is a sphere and find its centre.
Can you generalize this result in any way?
7. Show that the line г = Au, where u is a unit vector, cuts the sphere
(r-3i-2j-2k).(r-3i-2j-2k) = 6
at two points with parameters given by the quadratic equation
A2-2Au.(3i+2j + 2k)+ll = 0.
652

EXERCISE 29
Deduce that the line through the origin in the direction of the vector i + j + 3k
touches the given sphere.
Prove more generally that the line through the origin in the direction of the
vector /i + wj + ик is tangential to the sphere if and only if
(3/+2»г + 2и)2 = \\{P + m2 + n2),
and hence show that the equation of the cone with vertex the origin which
circumscribes the sphere is
(3x+2y + 2z)2 = lHx'+y + z2).
8. Find the equation of the cone obtained by rotating the line
x/p = y/g = z\r
about the line x/a = y/b = z/c.
9. A right-circular cone of semi-angle α has its vertex at the origin and contains
lines in the directions i, i + j and i +j + k. Show that
sec2 α = 9-2V2-2V6.
10. Show that the equation
yx+zx+xy = 0
represents a cone, vertex the origin and semi-angle arctan V2· What is the
curve of intersection of this cone:
(i) with the plane x+y+z = 1;
(ii) with the plane x-2y + z = 0;
(iii) with the plane x- 2y + ζ = 1 ?
11. A circle, S, is defined by the vector equation
(r-a).(r-a)-c2 = 0, r.n = p.
Show that any sphere passing through the circle S has vector equation
(г-а).(г-а)-с2 + Л[г.п-р] = О
and deduce that, if В is a point not on the plane r.n = ρ and if OB = b, then
the vector equation of the sphere through S and containing the point В is
(Ь.п-р)[(г-а).(г-а)-с2]-[(Ь-а).(Ь-а)-с2](г.п-р) = О.
Find the centre and radius of the sphere which passes through the point
(1, -1, 1) and contains the circle.
x2+y2 + z2= 4, x+2y+z=l.
12. A is the fixed point with position vector ak, and L, Μ are variable points
with position vectors λΐ, μ) respectively, where λ, μ are scalar parameters. If
Ρ is a point such that the angles LPM, MPA, APL are all right angles show that,
whatever the values of λ and μ, Ρ lies on a sphere, centre A and radius a.
13. Find the equation of the tangent to the sphere
x2+y2 + z2- 10x-2y-12z+ 35 = 0
at the point (4, 2, 1).
Prove that the plane x+y + z = 3 is also a tangent to the sphere and find the
point of contact.
653

FURTHER COORDINATE GEOMETRY [29
14. Find the values of к for which the plane
4x+2y+3z = к
touches the sphere „ „ „ ,„
x2+y*+z*-10x-6y-10z+30 = 0.
15. The point В lies outside the sphere (r-a).(r-a)-c2 = 0. If OB = b,
prove that the length of the tangent from В to the sphere is (b - a). (b - a) - c2.
Prove that the set of points from which the tangents to two non-intersecting
spheres are of equal length is a plane which is perpendicular to the line of centres
of the two spheres.
Deduce that the set of such points for three non-intersecting spheres is a line
and that, given four non-intersecting spheres with non-coplanar centres, there is a
point В from which the tangents to the four spheres are of equal lengths.
Explain how the last result can be used to show that a sphere may be
circumscribed about any tetrahedron.
16. Describe the curve of intersection of the surface with equation
хг + уг-гг = О;
(i) with the plane ζ = 1;
(ii) with the plane χ = 1.
What surface does the equation represent?
17. Show that the parametric equation of the helix of pitch 8 which lies on the
surface of the cylinder x2 + z2 = 1 is
40
r= cos0i+— j + sin#k.
Find the coordinates of the point in which the helix cuts the plane
J2x-y+2j2z = 0.
Show also that the helix cuts the plane x+z = 0 at an infinite number of
points, and explain this geometrically.
18. Prove that the mid-points P, Q, R, S, T, U of the edges ВС, CA, AB, AD,
BD, CD of a tetrahedron ABCD, in which the opposite edges (ВС, AD),
(CA, BD), (AB, CD) are perpendicular pairs, lie on a sphere. Is the converse
result, that if the points P, Q, R, S, T, U lie on a sphere then the opposite pairs
of edges are perpendicular, true or false?
If the sphere cuts ВС again at P', prove that ВС is perpendicular to the plane
P'AD.
19. Two spheres, centres Аг and Аг and radii cx and c2 intersect in a circle of
radius d. If АгАг = /, where Ρ = c\ + <%, prove that d = c^/Zand find the ratio
in which the centre of the circle divides the line АгАг.
20. Prove that there is one and only one sphere which contains a given circle
and passes through a given point not in the plane of the circle. Two circles, not
in the same plane, meet at points A and B. A plane meets one of these circles
at С and D and the other circle at Ε and F. Prove that:
(i) the points C, D, E, F lie on a circle;
(ii) the lines AB, CD, EF are either concurrent or parallel. (O & C)
654

3 + Г, + й=1·
EXERCISE 29
21. Show that, if the ellipse (x2/a2) + (y2/b2) = 1 lying on the plane ζ = 0 is
rotated through four right angles about the χ axis, the equation of the surface
obtained is „
x2 y2 z2
α2 ό2 ό2
The surface thus obtained is called an oblate spheroid if a < b and a prolate
spheroid if a > b. Both surfaces are particular cases of the ellipsoid.
2 y2 Z2
By considering sections of this ellipsoid with planes χ = ρ, у = q, ζ - r,
describe the shape of the general ellipsoid and in particular, show that it is a closed
surface.
22. Describe the nature of the hyperboloid of one sheet
a2 + b2 с2
and of the hyperboloid of two sheets
й2 о2 С2 '
23. Describe the nature of the elliptic paraboloid
cP Ь* с
and οι the hyperbolic paraboloid
α2 ό2 _ с '
24. Prove that the hyperboloid of one sheet (see Question 22)
xl+yl-t = ι
a2 b2 c2
contains the line determined by the two planes
?+5= λ(ΐ+^) and λ(^-ή= 1-
Call this line a λ generator. Prove also that the surface contains μ generators of
the form
-ί-Ή-'ίΝ)-1-
pair of λ generators and any ρε
lut that each λ generator meets
of the form
/α(λ + μ) Κ1-λμ) <*λ-μ)\
\ 1 + λμ ' 1+λμ ' 1 + λμ ) '
Prove also that any pair of λ generators and any pair of μ generators are
necessarily skew lines but that each λ generator meets every μ generator at a
point with coordinates of the form

FURTHER COORDINATE GEOMETRY [29
(Notice that we have obtained a parametric representation of the hyperboloid
of one sheet in terms of the two parameters λ, μ.)
25. (See Questions 23, 24.) Prove that the hyperbolic paraboloid
a2 b2 с
contains a system of λ generators and a system of μ generators with the
properties that any pair of λ generators are skew, any pair of μ generators are skew, but
each λ generator meets every μ generator in a point of the form
(α(λ + μ), Κλ-μ), Ιολμ).
26. Determine the nature of the central conic
7x*-8xy+13y* = 150
and find its eccentricity and the equations of its axes.
27. Determine the nature of the central conic
6x* + 5xy-6y2 = 169
and find its foci and the equations of its axes.
28. Show that the equation
x* + 6xy+y* + 6 = 0
represents a hyperbola and find its eccentricity and the equations of its
asymptotes.
29. Two lines, h and /2, intersect at an angle a. Find the locus of the centre of а
sphere of given radius which touches both lines.
656

Revision exercise С
1. The polynomials ^(л:), h(x) both leave the same remainder on division by
(x—a). Prove that the polynomial
xgQc)-ah(x)
is divisible by (x—a).
The polynomial/(л:) (degree > 4) leaves a remainder of rx+s on division by
(х-ЬУ. Prove that f"(x) is divisible by (x-b).
2. If ζ Φ 1 and -
show that \z\ = 1 if and only if Re (w) = 0.
3. If a < b => a" < bn for all α, ό e Λ where и е Z, what further conditions does η
satisfy ? Are these further conditions still necessary if a, b are restricted to positive
real numbers? (S.M.P.)
4. The function/: R -» R is defined by
№ = χ*-2\χ\.
Sketch the graph of/and find what values of л: are invariant under/.
5. Solve the equation 3 sin (9 + 2 sin (60° - Θ) = 2
for values of θ lying between 0° and 360°.
6. A random number table consists of a succession of digits each chosen at
random independently from the set 0, 1, 2,..., 9. Two successive digits are taken
from such a table; show that the probability that their sum is 9 is 1/10.
Four successive digits are taken from the table. Calculate the probability, p,
that the sum of the first two equals the sum of the third and fourth. (M.E.I.)
7. Prove that ,- _ n,
a + ar + ar2 + ...+arn-1 = ^~-^,
where гФ1,
Prove that in general
(1 + x+x2 + ... + x*") (1 - x+x2 -... +x2n) = l+x2 + xi+...+xi\
State any values of χ for which your proof does not apply and obtain the
appropriate results in each of these cases. (O & C)
8. Prove that, if the points О, А, С are non-collinear, then the position vector
with respect to О of any point in the plane О AC may be expressed uniquely in
terms of a = OA and с = ОС. (over)
657

REVISION EXERCISE С
О ABC is a parallelogram, L is the mid-point of AB and Τ is the point which
divides OB in the ratio 3:2. CT meets О A at X and LT meets ОС at У. XY
meets 05 at Z. Find the ratio OZ.ZB in which Ζ divides ОЯ.
■π·
prove by induction that
An = i(6n-2") (A-21)+ 2*1.
form the products MQ and NQ. What law, which is true for products in the
algebra of real numbers but which is not true for products of matrices, is
exemplified by this? (S.M.P.)
11. Express ^
* = гЩ and q = Y^b
in the form x+jy, where χ and у are real.
Find also the modulus and the argument of ρ and of q. Hence write down the
argument of ρ +q. (M.E.I.)
12. Express in partial fractions:
6-Зл: 2х-3 Зл:2-7л: + 3
W (x-D(x + 2)' W х(1+х2У (Ш) (1-х)2 (1-2*)'
13. Two concentric circles have radii 10 and 15 cm. If a point is taken at random
in the annulus formed by them (all points of the annulus being equally likely) find
the expected distance of the point from the centre of the circles. Explain why
you would expect your answer to be greater than 12-5 cm.
14. Ρ is the point (at2,2at) on the parabola^2 = 4ax and OQ is the chord passing
through the origin О and parallel to the tangent at P. Find the coordinates of
the point of intersection, R, of the tangents at Ρ and Q. Give a reason to show
that the locus of R is another parabola.
If S is the mid-point of О Q, prove that PSR is a right-angled triangle, and that
the area of triangle PQR is \аНг. (О & С)
15. The sets А, В, С, D are denned as follows:
A = {xeR: -2< x^l}, В = {xe R: -1 ^ χ < 2},
C= {xeR:x> 1}, D = {xe R: χ *ζ -2}.
Express in the form {:}, the sets:
&A(]B; (ЩЛ'ПС; (iii) А П C; (iv) Β' η С П D';
(ν) (Α η Β) υ (Β η С')·
Express the sets
Е= {xeR-.x <-l} and F={xeR:x<2}
in terms of unions and intersections of the sets А, В, С, D and their complements.

REVISION EXERCISE С
16. OAB is a triangle, right-angled at О; О A = a, OB = ό. Points X, У are
taken on OA, OB (or on these lines produced) in such a way that
OX+OY= a+b.
A У and BX meet at P. Find the locus of P.
17. Assuming a population in which equal numbers of births occur in each of
the twelve months of the year calculate the probabilities that, of four persons
taken at random:
(i) no two will be found to have birthdays in the same month;
(ii) exactly two will have birthdays in the same month and the others in
different months. (M.E.I.)
18. The polynomial ax2 + bx+c can be represented by the vector
Θ-
Find a 3 χ 3 matrix Μ which premultiplies this vector to give the vector
representing the derivative of the polynomial. Check that M3 = 0 and comment on
the significance of this result. (S.M.P).
19. Prove that
1*_2*+3*-4*+... + (-1)п-1и* = (-1)"-Чи(и3 + 2и2-1).
20. Find all angles in the interval 0 =ξ θ =ξ 2π satisfying the equation
sin 20 + cos (0-;|π) = 0.
21. If к is very small, show that χ = 1 - \k is an approximation to a root of the
equation **4-*·-ι = ο,
and find approximate values for the other two roots.
22. Solve the inequality , _..., ..
(x-7)(*+3)<1-
23. Express Αχ_χ
"(2x-l)(3x-l)
in partial fractions and deduce a quadratic approximation for^ in the
neighbourhood of the point (0, -1).
Deduce that the line x+y = -1
is the tangent to the graph of у at the point (0, -1) and that, near this point, the
curve lies in the region „ ^
{(x,y):x+y > -1}.
24. a,p,q, r are four non-zero complex numbers such that
p+q + r = a, p2+q2 + r2 - a2, pqr = a*.
Find, in terms of a,
(i) \{a-p){a-q){a-r)\; (Щ l/p+l/q + l/r.

REVISION EXERCISE С
25. Three points P,Q,R vary in position on the surface of a fixed sphere in such
a way that PA>+QA> + RA>
is constant, where A is a fixed point. Find the locus of the centroid of the triangle
PQR.
26. If θ is real and ζ = cos Θ+j sin Θ, find
1 , 1
zn-\—- and zn—-.
zn zn
Determine a, b, c, d such that
27 cos3 θ sin3 θ = a sin 80 + b sin 6Θ + с sin Λθ + d sin 20. (M.E.I.)
27. Show that the cubic equation
2x?+3x*-12x+k = 0 (ureal)
has three distinct real roots if and only if
- 20 < к < 7.
Solve the equation completely if к = -13.
28. Two real numbers are taken at random from the set of real numbers
{xeR:0^ x^ 1}.
Find the probability:
(i) that the square of their sum is greater than 1;
(ii) that the sum of their squares is greater than 1.
29. Show that the point
(β(1-ί2)/(1+ί2), 26ί/(1 + ί2))
χ* ν2
lies on the ellipse — + -^ = 1
for all values of t.
Deduce that, if the line lx+ my — 1
is a tangent to the ellipse, then
IW + mW = 1.
30. Prove that Р + 22+... + и2 = £и(и+1) (2и+1).
If Sl= 12 + 42 + 72 + ... + (Зи-2)2,
and j2 = 22 + 52 + 82 + ... + (Зи-1)2
show that j2 — Ji = Зи2.
and Ji+j2 = 6n3-n.
Hence, or otherwise, find Ji and s2. (O & C)
31. Prove that the medians of a triangle ABC are concurrent at a point G.
О is an arbitrary point in the plane of triangle ABC and L, Μ, Ν are the
midpoints of ВС, CA, AB. Prove that the lines through L, Μ, Ν drawn parallel to
О A, OB, ОС respectively are concurrent at a point H.
Prove also that the line GH passes through О and determine the ratio OG: GH.
660

REVISION EXERCISE С
32. It is observed that
1 + 2 = 3
4+5+6 = 7+8
9 + 10+11 + 12 = 13+14+15
16+17+18+19 + 20 = 21+22 + 23 + 24.
Enunciate a general result of which the above are particular cases and prove
your conjecture.
33. For what values of θ in the interval 0° < θ < 360° is the expression
6 cos2 0-cos#
less than one ? Give your answers to the nearest tenth of a degree.
Answer the same question for the expression
6 sin2 θ — ύτιθ.
34. Express
(x-3)(x-2)
in partial fractions.
If л: is so large that (l/л:)3 and higher powers of 1/x may be neglected, show that
the given expression is approximately equal to (_хг + х- 1)1 χ3.
35. Show that there is just one real value of the constant λ for which the equations
λχ+y + z = 2x + Xy+3z = 4x—2y + Xz = 0 have a common solution other than
χ = у = ζ = 0. Find, for this value of λ, the solution of these equations which
satisfies also the equation x+y + z = 3. (M.E.I.)
36. A and В play a match of five games, each of which must be won or lost. In
each of the first three games the probability that A will win is 2/3 and in the
remaining two games the probability is 3/4. Find:
(i) the generating function for the probability that A wins r games;
(ii) the probability that A will win the match;
(iii) the average number of games that A will wia (O and С modified)
37. Prove that, if A, B are two 3x3 matrices, then
(BA)' = A'B'.
What is the transpose of a 1 χ 1 matrix ?
Let Τ, Τ be two linear transformations of three-dimensional space into itself,
with matrices A, A', where A' is the transpose of A. Prove that, if
T(P) = Q and T'(P) = R,
then OP is perpendicular to QR.
38. Show that the points O(0, 0, 0), /1(0, 2, 2), 5(2, 0, 2), C(2, 2, 0) are the
vertices of a regular tetrahedron. (over)
10 PPMll 661

REVISION EXERCISE С
L, Μ are the mid-points of the edges О A, OB respectively. The tetrahedron
О ABC is now cut through by a plane CML. Find the cosines of the acute angles
between the plane CML and
(i) the plane OML; (ii) the plane OCM.
39. Prove that, for any complex number z, zz* = \z\2.
The reflection of the point zt in the line θ = α through the origin in the Ar-
gand diagram is z2 and the reflection of z2 in the line θ = β through the origin is
z3. Prove that , „ . . „ .
z2 = z1{ooi2a+ism2a)
and that z3 = zjcos (2/?- 2a)+j sin (2/?- 2a)].
40. Prove, by induction, or otherwise, that
Σ r(r+l) (iy-1 = 16-(и2+5и + 8) (i)«-i. (M.E.I.)
41. If OA = a, OB = b, prove that the position vector, relative to O, of the
mid-point of AB is K» + b). Deduce that the lines joining the mid-points of
opposite edges of any tetrahedron are concurrent.
If ABCD, A'B'C'D' are any two parallelograms, not necessarily coplanar,
and if the mid-points of AA', BB', CC, DD' are respectively W, Χ, Υ, Ζ, prove
that WXYZ is a parallelogram.
42. Give rough sketches of the curves:
(i) у = sinx; (ii) у = sin(x+|77); (iii) у = l + 2sin(x+3-7r).
How many real roots has the equation
x= l + 2sin(x + -37r)?
43. Sketch the graph of
* x2-5x-W
For what values ofxis^ < — 1 ?
44. Find the coordinates of the point of intersection, P, of the plane
х+Ъу+z = 0,
and the line joining ДО, 1, 15) and 5(3, - 5, 3).
Find the ratio АР: РВ.
45. The line / is a tangent to a parabola with focus F and N is the foot of the
perpendicular from Fto /. Prove that N lies on the tangent at the vertex and that
FN produced meets the directrix in a point Μ such that FN = NM.
The triangle ABC is right-angled at B. It is reflected in a line m in such a way
that the image of С lies in the original line of AB. Prove that m is a tangent to a
parabola with focus at С and directrix AB. (M.E.I.)
46. Interpret geometrically the transformations performed by premultiplying
the position vectors of points in a plane by each of the matrices
. № 0\
■С'^Э-'ЧЗ·
State in the form of a single matrix each of the expressions R", S", T", where
ι is a positive integer, and prove one of these results by induction. (M.E.I.)

REVISION EXERCISE С
47. Show that there are two complex numbers ζ such that
|z-2-j| = 1
where a, b and с are non-zero constants, are α, β and the roots of the equation
ca? + 2bx + c = 0
are γ, δ. Without solving either equation, show that the equation whose roots
are ουγ+βδ and οώ+βγ is
a3x2-2ab*x + c(5b2-4ac) = 0.
Show also that if this last equation has equal roots then one and only one of the
original equations has equal roots. (M.E.I.)
49. Show that:
w K L -*· + **+!'
(ii) (,l+x)*<l+x/3 (x>0),
and deduce that, when χ > 0,
(1+XJ ^27 + 9^ + ^·
Hence show that the error in taking (1 + x)* as 1 + x/3 is 1
9л:2 + χ3
81+27л: + Зл:а"
ess than
(M.E.I.)
50. A machine produces components to any required length specification with a
standard deviation of 1-40 mm. At a certain setting it produces to a mean length
of 102-30 mm. Assuming the distribution of lengths to be normal, calculate:
(i) what percentage would be rejected as less than 100 mm long;
(ii) to what value, to the nearest 001 mm, the mean should be adjusted if this
rejection rate is to be 1 %;
(iii) whether at the new setting more than 1 % of components would exceed
107 mm in length. (M.E.I.)
51. Prove that, if z, w are complex numbers such that
|z-2| = \w + 2\ = 1,
then -%π =ξ arg (z-w) < %π.
Find the limits between which \z—w\ must lie.

REVISION EXERCISE С
52. If A = ί! 3X u = P\, A«u = П prove that, for large n, xn « yny/3
and interpret your result geometrically.
53. Prove that, for any positive integer r,
tan (2Г-Щ = cot (2τ~1θ)-2 cot (2rff).
Hence evaluate Σ 2r_1 tan (2r_10).
54. Two players stake £1 each in a game in which eight counters are tossed on
the ground. The counters are painted black on one side and white on the other.
If the number of blacks showing is odd the thrower wins the other stake, if all
counters are of the same colour he wins a double stake, and otherwise he loses
his stake. Calculate the expected gain of the thrower and state what assumptions
you make. (M.E.I.)
55. Prove that „
Zr! = \n(n+\)(2n+\).
The χ axis from χ = 1 to χ = 2 is divided into η equal intervals by the points
(1,0), (l + ^.o), (l + ^.o), ..·, il+^,oj, (2,0).
Rectangles with sides parallel to the axes are drawn on these intervals with one
vertex on the curve у = χ2 and the rest of each rectangle between the curve and
the χ axis. Show that the sum of the areas of these rectangles is
1+1
-9(Ю-
Show further that as η increases this sum increases but remains less than J.
56. Show that, if the roots α and β of the quadratic
ax* + (b + c)x + d = 0
are unequal, the equation
aun г^+1 + bun + cun+1 + d = 0
can be put in the form -^^—- = λ ——-,
where λ is given by the equation
57. If a/b - 1 is a small positive quantity not greater than 10"^, show that
/д\* За+ 6
W a + 3b
is not greater than 10_3iv_1.
664

REVISION EXERCISE С
58. The linear transformation, T, has matrix (relative to i, j, k)
.π -: -:v
\ 0 0 1/
Show that Τ leaves every point of the plane
2x+3y + z = 0
fixed and that, if Π is any plane containing the line of intersection of the two planes
x+y = 0, ζ = 0
then Π is mapped into itself by T.
59. OAB is a fixed triangle and points X, У are taken on О A, OB (or these sides
produced) in such a way that AB and XY are parallel. A Y and BX meet at P.
Show that, for varying positions of X, Y, the locus of Ρ is ОС, where С is a vertex
of the parallelogram OACB.
60. Explain what is meant by the set Rx R.
The function /; R χ R -» R is denned by the equation
Ax,y) = x+y.
Find the image, under /, of:
(i) {(x,y)eRxR:2x+y= 1}; (ii) {(x,y) e Rχ R: x*+y* = 1}.
61. Find the modulus and argument of 1 +cos θ+j sin Θ, where —π<θ<π,
and hence or otherwise express (1 +cos θ+j sin Θ)" in the form a+jb, where η is
a positive integer.
By writing cos Θ+j sin θ = ζ, and using the binomial expansion, prove that
*0—Θ
1+ COS0+ COS20+...+COS«0 Ξ (2 COS 45)" COS in0,
and obtain a similar identity with sines on the left-hand side. (O & Q
62. If the equation x3 + ax2 + bx + c = 0
has the property that one root is equal to the product of the other two, prove
that (b-cy + cia-iy = 0.
If a = f- and b — \, find the two values of с which satisfy this condition, and
solve the resulting cubic equations. (O & C)
63. Prove that the ellipse „ „
and the hyperbola xy = c2
touch each other if 2c2 = ab. Is the converse true?
Let D be one of the points of contact in the case 2c2 = ab. Show that if the
common normal at D meets the hyperbola again at A and the coordinate axes аХР,
Q, then the mid-point of PQ coincides with the mid-point of AD. (M.E.I.)
665

REVISION EXERCISE С
64. A probability density p(x) is given by
t N fC*(4-x) (0^x^4),
(О (л: < 0, 4 < χ),
where С is a positive constant. Find the value of С and sketch the distribution.
Explain why the standard deviation of this distribution is the same as the
standard deviation of the distribution with probability density
Pix)
jC(4-x2) (|x| <2),
" to (|x| > 2).
For the first distribution, calculate the mean, the standard deviation and the
mode. Calculate also the probability that a value of the variable χ taken at
random will lie further than one standard deviation away from the mean. (M.E.I.)
65. If a circle passes through the origin, show that its equation, written in
complex form, is zz*+g*z+gz* = 0, where ζ = x+jy, z* = x—\y, and g is a
complex constant.
If a variable point Ζ on the circle is transformed into a point W by means of
the equation w = a2/z, where α is a real constant, find the equation satisfied by w.
What geometrical locus does this equation represent? (O & C)
66. By solving the set of equations
-x1+x2 = b,
Злг!+2л:2 + x3 = с
or otherwise, find the ii
\0 0 1/
find B_1 and also, by any other method, the inverse of AB. State a set of
equations which this inverse (AB)-1 would help you to solve. (M.E.I.)
67. An unbiased six-sided die is thrown. If a six is thrown the die is thrown a
second time and the scores for the two throws are added; find
(i) the chance of a score of more than eight;
(ii) the average score.
If the die is thrown again whenever a six has been thrown, find the average
number of throws and the average total score. (O & C)
68. The position vectors relative to the origin О of the points Α,Β,Ρ are
respectively a, b, p. X is the point of trisection of AP nearer A, and Υ is the mid-point
of BP. Ζ is a vertex of the parallelogram PXZY. Write down, in terms of a,
b, p, the position vectors of Χ, Υ, Ζ.
666

REVISION EXERCISE С
If А, В are fixed points on a unit sphere, centre O, and Ρ is a variable point
on the sphere, deduce that the locus of Ζ is a sphere and find its radius and the
position vector of its centre.
). Prove that the lines
K*- 3) =
x+2 =
K*+2) =
y + A
Ky-3)
-b
= z-3,
= z+3,
= iz
are skew and find the equations of the line through the origin which intersects
all three lines.
70. UV is a diameter of an ellipse and P, Q are points on the ellipse. PU, QV
meet at X and PV, QU meet at Y; prove that the chord defined by XY is
bisected by UV.
71. Prove that, if ξ, is any root of the equation
z"=(z-2j)",
thenlm© = 1.
Draw a sketch to show the position of the roots in the case η = 3 and show
how they are related to the equations
(i) z3 = (z + 2j)*; (ii) z*=(z + 2)3.
72. Express the vector i-4j- 5k as
(i) the sum of three vectors in the directions i, i+j, i + j + k;
(ii) the sum of two vectors, one in the plane 2x + 4y + 3z = 1 and one
perpendicular to this plane.
73. A single die is thrown six times. Find the probability that:
(i) exactly one six is thrown;
(ii) at least two sixes are thrown;
(iii) each face of the die turns up once.
Find the probability that, when a die is thrown repeatedly, a six appears for
the first time at the rth throw. Find the least value of r such that there is more
than a 50 per cent chance of obtaining a six on the rth throw or earlier.
Write down expressions, in terms of ρ = f and q = f, for the expectations
Sir), £(r*) and calculate the numerical values of these expressions. (M.E.I.)
74. A transformation is defined by
: Ш
Show that the set of planes perpendicular to the у axis maps into a set of parallel
lines. What is the direction of these lines ? Which of these planes is mapped onto
a line through the origin? (S.M.P.)
75. In the binomial expansion of (l-л:)*, where χ is positive and less than 1,
show that the ratio of the term in x"+1 to that in xn is

REVISION EXERCISE С
By comparing the sum of all the terms after the first three with a suitable
infinite geometric series, prove that the magnitude of the error involved in
neglecting these terms is less than (5л:3/81) (1 -л:)-1.
Show that, if χ =ί 0-1, this error is, in magnitude, less than 0-00007 and hence,
using the binomial expansion, evaluate (0-9)* correct to three decimal places.
76. Prove that the points with coordinates (0, -2, -\\ (1,1,1) (5, 3, 4) and
(8,2, 5£) are the vertices of a trapezium in which the non-parallel sides are equal
in length. (S.M.P.)
77. The fixed point С and a variable point Ρ have position vectors с, г
respectively in three-dimensional Euclidean space. Describe in geometrical terms the
locus Г denned by г. с = Irl |c| cos α
where α is a given acute angle. If С is the point (a, b, c) and Ρ the point (x, y, z)
express the equation of Γ in cartesian form and show that, for all real numbers λ,
{χ, y,z)eV^. (λχ, Ay, λζ) e Г.
Taking the special case with С the point (4, 0, 3), cos α = 4/5, show that Γ
meets the plane ζ = 6 in a parabola and that the sphere |r- c| = 3 touches the
plane at the focus of this parabola. (M.E.I.)
78. Obtain a quadratic approximation for the expression
Vd+x)/(l-2x)
in the neighbourhood of χ = 0.
Write down the equation of the tangent to the curve
y= V(l+*)/0-2x)
at the point (0,1) and draw a sketch showing the relative position of the curve
and the tangent at this point.
79. ABCDA'B'C'D' is a parallelepiped, with faces ABCD, A'B'C'D' and
parallel edges AA', BB', CC, DD', etc. The mid-points of CC, AB', B'C', CD' are
respectively W, X, Y, Z. Prove that the plane XYZ cuts A W externally in the
ratio 1:3.
80. Find the five roots of the equation
z5+l = 0,
and plot their positions on an Argand diagram.
A regular pentagon ABCDE is inscribed in a circle of unit radius. Prove that,
if Ρ is any point of the circle, then PA .PB.PC.PD.PE ^ \j2.
81. If A is a 2 χ 2 singular matrix, prove that there is a number a such that, for
any integer к > 0, д* _ at-i\
Find α if A is the matrix
В is the matrix
prove that
for any positive integer n.
668

REVISION EXERCISE С
By considering the product (1 - x)" (1 + x)n,
prove that Д,(-1)'[ОТ = C«)"
83. A boy and girl play a game of' spotting' approaching red cars when travelling
along a road on which 25 % of the cars are red. If a car is red the first one to
'claim' it gains one point (a claim is always made by either the boy or girl for a
red car, but never by both simultaneously). If a car is not red it is found that on
half the occasions either the boy or the girl (but not both) makes a claim, for which
there is a penalty of two points. The first to be five or more points ahead is the
winner.
The events 'a claim by the boy' and 'the car is red' are independent and the
probability that he makes a claim on a car is 2/5.
(i) Draw a tree diagram in which the first two branches lead to the events
'red' and 'not red', and mark on the secondary branches the probabilities of
claims by the children.
(ii) If it is known that a claim has been made for a car which is not red, what
is the probability it was made by the girl ?
(iii) If the score is 8-5 in favour of the girl, show that the probability that she
will have won by the time two more cars have passed is 0-48. (S.M.P.)
84. Find the solution set of the equations
in the following cases :
(i) a = 9, b = - 1; (ii) a = 8, b =
For what value of a does the equation
7 J not have a unique solution;
Θ-ί
J 7 j not have a unique solution?
w
85. Use de Moivres's Theorem to prove that
cos ηθ = Σ(-1)' Γ" ) cos"-21· θ sin2' Θ,
and deduce that cos ηθ may be expressed as a polynomial in cos Θ.
Write д 2, n
cos ηθ - Σ ar cosr Θ.
By taking 0=0 prove that an = 2"-1. What is a„ if η is even ?

REVISION EXERCISE С
86. Find and illustrate graphically the set of points (x, y) that satisfy
simultaneously the inequalities
(x- l)3 < x- 1,
87.
M =
\\
By considering the matrix product M'M, or otherwise, prove that
I 2 a+a2 b+b2 I
a+a2 a2 + cfi ab + a2b2 = 0.
| b + b2 ab+cPb2 b2 + b*
Evaluate the determinant
x+y
x2+y2
2xy
x+y
2xy
x2+y2
88. Describe geometrically the sets of points in the Argand diagram defined by
(i) |z-l| = |z+j|; (ii) |z-l| = 2|z+j|;
(iii) |z-l| + |z+j| = 2; (iv) Re(z+1) = |z+j|.
89. The points P(ca,c/a) and Q (cfi.c/β) lie on the rectangular hyperbola
xy = c2, and (X, Y) is the mid-point of PQ. Find Xand Yin terms of c, α and
β. Write down a quadratic equation in t whose coefficients involve only c, X
and Υ and whose roots are α and β.
Determine what points (X, Y) can arise as mid-points of real chords of the
hyperbola. On a rough sketch shade in the regions they occupy. (O & C)
90. Show how the matrix
/cos 2
-cos 20/
can be associated with reflection in the line у = χ tan 0.
If В is the matrix associated similarly with reflection in the line у = х^З,
find the values of θ for which AB = BA.
Describe how these values could be predicted by geometrical argument
(M.E.I.)
91. A rectangular box has faces OABC,0'A'B'C and OO', etc. vertical. О А = 2
units, ОС = 1 unit, OO' = 1 unit. Taking О as origin and axes Ox, Oy, Oz along
О А, ОС, OO', find the equations of the planes OA'C, ACM, where Μ is the
mid-point of B'C Hence show that
(i) the line of intersection of these planes is parallel to the face О ABC;
(ii) the cosine of the acute angle between planes is 7V6/18.
92. А, В are two fixed points with position vectors a, b relative to some origin O.
Χ, Υ are two variable points with position vectors x, y. Show that, if
x+y = a and χ л у = b,
then Ο, Χ, Υ, Α are coplanar and OB is perpendicular to this plane. Hence find
670

REVISION EXERCISE С
the most general solution, in terms of a, b and a scalar parameter, of the vector
equations
x + y = а, хлу = b.
93. Prove that the matrix
is orthogonal and interpret Ρ as the matrix of a linear transformation. Show that,
if χ is very small, the matrix
Q=L 1 0
\0 0 1/
represents a small rotation, if we ignore powers of χ higher than the first. If SS
is a matrix all of whose elements are very small, and if
1+SS
is orthogonal, prove that the matrix <SS is skew-symmetric.
94. Two men, A and B, play a match consisting of separate games, the probability
of A's winning a game being ρ and the probability of B's winning being q.
(p+q is not necessarily equal to one.) They start the match with η counters each
and the winner of each game receives a counter from the loser; the first player to
win all In counters wins the match. If the probability that, when A has к counters
he will eventually win the match, is denoted by uk, prove that
(p+q) "к = puk+1+quk-!.
What is the initial probability that A will win the match?
95. Prove that, if и is a positive integer, then
0< (2-V2)"< 1.
Use the Binomial Theorem to deduce the existence of an integer m such that
(2m- 1) (2-V2)" < 2" < 2m(2-J2)n.
96. In the complex equation
zn+an_1z»-1+a„_„z«-a+...+a1z+l = 0
it is given that the complex numbers α„_1; α„_2,..., аг satisfy
K_!| < 1, |a„_.| < 1, ..., k| < 1.
Show that any root of the equation in the complex plane must lie in the annular
regi°n }<И<2. (C.S.)
97. The expressions Еъ Е2, E3 are defined as follows:
£Ί = x+y-2z-3,
E, = 2x-3y-4z + 4,
E3 = 3x + 5y + 3z - 5. (over)

REVISION EXERCISE С
If λ, μ are two numbers, explain the geometrical significance of the equation
Ε!+λΕ2 + μΕ3 = 0.
Find the equation of the plane which contains the line
(x+l) = -7(y-l) = -7(z-l)
and passes through the point of intersection of the planes £Ί = 0, Ег = 0
Ε, = 0.
E3 = 0.
98. Show that the equation
х*-Ъх2-Ъх+\ =
has real roots between 0 and 1 and between 2 and 3. Find the larger of these roots
correct to three significant figures and check that three figure accuracy has been
achieved. (O & C)
99. The plane Π is said to be invariant under the linear transformation Τ of three-
dimensional space into itself if every point of Π maps into another (possibly the
same) point of Π.
Find the equations of the planes through the origin which are invariant under
the linear transformation Γ with matrix A (relative to the usual base vectors) given
by
-1
100. The complex numbers z, z' are represented in the complex plane by the
points P(x, y) and P\x', y'). State in geometrical terms the transformation of the
plane Ρ -> Ρ' given by z' = z(cos Θ+j sin Θ). Verify your statement by deriving
from z' = z(cos 0+j sin0) the matrix form
β-Θ·
What geometrical transformation is given by
z' — c = (z—c) (cos 0+j sin Θ),
where с is a complex constant? If the transformation ζ ->z' corresponds to a
turn of 120° about the point (1,0) and the transformation z' -» z" corresponds to
a turn of 60° about the point (-3,0), show that z" — -z + 2^3j, and hence
express in geometrical terms the single transformation equivalent to the two
transformations in the order given. (S.M.P.)
101. The points Flt F2 are the foci of hyperbola and Pisa point of the branch
defined by F^B- F2B = к > 0. Indicate on a sketch the position of points P, Q
such that PlP-FtP>k, FlQ-F>Q<k,
and justify your answer.
By reflecting the figure in the line / which bisects the angle F2PFi internally, or
otherwise, show that В is the unique point of the hyperbola which lies on /.
Deduce a theorem relating the tangent at a point of the curve and the lines joining
that point to the foci. (M.E.I.)
672

REVISION EXERCISE С
102. If α is one of the complex fifth roots of unity, and χ = α- α4, show that
χί + 5χ2 + 5 = 0.
Express the other roots of this equation in terms of a. (O & Q
103. The tangent to the cubic curve у = ax3 at Д(лг1; y^ meets the curve again
at Рг(х2, Уг)- (Л is not at the origin.) The tangent at P2 meets the curve again at
Рз(*з, Уз) and so on. Show that xu x2, x3, ■·■ are in geometric progression.
The tangent at Pt meets the perpendicular from P3 to the χ axis at P. Show that
Pj, is the mid-point of P2P and that the locus of.P is another cubic curve. (M.E.I.)
104. S stands for three-dimensional space in which points are specified in the
usual way by coordinates (x, y, z), and Ρ stands for the plane ζ = 0 in which
points are specified by just two coordinates (x, y). A transformation
'(")
Л1 °
\0 1
maps points of S onto points of P; and a transformation
4Vg :!)©
maps points of Ρ onto points of S. Show that for 7Ί any given image point in
Ρ is associated with a set of points of S, and describe this set; and that for Тг
the range is a proper subset of Slt to be specified.
Show that for both transformations the line joining a point and its image is in
a fixed direction. Hence describe in geometrical terms the transformations
rtra and ГаГ!. (S.M.P.)
105. Show that the number of solutions of the equation
x+2y+3z = n,
where x, y, ζ, η are positive integers, is equal to the coefficient of tn in
1/{(1 _i)(l_,.)(!_,»)}.
106. Show that the roots of the equation
where А, В, С are positive real numbers and a > b > с lie one between a and b
and one between b and с
If A is small compared with В and С show that one of the roots is
approximately
a +
A(b-a)(c-a)
3(с-аНСф-аУ

REVISION EXERCISE С
107. Under what circumstances is it true that a.b =0?
Prove that, for any angle θ and any positive integer n,
Σ cos #+
r=o \
Prove also that Σ cos2 [Θ +
2m\
!>*K)·
108. A square matrix is called 'magic' if there exists a number к such that the
sum of the elements of each row is к and the sum of the elements of each column
is k. Prove that, if A is a non-singular magic matrix, so also is adj A.
109. Χ, Υ and (и- 2) other players play a game in which each player has an
equal chance of winning each round. The game is won by winning 8 consecutive
rounds of the game. Let
кк = 1,2, .
Show that
(и-
qk = Pr (Xwins\Yhas wt
rk= Pr (X wins\Xhas w,
..,7.
-Di* + r*= 1,
on the last к times),
on the last к times)
r* = ->Vn+11--I?i, for & = 1,2,...,6,
and find the corresponding result for к = 1.
Hence show that the chance that X will win the game is (и7- Щи8-!)
if Υ has just won one round but did not win the round before that. (M.E.I.)
110. The remainder when a polynomial Ρ in χ is divided by (л:-a)2 is the linear
polynomial rx+s. Prove that
r = P'(«) and s = P(*)-aP'(a).
The expression - s/r is of significance in the solution of the equation P(x) = 0
by a certain approximate method. Explain this, and show how the division
process for polynomials could be used to approximate to the roots of this equation.
Illustrate your answer by finding in this way closer approximation than -1,
3, 5, to the roots of , „ , „
x3 - 7л:2 + Ίχ +14 = 0.
Find the sum and product of your three approximations and explain how these
serve to check the answers. (S.M.P.)
111. Let Η be the set of matrices of the form
_ (a + bj -c + d}\
X-[c + di a-bi)'

REVISION EXERCISE С
where j = V(~ 1) and a> b, c, d are real numbers not all 0. Show that Η
contains a subset representing the complex numbers and prove that each element of
Η has an inverse in H.
The matrix X is expressed in the form
X = al + cJ + bK + dL,
""--Π-'-ΠΠ-β-^-Π·
Prove that J2 = K2 = L2 = -1 and that JK = -KJ = L.
Given that KL = -LK = J and LJ = - JL = K, evaluate XY, where
(a' + b'j -c' + d'h
\c' + d'i a'-b'i)'
by multiplying the expressions for X and Υ in terms of I, J, K, L. (М.ЕД.)
112. X and Υ are independent Poisson variates from distributions with
parameters μ and λ respectively. Find
Pr(X= r\X+Y= n)
forr= 0,1, ...,η.
The telephone exchange in a large firm handles both internal and external
calls. It is desired to investigate the distribution of internal calls and equipment
is available for recording the number of internal calls in each set of 10 calls.
Calculate the probabilities of 0, 1,2 and 3 internal calls in sets of 10 if the average
number of calls per set is 7. What is the most likely number of internal calls ?
(M.E.I.)
113. Show that if tabular values are given for χ = 0, 1, 2, 3 of the function
Rx) = a + bx+ cxix - 1) + dx(x- D (*- 2),
then the values of b, c, d can be obtained from the differences derived from the
table, e.g. b from/(l)-/(0).
Given the following table for a cubic polynomial g(x), find and check the
values of a, b, c, d required to express it in the form given above; and hence or
otherwise evaluate #(5), #(6), g(J) and #(8),
л: 0 1 2 3 4
g(x) 87-25 89-37 91-49 96-85 108-69 (M.E.I.)
114. Three bags each contain N tickets numbered 1 to N. Three tickets are
drawn, one from each bag; find the probability that the sum of the numbers
obtained is 2N.
115. Two circles, centres A and В and radii one unit, are drawn respectively in the
planes r.n = pu r.n2 = p2- Prove that they lie on the surface of a sphere if and
ОПу (a-b).(nt л n2) = 0 and Oi-^.b)2 = 02-n2-a)2.
675

REVISION EXERCISE С
116. ABCDEF is a hexagon inscribed in a circle centre O. If
ΔΑΟΒ = ACOD = Z£OF = я/3,
prove that the mid-points of ВС, DE, FA are the vertices of an equilateral
triangle.
117. The region E„ is denned by
шР + Ьу1 < 1,
where a > 0 and b > 0. Without assuming any properties of conies, prove that,
if Pi, Рг are the position vectors of points of Ea, then
(i) -ъеЕо, (η) ¥Pi+P,)eE0,
(iii) 2pt e £, where Ε is the region ал:2 + by2 < 4.
A system of elliptical discs congruent to E0 is obtained by translating the centre
О of E0 to all points g whose coordinates are integers. Prove that, if the discs with
centres at gb g2 overlap, then
(iv) for a common t, g1—teE0 and t- g2 e E0,
(v) i(gi-g2)e£o, (vi)gl-g2e£.
Deduce that if Ε contains no point, other than the origin, with integral
coefficients, then no members of the system S overlap. (M.E.I.)
118. The log-normal distribution is a probability distribution given by
for 0 < χ < oo. By using the transformation χ = /эе" and comparing with the
normal distribution show that:
(i) f'jtx) ax = 1, (ii) JJ0 xf(x) dx = /9e*a, (iii) j" xV(x) ax = p*d>«.
Deduce the values of the mean and variance of the distribution. (O & C)
119. If л: is a close rational approximation to the cube root of 2, and
x' = (x2 + x + 2)/(x2 + x + l),
show, by expressing л:'-л: in the form (2*-х)Дл:) and verify ingt hat 1 < f(x) < 2,
or otherwise, that x' is a better approximation than x, and that if one is below the
correct value the other is above. Use this method to find the cube root of 2
correct to two places of decimals. (O & C)
120. The surface of a sphere is divided into regions by r circles; prove that, if a
further circle is drawn on the surface, the number of regions is not increased by
more than 2r.
Prove that η circles on the surface of a sphere divide the surface into at most
n2-n + 2 regions.
Hence, or otherwise, prove that η spheres divide space into at most
\{пг-Ъп2 + Щ
regions. (O & Q
676

REVISION EXERCISE С
121. Prove that Σ ———-—— = -- — — —.
r=i r(r+l)(r+2) 4 2(и+1)(и+2)
It is desired to estimate 1000 л
Σ
,=i r(r+ 101) (r + 2001)
to within 0-001 by summing only the first N terms. Find a suitable value for N.
(M.E.I.)
122. Six integers a,- (i = 1 to 6) are independently and uniformly distributed in
the interval - Ν < α,- < N Find the probability that the simultaneous equations
агХ+а2у = a3, atx+aby = ae
have a unique solution.
Check your answer by enumerating the various possibilities for N = 1.
Show that the probability of obtaining a unique solution tends to 1 as N tends
to oo.
123. Show that, if a, b, c, d,p and q are positive integers such that
ape
b<q<d'
then p>(a + c)/A and q > (b + d)/A,
where Δ = be—ad.
Construct the rational numbers of denominator 20 or less between γ§ and j.
(CS.)
124. If in the expression
g(x) = a0xn + atx"-1 +... + an (a4 all real),
α> = α.-+ι = 0, for some ί in the interval 1 < ι < n—2, prove that the equation
g(x) = 0 has at least two complex roots.
The polynomial f(x) is given by
fix) = b0xn + b1xn-1+ ... + b„ (t>i all real).
Show that, by a suitable choice of α, β, the equation
audt.b.x-bt¥dAxi + №-1x-buf&) =0 (1 < j < и-1)
has at least two complex roots.
Deduce that, if b\ < ЬВ^Ь^Ъ then the equation f(x) = 0 has at least two
complex roots.
125. If ζ, ζ* are conjugate complex numbers, give a geometric description of
those numbers z for which . „. . „..
\ζ-ζ\ < \ζ-ζ*\.
Let z1; ...z„ be η complex numbers, the imaginary parts of which are strictly
positive, and put
Π (z-zd = z" + (a1+jb1)z''-1+... + (an+}bn),
where the аъ ..., an, Ьъ ..., bn are real. Show that the roots of
xn+a1xn~1+... + an = 0

REVISION EXERCISE С
126. From a population of N butterflies which are restricted to a certain habitat
a sample of и is taken. Each butterfly is marked and released again in the habitat.
A second sample of η is taken on the following day. Show that the probability
Ρ that exactly m butterflies in the second sample will be found to be marked is
of the form AuN, where A is independent of N and
uN = (N-n)\j{N\ (N-2n + m)y.
Show that the value of N which makes Ρ greatest is approximately equal to
nVm. (C.S.)
127. Let 5ГО denote the set of all 3x3 real matrices and define an operation *
between members of 5ГО as follows:
X*Y = X+Y-XY.
We define a matrix X to be quasi-regular if we can find a matrix Μ e $t such that
X * Μ = Μ * X = 0.
Prove the following results:
(i) 0 is quasi-regular.
(ii) If X is non-singular and quasi-regular, so also is X-1.
(iii) If Χ, Υ are both quasi-regular, then X * Υ is quasi-regular,
(iv) If X, Y, Z, are quasi-regular, then (X * Y) * Ζ = X * (Υ * Ζ),
(ν) If Xn = 0 for some positive integer n, then X is quasi-regular.
128. The polynomial P(x) is denned by
Длг+1) = P(x) + (l+x)n,
where л is a positive integer. If P(l) = 1, find P(0) and P(-1).
Prove that:
(i) P(-x) ξ Ρ(-χ-1) + (-1)ηχη;
(ύ)Ρ(χ)Ξ(-1)"-ιΡ(-χ-1).
Deduce that, if л is even, then the sum of the nth powers of the first к natural
numbers is a polynomial in k, with integral coefficients, which is divisible by
Щ+1)(2&+1).
What can you say about the sum of the nth powers of the first к natural
numbers in the case when η is odd ?
129. In the tetrahedron ABCD, the perpendiculars from A, B to the opposite
faces intersect; prove that the edges AB, CD are perpendicular.
Prove also that the perpendiculars from C, D to the opposite faces intersect.
130. The parametric vector equation of a line / through the origin in
three-dimensional Euclidean space is
г = tk,
where к is a constant unit vector and t denotes distance measured along / from
the origin. A point Ρ has position vector s. Find the position vector of the
reflection of Ρ in /, i.e. of the point Q such that PQ is bisected at right angles by Λ
If г = tki (i = 1, 2,) are two distinct lines through the origin, and
Si (i = 1, 2) are the operations of reflection with respect to these lines, prove
that 525Ί = SlS2 if and only if the two lines are perpendicular. (C.S.)
678

REVISION EXERCISE С
131. It is required to find the matrix A (relative to i, j, k) of the linear
transformation representing a positive rotation θ about the axis
Find the explicit form for A by proving the following results,
(i) Ifu = /i + wj + лк, ν = l'i + m') + n"k, w = ri + 7n"j + 7i*k
are three mutually perpendicular unit vectors, then
II V
Q=l
is orthogonal.
(ii) If the point Ρ has position vector χ referred to i, j, к as base vectors, then
it has position vector X = Q'x
referred to u, v, w as base vectors.
(iii) The orthogonal matrix
\0 sin θ с
represents a positive rotation of θ about the χ axis,
(iv) A = QRQ'.
(v) Г* + П=\-Р and m'n"-m"n' = l.
(vi)
(/2 + (l-/2)cos6> w/(l-cos0) -nsinff nl(l-cosff) + ,
lm(l-cos0) + nsme m2 + (l-m*) cos Θ nm(l-cos0)-
ln(l-cosff)-msme mn{\ - cos Θ) +1 sin θ л2 + (1-л2) cos θ
132. If a, b are real and a > 0, and if the equation
z3-az+b = 0
has a pair of complex roots Я ± j/t, prove that the point (λ, μ) is one of the points
of intersection of the curves
Зл:2-^2 = a, 2x(y2 + a) = 3b.
Draw rough sketches of these curves and deduce the condition for the given
cubic to have complex roots.
133. The coefficients a, bin the quadratic polynomial
p(z) = z* + az + b
are complex numbers and α2 Φ 4b. Prove that the locus of the point ζ which
varies in the Argand diagram in such a way thatp(z) takes real values is a
rectangular hyperbola with asymptotes parallel to the real and imaginary axes.
By a change of origin, or otherwise, determine the asymptotes.
Determine also the locus of ζ if it varies in such a way that p(z) is purely
imaginary. Sketch both loci on the Argand diagram and interpret algebraically
their points of intersection.
Prove that there are two real values of ζ and two purely imaginary values of ζ
for all four of which p(z) is purely imaginary if
{Re(a)}2 > 4Retf>) > -{Im(a)}2. (О &С)
n-a 679

Bibliography
The following books may usefully be added to the more comprehensive
bibliography of Volume 1.
Armitage, J. V. and Griffiths, Η. Β. A Companion to Advanced Mathematics
(Cambridge). Surveys the underlying concepts of modern algebra and analysis:
an exacting book for sixth-formers but could be read with profit by a student
embarking upon a university course. Contains an excellent selection of
questions.
Brand, T. and Sherlock, A. Matrices: Pure and Applied (Arnold). Illustrates
many of the interesting applications of matrix theory.
Budden, F. J. Complex Numbers and their Applications (Longmans). An
entertaining book illustrating applications of complex numbers in both pure and
applied mathematics.
Cundy, Η. Μ. and Rollett, A. P. Mathematical Models (Oxford). Gives
instructions for making many models. Full of stimulating ideas: very strongly
recommended.
Durran, J. H. Statistics and Probability (Cambridge). One of the best
introductory texts available; contains an outstanding selection of examples: very
strongly recommended.
Hardy, G. H. A Mathematician's Apology (Cambridge). A leading mathematician
writes about what the subject means to him.
Hardy, G. H. and Wright, Ε. Μ. An Introduction to the Theory of Numbers
(Oxford). A famous work on one of the most fascinating of all mathematical
topics—the properties of the integers and their generalizations. Although the
book goes well beyond school level it is charmingly written and contains much
that will appeal to an intelligent sixth-former.
Henrici, P. Elements of Numerical Analysis (Wiley). A very good book on
numerical methods, although rather advanced for sixth-formers.
Hunter, J. Number Theory (Oliver and Boyd). A more modern approach to the
subject than in Hardy and Wright: the text is somewhat exacting as it is
written for university students. Contains some good examples.
Kemeny, J. G. and Snell, J. L. Finite Markov Chains (van Nostrand).
Applications of matrix theory to probability. An interesting but not always easy book.
Kline, M. and others. Mathematics in The Modern World (Freeman). Readings
from 'Scientific American'. Contains a series of fascinating essays on a wide
variety of subjects.
Maynard Smith, J. Mathematical Ideas in Biology (Cambridge). Among much
other interesting material there is an account of the applications of probability
to genetics. The same subject is dealt with in greater detail in D. S. Falconer's
Quantitative Genetics (Oliver and Boyd).
Mendelson, B. Introduction to Topology (Blackie). Perhaps the most readable
introduction available to topology, an important branch of modern
mathematics.

BIBLIOGRAPHY
Midonick, Η. The Treasury of Mathematics (Pelican, 2 volumes). Selected
writings of famous mathematicians—all within the understanding of the sixth-
former.
Ogilvy, C. S. and Anderson, J. T. Excursions in Number Theory (Oxford). A very
elementary introduction to number theory. Contains an interesting chapter
on Fibonacci numbers.
Ore, O. Number Theory and its History (McGraw-Hill). Another very good
introductory work: more advanced than Ogilvy and Anderson, less so than
Hardy and Wright.
Rahman, N. A. Exercises in Probability and Statistics (Griffin). A formidable
collection of problems, going far beyond school level but containing much
challenging material suitable for a sixth-former. Also A Course in Theoretical
Statistics, which takes the subject well beyond school level.
Ribbans, J. Basic Numerical Analysis (Intertext). A good elementary account of
numerical methods.
Uspensky, J. V. Introduction to Mathematical Probability (McGraw-Hill). A
classic: advanced but not too difficult.
Watson, W. Α., Philipson, T. and Oates, P. J. Numerical Analysis (Arnold,
2 volumes). An elementary introduction to numerical methods: contains many
worked examples.
681

^

Answers
CHAPTER 17
PAGE
381 Ex.3, (i) -l+2j; (ii)l-4j; (iii) 7+ 4j; (iv) -8 + 6j;
(v) -26-18J.
382 Ex. 5. (i) - 5 +18j; (ii) - 99 + 20j; (iii) 27 + 36j.
Ex. 6. (i) (2-j)/5; (ii) (5 +j)/2; (iii) (4-3j)/5.
383 Ex. 8. (i) 3,-2; (ii) 5,-12; (iii) Д; ^.
Ex. 9. b, 0.
Ex.10. (iv)8-j; (v) (2-j)/5; (vi)ll-5j; (vii) 8-8j.
Ex. 11. R.
Ex. 12. (i) 5; (ii) 2; (iii) 1Д/2; (iv) 1; (v) 2 sin i<9.
Exercise 17(a)
1. (i)7 + 6j; (ii)9 + 17j; (iii)4 + 3j; (iv)ll+6j; (v)ll+22j;
(vi) 1 + 32j; (vii) 0; (viii) (3 - j)/5; . (ix) - 72 + 368j;
(x)-6-12j; (xi) (7+j)/50; (xii) - (11 + 29j)/37.
2. cos (θ1 - θ2) +j sin (0j - θ2); cos 2вг + j sin 2θλ;
cos 2(θ1 + ej + j sin 2(θ1 + θ2).
3. (i) 2±j; (ii) (1 ±j)/2; (iii) (5±j73)/2; (iv) ± 1 -j.
384 4. 3-2j; (i) (1 +j)A/2; (ii) 2+j. 5. 3 + 2j, 1-j.
6.(i)l, -I; (ii)2-j,2+j.
7. 1 - cos θ + j sin Θ, (1 + j cot Щ/2. 10. 2, pure imaginary.
12. (i)*2-4* + 5 = 0; (ii)*2-4* + 13 = 0; (iii) *2- 4x + 7 = 0;
(iv) *2 - 3(1 + j> + 5j = 0; (v) *2 - (5 + j>+4(1 + j) = 0.
13. (i) (x- 1-j) (x-1 +j); (ii) (x + 2)y) (x-2jy);
(iii) (j*;+jV3^) (^-jV3^); (iv)(j*-l)(* + l-j);
(w)(jx-2y)(x + 2}y); (vi) (x + 1) (2*-1-jV3) (2*-l +W3)/4;
(vii)IT(x±l/V2±j/V2).
14.2,3; (i)2; (ii)* + l.
15. l,j, -1, -j; 1, l+j,j, 0 according as η = 4m,4m+l,4m + 2 or
4m+3.
385 17. -22.
388 Ex. 21. (i) 2, ^π; (ii) 2л/2, £я; (iii) 1, π; (iv) 1, in; (v) 1, -*я;
(vi) 2, -|π; (vii) 2, -\n; (viii) |sec α|, α;
683

ANSWERS
PAGE
388 (ix) |sec «|, (π-2α)/2; (χ) 2 sin \a, (α- π)/2]
(in the last three parts the principal argument depends on a).
390 Ex. 25. (V3-1) + j(V3 +1), cos 75° = (д/3- 1)/2V2, etc.
Exercise 17(b)
396 1. (i) 5, -53°; (ii) Vl3, 56°; (iii) V5, -117°; (iv) V5/2, 63°; (v) V29,112°;
(vi) v'5, -153°; (vii) 13, -23°; (viii) 41, -103°.
2. (i) cos 30 + j sin 3(9; (ii) cos (Θ + φ) +j sin (Θ + φ); (iii) j; (iv) (1 + j) V2;
(v)j; (vi)(l-JV3)/2; (vii)j;
(viii) 4 cos \6 sin i$5(sin (Θ + $5)/2-j cos (θ + φ)/2).
3. (i) (π + 2φ-20)/2; (ii) (-3π + 2φ-20)/2.
7. Circle, centre 2+j, radius 1. 12. — j.
397 13. (i) (1 +j)/2; (ii) (1 ±jV3)/2; (iii) j, 1 -j; (iv) -1 +ij.
14. 9 + 6j. 16. (i) — f, f; (ii) semicircle, radius 1.
17. (1 -j tan Щ/2, (1 + j cot Щ/2. 18. Square.
Miscellaneous Exercise 17
1. (V3 - j) (1 + j), 2V2, ^π. 2. 8, 2.
3. l±V3+j(5±V3).
398 5. -l + 2j, (x-iy + (y-iY = i
399 14. (i) \z\ = V7; (ii)2+j, 3 + 2j.
400 15. #· = 4, 0 = 147-5°.
CHAPTER 18
402 Ex. 1. *> - 9*2 - Зл: + 4, 2.
Ex. 2. 25086278.
403 Ex. 3. 2600.
Ex.4. y= 1957л:-7373.
Ex.5. 2412, 1957, 1190.
405 Ex.7. -26, -8.
Ex.9. -0099,0-357.
407 Ex. 13. 2*2- 5* + 3.
Exercise 18(a)
1. (i)*2-3*-5;6; (ii) *3-9* + 4; 2;
(iii) 3*3-5*2-2*-7; -10; (iv) x3 + 4x* +16x + 59; 231;
(v) 7*4 - 35л3 +175л:2 - 875л: + 4372; - 21865.
684

ANSWERS
408 2. (ϊ)χ*-3χ-7; 7; (ii) x*-x+3; 0; (iii) л:4-4л:3-л:2+Зл:+5; 2;
(iv) (16л:3-44л:2-14л:+11)/8; f;
(v) (9xi - 63л:2 + 6л: - 40)/9; ^.
3.(i)159-l; C") 31-4; (iii) 283-3.
4. (i) (л-- l)3 + 4(л-1)2 + 7(x-1) + 5;
(ii) 2(л:-2)3-5(л:-2)2-14(л:-2) + 8;
(iii) (л-+3)3-2(л-+3)2-21(л:+3) + 50;
(iv) (л: + 2)4 - 8(л: + 2)3 + 24(л: + 2)2 - 32(л: + 2) + 15.
5. (ϊ)λ:3-^2 + 4^ + 2; 1-j;
(ii) *3 + (l+j)*2-(2-j)* + (l + 2j); -j.
6. (i) 2-6; (ii) 3-0. 7. (л: + 4)4 + 4(л: + 4)2; minimum; -4, -4, -4±2j.
8. л:5-Зл:3 + 2л:+1; 1. 9. A = 0, В = I. 10. A = Л, В = 4Л-17.
11.-6,139. 12.86,68.
409 13. л:2-8л:-8. 14. 2л:3-4л:2-х + 6.
15. /г+ 15/г(«- 1)/2+ 25/г(/г-1) (/г-2)/3 + 5/г(/г-1) (п-2) (п- 3)/2
+ /г(/г-1)(/г-2)(/г-3)(/г-4)/5.
Ex. 14. e.g. a = 5, Ъ = -2. Ex. 15. β = 3, й = 3, с = -1.
411 Ex. 17. (i) 4/(л- 1)-3/(л- + 2); (ii) 3/2(л-3)-1/2(л-+3);
(iii) 1/(λ-α)-1/(λ-+α); (iv) 1/(λ-α)+ 1/(лЧ-а).
412 Ex. 18. (i) In л/1 ·5; (ii) In (f).
Ex. 19. (i) 4(л--2)-2-6(л—3)-2;
(п){(л-+1)-2-36(л--6)-2}/7.
415 Ex. 21. (i) 2/(л—2)-3/(л-2)2-2/(л:+1);
(п)1/(л-+1)-4/(л-+1)2-1/(2л-+1).
Ex.22. (ϊ)1/(λ—1)-λ:/(λ:2 + 2);
(п)2/3(л--1) + (л--1)/3(л-2 + л-+1).
Ex.24. (ϊ)1/(λ-1)-1/(λ-2-2);
(π)1/(Λ-1)-{1/(λ-ν2)-1/(Λ-+ν2)}/2ν2.
Exercise 18(b)
1. (i) 2/(3 - χ) + 3/(1 - 2л-); (ii) 2/χ - 3/(χ + 2);
(iii) 7/(л- + 3) - 5/(2л- - 1); (iv) 1 /2(1 - χ) - 3/2(1 + л");
(ν) 2 + 3/л-5/(л-+1); (vi) 1/(л-1)-3/(лЧ-2) + 2/(л:-3);
(vii) 11 χ - 3/(1 - л") - 5/(1 + л"); (viii) - χ + 3/(л- 2) - 7/(* + 2);
(ix) л:2 +5л:+19+81/(л:-3)-16/(л:-2);
(х)1/л:2+1/л:-2/(1 + 2л:);
(xi) 1 /(1 - 2л:)2-1 /(1 -2л") +11(2 +х); (xii) л:/(2 + л:2)-1 /(1 + л");
(xiii) (2л-+1)/3(1-2л:+4л:2)-1/3(1 + 2л:); (xiv) 4/(л—3)2+1/(л—2);
(xv) 1-1/3(л-+1) + (л-4)/3(л-2 + 2);
(xvi) 1/(л-2)-1/л-4-2/л-3-2/л-2-1/л-.
2. А = -13, В =13, С = -11, D= 27.
3. 2/(л- Ι)* + 6/(х-1)3 + 4(л-1)2 + 1/(л- 1)- 1Цх + 4).
685

ANSWERS
PAGE
416 4. }-1/(л + 2); ^-(3/г2+18я + 26)/3(/г + 2) (л+3) (л+ 4).
5. (О 1β(1+χ)-(χ-2)ΐ3(1-χ+χ*);
(ii) 1 /3(1 + χ) - (1 + V3j)/3(2x -1 - V3j) - (1 - V3J)/3(2* -1 + V3J).
6. 1/(jc-2)-1/(jc-1); i-ll2(2y-l) + ll(y-l);
*-1/8(4*-l)+l/(2x-l).
Ex. 25. 0-9802.
Exercise 18(c)
420 1. (i) 1 +3*+9*2 + 27*3; (ii) y+l*+2V*2+sT*3!
(iii) 1 + 4л: +12*2 + 32л3; (iv) 1 + 2л:2;
2. (i) Ι+λ+Ι^ + Ιλ3; (u)1-2a:-2a:2-4^; (iii) 2-|*-^*2-rh*3;
(iv) ΗΑ^ + ϊΙβ^ + ϊΑβ*3; (ν) (i+^+^2 + 2T6^V2.
3. (i) 1 +jx-}x2 + riX3; (ii) 1 -8л:+40л:2-160л3;
(iii) 1+!*+!** +1?*»; (iv) 3V +Α^ + Ί^ + ίίήί*3;
(у)2-1^л-^л:2-То1зб^.
4. (i) 1 + 3* + 7*2 +15л3; (ii) \ + ^х + -2ψέχ* + уЦ^л3;
(iii) i-U+Ч-х2-^; (iv) Зл:-5л:2 + 7л3;
(у)1+8л:+26л:2 + 64л3.
5. ®1-|л: + ^л:2;
(ii)l+* + i*2; (iii) l+fjc+^jc2; (iv) 1-|jc+V*2;
(\)1-3χ+ψχ2.
6.(i)i-K^-i)+K^-D2; (ηΗ+Κ^+υ+^+Ό2;
(iii) 1 + (л-+1)-Кл-+1)2; (iv) l + ^x-D-Ux-^f;
(v) -2-K*+l)-m*+l)2·
8. (i) 0-99933; (ii) 1-0100; (iii) 2-0025; (iv) 0-92793.
421 9.9-997. \2.l+\x + f^x\
13. 2/(2*-1) + (1 -*)/(l + *2). 14. 2-009926.
Miscellaneous Exercise 18
1.Α = λ,Β= arbitrary, С = -Л-1.
3. 1, -(Л-+1), l/(*-2)-(* + l)/(*2 + *+l).
422 4. l/7(*-2)-(* + 3)/7(*2 + *+l); l/9(*-2)-l/9(*+l)-l/3(* + l)*.
5. i, i; -^ - (4л + 5)/8(2и + 1) (2я + 3).
7.(i)V3; GO (4)*; (Ш) 2/^/7; (iv)(3)i.
8. 2л3-13л:2 + 26л:-13, 4(ял:-л:2)/я2.
9. (7-*)/4; tangent at (-1, 2).
10. 249, /·4-8/·3 + 24/·2-34/·+21.

CHAPTER 19
PAGE
423 Ex.2. -29j.
424 Ex.4.j/2».
Ex. 6. 2/(3- 10i2 + 3i4) (1 - 15ί2 + 15ί*- ί6).
425 Ex. 8. (10- 15 cos 2(9+ 6 cos 4(9-cos 60)/32.
Ex. 9. 5π/32.
426 Ex. 10. (1 - χ cos 0 - xn cos ηθ + *n+1 cos (и - 1)0) (1 - 2x cos (9 + л:2)"1.
Ex. 11. χ sin (9(1 -2x cos (9 +л:2)-1.
Exercise 19(a)
1.(0-1; (ii)-j; (iii)j; (iv) (V3+j)/2; (v)l;
(vi) 16(-V3+j)/9V3; (vii) -(l+JV3)/32;
(viii) sec4 0(cos 46·—j sin 40)/16.
2.(i)-l; 00 16; ("О -28(l+jV3);
(iv)cos(n + mW+}sm(n + m)e.
3.(i) 1-8 sin2 (9+8 sin4 (9;
(ii) 5 sin 0- 20 sin3 0+16 sin5 0.
4.(i) 1-8 cos2 0+8 cos4 0;
(ii) 16 cos5 0-20 cos3 0 + 5 cos 0;
(Hi) 6 cos 0- 32 cos3 0+32 cos5 0.
5. 4f(l-f2)(l-6f2 + f4)-1.
6. cos {(2k+1)/14}, A: = 0 to 6; cos (2λ:π/7), A: = 0 to 6.
7. (i) (cos 50 + 5 cos 30+10 cos 0)/16;
(ii)(cos40-4cos20+3)/8;
(iii) (cos 70 + 7 cos 50 + 21 cos 30 + 35 cos 0)/64.
8. (i) (sin 50- 5 sin 30+10 sin 0)/16;
(ii) (35 sin 0-21 sin 30+7 sin 50-sin 70)/64;
(iii)(2sin20-sin40)/8.
9. (sin 90 - sin 70 - 4 sin 50 + 4 sin 30 + 6 sin 0)/256.
10. (i) 3π/16; (ii) 3π/512.
11. (i) (cos 0+cos /z0-cos (n+1)0-1) (2-2 cos 0)"1;
(ii) (sin 0 + sin «0-sin (n+ 1)0) (2-2 cos 0)"1.
12. (cos 0 + 2n+1 cos «0-2-2" cos (n+ 1)0) (5-4 cos 0)"1.
427 13. 2" cos" (0/2) cos (и0/2).
14. i cosec2 (0/2) {sin 0 + (и+ 1) sin (n- 1)0-/г sin ηθ].
15. (sin2 0-sinn+10 sin (n+1)0+ sinn+2 0 sin ηθ) (1 + sin2 0-sin 20)-1.
16. (Σί! - Σί213 Q (1 + h h h h ~ Σίι h)-\
17. (sin 30 sin3 0 - sin (2/г + 3)0 sin2n+3 0 - sin6 0 + sin (2/г + 1)0 sin2n+5 0)
x (1 - 2 sin2 0 cos 20 + sin4 0)-1.

ANSWERS
427 18. a = 1xs,b=-^E, с = f; kn±\n, rm.
428 Ex.l4.(i) ±1, ±j; (ii) ±2, ±2j; (iii) ±(1 ±j)/V2.
429 Ex. 18. -j, (±V3+j)/2.
Exercise 19(b)
430 l.(i)2,(-l±V3j); (ii)-l,(l±V3j)/2; (iii) j, (±V3-J)/2;
(iv) 1+j, {-(V3 + l) + (V3-l)j}/2, {(V3-l)-(V3 + l)j}/2.
2. (i)2-j, (ii) 1-3 + 0-79J; (iii) -0-28-1-8j; (iv) 1-4-1-Oj.
3. 1 + j, { - (V3 + 1) + (V3 - 1) j}/2, {(л/3 - 1) - (л/3 +1) Л/2;
cos π/12 = (1 +V3)/2V2, sin π/12 = (^/3-l)/ZJ2.
4. 1-1-0-088J.
5. (i) cos 3 — j sin 3;
(ii) cos (20^)/6+j sin (20-π)/6;
(iii) cos 20 - j sin 20; (iv) cosec1 0 cjs (π - 20)/8;
(ν)φ(40-π)/8.
7.(±V3±j)/2. 8. -l,(±l±V3j)/2.
9. ij, — wj, — w2j where ω is a complex cube root of one.
10. tan kn/5, к = 0 to 4.
11. cos 0(1 -sin 0)"1, 0 = 2kn/n.
431 15. (α + ύ + ο)(α + ωί, + ω2ο)(α+ω2ύ + ωο),
(α + ύ + c) (a2 + b2 + c2-bc-ca-ab);
(a + b + c + d)(a-b + c-d)(a+jb-c-}d)(a-jb-c+jd)
433 Ex. 25. (i) .ίπ; (ii) -^π; (iii) e""; (iv) e4
Exercise 19(c)
l.\ar + bs\,e. 2. (i) e^(l+j)/V2; (ii) e-».
3. (i) In 2^π/6; (ii) eb(cos In 2+j sin In 2).
4. Moves along real axis from e" to e~". 7. 2c2 cosh ν (cos к + cosh в)-1.
Miscellaneous Exercise 19
434 1. cos (2A: + l)7r/12+j sin (2k + l)n/12, к = 0,2, 3, 5, 6, 8, 9, 11.
3. 2-" cos 2/г0(1+ cos 0)"n.
4. (i) 2 cjs (π/4), 8 cjs (3π/4); (ii) 2 cjs (11 π/12), 2 cjs ( - 5π/12).
5. ±(-0-61 + l-2j), ±(-0-61-l-2j).
6. \{ac - bd+ (α2 + b2)* (с2 + tf2)*}.
9. (i) cos {a + K«-1)/?} sin \ηβ cosec }/ff;
(ii) sin {ot + iin-1)/?} sin ^ cosec ifi.
10. 35π/256.

ANSWERS
PAGE
435 11. (i) 2 cos \θ cjs \6; (ii) 2 sin \6 cjs (θ-π)/2;
(iii) V(cot Щ cjs \tt.
13. $5 = (2/·+ 1)π/5 14. tan {4/c-3)я}/4и.
15. (V3 + 1V2V2, 1/(л/6 + л/2).
16. - (3* + 3*), - (3*ω + 3V), - Ρ*ω2 + 3&ω).
437 23. in(n + l); A =-C = α(1-α)-2, 5 = (1-α)"1.
24. (*-1)Π (*2-2*cos2for/(2« + l) + l).
CHAPTER 20
439 Ex. 2. (2 + 3j)z + (2 - 3j)z* = 13.
Ex.4, (i) 6; (ii) -3j.
Ex.6. zz*-(l+2j)z-(l-2j)z*-4 = 0.
Ex.7. (i)l + 3j, 2; (ii) i-j, 1.
Exercise 20
447 1. (i) |h--3| = 1; (ii) |и>—j| = 1; (ш) |и>| = 2; (iv) |и>| = 1;
(ν) N = V2.
3. -1, ±j. 4. 1+j; 2+j, -1+j.
448 11. z(l -j) + z*(l +j) = 2. Interior of 'shark's fin' bounded by real axis,
\*-i\ = i, k-i-ij| = V6/4.
12. Circle, 2j/3, f. 13. Line. 14. Line. 15. Point j.
16. Circle, 2, V5. 17. Line. 19. arg i^|] = π-θ.
449 21. и- = 3+j- 1/z. 25. Arcs of three circles through origin, centres
j/2, 1/26, j'/2(a-1) and imaginary axis.
26. (i) 3ww*-2(w + w*) + l = 0; (ii) w + w* = }.
27.w = (z-j)(jz-l)-\
CHAPTER 21
450 Ex.1. 1+j, (-l+7j)/5.
454 Ex.2. (i)-Jf; (ii) -^; (iii) V47/2; (iv) ^.
Ex.3, (i) 3z2-10z+12 = 0; (ii) 3z2 + z+l = 0;
(iii)9z2-7z + 9 = 0.
Exercise 21(a)
455 1.3,-7; (ii) 2, 5*; (iii) (l-j)/2, -j; (iv)4-3j, -1;
(v)(2-6j)/5,-(l+2j)/5.
689

ANSWERS
PAGE
455 2. (i) z2-z-6 = 0; (ii) 6z2-z-2 = 0; (iii) z2-6z + 4 = 0;
(iv) z2- (5 + V2)z+ (2 + V2) = 0; (v) z2 - 6z + 25 = 0;
(vi)z2-z+l = 0; (vii)z2-(2+j)z+(3+j) = 0;
(viii)z2-(V2 + l)(l-j)2-3j = 0.
3. (i) 2-8, -1-3; (ii) 075±l-7j; (iii) 0-42 + 0-86J, -1-42-1-86J.
4. (i) 3, -9, 27, -i -972; (ii) i -f, ^-, - j, -^;
(iii)j, -1+j, l-2j,(l-j)/2, -5+j.
5.(0-1; (ii)W. (iii) л/113/2; (iv) 9^113/4.
6. (i) (3/c2 + 2/c- 7)/3; (ii) ψ^; (iii) - 4.
7. (ϊ)2(α2 + 8βό+8ύ2); (ii) 2(α+2ύ)/α2; (iii) 4|0(fl+u)}*|.
456 8. -a2mc/(62 + a2m2), 62c/(62+aW).
9. (i)2z2 + 3z-6 = 0; (ii) z2-5z-350 = 0; (iii) 7z2 + z-2 = 0;
(iv)4z2-29z + 49 = 0.
10. (i) 2z2-2(l -3j)z-5-9j = 0; (ii) z2-2jz + 5-j = 0;
(iii)z2-(2+14j)z-12 + 5j = 0.
11. (i) 4z2 + 7 = 0; (ii) 4z2-(10+ 10j)z+ 9j = 0;
(iii) 4z2-19z+44 = 0.
12. ±24j.
13. ±3(l+j).
14. (i) (4 + 5j)z2-(3+j)z-(2-j) = 0;
(ii)(2+j)z2 + (3-j)z-4+5j = 0.
19. (ca*)*+(c*a)i + b = 0.
20. ύ2 = lea.
Miscellaneous Exercise 21
459 1. (i) llz2-8z-400 = 0; (ii) z2-5(l+j)z + 47j = 0.
2. α2, β\ 7. a/A /?/a.
CHAPTER 22
461 Ex. 2. 2х-Ъу+\Ъ = 0, (5ί-1, 3i-2).
462 Ex.5. bV-αψ = агЪ\
463 Ex.6. (-1,3), (-4,4), (-2,5), (-3, -4).
Exercise 22(a)
464 1. 4*2+.y2 = 4. 2.x3 = y*. 3.y=x2-3x + 3.
4. *2+.y2 = 2. 5. x>+y* = axy. 6. ХЧ.У2 = x2y.
8. (i) (-1, 4); (ii) (2, 0); (iii) (2, 8); (iv) (4,-1).
9. ±i.
465 12. (-2, 5).
690

ANSWERS
PAGE
466 Ex. 8. PSZ = 90°, SPM bisected by tangent.
467 Ex. 11. 4a, a, 3a.
Ex. 14. (Jail, a), χ = 5a/2; (i) (-Πα/4, 0), x + 5a/4 = 0;
(ii)(-a, a),y + 3a = 0.
468 Ex. 20. я =-р-2/р.
Exercise 22(b)
472 1. (i) 4, (3, 2), (4, 2), * = 2; (ii) 2, (3, -1), (3, -i), 2y = -3;
(iii) 1,(1, -1), (f, -1), 4*= 5;
(iv)3,(-2, -3), (-2, -f),4j/ = 11;
(v) 4, (5, -3), (6, -3), χ = 4, (vi) 8, (2, 2), (2, 4), у = 0;
(vii)2, (0,4), (-i,4), 2*= 1.
2. (i)y_4*-2j/ + 5 = 0; (ii) x3+6x + 2y+4 = 0.
3. (i)y*-12x-6y+21 = 0; (ii) *>+2л:+3.у + 4 = О.
4. (i) x+4y + 9 = 0; (ii) x-6y-3 = 0; (iii) 5x-ly + 25 = 0;
(iv) Sx + 3y+30 = 0.
5.(4,-2). 6. (-4, 6). 7. -i,-i+ij.
473 8.^.
474 18. Same axis, focus; directrix is tangent at vertex of original parabola.
19. Parabola. 21. j>2 = lax. 22. Parabola, latus rectum a.
475 28. у = la, y*-2ax = a2. 30. у = 3*2-4*.
31. у = 2jc2-* + 2.
Exercise 22(c)
478 6.x=-3,y= 1. 7.(1, 1).
479 10. 2x-Sy±15c = 0.
Miscellaneous Exercise 22
2. Y/a, (У—2аХ)\2а*\ (b, 0), (b + ia, 0).
481 15. (3V7)c2. 18. (la, 2a J2).
CHAPTER 23
484 Ex. 1. 2 cjs (2k+ 1)я/10, /с = 0 to 4, i.e. iV0« + 2V5) +Кл/5- l)j, etc.
Ex.2. 1, -2,-2.
Ex.6. -l±j, ω, ω2.
488 Ex. 10. (i) 0, 0, ^(2±2V10); (ii) -1,-1, \(5± V10);
(iii)2,2,K-4±j2V2).
691

ANSWERS
Exercise 23(a)
PAGE
488 1. χ*-5*3-4*2 +16*-8 = 0. 2. x*-3x* + x* + 7x-30 = 0.
3. ;t5+3;c4 + 4*3 + 4*2+3*+l = o. 4. 1, 1, ±JV2.
5. -3,3±j. 6.i-|, -|, -f.
7. (i)4z3 + z2-l = 0; (ii)4z3-llz2+10z-4 = 0.
489 8. (i) z3-6z2-12z-32 = 0; (ii) и>3-15и>2-15и>-16 = 0.
9. (i) 27z3+ 3jz+1 +j = 0; (ii) ^-2г-4 = 0.
10. (i) 2z3-13z2 + 32z-21 = 0; (ii) 2z3+17z2 + 52z+64 = 0.
11. (i) z4 + z-5 = 0; (ii) z4-8z3 + 24z2-33z+ 13 = 0;
(iii)z4-10z2-z+25 = 0.
12. 3,3, f. 13. -l±j, -1±V2.
14. -j, ± V2-j.
17. 3 real roots if and only if -11 < к < 16.
18. 4 real if - 7 < к < 9, etc.
490 19. 2 real if к < -23 or -16 < к < 112;
4 real if -23 < к < -16; none real if к > 112.
20. G2 + 4#3 < 0.
491 Ex. 11. f, -f, ψ.
Ex. 12. 0, 0, 4j, 1.
Exercise 23(b)
494 1. CO 14; (ii) 46; (iii) 162; (iv) 10; (v) 17; (vi) 24; (vii) -};
(viii)22; (ix) -2; (x) 30.
2.(i) 16; (ii) -3; (iii) |; (iv) 67; (v) 32; (vi) 48; (vii) 18.
3.(ΐ)2^2; (ii)(p* + W)/9·; ("0 0.
4. -3, i, 4.
5. -|, -i,i 6. f, i, |. 7.-2,21,5.
8. 1, -2, -3.
495 9. 2, 3, -4. 10. -1, 2, 3. 11. ЪаЪс = аЧ+2Ь\
12. -3, -if.
13. (i) ζ3+2ρζ2+^2ζ-^2 = 0; (ii) z3-pz*-q* = 0;
(iii) z3+.pz-$ = 0; (iv) z3 + 4pz-8tf = 0.
14. (i) zi + 2,qz2 + (iq2+ps)z+qs = 0;
(п)23-2^2+^-^2 = 0.
17. (i) x'+Oq-p^+W-P^x+^-p'r = 0;
(ii) *3 + (9<?-3^2)л:-2^3 + 9^-27/· = 0;
(i)G.P.; (ii)A.P.
496 18. r = pq.
497 Ex. 15. (i) *>-5* + 9 = 0; (ii) 2*3 + 4*-7 = 0.
692

ANSWERS
Exercise 23(c)
PAGE
502 1. -2, l±j. 2. -|,(3±jV7)/4.
3. 2 cos π/9, - 2 cos 4я/9, - 2 cos 2π/9.
4. cos a, cos (§π + α), cos (|π + α) where cos 3a = f.
5.3*+3&, ω3* + ω23&,ω23* + ω3&.
6. 2-2*+ 2*, 2-ω2*+ω2*, 2-ω22* + ω2*.
7. 5+ 6*+6*, 5 + ω6* + ω26*, 5 + ω6* + ω26*.
8. -4, -ω, -ω2.
503 9. -2j,j(l±V2).
И- MV2±JV(2 + 4V2)}, M-V2±JV(2-4V2)}.
Miscellaneous Exercise 23
3. ±8, +288.
4. 1. 5. -2±j, ±j/V2. 6. 84, 708.
7. -i,i- 8. 1, -f;(l, 1, 1),(4, -*, -«.
9.3 roots if -81 =S A: < 44.
505 10. -§, —f, 1±V5.
ll.i20'+l)3+jp30' + 2) = 0.
CHAPTER 24
506 Ex.7. ±3c.
508 Ex.9. i+4j-3k.
Exercise 24(a)
512 4. (i)-2i-j + 5k; (ii) 8i-16j + 8k; (iii)i-j + k;
(iv) -4j+4k; (v) -bci + abk.
5. 0; coplanar with O. 6. -k, -j.
7. (6i + 3j + 2k)/7, f. 8. 3i, 1.
9.i(i+2j + 2k), 9, 9.
513 11. |a|"2. 14. α = r.a/{a.(b Л с)}.
Exercise 24(b)
518 1. (i) 3x-2y+z = 6; (ii) lx+2y-5z = 4; (iii) 4x-4y + z = 5;
(iv) χ - 5y + 3z = 7; (v) bcx -3cay + labz = 0.
2. (i) 3*+4j/-2z = -7; (ii) 2*-3j/+2z = 0; (Щ) x+3y-4z = -5;
(iv) 2*-7j/+4z = 1; (v) 2*-5j/-z = -2.
3. (i)6; (ii)V14; (iii) 2V29.
693

ANSWERS
PAGE
518 4. 0; lines intersect. 5. 2x + 3y-5z-21 = 0.
6. 7x-3y+z-16 = 0. 7.x+y-2z = 4.
519 8.(i)(4,-2,3); (ii)i+2k; (Hi) 6i-5j-3k;
(iv) K*-4) = -iC + 2) = -К2-З).
10. л: = iy = \z.
11. r = c+K[bi Л (а!-с)] Л [b2 Л (а,-с)]}.
12. 2δη$α.βγ). 13. aV6/3.
CHAPTER 25
523 Ex. 2. ^5, f{x) = fh>, 0 < χ < 100.
Ex. 3. 2 χ ΙΟ-4, τ^.
528 Ex. 8. i.
Exercise 25(a)
1. 1, i. 2. 2, 0-117. 3. in, I
4. (i) (*- l)jx for * > 1; (ii) 1 -e-2* for χ > 0;
(iii) Kl-cos π*) for 0 < x < 1.
529 5.1,1,1,1- 6. f, f, 1, тй-
7. 0, 0, 0, (π2-8)/4π2. 8. 1/Λ, (In 2)/Л, 0, 1/Л2.
9. Я*) = 0, *2/4, (2*-1)/4, (6*-2*>-5)/4, 1;
G(y) = 0, \y, K2Jy-D, etc.; ^ = dG/dj/.
10. £θ>) = 2/, 0 < у < 1, 2Х2-У), 1 < у < J2;
1, i;8(2V2-l)/15.
И-t£ т&.
530 12. 1, -4, 6.
13. е-*Ч е-3*'», oe-^Hl-e-*'»)2·
14. 1-5, Ц, 015.
15. 4/я, (2/я) Ь 2, 0-0785, V2-1, 0-854.
535 Ex.12, iz2, -l+2z-iz2.
Ex. 13. Triangular.
Exercise 25(b)
545 1. ■&. 2. l-1/ναθπ). 3. i 4. y£g.
5. 1, i. 6. i 7.4/-/Я.
546 8. 2/·2-16/·2/π2. 9. /2/4, /4/48.
10. α4{4π - π2 - (In 4)2}/4я2. 11. 1, 53, 160.
12. 0-149. 13.2-421, 2-641. 14.0-004.
15. 0-154. 16. 2-19, 78%, σ = 0156.
17. 10-825, 98, 0-271.
547 18. 502 g.
694

Miscellaneous Exercise 25
PAGE
547 1. 8a2/3, 1/V2. 2.18-6. 3. (l+ln4)/4.
4. l/(2Jy), jy, h &.
548 5. 0-301, 0-092. 6. 1 + exp [ - πχ/(σ^3)].
7. 0-683. 8. 01336, 1-630, 4-98.
9. Λ, Λ, 2λ. 10. 183р.
549 12. ^ 13. i. 15. 1-β-λ2,2А-2.
17. «1+jO-*, 2/W(1-z2); -f, 2/V45; 2/π, J{(n*-8)/2π2}.
18. λ/Ο*-λ).
550 19. (i)j> < 1/22-4; (ϋ),ρ = 0. 20. |.
21. 4-471, 5-000, 5-029; 0-2280.
23.1, V65/3, 0-387.
CHAPTER 26
554 Ex. 6. Absolute: 2|Ле,|, |е,/2>/л|, (Ы + Ы)1^(У1 + уг)-
Exercise 26(a)
559 1. 0-7513 ± 0-0006, 1-138310-0020.
2.(i) 22-3610-055; (ii) 88-7510-35; (Ш) 3-333 10-014.
3.0-31784. 4.0-043685(5). 5.5-95 + 0-10.
6. |е| sec2 у. 7. \уе\/(1 +у*)К 8. (i) 0-011; (ii) 0-034.
9.0-39. 10.0-23(1). 11. 2-15(4). 12. 1-15.
13. 1-94, -0-56, -1-38. 14. 1-56, -2-56.
17. η even: no real roots; η odd: one real (negative) root.
569 1. 1-44. Ζ4-64. 3. 1-38. 4. 5-14, -
5. 4-10. 6. 4-13 (5). 7. 2-30, -1-30.
8.2-65,-1-42,-4-23. 9.4-193.
10. 1-092, 1-931. 11.0-8447. 12. 0-7920.
13.0-60. 14.1 - (5k + 3)-1. 15. 0-877.
570 16. 0-838. 17. 3-591.
Miscellaneous Exercise 26
1. 1-532. 4. 60-3.
5. 0-88, 0-78, 0-69, 0-60, 0-50.

PAGE
571 8. 3-316625. 9. 1-81713.
572 12.1-162(28). 13. Κ+1/2Κ3-1/4Κ\
CHAPTER 27
576 Ex. 7. ( ± 3, 0). Ex. 8. (2, - 2), (0, - 2).
577 Ex. 10.0-205.
Exercise 27(a)
585 1. (i) i; (ii) V3/2; (iii) V2/V5. 2. f, (0, ± 2), у = ± §.
3. i, (2±V3, -1). 4. 1/V2, (-2, 1), (-4, 1).
586 5. i, (2, -1), (2, - 3). 6. |f. 7. л— 2j/ =
8. 4x+y=6. 9. (-1,2), (-2,1).
13. Mid-point of OS.
587 22. fly+i¥ = 4*2jA 23. 5, b.
27.*±3j/= ±5V2.
596 Ex. 37. Arc of a circle.
Exercise 27(b)
597 1. (i)|; (ii)V3; (iii) V0/2). 2. 2V3/3, (±2^/2, 0), J2x + 3 = 0.
3.|, (6, -l),(-4, -1), 5л:-21 = 0,5л:+11 = 0.
4.V!,(-3±V5,lX*=-3±4/A
5. (i) Зл:± 2j/ = 0; (ii) J3x±у = 0.
6. (i) л:-2^ = -3, л:+2.у = 1; (ii) x±<J2y = +V2.
7. 4л:2-^2 = 7. 8. 9л:2-4.у2-18л:-24.)/-32 = 0.
9. л:+.у-1=0. 10. л:+3>> = 4, -jf.
И. 2л:+^+3 = 0. 13. л:2-^2 = 8.
598 18. (1, 0), j/-2V2*-+2(V2+ 1) = 0.
19. 5a:-2j/-9 = 0, 2*+5j/-50 = 0, (20, -ψ).
600 Ex. 38. (i) Line parallel to θ = 0. (ii) Circle, (iii) Line at α to θ =
(iv) Quadrant of annulus. (v) Interior of circle, centre (a, 0).
Ex. 39. (i) /-(sin Θ+ cos Θ) = 1; (ii) r = 2 sin 0;
(iii)/-2-2/-(sin0+cos0)+l = 0.
Miscellaneous Exercise 27
605 1. 1/V2, (1, -3), (-2, -2).
606 3. ύ2/α2, л:2+уг — ax sec a — by cosec a = 0.
608 15. 2x(Xl -x2) + 2ΧΛ -j/2) = л-2 + y\ -x\-y%.
16. \ V(l + 4e + 3e2). 17. Ellipse, parabola.

PAGE
613 Ex.4. 3, -2.
617 Ex. 12.-L (J -
CHAPTER 28
ANSWERS
Exercise 28(a)
*»·(-» C)· —-
2x+y = 0, Ax- Ъу =
-.-.mi; ;,. 4.42(; "I).
-3/V13,
-2/V5,
1/V5\ (!),
619 9.(i)
)■ <M
5. x+2y = 0 and jc—у = 0 invariant.
6. x—Ay = 0 and 2x+y = 0 invariant. 8. Maps into χ = y.
3.2n+1-5.3n -5.2n+1 + 10.3n\
З.ги-З'н-1 -5.2n + 2.3n+1 }'
7.2"+1 + 15 З.г^-бХ
35.2" + 35 15.2n-14J'
f4(-l)»-3.2» 6(-l)«+
д_1)п_2п+1 3(-1Г+Ч-2"+
/5.3"-22"+2 -2.3" + 22"+1\
UVJ \l0.3n-5.22n+1 -4.3n + 5.22nj·
10. No. 11. -^; maps everything close to χ =
/11.2"+2-21/г-43 -ЗЗ.г^+Ч-ЗЗ/г + ббХ
• \7.2"+2-14/г-28 -21.2"+1 + 22и + 43/'
(« + l)and(2n+1-l).
mr~1J ~J'" "K~1J --rj-2n+1)·
V' l')^_1M»_')n4-l V_1W+1j_1n+! I»
(1 0 -28^
-7 8 7
0 0 -27У
/-5/V33\ / 1/3 V3\ /1/V3\
2/V33 , 1/3 V3, 1/V3 ·
\ 2/V33/ \-5/3V3/ \l/V3/
2. -2, 3, 0; A singular,

ANSWERS
PAGE
628 3.1, -1,2; detA
— 1, 1, i; eigenvah
4.1,
/1 1
\l/V2/ \f/ \l/V2/'
(ίί4
/1 -2 2\
5.1,2, -3; Ш -1 -2 .
\2 2 1/
629 6. ,y-z = 0, x-y = 0, 15*+13j/-8y = 0.
ΙΙ,λ+μ,λμ. 16. (^ _2°).
630 17. A if «odd, I if «even. 18. (J™ ~^.
(10 9 23\ /4 -2 -3\
5 9 14 J, A-1 = I 1 0 -1 J.
9 5 19/ \-2 1 2/
21. (17I + A-A2)/52.
(2n+4_15 _2n+3_4.3n + 12 2"+4-4.3n-12\
10.2"-10 -5.2n-2.3n + 8 10.2n-2.3n-8 .
-io.2n+io s^+s^-s -Б.г^+з^+я)
631 24. 0, ±jtani(9. 25. x + 2y-6z =
, where аг + 2/?г-6уг = 0, det Ρ φ 0.
2/
632 28. i* = \y =
= ΙΛ A 3 ,w
\7i 72 2/
CHAPTER 29
633 Ex. 1. (-α,Ο).
636 Ex. 5. x±y = 0.
Exercise 29(a)
637 3. (2- t)l(2t + 1). 4. i> +<? + r = 0.
638 8. (4ak\ 4ak). 11. (αί/(1 +13), αί2/(1 + ί3)).

ANSWERS
639 16.8:1.
640 Ex. 7. Ends of diameter.
Ex.8. (1,2, -1), 4.
641 Ex. 10. (2, 2, -1), 3.
644 Ex. 12. Elliptic cylinder.
Exercise 29(b)
652 1. (i) x2+y2 + z2 + 2y = 0; (ii) *2+j/2 + z2-4*+6j/ + 2z-ll = 0;
(iii) x*+y* + z2-2ax-2by-2cz = 0.
2. (i) (3, 1, -1), 4; (ii) (-2, 4, 1), 9; (iii) (a, £>, -c), \a-b + c\.
3. (i) *2+j/2 + z2-4*-2j/ + 2z-32 = 0;
(ii)*2+j/2 + z2-2;c-4z-16 = 0;
(iii) x2+y2 + z2 — ax—by — cz = 0.
4. (i) (4, 1, 3), V6; (ii) (5,-1, 2), 9.
5. i*J{2d2-(a-b)2}. AB is diameter.
6. Centroid of A ABC. ΣΡΑ\ = d2; locus a sphere, centre at centroid of
polygon Ax...An.
653 8. (p2+q2 + r2) (ax + by + czf = (ap + bq + erf (x2 +y2 + z2).
10. (i) Circle; (ii) pair of straight lines; (iii) hyperbola.
11. (i, 1, i), 3V2/2.
13. x-y + 5z= 7,(2, -2,3).
654 14. 12, 70. 16. Circle; rectangular hyperbola.
17. (1Д/2, -1, - 1/V2). 18. True. 19. c\:c\.
656 26. Ellipse; 2^/2/3; x-2y = 0, 2jc+j/ = 0.
27. Rectangular hyperbola. (± 5, ± 1), 5x+y = 0, x-5y = 0.
28. V(|); x + O±2j2)y = 0.
Revision exercise С
657 3. η > 0 and odd; η > 0 only. 4. -1, 0, 3. 5. 8-2°, 90°.
6.0-067. 7.-1,1. 8.1:1.
658 10. Cancellation.
11. (V2, -$π), (V2, Ю; £π.
12. (i) l/(*-l)-4/(*+2); (ii) (2 + 3*)/(l +*2)-3/*;
(iii) 1 /(1 - x) +1 /(1 - x)2 +1 /(1 - 2x).
13. 12-67. 14. (2αί2, 3at).
I5.(i){-1 <*< 1}; (ii){*>l}; (iii) {x = 1};
(iv) {-2 < * < -1}; (v){-l<A;<l);£=i'nC,F=BuC.
659 16. Line * +j/ = α + b. 17. (i) ff; (ii) -&.
699

ANSWERS
/О О 0\
659 18. 12 О 01.
\0 1 О/
20. 7я/12, 5я/4, 23я/12.
21. -Ι+ik, -1/к. 22. л: < -5 or -3 < χ < 7.
23. 1/(1- 3*)- 2/(1 -2л:); -1-х + х\
24. |о|, 0.
660 25. Plane perpendicular to О Α. 26. 1, - 2, - 2, 6.
27. -1, (-1 ± Vl05)/2. 28. \, 0-215.
30. Кби3 - Зи2 - и), ЩгР + Зи2 - и). 31. 2:1.
661 33. (i) 60° < θ < 109-5°, 250-5° < θ < 300°;
(ii) 0° < θ < 30°, 150° < θ < 199-5°, 240-5° < θ < 360°.
35. Λ =-2, (l,f,f).
36. (i) (2ί +1)3 (3ί +1)2/432; (ii) ^; (Hi) 3-5.
38. (i) 1/V33; (ii) 3/V33.
662 42. 3. 43. -2 < χ < О, 2 < χ < 7. 44. (2, -3, 7), 2.
663 47. л/2, 2V2. 50. 5-02, 103-26, no (1 % exceed 106-52).
51. 2, 6.
664 53. cot 0-2" cot (2n0). 54.3/128.
665 60. (i) R; (ii) {x e J?: - V2 < χ < л/2}·
61. 2 cos i<9, £0; (2 cos Wn sin i«0.
62. -1, -};i, -1, -2; -i, -l±Vi
666 64. ^; 2, 0-89, 2; 0-374. 65. ^и>+£-*и>* +α2 = 0.
/1 0 0\ /1 -4 14\ /-73 -32 14\
66. /1 1 01, JO 1 -з), I 16 7 -3J.
\5 -2 l/ \0 0 1/ \ -5 -2 1/
67. iff, f,^-. 68.£;£(4a + 3b).
667 69. * = -j//5 = z/2.
72. (i) 5i + (i+j)-5(i+j + k); (ii) (3i-2k)-(2i+4j + 3k).
73. (f)5, 1 -(f)6-(f)5, (6!) (£)·, (f)-V6; 4; 6, 66.
74. -2i+j + 3k, j/=0. 75.0-966.
668 77. 16j/2-24*z + 7z2 = 0, 4f = 36*- 63.
78. 1 + 5*/2 + 39*2/8, 5х-2у + 2 = 0, curve above tangent.
669 83. i 84. (i) (1, -2, 1); (ii) no solution; (iii) (Л, -ЗЛ, Λ + i); 8; 16.
85. (-1)4
670 87.0. 88. (i) Line; (ii) circle; (iii) ellipse; (iv) parabola.
89. c(a+/?)/2, c(a+/?)/(2a/?); cYt*-2XYt + cX = 0.
90.1(3и-1)я.
700

ANSWERS
PAGE
670 92. χ = Ла+(а л b)/|a|2, у = (1 -Л)а-(а л b)/|a|2.
671 94.pnl(pn+qn). 97. 2x+9y+5z-12 = 0.
672 98.206. 99. 5x + 3y+4z = 0,y-z = 0, x-y = 0.
100. Half-turn about (0, V3).
673 102. a4 - a, a2 - a3, a3 - a2.
104. Line in direction 3i + 2j + k; plane x+y-4z = 0.
674 110. -0 96, 2 88, 5 08.
675 112. (") μ"λ»-η(λ+μ)η; Ο, 00001, 0 0014, 00090; 7.
113. 130-25, 164-77, 215-49, 285-65.
114. (iV-l)(iV+4)/(2iV3).
676 И8.реЧр2(е2а-еа). 119.1-26.
677 121. Л^ > 21.
122. 8Л^(Л^+ l)/(2iV+ l)3.
123.1 ϋ, Ц.
678 128. Integral polynomial in k divisible by k(k+1).
679 133. 2z = - Re (a), 2z = - j Im (a). Rectangular hyperbola. p(z) = 0.

^

Index
Abel, 483
absolute error, 553
affix, 385
ApoIIonius, 603
Argand diagram, 385
argument, 386
auxiliary circle
of ellipse, 577; of hyperbola, 593
Buffon, 549
C, 380
Cardan, 496
cardioid, 447
Cauchy distribution, 549
Cayley-Hamilton theorem, 626
central limit theorem, 543
characteristic equation, 609
chi-squared distribution, 549
complex numbers, 381
addition of, 381; argument of, 386;
conjugate of, 382; division of, 382;
equality of, 381; imaginary part of, 382;
logarithm of, 432; modulus of, 382;
real part of, 382
complex plane, 385
cone, 645
conic, 573
central, 595; conjugate diameters of,
578; diameter of, 578; directrix of, 573;
eccentricity of, 573; focus of, 573
conjugate complex number, 382
с roots of real equation, 484
continuous random variable, 520
cover-up rule, 411
cubic equation, 496
discriminant of, 501; irreducible case,
500
cumulative distribution function, 526
cylinder, 644
de Moivre's theorem, 423
density function, 521
Descartes's rule of signs, 558
diagonal form, 617, 620
diameter, 578
diametral plane, 642
differences, 403
director circle, 595
distribution function, 524
double point, 636
eigenvalue, 610
eigenvector, 610
ellipse, 573
construction of, 577, 582
ellipsoid, 655
equations
of curve in Argand diagram, 438;
polynomial, 483; quadratic, 450; sum
and product of roots, 451, 490
equilateral hyperbola, 593
expectation, 527
exponential distribution, 543
field, 379
focal distance property
of ellipse, 581; of hyperbola, 594
folium of Descartes, 636
Fregier point, 482
fundamental theorem of algebra, 380,483
Galois, 483
generator of cone, 646
Gregory-Newton formula, 406
helix, 645
Horner's method of synthetic division,
402
hyperbola, 573
asymptotes, 591; conjugate axis, 591;
of ApoIIonius, 608; rectangular, 593;
transverse axis, 591
hyperboloid, 655
initial line, 599
invariant line, 609
inversion, 441
iterative methods, 560
linear interpolation, 556
log-normal distribution, 676
major axis, 576
703

INDEX
mean, 527
mean deviation, 528
median, 523
minor axis, 576
mode, 523
modulus, 382
modulus-argument form, 387
multiple roots, 484
nested multiplication, 402
Newton
conic theorem, 480, 607; -Raphson, 563
nilpotent, 629
normal distribution, 535
oblate spheroid, 655
ogive curve, 527
orthogonal matrix, 614
parabola, 465, 573
axis, 465; directrix, 465; family of, 475;
focal chord, 468; focus, 465; Iatus
rectum, 467; vertex, 465
paraboloid, 655
parity, 558
partial fractions, 409
polar
coordinates, 599; equations, 447
probability density function, 521
prolate spheroid, 655
pure imaginary, 385
purely random process, 544
quartic, 503
quartiles, 524
real quadratic function, 457
rectangular distribution, 530
rectangular hyperbola, 475, 593
reflection property
of ellipse, 582; of parabola, 470
relative error, 553
roots, coincident (repeated, multiple), 450,
484
roots of unity, 429
rounding-off, 552
scalar triple product, 510
second-order process, 566
semi-cubical parabola, 633
similar matrices, 620
skew symmetric matrix, 629
sphere, 640
standard normal distribution, 537
stationary process, 544
symmetric matrix, 616
synthetic division, 401
trace, 629
translation of axes, 462
triangle inequalities, 390
triangular distribution, 534
uniform distribution, 530
variance, 528
vector
area, 509; product, 506; projection,
570; triple product, 510
volume
of parallelepiped, 511; of tetrahedron,
511
von Mises's iteration, 567