Where two lines meet, there is an angle. We can add angles together, we
can multiply them by numbers (yielding angles), we can divide one angle by
another to get a number; in all this, angles are just like lengths. We can
measure the sizes of angles relative to one another but, until we chose an
angle to use as unit
, angles aren't numbers. A quarter turn isn't
π/2 or, indeed, 1/4; it's a quarter turn; and it is a right angle;
but it isn't a number. Here turn
serves as a unit of angle entrenched
in the English language: it's the angle through which one has to turn to get
back to where one started. A quarter of that angle is the right
angle
one gets between perpendicular
lines.
Angles also have direction: so there are two quarter turns
– one clockwise, the other anticlockwise. I'll distinguish these as
the right
angle and the left
angle. For compatibility with the
notion of turn right
or turn left
, taken in context of some
orthodox descriptions, presume that a right angle is a quarter turn clockwise
while a left angle is a quarter turn anticlockwise; and describe the left
angle as positive, right angle as negative; however, all that matters is that
left and right be in opposite senses and that one is negative, the other
positive.
The SI unit of angle is the radian; 2.π radians = one turn. I'll come back to why later. The right (or left) angle, already mentioned, is another candidate for use as a unit of angle; as is the about-face or half turn. I can't say I'm much interested in the degree, though – it's a hang-over from the Babylonian approximation to the year as 360 days, albeit 360 is a highly factorisable number (it has 5 as a factor once, 3 twice and 2 thrice). There's also a unit called the grad, equal to turn / 400, which I believe originated in military usage (among gunners); this subdivides the right angle into 100 equal parts, making it marginally tidier than the degree; but I do not find it a compelling choice for unit of angle – at best, it is the centi-unit when the chosen unit is the quarter turn.
Rotation through a whole turn gets you back to where you were when the turn started. In two dimensions, a half turn negates both co-ordinates; repeating it to make a whole turn illustrates (−1)**2 = 1; the quarter turn will in due course illustrate what a square-root of −1 looks like. It is worth noting that one needs at least two dimensions to have angles; with more dimensions, more complex options enter the picture, but our discussion of any given angle can, at least, be reduced by projection onto the (at most) two dimensions spanned by the lines forming the angle. In one dimension, we still – sort of – have angles; but only half and whole turns: as before, the half turn takes the rôle of −1, we see its square is 1 and might pause to dream of the quarter turn as its square root.
One way or another, there are good reasons in geometry and linear algebra for wanting to measure angles in turns or nice easy fractions of the turn. So why have the learned bods of the international institutes chosen the radian, an irrational fraction of the turn, as the SI unit ?
To answer that, it will be necessary to discuss some trigonometry; which shall reinforce the case for the turn but equip us to state the case for the radian.
Figure: two lines come out of a right angle, one going forward the other upwards; the front of the former is joined to the top of the latter by a line, called the hypotenuse; this is longer than either of the original two lines. At the front of the triangle, the hypotenuse meets the forward line; the angle between them is labelled a. The sides are also labelled with: h on the hypotenuse, h.Sin(a) on the upright and h.Cos(a) on the forward edge. The labels are the size of the angle and lengths of the sides.
By such a diagram or
otherwise, the functions Sin (short for sine
and pronounced like sign)
and Cos (for cosine) are defined; each takes an angle as input and produces a
ratio (a pure number) as output; a is an angle, Sin(a) is the length ratio of
the upright to the hypotenuse, as Cos(a) is for forward to hypotenuse. [The
ratio of upright to forward is called the tangent, written Tan(a) and equal to
Sin(a) / Cos(a). But it and the rest of
that family are peripheral to the present discussion.] We can use Sin and
Cos to express Pythagoras' theorem as:
The diagram only really gives us Sin(a) and Cos(a) for a between zero
and a quarter turn; and, at that, interprets a as positive, so we'd better
call that quarter turn the left angle
(positive by the conventions
chosen above). However, various structural truths about
Sin and Cos emerge, in particular it is possible to compute the Sin and Cos of
a+b and a−b from the Sin and Cos of a and b, at least wherever a+b or
a−b was an angle for which Sin and Cos are defined; by accepting the
answers this gives where Sin and Cos aren't defined, we can extend the two
functions to arbitrarily large angle. Here are the relevant
formulae:
These addition and subtraction formulae can (for instance) be obtained by using the given definition to infer the co-ordinates of the linear map which implements a rotation through an angle a; (: [x.Cos(a) −y.Sin(a), x.Sin(a) +y.Cos(a)] ← [x,y] :); applying this for angles a and b we can compose the linear maps, by matrix multiplication, to obtain an implementation of rotation through angle a+b, entirely in terms of the cosine and sine of a and b; we can equally apply the same reasoning for a+b as we did for a and b to obtain rotation through a+b in terms of the cosine and sine of a+b itself; comparing this with the composite yields the first two formulae given above. Either by similar reasoning or by substituting c=a+b, d=a, c−d=b and subsequent re-naming, one can obtain the last two. Note that the corresponding equations for tangent can be cast purely in terms of Tan:
For the subtraction formulae with a = b, we promptly obtain Sin(zero) = 0 and, via Pythagoras' theorem, Cos(zero) = 1. Applying the same formulae with a = zero instead, we obtain
i.e. Sin is an odd function (negating its input negates its output) and Cos is an even function (negating input doesn't change output). These, in turn, turn the formulae for a−b into special cases of the formulae for a+b, by simply replacing b with −b. (They also imply that Tan = Sin/Cos is odd.)
Just as the original triangle only established Sin and Cos for angles between zero and a quarter turn, it only strictly lets us assert Pythagoras' theorem for those angles. However, by establishing that the above sum and difference formulae yield results which satisfy Pythagoras' theorem for a+b and a−b when it holds for a and b, we can induce the theorem's validity for arbitrary angles, just as we can infer the Sin and Cos of arbitrary angle using the above formulae. From Sin being odd and Cos even, we can immediately infer that Pythagoras' theorem holds for −b whenever it holds for b, saving us the need to examine the a−b case. So, given Pythagoras' theorem for angles a and b:
by applying Pythagoras first to a and then to b.
Furthermore, we can (by adding and subtracting suitable formulae above) infer the product formulae:
I'll use left
as a positive quarter turn and right
a
negative one. In our original triangle, the angle at the top of the
hypotenuse is just left−a (because the sum of angles in the triangle is
a half turn, 2.left, and the right angle accounts for left of that, leaving
left for the other two). This is equal to a precisely when a is
left/2. Consequently the two sides other than the hypotenuse are equal; so
Sin(a).Sin(a) = Cos(a).Cos(a) and their sum is 1, so each is 1/2, whence
Sin(left/2) = 1/√2 = Cos(left/2). Thus
and we can use the fact that Sin is odd and Cos is even to infer the corresponding results for multiples of the right angle (i.e. negative multiples of left).
It follows immediately that
These tell us that the two sinusoids are periodic (they repeat themselves after a turn), advancing either by a half turn negates it, and each may be obtained by advancing or backing up the other by a quarter turn. We should also note one further important structural fact (for symmetry, expressed here in two ways):
which you can actually read off from the original triangle which I
used to define Sin and Cos, by looking at the angle at the top of the
hypotenuse, which is left−a. We can use the addition formulae to
compute Sin and Cos of any multiple of an angle,
as polynomials in the Sin and Cos of the
original angle. In particular, this lets us express the (known) Sin and Cos
of the turn as polynomials in the Sin and Cos of the result of dividing the
turn by any positive integer; by solving the resulting polynomial equations,
we can (with suitable care) infer the Sin and Cos of such fractions of the
turn; by scaling those fractions up, we are thus able to compute the Sin and
Cos of any rational (i.e. integer divided by positive integer) multiple of the
turn. Since the relevant polynomials all have integer coefficients and the
Sin and Cos of the (quarter, half and whole) turn are integers, we also infer
that the Sin and Cos of any rational multiple of the turn is
an algebraic number
(i.e. one that's a solution to some polynomial
equation whose coefficients are all integers). This could sanely be construed
as a powerful argument in favour of using the turn (or a rational multiple of
it) as unit of angle.
That pretty much re-iterates the virtues of the turn and its fractions as
units of angle; but it also sets the scene for the perspective that leads SI
to measure angles in radians. The attentive will notice that, when a
is small
(by comparison to a quarter turn or other angle of roughly its
size) Sin(a) is approximately proportional to a; the very careful will see
that, in fact, Sin(a) is approximately 2.π.a/turn = a/radian for such small
a. However, rather than approaching this approximately, I'll derive the
constant of proportionality as a side-effect of …
Draw a circle of radius R: use its centre as the origin for cartesian co-ordinates, [x, y]. Each position on the circle may be characterised by the radial line from the origin to it; which, in turn, may be uniquely identified by the angle between it and a chosen co-ordinate direction. Indeed, if this angle is a, measured from the positive x-axis in the sense which has the positive y-axis's angle equal the positive quarter turn (i.e. left), the position on the circle may readily be shown to be [x, y] = R.[Cos(a), Sin(a)]. Draw a tangent to the circle at our position at angle a. This tangent is at right angles to the radius, so is parallel to the unit vector [−Sin(a), Cos(a)].
Now, for a given angular velocity
w (whose units are angle/time),
we can consider an object moving around the circle so that its angle at time t
is just w.t: its position is then [x, y] = R.[Cos(w.t), Sin(w.t)] and its
velocity is V.[−Sin(w.t), Cos(w.t)] for some speed V (whose units are
length/time). It is easy enough to verify that the absolute value of V is the
speed of our moving object, which is just the circumference divided by the
period. The circumference is 2.π.R, the period is turn/w, so we obtain
abs(V) = 2.π.R.w/turn = R.w/radian. As to the sign of V, observe that
(when w is positive) y is increasing exactly when x is positive while x is
increasing exactly when y is negative: whence we find V = R.w/radian.
Thus (cancelling out the common factor of R), [Cos(w.t), Sin(w.t)] ←t has derivative [−Sin(w.t), Cos(w.t)].w/radian ←t, from which we may infer that the derivative of Sin is Sin' = Cos/radian and that of Cos is Cos' = −Sin/radian. These may be re-written as:
Thus differentiation of the sinusoids is add a quarter turn to the
input, divide the output by radian
. SI uses the radian as its unit of
angle because one can read this as saying that differentiation wants to
measure angles in radians; yet, even differentiation also uses the quarter
turn.
For the object going round a circular path, above, let its position be p;
so p = R.[Cos(w.t), Sin(w.t)] for some angular velocity, w, and the object's
velocity is R.[−Sin(w.t), Cos(w.t)].w/radian. Its accelleration is then
−R.[Cos(w.t), Sin(w.t)].w.w/radian/radian, so ddp/dt/dt =
−p.(w/radian).(w/radian). Thus, again, the radian shows up as the
convenient unit of angle for use when looking at the (very common) case of
the simple harmonic oscillator
, a system whose dynamics are described
by the second derivative of a quantity being equal to a negative multiple of
the quantity itself. This special case likely contributed heavily to SI's
choice in the matter.
Of course, in these terms, radian wants to be an imaginary unit (as time is, in relation to spatial distance; the speed of light is more compellingly a square root of −1 than the radian is a dimensionless unit); then the negation involved in the above gets swallowed by radian. This also fits nicely with the fact that 2.π is the period along the imaginary axis of the exponential function on the complex plane.
Now it's possible to take (pretty much) any function (U: f |V), with U and V both real vector spaces, and apply a transformation, F, to it, defined (using i as a square root of −1) by:
for any choice we like of scalings P, Q, R, S and T. (The Fourier transform uses R = 2.π, Q = −1, P = S = T = 1; its inverse uses P = Q = R = S = T = 1.) It is easy enough to show that replacing S with 1 and Q with Q/S doesn't change F; nor does replacing R with 1, P with P/R and Q with Q/R; nor does replacing R with 1 and T with T/power(dim, R) where dim is the dimension of V. If we apply this transformation twice we get
for some scale computable from P, Q, R, S and T. Aside from the scaling, F∘F is just (: f∘negate ←f :), with negate = (: −x ←x :). Clearly repeating this yields the identity as F∘F∘F∘F. Thus, give or take a scaling, F is a fourth root of the identity; if we chose P, Q, R, S and T suitably, so as to get scale = ±1, it'll deserve to be regarded as having unit size. The choice P = Q = S = 1 with R = √(2.π) yields scale = 1; but so does the choice Q = 2.π, P = R = S = T = 1. While the former is most useful in justifying the Fourier transform, it is hard to motivate other than as the way to achieve unit size; the latter, on the other hand, is easily enough motivated.
Chosing Q = 2.π effectively choses exp(2.π.i.w·x) as
the canonical sinusoid
being used by Fourier's analysis. The inner
product w·x of the inputs (to the original function, f, and its
transform, F(f)) is the natural scalar to obtain from a member of V and a
member of dual(V); while the function exp(2.π.i.t) ←t is a sinusoid
with period 1. Thus, in effect, the Fourier transform begs us to use period =
1 sinusoids rather than period = 2.π.i ones; in effect, it's asking us to
use the turn as our unit of angle, in preference to the radian.
This preference becomes particularly stark in the case of the real transform, where no complex numbers are involved at all. For this we define
and obtain f = C(C(f)) +S(S(f)) cleanly without any factors of 2.π showing up at all; whereas the radian-based equivalents (using cos(k·x), i.e. Cos(k·x.radian), rather than Cos(k·x.turn); and likewise for Sin) are doomed to a scattering of factors of 2.π.
solid angle
While we're at it, note that SI also defines the steradian, a unit
of solid angle
. Just as the whole of a circle subtends an angle of one
turn about its centre, so equally a sphere's surface subtends
a solid angle
of one whole shell (for want of a better word) about its
centre. The idea behind solid angle is that, for example, if a light source
is radiating energy out in all directions equally, any object receives a share
of that energy in proportion to the fraction of a whole shell that's covered
by the radial projection of the given object onto the chosen shell. Thus, if
a light-bulb hangs in the mouth of a cave, half the light from the bulb goes
into the cave and half of it goes to the outside world; when projected onto a
sphere about the bulb, the mouth of the cave appears as a great circle of the
sphere, dividing it into equal parts, one of which faces into the cave, the
other outwards.
Since we live in a Minkowskian universe, it might also make
sense to consider what analogue of fraction of a whole spherical shell
can be made intelligible for the metric of space-time, whose spheres
are hyperboloids (of one sheet if of space-like radius, of two sheets if of
time-like radius) and hence, in particular, infinite.
Where two lines meet in an angle, if we draw a circle about the point
where they meet, small enough that both lines cut it, we can measure the
length of the arc of the circle between where the two lines cut the
circle. (Measure the arc's length along the circumference, not the length of
the chord that takes a short-cut via the interior of the circle.) Dividing
the length of the arc by the radius of the circle gives you the size of your
angle, measured in radians. Equally, if you start with an arc of a circle and
connect its end-points to the centre by straight lines, the angle these make
at the centre is, in radians, the result of dividing the length of the arc by
the radius of the circle. Analogously, the steradian is defined by: if we
connect the boundary of a region on the surface of a sphere to the centre, by
straight lines forming a (not necessarily circular) conical
surface,
the solid angle at the apex of the cone
, measured in steradians, is
just the area of the region divided by the square of the radius.
If we look at a distant object, the lines from our eyes to the apparent
boundary of the object form just such a (not necessarily
circular) cone
; we can intersect this with any sphere between us and
the object, centred on our eyes, and measure the solid angle by looking at the
portion of the sphere contained within the cone. That portion is what's
covered by projecting the object radially, towards the sphere's centre, onto
the sphere. The object is said to subtend
the resulting solid
angle at
our eyes. Likewise, in two dimensions, if we project a figure
radially onto a circle, to cover a portion of its circumference, we obtain the
angle that the figure subtends at
the centre of the circle.
Astronomers use a measure of solid angle called the square degree; this is the solid angle subtended by a square on the sphere whose sides subtend one degree. For any tiny enough angle k, a similar square with sides subtending k will have area equal to square(radius.k/radian); dividing this by the square of radius we get the number of steradians it corresponds to, which is just the square of k/radian. Thus the steradian is, in a meaningful sense, just the square of the radian. One full shell, 4.π sr, is thus 2.turn.radian or turn.turn/π; in square degrees, that comes to 129600/π or just under 41253.
Strictly speaking, the appropriate notion of a square
on the surface of a sphere requires us to use, as its straight
edges,
arcs of great circles (these are the circles in which the sphere meets planes
through its centre; the sense in which they are straight
is that the
shortest path, in the surface of the sphere, between two points on it, is
always an arc of a great circle); and the figure is square
if the four
angles in which the edges meet are equal – in which case they are (at
least a little) bigger than quarter turns, thanks to the sphere's
curvature. For small enough angles, this makes no practical difference, so we
can properly define the square of an angle
a by: find some number k
large enough that for squares whose sides subtend a/k at a sphere's centre,
there is negligible difference in area between a Euclidean square in a plane
tangent to the sphere at the plane's centre and a square in the surface of the
sphere; now measure the solid angle such a square subtends at the centre and
multiply it by k.k; this is the square of a. The important thing is that all
sufficiently large k give the same answer (give or take negligible
differences); or, to put it formally, the solid angle this gives you as
a×a tends to a definite limit as 1/k tends to zero. If the sphere's
radius is R, the tiny squares have side R.a/k/radian and area
R.R.(a/radian).(a/radian)/k/k, hence subtend solid angle
steradian.(a/radian).(a/radian)/k/k, whence the square
of a is
a×a = steradian.(a/radian).(a/radian). In particular, substituting a =
radian, we get radian×radian = steradian. Technically, this ×
isn't scalar multiplication, but we can now see that it behaves about as much
like scalar multiplication as the multiplication of lengths of sides of a
square, so it makes sense to denote it the same way and just write a.a for the
square of the angle a. Which is just a long-winded way of formalising what I
said previously.
Now, the total area of a sphere's surface is 4.π times the square of the sphere's radius; this is the product of the sphere's circumference and diameter. Crucially, it grows in proportion to the square of the sphere's radius.
In the two-dimensional world, the one-dimensional circle's circumference grows proportional to the radius of the circle; the fraction of the circle covered by the projection, radially onto it, of any given object is just the angle (measured in turns) subtended by that object as seen from the circle's centre. The length of the relevant piece of the circumference is then just the angle multiplied by the radius times 2.π/turn. The radian = turn/(2.π) equips us to restate that as: the length of the piece of circumference is just the angle, measured in radians, times the radius.
Likewise, in three dimensions, a two-dimensional portion of a sphere's surface, covered by the projection, radially onto it, of some given object, has area equal to the solid angle it subtends, times the square of the sphere's radius, times 4.π/shell. Just as measuring an angle in radians gave us what to multiply the radius by to get the length of a piece of circumference, we can introduce a solid angle steradian = shell/(4.π) and, by measuring solid angles in this unit, get the scalar by which to multiply the square of radius to obtain the area of a portion of our sphere that subtends the given solid angle.
The question I want to address is: can we derive the solid angle from the
angle (e.g. angle.angle), is it a separate and independent entity, or is it in
fact properly dimensionless ? SI, in chosing to define the steradian,
declares solid angle to be an independent kind of dimension, separate from the
angle. This begs the question of what must happen at each higher dimension; do
we get separate and independent n-dimensional angles
for each positive
integer n ?
An immediate thought from that is to follow down in dimension; at dimension 1, the 1-sphere's perimeter comprises two points, independent of radius, and a zero-angle counts how many of those two points it embraces; it's clearly just a number. At dimension zero, the 0-sphere is a point with no boundary (or, if you will, with a boundary made of 0 pieces, each of which is the −1 simplex and varies in size proportional to 1/radius), so begs no notion of angle. Having no geometric intuition about – and, in particular, no notion of angle for – negative dimensional analogues, it doesn't make sense to chase below dimension zero. It is, however, noteworthy that this strange domain gives zero area and volume to negative even dimension and steadily growing (albeit alternating in sign) volume and area for negative odd dimension: even dimension is special.
Going up in dimension, we'd get the 4-sphere, of
total area
2.π.π times the cube of radius, requiring a unit of
4-angle
equal to 1/(2.π.π) of a 4-shell; the 5-sphere
with area
8.π.π/3 times the fourth power of radius, requring a
unit of 5-angle equal to 3/(8.π.π) of a
5-shell; and so on. Taking A as mapping
from dimensions to the surface measure of the unit sphere at each, we obtain
A(n+2) = 2.π.A(n)/n for each positive dimension, n. Note that A(dim) <
1 for dim > 18, so the required units of angle
for dimension greater
than 18 would exceed the whole shell
at such dimensions. The two-step
nature of the iteration might point to the right unit of angle at each
dimension being a suitable power of solid angle times, for even dimensions,
plain old angle; if solid angle was honestly dimensionless, we'd thereby
escape from needing endlessly many new forms of angle
unit.
The ancient HAKMEM document contains the following fascinating information:
- PROBLEM 45 (Gosper):
- Take a unit step at some heading (angle).
- Double the angle, step again.
- Redouble, step, etc.
- For what initial heading angles is your locus bounded?
- PARTIAL ANSWER (Schroeppel, Gosper):
When the initial angle is a rational multiple of [a half turn], it seems that your locus is bounded (in fact, eventually periodic) iff the denominator contains as a factor the square of an odd prime other than 1093 and 3511, which must occur at least cubed. (This is related to the fact that 1093 and 3511 are the only known primes satisfying
- power(2, P) = 2 mod P.P)
But a denominator of 171 = 9 * 19 never loops, probably because 9 divides phi(19). Similarly for 9009 and 2525. Can someone construct an irrational multiple of [a half turn] with a bounded locus? Do such angles form a set of measure zero in the reals, even though the
measurein the rationals is about .155? About .155 = the fraction of rationals with denominators containing odd primes squared = 1 − product(: 1 − 1/P/(P +1) ←P :{odd primes}). This product = .84533064 ± a smidgen, and is not, alas, sqrt(pi/2) ARCERF(1/4) = .84534756. This errs by 16 times the correction factor one expects for 1093 and 3511, and is not even salvaged by the hypothesis that all primes > a million satisfy the congruence. It might, however, be salvaged by quantities like 171.
If, in fact, all solution angles are rational multiples of a half turn, this could be construed as geometry favouring the turn, again …

