This post is about the axiom of choice, arguably the most famous and the most controversial of the axioms underpinning the set theoretic foundations of mathematics today. The axiom of choice is about making choices: choosing an element each from an arbitrary collection of nonempty sets. The axiom basically says that this can always be done, although it provides no recipe for doing so.
Before formally presenting the axiom we state some definitions. The treatment is essentially taken from here.
Definition: A function from a set
onto a set
is said to index the set
by
. The set
is called the index and
is the indexed set. If
, we write
for
.
Definition: Let be a nonempty set indexed by a set
. The cartesian product of
is defined to be the set
of all functions
with domain
and codomain
, satisfying the condition
.
Let us consider an example before we go further. We let be indexed by
where
. Now there are four functions possible from
to
which satisfy the definition:
The function for which
.
The function for which
.
The function for which
.
The function for which
.
The collection is the cartesian product. Now intuitively the behavior of the function
can be captured by simply writing
since the fact that the first coordinate corresponds to
and the second to
indicates the function
. Similarly the behavior of the function
are also captured by
respectively. Note that a particular ordered pair always corresponds to a unique function
. So the cartesian product may be simply represented as
.
In fact if has at most countable number of elements within it, we can always consider it as an subset of the form
(which may or may not terminate), and index appropriately. Then the cartesian product obtained can be equivalently thought of as a collection of n-tuples or infinite sequences where the ith coordinate corresponds to the image of
in the indexing. For example if
and
then all zero-one sequences correspond to the requisite functions and their collection forms the cartesian product.
Definition: Any function which satisfies the conditions required to make it a member of a cartesian product is called a choice function.
We now give three formulations of the axiom of choice:
Axiom of choice:
1. For every nonempty set whose elements are nonempty sets and are indexed by a set there exists a choice function.
2. If is a family of nonempty sets, indexed by a nonempty set
, then there exists a family
with
such that
for each
. (This corresponds roughly to our informal comment about choices in the first paragraph.)
3. The cartesian product of a nonempty collection of nonempty sets is nonempty.
Theorem: The three formulations are equivalent.
Proof: : Let
be a family of sets, indexed by the nonempty set
. Let
. (Note that the elements of
are themselves sets.) We index
by itself, and consider the cartesian product of
. By the assumption and by
, the set
is a nonempty set of nonempty sets. Hence owing to (1) there exists a choice function, i.e. a function
such that
for all
. For
let
. Then
is a family with the required properties.
: Let
be a nonempty set whose elements are themselves nonempty where
is indexed by
. By (2) we have a family
with
. We define our choice function to be
where
.
: This is obvious by the definitions.
Due to this result any of these three statements may be referred to as the axiom of choice. In fact, there are many other equivalent statements. The term ZFC is usually used to denote the Zermelo Fraenkel axioms together with the axiom of choice, and these axioms form a basis of most of the mathematics done today.
Why is this axiom so relevant? Firstly if one refuses to accept this axiom, one loses a lot of interesting results. Secondly, many times assuming the axiom of choice is so much easier. Many results such as the Cantor-Berstein-Schroeder theorem were proved originally using the axiom of choice, although later a “choice-free” proof was discovered. Similarly the axiom of choice makes undergraduate analysis easier by enabling one to say that is continuous at
if
tends to
for each sequence
that tends to
.
Are there any negative issues with the axiom of choice? Well, firstly it is not constructive and doesn’t actually tell us the choices to make. It just guarantees that some choice exists. In one formulation it tells us that the real numbers can be well ordered but it gives us no recipe of doing so. In other words, it doesn’t actually construct a well order of the reals for us. Secondly, some counter intuitive results such as the Banach Tarsky paradox are true in ZFC and many mathematicians find this troubling and “fishy”.
Thirdly, and probably most importantly, there is history. After it was introduced (in another form) in 1904 by Zermelo, many mathematicians wondered whether or not the axiom was consistent with ZF. In other words people asked whether accepting it would lead to contradictions. Many decades passed before in 1938 Godel showed that ZFC is indeed consistent. Another doubt was whether the axiom of choice was independent of ZF. It was only by 1963 that Cohen proved that the axiom of choice cannot derived from ZF and so together with Godel’s result this implied that the axiom of choice is independent of ZF. During this long period from 1904 to 1963, many bizzare results such as the Banach Tarsky paradox, existence of non-Lebesgue measurable sets etc were found. All this sowed a suspicion in people’s minds as to whether this axiom was “false” at some fundamental level. As a result mathematicians began to pay close attention to results in which the axiom was used, and to try and device proofs which avoided the axiom as much as possible. This habit somewhat lessened after the independence results, but it has not been dropped by everyone even today.
The fact of the matter is that while rejecting the axiom of choice leads to many weird results, so does accepting it. See the chapters entitled “Disasters without Choice” and “Disasters with Choice” in this book for details. However in the long run, the advantages of accepting the axiom outweigh the advantages of rejecting it primarily because the sheer quantity of beautiful and aesthetic results flowing from the axiom of choice provide substance to mathematics as a whole.


Pingback: On unabridged versions of Ramsey’s theorem | Notes on Mathematics
Pingback: Konig’s lemma | Notes on Mathematics
Pingback: The Zermelo Fraenkel axioms | Notes on Mathematics