Category Archive

You are currently browsing the category archive for the ‘Puzzles’ category.

Self-referential Paradoxes, Incompleteness and Fixed Points

July 11, 2010 in Category Theory, Mathematics, Philosophy & Logic, Puzzles | Tags: Cantor's theorem, cartesian closed category, Category Theory, fixed points, Gödel's incompleteness theorem, self-refential paradoxes | by Vishal | 17 comments

Who doesn’t like self-referential paradoxes? There is something about them that appeals to all and sundry. And, there is also a certain air of mystery associated with them, but when people talk about such paradoxes in a non-technical fashion indiscriminately, especially when dealing with Gödel’s incompleteness theorem, then quite often it gets annoying!

Lawvere in ‘Diagonal Arguments and Cartesian Closed Categories‘ sought, among several things, to demystify the incompleteness theorem. To pique your interest, in a self-commentary on the above paper, he actually has quite a few harsh words, in a manner of speaking.

“The original aim of this article was to demystify the incompleteness theorem of Gödel and the truth-definition theory of Tarski by showing that both are consequences of some very simple algebra in the cartesian-closed setting. It was always hard for many to comprehend how Cantor’s mathematical theorem could be re-christened as a“paradox” by Russell and how Gödel’s theorem could be so often declared to be the most significant result of the 20th century. There was always the suspicion among scientists that such extra-mathematical publicity movements concealed an agenda for re-establishing belief as a substitute for science.”

In the aforesaid paper, Lawvere of course uses the language of category theory – the setting is that of cartesian closed categories – and therefore the technical presentation can easily get out of reach of most people’s heads – including myself. Thankfully, Noson S. Yanofsky has written a nice paper, ‘A Universal Approach to Self-Referential Paradoxes, Incompleteness and Fixed Points’, that is a lot more accessible and fun to read as well.Yanofsky employs only the notions of sets and functions, thereby avoiding the language of category theory, to bring out and make accessible as much as possible the content of Lawvere’s paper. Cantor’s theorem, Russell’s Paradox, the non-definability of satisfiability, Tarski’s non-definability of truth and Gödel’s (first) incompleteness theorem are all shown to be paradoxical phenomena that merely result from the existence of a cartesian closed category satisfying certain conditions. The idea is to use a single formalism to describe all these diverse phenomena.

(Dang, I just found that John Baez had already blogged on this before, way back in 2006!)

An Experiment in Cognitive Reasoning

December 12, 2009 in Puzzles | Tags: cognitive reasoning, puzzle | by Vishal | 12 comments

Among several things, these days, I have been doing some (serious) reading of literature on psychology, cognitive development, learning, linguistics, philosophy and a few other subjects. Well, the ones I just named happen to be parts of interdisciplinary areas, which are precisely the ones I am interested in. Of course, on many levels those parts also have a lot to do with mathematics, especially mathematical education. Ok, that was just a little background I wanted to provide for the content of today’s post.

I need to do a small (online) experiment in order to test a hypothesis, which will be the subject of my next post. Let me not reveal too much for now. The experiment is in the form of two puzzles that I ask readers (you!) to solve. They are both “multiple choice” puzzles with exactly two correct answers to each. Please bear in mind that this is NOT an IQ test. It is also not meant to test how good you are at solving puzzles individually. I am really interested in “aggregate” results. That is, for testing my hypothesis, I am only interested in what the majority thinks are the right answers. What is more, you won’t be graded, and no one (not even me) will ever know if you got the right answers. Please submit answers to both the puzzles.

Lastly, please don’t cheat or try to look for answers offline or online. As I said, this is NOT a test!

Let us now look at the puzzles.

Puzzle 1:

There are four cards, labelled either X or Y on one side and either 3 or 7 on the other. They are laid out in a row with their top (visible) sides shown like this: X Y 3 7. A rule states: “If X is on one side then there must be a 3 on the other.” Which two cards do you need to turn over to find out if this rule is true?

1) X

2) Y

3) 3

4) 7

Puzzle 2:

As you walk into a bar, you see a large sign that reads, “To drink alcohol here you must be over 18.” There are four people in the bar. You know the ages of two of them, and can see what the other two are drinking. The situation is: Alisa is drinking beer; Dymphna is drinking Coke; Maureen is 30 years old; Lauren is 16 years old. Which two people would you need to talk to in order to check that the “over 18 rule” for drinking alcohol is being followed?

1) Alisa

2) Dymphna

3) Maureen

4) Lauren

If you think you have the answers to those puzzles, then please click here Puzzles to submit your answers. (I couldn’t use PollDaddy to embed the above puzzles in this post because I am not allowed more than 160 characters in a single question. What a pain!) So, please go ahead and click the above link to submit yours answers.

Note: I will keep this “poll” open for a week to collect as much data as possible. Thanks!

Update: It seems many readers weren’t aware of the short duration of the “poll” and that they would have very much liked to participate. So, I am extending the poll till Jan 7, 2010 for them. (Doing so would also help me in collecting more data.)

Number of idempotent endofunctions

March 28, 2009 in Elementary Math Problem Solving, Problem Corner, Puzzles | Tags: André Joyal, combinatorial species, endofunction, fixed point, generating function, idempotent | by Vishal | 26 comments

Last summer, Todd and I discussed a problem and its solution, and I had wondered if it was fit enough to be in the POW-series (on this blog) when he mentioned that the problem might be somewhat too easy for that purpose. Of course, I immediately saw that he was right. But, a few days back, I thought it wouldn’t be bad if we shared this cute problem and its solution over here, the motivation being that some of our readers may perhaps gain something out of it. What is more, an analysis of an egf solution to the problem lends itself naturally to a discussion of combinatorial species. Todd will talk more about it in the second half of this post. Anyway, let’s begin.

PROBLEM: Suppose $A = \{ 1,2, \ldots , n \}$ , where $n$ is a positive natural number. Find the number of endofunctions $f: A \rightarrow A$ satisfying the idempotent property, i.e. $f \circ f = f$ .

It turns out that finding a solution to the above problem is equivalent to counting the number of forests with $n$ nodes and height at most $1$ , which I found here (click only if you wish to see the answer!) at the Online Encyclopedia of Integer Sequences. If you haven’t clicked on that link yet and wish to solve the problem on your own, then please stop reading beyond this point.

SOLUTION: There are two small (and related) observations that need to be made. And, both are easy ones.

Lemma 1: $f$ has at least one fixed point.

Proof: Pick any $i \in A$ and let $f(i) = j$ , where $j \in A$ . Then, using the idempotent property, we have $f(f(i)) = f(i)$ , which implies $f(j) = j$ . Therefore, $j$ is a fixed point, and this proves our lemma.

Lemma 2: The elements in $A$ that are not fixed points are mapped to fixed points of $f$ .

Proof: Suppose $j \in A$ is not a fixed point such that $f(j) = k$ . Then, using the idempotent property again, we immediately have $f(f(j)) = f(j)$ , which implies $f(k) = k$ , thereby establishing the fact that $k$ itself is a fixed point. Hence, $j$ (which is not a fixed point) is mapped to some fixed point of $f$ .

In both the lemmas above, the idempotent property “forces” everything.

Now, the solution is right before our eyes! Suppose $f$ has $m$ fixed points. Then there are $\displaystyle \binom{n}{m}$ ways of choosing them. And, each of the remaining $n - m$ elements of $A$ that are not fixed points are to be mapped to any one of the $m$ fixed points. And, there are a total of $m^{n-m}$ ways of doing that. So, summing over all $m$ , our final answer is $\displaystyle \sum_{m=0}^{n} \binom{n}{m} m^{n-m}$ .

Exponential Generating Function and Introduction to Species

Hi; Todd here. Vishal asked whether I would discuss this problem from the point of view of exponential generating functions (or egf’s), and also from a categorical point of view, using the concept of species of structure, which gives the basis for a categorical or structural approach to generatingfunctionology.

I’ll probably need to write a new post of my own to do any sort of justice to these topics, but maybe I can whet the reader’s appetite by talking a little about the underlying philosophy, followed by a quick but possibly cryptic wrap-up which I could come back to later for illustrative purposes.

Enumerative combinatorics studies the problem of counting the number $a_n$ of combinatorial structures of some type on an $n$ -element set, such as the number of idempotent functions on that set, or the number of equivalence relations, and so on. A powerful idea in enumerative combinatorics is the idea of a generating function, where we study the series $a_n$ by rolling them into a single analytic function, such as

$\displaystyle A(x) = \sum_{n \geq 0} \frac{a_n x^n}{n!},$

(this the so-called “exponential” generating function of $\{a_n\}_{n \geq 0}$ ). In many cases of interest, the function $A(x)$ will be recognizable in terms of operations familiar from calculus (addition, multiplication, differentiation, composition, etc.), and this can then be used to extract information about the series $a_n$ , such as explicit formulas, asymptotics, and so on. If you’ve never seen this idea in action, you should definitely take a look at Wilf’s book generatingfunctionology, or at the book Concrete Mathematics by Graham, Knuth and Patashnik.

Each of the basic operations one performs on analytic functions (addition, multiplication, composition, etc.) will, it turns out, correspond to some set-theoretic operation directly at the level of combinatorial structures, and one of the trade secrets of generating function technology is to have very clear pictures of the combinatorial structures being counted, and how these pictures are built up using these basic structural operations.

In fact, why don’t we start right now, and figure out what some of these structural operations would be? In other words, let’s ask ourselves: if $A(x)$ and $B(x)$ are generating functions for counting combinatorial structures of type (or species) $A$ and $B$ , then what types of structures would the function $A(x) + B(x)$ “count”? How about $A(x)B(x)$ ? Composition $A(B(x))$ ?

The case of $A(x) + B(x)$ is easy: writing

$\displaystyle A(x) + B(x) = \sum_{n \geq 0} \frac{a_n x^n}{n!} + \sum_{n \geq 0} \frac{b_n x^n}{n!} = \sum_{n \geq 0} \frac{(a_n + b_n) x^n}{n!},$

and thinking of $a_n$ as counting structures of type $A$ on an $n$ -element set, and $b_n$ as counting structures of type $B$ , the quantity $a_n + b_n$ counts elements in the disjoint union of the sets of $A$ -structures and $B$ -structures.

In the categorical approach we will discuss later, we actually think of structure types (or species of structure) $A$ as functors, which take an $n$ -element set $S$ to the set $A\left[S\right]$ of structures of type $A$ on $S$ . Here, we have to be a little bit careful about what categories we’re talking about, but the general idea is that if we have a bijection $f: S \to T$ from one $n$ -element set to another, then it should always be possible to “transport” $A$ -structures on $S$ to $A$ -structures on $T$ , simply by relabeling points along the bijection $f$ . So, let us define a species to be a functor

$A: FB \to Set$

where $FB$ is the category of finite sets and bijections (not all functions, just bijections!), and $Set$ is the category of sets. In enumerative combinatorics, the set $A\left[S\right]$ is normally assumed to be finite, but in other applications of the notion of species, we actually allow a lot more latitude, and allow the functor $A$ to map into other categories $C$ , not just $Set$ (“ $C$ -valued species”). But if we stick for now just to set-valued species $A$ , $B$ , then we define the species $A + B$ by the functorial formula

$\displaystyle (A + B)\left[S\right] = A\left[S\right] \sqcup B\left[S\right]$

where $\sqcup$ denotes disjoint union. So addition of generating functions will correspond to the concrete operation of taking disjoint unions of sets of combinatorial species.

More interesting is the case of multiplication. Let’s calculate the product of two egf’s:

$\displaystyle A(x) B(x) = (\sum_{j \geq 0} \frac{a_j x^j}{j!})(\sum_{k \geq 0} \frac{b_k x^k}{k!}) = \sum_{n \geq 0} (\sum_{j + k = n} \frac{n!}{j! k!} a_j b_k) \frac{x^n}{n!}$

The question is: what type of structure does the expression $\displaystyle \sum_{j+k = n} \frac{n!}{j! k!} a_j b_k$ “count”? Look at the individual terms: the binomial coefficient $\displaystyle \frac{n!}{j! k!}$ describes the number of ways of decomposing an $n$ -element set into two disjoint subsets, one with $j$ elements and the other with $k$ , where $j$ and $k$ add to $n$ . Then, $a_j$ is the number of ways of putting an $A$ -structure on the $j$ -element part, and $b_k$ is the number of $B$ -structures on the $k$ -element part.

This suggests a new operation on structure types: given structure types or species $A, B$ , we define a new species $A \otimes B$ according to the formula

$\displaystyle (A \otimes B)\left[S\right] = \bigsqcup_{T \sqcup U = S} A\left[T\right] \times B\left[U\right]$

(that is, a structure of type $A \otimes B$ on a set $S$ is an ordered pair, consisting of an $A$ -structure on a subset of $S$ and a $B$ -structure on its complement). This functorial operation is usually called the “convolution product” of the combinatorial species $A, B$ : it is the concrete set-theoretic operation which corresponds to multiplication of generating functions.

Finally, let’s look at composition $A(B(x))$ . Here we make the technical assumption that $b_0 = 0$ (no $B$ -structures on the empty set!), so that we won’t have divergence issues blowing up in our faces: we want to remain safely within the realm of finitary structures. Okay, then, what type of combinatorial structure does this egf count?

Perhaps not surprisingly, this is rather more challenging than the previous two examples. In analytic function language, we are trying here to give a meaning to the Taylor coefficients of a composite function in terms of the Taylor coefficients of the original functions — for this, there is a famous formula attributed to Faà di Bruno, which we then want to interpret combinatorially. If you don’t already know this but want to think about this on your own, then stop reading! But I’ll just give away the answer, and say no more for now about where it comes from, although there’s a good chance you can figure it out just by staring at it for a while, possibly with paper and pen in hand.

Definition: Let $A, B: FB \to Fin$ be species (functors from finite sets and bijections to finite sets), and assume $B\left[\emptyset\right] = \emptyset$ . The substitution product $A \circ B$ is defined by the formula

$\displaystyle (A \circ B)\left[S\right] = \sum_{E \in Eq(S)} A\left[S/E\right] \times \prod_{c \in S/E} B\left[c\right]$

This clearly requires some explanation. The sum here denotes disjoint union, and $Eq(S)$ denotes the set of equivalence relations on the finite set $S$ . So $E$ here is an equivalence relation, which partitions $S$ into nonempty sets $c$ ( $E$ -equivalence classes). And the quotient $S/E$ denotes the set of such equivalence classes (so we think of each class $c$ as a point of $S/E$ ). What this formula says is that a structure of type $A \circ B$ on $S$ consists of a partition of $S$ into a bunch of non-empty blobs, a $B$ -structure on each blob, and then an $A$ -structure on the set of blobs.

It’s high time for an example! So let’s look at Vishal’s problem, and see if we can picture it in terms of these operations. We’re going to need some basic functions (or functors!) to apply these operations to, and out of thin air I’ll pluck the two basic ones we’ll need:

$\displaystyle E(x) = \exp(x) = \sum_{n \geq 0} \frac{x^n}{n!}$

$F(x) = x$

The first is the generating function for the series $e_n = 1$ . So for the species $E$ , there’s just one structure of type $E$ for each set $S$ (in categorical language, the functor $E: FB \to Set$ is the terminal functor). We can just think of that structure as the set $S$ itself, if we like, with no other structure appended thereon.

For $F$ , we have $f_n = 0$ unless $n = 1$ , where $f_1 = 1$ . So $F$ is the species for the one-element set structure (meaning that $F\left[S\right] = \emptyset$ unless $S$ has cardinality 1, in which case $F\left[S\right] = \{S\}$ ).

Okay, on to Vishal’s example. He was counting the number of idempotent functions $f: S \to S$ , and now, as promised, I want to determine the corresponding egf. You might be able to find it by looking at his formula, but obviously I want to use the ideas I’ve developed thus far, which focuses much more on the pictures. So, let’s picture $f: S \to S$ , first as Vishal did, by thinking of the elements of $S$ as ‘nodes’, and then drawing a directed edge from node $x$ to node $y$ if $f(x) = y$ . (Then, by idempotence of $f$ , $y$ will be a fixed point of $f$ . Let’s agree not to bother drawing an edge from $y$ to itself, if $y$ is already a fixed point.)

In this picture, we get a directed graph which consists of a disjoint union of “sprouts”: little bushes, each rooted at a fixed point of $f$ , whose only other nodes are “leaves” joined to the root by an edge. We can simplify the picture a little: if you put a circle around each sprout, you don’t need the edges at all: just mark one of the points inside as the root, and you know what to do.

So we arrive at a picture of an idempotent function on $S$ : a partition of $S$ into a collection of (nonempty) blobs, and inside each blob, one of the points is marked “root”. In terms of our operations, what does it mean to mark a point in a blob? It just means: break the blob into two pieces, where one piece is given the structure of “one-element set”, and the other piece is just itself. In terms of the ideas developed above, this means each blob carries a $F \otimes E$ structure; we’ll suggestively write this structure type as $X \otimes \exp(X)$ .

In this picture of idempotent $f$ , there is no extra combinatorial structure imposed on the set of blobs, beyond the set itself. In other words, in this picture, the set of blobs carries merely an “ $E$ -structure”, nothing more.

So, putting all this together, we picture an idempotent function on $S$ as a partition or equivalence relation on $S$ , together with an assignment of a marked point in each equivalence class. In the language of species operations, we may therefore identify the structure type of idempotent functions with

$E \circ (F \otimes E)$

or more suggestively, $\exp \circ (X \otimes \exp(X))$ . The exponential generating function is, of course, $e^{x e^x}$ !

In summary, the theory of species is a functorial calculus which projects onto its better-known “shadow”, the functional calculus of generating functions. That is to say, we lift operations on enumeration sequences $\{a_n\}$ , as embodied in their generating functions, directly up to the level of the sets we’re counting, where the functorial operations become both richer and more concrete. The functorial analogue of the generating function itself is called the “analytic functor” attached to the species (the species itself being the concrete embodiment of the enumeration).

Much more could be said, of course. Instead, here’s a little exercise which can be solved by working through the ideas presented here: write down the egf for the number of ways a group of people can be split into pairs, and give an explicit formula for this number. Those of you who have studied quantum field theory may recognize this already (and certainly the egf is very suggestive!) ; in that case, you might find interesting the paper by Baez and Dolan, From Finite Sets to Feynman Diagrams, where the functorial point of view is used to shed light on, e.g., creation and annihilation operators in terms of simple combinatorial operations.

The literature on species (in all its guises) is enormous, but I’d strongly recommend reading the original paper on the subject:

André Joyal, Une théorie combinatoire des séries formelles, Adv. Math. 42 (1981), 1-82.

which I’ve actually referred to before, in connection with a POW whose solution involves counting tree structures. Joyal could be considered to be a founding father of what I would call the “Montreal school of combinatorics”, of which a fine representative text would be

F. Bergeron, G. Labelle, and P. Leroux, Combinatorial Species and Tree-like Structures, Encyclopedia of Mathematics and its Applications 67, 1998.

More to come, I hope!

A Mathematical Chess (Dissection) Paradox

March 25, 2009 in Elementary Math Problem Solving, Problem Corner, Puzzles | Tags: cassini's identity, chess, dissection, fibonacci, mathematical, paradox | by Vishal | 1 comment

I thought I would share with our chess-loving readers the following interesting (and somewhat well-known) mathematical chess paradox , apparently proving that $64=65$ , and the accompanying explanation offered by Prof. Christian Hesse, University of Stuttgart (Germany). It shows a curious connection between the well-known Cassini’s identity (related to Fibonacci numbers) and the $8 \times 8$ chessboard ( $8$ being a Fibonacci number!). The connection can be exploited further to come up with similar paradoxes wherein any $F_n \times F_n$ -square can always be “rerranged” to form a $F_{n-1} \times F_{n+1}$ -rectangle such that the difference between their areas is either $+1$ or $-1$ . Of course, for the curious reader there are plenty of such dissection problems listed in Prof David Eppstein’s Dissection page.

Solution to POW-12: A graph coloring problem

January 4, 2009 in Problem of the Week (POW), Puzzles | by Todd Trimble | 22 comments

The solutions are in! This problem of the week was interesting for me: the usual pattern has been that I pose problems that I’ve solved myself at some point in the past, giving me a kind of “inside edge” on understanding the solutions that come in. But, as I said in POW-12, the difference this time is that the solution I knew of came from someone else (Arin Chaudhuri). What was interesting for me — and given the circumstances, it was probably inevitable — is that some of the solutions we received forced me to make contact with some mathematics I’d only heard about but never studied. Let me get to that in a minute.

Another interesting thing for me — speaking now with my category theorist’s hat on — is how utterly simple and conceptual Arin’s proof was! I was pleased to see that regular problem solver Philipp Lampe also spotted it. Wait for it…

Solution I by Philipp Lampe, University of Bonn: The answer is 8.

Among the eight neighbors of an arbitrary vertex, all colors of an admissible coloring must occur. Thus, 8 is an upper bound for the maximum number of colors one can use. We have to show that there is an admissible coloring with eight colors.

The vertices of the 8-dimensional cube may be represented by vectors $(a_1, a_2, \ldots, a_8)$ with $a_i$ in $\{0,1\}$ , in other words as vectors in $\mathbb{F}_{2}^8$ , the 8-dimensional vector space over the field $\mathbb{F}_2 = \{0, 1\}$ with two elements (i.e., the integers modulo 2). Two such vectors $u, v$ are neighbors iff their coordinate vectors differ in exactly one place, in other words if $u = v + e_i$ considered modulo 2, where $e_i$ is one of the eight standard basis elements (with $i$ -th coordinate 1, and the other coordinates 0).

Now let our “colors” be the 8 elements $x_1, x_2, \ldots, x_8$ of $\mathbb{F}_{2}^3$ , the 3-dimensional vector space over $\mathbb{F}_2$ . Let the vertex “coloring”

$C: \mathbb{F}_{2}^8 \to \mathbb{F}_{2}^3$

be the unique $\mathbb{F}_2$ -linear map such that $C(e_i) = x_i$ ; that is, define the color of a vertex/vector by

$\displaystyle C\left(a_1,\ldots, a_8\right) = \sum_{i=1}^{8} a_i x_i$

Now, if $v$ is any vector, the colors of its neighbors are $C(v + e_i) = C(v) + C(e_i) = C(v) + x_i$ . The colors of these neighbors are all distinct since the $x_i$ are distinct. Hence all 8 colors are represented among the colors of the neighbors of any given vertex $v$ , QED.

What I love about this solution it is how natural it is. I’ll say a little more about this in remarks below.

But I also learned a thing or two by studying the next solution. It relies on the theory of Hamming codes, with which I was not conversant. Up to some small edits, here is exactly what Sune Jakobsen submitted:

Solution II by Sune Jakobsen (first-year student), University of Copenhagen: Since each vertex only has 8 neighbors, the answer cannot be greater that 8.

Now we construct such a coloring with 8 colors. An 8-dimensional cube can be represented by the graph with vertices in $\{0, 1\}^8$ , and with an edge between them iff the Hamming distance between them is 1. We color a vertex with color 8 if the seven first bits in the vertex is a “correct” Hamming message (cf. Hamming code (7,4)), and color it in color $i$ if the first seven bits give a correct Hamming message upon changing bit $i$ . This is a well-defined coloring, since each element in $\{0, 1\}^7$ is either a correct Hamming message, or is Hamming distance 1 away to exactly one correct Hamming message.

It remains to show that no vertex is neighbor to two vertices of the same color. The Hamming distance between these two vertices is 2, thus the Hamming distance between the first 7 bits of two neighbors must be 1 or 2. If two neighbors had the same color $i$ , then by definition one would get two correct Hamming messages by changing bit $i$ in both of them, and the Hamming distance between these messages would be the same as before — either 1 or 2. But the distance between any two correct Hamming messages is at least 3. Contradiction.

Remarks:

1. Let me give a little background to Sune’s solution. Mathematically, the Hamming code called “(7, 4)” is the image of injective linear map

$G: \mathbb{F}_{2}^4 \to \mathbb{F}_{2}^7$

given by the $7 \times 4$ matrix

1101
1011
1000
0111
0100
0010
0001

The Hamming code is what is known as an “error-correcting code”. Imagine that you want to send a 4-bit message (each bit being a 0 or 1) down a transmission line, but that due to noise in the line, there is the possibility of a bit being flipped and the wrong message being received. What can you do to help ensure that the intended message is received?

The answer is to add some “parity check bits” to the message. In the code (7, 4), one adds in three parity checks, so that the transmitted message is 7 bits long. What one does is apply the matrix $G$ to the 4-vector, to get a 7-vector (remember we’re working modulo 2), and this 7-vector is sent down the line. Assuming only a moderate amount of noise in the transmission line, perhaps the 7-bit message will remain intact or perhaps a single bit will be flipped, more rarely two bits will be flipped (and we’ll assume the event that more than two are flipped has negligible probability). Now, the Hamming encoding $G$ is rigged up so that if the 7-bit vector is received as sent, then the parity checks will report 0 errors. If the parity checks report an error, they report precisely where the error occurred if the received vector was off by one bit. (If two flips occurred, then the receiver can still detect from the parity checks that an error occurred. The parity checks can never detect how many bits were flipped, but if the receiver assumes correctly that just one bit got flipped, he can restore the intended message with accuracy. If three or more got flipped, then the receiver got the wrong message, but he would never know it if the parity checks reported back 0 errors.)

How does this work? By having the receiver apply a parity-check matrix $H$ to the received message, given by the $3 \times 7$ matrix

1010101
0110011
0001111

In the first place, $HG = 0$ , and so if the message received belongs to the image of $G$ (is a “correct Hamming message” in the terminology of Solution II), which will indeed be the case if there were no errors in the transmission, then $H$ applied to the message will produce the zero vector. In the case where one bit got flipped in the transmission, $H$ applied to the received vector will return a nonzero 3-vector, but the beauty of the Hamming code is that the 3-vector will spell out in binary the bit the error occurred (for example, if the output vector is $(0 1 1)^t$ , then error occurred in the bit with binary number 011, that is bit 3). In that case, the receiver flips that bit to restore the original message. In these two cases, the original 4 bits are then read off as the 3rd, 5th, 6th, and 7th coordinates of the (possibly corrected) 7-vector.

By the way, Sune reminded us that Hamming codes also came up in a post by Greg Muller over at The Everything Seminar, who used the existence of a Hamming code in every dimension $2^k - 1$ to solve general hat-guessing puzzles.

2. Within minutes of posting the original problem, we received a message from David Eppstein, who reported that the solution of 8 colors is essentially contained in a paper he coauthored with T. Givaris (page 4); his solution is close in spirit to Sune’s.

Arin Chaudhuri also surmised the connection to Hamming codes, and mentions that the problem (in a slightly different formulation) originally came from a friend of a friend, who blogs about a number of related problems on this page. Presumably the author had things like error-correcting codes in mind.

3. Sune and David noted that their solutions generalize to show that on the $2^k$ -dimensional cube (for any $k$ ), it is possible to color the vertices with $2^k$ colors (the maximum number possible) so that all of the colors show up as colors of the neighbors of any given vertex.

This is very easy to see by adapting Philipp’s method. Indeed, for each $k$ just take the color set to be the $\mathbb{F}_2$ -vector space $S = \mathbb{F}_{2}^k$ . The set of vertices of the $2^k$ -dimensional cube may identified with the $\mathbb{F}_2$ -vector space $V(S)$ generated by the set $S$ (as basis), and the desired coloring is then just the $\mathbb{F}_2$ -linear map $C: V(S) \to S$ that extends the identity function on $S$ . As mathematical constructions go, you can’t get much more canonical than that! No fuss, no muss — it’s a purely categorical construction (categorists call it a counit of an adjunction). So then why didn’t I see that myself? 🙂

4. We’re still not sure what the story is in other dimensions. If $a_n$ is the maximum number of admissible colors in dimension $n$ , then about all any one of us knows right now is that $a_n = n$ for $n = 2^k$ , $a_n$ is weakly monotone increasing, and that $a_n < n$ if $n$ is not a power of 2. There’s a conjecture afloat that $a_n = 2^{\left[\log_2 n\right]}$ where $\left[x\right]$ is the floor function, but for now that should be treated as a wild guess. If anyone has further information, please let us know!

Solved by Arin Chaudhuri, David Eppstein, Sune Jakobsen, and Philipp Lampe. Thanks to all those who wrote in!

Using Mathematical Induction to “disprove” the foundation of Calculus!

November 12, 2008 in Puzzles | Tags: 0.999... = 1, mathematical induction | by Vishal | 29 comments

Or, at least, that’s what this blog post at Science and Math Defeated aims to do. Normally, I avoid writing on such a topic but I think the following example could be instructive to a few people, at least, in learning how not to infer from mathematical induction. The author of that blog post sets to “disprove” the foundation of Calculus by showing that the “assumption” $0.999 \ldots = 1$ leads to a contradiction (which I am sure most of you have seen before.) And this is supposed to be achieved through the use of Mathematical Induction.

Let $P(n)$ be the statement $\displaystyle 0.\underbrace{999...9}_{n \, 9's} < 1$ for all $n \in \mathbb{N}$ and $n \ge 1$ .

Claim: $P(n)$ is true for all $n \ge 1$ .

Proof: $0.9 < 1$ , and so, $P(1)$ is true. This takes care of the base case. Now assume $P(n)$ is true for some $k \in \mathbb{N}$ , where $k \geq 1$ . Now, it is easy to show that $P(k+1)$ is true as well (I just skipped some details!). Hence, $P(k) \Rightarrow P(k+1)$ holds. This takes care of the induction step. (Note that $P(k+1)$ is shown to be true independent of $P(k)$ !) And, this proves our claim.

(Erroneous) Conclusion: Hence, $0.999\ldots < 1$ .

Notwithstanding the inductive proof (which is correct) above, why is the above conclusion wrong?

Ans. Because “infinity” is not a member of $\mathbb{N}$ .

(Watch out for Todd’s next post in the ETCS series!)

POW-8: Ups at bat

July 24, 2008 in Problem of the Week (POW), Puzzles | by Todd Trimble | 8 comments

Time for another Problem of the Week! This one is rather elementary, but is connected with a pet topic of mine which I plan to write a post on soon:

If a baseball player’s batting average is .334, what is the minimum number of times he’s been up at bat?

[Note for those who are not fans of baseball: a player’s batting average is (number of hits made)/(number of times at bat), rounded to three decimal places.]

Please submit solutions to topological[dot]musings[At]gmail[dot]com by Wednesday, July 30, 11:59 pm (UTC); do not submit solutions in Comments. Everyone with a correct solution will be inducted into our Hall of Fame! We look forward to your response.

Solution to POW-7: Midpoints between points on a curve

July 10, 2008 in Problem of the Week (POW), Puzzles | by Todd Trimble | 2 comments

The solutions are in! There was quite a bit of activity behind the scenes on this one; POW-7 might look intuitively obvious, as if it should succumb to an easy application of the intermediate value theorem, but for a number of people who fought through it, this problem fought back! (And that was true for me too, when I first encountered it.)

This problem appears as problem B-4 from the 1977 Putnam Competition. A couple of readers hit upon the snappy solution proposed by the problem compilers [as given in The American Mathematical Monthly vol. 86 (1979), pp. 749-757]. I’ll give that solution, and follow up with a few remarks on alternate approaches, and on some of the thoughts the problem evoked in our readers. Thanks to all who wrote in!

Composite solution by Kenneth Chan and Paul Shearer: By translation, we may assume that the given point $P$ is the origin. Let $-C = \{-x: x \in C\}$ . It suffices to show that $C \cap -C$ is nonempty, for if $x$ belongs to this set, then both $x \in C$ and $-x \in C$ , so that $\frac{x + -x}{2} = 0$ is the midpoint $P$ .

The interior regions $R, -R$ of $C, -C$ intersect (e.g., $P = 0$ is in both regions), so if $C \cap -C = \emptyset$ , then by connectedness of the curves $C, -C$ , one of them must be contained in the interior of the other. But this is absurd; for example, it is clear that

$\displaystyle \mbox{sup} \{|x|: x \in C\} = \mbox{sup} \{|x|: x \in -C\}$

and so if the sup is realized at a point $x \in -C$ which is interior to $C$ , then the boundary of the nonempty set $\{tx \in R: t > 1\}$ contains a point of $C$ whose distance from the origin exceeds this shared sup, contradiction.

Remarks:

1. One could just as easily observe that the regions $R, -R$ are congruent and therefore have the same area, so that one of these regions cannot be strictly contained in the other. Therefore, unless $C = -C$ , some of $R$ is interior to $-R$ and some is exterior; since $R$ is connected, it must therefore intersect the border $-C$ of $-R$ . Similarly, the exterior of $C$ must be partly inside $-C$ and partly outside, so it too must intersect $-C$ . In other words, some of $-C$ is interior to $C$ and some is exterior to $C$ , and so by connectedness of $-C$ , it must cross $C$ , as desired. This argument is substantially the one given by Sune Jakobsen.

2. Assuming $P$ is at the origin, a number of people tried to argue that (in polar coordinates) the radius $r$ of a point on $C$ is a function of its angle $\theta$ , and since $r(\theta) - r(\theta + \pi)$ sometimes takes non-positive values and sometimes non-negative values, it should take on the value 0 somewhere by the intermediate value theorem. The pitfall though is that $r$ is not necessarily a well-defined function of $\theta$ (consider non-convex curves $C$ where a radial line meets $C$ in more than one point). Some other readers seemed to notice this and tried a surrogate function where along the radial line at angle $\theta$ , you choose the closest distance $r$ from $C$ to the origin; the trouble there is that this new function $r(\theta)$ might not be continuous (there will typically be a jump discontinuity at angles where the radial line is tangent to $C$ ).

3. The obstructions mentioned in remark 2. are of a sort which is ubiquitous in topology, where one would like to construct a continuous choice function (e.g., in the theory of vector bundles or fiber bundles, where one is interested in whether continuous sections of a bundle projection $\pi: E \to X$ exist). À propos of that, Paul Shearer made an intriguing suggestion (in a comment here), that POW-7 would follow from the stronger conjecture that the space $S$ of pairs of points on $C$ which “straddle” $P$ (i.e., where the points and $P$ live on the same line, but the points are on opposite sides of $P$ ) should be path-connected. That is, given two pairs $c, c' \in S$ , the conjecture is that we can choose a continuous path through $S$ which gets us from $c$ to $c'$ . Subsequently, Paul found a counterexample to this conjecture. Can you find one? [Edit: I had invited Paul to comment on this, but then missed the fact that he had already done so! My bad – TT.]

4. Miodrag Milenkovic came up with a related problem which can be posed in any dimension. Consider any $n$ -sphere $S^n$ embedded in $\mathbb{R}^{n+1}$ (POW-7 concerns the case $n = 1$ ), and let $P$ be a point interior to the embedded sphere. [Note: the complement of such an embedded sphere consists of two connected components, i.e., an “interior” and “exterior”, although this fact isn’t completely trivial; see theorem 36.3 (generalized Jordan curve theorem) in Munkres’s Elements of Algebraic Topology. These regions can be topologically complicated, as we know from the example of the Alexander horned sphere.] Miodrag asks: is it true that there exists a plane through $P$ so that $P$ is the center of mass (barycenter) of the $(n-1)$ -dimensional region where the interior intersects the plane?

I am not entirely decided on the answer to this question, although I think I have a nice topological argument that the answer is yes, if we assume some mild smoothness assumption on the embedding. I am frankly a little scared of the problem in full topological generality, due to the pathology one may encounter in the topological category!

POW-7 was also solved by Arin Chaudhuri, Sune Jakobsen, Philipp Lampe, and Peter LeFanu Lumsdaine. Thanks again to all for the stimulating correspondence!

POW-7: Midpoints between points on a curve

July 3, 2008 in Elementary Math Problem Solving, Problem Corner, Problem of the Week (POW), Puzzles | by Todd Trimble | 7 comments

Okay, folks, time for another Problem of the Week! I hope it generates more response than last week’s problem:

Let $C$ be a simple closed curve in the plane, and let $P$ be any point strictly in the region interior to $C$ . Show there are two points on $C$ whose midpoint is $P$ .

Please submit solutions to topological[dot]musings[At]gmail[dot]com by Wednesday, July 9, 11:59 pm (UTC); do not submit solutions in Comments. Everyone with a correct solution will be inducted into our Hall of Fame! We look forward to your response.

POW-6: Tiling with Triominoes

June 27, 2008 in Elementary Math Problem Solving, Problem Corner, Problem of the Week (POW), Puzzles | by Todd Trimble | 3 comments

This week’s problem is offered more in the spirit of a light and pleasant diversion — I don’t think you’ll need any deep insight to solve it. (A little persistence may come in handy though!)

Define a triomino to be a figure congruent to the union of three of the four unit squares in a $2 \times 2$ square. For which pairs of positive integers $(m, n)$ is an $m \times n$ rectangle tileable by triominoes?

Please submit solutions to topological[dot]musings[At]gmail[dot]com by Wednesday, July 3, 11:59 pm (UTC); do not submit solutions in Comments. Everyone with a correct solution will be inducted into our Hall of Fame! We look forward to your response. Enjoy!

	Strong equidistribut… on Platonic Solids and Euler…
	m.s.mohamed ansari on The 54th Carnival of Math…
	Convictions ·… on Continued fraction for e
	Wayne J. Mann on Solution to POW-12: A graph co…
	erneststephen on The 54th Carnival of Math…
	anhtraisg on p^q + q^p is prime
	prof dr drd horia or… on My first post
	prof drd horia orasa… on My first post
	prof dr mircea orasa… on Inequality with log
	notedscholar on Self-referential Paradoxes, In…
	prof dr mircea orasa… on Inequality with log
	prof dr mircea orasa… on Inequality with log
	prof dr mircea orasa… on 2010 in review
	kenji on Basic category theory, I
	prof dr mircea orasa… on Solution to POW-10: Another ha…

Category Archive

Exponential Generating Function and Introduction to Species

Our other blog

Visitors to this blog

Blog Stats

Wikio Ranking

Current Online Readers

Recent Comments

Top Posts

Archives

Categories

Blogroll

chess

Computers and Technology

Elementary Math Problem Solving

Fun, Humor

Higher Mathematics

Math Education

Non-technical

Opinions

People in Computer Science

People in mathematics

Philosophy & Logic

Physics

Politics, News

Sciences

Useful general links

Useful Math Links

Tag Cloud

Admin/Users Login

Pages