W. T. Tutte
William Thomas Tutte was an English and Canadian code breaker and mathematician. During the Second World War, he made a fundamental advance in cryptanalysis of the Lorenz cipher, a major Nazi German cipher system which was used for top-secret communications within the Wehrmacht High Command.
The high level, strategic nature of the intelligence obtained from Tutte's crucial breakthrough, in the bulk decrypting of Lorenz-enciphered messages specifically, contributed greatly, and perhaps even decisively, to the defeat of Nazi Germany. He also had a number of significant mathematical accomplishments, including foundational work in the fields of graph theory and matroid theory.
Tutte's research in the field of graph theory proved to be of remarkable importance. At a time when graph theory was still a primitive subject, Tutte commenced the study of matroids and developed them into a theory by expanding from the work that Hassler Whitney had first developed around the mid-1930s. Even though Tutte's contributions to graph theory have been influential to modern graph theory and many of his theorems have been used to keep making advances in the field, most of his terminology was not in agreement with their conventional usage and thus his terminology is not used by graph theorists today. "Tutte advanced graph theory from a subject with one text toward its present extremely active state."
Early life and education
Tutte was born in Newmarket in Suffolk. He was the younger son of William John Tutte, an estate gardener, and Annie, a housekeeper. Both parents worked at Fitzroy House stables where Tutte was born. The family spent some time in Buckinghamshire, County Durham, and Yorkshire before returning to Newmarket, where Tutte attended Cheveley Church of England primary school in the nearby village of Cheveley. In 1927, when he was ten, Tutte was awarded a scholarship to the Cambridge and County High School for Boys. He took up his place there in 1928.In 1935, he won a scholarship to study natural sciences at Trinity College, Cambridge, where he specialized in chemistry and graduated with first-class honours in 1938. He continued with physical chemistry as a graduate student, but transferred to mathematics at the end of 1940. As a student, Tutte and three of his friends were one of the first to solve the problem of squaring the square, and the first to solve the problem without a squared subrectangle. Together the four created the pseudonym Blanche Descartes, under which Tutte published occasionally for years.
Second World War
Soon after the outbreak of the Second World War, Tutte's tutor, Patrick Duff, suggested him for war work at the Government Code and Cypher School at Bletchley Park. He was interviewed and sent on a training course in London before going to Bletchley Park, where he joined the Research Section. At first, he worked on the Hagelin cipher that was being used by the Italian Navy. This was a rotor cipher machine that was available commercially, so the mechanics of enciphering was known, and decrypting messages only required working out how the machine was set up.In the summer of 1941, Tutte was transferred to work on a project called Fish. Intelligence information had revealed that the Germans called the wireless teleprinter transmission systems "Sägefisch". This led the British to use the code Fish for the German teleprinter cipher system. The nickname Tunny was used for the first non-Morse link, and it was subsequently used for the Lorenz SZ machines and the traffic that they enciphered.
Telegraphy used the 5-bit International Telegraphy Alphabet No. 2. Nothing was known about the mechanism of enciphering other than that messages were preceded by a 12-letter indicator, which implied a 12-wheel rotor cipher machine. The first step, therefore, had to be to diagnose the machine by establishing the logical structure and hence the functioning of the machine. Tutte played a pivotal role in achieving this, and it was not until shortly before the Allied victory in Europe in 1945, that Bletchley Park acquired a Tunny Lorenz cipher machine. Tutte's breakthroughs led eventually to bulk decrypting of Tunny-enciphered messages between the German High Command in Berlin and their army commands throughout occupied Europe and contributed—perhaps decisively—to the defeat of Germany.
Diagnosing the cipher machine
On 31 August 1941, two versions of the same message were sent using identical keys, which constituted a "depth". This allowed John Tiltman, Bletchley Park's veteran and remarkably gifted cryptanalyst, to deduce that it was a Vernam cipher which uses the Exclusive Or function, and to extract the two messages and hence obtain the obscuring key. After a fruitless period during which Research Section cryptanalysts tried to work out how the Tunny machine worked, this and some other keys were handed to Tutte, who was asked to "see what you can make of these".At his training course, Tutte had been taught the Kasiski examination technique of writing out a key on squared paper, starting a new row after a defined number of characters that was suspected of being the frequency of repetition of the key. If this number was correct, the columns of the matrix would show more repetitions of sequences of characters than chance alone. Tutte knew that the Tunny indicators used 25 letters for 11 of the positions, but only 23 letters for the other. He therefore tried Kasiski's technique on the first impulse of the key characters, using a repetition of 25 × 23 = 575. He did not observe a large number of column repetitions with this period, but he did observe the phenomenon on a diagonal. He therefore tried again with 574, which showed up repeats in the columns. Recognising that the prime factors of this number are 2, 7 and 41, he tried again with a period of 41 and "got a rectangle of dots and crosses that was replete with repetitions".
It was clear, however, that the first impulse of the key was more complicated than that produced by a single wheel of 41 key impulses. Tutte called this component of the key . He figured that there was another component, which was XOR-ed with this, that did not always change with each new character, and that this was the product of a wheel that he called . The same applied for each of the five impulses So for a single character, the whole key K consisted of two components:
At Bletchley Park, mark impulses were signified by x and space impulses by •. For example, the letter "H" would be coded as ••x•x. Tutte's derivation of the chi and psi components was made possible by the fact that dots were more likely than not to be followed by dots, and crosses more likely than not to be followed by crosses. This was a product of a weakness in the German key setting, which they later eliminated. Once Tutte had made this breakthrough, the rest of the Research Section joined in to study the other impulses, and it was established that the five chi wheels all advanced with each new character and that the five psi wheels all moved together under the control of two mu or "motor" wheels. Over the following two months, Tutte and other members of the Research Section worked out the complete logical structure of the machine, with its set of wheels bearing cams that could either be in a position that added x to the stream of key characters, or in the alternative position that added in •.
Diagnosing the functioning of the Tunny machine in this way was novel for the time, in the citation for Tutte's induction as an Officer of the Order of Canada, was described as "one of the greatest intellectual feats of World War II".
Tutte's statistical method
To decrypt a Tunny message required knowledge not only of the logical functioning of the machine, but also the start positions of each rotor for the particular message. The search was on for a process that would manipulate the ciphertext or key to produce a frequency distribution of characters that departed from the uniformity that the enciphering process aimed to achieve. While on secondment to the Research Section in July 1942, Alan Turing worked out that the XOR combination of the values of successive characters in a stream of ciphertext and key emphasised any departures from a uniform distribution. The resultant stream was called the difference because XOR is the same as modulo 2 subtraction.The reason that this provided a way into Tunny was that although the frequency distribution of characters in the ciphertext could not be distinguished from a random stream, the same was not true for a version of the ciphertext from which the chi element of the key had been removed. This was the case because where the plaintext contained a repeated character and the psi wheels did not move on, the differenced psi character would be the null character. When XOR-ed with any character, this character has no effect. Repeated characters in the plaintext were more frequent both because of the characteristics of German, and because telegraphists frequently repeated the figure-shift and letter-shift characters.
To quote the General Report on Tunny:
Turingery introduced the principle that the key differenced at one, now called ΔΚ, could yield information unobtainable from ordinary key. This Δ principle was to be the fundamental basis of nearly all statistical methods of wheel-breaking and setting.
Tutte exploited this amplification of non-uniformity in the differenced values and by November 1942 had produced a way of discovering wheel starting points of the Tunny machine which became known as the "Statistical Method". The essence of this method was to find the initial settings of the chi component of the key by exhaustively trying all positions of its combination with the ciphertext, and looking for evidence of the non-uniformity that reflected the characteristics of the original plaintext. Because any repeated characters in the plaintext would always generate •, and similarly would generate • whenever the psi wheels did not move on, and about half of the time when they did – some 70% overall.
As well as applying differencing to the full 5-bit characters of the ITA2 code, Tutte applied it to the individual impulses. The current chi wheel cam settings needed to have been established to allow the relevant sequence of characters of the chi wheels to be generated. It was totally impracticable to generate the 22 million characters from all five of the chi wheels, so it was initially limited to 41 × 31 = 1271 from the first two. After explaining his findings to Max Newman, Newman was given the job of developing an automated approach to comparing ciphertext and key to look for departures from randomness. The first machine was dubbed Heath Robinson, but the much faster Colossus computer, developed by Tommy Flowers and using algorithms written by Tutte and his colleagues, soon took over for breaking codes.