Claude Shannon | |
Birth Date: | 30 April 1916 |
Birth Name: | Claude Elwood Shannon |
Birth Place: | Petoskey, Michigan, U.S. |
Death Place: | Medford, Massachusetts, U.S. |
Fields: | Mathematics, computer science, electronic engineering, artificial intelligence |
Workplaces: | |
Doctoral Advisor: | Frank Lauren Hitchcock |
Doctoral Students: | |
Awards: |
|
Thesis1 Title: | A Symbolic Analysis of Relay and Switching Circuits |
Thesis1 Url: | https://dspace.mit.edu/handle/1721.1/11173 |
Thesis1 Year: | 1937 |
Thesis2 Title: | An Algebra for Theoretical Genetics |
Thesis2 Url: | https://dspace.mit.edu/handle/1721.1/11174 |
Thesis2 Year: | 1940 |
Spouse: | Norma Levor (1940–41) Betty Shannon (1949–2001) |
Education: | University of Michigan (BS, BSE) Massachusetts Institute of Technology (MS, PhD) |
Claude Elwood Shannon (April 30, 1916 – February 24, 2001) was an American mathematician, electrical engineer, computer scientist and cryptographer known as the "father of information theory" and as the "father of the Information Age".[1] Shannon was the first to describe the Boolean gates (electronic circuits) that are essential to all digital electronic circuits, and was one of the founding fathers of artificial intelligence.[2] [3] [4] [5] [6] He is credited alongside George Boole for laying the foundations of the Information Age.[7] [8] [9]
At the University of Michigan, Shannon dual degreed, graduating with a Bachelor of Science in both electrical engineering and mathematics in 1936. A 21-year-old master's degree student at the Massachusetts Institute of Technology (MIT) in electrical engineering, his thesis concerned switching circuit theory, demonstrating that electrical applications of Boolean algebra could construct any logical numerical relationship,[10] thereby establishing the theory behind digital computing and digital circuits.[11] The thesis has been claimed to be the most important master's thesis of all time, as in 1985, Howard Gardner described it as "possibly the most important, and also the most famous, master's thesis of the century",[12] while Herman Goldstine described it as "surely... one of the most important master's theses ever written... It helped to change digital circuit design from an art to a science."[13] It has also been called the "birth certificate of the digital revolution",[14] and it won the 1939 Alfred Noble Prize. Shannon then graduated with a PhD in mathematics from MIT in 1940, with his thesis focused on genetics, with it deriving important results, but it went unpublished.
Shannon contributed to the field of cryptanalysis for national defense of the United States during World War II, including his fundamental work on codebreaking and secure telecommunications, writing a paper which is considered one of the foundational pieces of modern cryptography,[15] with his work described as "a turning point, and marked the closure of classical cryptography and the beginning of modern cryptography."[16] The work of Shannon is the foundation of secret-key cryptography, including the work of Horst Feistel, the Data Encryption Standard (DES), Advanced Encryption Standard (AES), and more. As a result, Shannon has been called the "founding father of modern cryptography".[17]
His mathematical theory of communication laid the foundations for the field of information theory,[18] [19] with his famous paper being called the "Magna Carta of the Information Age" by Scientific American,[20] along with his work being described as being at "the heart of today's digital information technology".[21] Robert G. Gallager referred to the paper as a "blueprint for the digital era".[22] Regarding the influence that Shannon had on the digital age, Solomon W. Golomb remarked "It's like saying how much influence the inventor of the alphabet has had on literature." Shannon's theory is widely used and has been fundamental to the success of many scientific endeavors, such as the invention of the compact disc, the development of the Internet, feasibility of mobile phones, the understanding of black holes, and more, and is at the intersection of numerous important fields.[23] [24]
Shannon made numerous contributions to the field of artificial intelligence, writing papers on programming a computer for chess, which have been immensely influential,[25] [26] and also his Theseus machine was the first electrical device to learn by trial and error, being one of the first examples of artificial intelligence.[27] He also co-organized and participated in the Dartmouth workshop of 1956, considered the founding event of the field of artificial intelligence.[28] [29] He also made contributions to multiple other fields, such as detection theory and combinatorics.[30]
Rodney Brooks declared that Shannon was the 20th century engineer who contributed the most to 21st century technologies. Shannon's achievements are considered to be on par, in his field, with those of Albert Einstein and Sir Isaac Newton in theirs.[31] [32]
The Shannon family lived in Gaylord, Michigan, and Claude was born in a hospital in nearby Petoskey.[3] His father, Claude Sr. (1862–1934), was a businessman and, for a while, a judge of probate in Gaylord. His mother, Mabel Wolf Shannon (1880–1945), was a language teacher, who also served as the principal of Gaylord High School. Claude Sr. was a descendant of New Jersey settlers, while Mabel was a child of German immigrants.[3] Shannon's family was active in their Methodist Church during his youth.[33]
Most of the first 16 years of Shannon's life were spent in Gaylord, where he attended public school, graduating from Gaylord High School in 1932. Shannon showed an inclination towards mechanical and electrical things. His best subjects were science and mathematics. At home, he constructed such devices as models of planes, a radio-controlled model boat and a barbed-wire telegraph system to a friend's house a half-mile away.[34] While growing up, he also worked as a messenger for the Western Union company.
Shannon's childhood hero was Thomas Edison, whom he later learned was a distant cousin. Both Shannon and Edison were descendants of John Ogden (1609–1682), a colonial leader and an ancestor of many distinguished people.[35] [36]
In 1932, Shannon entered the University of Michigan, where he was introduced to the work of George Boole. He graduated in 1936 with two bachelor's degrees: one in electrical engineering and the other in mathematics.
In 1936, Shannon began his graduate studies in electrical engineering at the Massachusetts Institute of Technology (MIT), where he worked on Vannevar Bush's differential analyzer, which was an early analog computer that was composed of electromechanical parts and could solve differential equations.[37] While studying the complicated ad hoc circuits of this analyzer, Shannon designed switching circuits based on Boole's concepts. In 1937, he wrote his master's degree thesis, A Symbolic Analysis of Relay and Switching Circuits,[38] with a paper from this thesis published in 1938.[39] A revolutionary work for switching circuit theory, Shannon diagramed switching circuits that could implement the essential operators of Boolean algebra. Then he proved that his switching circuits could be used to simplify the arrangement of the electromechanical relays that were used during that time in telephone call routing switches. Next, he expanded this concept, proving that these circuits could solve all problems that Boolean algebra could solve. In the last chapter, he presented diagrams of several circuits, including a digital 4-bit full adder.
Using electrical switches to implement logic is the fundamental concept that underlies all electronic digital computers. Shannon's work became the foundation of digital circuit design, as it became widely known in the electrical engineering community during and after World War II. The theoretical rigor of Shannon's work superseded the ad hoc methods that had prevailed previously. Howard Gardner hailed Shannon's thesis "possibly the most important, and also the most noted, master's thesis of the century."[40] One of the reviewers of his work commented that "To the best of my knowledge, this is the first application of the methods of symbolic logic to so practical an engineering problem. From the point of view of originality I rate the paper as outstanding."[41] Shannon's master thesis won the 1939 Alfred Noble Prize.
Shannon received his PhD in mathematics from MIT in 1940.[35] Vannevar Bush had suggested that Shannon should work on his dissertation at the Cold Spring Harbor Laboratory, in order to develop a mathematical formulation for Mendelian genetics. This research resulted in Shannon's PhD thesis, called An Algebra for Theoretical Genetics.[42] However, the thesis went unpublished after Shannon lost interest, but it did contain important results. Notably, he was one of the first to apply an algebraic framework to study theoretical population genetics.[43] In addition, Shannon devised a general expression for the distribution of several linked traits in a population after multiple generations under a random mating system, which was original at the time,[44] with the new theorem unworked out by other population geneticists of the time.[45]
In 1940, Shannon became a National Research Fellow at the Institute for Advanced Study in Princeton, New Jersey. In Princeton, Shannon had the opportunity to discuss his ideas with influential scientists and mathematicians such as Hermann Weyl and John von Neumann, and he also had occasional encounters with Albert Einstein and Kurt Gödel. Shannon worked freely across disciplines, and this ability may have contributed to his later development of mathematical information theory.[46]
Shannon had worked at Bell Labs for a few months in the summer of 1937,[47] and returned there to work on fire-control systems and cryptography during World War II, under a contract with section D-2 (Control Systems section) of the National Defense Research Committee (NDRC).
Shannon is credited with the invention of signal-flow graphs, in 1942. He discovered the topological gain formula while investigating the functional operation of an analog computer.[48]
For two months early in 1943, Shannon came into contact with the leading British mathematician Alan Turing. Turing had been posted to Washington to share with the U.S. Navy's cryptanalytic service the methods used by the British Government Code and Cypher School at Bletchley Park to break the cyphers used by the Kriegsmarine U-boats in the north Atlantic Ocean. He was also interested in the encipherment of speech and to this end spent time at Bell Labs. Shannon and Turing met at teatime in the cafeteria. Turing showed Shannon his 1936 paper that defined what is now known as the "universal Turing machine". This impressed Shannon, as many of its ideas complemented his own.
In 1945, as the war was coming to an end, the NDRC was issuing a summary of technical reports as a last step prior to its eventual closing down. Inside the volume on fire control, a special essay titled Data Smoothing and Prediction in Fire-Control Systems, coauthored by Shannon, Ralph Beebe Blackman, and Hendrik Wade Bode, formally treated the problem of smoothing the data in fire-control by analogy with "the problem of separating a signal from interfering noise in communications systems."[49] In other words, it modeled the problem in terms of data and signal processing and thus heralded the coming of the Information Age.
Shannon's work on cryptography was even more closely related to his later publications on communication theory.[50] At the close of the war, he prepared a classified memorandum for Bell Telephone Labs entitled "A Mathematical Theory of Cryptography", dated September 1945. A declassified version of this paper was published in 1949 as "Communication Theory of Secrecy Systems" in the Bell System Technical Journal. This paper incorporated many of the concepts and mathematical formulations that also appeared in his A Mathematical Theory of Communication. Shannon said that his wartime insights into communication theory and cryptography developed simultaneously, and that "they were so close together you couldn't separate them".[51] In a footnote near the beginning of the classified report, Shannon announced his intention to "develop these results … in a forthcoming memorandum on the transmission of information."[52]
While he was at Bell Labs, Shannon proved that the cryptographic one-time pad is unbreakable in his classified research that was later published in 1949. The same article also proved that any unbreakable system must have essentially the same characteristics as the one-time pad: the key must be truly random, as large as the plaintext, never reused in whole or part, and kept secret.[53]
See main article: information theory. In 1948, the promised memorandum appeared as "A Mathematical Theory of Communication", an article in two parts in the July and October issues of the Bell System Technical Journal. This work focuses on the problem of how best to encode the message a sender wants to transmit. Shannon developed information entropy as a measure of the information content in a message, which is a measure of uncertainty reduced by the message. In so doing, he essentially invented the field of information theory.
The book The Mathematical Theory of Communication reprints Shannon's 1948 article and Warren Weaver's popularization of it, which is accessible to the non-specialist. Weaver pointed out that the word "information" in communication theory is not related to what you do say, but to what you could say. That is, information is a measure of one's freedom of choice when one selects a message. Shannon's concepts were also popularized, subject to his own proofreading, in John Robinson Pierce's Symbols, Signals, and Noise.
Information theory's fundamental contribution to natural language processing and computational linguistics was further established in 1951, in his article "Prediction and Entropy of Printed English", showing upper and lower bounds of entropy on the statistics of English – giving a statistical foundation to language analysis. In addition, he proved that treating space as the 27th letter of the alphabet actually lowers uncertainty in written language, providing a clear quantifiable link between cultural practice and probabilistic cognition.
Another notable paper published in 1949 is "Communication Theory of Secrecy Systems", a declassified version of his wartime work on the mathematical theory of cryptography, in which he proved that all theoretically unbreakable cyphers must have the same requirements as the one-time pad. He is credited with the introduction of sampling theorem, which he had derived as early as 1940,[54] and which is concerned with representing a continuous-time signal from a (uniform) discrete set of samples. This theory was essential in enabling telecommunications to move from analog to digital transmissions systems in the 1960s and later.
In 1950, Shannon, designed, and built with the help of his wife, a learning machine named Theseus. It consisted of a maze on a surface, through which a mechanical mouse could move through. Below the surface were sensors that followed the path of a mechanical mouse through the maze. After much trial and error, this device would learn the shortest path through the maze, and direct the mechanical mouse through the maze. The pattern of the maze could be changed at will.
Mazin Gilbert stated that Theseus "inspired the whole field of AI. This random trial and error is the foundation of artificial intelligence."[55]
Shannon wrote multiple influential papers on artificial intelligence, such as his 1950 paper titled "Programming a Computer for Playing Chess", and his 1953 paper titled "Computers and Automata".[56]
Shannon co-organized and participated in the Dartmouth workshop of 1956, alongside John McCarthy, Marvin Minsky and Nathaniel Rochester, and which is considered the founding event of the field of artificial intelligence.[57] [58]
In 1956 Shannon joined the MIT faculty, holding an endowed chair. He worked in the Research Laboratory of Electronics (RLE). He continued to serve on the MIT faculty until 1978.
Shannon developed Alzheimer's disease and spent the last few years of his life in a nursing home; he died in 2001, survived by his wife, a son and daughter, and two granddaughters.[59] [60]
Outside of Shannon's academic pursuits, he was interested in juggling, unicycling, and chess. He also invented many devices, including a Roman numeral computer called THROBAC, and juggling machines.[61] [62] He built a device that could solve the Rubik's Cube puzzle.[35]
Shannon designed the Minivac 601, a digital computer trainer to teach business people about how computers functioned. It was sold by the Scientific Development Corp starting in 1961.[63]
He is also considered the co-inventor of the first wearable computer along with Edward O. Thorp.[64] The device was used to improve the odds when playing roulette.
Shannon married Norma Levor, a wealthy, Jewish, left-wing intellectual in January 1940. The marriage ended in divorce after about a year. Levor later married Ben Barzman.[65]
Shannon met his second wife, Mary Elizabeth Moore (Betty), when she was a numerical analyst at Bell Labs. They were married in 1949. Betty assisted Claude in building some of his most famous inventions.[66] They had three children.[67]
Shannon presented himself as apolitical and an atheist.[68]
There are six statues of Shannon sculpted by Eugene Daub: one at the University of Michigan; one at MIT in the Laboratory for Information and Decision Systems; one in Gaylord, Michigan; one at the University of California, San Diego; one at Bell Labs; and another at AT&T Shannon Labs.[69] The statue in Gaylord is located in the Claude Shannon Memorial Park.[70] After the breakup of the Bell System, the part of Bell Labs that remained with AT&T Corporation was named Shannon Labs in his honor.
According to Neil Sloane, an AT&T Fellow who co-edited Shannon's large collection of papers in 1993, the perspective introduced by Shannon's communication theory (now called information theory) is the foundation of the digital revolution, and every device containing a microprocessor or microcontroller is a conceptual descendant of Shannon's publication in 1948:[71] "He's one of the great men of the century. Without him, none of the things we know today would exist. The whole digital revolution started with him."[72] The cryptocurrency unit shannon (a synonym for gwei) is named after him.[73]
Shannon is credited by many as single-handedly creating information theory and for laying the foundations for the Digital Age.[74] [75] [76] [77]
A Mind at Play, a biography of Shannon written by Jimmy Soni and Rob Goodman, was published in 2017.[78] They described Shannon as "the most important genius you’ve never heard of, a man whose intellect was on par with Albert Einstein and Isaac Newton".[79]
On April 30, 2016, Shannon was honored with a Google Doodle to celebrate his life on what would have been his 100th birthday.[80] [81] [82] [83] [84] [85]
The Bit Player, a feature film about Shannon directed by Mark Levinson premiered at the World Science Festival in 2019.[86] Drawn from interviews conducted with Shannon in his house in the 1980s, the film was released on Amazon Prime in August 2020.
Shannon's The Mathematical Theory of Communication,[87] begins with an interpretation of his own work by Warren Weaver. Although Shannon's entire work is about communication itself, Warren Weaver communicated his ideas in such a way that those not acclimated to complex theory and mathematics could comprehend the fundamental laws he put forth. The coupling of their unique communicational abilities and ideas generated the Shannon-Weaver model, although the mathematical and theoretical underpinnings emanate entirely from Shannon's work after Weaver's introduction. For the layman, Weaver's introduction better communicates The Mathematical Theory of Communication, but Shannon's subsequent logic, mathematics, and expressive precision was responsible for defining the problem itself.
"Theseus", created in 1950, was a mechanical mouse controlled by an electromechanical relay circuit that enabled it to move around a labyrinth of 25 squares.[88] The maze configuration was flexible and it could be modified arbitrarily by rearranging movable partitions. The mouse was designed to search through the corridors until it found the target. Having travelled through the maze, the mouse could then be placed anywhere it had been before, and because of its prior experience it could go directly to the target. If placed in unfamiliar territory, it was programmed to search until it reached a known location and then it would proceed to the target, adding the new knowledge to its memory and learning new behavior. Shannon's mouse appears to have been the first artificial learning device of its kind.
See main article: Shannon number. In 1949 Shannon completed a paper (published in March 1950) which estimates the game-tree complexity of chess, which is approximately 10120. This number is now often referred to as the "Shannon number", and is still regarded today as an accurate estimate of the game's complexity. The number is often cited as one of the barriers to solving the game of chess using an exhaustive analysis (i.e. brute force analysis).[89] [90]
On March 9, 1949, Shannon presented a paper called "Programming a Computer for playing Chess". The paper was presented at the National Institute for Radio Engineers Convention in New York. He described how to program a computer to play chess based on position scoring and move selection. He proposed basic strategies for restricting the number of possibilities to be considered in a game of chess. In March 1950 it was published in Philosophical Magazine, and is considered one of the first articles published on the topic of programming a computer for playing chess, and using a computer to solve the game.[89] [91] In 1950, Shannon wrote an article titled "A Chess-Playing Machine",[92] which was published in Scientific American. Both papers have had immense influence and laid the foundations for future chess programs.
His process for having the computer decide on which move to make was a minimax procedure, based on an evaluation function of a given chess position. Shannon gave a rough example of an evaluation function in which the value of the black position was subtracted from that of the white position. Material was counted according to the usual chess piece relative value (1 point for a pawn, 3 points for a knight or bishop, 5 points for a rook, and 9 points for a queen). He considered some positional factors, subtracting ½ point for each doubled pawn, backward pawn, and isolated pawn; mobility was incorporated by adding 0.1 point for each legal move available.
Shannon formulated a version of Kerckhoffs' principle as "The enemy knows the system". In this form it is known as "Shannon's maxim".
The Shannon centenary, 2016, marked the life and influence of Claude Elwood Shannon on the hundredth anniversary of his birth on April 30, 1916. It was inspired in part by the Alan Turing Year. An ad hoc committee of the IEEE Information Theory Society including Christina Fragouli, Rüdiger Urbanke, Michelle Effros, Lav Varshney and Sergio Verdú,[93] coordinated worldwide events. The initiative was announced in the History Panel at the 2015 IEEE Information Theory Workshop Jerusalem[94] [95] and the IEEE Information Theory Society newsletter.[96]
A detailed listing of confirmed events was available on the website of the IEEE Information Theory Society.[97]
Some of the activities included:
The Claude E. Shannon Award was established in his honor; he was also its first recipient, in 1972.[105] [106]