McEliece cryptosystem explained

In cryptography, the McEliece cryptosystem is an asymmetric encryption algorithm developed in 1978 by Robert McEliece.^[1] It was the first such scheme to use randomization in the encryption process. The algorithm has never gained much acceptance in the cryptographic community, but is a candidate for "post-quantum cryptography", as it is immune to attacks using Shor's algorithm and – more generally – measuring coset states using Fourier sampling.^[2]

The algorithm is based on the hardness of decoding a general linear code (which is known to be NP-hard^[3]). For a description of the private key, an error-correcting code is selected for which an efficient decoding algorithm is known, and that is able to correct

errors. The original algorithm uses binary Goppa codes (subfield codes of algebraic geometry codes of a genus-0 curve over finite fields of characteristic 2); these codes can be efficiently decoded, thanks to an algorithm due to Patterson.^[4] The public key is derived from the private key by disguising the selected code as a general linear code. For this, the code's generator matrix

is perturbated by two randomly selected invertible matrices

and

(see below).

Variants of this cryptosystem exist, using different types of codes. Most of them were proven less secure; they were broken by structural decoding.

McEliece with Goppa codes has resisted cryptanalysis so far. The most effective attacks known use information-set decoding algorithms. A 2008 paper describes both an attack and a fix.^[5] Another paper shows that for quantum computing, key sizes must be increased by a factor of four due to improvements in information set decoding.^[6]

The McEliece cryptosystem has some advantages over, for example, RSA. The encryption and decryption are faster.^[7] For a long time, it was thought that McEliece could not be used to produce signatures. However, a signature scheme can be constructed based on the Niederreiter scheme, the dual variant of the McEliece scheme. One of the main disadvantages of McEliece is that the private and public keys are large matrices. For a standard selection of parameters, the public key is 512 kilobits long.

Scheme definition

McEliece consists of three algorithms: a probabilistic key generation algorithm that produces a public and a private key, a probabilistic encryption algorithm, and a deterministic decryption algorithm.

All users in a McEliece deployment share a set of common security parameters:

n,k,t

Key generation

The principle is that Alice chooses a linear code

from some family of codes for which she knows an efficient decoding algorithm, and to make

public knowledge but keep the decoding algorithm secret. Such a decoding algorithm requires not just knowing

, in the sense of knowing an arbitrary generator matrix, but requires one to know the parameters used when specifying

in the chosen family of codes. For instance, for binary Goppa codes, this information would be the Goppa polynomial and the code locators. Therefore, Alice may publish a suitably obfuscated generator matrix of

More specifically, the steps are as follows:

Alice selects a binary

(n,k)

-linear code

capable of (efficiently) correcting

errors from some large family of codes, e.g. binary Goppa codes. This choice should give rise to an efficient decoding algorithm

. Let also

be any generator matrix for

. Any linear code has many generator matrices, but often there is a natural choice for this family of codes. Knowing this would reveal

so it should be kept secret.

Alice selects a random

k x k

binary non-singular matrix

Alice selects a random

n x n

permutation matrix

Alice computes the

k x n

matrix

{\hatG}=SGP

Alice's public key is

({\hatG},t)

; her private key is

(S,P,A)

. Note that

could be encoded and stored as the parameters used for selecting

Message encryption

Suppose Bob wishes to send a message m to Alice whose public key is

({\hatG},t)

Bob encodes the message

as a binary string of length

Bob computes the vector

c=m{\hatG}

Bob generates a random

-bit vector

containing exactly

ones (a vector of length

and weight

)^[1]

Bob computes the ciphertext as

c=c^\prime+z

Message decryption

Upon receipt of

, Alice performs the following steps to decrypt the message:

Alice computes the inverse of

(i.e.

P^-1

Alice computes

{\hatc}=cP^-1

Alice uses the decoding algorithm

to decode

{\hatc}

{\hatm}

Alice computes

m={\hatm}S^-1

Proof of message decryption

Note that

{\hatc}=cP^-1=m{\hatG}P^-1+zP^-1=mSG+zP^-1

,and that

is a permutation matrix, thus

zP^-1

has weight

The Goppa code

can correct up to

errors, and the word

mSG

is at distance at most

from

cP^-1

. Therefore, the correct code word

{\hatm}=mS

is obtained.

Multiplying with the inverse of

gives
m={\hatm}S^-1=mSS^-1

, which is the plain text message.

Key sizes

Because there is a free choice in the matrix

, it is common to express

{\hatG}

in "systematic form" so that the last

columns correspond to the identity matrix

{\hatG}=({\tildeG}|I)

. This reduces the key size to

(n-k) x k

.^[8] ^[9] McEliece originally suggested security parameter sizes of

n=1024,k=524,t=50

,^[1] resulting in a public key size of . Recent analysis suggests parameter sizes of

n=2048,k=1751,t=27

for 80 bits of security when using standard algebraic decoding, or

n=1632,k=1269,t=34

when using list decoding for the Goppa code, giving rise to public key sizes of and respectively.^[5] For resiliency against quantum computers, sizes of

n=6960,k=5413,t=119

with Goppa code were proposed, giving the size of public key of .^[10] In its round 3 submission to the NIST post quantum standardization the highest level of security, level 5 is given for parameter sets 6688128, 6960119, and 8192128. The parameters are

n=6688,k=128,t=13

n=6960,k=119,t=13

n=8192,k=128,t=13

respectively.

Attacks

An attack consists of an adversary, who knows the public key

({\hatG},t)

but not the private key, deducing the plaintext from some intercepted ciphertext

y\in

	n
F
	2

. Such attempts should be infeasible.

There are two main branches of attacks for McEliece:

Brute-force / unstructured attacks

The attacker knows

\hatG

, the generator matrix of an

(n,k)

code

\hatC

that is combinatorially able to correct

errors.The attacker may ignore the fact that

\hatC

is really the obfuscation of a structured code chosen from a specific family, and instead just use an algorithm for decoding with any linear code. Several such algorithms exist, such as going through each codeword of the code, syndrome decoding, or information set decoding.

Decoding a general linear code, however, is known to be NP-hard,^[3] however, and all of the above-mentioned methods have exponential running time.

In 2008, Bernstein, Lange, and Peters^[5] described a practical attack on the original McEliece cryptosystem, using the information set decoding method by Stern.^[11] Using the parameters originally suggested by McEliece, the attack could be carried out in 2^60.55 bit operations. Since the attack is embarrassingly parallel (no communication between nodes is necessary), it can be carried out in days on modest computer clusters.

Structural attacks

The attacker may instead attempt to recover the "structure" of

, thereby recovering the efficient decoding algorithm

or another sufficiently strong, efficient decoding algorithm.

The family of codes from which

is chosen completely determines whether this is possible for the attacker. Many code families have been proposed for McEliece, and most of them have been completely "broken" in the sense that attacks have been found that recover an efficient decoding algorithm, such as Reed-Solomon codes.

The originally proposed binary Goppa codes remain one of the few suggested families of codes that have largely resisted attempts at devising structural attacks.

Post-quantum encryption candidate

A variant of this algorithm combined with NTS-KEM^[12] was entered into and selected during the third round of the NIST post-quantum encryption competition.^[13]

External links

Book: . Handbook of Applied Cryptography. CRC Press. 1996. http://cacr.uwaterloo.ca/hac/. Alfred J. Menezes. Scott A. Vanstone. A. J. Menezes. Paul C. van Oorschot. Chapter 8: Public-Key Encryption. 978-0-8493-8523-0. registration.
Book: McEliece Messaging: Smoke Crypto Chat - The first mobile McEliece-Messenger published as a stable prototype worldwide. Article TK Info Portal. 2023. Rahmschmid, Claudia. Adams, David.
Web site: Quantum Computers? Internet Security Code of the Future Cracked. 2008-11-01. Science Daily. Eindhoven University of Technology.
Web site: Classic McEliece. (Submission to the NIST Post-Quantum Cryptography Standardization Project)

Notes and References

A Public-Key Cryptosystem Based on Algebraic Coding Theory . McEliece . Robert J. . 1978DSNPR..44..114M . DSN Progress Report . 44 . 114–116 . 1978.
2011 . McEliece and Niederreiter cryptosystems that resist quantum Fourier sampling attacks . Dinh . Hang . Moore . Cristopher . Russell . Alexander . 761–779 . Heidelberg . Springer . 2874885 . 10.1007/978-3-642-22792-9_43 . Philip . Rogaway . Lecture Notes in Computer Science . 6841 . 978-3-642-22791-2 . Advances in cryptology—CRYPTO 2011 . free.
Berlekamp . Elwyn R. . McEliece . Robert J. . Van Tilborg . Henk C.A. . 1978 . On the Inherent Intractability of Certain Coding Problems . IEEE Transactions on Information Theory . IT-24 . 3 . 384–386 . 10.1109/TIT.1978.1055873 . 0495180 .
N. J. Patterson . 1975 . The algebraic decoding of Goppa codes . IEEE Transactions on Information Theory . IT-21 . 2 . 203–207 . 10.1109/TIT.1975.1055350.
Book: Bernstein . Daniel J. . Lange . Tanja. Tanja Lange . Peters . Christiane . Post-Quantum Cryptography . Attacking and Defending the McEliece Cryptosystem . 8 August 2008 . Lecture Notes in Computer Science . 5299 . 31–46 . 10.1007/978-3-540-88403-3_3 . 978-3-540-88402-6 . 10.1.1.139.3548.
Daniel J. . Bernstein . 2010 . Grover vs. McEliece . Springer . Berlin . Lecture Notes in Computer Science . 6061 . Post-quantum cryptography 2010 . 2776312 . 73–80 . 10.1007/978-3-642-12929-2_6 . Nicolas . Sendrier . 978-3-642-12928-5.
Web site: eBATS: ECRYPT Benchmarking of Asymmetric Systems . 2018-08-25 . bench.cr.yp.to . 2020-05-01.
Web site: Classic McEliece: conservative code-based cryptography: cryptosystem specification . Classic McEliece Team . 2022-10-23 . Round 4 NIST Submission Overview.
Web site: Code-based cryptography III - Goppa codes: definition and usage . Tanja Lange . . 2021-02-23.
Web site: Initial recommendations of long-term secure post-quantum systems . Daniel Augot . 2015-09-07 . PQCRYPTO: Post-Quantum Cryptography for Long-Term Security . etal.
Book: Jacques Stern . Coding Theory and Applications . 1989 . A method for finding codewords of small weight . Lecture Notes in Computer Science . Springer Verlag . 388 . 106–113 . 10.1007/BFb0019850 . 978-3-540-51643-9.
Web site: 2017-12-29. NTS-KEM. 2020-12-09. https://web.archive.org/web/20171229103229/https://nts-kem.io/. 29 December 2017.
Status Report on the Third Round of the NIST Post-Quantum Cryptography Standardization Process. NISTIR. 31.