Semantic security explained

In cryptography, a semantically secure cryptosystem is one where only negligible information about the plaintext can be feasibly extracted from the ciphertext. Specifically, any probabilistic, polynomial-time algorithm (PPTA) that is given the ciphertext of a certain message

m

(taken from any distribution of messages), and the message's length, cannot determine any partial information on the message with probability non-negligibly higher than all other PPTA's that only have access to the message length (and not the ciphertext).[1] This concept is the computational complexity analogue to Shannon's concept of perfect secrecy. Perfect secrecy means that the ciphertext reveals no information at all about the plaintext, whereas semantic security implies that any information revealed cannot be feasibly extracted.[2] [3]

History

The notion of semantic security was first put forward by Goldwasser and Micali in 1982.[4] However, the definition they initially proposed offered no straightforward means to prove the security of practical cryptosystems. Goldwasser/Micali subsequently demonstrated that semantic security is equivalent to another definition of security called ciphertext indistinguishability under chosen-plaintext attack.[5] This latter definition is more common than the original definition of semantic security because it better facilitates proving the security of practical cryptosystems.

Symmetric-key cryptography

In the case of symmetric-key algorithm cryptosystems, an adversary must not be able to compute any information about a plaintext from its ciphertext. This may be posited as an adversary, given two plaintexts of equal length and their two respective ciphertexts, cannot determine which ciphertext belongs to which plaintext.

Public-key cryptography

For an asymmetric key encryption algorithm cryptosystem to be semantically secure, it must be infeasible for a computationally bounded adversary to derive significant information about a message (plaintext) when given only its ciphertext and the corresponding public encryption key. Semantic security considers only the case of a "passive" attacker, i.e., one who generates and observes ciphertexts using the public key and plaintexts of their choice. Unlike other security definitions, semantic security does not consider the case of chosen ciphertext attack (CCA), where an attacker is able to request the decryption of chosen ciphertexts, and many semantically secure encryption schemes are demonstrably insecure against chosen ciphertext attack. Consequently, semantic security is now considered an insufficient condition for securing a general-purpose encryption scheme.

Indistinguishability under Chosen Plaintext Attack (IND-CPA) is commonly defined by the following experiment:[6]

  1. A random pair

(pk,sk)

is generated by running

Gen(1n)

.
  1. A probabilistic polynomial time-bounded adversary is given the public key

pk

, which it may use to generate any number of ciphertexts (within polynomial bounds).
  1. The adversary generates two equal-length messages

m0

and

m1

, and transmits them to a challenge oracle along with the public key.
  1. The challenge oracle selects one of the messages by flipping a fair coin (selecting a random bit

b\in\{0,1\}

), encrypts the message

mb

under the public key, and returns the resulting challenging ciphertext

c

to the adversary.

The underlying cryptosystem is IND-CPA (and thus semantically secure under chosen plaintext attack) if the adversary cannot determine which of the two messages was chosen by the oracle, with probability significantly greater than

1/2

(the success rate of random guessing). Variants of this definition define indistinguishability under chosen ciphertext attack and adaptive chosen ciphertext attack (IND-CCA, IND-CCA2).

Because the adversary possesses the public encryption key in the above game, a semantically secure encryption scheme must by definition be probabilistic, possessing a component of randomness; if this were not the case, the adversary could simply compute the deterministic encryption of

m0

and

m1

and compare these encryptions with the returned ciphertext

c

to successfully guess the oracle's choice.

Semantically secure encryption algorithms include Goldwasser-Micali, ElGamal and Paillier. These schemes are considered provably secure, as their semantic security can be reduced to solving some hard mathematical problem (e.g., Decisional Diffie-Hellman or the Quadratic Residuosity Problem). Other, semantically insecure algorithms such as RSA, can be made semantically secure (under stronger assumptions) through the use of random encryption padding schemes such as Optimal Asymmetric Encryption Padding (OAEP).

References

  1. [Shafi Goldwasser|S. Goldwasser]
  2. Shannon. Claude. Communication Theory of Secrecy Systems. Bell System Technical Journal. 28. 4. 656–715. 1949. 10.1002/j.1538-7305.1949.tb00928.x. 10338.dmlcz/119717. free.
  3. [Oded Goldreich|Goldreich, Oded.]
  4. Goldwasser . Shafi . Micali . Silvio . 1984-04-01 . Probabilistic encryption . Journal of Computer and System Sciences . 28 . 2 . 270–299 . 10.1016/0022-0000(84)90070-9 . 0022-0000.
  5. [Shafi Goldwasser|S. Goldwasser]
  6. Book: Katz. Jonathan. Lindell. Yehuda. Introduction to Modern Cryptography: Principles and Protocols. 2007. Chapman and Hall/CRC. 978-1584885511.