A mask generation function (MGF) is a cryptographic primitive similar to a cryptographic hash function except that while a hash function's output has a fixed size, a MGF supports output of a variable length. In this respect, a MGF can be viewed as a extendable-output function (XOF): it can accept input of any length and process it to produce output of any length. Mask generation functions are completely deterministic: for any given input and any desired output length the output is always the same.
A mask generation function takes an octet string of variable length and a desired output length as input, and outputs an octet string of the desired length. There may be restrictions on the length of the input and output octet strings, but such bounds are generally very large. Mask generation functions are deterministic; the octet string output is completely determined by the input octet string. The output of a mask generation function should be pseudorandom, that is, if the seed to the function is unknown, it should be infeasible to distinguish the output from a truly random string.[1]
Mask generation functions, as generalizations of hash functions, are useful wherever hash functions are. However, use of a MGF is desirable in cases where a fixed-size hash would be inadequate. Examples include generating padding, producing one-time pads or keystreams in symmetric-key encryption, and yielding outputs for pseudorandom number generators.
Mask generation functions were first proposed as part of the specification for padding in the RSA-OAEP algorithm. The OAEP algorithm required a cryptographic hash function that could generate an output equal in size to a "data block" whose length was proportional to arbitrarily sized input message.[1]
NIST Special Publication 800-90A[2] defines a class of cryptographically secure random number generators, one of which is the "Hash DRBG", which uses a hash function with a counter to produce a requested sequence of random bits equal in size to the requested number of random bits.
Perhaps the most common and straightforward mechanism to build a MGF is to iteratively apply a hash function together with an incrementing counter value. The counter may be incremented indefinitely to yield new output blocks until a sufficient amount of output is collected. This is the approach used in MGF1.
MGF1 is a mask generation function defined in the Public Key Cryptography Standard #1 published by RSA Laboratories:[1]
Options
Hash
hash function (
denotes the length in octets of the hash function output)hLen
Input
Z
seed from which mask is generated, an octet string
l
intended length in octets of the mask, at most
232(hLen)
Output
mask
mask, an octet string of length
; or "mask too long"l
Steps
Below is Python code implementing MGF1:
def mgf1(seed: bytes, length: int, hash_func=hashlib.sha1) -> bytes: """Mask generation function.""" hLen = hash_func.digest_size # https://www.ietf.org/rfc/rfc2437.txt # 1. If l > 2^32(hLen), output "mask too long" and stop. if length > (hLen << 32): raise ValueError("mask too long") # 2. Let T be the empty octet string. T = b"" # 3. For counter from 0 to \lceil\rceil-1, do the following: # Note: \lceil\rceil-1 is the number of iterations needed, # but it's easier to check if we have reached the desired length. counter = 0 while len(T) < length: # a. Convert counter to an octet string C of length 4 with the primitive I2OSP: C = I2OSP (counter, 4) C = int.to_bytes(counter, 4, "big") # b. Concatenate the hash of the seed Z and C to the octet string T: T = T || Hash (Z || C) T += hash_func(seed + C).digest counter += 1 # 4. Output the leading l octets of T as the octet string mask. return T[:length]
Example outputs of MGF1: