Shamir's secret sharing explained

Shamir's secret sharing (SSS) is an efficient secret sharing algorithm for distributing private information (the "secret") among a group. The secret cannot be revealed unless a quorum of the group acts together to pool their knowledge. To achieve this, the secret is mathematically divided into parts (the "shares") from which the secret can be reassembled only when a sufficient number of shares are combined. SSS has the property of information-theoretic security, meaning that even if an attacker steals some shares, it is impossible for the attacker to reconstruct the secret unless they have stolen the quorum number of shares.

Shamir's secret sharing is used in some applications to share the access keys to a master secret.

High-level explanation

SSS is used to secure a secret in a distributed form, most often to secure encryption keys. The secret is split into multiple shares, which individually do not give any information about the secret.

To reconstruct a secret secured by SSS, a number of shares is needed, called the threshold. No information about the secret can be gained from any number of shares below the threshold (a property called perfect secrecy). In this sense, SSS is a generalisation of the one-time pad (which can be viewed as SSS with a two-share threshold and two shares in total).^[1]

Application example

A company needs to secure their vault. If a single person knows the code to the vault, the code might be lost or unavailable when the vault needs to be opened. If there are several people who know the code, they may not trust each other to always act honestly.
SSS can be used in this situation to generate shares of the vault's code which are distributed to authorized individuals in the company. The minimum threshold and number of shares given to each individual can be selected such that the vault is accessible only by (groups of) authorized individuals. If fewer shares than the threshold are presented, the vault cannot be opened.
By accident, coercion or as an act of opposition, some individuals might present incorrect information for their shares. If the total of correct shares fails to meet the minimum threshold, the vault remains locked.

Use cases

Shamir's secret sharing can be used to

share a key for decrypting the root key of a password manager,^[2]
recover a user key for encrypted email access^[3] and
share the passphrase used to recreate a master secret, which is in turn used to access a cryptocurrency wallet.^[4]

Properties and weaknesses

SSS has useful properties, but also weaknesses^[5] that means that it is unsuited to some uses.

Useful properties include:

Secure: The scheme has information-theoretic security.
Minimal: The size of each piece does not exceed the size of the original data.
Extensible: For any given threshold, shares can be dynamically added or deleted without affecting existing shares
Dynamic: Security can be enhanced without changing the secret, but by changing the polynomial occasionally (keeping the same free term) and constructing a new share for each of the participants.
Flexible: In organizations where hierarchy is important, each participant can be issued different numbers of shares according to their importance inside the organization. For instance, with a threshold of 3, the president could unlock the safe alone if given three shares, while three secretaries with one share each must combine their shares to unlock the safe.

Weaknesses include:

No verifiable secret sharing: During the share reassembly process, SSS does not provide a way to verify the correctness of each share being used. Verifiable secret sharing aims to verify that shareholders are honest and not submitting fake shares.
Single point of failure: The secret must exist in one place when it is split into shares, and again in one place when it is reassembled. These are attack points, and other schemes including multisignature eliminate at least one of these single points of failure.

History

Adi Shamir, an Israeli scientist, first formulated the scheme in 1979.

Mathematical principle

The scheme exploits the Lagrange interpolation theorem, specifically that

points on the polynomial uniquely determines a polynomial of degree less than or equal to

k-1

. For instance, 2 points are sufficient to define a line, 3 points are sufficient to define a parabola, 4 points to define a cubic curve and so forth.

Mathematical formulation

Shamir's secret sharing is an ideal and perfect

\left(k,n\right)

-threshold scheme based on polynomial interpolation over finite fields. In such a scheme, the aim is to divide a secret

(for example, the combination to a safe) into

pieces of data

S_1,\ldots,S_n

(known as shares) in such a way that:

Knowledge of any

or more shares

S_i

makes

computable. That is, the entire secret

can be reconstructed from any combination of

shares.

Knowledge of any

k-1

or fewer shares

S_i

leaves

completely undetermined, in the sense that the possible values for

remain as likely with knowledge of up to

k-1

shares as with knowledge of

shares. The secret

cannot be reconstructed with fewer than

shares.

n=k

, then all of the shares are needed to reconstruct the secret

Assume that the secret

can be represented as an element

a₀

of a finite field

GF(q)

(where

is greater than the number

of shares being generated). Randomly choose

k-1

elements,

a_{1, … ,a}_k-1

, from

GF(q)

and construct the polynomial

f\left(x\right)=a_0+a_1x+a

	3+ … +a

	k-1

x^k-1

. Compute any

points out on the curve, for instance set

i=1,\ldots,n

to find points

\left(i,f\left(i\right)\right)

. Every participant is given a point (a non-zero input to the polynomial, and the corresponding output).^[6] Given any subset of

of these pairs,

a₀

can be obtained using interpolation, with one possible formula for doing so being

a₀=f(0)=

	k-1
\sum
	j=0

y_j\prod_{\begin{smallmatrix}m=0\ m\nej\end{smallmatrix}}^k-1

	x_m
	x_m-x_j

, where the list of points on the polynomial is given as

pairs of the form

(x_i,y_i)

. Note that

f(0)

is equal to the first coefficient of polynomial

f(x)

Example calculation

The following example illustrates the basic idea. Note, however, that calculations in the example are done using integer arithmetic rather than using finite field arithmetic to make the idea easier to understand. Therefore, the example below does not provide perfect secrecy and is not a proper example of Shamir's scheme. The next example will explain the problem.

Preparation

Suppose that the secret to be shared is 1234

(S=1234)

In this example, the secret will be split into 6 shares

(n=6)

, where any subset of 3 shares

(k=3)

is sufficient to reconstruct the secret.

k-1=2

numbers are taken at random. Let them be 166 and 94.

This yields coefficients

(a_0=1234;a_1=166;a_2=94),

where

a₀

is the secret

The polynomial to produce secret shares (points) is therefore:

f(x)=1234+166x+94x²

Six points

D_x-1=(x,f(x))

from the polynomial are constructed as:

D_0=(1,1494);D_1=(2,1942);D_2=(3,2578);D_3=(4,3402);D₄₌(5,4414);D_5=(6,5614)

Each participant in the scheme receives a different point (a pair of

and

f(x)

). Because

D_x-1

is used instead of

D_x

the points start from

(1,f(1))

and not

(0,f(0))

. This is necessary because

f(0)

is the secret.

Reconstruction

In order to reconstruct the secret, any 3 points are sufficient

Consider using the 3 points

\left(x_0,y_{0\right)=\left(2,1942\right);\left(x}_1,y_{1\right)=\left(4,3402\right);\left(x}_2,y_{2\right)=\left(5,4414\right)}

Computing the Lagrange basis polynomials:

\ell

⋅

0(x)=	x-x₁
	x_0-x₁

	x-x₂	=
	x_0-x₂

	x-4	⋅
	2-4

	x-5	=
	2-5

	1
	6

2-	3
	2

	10
	3

\ell

⋅

1(x)=	x-x₀
	x_1-x₀

	x-x₂	=
	x_1-x₂

	x-2	⋅
	4-2

	x-5	=-
	4-5

	1
	2

2+	7
	2

x-5

\ell

⋅

2(x)=	x-x₀
	x_2-x₀

	x-x₁	=
	x_2-x₁

	x-2	⋅
	5-2

	x-4	=
	5-4

	1
	3

2-2x+	8
	3

Using the formula for polynomial interpolation,

f(x)

is:

\begin{align} f(x)&

	2
=\sum
	j=0

y_{j ⋅ \ell}_j(x)\\[6pt] &=y_0\ell_0(x)+y_1\ell_1(x)+y_2\ell_2(x)\\[6pt] &=1942\left(

	1
	6

2-	3
	2

	10
	3

\right)+3402\left(-

	1
	2

2+	7
	2

x-5\right)+4414\left(

	1
	3

2-2x+	8
	3

\right)\\[6pt] &=1234+166x+94x^{2
\end{align}}

Recalling that the secret is the free coefficient, which means that

S=1234

, and the secret has been recovered.

Computationally efficient approach

Using polynomial interpolation to find a coefficient in a source polynomial

S=f(0)

using Lagrange polynomials is not efficient, since unused constants are calculated.

Considering this, an optimized formula to use Lagrange polynomials to find

f(0)

is defined as follows:

f(0)=

	k-1
\sum
	j=0

y_j\prod_{\begin{smallmatrix}m=0\ m\nej\end{smallmatrix}}^k-1

	x_m
	x_m-x_j

Problem of using integer arithmetic

Although the simplified version of the method demonstrated above, which uses integer arithmetic rather than finite field arithmetic, works, there is a security problem: Eve gains information about

with every

D_i

that she finds.

Suppose that she finds the 2 points

D_0=(1,1494)

and

D_1=(2,1942)

. She still does not have

k=3

points, so in theory she should not have gained any more information about

. But she could combine the information from the 2 points with the public information:

n=6,k=3,f(x)=a_0+a_{1x+ … +a}_k-1x^k-1,a_0=S,a_i\inZ

. Doing so, Eve could perform the following algebra:

Fill the formula for

f(x)

with

and the value of

k:f(x)=S+a_{1x+ … +a}_k-1x^k-1 ⇒ {}f(x)=S+a_1x+a

	2

	2x

Fill (1) with the values of

D₀

and

f(x):1494=S+a₁1+a₂

	2 ⇒ {}1494=S+a
1
	1+a

₂

Fill (1) with the values of

D₁

and

f(x):1942=S+a₁2+a₂

	2 ⇒ {}1942=S+2a
2
	1+4a

₂

Subtract (3)-(2):

(1942-1494)=(S-S)+(2a_1-a_1)+(4a_2-a_{2) ⇒ {}448=a}_1+3a₂

and rewrite this as

a_1=448-3a₂

Now, Eve can replace the result from (4) into (2):

1494=S+(448-3a_2)+a_{2 ⇒ {}S=1046+2a}₂

which leads her to the information that S is even.

Solution using finite field arithmetic

The above attack exploits constraints on the values that the polynomial may take by virtue of how it was constructed: the polynomial must have coefficients that are integers, and the polynomial must take an integer as value when evaluated at each of the coordinates used in the scheme. This reduces its possible values at unknown points, including the resultant secret, given fewer than

shares.

This problem can be remedied by using finite field arithmetic. A finite field always has size

q=p^r

, where

is a prime and

is a positive integer. The size

of the field must satisfy

q>n

, and that

is greater than the number of possible values for the secret, though the latter condition may be circumvented by splitting the secret into smaller secret values, and applying the scheme to each of these. In our example below, we use a prime field (i.e. r = 1). The figure shows a polynomial curve over a finite field.

In practice this is only a small change. The order q of the field (i.e. the number of values that it has) must be chosen to be greater than the number of participants and the number of values that the secret

a_0=S

may take. All calculations involving the polynomial must also be calculated over the field (mod p in our example, in which

p=q

is taken to be a prime) instead of over the integers. Both the choice of the field and the mapping of the secret to a value in this field are considered to be publicly known.

For this example, choose

p=1613

, so the polynomial becomes

f(x)=1234+166x+94x^2\bmod{1613}

which gives the points:

(1,1494);(2,329);(3,965);(4,176);(5,1188);(6,775)

This time Eve doesn't gain any information when she finds a

D_x

(until she has

points).

Suppose again that Eve finds

D_{0=\left(1,1494\right)}

and

D_{1=\left(2,329\right)}

, and the public information is:

n=6,k=3,p=1613,f(x)=a_0+a_1x+...+a_k-1x^k-1\mod{p},a_0=S,a_i\inN

. Attempting the previous attack, Eve can:

Fill the

f(x)

-formula with

and the value of

and

f(x)=S+a_1x+...+a_3-1x^3-1\mod1613

Fill (1) with the values of

D₀

and

f(x):1494\equivS+a₁1+a₂1²\pmod{1613} ⇒ {}1494\equivS+a_1+a₂\pmod{1613}

Fill (1) with the values of

D₁

and

f(x):1942\equivS+a₁2+a₂2²\pmod{1613} ⇒ {}1942\equivS+2a_1+4a₂\pmod{1613}

Subtract (3)-(2):

(1942-1494)\equiv(S-S)+(2a_1-a_1)+(4a_2-a₂₎\pmod{1613} ⇒ {}448\equiva_1+3a₂\pmod{1613}

and rewrite this as

a_1\equiv448-3a₂\pmod{1613}

There are

possible values for

a₁

. She knows that

[448,445,442,\ldots]

always decreases by 3, so if

were divisible by

she could conclude

a_1\in[1,4,7,\ldots]

. However,

is prime, so she can not conclude this. Thus, using a finite field avoids this possible attack.

Also, even though Eve can conclude that

S\equiv1046+2a₂\pmod{1613}

, it does not provide any additional information, since the "wrapping around" behavior of modular arithmetic prevents the leakage of "S is even", unlike the example with integer arithmetic above.

Python code

For purposes of keeping the code clearer, a prime field is used here. In practice, for convenience a scheme constructed using a smaller binary field may be separately applied to small substrings of bits of the secret (e.g. GF(256) for byte-wise application), without loss of security. The strict condition that the size of the field must be larger than the number of shares must still be respected (e.g., if the number of shares could exceed 255, the field GF(256) might be replaced by say GF(65536)).

"""The following Python implementation of Shamir's secret sharing isreleased into the Public Domain under the terms of CC0 and OWFa:https://creativecommons.org/publicdomain/zero/1.0/http://www.openwebfoundation.org/legal/the-owf-1-0-agreements/owfa-1-0

See the bottom few lines for usage. Tested on Python 2 and 3."""

from __future__ import divisionfrom __future__ import print_function

import randomimport functools

12th Mersenne Prime

_PRIME = 2 ** 127 - 1

_RINT = functools.partial(random.SystemRandom.randint, 0)

def _eval_at(poly, x, prime): """Evaluates polynomial (coefficient tuple) at x, used to generate a shamir pool in make_random_shares below. """ accum = 0 for coeff in reversed(poly): accum *= x accum += coeff accum %= prime return accum

def make_random_shares(secret, minimum, shares, prime=_PRIME): """ Generates a random shamir pool for a given secret, returns share points. """ if minimum > shares: raise ValueError("Pool secret would be irrecoverable.") poly = [secret] + [_RINT(prime - 1) for i in range(minimum - 1)] points = [(i, _eval_at(poly, i, prime)) for i in range(1, shares + 1)] return points

def _extended_gcd(a, b): """ Division in integers modulus p means finding the inverse of the denominator modulo p and then multiplying the numerator by this inverse (Note: inverse of A is B such that A*B % p

1). This can be computed via the extended Euclidean algorithm http://en.wikipedia.org/wiki/Modular_multiplicative_inverse#Computation """ x = 0 last_x = 1 y = 1 last_y = 0 while b != 0: quot = a // b a, b = b, a % b x, last_x = last_x - quot * x, x y, last_y = last_y - quot * y, y return last_x, last_y

def _divmod(num, den, p): """Compute num / den modulo prime p

To explain this, the result will be such that: den * _divmod(num, den, p) % p

num """ inv, _ = _extended_gcd(den, p) return num * inv

def _lagrange_interpolate(x, x_s, y_s, p): """ Find the y-value for the given x, given n (x, y) points; k points will define a polynomial of up to kth order. """ k = len(x_s) assert k

len(set(x_s)), "points must be distinct" def PI(vals): # upper-case PI -- product of inputs accum = 1 for v in vals: accum = v return accum nums = [] # avoid inexact division dens = [] for i in range(k): others = list(x_s) cur = others.pop(i) nums.append(PI(x - o for o in others)) dens.append(PI(cur - o for o in others)) den = PI(dens) num = sum([_divmod(nums[i] den * y_s[i] % p, dens[i], p) for i in range(k)]) return (_divmod(num, den, p) + p) % p

def recover_secret(shares, prime=_PRIME): """ Recover the secret from share points (points (x,y) on the polynomial). """ if len(shares) < 3: raise ValueError("need at least three shares") x_s, y_s = zip(*shares) return _lagrange_interpolate(0, x_s, y_s, prime)

def main: """Main function""" secret = 1234 shares = make_random_shares(secret, minimum=3, shares=6)

print('Secret: ', secret) print('Shares:') if shares: for share in shares: print(' ', share)

print('Secret recovered from minimum subset of shares: ', recover_secret(shares[:3])) print('Secret recovered from a different minimum subset of shares: ', recover_secret(shares[-3:]))

if __name__

References

Book: Krenn. Stephan. Loruenser. Thomas. 2023. An Introduction to Secret Sharing: A Systematic Overview and Guide for Protocol Selection. 10.1007/978-3-031-28161-7. 978-3-031-28160-0.
Web site: Seal/Unseal . 2022-10-02 . . en.
Web site: PreVeil Review . 2022-10-02 . . en.
Web site: Rusnak . Pavol . Kozlik . Andrew . Vejpustek . Ondrej . Susanka . Tomas . Palatinus . Marek . Hoenicke . Jochen . 2017-12-18 . SLIP-0039 : Shamir's Secret-Sharing for Mnemonic Codes . 2022-10-03 . . SatoshiLabs . This SLIP describes a standard and interoperable implementation of Shamirs secret-sharing (SSS) and a specification for its use in backing up Hierarchical Deterministic Wallets described in BIP-0032..
Web site: Shamir's Secret Sharing shortcomings . Lopp . Jameson . 2020-10-01 . 2022-10-03 . Variations of Shamir's Secret Sharing (SSS) have been implemented several times in the cryptocurrency space, only for developers to later realize that the additional complexity ended up reducing the security of the system. .
.

External links

Shamir's Secret Sharing in the Crypto++ library
Shamir's Secret Sharing Scheme (ssss) – a GNU GPL implementation
sharedsecret – implementation in Go
s4 - online shamir's secret sharing tool utilizing HashiCorp's shamir secret sharing algorithm
Shamir39 - webversion on iancoleman.io
kn Secrets - webversion on a dedicated website, aiming to make Shamir's secret sharing as accessible as possible

Shamir's secret sharing explained

High-level explanation

Application example

Use cases

Properties and weaknesses

History

Mathematical principle

Mathematical formulation

Example calculation

Preparation

Reconstruction

Computationally efficient approach

Problem of using integer arithmetic

Solution using finite field arithmetic

Python code

num """ inv, _ = _extended_gcd(den, p) return num * inv

See also

References

Further reading

External links