Private set intersection explained

Private set intersection
Related To:homomorphic encryption

Private set intersection is a secure multiparty computation cryptographic technique[1] that allows two parties holding sets to compare encrypted versions of these sets in order to compute the intersection. In this scenario, neither party reveals anything to the counterparty except for the elements in the intersection.

Other variants of this exist, such as the server-client scenario, in which only the client learns the intersection of her set with the set of the server, without the server learning intersection of his set with the clients.[2]

For the comparison of data sets by cryptographic hashes on a small or predictable domain, precautions should be taken to prevent dictionary attacks.[3]

Apple uses this technique in Password Monitoring.[4] It has proposed using the technology for its announced Expanded Protections for Children [5]

In general, PSI protocols can be categorized into two broad categories: (1) traditional PSI and (2) delegated PSI. In the traditional PSI category, the data owners interact directly with each other and need to have a copy of their set at the time of the computation, e.g.,.[6] In the delegated PSI the computation of PSI and/or the storage of sets can be delegated to a third-party server (that is itself might be a passive or active adversary). The delegated PSI category can be further divided into two classes: (a) those that support one-off delegation, and (b) those that support repeated delegation. The PSI protocols that support one-off delegation require the data owner to re-encode its data and send the encoded data to the server for each computation, e.g.,.[7] Those that support repeated delegation allow the data owners to upload their (encrypted) data to the server only once, and then re-use it many times for each computation carried out but the server, e.g.,[8]

Recently, researchers have proposed a variant of PSI protocol (in both traditional and delegated categories) that support data update, e.g., .[9] [10] This type of PSI protocol lets data owners insert/delete set elements into/from their data with low overheads and in a privacy-preserving manner.

Educational example

This educational example demonstrated the key idea of PSI, but does not provide real-world cryptographic security (hence should not be used with real-world data).

  1. Example sets

party_a_set = party_b_set =

  1. Hashing the elements in both sets

hashed_party_a_set = hashed_party_b_set =

  1. Finding the intersection of the hashed sets

intersection = hashed_party_a_set.intersection(hashed_party_b_set)

  1. Printing hashed intersection for demonstration

print(intersection)

References

  1. Book: Fast Private Set Intersection from Homomorphic Encryption. Chen. Hao. Laine. Kim. Rindal. Peter. 2018-05-16. Association for Computing Machinery . 9781450349468. en-US.
  2. Book: Pinkas, Benny. Private Set Intersection. en-US.
  3. Book: Ihle. Cornelius. Schubotz. Moritz. Meuschke. Norman. Gipp. Bela. Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020 . A First Step Towards Content Protecting Plagiarism Detection . 2020-08-02. en. Virtual Event China. ACM. 341–344. 10.1145/3383583.3398620. 2005.11504. 978-1-4503-7585-6. free.
  4. Web site: Password Monitoring. 8 August 2021.
  5. Web site: Child Safety. 8 August 2021.
  6. Michael J. Freedman. Kobbi. Nissim . Benny. Pinkas . Efficient private matching and set intersection. International Conference on the Theory and Applications of Cryptographic Techniques'04: Proceedings . Lecture Notes in Computer Science . 1–19. 2004 . 3027 . 10.1007/978-3-540-24676-3_1 . 978-3-540-21935-4 . 10184294 .
  7. Seny. Kamara. Payman. Mohassel . Mariana. Raykova. Saeed. Sadeghian. Scaling private set intersection to billion-element sets. International Conference on Financial Cryptography and Data Security'14: Proceedings. 195–215. 2014 .
  8. Aydin. Abadi. Sotirios. Terzis. Changyu. Dong. VD-PSI: verifiable delegated private set intersection on outsourced private datasets. International Conference on Financial Cryptography and Data Security'16: Proceedings. 149–168. 2016.
  9. Aydin. Abadi. Changyu. Dong. Steven J. Murdoch. Sotirios. Terzis. Multi-party Updatable Delegated Private Set Intersection. International Conference on Financial Cryptography and Data Security'22: Proceedings. 2022.
  10. Saikrishna. Badrinarayanan. Peihan . Miao. Tiancheng. Xie. Updatable Private Set Intersection. Privacy Enhancing Technologies'22:Proceedings. 2022. 2022. 2. 378–406. 10.2478/popets-2022-0051. 239000070.