Paper 2025/054

Doubly Efficient Fuzzy Private Set Intersection for High-dimensional Data with Cosine Similarity

Hyunjung Son, Hanyang University
Seunghun Paik, Hanyang University
Yunki Kim, Hanyang University
Sunpill Kim, Hanyang University
Heewon Chung, Independent Researcher
Jae Hong Seo, Hanyang University
Abstract

Fuzzy private set intersection (Fuzzy PSI) is a cryptographic protocol for privacy-preserving similarity matching, which is one of the essential operations in various real-world applications such as facial authentication, information retrieval, or recommendation systems. Despite recent advancements in fuzzy PSI protocols, still a huge barrier remains in deploying them for these applications. The main obstacle is the high dimensionality, e.g., from 128 to 512, of data; lots of existing methods, Garimella et al. (CRYPTO’23, CRYPTO’24) or van Baarsen et al. (EUROCRYPT’24), suffer from exponential overhead on communication and/or computation cost. In addition, the dominant similarity metric in these applications is cosine similarity, which disables several optimization tricks based on assumptions for the distribution of data, e.g., techniques by Gao et al. (ASIACRYPT’24). In this paper, we propose a novel fuzzy PSI protocol for cosine similarity, called FPHE, that overcomes these limitations at the same time. FPHE features linear complexity on both computation and communication with respect to the dimension of set elements, only requiring much weaker assumption than prior works. The basic strategy of ours is to homomorphically compute cosine similarity and run an approximated comparison function, with a clever packing method for efficiency. In addition, we introduce a novel proof technique to harmonize the approximation error from the sign function with the noise flooding, proving the security of FPHE under the semi-honest model. Moreover, we show that our construction can be extended to support various functionalities, such as labeled or circuit fuzzy PSI. Through experiments, we show that FPHE can perform fuzzy PSI over 512-dimensional data in a few minutes, which was computationally infeasible for all previous proposals under the same assumption as ours.

Metadata
Available format(s)
PDF
Category
Cryptographic protocols
Publication info
Preprint.
Keywords
Fuzzy Private Set IntersectionFuzzy MatchingHomomorphic Encryption
Contact author(s)
dk9050rx @ hanyang ac kr
whitesoonguh @ hanyang ac kr
dbsrl7665 @ hanyang ac kr
ksp0352 @ hanyang ac kr
tyler heewonchung @ gmail com
jaehongseo @ hanyang ac kr
History
2025-01-14: revised
2025-01-14: received
See all versions
Short URL
https://ia.cr/2025/054
License
Creative Commons Attribution
CC BY

BibTeX

@misc{cryptoeprint:2025/054,
      author = {Hyunjung Son and Seunghun Paik and Yunki Kim and Sunpill Kim and Heewon Chung and Jae Hong Seo},
      title = {Doubly Efficient Fuzzy Private Set Intersection  for High-dimensional Data with Cosine Similarity},
      howpublished = {Cryptology {ePrint} Archive, Paper 2025/054},
      year = {2025},
      url = {https://eprint.iacr.org/2025/054}
}
Note: In order to protect the privacy of readers, eprint.iacr.org does not use cookies or embedded third party content.