Paper 2025/422

Private Computation on Common Fuzzy Records

Kyoohyung Han, Samsung SDS
Seongkwang Kim, Samsung SDS
Yongha Son, Sungshin Women's University
Abstract

Private computation on common records refers to analyze data from two databases containing shared records without revealing personal information. As a basic requirement for private computation, the databases involved essentially need to be aligned by a common identification system. However, it is hard to expect such common identifiers in real world scenario. For this reason, multiple quasi-identifiers can be used to identify common records. As some quasi-identifiers might be missing or have typos, it is important to support fuzzy records setting. Identifying common records using quasi-identifiers requires manipulation of highly sensitive information, which could be privacy concerns. This work studies the problem of enabling such data analysis on the fuzzy records of quasi-identifiers. To this end, we propose ordered threshold-one (OTO) matching which can be efficiently realized by circuit-based private set intersection (CPSI) protocols and some multiparty computation (MPC) techniques. Furthermore, we introduce some generic encoding techniques from traditional matching rules to the OTO matching. Finally, we achieve a secure efficient private computation protocol which supports various matching rules which have already been widely used. We also demonstrate the superiority of our proposal with experimental validation. First, we empirically check that our encoding to OTO matching does not affect accuracy a lot for the benchmark datasets found in the fuzzy record matching literature. Second, we implement our protocol and achieve significantly faster performance at the cost of communication overhead compared to previous privacy-preserving record linkage (PPRL) protocols. In the case of 100K records for each dataset, our work shows 147.58MB communication cost, 10.71s setup time, and 1.97s online time, which is 7.78 times faster compared to the previous work (50.12 times faster when considering online time only).

Metadata
Available format(s)
PDF
Category
Cryptographic protocols
Publication info
Published elsewhere. PoPETs 2025(1)
DOI
https://doi.org/10.56553/popets-2025-0031
Keywords
secure multiparty computationprivacy-preserving record linkageprivate set intersection
Contact author(s)
kh89 han @ samsung com
sk39 kim @ samsung com
yongha son @ sungshin ac kr
History
2025-03-05: approved
2025-03-05: received
See all versions
Short URL
https://ia.cr/2025/422
License
Creative Commons Attribution
CC BY

BibTeX

@misc{cryptoeprint:2025/422,
      author = {Kyoohyung Han and Seongkwang Kim and Yongha Son},
      title = {Private Computation on Common Fuzzy Records},
      howpublished = {Cryptology {ePrint} Archive, Paper 2025/422},
      year = {2025},
      doi = {https://doi.org/10.56553/popets-2025-0031},
      url = {https://eprint.iacr.org/2025/422}
}
Note: In order to protect the privacy of readers, eprint.iacr.org does not use cookies or embedded third party content.