ARM: Authenticated Approximate Record Matching for Outsourced Databases
In this paper, we consider the outsourcing model in which a third-party server provides data integration as a service. Identifying approximately duplicate records in databases is an essential step for the information integration processes. Most existing approaches rely on estimating the similarity of potential duplicates. The service provider returns all records from the outsourced dataset that are similar according to specific distance metrics. A major security concern of this outsourcing paradigm is whether the service provider returns sound and complete near-duplicates. In this paper, we design ARM, an authentication system for the outsourced record matching. The key idea of ARM is that besides the similar record pairs, the server returns the verification object (VO) of these similar pairs to prove their correctness. First, we design an authenticated data structure namedMB-Tree forVO construction. Second, we design a lightweight authentication method that can catch the service provider's various cheating behaviors by utilizing VOs. We perform an extensive set of experiment on real-world datasets to demonstrate that ARM can verify the record matching results with cheap cost.
MSU Digital Commons Citation
Dong, Boxiang and Wang, Wendy, "ARM: Authenticated Approximate Record Matching for Outsourced Databases" (2016). Department of Computer Science Faculty Scholarship and Creative Works. 125.