Lecture Notes in Computer Science, 2006, Volume 4165/2006, 84-99, DOI: 10.1007/11844662_7

Privacy Preserving BIRCH Algorithm for Clustering over Vertically Partitioned Databases

P. Krishna Prasad and C. Pandu Rangan

View Related Documents

Abstract

BIRCH algorithm, introduced by Zhang et al. [15], is a well known algorithm for effectively finding clusters in a large data set. The two major components of the BIRCH algorithm are CF tree construction and global clustering. However BIRCH algorithm is basically designed as an algorithm working on a single database. We propose the first novel method for running BIRCH over a vertically partitioned data sets, distributed in two different databases in a privacy preserving manner. We first provide efficient solutions to crypto primitives such as finding minimum index in a vector sum and checking if sum of two private values exceed certain threshold limit. We then use these primitives as basic tools to arrive at secure solutions to CF tree construction and single link clustering for implementing BIRCH algorithm.

Fulltext Preview

Image of the first page of the fulltext document