Bera, Sabyasachi; Chatterjee, Snigdhansu - In: Statistics in Transition New Series 21 (2020) 4, pp. 123-143
We develop a technique for record linkage on high dimensional data, where the two datasets may not have any common variable, and there may be no training set available. Our methodology is based on sparse, high dimensional principal components. Since large and high dimensional datasets are often...