Maintaining tail dependence in data shuffling using t copula
Data shuffling is a recently proposed technique for masking numerical data where the confidential values are shuffled between records while maintaining all monotonic relationships between the variables in the data set. Data shuffling is based on the multivariate normal copula which assumes that there is no tail dependence in the data set. In many practical situations, however, tail dependence plays a crucial role in decision making. Hence, it is desirable that the data masking procedure be capable of preserving tail dependence when present. In this study, we provide a new data shuffling approach based on t copulas that is capable of maintaining tail dependence in the masked data in a large number of applications.
Year of publication: |
2011
|
---|---|
Authors: | Trottini, Mario ; Muralidhar, Krish ; Sarathy, Rathindra |
Published in: |
Statistics & Probability Letters. - Elsevier, ISSN 0167-7152. - Vol. 81.2011, 3, p. 420-428
|
Publisher: |
Elsevier |
Keywords: | Statistical confidentiality Copulas Data shuffling Disclosure risk Data dissemination |
Saved in:
Saved in favorites
Similar items by person
-
Muralidhar, Krish, (2014)
-
Data disclosure limitation as a decision problem
Trottini, Mario, (2008)
-
A Re-examination of the Census Bureau Reconstruction and Reidentification Attack
Muralidhar, Krish, (2022)
- More ...