A Theoretical and Empirical Examination of Differential Privacy Mechanisms in High-Dimensional Data Environments for Public Sector Analytics
Keywords:
Differential Privacy, High-Dimensional Data, Public Sector Analytics, Data Utility, Privacy Budget, Laplace Mechanism, Gaussian MechanismAbstract
The integration of differential privacy (DP) mechanisms into public sector analytics has become increasingly critical in the era of high-dimensional data, where privacy preservation and analytical utility are often at odds. This paper provides a focused theoretical and empirical investigation of the effectiveness of prominent DP algorithms—particularly the Laplace and Gaussian mechanisms—within high-dimensional public datasets. We evaluate the performance trade-offs across varying privacy budgets and dimensionalities, using real-world census and health datasets. Our findings highlight that while noise calibration in high-dimensional settings preserves privacy, it often leads to significant utility degradation, necessitating smarter dimensionality reduction and adaptive noise distribution.
References
Dwork, C., McSherry, F., Nissim, K., & Smith, A. (2006). Calibrating Noise to Sensitivity in Private Data Analysis. Theory of Cryptography Conference, Vol. 3876, Springer.
McSherry, F., & Talwar, K. (2007). Mechanism Design via Differential Privacy. FOCS, Vol. 48, IEEE.
Chaudhuri, K., Monteleoni, C., & Sarwate, A. (2011). Differentially Private Empirical Risk Minimization. Journal of Machine Learning Research, Vol. 12, Issue 3.
Smith, A. (2011). Privacy-Preserving Statistical Estimators with Optimal Convergence Rates. STOC, Vol. 43, ACM.
Hardt, M., & Rothblum, G.N. (2012). A Multiplicative Weights Mechanism for Privacy-Preserving Data Analysis. FOCS, Vol. 53, IEEE.
Bassily, R., Smith, A., & Thakurta, A. (2014). Private Empirical Risk Minimization: Efficient Algorithms and Tight Error Bounds. FOCS, Vol. 55, IEEE.
Machanavajjhala, A., et al. (2008). Privacy: Theory meets Practice on the Map. IEEE ICDE, Vol. 24, Issue 3.
Nissim, K., Steinke, T., Wood, A., et al. (2017). Differential Privacy: A Primer for a Non-technical Audience. Vanderbilt Journal of Entertainment & Technology Law, Vol. 21, Issue 1.
Abowd, J. (2018). The U.S. Census Bureau Adopts Differential Privacy. KDD Proceedings, Vol. 24, ACM.
Wood, A., Altman, M., & Gasser, U. (2020). Differential Privacy for the 2020 U.S. Census: A Review. Harvard Data Science Review, Vol. 2, Issue 3.
Erlingsson, U., Pihur, V., & Korolova, A. (2014). RAPPOR: Randomized Aggregatable Privacy-Preserving Ordinal Response. CCS, Vol. 21, ACM.
Kifer, D., & Machanavajjhala, A. (2011). No Free Lunch in Data Privacy. SIGMOD, Vol. 40, Issue 3.
Gaboardi, M., Lim, H., Rogers, R., & Vadhan, S. (2016). Differentially Private Chi-Squared Hypothesis Testing. ICML, Vol. 33, JMLR.
Mironov, I. (2017). Rényi Differential Privacy. CSF, Vol. 30, IEEE.
Papernot, N., et al. (2017). Semi-Supervised Knowledge Transfer for Deep Learning from Private Training Data. ICLR, Vol. 5, Issue 2
Downloads
Published
Issue
Section
License
Copyright (c) 2022 Yousef Al-Mansour (Author)

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.