File Download
There are no files associated with this item.
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: Differentially private clustering in high-dimensional Euclidean spaces
Title | Differentially private clustering in high-dimensional Euclidean spaces |
---|---|
Authors | |
Issue Date | 2017 |
Citation | 34th International Conference on Machine Learning, ICML 2017, 2017, v. 1, p. 501-522 How to Cite? |
Abstract | We study the problem of clustering sensitive data while preserving the privacy of individuals represented in the dataset, which has broad applications in practical machine learning and data analysis tasks. Although the problem has been widely studied in the context of lowdimensional, discrete spaces, much remains unknown concerning private clustering in highdimensional Euclidean spaces Rd. In this work, we give differentially private and efficient algorithms achieving strong guarantees for k-means and κ-median clustering when d = Ω(polylog(n)). Our algorithm achieves clustering loss at most log3 (n) OPT + poly (log n, d, κ), advancing the state-of-the-art result of VdOPT+ poly(log n, dd, κd). We also study the case where the data points are s-sparse and show that the clustering loss can scale logarithmically with d, i.e., log3(n)OPT + poly(log n,log d,κ,S). Experiments on both synthetic and real datasets verify the effectiveness of the proposed method. |
Persistent Identifier | http://hdl.handle.net/10722/341228 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Balean, Maria Fiorina | - |
dc.contributor.author | Dick, Travis | - |
dc.contributor.author | Liang, Yingyu | - |
dc.contributor.author | Mou, Wenlong | - |
dc.contributor.author | Zhang, Hongyang | - |
dc.date.accessioned | 2024-03-13T08:41:10Z | - |
dc.date.available | 2024-03-13T08:41:10Z | - |
dc.date.issued | 2017 | - |
dc.identifier.citation | 34th International Conference on Machine Learning, ICML 2017, 2017, v. 1, p. 501-522 | - |
dc.identifier.uri | http://hdl.handle.net/10722/341228 | - |
dc.description.abstract | We study the problem of clustering sensitive data while preserving the privacy of individuals represented in the dataset, which has broad applications in practical machine learning and data analysis tasks. Although the problem has been widely studied in the context of lowdimensional, discrete spaces, much remains unknown concerning private clustering in highdimensional Euclidean spaces Rd. In this work, we give differentially private and efficient algorithms achieving strong guarantees for k-means and κ-median clustering when d = Ω(polylog(n)). Our algorithm achieves clustering loss at most log3 (n) OPT + poly (log n, d, κ), advancing the state-of-the-art result of VdOPT+ poly(log n, dd, κd). We also study the case where the data points are s-sparse and show that the clustering loss can scale logarithmically with d, i.e., log3(n)OPT + poly(log n,log d,κ,S). Experiments on both synthetic and real datasets verify the effectiveness of the proposed method. | - |
dc.language | eng | - |
dc.relation.ispartof | 34th International Conference on Machine Learning, ICML 2017 | - |
dc.title | Differentially private clustering in high-dimensional Euclidean spaces | - |
dc.type | Conference_Paper | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.scopus | eid_2-s2.0-85048681197 | - |
dc.identifier.volume | 1 | - |
dc.identifier.spage | 501 | - |
dc.identifier.epage | 522 | - |