File Download
There are no files associated with this item.
Supplementary
-
Citations:
- Appears in Collections:
Conference Paper: Representative clustering of uncertain data
Title | Representative clustering of uncertain data |
---|---|
Authors | |
Issue Date | 2014 |
Publisher | ACM. |
Citation | The 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'14), New York, NY., 24-27 August 2014. In KDD'14 Conference Proceedings, 2014, p. 1-10 How to Cite? |
Abstract | This paper targets the problem of computing meaningful clusterings from uncertain data sets. Existing methods for clustering uncertain data compute a single clustering without any indication of its quality and reliability; thus, decisions based on their results are questionable. In this paper, we describe a framework, based on possible-worlds semantics; when applied on an uncertain dataset, it computes a set of representative clusterings, each of which has a probabilistic guarantee not to exceed some maximum distance to the ground truth clustering, i.e., the clustering of the actual (but unknown) data. Our framework can be combined with any existing clustering algorithm and it is the first to provide quality guarantees about its result. In addition, our experimental evaluation shows that our representative clusterings have a much smaller deviation from the ground truth clustering than existing approaches, thus reducing the effect of uncertainty. |
Persistent Identifier | http://hdl.handle.net/10722/199311 |
ISBN |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Zuefle, A | en_US |
dc.contributor.author | Emrich, T | en_US |
dc.contributor.author | Schmid, KA | en_US |
dc.contributor.author | Mamoulis, N | en_US |
dc.contributor.author | Zimek, A | en_US |
dc.contributor.author | Renz, M | en_US |
dc.date.accessioned | 2014-07-22T01:13:04Z | - |
dc.date.available | 2014-07-22T01:13:04Z | - |
dc.date.issued | 2014 | en_US |
dc.identifier.citation | The 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'14), New York, NY., 24-27 August 2014. In KDD'14 Conference Proceedings, 2014, p. 1-10 | en_US |
dc.identifier.isbn | 978-1-4503-2956-9 | - |
dc.identifier.uri | http://hdl.handle.net/10722/199311 | - |
dc.description.abstract | This paper targets the problem of computing meaningful clusterings from uncertain data sets. Existing methods for clustering uncertain data compute a single clustering without any indication of its quality and reliability; thus, decisions based on their results are questionable. In this paper, we describe a framework, based on possible-worlds semantics; when applied on an uncertain dataset, it computes a set of representative clusterings, each of which has a probabilistic guarantee not to exceed some maximum distance to the ground truth clustering, i.e., the clustering of the actual (but unknown) data. Our framework can be combined with any existing clustering algorithm and it is the first to provide quality guarantees about its result. In addition, our experimental evaluation shows that our representative clusterings have a much smaller deviation from the ground truth clustering than existing approaches, thus reducing the effect of uncertainty. | - |
dc.language | eng | en_US |
dc.publisher | ACM. | - |
dc.relation.ispartof | 20th ACM SIGKDD International Conference Proceedings 2014 | en_US |
dc.title | Representative clustering of uncertain data | en_US |
dc.type | Conference_Paper | en_US |
dc.identifier.email | Mamoulis, N: nikos@cs.hku.hk | en_US |
dc.identifier.authority | Mamoulis, N=rp00155 | en_US |
dc.identifier.hkuros | 230469 | en_US |
dc.identifier.spage | 1 | - |
dc.identifier.epage | 10 | - |
dc.publisher.place | United States | - |