Representative clustering of uncertain data

Zuefle, A; Emrich, T; Schmid, KA; Mamoulis, N; Zimek, A; Renz, M

File Download

There are no files associated with this item.

Supplementary

Citations:
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: Representative clustering of uncertain data

Title	Representative clustering of uncertain data
Authors	Zuefle, A Emrich, T Schmid, KA Mamoulis, N Zimek, A Renz, M
Issue Date	2014
Publisher	ACM.
Citation	The 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'14), New York, NY., 24-27 August 2014. In KDD'14 Conference Proceedings, 2014, p. 1-10 How to Cite?
Abstract	This paper targets the problem of computing meaningful clusterings from uncertain data sets. Existing methods for clustering uncertain data compute a single clustering without any indication of its quality and reliability; thus, decisions based on their results are questionable. In this paper, we describe a framework, based on possible-worlds semantics; when applied on an uncertain dataset, it computes a set of representative clusterings, each of which has a probabilistic guarantee not to exceed some maximum distance to the ground truth clustering, i.e., the clustering of the actual (but unknown) data. Our framework can be combined with any existing clustering algorithm and it is the first to provide quality guarantees about its result. In addition, our experimental evaluation shows that our representative clusterings have a much smaller deviation from the ground truth clustering than existing approaches, thus reducing the effect of uncertainty.
Persistent Identifier	http://hdl.handle.net/10722/199311
ISBN	978-1-4503-2956-9

DC Field	Value	Language
dc.contributor.author	Zuefle, A	en_US
dc.contributor.author	Emrich, T	en_US
dc.contributor.author	Schmid, KA	en_US
dc.contributor.author	Mamoulis, N	en_US
dc.contributor.author	Zimek, A	en_US
dc.contributor.author	Renz, M	en_US
dc.date.accessioned	2014-07-22T01:13:04Z	-
dc.date.available	2014-07-22T01:13:04Z	-
dc.date.issued	2014	en_US
dc.identifier.citation	The 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'14), New York, NY., 24-27 August 2014. In KDD'14 Conference Proceedings, 2014, p. 1-10	en_US
dc.identifier.isbn	978-1-4503-2956-9	-
dc.identifier.uri	http://hdl.handle.net/10722/199311	-
dc.description.abstract	This paper targets the problem of computing meaningful clusterings from uncertain data sets. Existing methods for clustering uncertain data compute a single clustering without any indication of its quality and reliability; thus, decisions based on their results are questionable. In this paper, we describe a framework, based on possible-worlds semantics; when applied on an uncertain dataset, it computes a set of representative clusterings, each of which has a probabilistic guarantee not to exceed some maximum distance to the ground truth clustering, i.e., the clustering of the actual (but unknown) data. Our framework can be combined with any existing clustering algorithm and it is the first to provide quality guarantees about its result. In addition, our experimental evaluation shows that our representative clusterings have a much smaller deviation from the ground truth clustering than existing approaches, thus reducing the effect of uncertainty.	-
dc.language	eng	en_US
dc.publisher	ACM.	-
dc.relation.ispartof	20th ACM SIGKDD International Conference Proceedings 2014	en_US
dc.title	Representative clustering of uncertain data	en_US
dc.type	Conference_Paper	en_US
dc.identifier.email	Mamoulis, N: nikos@cs.hku.hk	en_US
dc.identifier.authority	Mamoulis, N=rp00155	en_US
dc.identifier.hkuros	230469	en_US
dc.identifier.spage	1	-
dc.identifier.epage	10	-
dc.publisher.place	United States	-

File Download

Supplementary

Conference Paper: Representative clustering of uncertain data

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats