File Download

There are no files associated with this item.

Supplementary

Conference Paper: Representative clustering of uncertain data

TitleRepresentative clustering of uncertain data
Authors
Issue Date2014
PublisherACM.
Citation
The 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'14), New York, NY., 24-27 August 2014. In KDD'14 Conference Proceedings, 2014, p. 1-10 How to Cite?
AbstractThis paper targets the problem of computing meaningful clusterings from uncertain data sets. Existing methods for clustering uncertain data compute a single clustering without any indication of its quality and reliability; thus, decisions based on their results are questionable. In this paper, we describe a framework, based on possible-worlds semantics; when applied on an uncertain dataset, it computes a set of representative clusterings, each of which has a probabilistic guarantee not to exceed some maximum distance to the ground truth clustering, i.e., the clustering of the actual (but unknown) data. Our framework can be combined with any existing clustering algorithm and it is the first to provide quality guarantees about its result. In addition, our experimental evaluation shows that our representative clusterings have a much smaller deviation from the ground truth clustering than existing approaches, thus reducing the effect of uncertainty.
Persistent Identifierhttp://hdl.handle.net/10722/199311
ISBN

 

DC FieldValueLanguage
dc.contributor.authorZuefle, Aen_US
dc.contributor.authorEmrich, Ten_US
dc.contributor.authorSchmid, KAen_US
dc.contributor.authorMamoulis, Nen_US
dc.contributor.authorZimek, Aen_US
dc.contributor.authorRenz, Men_US
dc.date.accessioned2014-07-22T01:13:04Z-
dc.date.available2014-07-22T01:13:04Z-
dc.date.issued2014en_US
dc.identifier.citationThe 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'14), New York, NY., 24-27 August 2014. In KDD'14 Conference Proceedings, 2014, p. 1-10en_US
dc.identifier.isbn978-1-4503-2956-9-
dc.identifier.urihttp://hdl.handle.net/10722/199311-
dc.description.abstractThis paper targets the problem of computing meaningful clusterings from uncertain data sets. Existing methods for clustering uncertain data compute a single clustering without any indication of its quality and reliability; thus, decisions based on their results are questionable. In this paper, we describe a framework, based on possible-worlds semantics; when applied on an uncertain dataset, it computes a set of representative clusterings, each of which has a probabilistic guarantee not to exceed some maximum distance to the ground truth clustering, i.e., the clustering of the actual (but unknown) data. Our framework can be combined with any existing clustering algorithm and it is the first to provide quality guarantees about its result. In addition, our experimental evaluation shows that our representative clusterings have a much smaller deviation from the ground truth clustering than existing approaches, thus reducing the effect of uncertainty.-
dc.languageengen_US
dc.publisherACM.-
dc.relation.ispartof20th ACM SIGKDD International Conference Proceedings 2014en_US
dc.titleRepresentative clustering of uncertain dataen_US
dc.typeConference_Paperen_US
dc.identifier.emailMamoulis, N: nikos@cs.hku.hken_US
dc.identifier.authorityMamoulis, N=rp00155en_US
dc.identifier.hkuros230469en_US
dc.identifier.spage1-
dc.identifier.epage10-
dc.publisher.placeUnited States-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats