File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/TKDE.2010.82
- Scopus: eid_2-s2.0-77955150205
- WOS: WOS:000280134800003
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: Clustering uncertain data using voronoi diagrams and R-tree index
Title | Clustering uncertain data using voronoi diagrams and R-tree index | ||||
---|---|---|---|---|---|
Authors | |||||
Keywords | clustering indexing methods object hierarchies Uncertainty | ||||
Issue Date | 2010 | ||||
Publisher | I E E E. The Journal's web site is located at http://www.computer.org/tkde | ||||
Citation | Ieee Transactions On Knowledge And Data Engineering, 2010, v. 22 n. 9, p. 1219-1233 How to Cite? | ||||
Abstract | We study the problem of clustering uncertain objects whose locations are described by probability density functions (pdfs). We show that the UK-means algorithm, which generalizes the k-means algorithm to handle uncertain objects, is very inefficient. The inefficiency comes from the fact that UK-means computes expected distances (EDs) between objects and cluster representatives. For arbitrary pdfs, expected distances are computed by numerical integrations, which are costly operations. We propose pruning techniques that are based on Voronoi diagrams to reduce the number of expected distance calculations. These techniques are analytically proven to be more effective than the basic bounding-box-based technique previously known in the literature. We then introduce an R-tree index to organize the uncertain objects so as to reduce pruning overheads. We conduct experiments to evaluate the effectiveness of our novel techniques. We show that our techniques are additive and, when used in combination, significantly outperform previously known methods. © 2006 IEEE. | ||||
Persistent Identifier | http://hdl.handle.net/10722/129978 | ||||
ISSN | 2023 Impact Factor: 8.9 2023 SCImago Journal Rankings: 2.867 | ||||
ISI Accession Number ID |
Funding Information: This research is supported by Hong Kong Research Grants Council Grant HKU 7134/06E. | ||||
References | |||||
Grants |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Kao, B | en_HK |
dc.contributor.author | Lee, SD | en_HK |
dc.contributor.author | Lee, FKF | en_HK |
dc.contributor.author | Cheung, DW | en_HK |
dc.contributor.author | Ho, WS | en_HK |
dc.date.accessioned | 2010-12-23T08:45:06Z | - |
dc.date.available | 2010-12-23T08:45:06Z | - |
dc.date.issued | 2010 | en_HK |
dc.identifier.citation | Ieee Transactions On Knowledge And Data Engineering, 2010, v. 22 n. 9, p. 1219-1233 | en_HK |
dc.identifier.issn | 1041-4347 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/129978 | - |
dc.description.abstract | We study the problem of clustering uncertain objects whose locations are described by probability density functions (pdfs). We show that the UK-means algorithm, which generalizes the k-means algorithm to handle uncertain objects, is very inefficient. The inefficiency comes from the fact that UK-means computes expected distances (EDs) between objects and cluster representatives. For arbitrary pdfs, expected distances are computed by numerical integrations, which are costly operations. We propose pruning techniques that are based on Voronoi diagrams to reduce the number of expected distance calculations. These techniques are analytically proven to be more effective than the basic bounding-box-based technique previously known in the literature. We then introduce an R-tree index to organize the uncertain objects so as to reduce pruning overheads. We conduct experiments to evaluate the effectiveness of our novel techniques. We show that our techniques are additive and, when used in combination, significantly outperform previously known methods. © 2006 IEEE. | en_HK |
dc.language | eng | en_US |
dc.publisher | I E E E. The Journal's web site is located at http://www.computer.org/tkde | en_HK |
dc.relation.ispartof | IEEE Transactions on Knowledge and Data Engineering | en_HK |
dc.rights | ©2010 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. | - |
dc.subject | clustering | en_HK |
dc.subject | indexing methods | en_HK |
dc.subject | object hierarchies | en_HK |
dc.subject | Uncertainty | en_HK |
dc.title | Clustering uncertain data using voronoi diagrams and R-tree index | en_HK |
dc.type | Article | en_HK |
dc.identifier.openurl | http://library.hku.hk:4550/resserv?sid=HKU:IR&issn=1041-4347&volume=22&issue=9&spage=1219&epage=1233&date=2010&atitle=Clustering+uncertain+data+using+voronoi+diagrams+and+R-tree+index | - |
dc.identifier.email | Kao, B: kao@cs.hku.hk | en_HK |
dc.identifier.email | Cheung, DW: dcheung@cs.hku.hk | en_HK |
dc.identifier.email | Ho, WS: wsho@cs.hku.hk | en_HK |
dc.identifier.authority | Kao, B=rp00123 | en_HK |
dc.identifier.authority | Cheung, DW=rp00101 | en_HK |
dc.identifier.authority | Ho, WS=rp01730 | en_HK |
dc.description.nature | published_or_final_version | - |
dc.identifier.doi | 10.1109/TKDE.2010.82 | en_HK |
dc.identifier.scopus | eid_2-s2.0-77955150205 | en_HK |
dc.identifier.hkuros | 176926 | en_US |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-77955150205&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.volume | 22 | en_HK |
dc.identifier.issue | 9 | en_HK |
dc.identifier.spage | 1219 | en_HK |
dc.identifier.epage | 1233 | en_HK |
dc.identifier.isi | WOS:000280134800003 | - |
dc.publisher.place | United States | en_HK |
dc.relation.project | Computational issues in mining uncertain data | - |
dc.identifier.scopusauthorid | Kao, B=35221592600 | en_HK |
dc.identifier.scopusauthorid | Lee, SD=7601400741 | en_HK |
dc.identifier.scopusauthorid | Lee, FKF=36195751200 | en_HK |
dc.identifier.scopusauthorid | Cheung, DW=34567902600 | en_HK |
dc.identifier.scopusauthorid | Ho, WS=7402968940 | en_HK |
dc.identifier.issnl | 1041-4347 | - |