K-means-type algorithms on distributed memory computer

Ng, M. K.

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1142/S0129053300000096
Scopus: eid_2-s2.0-0034196966
WOS: WOS:000170680800001
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Mathematics: Journal/Magazine Articles

Article: K-means-type algorithms on distributed memory computer

Title	K-means-type algorithms on distributed memory computer
Authors	Ng, M. K.
Keywords	Clustering Data mining K-means-type algorithm Parallel algorithms
Issue Date	2000
Citation	International Journal of High Speed Computing, 2000, v. 11, n. 2, p. 75-91 How to Cite? DOI: http://dx.doi.org/10.1142/S0129053300000096
Abstract	Partitioning a set of objects into homogeneous clusters is a fundamental operation in data mining. The k-means-type algorithm is best suited for implementing this operation because of its efficiency in clustering large numerical and categorical data sets. An efficient parallel k-means-type algorithm for clustering data sets on a distributed share-nothing parallel system is considered. It has a simple communication scheme which performs only one round of information exchange in every iteration. We show that the speedup of our algorithm is asymptotically linear when the number of objects is sufficiently large. We implement the parallel k-means-type algorithm on an IBM SP2 parallel machine. The performance studies show that the algorithm has nice parallelism in experiments.
Persistent Identifier	http://hdl.handle.net/10722/276541
ISSN	0129-0533
ISI Accession Number ID	WOS:000170680800001

DC Field	Value	Language
dc.contributor.author	Ng, M. K.	-
dc.date.accessioned	2019-09-18T08:33:55Z	-
dc.date.available	2019-09-18T08:33:55Z	-
dc.date.issued	2000	-
dc.identifier.citation	International Journal of High Speed Computing, 2000, v. 11, n. 2, p. 75-91	-
dc.identifier.issn	0129-0533	-
dc.identifier.uri	http://hdl.handle.net/10722/276541	-
dc.description.abstract	Partitioning a set of objects into homogeneous clusters is a fundamental operation in data mining. The k-means-type algorithm is best suited for implementing this operation because of its efficiency in clustering large numerical and categorical data sets. An efficient parallel k-means-type algorithm for clustering data sets on a distributed share-nothing parallel system is considered. It has a simple communication scheme which performs only one round of information exchange in every iteration. We show that the speedup of our algorithm is asymptotically linear when the number of objects is sufficiently large. We implement the parallel k-means-type algorithm on an IBM SP2 parallel machine. The performance studies show that the algorithm has nice parallelism in experiments.	-
dc.language	eng	-
dc.relation.ispartof	International Journal of High Speed Computing	-
dc.subject	Clustering	-
dc.subject	Data mining	-
dc.subject	K-means-type algorithm	-
dc.subject	Parallel algorithms	-
dc.title	K-means-type algorithms on distributed memory computer	-
dc.type	Article	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1142/S0129053300000096	-
dc.identifier.scopus	eid_2-s2.0-0034196966	-
dc.identifier.volume	11	-
dc.identifier.issue	2	-
dc.identifier.spage	75	-
dc.identifier.epage	91	-
dc.identifier.isi	WOS:000170680800001	-
dc.identifier.issnl	0129-0533	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: K-means-type algorithms on distributed memory computer

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats