File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/IPDPS.2010.5470461
- Scopus: eid_2-s2.0-77954027160
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: Adaptive sampling-based profiling techniques for optimizing the distributed JVM runtime
Title | Adaptive sampling-based profiling techniques for optimizing the distributed JVM runtime |
---|---|
Authors | |
Keywords | Access locality Correlation tracking Distributed Java virtual machine Distributed shared memory systems Dynamic load balancing Object sharing Profiling Sampling Thread affinity Thread migration Thread stack |
Issue Date | 2010 |
Publisher | IEEE, Computer Society. |
Citation | The 24th IEEE International Symposium on Parallel & Distributed Processing (IPDPS 2010), Atlanta, GA., 19-23 April 2010. In Proceedings of the 24th IPDPS, 2010, p. 1-11 How to Cite? |
Abstract | Extending the standard Java virtual machine (JVM) for cluster-awareness is a transparent approach to scaling out multithreaded Java applications. While this clustering solution is gaining momentum in recent years, efficient runtime support for fine-grained object sharing over the distributed JVM remains a challenge. The system efficiency is strongly connected to the global object sharing profile that determines the overall communication cost. Once the sharing or correlation between threads is known, access locality can be optimized by collocating highly correlated threads via dynamic thread migrations. Although correlation tracking techniques have been studied in some page-based sof Tware DSM systems, they would entail prohibitively high overheads and low accuracy when ported to fine-grained object-based systems. In this paper, we propose a lightweight sampling-based profiling technique for tracking inter-thread sharing. To preserve locality across migrations, we also propose a stack sampling mechanism for profiling the set of objects which are tightly coupled with a migrant thread. Sampling rates in both techniques can vary adaptively to strike a balance between preciseness and overhead. Such adaptive techniques are particularly useful for applications whose sharing patterns could change dynamically. The profiling results can be exploited for effective thread-to-core placement and dynamic load balancing in a distributed object sharing environment. We present the design and preliminary performance result of our distributed JVM with the profiling implemented. Experimental results show that the profiling is able to obtain over 95% accurate global sharing profiles at a cost of only a few percents of execution time increase for fine- to medium- grained applications. © 2010 IEEE. |
Persistent Identifier | http://hdl.handle.net/10722/125697 |
ISBN | |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lam, KT | en_HK |
dc.contributor.author | Luo, Y | en_HK |
dc.contributor.author | Wang, CL | en_HK |
dc.date.accessioned | 2010-10-31T11:46:40Z | - |
dc.date.available | 2010-10-31T11:46:40Z | - |
dc.date.issued | 2010 | en_HK |
dc.identifier.citation | The 24th IEEE International Symposium on Parallel & Distributed Processing (IPDPS 2010), Atlanta, GA., 19-23 April 2010. In Proceedings of the 24th IPDPS, 2010, p. 1-11 | en_HK |
dc.identifier.isbn | 978-1-4244-6442-5 | - |
dc.identifier.uri | http://hdl.handle.net/10722/125697 | - |
dc.description.abstract | Extending the standard Java virtual machine (JVM) for cluster-awareness is a transparent approach to scaling out multithreaded Java applications. While this clustering solution is gaining momentum in recent years, efficient runtime support for fine-grained object sharing over the distributed JVM remains a challenge. The system efficiency is strongly connected to the global object sharing profile that determines the overall communication cost. Once the sharing or correlation between threads is known, access locality can be optimized by collocating highly correlated threads via dynamic thread migrations. Although correlation tracking techniques have been studied in some page-based sof Tware DSM systems, they would entail prohibitively high overheads and low accuracy when ported to fine-grained object-based systems. In this paper, we propose a lightweight sampling-based profiling technique for tracking inter-thread sharing. To preserve locality across migrations, we also propose a stack sampling mechanism for profiling the set of objects which are tightly coupled with a migrant thread. Sampling rates in both techniques can vary adaptively to strike a balance between preciseness and overhead. Such adaptive techniques are particularly useful for applications whose sharing patterns could change dynamically. The profiling results can be exploited for effective thread-to-core placement and dynamic load balancing in a distributed object sharing environment. We present the design and preliminary performance result of our distributed JVM with the profiling implemented. Experimental results show that the profiling is able to obtain over 95% accurate global sharing profiles at a cost of only a few percents of execution time increase for fine- to medium- grained applications. © 2010 IEEE. | en_HK |
dc.language | eng | en_HK |
dc.publisher | IEEE, Computer Society. | - |
dc.relation.ispartof | Proceedings of the IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010 | en_HK |
dc.rights | ©2010 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. | - |
dc.subject | Access locality | en_HK |
dc.subject | Correlation tracking | en_HK |
dc.subject | Distributed Java virtual machine | en_HK |
dc.subject | Distributed shared memory systems | en_HK |
dc.subject | Dynamic load balancing | en_HK |
dc.subject | Object sharing | en_HK |
dc.subject | Profiling | en_HK |
dc.subject | Sampling | en_HK |
dc.subject | Thread affinity | en_HK |
dc.subject | Thread migration | en_HK |
dc.subject | Thread stack | en_HK |
dc.title | Adaptive sampling-based profiling techniques for optimizing the distributed JVM runtime | en_HK |
dc.type | Conference_Paper | en_HK |
dc.identifier.openurl | http://library.hku.hk:4550/resserv?sid=HKU:IR&issn=978-1-4244-6442-5&volume=&spage=1&epage=11&date=2010&atitle=Adaptive+sampling-based+profiling+techniques+for+optimizing+the+distributed+JVM+runtime | - |
dc.identifier.email | Wang, CL:clwang@cs.hku.hk | en_HK |
dc.identifier.authority | Wang, CL=rp00183 | en_HK |
dc.description.nature | published_or_final_version | - |
dc.identifier.doi | 10.1109/IPDPS.2010.5470461 | en_HK |
dc.identifier.scopus | eid_2-s2.0-77954027160 | en_HK |
dc.identifier.hkuros | 179413 | en_HK |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-77954027160&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.spage | 1 | - |
dc.identifier.epage | 11 | - |
dc.description.other | The 24th IEEE International Symposium on Parallel & Distributed Processing (IPDPS 2010), Atlanta, GA., 19-23 April 2010. In Proceedings of the 24th IPDPS, 2010, p. 1-11 | - |
dc.identifier.scopusauthorid | Lam, KT=26031004100 | en_HK |
dc.identifier.scopusauthorid | Luo, Y=35759395100 | en_HK |
dc.identifier.scopusauthorid | Wang, CL=7501646188 | en_HK |