We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Optimizing UPC programs for multi-core systems.
- Authors
Yili Zheng
- Abstract
The Partitioned Global Address Space (PGAS) model of Unified Parallel C (UPC) can help users express and manage application data locality on non-uniform memory access (NUMA) multi-core shared-memory systems to get good performance. First, we describe several UPC program optimization techniques that are important to achieving good performance on NUMA multi-core computers with examples and quantitative performance results. Second, we use two numerical computing kernels, parallel matrix-matrix multiplication and parallel 3-D FFT, to demonstrate the end-to-end development and optimization for UPC applications. Our results show that the optimized UPC programs achieve very good and scalable performance on current multi-core systems and can even outperform vendor-optimized libraries in some cases.
- Subjects
UPC (Computer program language); COMPUTER storage devices; PROGRAMMING languages; DIGITAL libraries; ARTIFICIAL languages
- Publication
Scientific Programming, 2010, Vol 18, Issue 3/4, p183
- ISSN
1058-9244
- Publication type
Article
- DOI
10.1155/2010/646829