Effective Automatic Parallelization and Locality Optimization Using the Polyhedral Model

Effective Automatic Parallelization and Locality Optimization Using the Polyhedral Model
Author :
Publisher :
Total Pages : 172
Release :
ISBN-10 : OCLC:263071805
ISBN-13 :
Rating : 4/5 ( Downloads)

Book Synopsis Effective Automatic Parallelization and Locality Optimization Using the Polyhedral Model by : Uday Kumar Reddy Bondhugula

Download or read book Effective Automatic Parallelization and Locality Optimization Using the Polyhedral Model written by Uday Kumar Reddy Bondhugula and published by . This book was released on 2008 with total page 172 pages. Available in PDF, EPUB and Kindle. Book excerpt: Abstract: Multicore processors have now become mainstream. The difficulty of programming these architectures to effectively tap the potential of multiple processing units is well-known. Among several ways of addressing this issue, one of the very promising and simultaneously hard approaches is Automatic Parallelization. This approach does not require any effort on part of the programmer in the process of parallelizing a program. The Polyhedral model for compiler optimization is a powerful mathematical framework based on parametric linear algebra and integer linear programming. It provides an abstraction to represent nested loop computation and its data dependences using integer points in polyhedra. Complex execution-reordering, that can improve performance by parallelization as well as locality enhancement, is captured by affine transformations in the polyhedral model. With several recent advances, the polyhedral model has reached a level of maturity in various aspects -- in particular, as a powerful intermediate representation for performing transformations, and code generation after transformations. However, an approach to automatically find good transformations for communication-optimized coarse-grained parallelization together with locality optimization has been a key missing link. This dissertation presents a new automatic transformation framework that solves the above problem. Our approach works by finding good affine transformations through a powerful and practical linear cost function that enables efficient tiling and fusion of sequences of arbitrarily nested loops. This in turn allows simultaneous optimization for coarse-grained parallelism and locality. Synchronization-free parallelism and pipelined parallelism at various levels can be extracted. The framework can be targeted to different parallel architectures, like general-purpose multicores, the Cell processor, GPUs, or embedded multiprocessor SoCs. The proposed framework has been implemented into a new end-to-end transformation tool, PLUTO, that can automatically generate parallel code from regular C program sections. Experimental results from the implemented system show significant performance improvement for single core and multicore execution over state-of-the-art research compiler frameworks as well as the best native production compilers. For several dense linear algebra kernels, code generated from Pluto beats, by a significant margin, the same kernels implemented with sequences of calls to highly-tuned libraries supplied by vendors. The system also allows empirical optimization to be performed in a much wider context than has been attempted previously. In addition, Pluto can serve as the parallel code generation backend for several high-level domain-specific languages.


Effective Automatic Parallelization and Locality Optimization Using the Polyhedral Model Related Books

Effective Automatic Parallelization and Locality Optimization Using the Polyhedral Model
Language: en
Pages: 172
Authors: Uday Kumar Reddy Bondhugula
Categories: Algebras, Linear
Type: BOOK - Published: 2008 - Publisher:

DOWNLOAD EBOOK

Abstract: Multicore processors have now become mainstream. The difficulty of programming these architectures to effectively tap the potential of multiple proces
Algorithms and Architectures for Parallel Processing
Language: en
Pages: 502
Authors: Joanna Kolodziej
Categories: Computers
Type: BOOK - Published: 2013-12-09 - Publisher: Springer

DOWNLOAD EBOOK

This two volume set LNCS 8285 and 8286 constitutes the proceedings of the 13th International Conference on Algorithms and Architectures for Parallel Processing,
Parallel Processing and Applied Mathematics
Language: en
Pages: 669
Authors: Roman Wyrzykowski
Categories: Computers
Type: BOOK - Published: 2018-03-22 - Publisher: Springer

DOWNLOAD EBOOK

The two-volume set LNCS 10777 and 10778 constitutes revised selected papers from the 12th International Conference on Parallel Processing and Applied Mathematic
Compiler Construction
Language: en
Pages: 318
Authors: Laurie Hendren
Categories: Computers
Type: BOOK - Published: 2008-03-18 - Publisher: Springer Science & Business Media

DOWNLOAD EBOOK

This book constitutes the refereed proceedings of the 17th International Conference on Compiler Construction, CC 2008, held in Budapest, Hungary, in March 2008
Information Systems Architecture and Technology: Proceedings of 39th International Conference on Information Systems Architecture and Technology – ISAT 2018
Language: en
Pages: 384
Authors: Leszek Borzemski
Categories: Technology & Engineering
Type: BOOK - Published: 2018-08-28 - Publisher: Springer

DOWNLOAD EBOOK

This three-volume set of books highlights major advances in the development of concepts and techniques in the area of new technologies and architectures of cont