Automated Derivation of Parametric Data Movement Lower Bounds for Affine Programs
Researchers and practitioners have for long worked on improving the computational complexity of algorithms, focusing on reducing the number of operations needed to perform a computation. However the hardware trend nowadays clearly shows a higher performance and energy cost for data movements than computations: quality algorithms have to minimize data movements as much as possible.
The theoretical operational complexity of an algorithm is a function of the total number of operations that must be executed, regardless of the order in which they will actually be executed. But theoretical data movement (or, I/O) complexity is fundamentally different: one must consider all possible legal schedules of the operations to determine the minimal number of data movements achievable, a major theoretical challenge. I/O complexity has been studied via complex manual proofs, e.g., refined from $\Omega(n^3/\sqrt{S})$ for matrix-multiply on a cache size $S$ by Hong & Kung to $2n^3/\sqrt{S}$ by Smith et al. While asymptotic complexity may be sufficient to compare I/O potential between broadly different algorithms, the accuracy of the reasoning depends on the tightness of these I/O lower bounds. Precisely, exposing constants is essential to enable precise comparison between different algorithms: for example the $2n^3/\sqrt{S}$ lower bound allows to demonstrate the optimality of panel-panel tiling for matrix-multiplication.
\emph{We present the first static analysis to automatically derive non-asymptotic parametric expressions of data movement lower bounds with scaling constants, for arbitrary affine computations}. Our approach is fully automatic, assisting algorithm designers to reason about I/O complexity and make educated decisions about algorithmic alternatives.
Fri 19 JunDisplayed time zone: Pacific Time (US & Canada) change
16:00 - 17:00 | Static AnalysisPLDI Research Papers at PLDI Research Papers live stream Chair(s): Julian Dolby IBM Research, USA | ||
16:00 20mTalk | Automated Derivation of Parametric Data Movement Lower Bounds for Affine Programs PLDI Research Papers Auguste Olivry Inria, France, Julien Langou University of Colorado at Denver, USA, Louis-Noël Pouchet Colorado State University, USA, Saday Sadayappan University of Utah, USA, Fabrice Rastello Inria, France | ||
16:20 20mTalk | Fast Graph Simplification for Interleaved Dyck-Reachability PLDI Research Papers Yuanbo Li Georgia Institute of Technology, USA, Qirun Zhang Georgia Institute of Technology, USA, Thomas Reps University of Wisconsin-Madison, USA | ||
16:40 20mTalk | Static Analysis of Java Enterprise Applications: Frameworks and Caches, the Elephants in the Room PLDI Research Papers Anastasios Antoniadis University of Athens, Greece, Nikos Filippakis CERN, Switzerland, Paddy Krishnan Oracle Labs, Australia, Raghavendra Ramesh ConsenSys, Australia, Nicholas Allen Oracle Labs, Australia, Yannis Smaragdakis University of Athens, Greece Pre-print |