CGO 2019
2019 IEEE/ACM International Symposium on Code Generation and Optimization (CGO)
Powered by
Conference Publishing Consulting

2019 IEEE/ACM International Symposium on Code Generation and Optimization (CGO), February 16–20, 2019, Washington, DC, USA

CGO 2019 – Proceedings

Contents - Abstracts - Authors

Frontmatter

Title Page
Message from the General Chair
Message from the Program Chairs
CGO 2019 Organization
Report from the Artifact Evaluation Committee
Sponsors
ACM Student Research Competition

Keynote

Rethinking Compilation in a Heterogeneous World (Keynote)
Michael O'Boyle
(University of Edinburgh, UK)

Research Papers

Binary Optimization

BOLT: A Practical Binary Optimizer for Data Centers and Beyond
Maksim Panchenko, Rafael Auler, Bill Nell, and Guilherme Ottoni
(Facebook, USA)
Artifacts Available Artifacts Reusable Results Replicated
Janus: Statically-Driven and Profile-Guided Automatic Dynamic Binary Parallelisation
Ruoyu Zhou and Timothy M. Jones
(University of Cambridge, UK)
Artifacts Available Artifacts Functional Results Replicated

Bugs and Security

Smokestack: Thwarting DOP Attacks with Runtime Stack Layout Randomization
Misiker Tadesse Aga and Todd Austin
(University of Michigan, USA)
Automatic Equivalence Checking for Assembly Implementations of Cryptography Libraries
Jay P. Lim and Santosh Nagarakatte
(Rutgers University, USA)
Publisher's Version Info Artifacts Available Artifacts Functional Results Replicated
CSOD: Context-Sensitive Overflow Detection
Hongyu Liu, Sam Silvestro, Xiaoyin Wang, Lide Duan, and Tongping Liu
(University of Texas at San Antonio, USA)
Reasoning about the Node.js Event Loop using Async Graphs
Haiyang Sun, Daniele Bonetta, Filippo Schiavio, and Walter Binder
(USI Lugano, Switzerland; Oracle Labs, USA)
Artifacts Available Artifacts Functional Results Replicated

GPUs and Tensors

Automatic Generation of Warp-Level Primitives and Atomic Instructions for Fast and Portable Parallel Reduction on GPUs
Simon Garcia De Gonzalo, Sitao Huang, Juan Gómez-Luna, Simon Hammond, Onur Mutlu, and Wen-mei Hwu
(University of Illinois at Urbana-Champaign, USA; ETH Zurich, Switzerland; Sandia National Laboratories, USA)
A Code Generator for High-Performance Tensor Contractions on GPUs
Jinsung Kim, Aravind Sukumaran-Rajam, Vineeth Thumma, Sriram Krishnamoorthy, Ajay Panyala, Louis-Noël Pouchet, Atanas Rountev, and P. Sadayappan
(Ohio State University, USA; Pacific Northwest National Laboratory, USA; Colorado State University, USA)
Artifacts Available Artifacts Functional Results Replicated

Potpourri

Transforming Query Sequences for High-Throughput B+ Tree Processing on Many-Core Processors
Ruiqin Tian, Junqiao Qiu, Zhijia Zhao, Xu Liu, and Bin Ren
(College of William and Mary, USA; University of California at Riverside, USA)
Artifacts Available Artifacts Functional Results Replicated
Quantifying and Reducing Execution Variance in STM via Model Driven Commit Optimization
Girish Mururu, Ada Gavrilovska, and Santosh Pande
(Georgia Institute of Technology, USA)
Artifacts Available Artifacts Functional Results Replicated
White-Box Program Tuning
Wen-Chuan Lee, Yingqi Liu, Peng Liu, Shiqing Ma, Hongjun Choi, Xiangyu Zhang, and Rajiv Gupta
(Purdue University, USA; University of California at Riverside, USA)
Generation of In-Bounds Inputs for Arrays in Memory-Unsafe Languages
Marcus Rodrigues, Breno Guimarães, and Fernando Magno Quintão Pereira
(Federal University of Minas Gerais, Brazil)
Info

Code Generation

Function Merging by Sequence Alignment
Rodrigo C. O. Rocha, Pavlos Petoumenos, Zheng Wang, Murray Cole, and Hugh Leather
(University of Edinburgh, UK; Lancaster University, UK)
Artifacts Available Artifacts Functional Results Replicated
An Optimization-Driven Incremental Inline Substitution Algorithm for Just-in-Time Compilers
Aleksandar Prokopec, Gilles Duboscq, David Leopoldseder, and Thomas Würthinger
(Oracle Labs, Switzerland; JKU Linz, Austria)
Artifacts Available Artifacts Functional Results Replicated
Tensor Algebra Compilation with Workspaces
Fredrik Kjolstad, Peter Ahrens, Shoaib Kamil, and Saman Amarasinghe
(Massachusetts Institute of Technology, USA; Adobe, USA)

Kernel Optimization

Tiramisu: A Polyhedral Compiler for Expressing Fast and Portable Code
Riyadh Baghdadi, Jessica Ray, Malek Ben Romdhane, Emanuele Del Sozzo, Abdurrahman Akkas, Yunming Zhang, Patricia Suriana, Shoaib Kamil, and Saman Amarasinghe
(Massachusetts Institute of Technology, USA; Politecnico di Milano, Italy; Google, USA; Adobe, USA)
Info Artifacts Available Artifacts Reusable Results Replicated
Super-Node SLP: Optimized Vectorization for Code Sequences Containing Operators and Their Inverse Elements
Vasileios Porpodas, Rodrigo C. O. Rocha, Evgueni Brevnov, Luís F. W. Góes, and Timothy Mattson
(Intel, USA; University of Edinburgh, UK; PUC-MG, Brazil)
Locus: A System and a Language for Program Optimization
Thiago S. F. X. Teixeira, Corinne Ancourt, David Padua, and William Gropp
(University of Illinois at Urbana-Champaign, USA; MINES ParisTech, France)

GPUs

Decoding CUDA Binary
Ari B. Hayes, Fei Hua, Jin Huang, Yanhao Chen, and Eddy Z. Zhang
(Rutgers University, USA)
Artifacts Available Artifacts Reusable Results Replicated
From Loop Fusion to Kernel Fusion: A Domain-Specific Approach to Locality Optimization
Bo Qiao, Oliver Reiche, Frank Hannig, and Jürgen Teich
(University of Erlangen-Nuremberg, Germany)
Publisher's Version Info Artifacts Available Artifacts Reusable Results Replicated
IGC: The Open Source Intel Graphics Compiler
Anupama Chandrasekhar, Gang Chen, Po-Yu Chen, Wei-Yu Chen, Junjie Gu, Peng Guo, Shruthi Hebbur Prasanna Kumar, Guei-Yuan Lueh, Pankaj Mistry, Wei Pan, Thomas Raoux, and Konrad Trifunovic
(Intel, USA; Intel, Poland)

Student Research Competition

Undergraduate

Automatic Parallelization of Irregular x86-64 Loops
Brandon Neth and Michelle Mills Strout
(University of Arizona, USA)

Graduate

A Shared BTB Design for Multicore Systems
Moumita Das, Ansuman Banerjee, and Bhaskar Sardar
(Jadavpur University, India; Indian Statistical Institute, India)
Optimizing RNA-RNA Interaction Computations
Swetha Varadarajan
(Colorado State University, USA)
Code Generation from Formal Models for Automatic RTOS Portability
Renata Martins Gomes and Marcel Baunach
(Graz University of Technology, Austria)
Understanding RDMA Behavior in NUMA Systems
Jacob Nelson and Roberto Palmieri
(Lehigh University, USA)
Translating Traditional SIMD Instructions to Vector Length Agnostic Architectures
Sheng-Yu Fu and Wei-Chung Hsu
(National Taiwan University, Taiwan)
Accelerating GPU Computing at Runtime with Binary Optimization
Guangli Li, Lei Liu, and Xiaobing Feng
(Institute of Computing Technology at Chinese Academy of Sciences, China; University of Chinese Academy of Sciences, China)
Extending LLVM for Lightweight SPMD Vectorization: Using SIMD and Vector Instructions Easily from Any Language
Robin Kruppe, Julian Oppermann, Lukas Sommer, and Andreas Koch
(TU Darmstadt, Germany)
Multi-target Compiler for the Deployment of Machine Learning Models
Oscar Castro-Lopez and Ines F. Vega-Lopez
(Autonomous University of Sinaloa, Mexico)
A Tool for Performance Analysis of GPU-Accelerated Applications
Keren Zhou and John Mellor-Crummey
(Rice University, USA)
Info
Kernel Fusion/Decomposition for Automatic GPU-Offloading
Alok Mishra, Martin Kong, and Barbara Chapman
(Stony Brook University, USA; Brookhaven National Laboratory, USA)
Translating CUDA to OpenCL for Hardware Generation using Neural Machine Translation
Yonghae Kim and Hyesoon Kim
(Georgia Institute of Technology, USA)

proc time: 0.79