default search action
22nd CGO 2024: Edinburgh, UK
- Tobias Grosser, Christophe Dubach, Michel Steuwer, Jingling Xue, Guilherme Ottoni, ernando Magno Quintão Pereira:
IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2024, Edinburgh, United Kingdom, March 2-6, 2024. IEEE 2024, ISBN 979-8-3503-9509-9 - Guilherme Ottoni, Fernando Pereira, Michel Steuwer, Jingling Xue:
Welcome from the Program Chairs. v-vi - Amir Shaikhha, Mathieu Huot, Shideh Hashemian:
A Tensor Algebra Compiler for Sparse Differentiation. 1-12 - Malith Jayaweera, Martin Kong, Yanzhi Wang, David R. Kaeli:
Energy-Aware Tile Size Selection for Affine Programs on GPUs. 13-27 - Gianpietro Consolaro, Zhen Zhang, Harenome Razanajato, Nelson Lossing, Nassim Tchoulak, Adilla Susungi, Artur Cesar Araujo Alves, Renwei Zhang, Denis Barthou, Corinne Ancourt, Cédric Bastoul:
PolyTOPS: Reconfigurable and Flexible Polyhedral Scheduler. 28-40 - Katsumi Okuda, Saman P. Amarasinghe:
AskIt: Unified Programming Interface for Programming with Large Language Models. 41-54 - Volker Seeker, Chris Cummins, Murray Cole, Björn Franke, Kim M. Hazelwood, Hugh Leather:
Revealing Compiler Heuristics Through Automated Discovery and Optimization. 55-66 - Jordi Armengol-Estapé, Jackson Woodruff, Chris Cummins, Michael F. P. O'Boyle:
SLaDe: A Portable Small Language Model Decompiler for Optimized Assembly. 67-80 - Milad Hakimi, Arrvindh Shriraman:
TapeFlow: Streaming Gradient Tapes in Automatic Differentiation. 81-92 - Abhinav Jangda, Saeed Maleki, Maryam Mehri Dehnavi, Madan Musuvathi, Olli Saarikivi:
A Framework for Fine-Grained Synchronization of Dependent GPU Kernels. 93-105 - Alnis Murtovi, Giorgis Georgakoudis, Konstantinos Parasyris, Chunhua Liao, Ignacio Laguna, Bernhard Steffen:
Enhancing Performance Through Control-Flow Unmerging and Loop Unrolling on GPUs. 106-118 - Ivan R. Ivanov, Oleksandr Zinenko, Jens Domke, Toshio Endo, William S. Moses:
Retargeting and Respecializing GPU Workloads for Performance Portability. 119-132 - Ryan Swann, Muhammad Osama, Karthik Sangaiah, Jalal Mahmud:
Seer: Predictive Runtime Kernel Selection for Irregular Problems. 133-142 - Nicolas Bohm Agostini, Jude Haris, Perry Gibson, Malith Jayaweera, Norm Rubin, Antonino Tumeo, José L. Abellán, José Cano, David R. Kaeli:
AXI4MLIR: User-Driven Automatic Host Code Generation for Custom AXI-Based Accelerators. 143-157 - Mingzheng Zhu, Hao Fu, Jun Wu, Chi Zhang, Wei Xie, Xiang-Yang Li:
Ecmas: Efficient Circuit Mapping and Scheduling for Surface Code. 158-169 - Xiaofeng Guan, Hao Zhou, Guoqing Bao, Handong Li, Liang Zhu, Jianguo Yao:
PresCount: Effective Register Allocation for Bank Conflict Reduction. 170-181 - Adrián Castelló, Julian Bellavita, Grace Dinh, Yuka Ikarashi, Héctor Martínez:
Tackling the Matrix Multiplication Micro-Kernel Generation with Exo. 182-193 - Luisa Cicolini, Filippo Carloni, Marco D. Santambrogio, Davide Conficconi:
One Automaton to Rule Them All: Beyond Multiple Regular Expressions Execution. 193-206 - Ben L. Titzer:
Whose Baseline Compiler is it Anyway? 207-220 - Ruobing Han, Jisheng Zhao, Hyesoon Kim:
Enabling Fine-Grained Incremental Builds by Making Compiler Stateful. 221-232 - Alexis Engelke, Tobias Schwarz:
Compile-Time Analysis of Compiler Frameworks for Query Compilation. 233-244 - Jinku Cui, Qidong Zhao, Yueming Hao, Xu Liu:
DrPy: Pinpointing Inefficient Memory Usage in Multi-Layer Python Applications. 245-257 - Hugo Reymond, Jean-Luc Béchennec, Mikaël Briday, Sébastien Faucou, Isabelle Puaut, Erven Rohou:
SCHEMATIC: Compile-Time Checkpoint Placement and Memory Allocation for Intermittent Systems. 258-269 - Jonathan Van der Cruysse, Christophe Dubach:
Latent Idiom Recognition for a Minimalist Functional Array Language Using Equality Saturation. 270-282 - Yousun Ko, Bernd Burgstaller:
BEC: Bit-Level Static Analysis for Reliability against Soft Errors. 283-295 - Haofeng Li, Jie Lu, Haining Meng, Liqing Cao, Lian Li, Lin Gao:
Boosting the Performance of Multi-Solver IFDS Algorithms with Flow-Sensitivity Optimizations. 296-307 - Tommy McMichen, Nathan Greiner, Peter Zhong, Federico Sossai, Atmn Patel, Simone Campanoni:
Representing Data Collections in an SSA Form. 308-321 - Wenlei He, Hongtao Yu, Lei Wang, Taewook Oh:
Revamping Sampling-Based PGO with Context-Sensitivity and Pseudo-instrumentation. 322-333 - Luke Geeson, Lee Smith:
Compiler Testing with Relaxed Memory Models. 334-348 - Yuyou Fan, John Regehr:
High-Throughput, Formal-Methods-Assisted Fuzzing for LLVM. 349-358 - Théo Barollet, Christophe Guillon, Manuel Selva, François Broquedis, Florent Bouchez-Tichadou, Fabrice Rastello:
EasyTracker: A Python Library for Controlling and Inspecting Program Execution. 359-372 - Yuxin Guo, Alexandra W. Chadwick, Márton Erdos, Utpal Bora, Ilias Vougioukas, Giacomo Gabrielli, Timothy M. Jones:
OptiWISE: Combining Sampling and Instrumentation for Granular CPI Analysis. 373-385 - Qidong Zhao, Milind Chabbi, Xu Liu:
EasyView: Bringing Performance Profiles into Integrated Development Environments. 386-398 - Ettore Tiotto, Victor Perez, Whitney Tsang, Lukas Sommer, Julian Oppermann, Victor Lomüller, Mehdi Goli, James Brodman:
Experiences Building an MLIR-Based SYCL Compiler. 399-410 - Georgian-Vlad Saioc, Dmitriy Shirchenko, Milind Chabbi:
Unveiling and Vanquishing Goroutine Leaks in Enterprise Microservices: A Dynamic Analysis Approach. 411-422 - Jinhu Jiang, Chaoyi Liang, Rongchao Dong, Zhaohui Yang, Zhongjun Zhou, Wenwen Wang, Pen-Chung Yew, Weihua Zhang:
A System-Level Dynamic Binary Translator Using Automatically-Learned Translation Rules. 423-434 - Ghassan Shobaki, Pinar Muyan-Özçelik, Josh Hutton, Bruce Linck, Vladislav Malyshenko, Austin Kerbow, Ronaldo Ramirez-Ortega, Vahl Scott Gordon:
Instruction Scheduling for the GPU on the GPU. 435-447 - Qiang Fu, Thomas B. Rolinger, H. Howie Huang:
JITSPMM: Just-in-Time Instruction Generation for Accelerated Sparse Matrix-Matrix Multiplication. 448-459 - Jianhui Li, Zhennan Qin, Yijie Mei, Jingze Cui, Yunfei Song, Ciyong Chen, Yifei Zhang, Longsheng Du, Xianhang Cheng, Baihui Jin, Yan Zhang, Jason Ye, Eric Lin, Dan Lavery:
oneDNN Graph Compiler: A Hybrid Approach for High-Performance Deep Learning Compilation. 460-470
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.