default search action
32nd HPDC 2023: Minneapolis, MN, USA
- Ali Raza Butt, Ningfang Mi, Kyle Chard:
Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing, HPDC 2023, Orlando, FL, USA, June 16-23, 2023. ACM 2023
HPDC Achievement Award
- Manish Parashar:
Computing Everywhere, All at Once: Harnessing the Computing Continuum for Science. 1
Session: Machine Learning and HPC
- Baolin Li, Siddharth Samsi, Vijay Gadepally, Devesh Tiwari:
Kairos: Building Cost-Efficient Machine Learning Inference Systems with Heterogeneous Cloud Resources. 3-16 - Yaqi Xia, Zheng Zhang, Hulin Wang, Donglin Yang, Xiaobo Zhou, Dazhao Cheng:
Redundancy-Free High-Performance Dynamic GNN Training with Hierarchical Pipeline Parallelism. 17-30 - Wenqian Dong, Gokcen Kestor, Dong Li:
Auto-HPCnet: An Automatic Framework to Build Neural Network-based Surrogate for High-Performance Computing Applications. 31-44 - Akash Dutta, Jordi Alcaraz, Ali TehraniJamsaz, Eduardo César, Anna Sikora, Ali Jannesari:
Performance Optimization using Multimodal Modeling and Heterogeneous GNN. 45-57
Session: Fault Tolerance, Reliability, and Availability
- Xinyi Li, Ignacio Laguna, Bo Fang, Katarzyna Swirydowicz, Ang Li, Ganesh Gopalakrishnan:
Design and Evaluation of GPU-FPX: A Low-Overhead tool for Floating-Point Exception Detection in NVIDIA GPUs. 59-71 - Avinash Maurya, M. Mustafa Rafique, Thierry Tonellot, Hussain J. AlSalem, Franck Cappello, Bogdan Nicolae:
GPU-Enabled Asynchronous Multi-level Checkpoint Caching and Prefetching. 73-85 - Lipeng Wan, Jieyang Chen, Xin Liang, Ana Gainaru, Qian Gong, Qing Liu, Ben Whitney, Joy Arulraj, Zhengchun Liu, Ian T. Foster, Scott Klasky:
RAPIDS: Reconciling Availability, Accuracy, and Performance in Managing Geo-Distributed Scientific Data. 87-100
Session: Algorithms and Accelerators
- Valentin Le Fèvre, Marc Casas:
Efficient Execution of SpGEMM on Long Vector Architectures. 101-113 - Genshen Chu, Yuanjie He, Lingyu Dong, Zhezhao Ding, Dandan Chen, He Bai, Xuesong Wang, Changjun Hu:
Efficient Algorithm Design of Optimizing SpMV on GPU. 115-128 - Boyuan Zhang, Jiannan Tian, Sheng Di, Xiaodong Yu, Yunhe Feng, Xin Liang, Dingwen Tao, Franck Cappello:
FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Computing Applications on GPUs. 129-142
Session: Data, I/O, and Memory Management
- Ruicheng Liu, Peiquan Jin, Xiaoliang Wang, Yongping Luo, Zhaole Chu, Yigui Yuan:
Closing the Performance Gap between Leveling and Tiering Compaction via Bundle Compaction. 143-154 - Bin Dong, Jean Luca Bez, Suren Byna:
AIIO: Using Artificial Intelligence for Job-Level and Automatic I/O Performance Bottleneck Diagnosis. 155-167 - Junxian Zhao, Xiaobo Zhou, Sang-Yoon Chang, Chengzhong Xu:
Let It Go: Relieving Garbage Collection Pain for Latency Critical Applications in Golang. 169-180
Session: Serverless Computing
- Hanfei Yu, Christian Fontenot, Hao Wang, Jian Li, Xu Yuan, Seung-Jong Park:
Libra: Harvesting Idle Resources Safely and Timely in Serverless Clusters. 181-194 - Dimitra Giantsidi, Emmanouil Giortamis, Nathaniel Tornow, Florin Dinu, Pramod Bhatotia:
FlexLog: A Shared Log for Stateful Serverless Computing. 195-209 - Rohan Basu Roy, Tirthak Patel, Richmond Liew, Yadu Nand Babuji, Ryan Chard, Devesh Tiwari:
ProPack: Executing Concurrent Serverless Functions Faster and Cheaper. 211-224
Session: Graphs and Scheduling
- Tiziano De Matteis, Lukas Gianinazzi, Johannes de Fine Licht, Torsten Hoefler:
Streaming Task Graph Scheduling for Dataflow Architectures. 225-237 - Scott Sallinen, Juntong Luo, Matei Ripeanu:
Real-Time PageRank on Dynamic Graphs. 239-251
Session: Open Source Tools and Data
- Danielle Movsowitz-Davidow, Orna Agmon Ben-Yehuda, Orr Dunkelman:
Deconstructing Alibaba Cloud's Preemptible Instance Pricing. 253-265 - Alexander Fuerst, Abdul Rehman, Prateek Sharma:
Ilúvatar: A Fast Control Plane for Serverless Computing. 267-280 - Stephanie Brink, Michael McKinsey, David Böhme, Connor Scully-Allison, Ian Lumsden, W. Daryl Hawkins, Treece Burgess, Vanessa Lama, Jakob Lüttgau, Katherine E. Isaacs, Michela Taufer, Olga Pearce:
Thicket: Seeing the Performance Experiment Forest for the Individual Run Trees. 281-293 - Maximilian Knespel, Holger Brunst:
Rapidgzip: Parallel Decompression and Seeking in Gzip Files Using Cache Prefetching. 295-307
Poster Session
- Jaiaid Mobin, Avinash Maurya, M. Mustafa Rafique:
COLTI: Towards Concurrent and Co-located DNN Training and Inference. 309-310 - Camila Roa, Paula Olaya, Ricardo M. Llamas, Rodrigo Vargas, Michela Taufer:
GEOtiled: A Scalable Workflow for Generating Large Datasets of High-Resolution Terrain Parameters. 311-312 - Qingsheng Zhang, Chen Liang:
Distributed Logical Timestamp Allocation for DBMS Concurrency Control on Many-core Machines. 313-314 - Chris Egersdoerfer, Di Zhang, Dong Dai:
Early Exploration of Using ChatGPT for Log-based Anomaly Detection on Parallel File Systems Logs. 315-316 - Linsheng He, Jiamiao Zhao, Fei Hu:
Distributed Multi-agent Reinforcement Learning for Directional UAV Network Control. 317-318 - Manoj Pravakar Saha, Bryan S. Kim, Haryadi S. Gunawi, Janki Bhimani:
RHIK: Re-configurable Hash-based Indexing for KVSSD. 319-320 - Adnan Maruf, Daniel Carlson, Ashikee Ghosh, Manoj Pravakar Saha, Janki Bhimani, Raju Rangaswami:
Allocation Policies Matter for Hybrid Memory Systems. 321-322 - Shixun Wu, Yujia Zhai, Jiajun Huang, Zizhe Jian, Zizhong Chen:
FT-GEMM: A Fault Tolerant High Performance GEMM Implementation on x86 CPUs. 323-324 - Jakob Lüttgau, Heberth F. Martinez, Glenn Tarcea, Giorgio Scorzelli, Valerio Pascucci, Michela Taufer:
Studying Latency and Throughput Constraints for Geo-Distributed Data in the National Science Data Fabric. 325-326 - Manoj Pravakar Saha, Omkar Desai, Bryan S. Kim, Janki Bhimani:
Leveraging Keys In Key-Value SSD for Production Workloads. 327-328 - Zhuo Tian, Shuai Yang, Changyou Zhang:
Accelerating Sparse General Matrix-Matrix Multiplication for NVIDIA Volta GPU and Hygon DCU. 329-330 - David Simonetti, Ben Tovar, Douglas Thain:
Mixed Modality Workflows in TaskVine. 331-332 - Jiajun Huang, Kaiming Ouyang, Yujia Zhai, Jinyang Liu, Min Si, Ken Raffenetti, Hui Zhou, Atsushi Hori, Zizhong Chen, Yanfei Guo, Rajeev Thakur:
Accelerating MPI Collectives with Process-in-Process-based Multi-object Techniques. 333-334 - Chen Tang, Zhaole Chu, Peiquan Jin, Yongping Luo, Kuankuan Guo:
HM2: Efficient Host Memory Management for RDMA-Enabled Distributed Systems. 335-336
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.