default search action
ICPP 2020: Edmonton, AB, Canada
- José Nelson Amaral, Lizy Kurian John, Xipeng Shen:
ICPP 2020: 49th International Conference on Parallel Processing, Edmonton, AB, Canada, August 17-20, 2020. ACM 2020, ISBN 978-1-4503-8816-0
Best-Paper Candidates
- Naoya Yamamoto, Koji Nakano, Yasuaki Ito, Daisuke Takafuji, Akihiko Kasagi, Tsuguchika Tabaru:
Huffman Coding with Gap Arrays for GPU Acceleration. 1:1-1:11 - Jiya Su, Feng Zhang, Weifeng Liu, Bingsheng He, Ruofan Wu, Xiaoyong Du, Rujia Wang:
CapelliniSpTRSV: A Thread-Level Synchronization-Free Sparse Triangular Solve on GPUs. 2:1-2:11 - Jianting Zhang, Zicong Hong, Xiaoyu Qiu, Yufeng Zhan, Song Guo, Wuhui Chen:
SkyChain: A Deep Reinforcement Learning-Empowered Dynamic Blockchain Sharding System. 3:1-3:11 - Taha Atahan Akyildiz, Amro Alabsi Aljundi, Kamer Kaya:
GOSH: Embedding Big Graphs on Small Hardware. 4:1-4:11
Distributed Systems
- Shangming Cai, Dongsheng Wang, Zhanye Wang, Haixia Wang:
CARD: A Congestion-Aware Request Dispatching Scheme for Replicated Metadata Server Cluster. 5:1-5:11 - Chris Kjellqvist, Mohammad Hedayati, Michael L. Scott:
Safe, Fast Sharing of memcached as a Protected Library. 6:1-6:8 - Ziyi Zhao, Zhang Jiang, Ximing Liu, Xiaoli Gong, Wenwen Wang, Pen-Chung Yew:
DQEMU: A Scalable Emulator with Retargetable DBT on Distributed Platforms. 7:1-7:11
Edge Learning and Inference
- Jae-Won Chung, Jae-Yun Kim, Soo-Mook Moon:
ShadowTutor: Distributed Partial Distillation for Mobile Video DNN Inference. 8:1-8:11 - Yeting Guo, Fang Liu, Zhiping Cai, Li Chen, Nong Xiao:
FEEL: A Federated Edge Learning System for Efficient and Privacy-Preserving Mobile Healthcare. 9:1-9:11 - Sai Qian Zhang, Jieyu Lin, Qi Zhang:
Adaptive Distributed Convolutional Neural Network Inference at the Network Edge with ADCNN. 10:1-10:11
Memory Systems
- Jianming Huang, Yu Hua, Pengfei Zuo, Wen Zhou, Fangting Huang:
An Efficient Wear-level Architecture using Self-adaptive Wear Leveling. 11:1-11:11 - Xueliang Wei, Dan Feng, Wei Tong, Jingning Liu, Chengning Wang, Liuqing Ye:
CCHL: Compression-Consolidation Hardware Logging for Efficient Failure-Atomic Persistent Memory Updates. 12:1-12:11 - Shanjiang Tang, Qifei Chai, Ce Yu, Yusen Li, Chao Sun:
Balancing Fairness and Efficiency for Cache Sharing in Semi-external Memory System. 13:1-13:11
Fault-Tolerance
- Carlos Pachajoa, Christina Pacher, Markus Levonyak, Wilfried N. Gansterer:
Algorithm-Based Checkpoint-Recovery for the Conjugate Gradient Method. 14:1-14:11 - Yishu Du, Loris Marchal, Guillaume Pallez Aupy, Yves Robert:
Robustness of the Young/Daly formula for stochastic iterative applications. 15:1-15:11 - Li Han, Yiqin Gao, Jing Liu, Yves Robert, Frédéric Vivien:
Energy-aware strategies for reliability-oriented real-time task allocation on heterogeneous platforms. 16:1-16:11
Scheduling and Placement in Networks
- Chi Lin, Ziwei Yang, Yu Sun, Jing Deng, Lei Wang, Guowei Wu:
Cooperative Game for Multiple Chargers with Dynamic Network Topology. 17:1-17:10 - Yang Chen, Jie Wu, Bo Ji:
Optimizing Flow Bandwidth Consumption with Traffic-diminishing Middlebox Placement. 18:1-18:10 - Yang Shi, Mei Wen, Chunyuan Zhang:
Towards High-Efficiency Data Centers via Job-Aware Network Scheduling. 19:1-19:10
Systems for Machine Learning
- Lipeng Wang, Songgao Ye, Baichen Yang, Youyou Lu, Hequan Zhang, Shengen Yan, Qiong Luo:
DIESEL: A Dataset-Based Distributed Storage and Caching System for Large-Scale Deep Learning Training. 20:1-20:11 - Abeda Sultana, Li Chen, Fei Xu, Xu Yuan:
E-LAS: Design and Analysis of Completion-Time Agnostic Scheduling for Distributed Deep Learning Cluster. 21:1-21:11 - Zheng Chen, Feng Zhang, Amelie Chi Zhou, Jidong Zhai, Chenyang Zhang, Xiaoyong Du:
ParSecureML: An Efficient Parallel Secure Machine Learning Framework on GPUs. 22:1-22:11
Graph Processing and Concurrent Data Structures
- Somesh Singh, Rupesh Nasre:
Graffix: Efficient Graph Processing with a Tinge of GPU-Specific Approximations. 23:1-23:11 - Matthew Rodriguez, Michael F. Spear:
Optimizing Linearizable Bulk Operations on Data Structures. 24:1-24:10 - Feng Sheng, Qiang Cao, Hong Jiang, Jie Yao:
GraBi: Communication-Efficient and Workload-Balanced Partitioning for Bipartite Graphs. 25:1-25:11
Large-Scale Applications on Supercomputers
- Xinyuan Li, Huang Ye, Jian Zhang:
Large-scale Simulations of Peridynamics on Sunway Taihulight Supercomputer. 26:1-26:11 - Sudip K. Seal, Seung-Hwan Lim, Dali Wang, Jacob D. Hinkle, Dalton D. Lunga, Aristeidis Tsaris:
Toward Large-Scale Image Segmentation on Summit. 27:1-27:11 - Kai Xu, Xiaohui Duan, Xiangxu Meng, Xin Li, Bertil Schmidt, Weiguo Liu:
SWMapper: Scalable Read Mapper on SunWay TaihuLight. 28:1-28:10
Machine Learning for Computing
- Xueying Zhang, Ruiting Zhou, Zhi Zhou, John C. S. Lui, Zongpeng Li:
An Online Learning-Based Task Offloading Framework for 5G Small Cell Networks. 29:1-29:11 - Haoyu Wang, Haiying Shen, Qi Liu, Kevin Zheng, Jie Xu:
A Reinforcement Learning Based System for Minimizing Cloud Storage Service Cost. 30:1-30:10 - Zixia Liu, Liqiang Wang, Gang Quan:
Deep Reinforcement Learning based Elasticity-compatible Heterogeneous Resource Management for Time-critical Computing. 31:1-31:11
Performance Tools and Methodology
- Girish Mururu, Kaushik Ravichandran, Ada Gavrilovska, Santosh Pande:
Generating Robust Parallel Programs via Model Driven Prediction of Compiler Optimizations for Non-determinism. 32:1-32:12 - Wei Liu, Yifan Gong, Hao Wu, Jidong Zhai, Jiangming Jin:
Memory-Centric Communication Mechanism for Real-time Autonomous Navigation Applications. 33:1-33:11 - Christian Helm, Kenjiro Taura:
Automatic Identification and Precise Attribution of DRAM Bandwidth Contention. 34:1-34:11
Storage Reliability & Memory Security
- Zizhong Wang, Haixia Wang, Airan Shao, Dongsheng Wang:
An Adaptive Erasure-Coded Storage Scheme with an Efficient Code-Switching Algorithm. 35:1-35:11 - Kartik Ramkrishnan, Stephen McCamant, Pen-Chung Yew, Antonia Zhai:
First Time Miss : Low Overhead Mitigation for Shared Memory Cache Side Channels. 36:1-36:11 - Tong Liu, Shakeel Alibhai, Xubin He:
A Rack-Aware Pipeline Repair Scheme for Erasure-Coded Distributed Storage Systems. 37:1-37:11
Supporting Efficient Machine Learning
- Qingchang Han, Yongmin Hu, Fengwei Yu, Hailong Yang, Bing Liu, Peng Hu, Ruihao Gong, Yanfei Wang, Rui Wang, Zhongzhi Luan, Depei Qian:
Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures. 38:1-38:12 - Jan Hückelheim, Michel Schanen, Sri Hari Krishna Narayanan, Paul D. Hovland:
Vector Forward Mode Automatic Differentiation on SIMD/SIMT architectures. 39:1-39:11 - Zhenbo Hu, Xiangyu Zou, Wen Xia, Sian Jin, Dingwen Tao, Yang Liu, Weizhe Zhang, Zheng Zhang:
Delta-DNN: Efficiently Compressing Deep Neural Networks via Exploiting Floats Similarity. 40:1-40:12
Data Center Networking
- Jinbin Hu, Jiawei Huang, Zhaoyi Li, Jianxin Wang, Tian He:
AMRT: Anti-ECN Marking to Improve Utilization of Receiver-driven Transmission in Data Center. 41:1-41:10 - Wanchun Jiang, Kaiqin Liao, Yulong Yan, Jianxin Wang:
PS: Periodic Strategy for the 40-100Gbps Energy Efficient Ethernet. 42:1-42:10 - Chang Ruan, Jianxin Wang, Wanchun Jiang, Tao Zhang:
Polo: Receiver-Driven Congestion Control for Low Latency over Commodity Network Fabric. 43:1-43:10
Parallel Algorithms I
- Jesmin Jahan Tithi, Andrzej Stasiak, Sriram Aananthakrishnan, Fabrizio Petrini:
Prune the Unnecessary: Parallel Pull-Push Louvain Algorithms with Automatic Edge Pruning. 44:1-44:11 - Ashirbad Mishra, Shad Kirmani, Kamesh Madduri:
Fast Spectral Graph Layout on Multicore Platforms. 45:1-45:11 - Tarequl Islam Sifat, Nirmal Prajapati, Sanjay V. Rajopadhye:
Revisiting Sparse Dynamic Programming for the 0/1 Knapsack Problem. 46:1-46:10
Parallel and Distributed Machine Learning
- Junyu Li, Ligang He, Shenyuan Ren, Rui Mao:
Developing a Loss Prediction-based Asynchronous Stochastic Gradient Descent Algorithm for Distributed Training of Deep Neural Networks. 47:1-47:10 - Canh T. Dinh, Nguyen Hoang Tran, Tuan Dung Nguyen, Wei Bao, Albert Y. Zomaya, Bing Bing Zhou:
Federated Learning with Proximal Stochastic Variance Reduced Gradient Algorithms. 48:1-48:11 - Zijie Yan, Danyang Xiao, Mengqiang Chen, Jieying Zhou, Weigang Wu:
Dual-Way Gradient Sparsification for Asynchronous Distributed Deep Learning. 49:1-49:10
Heterogeneous Systems
- Matthew Agostini, Francis O'Brien, Tarek S. Abdelrahman:
Balancing Graph Processing Workloads Using Work Stealing on Heterogeneous CPU-FPGA Systems. 50:1-50:12 - Juan Carlos Saez, Fernando Castro, Manuel Prieto-Matías:
Enabling performance portability of data-parallel OpenMP applications on asymmetric multicore processors. 51:1-51:11 - Pengfei Zou, Ang Li, Kevin J. Barker, Rong Ge:
Detecting Anomalous Computation with RNNs on GPU-Accelerated HPC Machines. 52:1-52:11
Performance Evaluation and Characterization
- Adrian Munera, Sara Royuela, Germán Llort, Estanislao Mercadal, Franck Wartel, Eduardo Quiñones:
Experiences on the characterization of parallel applications in embedded systems with Extrae/Paraver. 53:1-53:11 - Pablo Prieto, Pablo Abad Fidalgo, Jose Angel Herrero, José-Ángel Gregorio, Valentin Puente:
SPECcast: A Methodology for Fast Performance Evaluation with SPEC CPU 2017 Multiprogrammed Workloads. 54:1-54:11 - Davood Ghatreh Samani, Chavit Denninnart, Josef Bacik, Mohsen Amini Salehi:
The Art of CPU-Pinning: Evaluating and Improving the Performance of Virtualization and Containerization Platforms. 55:1-55:11
Routing and Mapping in Networks
- Hongyun Gao, Laiping Zhao, Huanbin Wang, Zhao Tian, Lihai Nie, Keqiu Li:
XShot: Light-weight Link Failure Localization using Crossed Probing Cycles in SDN. 56:1-56:11 - Felix Zahn, Holger Fröning:
On Network Locality in MPI-Based HPC Applications. 57:1-57:10 - Bo He, Jingyu Wang, Qi Qi, Haifeng Sun, Zirui Zhuang, Cong Liu, Jianxin Liao:
DeepHop on Edge: Hop-by-hop Routing byDistributed Learning with Semantic Attention. 58:1-58:11
Microarchitecture and Power Management
- Alexandra Angerd, Erik Sintorn, Per Stenström:
A GPU Register File using Static Data Compression. 59:1-59:10 - Kramer Straube, Jason Lowe-Power, Christopher Nitta, Matthew K. Farrens, Venkatesh Akella:
HCAPP: Scalable Power Control for Heterogeneous 2.5D Integrated Systems. 60:1-60:11 - Jiaxin Peng, Yousra Al-Kabani, Shuai Sun, Volker J. Sorger, Tarek A. El-Ghazawi:
DNNARA: A Deep Neural Network Accelerator using Residue Arithmetic and Integrated Photonics. 61:1-61:11
Parallel Algorithms II
- Ryota Yasudo, Koji Nakano, Yasuaki Ito, Masaru Tatekawa, Ryota Katsuki, Takashi Yazane, Yoko Inaba:
Adaptive Bulk Search: Solving Quadratic Unconstrained Binary Optimization Problems on Multiple GPUs. 62:1-62:11 - Zhengyang Lu, Yuyao Niu, Weifeng Liu:
Efficient Block Algorithms for Parallel Sparse Triangular Solve. 63:1-63:11 - Shouxi Luo, Pingzhi Fan, Huanlai Xing, Hongfang Yu:
Selective Coflow Completion for Time-sensitive Distributed Applications with Poco. 64:1-64:10
Resource Management on the Cloud
- Kaiyue Duan, Yusen Li, Trent G. Marbach, Gang Wang, Xiaoguang Liu:
Improving Load Balance via Resource Exchange in Large-Scale Search Engines. 65:1-65:11 - Iryanto Jaya, Wentong Cai, Yusen Li:
Rendering Server Allocation for MMORPG Players in Cloud Gaming. 66:1-66:11 - Zhuozhao Li, Tanmoy Sen, Haiying Shen, Mooi Choo Chuah:
Impact of Memory DoS Attacks on Cloud Applications and Real-Time Detection Schemes. 67:1-67:11
GPU-Accelerated Applications
- David B. Williams-Young, Chao Yang:
Parallel Shift-Invert Spectrum Slicing on Distributed Architectures with GPU Accelerators. 68:1-68:11 - Martin Krulis, Miroslav Kratochvíl:
Detailed Analysis and Optimization of CUDA K-means Algorithm. 69:1-69:11 - Ichitaro Yamazaki, Sivasankaran Rajamanickam, Nathan D. Ellingwood:
Performance Portable Supernode-based Sparse Triangular Solver for Manycore Architectures. 70:1-70:11
Data Centers and the Edge
- Xiaoqing Cai, Jiuchen Shi, Rui Yuan, Chang Liu, Wenli Zheng, Quan Chen, Chao Li, Jingwen Leng, Minyi Guo:
OVERSEE: Outsourcing Verification to Enable Resource Sharing in Edge Environment. 71:1-71:11 - Ahmed Mohamed Abdelmoniem, Hengky Susanto, Brahim Bensaou:
Reducing Latency in Multi-Tenant Data Centers via Cautious Congestion Watch. 72:1-72:11 - Wei Zhang, Ningxin Zheng, Quan Chen, Yong Yang, Zhuo Song, Tao Ma, Jingwen Leng, Minyi Guo:
URSA: Precise Capacity Planning and Fair Scheduling based on Low-level Statistics for Public Clouds. 73:1-73:11 - Weifa Liang, Yu Ma, Wenzheng Xu, Xiaohua Jia, Sid Chi-Kin Chau:
Reliability Augmentation of Requests with Service Function Chain Requirements in Mobile Edge-Cloud Networks. 74:1-74:11
Storage and I/O Optimization
- Yuchen Cheng, Chunghsuan Wu, Yanqiang Liu, Rui Ren, Hong Xu, Bin Yang, Zhengwei Qi:
OPS: Optimized Shuffle Management System for Apache Spark. 75:1-75:11 - Fan Deng, Qiang Cao, Shucheng Wang, Shuyang Liu, Jie Yao, Yuanyuan Dong, Puyuan Yang:
SeRW: Adaptively Separating Read and Write upon SSDs of Hybrid Storage Server in Clouds. 76:1-76:11 - Vinay Devadas, Matthew Curtis-Maury:
Scalable Coordination of Hierarchical Parallelism. 77:1-77:11 - Yu Chen, Wei Tong, Dan Feng, Zike Wang:
Mass: Workload-Aware Storage Policy for OpenStack Swift. 78:1-78:11
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.