default search action
SC 2015: Austin, TX, USA
- Jackie Kern, Jeffrey S. Vetter:
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2015, Austin, TX, USA, November 15-20, 2015. ACM 2015, ISBN 978-1-4503-3723-6
ACM Gordon Bell finalists
- Amanda Randles, Erik W. Draeger, Tomas Oppelstrup, Liam Krauss, John A. Gunnels:
Massively parallel models of the human circulatory system. 1:1-1:11 - Diego Rossinelli, Yu-Hang Tang, Kirill Lykov, Dmitry Alexeev, Massimo Bernaschi, Panagiotis E. Hadjidoukas, Mauro Bisson, Wayne Joubert, Christian Conti, George E. Karniadakis, Massimiliano Fatica, Igor Pivkin, Petros Koumoutsakos:
The in-silico lab-on-a-chip: petascale and high-throughput simulations of microfluidics at cell resolution. 2:1-2:12 - Mauro Calderara, Sascha Brück, Andreas Pedersen, Mohammad H. Bani-Hashemian, Joost VandeVondele, Mathieu Luisier:
Pushing back the limit of ab-initio quantum transport simulations on hybrid supercomputers. 3:1-3:12 - Tsuyoshi Ichimura, Kohei Fujita, Pher Errol Balde Quinay, Lalith Maddegedara, Muneo Hori, Seizo Tanaka, Yoshihisa Shizawa, Hiroshi Kobayashi, Kazuo Minami:
Implicit nonlinear wave simulation with 1.08T DOF and 0.270T unstructured finite elements to enhance comprehensive earthquake simulation. 4:1-4:12 - Johann Rudi, A. Cristiano I. Malossi, Tobin Isaac, Georg Stadler, Michael Gurnis, Peter W. J. Staar, Yves Ineichen, Costas Bekas, Alessandro Curioni, Omar Ghattas:
An extreme-scale implicit solver for complex PDEs: highly heterogeneous flow in earth's mantle. 5:1-5:12
Technical papers: data clustering
- Md. Mostofa Ali Patwary, Surendra Byna, Nadathur Rajagopalan Satish, Narayanan Sundaram, Zarija Lukic, Vadim Roytershteyn, Michael J. Anderson, Yushu Yao, Prabhat, Pradeep Dubey:
BD-CATS: big data clustering at trillion particle scale. 6:1-6:12 - Chenhan D. Yu, Jianyu Huang, Woody Austin, Bo Xiao, George Biros:
Performance optimization for the k-nearest neighbors kernel on x86 architectures. 7:1-7:12
Technical papers: applications: material science
- Martin Bauer, Johannes Hötzer, Marcus Jainta, Philipp Steinmetz, Marco Berghoff, Florian Schornbaum, Christian Godenschwager, Harald Köstler, Britta Nestler, Ulrich Rüde:
Massively parallel phase-field simulations for ternary eutectic directional solidification. 8:1-8:12 - Hongzhang Shan, Samuel Williams, Calvin W. Johnson, Kenneth S. McElvain, W. Erich Ormand:
Parallel implementation and performance optimization of the configuration-interaction method. 9:1-9:12 - Raffaele Solcà, Anton Kozhevnikov, Azzam Haidar, Stanimire Tomov, Jack J. Dongarra, Thomas C. Schulthess:
Efficient implementation of quantum materials simulations on distributed CPU-GPU systems. 10:1-10:12
Technical papers: cache and memory subsystems
- Abhisek Pan, Vijay S. Pai:
Runtime-driven shared last-level cache management for task-parallel programs. 11:1-11:12 - Jungrae Kim, Michael B. Sullivan, Seong-Lyong Gong, Mattan Erez:
Frugal ECC: efficient and versatile memory error protection through fine-grained compression. 12:1-12:12 - Malek Musleh, Vijay S. Pai:
Automatic sharing classification and timely push for cache-coherent systems. 13:1-13:12
Technical papers: applications: biophysics and genomics
- Evangelos Georganas, Aydin Buluç, Jarrod Chapman, Steven A. Hofmeyr, Chaitanya Aluru, Rob Egan, Leonid Oliker, Daniel Rokhsar, Katherine A. Yelick:
HipMer: an extreme-scale de novo genome assembler. 14:1-14:11 - Patrick Flick, Chirag Jain, Tony Pan, Srinivas Aluru:
A parallel connectivity algorithm for de Bruijn graphs in metagenomic applications. 15:1-15:11 - Patrick Flick, Srinivas Aluru:
Parallel distributed memory construction of suffix and longest common prefix arrays. 16:1-16:10
Technical papers: GPU memory management
- Ang Li, Gert-Jan van den Braak, Akash Kumar, Henk Corporaal:
Adaptive and transparent cache bypassing for GPUs. 17:1-17:12 - Jason Jong Kyu Park, Yongjun Park, Scott A. Mahlke:
ELF: maximizing memory-level parallelism for GPUs with coordinated warp and fetch scheduling. 18:1-18:12 - Tal Ben-Nun, Ely Levy, Amnon Barak, Eri Rubin:
Memory access patterns: the missing piece of the multi-GPU puzzle. 19:1-19:12
Technical papers: scalable storage systems
- Hyogi Sim, Youngjae Kim, Sudharshan S. Vazhkudai, Devesh Tiwari, Ali Anwar, Ali Raza Butt, Lavanya Ramakrishnan:
AnalyzeThis: an analysis workflow-aware storage system. 20:1-20:12 - Michael A. Sevilla, Noah Watkins, Carlos Maltzahn, Ike Nassi, Scott A. Brandt, Sage A. Weil, Greg Farnum, Sam Fineberg:
Mantle: a programmable metadata load balancer for the ceph file system. 21:1-21:12 - Yandong Wang, Li Zhang, Jian Tan, Min Li, Yuqing Gao, Xavier Guerin, Xiaoqiao Meng, Shicong Meng:
HydraDB: a resilient RDMA-driven key-value middleware for in-memory cluster computing. 22:1-22:11
Technical papers: applications: folding, imaging. and proteins
- Yida Wang, Michael J. Anderson, Jonathan D. Cohen, Alexander Heinecke, Kai Li, Nadathur Satish, Narayanan Sundaram, Nicholas B. Turk-Browne, Theodore L. Willke:
Full correlation matrix analysis of fMRI data on Intel® Xeon Phi™ coprocessors. 23:1-23:12 - William B. March, Bo Xiao, Sameer Tharakan, Chenhan D. Yu, George Biros:
A kernel-independent FMM in general dimensions. 24:1-24:12 - Andrew Schoenrock, Daniel J. Burnside, Houman Moteshareie, Alex Wong, Ashkan Golshani, Frank Dehne:
Engineering inhibitory proteins with InSiPS: the in-silico protein synthesizer. 25:1-25:11
Technical papers: graph analytics on HPC systems
- Xinyu Que, Fabio Checconi, Fabrizio Petrini, Xing Liu, Daniele Buono:
Exploring network optimizations for large-scale graph analytics. 26:1-26:10 - Seung-Hee Bae, Bill Howe:
GossipMap: a distributed community detection algorithm for billion-edge directed graphs. 27:1-27:12 - Dipanjan Sengupta, Shuaiwen Leon Song, Kapil Agarwal, Karsten Schwan:
GraphReduce: processing large-scale graphs on accelerator-based systems. 28:1-28:12
Technical papers: MPI/communication
- Akshay Venkatesh, Abhinav Vishnu, Khaled Hamidouche, Nathan R. Tallent, Dhabaleswar K. Panda, Darren J. Kerbyson, Adolfy Hoisie:
A case for application-oblivious energy-efficient MPI runtime. 29:1-29:12 - Karthikeyan Vaidyanathan, Dhiraj D. Kalamkar, Kiran Pamnany, Jeff R. Hammond, Pavan Balaji, Dipankar Das, Jongsoo Park, Bálint Joó:
Improving concurrency and asynchrony in multithreaded MPI applications using software offloading. 30:1-30:12 - Thomas Hérault, Aurélien Bouteiller, George Bosilca, Marc Gamell, Keita Teranishi, Manish Parashar, Jack J. Dongarra:
Practical scalable consensus for pseudo-synchronous distributed systems. 31:1-31:12
Technical papers: cloud resource management
- Yifan Gong, Bingsheng He, Amelie Chi Zhou:
Monetary cost optimizations for MPI-based HPC applications on Amazon clouds: checkpoints and replicated execution. 32:1-32:12 - Feng Liu, Jon B. Weissman:
Elastic job bundling: an adaptive resource request strategy for large-scale parallel applications. 33:1-33:12 - Yanfei Guo, Wesley Bland, Pavan Balaji, Xiaobo Zhou:
Fault tolerant MapReduce-MPI for HPC clusters. 34:1-34:12
Technical papers: interconnection networks
- Nan Jiang, Larry R. Dennison, William J. Dally:
Network endpoint congestion control for fine-grained communication. 35:1-35:12 - Georgios Kathareios, Cyriel Minkenberg, Bogdan Prisacari, Germán Rodríguez, Torsten Hoefler:
Cost-effective diameter-two topologies: analysis and evaluation. 36:1-36:11 - Shinobu Miwa, Hiroshi Nakamura:
Profile-based power shifting in interconnection networks with on/off links. 37:1-37:11
Technical papers: state of the practice: infrastructure management
- Devesh Tiwari, Saurabh Gupta, George Gallarno, Jim Rogers, Don Maxwell:
Reliability lessons learned from GPU experience with the Titan supercomputer at Oak Ridge leadership computing facility. 38:1-38:12 - Patricia H. Kovatch, Anthony Costa, Zachary Giles, Eugene Fluder, Hyung Min Cho, Svetlana Mazurkova:
Big omics data experience. 39:1-39:12 - Todd Gamblin, Matthew P. LeGendre, Michael R. Collette, Gregory L. Lee, Adam Moody, Bronis R. de Supinski, Scott Futral:
The Spack package manager: bringing order to HPC software chaos. 40:1-40:12
Technical papers: applications: climate and weather
- Tobias Gysi, Carlos Osuna, Oliver Fuhrer, Mauro Bianco, Thomas C. Schulthess:
STELLA: a domain-specific tool for structured grid methods in weather and climate models. 41:1-41:12 - Yong Hu, Xiaomeng Huang, Allison H. Baker, Yu-heng Tseng, Frank O. Bryan, John M. Dennis, Guangwen Yang:
Improving the scalability of the ocean barotropic solver in the community earth system model. 42:1-42:12 - Kalin Kanov, Randal C. Burns:
Particle tracking in open simulation laboratories. 43:1-43:11
Technical papers: data transfers and data-intensive applications
- Ismail Alan, Engin Arslan, Tevfik Kosar:
Energy-aware data transfer algorithms. 44:1-44:12 - Ron Chi-Lung Chiang, H. Howie Huang, Timothy Wood, Changbin Liu, Oliver Spatscheck:
IOrchestra: supporting high-performance data-intensive applications in the cloud via collaborative virtualization. 45:1-45:12 - Rajkumar Kettimuthu, Gayane Vardoyan, Gagan Agrawal, P. Sadayappan, Ian T. Foster:
An elegant sufficiency: load-aware differentiated scheduling of data transfers. 46:1-46:12
Technical papers: performance tools and models
- Xu Liu, Bo Wu:
ScaAnalyzer: a tool to identify memory scalability bottlenecks in parallel programs. 47:1-47:12 - Yuhang Liu, Xian-He Sun:
C2-bound: a capacity and concurrency driven analytical model for many-core design. 48:1-48:11 - Katherine E. Isaacs, Abhinav Bhatele, Jonathan Lifflander, David Böhme, Todd Gamblin, Martin Schulz, Bernd Hamann, Peer-Timo Bremer:
Recovering logical structure from Charm++ event traces. 49:1-49:12
Technical papers: in-situ (simulation time) analysis
- Christopher M. Sewell, Katrin Heitmann, Hal Finkel, George Zagaris, Suzanne Parete-Koon, Patricia K. Fasel, Adrian Pope, Nicholas Frontiere, Li-Ta Lo, O. E. Bronson Messer, Salman Habib, James P. Ahrens:
Large-scale compute-intensive analysis via a combined in-situ and co-scheduling workflow approach. 50:1-50:11 - Yi Wang, Gagan Agrawal, Tekin Bicer, Wei Jiang:
Smart: a MapReduce-like framework for in-situ scientific analytics. 51:1-51:12 - Preeti Malakar, Venkatram Vishwanath, Todd S. Munson, Christopher Knight, Mark Hereld, Sven Leyffer, Michael E. Papka:
Optimal scheduling of in-situ analysis for large-scale scientific simulations. 52:1-52:11
Technical papers: linear algebra
- Luc Jaulmes, Marc Casas, Miquel Moretó, Eduard Ayguadé, Jesús Labarta, Mateo Valero:
Exploiting asynchrony from exact forward recovery for DUE in iterative solvers. 53:1-53:12 - Jongsoo Park, Mikhail Smelyanskiy, Ulrike Meier Yang, Dheevatsa Mudigere, Pradeep Dubey:
High-performance algebraic multigrid solver optimized for multi-core based distributed parallel systems. 54:1-54:12 - Humayun Kabir, Joshua Dennis Booth, Guillaume Aupy, Anne Benoit, Yves Robert, Padma Raghavan:
STS-k: a multilevel sparse triangular solution scheme for NUMA multicores. 55:1-55:11
Technical papers: management of graph workloads
- Michael LeBeane, Shuang Song, Reena Panda, Jee Ho Ryoo, Lizy K. John:
Data partitioning strategies for graph workloads on heterogeneous clusters. 56:1-56:12 - Kisung Lee, Ling Liu, Karsten Schwan, Calton Pu, Qi Zhang, Yang Zhou, Emre Yigitoglu, Pingpeng Yuan:
Scaling iterative graph computations with GraphMap. 57:1-57:12 - Sungpack Hong, Siegfried Depner, Thomas Manhardt, Jan Van Der Lugt, Merijn Verstraaten, Hassan Chafi:
PGX.D: a fast distributed graph processing engine. 58:1-58:12
Technical papers: sampling in matrix computations
- Ichitaro Yamazaki, Jakub Kurzak, Piotr Luszczek, Jack J. Dongarra:
Randomized algorithms to update partial singular value decomposition on a hybrid CPU/GPU cluster. 59:1-59:12 - Théo Mary, Ichitaro Yamazaki, Jakub Kurzak, Piotr Luszczek, Stanimire Tomov, Jack J. Dongarra:
Performance of random sampling for computing low-rank approximations of a dense matrix on GPUs. 60:1-60:11
Technical papers: programming tools
- Stephen F. Siegel, Manchun Zheng, Ziqing Luo, Timothy K. Zirkel, Andre V. Marianiello, John G. Edenhofner, Matthew B. Dwyer, Michael S. Rogers:
CIVL: the concurrency intermediate verification language. 61:1-61:12 - Kento Sato, Dong H. Ahn, Ignacio Laguna, Gregory L. Lee, Martin Schulz:
Clock delta compression for scalable order-replay of non-deterministic parallel applications. 62:1-62:12 - Luiz De Rose, Andrew Gontarek, Aaron Vose, Robert Moench, David Abramson, Minh Ngoc Dinh, Chao Jin:
Relative debugging for a highly parallel hybrid computer system. 63:1-63:12
Technical papers: resource management
- Éric Gaussier, David Glesser, Valentin Reis, Denis Trystram:
Improving backfilling by using machine learning to predict running times. 64:1-64:10 - Qian Sun, Tong Jin, Melissa Romanus, Hoang Bui, Fan Zhang, Hongfeng Yu, Hemanth Kolla, Scott Klasky, Jacqueline Chen, Manish Parashar:
Adaptive data placement for staging-based coupled scientific workflows. 65:1-65:12 - Sergey Blagodurov, Alexandra Fedorova, Evgeny Vinnik, Tyler Dwyer, Fabien Hermenier:
Multi-objective job placement in clusters. 66:1-66:12
Technical papers: graph algorithms and benchmarks
- Umut A. Acar, Arthur Charguéraud, Mike Rainey:
A work-efficient algorithm for parallel unordered depth-first search. 67:1-67:12 - Hang Liu, H. Howie Huang:
Enterprise: breadth-first graph traversal on GPUs. 68:1-68:12 - Lifeng Nai, Yinglong Xia, Ilie Gabriel Tanase, Hyesoon Kim, Ching-Yung Lin:
GraphBIG: understanding graph computing in the context of industrial solutions. 69:1-69:12
Technical papers: resilience
- Marc Gamell, Keita Teranishi, Michael A. Heroux, Jackson R. Mayo, Hemanth Kolla, Jacqueline Chen, Manish Parashar:
Local recovery and failure masking for stencil-based applications at extreme scales. 70:1-70:12 - Antonio J. Peña, Wesley Bland, Pavan Balaji:
VOCL-FT: introducing techniques for efficient soft error coprocessor recovery. 71:1-71:12 - Rizwan A. Ashraf, Roberto Gioiosa, Gokcen Kestor, Ronald F. DeMara, Chen-Yong Cher, Pradip Bose:
Understanding the propagation of transient errors in HPC applications. 72:1-72:12
Technical papers: state of the practice: measuring systems
- Torsten Hoefler, Roberto Belli:
Scientific benchmarking of parallel computing systems: twelve ways to tell the masses when reporting performance results. 73:1-73:12 - Thomas Scogland, Jonathan Azose, David Rohr, Suzanne Rivoire, Natalie J. Bates, Daniel Hackenberg:
Node variability in large-scale power measurements: perspectives from the Green500, Top500 and EEHPCWG. 74:1-74:11 - Lipeng Wan, Feiyi Wang, Sarp Oral, Devesh Tiwari, Sudharshan S. Vazhkudai, Qing Cao:
A practical approach to reconciling availability, performance, and capacity in provisioning extreme-scale storage systems. 75:1-75:12
Technical papers: tensor computation
- Jiajia Li, Casey Battaglino, Ioakeim Perros, Jimeng Sun, Richard W. Vuduc:
An input-adaptive and in-place approach to dense tensor-times-matrix multiply. 76:1-76:12 - Oguz Kaya, Bora Uçar:
Scalable sparse tensor decompositions in distributed memory systems. 77:1-77:11
Technical papers: power-constrained computing
- Yuichi Inadomi, Tapasya Patki, Koji Inoue, Mutsumi Aoyagi, Barry Rountree, Martin Schulz, David K. Lowenthal, Yasutaka Wada, Keiichiro Fukazawa, Masatsugu Ueda, Masaaki Kondo, Ikuo Miyoshi:
Analyzing and mitigating the impact of manufacturing variability in power-constrained supercomputing. 78:1-78:12 - Peter E. Bailey, Aniruddha Marathe, David K. Lowenthal, Barry Rountree, Martin Schulz:
Finding the limits of power-constrained application performance. 79:1-79:12 - Daniel A. Ellsworth, Allen D. Malony, Barry Rountree, Martin Schulz:
Dynamic power sharing for higher job throughput. 80:1-80:11
Technical papers: programming systems
- Elliott Slaughter, Wonchan Lee, Sean Treichler, Michael Bauer, Alex Aiken:
Regent: a high-productivity programming language for HPC with logical regions. 81:1-81:12 - Junghyun Kim, Thanh Tuan Dao, Jaehoon Jung, Jinyoung Joo, Jaejin Lee:
Bridging OpenCL and CUDA: a comparative analysis and translation. 82:1-82:12 - Shaizeen Aga, Sriram Krishnamoorthy, Satish Narayanasamy:
CilkSpec: optimistic concurrency for Cilk. 83:1-83:12
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.