default search action
45th ISCA 2018: Los Angeles, CA, USA
- Murali Annavaram, Timothy Mark Pinkston, Babak Falsafi:
45th ACM/IEEE Annual International Symposium on Computer Architecture, ISCA 2018, Los Angeles, CA, USA, June 1-6, 2018. IEEE Computer Society 2018, ISBN 978-1-5386-5984-7
Session 1A: Clouds & Datacenters
- Jeremy Fowers, Kalin Ovtcharov, Michael Papamichael, Todd Massengill, Ming Liu, Daniel Lo, Shlomi Alkalay, Michael Haselman, Logan Adams, Mahdi Ghandi, Stephen Heil, Prerak Patel, Adam Sapek, Gabriel Weisz, Lisa Woods, Sitaram Lanka, Steven K. Reinhardt, Adrian M. Caulfield, Eric S. Chung, Doug Burger:
A Configurable Cloud-Scale DNN Processor for Real-Time AI. 1-14 - Matt Skach, Manish Arora, Dean M. Tullsen, Lingjia Tang, Jason Mars:
Virtual Melting Temperature: Managing Server Load to Minimize Cooling Overhead with Phase Change Materials. 15-28 - Sagar Karandikar, Howard Mao, Donggyu Kim, David Biancolin, Alon Amid, Dayeol Lee, Nathan Pemberton, Emmanuel Amaro, Colin Schmidt, Aditya Chopra, Qijing Huang, Kyle Kovacs, Borivoje Nikolic, Randy H. Katz, Jonathan Bachrach, Krste Asanovic:
FireSim: FPGA-Accelerated Cycle-Exact Scale-Out System Simulation in the Public Cloud. 29-42
Session 1B: Accelerators for Emerging Apps
- Prakalp Srivastava, Mingu Kang, Sujan K. Gonugondla, Sungmin Lim, Jungwook Choi, Vikram S. Adve, Nam Sung Kim, Naresh R. Shanbhag:
PROMISE: An End-to-End Design of a Programmable Mixed-Signal Accelerator for Machine-Learning Algorithms. 43-56 - Marc Riera, José-María Arnau, Antonio González:
Computation Reuse in DNNs by Exploiting Input Similarity. 57-68 - Daichi Fujiki, Arun Subramaniyan, Tianjun Zhang, Yu Zeng, Reetuparna Das, David T. Blaauw, Satish Narayanasamy:
GenAx: A Genome Sequencing Accelerator. 69-82
Session 2A: Prefetching
- Sushant Kondguli, Michael C. Huang:
Division of Labor: A More Effective Approach to Prefetching. 83-95 - Anant Nori, Jayesh Gaur, Siddharth Rai, Sreenivas Subramoney, Hong Wang:
Criticality Aware Tiered Cache Hierarchy: A Fundamental Relook at Multi-Level Cache Hierarchies. 96-109 - Akanksha Jain, Calvin Lin:
Rethinking Belady's Algorithm to Accommodate Prefetching. 110-123
Session 2B: Languages & Models
- Sizhuo Zhang, Muralidaran Vijayaraghavan, Andrew Wright, Mehdi Alipour, Arvind:
Constructing a Weak Memory Model. 124-137 - Martin Maas, Krste Asanovic, John Kubiatowicz:
A Hardware Accelerator for Tracing Garbage Collection. 138-151 - Weilong Cui, Yongshan Ding, Deeksha Dangwal, Adam Holmes, Joseph McMahan, Ali JavadiAbhari, Georgios Tzimpragos, Frederic T. Chong, Timothy Sherwood:
Charm: A Language for Closed-Form High-Level Architecture Modeling. 152-165
Session 3A: Virtual Memory
- Yuxi Liu, Xia Zhao, Magnus Jahre, Zhenlin Wang, Xiaolin Wang, Yingwei Luo, Lieven Eeckhout:
Get Out of the Valley: Power-Efficient Address Mapping for GPUs. 166-179 - Seunghee Shin, Guilherme Cox, Mark Oskin, Gabriel H. Loh, Yan Solihin, Abhishek Bhattacharjee, Arkaprava Basu:
Scheduling Page Table Walks for Irregular GPU Applications. 180-192 - Mayank Parasar, Abhishek Bhattacharjee, Tushar Krishna:
SEESAW: Using Superpages to Improve VIPT Caches. 193-206 - Nandita Vijaykumar, Abhilasha Jain, Diptesh Majumdar, Kevin Hsieh, Gennady Pekhimenko, Eiman Ebrahimi, Nastaran Hajinazar, Phillip B. Gibbons, Onur Mutlu:
A Case for Richer Cross-Layer Abstractions: Bridging the Semantic Gap with Expressive Memory. 207-220
Session 3B: Coherence & Memory Ordering
- Alberto Ros, Stefanos Kaxiras:
Non-Speculative Store Coalescing in Total Store Order. 221-234 - Zhaoxiang Jin, Soner Önder:
Dynamic Memory Dependence Predication. 235-246 - Nicolai Oswald, Vijay Nagarajan, Daniel J. Sorin:
ProtoGen: Automatically Generating Directory Cache Coherence Protocols from Atomic Specifications. 247-260 - Johnathan Alsop, Matthew D. Sinclair, Sarita V. Adve:
Spandex: A Flexible Interface for Efficient Heterogeneous Coherence. 261-274
Session 4A: Emerging Paradigms
- Dayeol Lee, Gwangmu Lee, Dongup Kwon, Sunghwa Lee, Youngsok Kim, Jangwoo Kim:
Flexon: A Flexible Digital Neuron for Efficient Spiking Neural Network Simulations. 275-288 - James E. Smith:
Space-Time Algebra: A Model for Neocortical Computation. 289-300 - Xiangyu Zhang, Ramin Bashizade, Craig LaBoda, Chris Dwyer, Alvin R. Lebeck:
Architecting a Stochastic Computing Unit with Molecular Optical Devices. 301-314
Session 4B: Persistence
- Kunal Korgaonkar, Ishwar Bhati, Huichu Liu, Jayesh Gaur, Sasikanth Manipatruni, Sreenivas Subramoney, Tanay Karnik, Steven Swanson, Ian Young, Hong Wang:
Density Tradeoffs of Non-Volatile Memory as a Replacement for SRAM Based Last Level Cache. 315-327 - Vinson Young, Chia-Chen Chou, Aamer Jaleel, Moinuddin K. Qureshi:
ACCORD: Enabling Associativity for Gigascale DRAM Caches by Coordinating Way-Install and Way-Prediction. 328-339 - Fengbin Tu, Weiwei Wu, Shouyi Yin, Leibo Liu, Shaojun Wei:
RANA: Towards Efficient Neural Acceleration with Refresh-Optimized Embedded DRAM. 340-352
Session 5A: Emerging Memory 1
- Adi Fuchs, David Wentzlaff:
Scaling Datacenter Accelerators with Compute-Reuse Architectures. 353-366 - Ben Feinberg, Uday Kumar Reddy Vengalam, Nathan Whitehair, Shibo Wang, Engin Ipek:
Enabling Scientific Computing on Memristive Accelerators. 367-382 - Charles Eckert, Xiaowei Wang, Jingcheng Wang, Arun Subramaniyan, Ravi R. Iyer, Dennis Sylvester, David T. Blaauw, Reetuparna Das:
Neural Cache: Bit-Serial In-Cache Acceleration of Deep Neural Networks. 383-396
Session 5B: Storage
- Arash Tavakkol, Mohammad Sadrosadati, Saugata Ghose, Jeremie S. Kim, Yixin Luo, Yaohua Wang, Nika Mansouri-Ghiasi, Lois Orosa, Juan Gómez-Luna, Onur Mutlu:
FLIN: Enabling Fairness and Enhancing Performance in Modern NVMe Solid State Drives. 397-410 - Sang Woo Jun, Andy Wright, Sizhuo Zhang, Shuotao Xu, Arvind:
GraFBoost: Using Accelerated Flash Storage for External Graph Analytics. 411-424 - Duck-Ho Bae, Insoon Jo, Youra Choi, Joo Young Hwang, Sangyeun Cho, Daniel D. G. Lee, Jaeheon Jeong:
2B-SSD: The Case for Dual, Byte- and Block-Addressable Solid-State Drives. 425-438
Session 6A: Emerging Memory 2
- Mohammad A. Alshboul, James Tuck, Yan Solihin:
Lazy Persistency: A High-Performing and Write-Efficient Software Persistency Technique. 439-451 - Arpit Joshi, Vijay Nagarajan, Marcelo Cintra, Stratis Viglas:
DHTM: Durable Hardware Transactional Memory. 452-465 - Tiancong Wang, Sakthikumaran Sambasivam, James Tuck:
Hardware Supported Permission Checks on Persistent Objects for Performance and Programmability. 466-478
Session 6B: Controllers & Control Systems
- Jacob Sacks, Divya Mahajan, Richard Connor Lawson, Hadi Esmaeilzadeh:
RoboX: An End-to-End Solution to Accelerate Autonomous Control in Robotics. 479-490 - Dongup Kwon, Jaehyung Ahn, Dongju Chae, Mohammadamin Ajdari, Jaewon Lee, Suheon Bae, Youngsok Kim, Jangwoo Kim:
DCS-ctrl: A Fast and Flexible Device-Control Mechanism for Device-Centric Server Architecture. 491-504 - Raghavendra Pradyumna Pothukuchi, Sweta Yamini Pothukuchi, Petros G. Voulgaris, Josep Torrellas:
Yukta: Multilayer Resource Controllers to Maximize Efficiency. 505-518
Session 7A: Mobile Platforms
- Samira Mirbagher Ajorpaz, Elba Garza, Sangam Jindal, Daniel A. Jiménez:
Exploring Predictive Replacement Policies for Instruction Cache and Branch Target Buffer. 519-532 - Mark Buckler, Philip Bedoukian, Suren Jayasuriya, Adrian Sampson:
EVA2: Exploiting Temporal Redundancy in Live Computer Vision. 533-546 - Yuhao Zhu, Anand Samajdar, Matthew Mattina, Paul N. Whatmough:
Euphrates: Algorithm-SoC Co-Design for Low-Power Mobile Continuous Vision. 547-560 - Woo-Seok Choi, Matthew Tomei, Jose Rodrigo Sanchez Vicarte, Pavan Kumar Hanumolu, Rakesh Kumar:
Guaranteeing Local Differential Privacy on Ultra-Low-Power Systems. 561-574 - Cheng Tan, Manupa Karunaratne, Tulika Mitra, Li-Shiuan Peh:
Stitch: Fusible Heterogeneous Accelerators Enmeshed with Many-Core Architecture for Wearables. 575-587
Session 7B: Security
- Kate Nguyen, Kehan Lyu, Xianze Meng, Vilas Sridharan, Xun Jian:
Nonblocking Memory Refresh. 588-599 - Kanad Sinha, Simha Sethumadhavan:
Practical Memory Safety with REST. 600-611 - Seyed Mohammad Seyedzadeh, Alex K. Jones, Rami G. Melhem:
Mitigating Wordline Crosstalk Using Adaptive Trees of Counters. 612-623 - Mohammadkazem Taram, Ashish Venkat, Dean M. Tullsen:
Mobilizing the Micro-Ops: Exploiting Context Sensitive Decoding for Security and Energy Efficiency. 624-637 - Alric Althoff, Joseph McMahan, Luis Vega, Scott Davidson, Timothy Sherwood, Michael B. Taylor, Ryan Kastner:
Hiding Intermittent Information Leakage with Architectural Support for Blinking. 638-649
Session 8A: Machine Learning Systems 1
- Amir Yazdanbakhsh, Kambiz Samadi, Nam Sung Kim, Hadi Esmaeilzadeh:
GANAX: A Unified MIMD-SIMD Acceleration for Generative Adversarial Networks. 650-661 - Vahideh Akhlaghi, Amir Yazdanbakhsh, Kambiz Samadi, Rajesh K. Gupta, Hadi Esmaeilzadeh:
SnaPEA: Predictive Early Activation for Reducing Computation in Deep Convolutional Neural Networks. 662-673 - Kartik Hegde, Jiyong Yu, Rohit Agrawal, Mengjia Yan, Michael Pellauer, Christopher W. Fletcher:
UCNN: Exploiting Computational Reuse in Deep Neural Networks via Weight Repetition. 674-687 - Eunhyeok Park, Dongyoung Kim, Sungjoo Yoo:
Energy-Efficient Neural Network Accelerator Based on Outlier-Aware Low-Precision Computation. 688-698
Session 8B: Interconnection Networks
- Aniruddh Ramrakhyani, Paul V. Gratz, Tushar Krishna:
Synchronized Progress in Interconnection Networks (SPIN): A New Theory for Deadlock Freedom. 699-711 - Gwangsun Kim, Hayoung Choi, John Kim:
TCEP: Traffic Consolidation for Energy-Proportional High-Radix Networks. 712-725 - Jieming Yin, Zhifeng Lin, Onur Kayiran, Matthew Poremba, Muhammad Shoaib Bin Altaf, Natalie D. Enright Jerger, Gabriel H. Loh:
Modular Routing Design for Chiplet-Based Systems. 726-738 - Nachiket Kapre, Tushar Krishna:
FastTrack: Leveraging Heterogeneous FPGA Wires to Design Low-Cost High-Performance Soft NoCs. 739-751
Session 9A: Machine Learning Systems 2
- Mingcong Song, Jiechen Zhao, Yang Hu, Jiaqi Zhang, Tao Li:
Prediction Based Execution on Deep Neural Networks. 752-763 - Hardik Sharma, Jongse Park, Naveen Suda, Liangzhen Lai, Benson Chau, Vikas Chandra, Hadi Esmaeilzadeh:
Bit Fusion: Bit-Level Dynamically Composable Architecture for Accelerating Deep Neural Network. 764-775 - Animesh Jain, Amar Phanishayee, Jason Mars, Lingjia Tang, Gennady Pekhimenko:
Gist: Efficient Data Encoding for Deep Neural Network Training. 776-789 - Reza Yazdani, Marc Riera, José-María Arnau, Antonio González:
The Dark Side of DNN Pruning. 790-801
Session 9B: GPUs
- Bhargava Gopireddy, Dimitrios Skarlatos, Wenjuan Zhu, Josep Torrellas:
HetCore: TFET-CMOS Hetero-Device Architecture for CPUs and GPUs. 802-815 - Farzad Khorasani, Hodjat Asghari Esfeden, Amin Farmahini Farahani, Nuwan Jayasena, Vivek Sarkar:
RegMutex: Inter-Warp GPU Register Time-Sharing. 816-828 - Nandita Vijaykumar, Eiman Ebrahimi, Kevin Hsieh, Phillip B. Gibbons, Onur Mutlu:
The Locality Descriptor: A Holistic Cross-Layer Abstraction to Express Data Locality In GPUs. 829-842 - Ján Veselý, Arkaprava Basu, Abhishek Bhattacharjee, Gabriel H. Loh, Mark Oskin, Steven K. Reinhardt:
Generic System Calls for GPUs. 843-856
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.