Author: Ahn, Junwhan : Search

Applied Filters

People

Publications

Publication Date

Past 5 years

21 Results for: Author: Ahn, JunwhanEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,816,101 records)|Limit your search to The ACM Full-Text Collection (772,232 records)

Showing 1 - 20of21 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
Free
July 2020
An imitation learning approach for cache replacement
ICML'20: Proceedings of the 37th International Conference on Machine LearningArticle No.: 579, Pages 6237–6247

Program execution speed critically depends on increasing cache hits, as cache hits are orders of magnitude faster than misses. To increase cache hits, we focus on the problem of cache replacement: choosing which cache line to evict upon inserting a new ...
3
231
Metrics
Total Citations3
Total Downloads231
Last 12 Months173
Last 6 weeks6
1
Supplementary Material
Additional material
View online with eReader
PDF
research-article
April 2019
Software-Defined Far Memory in Warehouse-Scale Computers
ASPLOS '19: Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating SystemsPages 317–330https://doi.org/10.1145/3297858.3304053

Increasing memory demand and slowdown in technology scaling pose important challenges to total cost of ownership (TCO) of warehouse-scale computers (WSCs). One promising idea to reduce the memory TCO is to add a cheaper, but slower, "far memory" tier ...
79
5,936
Metrics
Total Citations79
Total Downloads5,936
Last 12 Months413
Last 6 weeks42
Get Access
research-article
September 2018
Nonvolatile Write Buffer-Based Journaling Bypass for Storage Write Reduction in Mobile Devices
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCADICS), Volume 37, Issue 9Pages 1747–1759https://doi.org/10.1109/TCAD.2017.2774192

In mobile systems, such as smartphones, most of storage writes are incurred by the SQLite database (DB) system. These writes consist of two parts: writes to original data (e.g., SQLite DB file) and journaling-induced writes. In this paper, we first ...
1
Metrics
Total Citations1
research-article
Open Access
March 2018
Benzene: An Energy-Efficient Distributed Hybrid Cache Architecture for Manycore Systems
ACM Transactions on Architecture and Code Optimization (TACO), Volume 15, Issue 1Article No.: 10, Pages 1–23https://doi.org/10.1145/3177963

This article proposes Benzene, an energy-efficient distributed SRAM/STT-RAM hybrid cache for manycore systems running multiple applications. It is based on the observation that a naïve application of hybrid cache techniques to distributed caches in a ...
9
724
Metrics
Total Citations9
Total Downloads724
Last 12 Months77
Last 6 weeks13
View online with eReader
PDF
research-article
June 2017
Making DRAM Stronger Against Row Hammering
DAC '17: Proceedings of the 54th Annual Design Automation Conference 2017Article No.: 55, Pages 1–6https://doi.org/10.1145/3061639.3062281

Modern DRAM suffers from a new problem called row hammering. The problem is expected to become more severe in future DRAMs mostly due to increased inter-row coupling at advanced technology. In order to address this problem, we present a ...
69
597
Metrics
Total Citations69
Total Downloads597
Last 12 Months88
Last 6 weeks11
Get Access
Upcoming Conferences
Skip slideshow

ASPDAC '25

January 20 - 23, 2025

Tokyo Odaiba Miraikan, Japan, Tokyo, Japan

ASPDAC '25 Website

ASPLOS '25

March 30 - April 3, 2025

Postillion Hotel and Convention Centre WTC Rotterdam, Rotterdam, Netherlands

DATE '25

March 31 - April 2, 2025

Centre Congr?s de Lyon, Lyon, France

DATE '25 Website

ISCA '25

June 21 - 25, 2025

Waseda University & RIHGA Royal Hotel Tokyo, Tokyo, Japan

ISCA '25 Website

DAC '25

June 22 - 26, 2025

Moscone Center, San Francisco, CA, USA

DAC '25 Website
research-article
Free
March 2017
A novel zero weight/activation-aware hardware architecture of convolutional neural network
DATE '17: Proceedings of the Conference on Design, Automation & Test in EuropePages 1466–1471

It is imperative to accelerate convolutional neural networks (CNNs) due to their ever-widening application areas from server, mobile to IoT devices. Based on the fact that CNNs can be characterized by a significant amount of zero values in both kernel ...
3
164
Metrics
Total Citations3
Total Downloads164
Last 12 Months29
Last 6 weeks4
View online with eReader
PDF
research-article
Open Access
October 2016
AIM: Energy-Efficient Aggregation Inside the Memory Hierarchy
ACM Transactions on Architecture and Code Optimization (TACO), Volume 13, Issue 4Article No.: 34, Pages 1–24https://doi.org/10.1145/2994149

In this article, we propose Aggregation-in-Memory (AIM), a new processing-in-memory system designed for energy efficiency and near-term adoption. In order to efficiently perform aggregation, we implement simple aggregation operations in main memory and ...
7
751
Metrics
Total Citations7
Total Downloads751
Last 12 Months92
Last 6 weeks14
View online with eReader
PDF
research-article
October 2016
Zero and data reuse-aware fast convolution for deep neural networks on GPU
CODES '16: Proceedings of the Eleventh IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System SynthesisArticle No.: 33, Pages 1–10https://doi.org/10.1145/2968456.2968476

Convolution operations dominate the total execution time of deep convolutional neural networks (CNNs). In this paper, we aim at enhancing the performance of the state-of-the-art convolution algorithm (called Winograd convolution) on the GPU. Our work is ...
29
721
Metrics
Total Citations29
Total Downloads721
Last 12 Months39
Last 6 weeks4
Get Access
research-article
April 2016
Differential Write-Conscious Software Design on Phase-Change Memory: An SQLite Case Study
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 21, Issue 3Article No.: 47, Pages 1–25https://doi.org/10.1145/2842613

Phase-change memory (PCM) has several benefits including low cost, non-volatility, byte-addressability, etc., and limitations such as write endurance. There have been several hardware approaches to exploit the benefits while minimizing the negative ...
0
299
Metrics
Total Citations0
Total Downloads299
Last 12 Months5
Last 6 weeks1
Get Access
article
March 2016
Prediction Hybrid Cache: An Energy-Efficient STT-RAM Cache Architecture
IEEE Transactions on Computers (ITCO), Volume 65, Issue 3Pages 940–951https://doi.org/10.1109/TC.2015.2435772

Spin-transfer torque RAM (STT-RAM) has emerged as an energy-efficient and high-density alternative to SRAM for large on-chip caches. However, its high write energy has been considered as a serious drawback. Hybrid caches mitigate this problem by ...
11
Metrics
Total Citations11
research-article
February 2016
Low-Power Hybrid Memory Cubes With Link Power Management and Two-Level Prefetching
IEEE Transactions on Very Large Scale Integration (VLSI) Systems (ITVL), Volume 24, Issue 2Pages 453–464https://doi.org/10.1109/TVLSI.2015.2420315
The hybrid memory cube (HMC) is a 3-D-stacked DRAM architecture designed for substantially improved memory bandwidth. In particular, its I/O interface achieves up to 320 GB/s of external bandwidth through high-speed serial links. However, it comes at the ...
3
Metrics
Total Citations3
research-article
October 2015
A tiny-capacitor-backed non-volatile buffer to reduce storage writes in smartphones
CODES '15: Proceedings of the 10th International Conference on Hardware/Software Codesign and System SynthesisPages 21–29

Mobile storage writes are often dominated by writes to SQLite database files. Our characterization shows that they mostly consist of frequent overwrites with small new data (which we call small writes) and relatively infrequent writes with large data ...
2
115
Metrics
Total Citations2
Total Downloads115
Last 12 Months2
Last 6 weeks0
Get Access
research-article
June 2015
A scalable processing-in-memory accelerator for parallel graph processing
ISCA '15: Proceedings of the 42nd Annual International Symposium on Computer ArchitecturePages 105–117https://doi.org/10.1145/2749469.2750386

The explosion of digital data and the ever-growing need for fast data analysis have made in-memory big-data processing in computer systems increasingly important. In particular, large-scale graph processing is gaining attention due to its broad ...
Also Published in:
ACM SIGARCH Computer Architecture News: Volume 43 Issue 3S
683
5,649
Metrics
Total Citations683
Total Downloads5,649
Last 12 Months558
Last 6 weeks49
Get Access
research-article
June 2015
PIM-enabled instructions: a low-overhead, locality-aware processing-in-memory architecture
ISCA '15: Proceedings of the 42nd Annual International Symposium on Computer ArchitecturePages 336–348https://doi.org/10.1145/2749469.2750385

Processing-in-memory (PIM) is rapidly rising as a viable solution for the memory wall crisis, rebounding from its unsuccessful attempts in 1990s due to practicality concerns, which are alleviated with recent advances in 3D stacking technologies. However,...
Also Published in:
ACM SIGARCH Computer Architecture News: Volume 43 Issue 3S
377
2,014
Metrics
Total Citations377
Total Downloads2,014
Last 12 Months357
Last 6 weeks40
Get Access
research-article
March 2015
Memory fast-forward: a low cost special function unit to enhance energy efficiency in GPU for big data processing
DATE '15: Proceedings of the 2015 Design, Automation & Test in Europe Conference & ExhibitionPages 1341–1346

Big data processing, e.g., graph computation and MapReduce, is characterized by massive parallelism in computation and a large amount of fine-grained random memory accesses often with structural localities due to graph-like data dependency. Recently, ...
0
152
Metrics
Total Citations0
Total Downloads152
Last 12 Months2
Last 6 weeks1
Get Access
research-article
June 2014
Dynamic Power Management of Off-Chip Links for Hybrid Memory Cubes
DAC '14: Proceedings of the 51st Annual Design Automation ConferencePages 1–6https://doi.org/10.1145/2593069.2593128

The Hybrid Memory Cube (HMC) is a 3D-stacked DRAM architecture designed for substantially improved memory bandwidth. In particular, its I/O interface achieves up to 320 GB/s of external bandwidth through high-speed serial links. However, it comes at a ...
16
382
Metrics
Total Citations16
Total Downloads382
Last 12 Months3
Last 6 weeks0
Get Access
research-article
September 2013
Write intensity prediction for energy-efficient non-volatile caches
ISLPED '13: Proceedings of the 2013 International Symposium on Low Power Electronics and DesignPages 223–228

This paper presents a novel concept called write intensity prediction for energy-efficient non-volatile caches as well as the architecture that implements the concept. The key idea is to correlate write intensity of cache blocks with addresses of memory ...
3
101
Metrics
Total Citations3
Total Downloads101
Last 12 Months0
Last 6 weeks0
Get Access
research-article
Open Access
May 2013
Power-Efficient Predication Techniques for Acceleration of Control Flow Execution on CGRA
ACM Transactions on Architecture and Code Optimization (TACO), Volume 10, Issue 2Article No.: 8, Pages 1–25https://doi.org/10.1145/2459316.2459319

Coarse-grained reconfigurable architecture typically has an array of processing elements which are controlled by a centralized unit. This makes it difficult to execute programs having control divergence among PEs without predication. However, ...
35
1,146
Metrics
Total Citations35
Total Downloads1,146
Last 12 Months171
Last 6 weeks19
View online with eReader
PDF
research-article
January 2013
Isomorphism-Aware Identification of Custom Instructions With I/O Serialization
- Junwhan Ahn,
- Kiyoung Choi
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCADICS), Volume 32, Issue 1Pages 34–46https://doi.org/10.1109/TCAD.2012.2214033

Extensible processors have been widely used to achieve the conflicting demands for performance improvement, low power consumption, and flexibility. As extensible processors have become more popular, several algorithms have been proposed for ...
1
Metrics
Total Citations1
research-article
October 2011
An efficient algorithm for isomorphism-aware custom instruction identification for extensible processors
- Junwhan Ahn,
- Kiyoung Choi
CODES+ISSS '11: Proceedings of the seventh IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesisPages 345–354https://doi.org/10.1145/2039370.2039424

Extensible processors have been widely used to achieve the conflicting demands for performance improvement, low power consumption, and flexibility. As extensible processors have become more popular, several algorithms have been proposed for ...
2
163
Metrics
Total Citations2
Total Downloads163
Last 12 Months4
Last 6 weeks2
Get Access

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder

Upcoming Conferences

Also Published in:

Also Published in: