Author: Wong, Stephan : Search

research-article

Swordfish: A Framework for Evaluating Deep Neural Network-based Basecalling using Computation-In-Memory with Non-Ideal Memristors

MICRO '23: Proceedings of the 56th Annual IEEE/ACM International Symposium on MicroarchitecturePages 1437–1452https://doi.org/10.1145/3613424.3614252

Basecalling, an essential step in many genome analysis studies, relies on large Deep Neural Network s (DNN s) to achieve high accuracy. Unfortunately, these DNN s are computationally slow and inefficient, leading to considerable delays and resource ...

Article

A Case for Genome Analysis Where Genomes Reside

Embedded Computer Systems: Architectures, Modeling, and SimulationPages 453–458https://doi.org/10.1007/978-3-031-46077-7_30

Abstract

Genome analysis, such as studying human genomics, critically impacts various aspects of human life. These analyses involve diverse species and experience a surge in the data required to be dealt with. However, extant computer systems grapple with ...

Article

RattlesnakeJake: A Fast and Accurate Pre-alignment Filter Suitable for Computation-in-Memory

Embedded Computer Systems: Architectures, Modeling, and SimulationPages 209–221https://doi.org/10.1007/978-3-031-46077-7_14

Abstract

Significant improvements in pre-alignment filter accuracy have shifted the execution bottleneck of short-read sequence alignment to the filtering step for many genomics datasets. Current pre-alignment filters move data from memory to the ...

Article

Micro-architecture and Control Electronics Simulation of Modular Color Center-Based Quantum Computers

Embedded Computer Systems: Architectures, Modeling, and SimulationPages 141–157https://doi.org/10.1007/978-3-031-46077-7_10

Abstract

In the world of quantum computing, developments of electronics needed to control (a) qubit(s) rapidly follow one another. Consequently, (micro-)architectures need to be defined to ultimately build control logic. In this paper, we present the (...

research-article

Open Access

KrakenOnMem: a memristor-augmented HW/SW framework for taxonomic profiling

ICS '22: Proceedings of the 36th ACM International Conference on SupercomputingArticle No.: 29, Pages 1–14https://doi.org/10.1145/3524059.3532367

State-of-the-art taxonomic profilers that comprise the first step in larger-context metagenomic studies have proven to be computationally intensive, i.e., while accurate, they come at the cost of high latency and energy consumption. Table Lookup ...

research-article

Open Access

MNEMOSENE: Tile Architecture and Simulator for Memristor-based Computation-in-memory

ACM Journal on Emerging Technologies in Computing Systems (JETC), Volume 18, Issue 3Article No.: 44, Pages 1–24https://doi.org/10.1145/3485824

In recent years, we are witnessing a trend toward in-memory computing for future generations of computers that differs from traditional von-Neumann architecture in which there is a clear distinction between computing and memory units. Considering that ...

research-article

FPGA-based Deep Learning Accelerator for RF Applications

MILCOM 2021 - 2021 IEEE Military Communications Conference (MILCOM)Pages 751–756https://doi.org/10.1109/MILCOM52596.2021.9652891

A key obstacle within the design of cognitive radios has always been the spectrum sensing component that implements the function automatic modulation classification (AMC). With the transition to software-defined radios (SDRs) followed by the introduction ...

short-paper

CIM-SIM: Computation In Memory SIMuIator

SCOPES '19: Proceedings of the 22nd International Workshop on Software and Compilers for Embedded SystemsPages 1–4https://doi.org/10.1145/3323439.3323989

Computation-in-memory reverses the trend in von-Neumann processors by bringing the computation closer to the data, to even within the memory array, as opposed to introducing new memory hierarchies to keep (frequently used) data closer to a central ...

research-article

Exploiting Idle Hardware to Provide Low Overhead Fault Tolerance for VLIW Processors

ACM Journal on Emerging Technologies in Computing Systems (JETC), Volume 13, Issue 2Article No.: 13, Pages 1–21https://doi.org/10.1145/3001935

Because of technology scaling, the soft error rate has been increasing in digital circuits, which affects system reliability. Therefore, modern processors, including VLIW architectures, must have means to mitigate such effects to guarantee reliable ...

article

A Dynamic Modulo Scheduling with Binary Translation: Loop optimization with software compatibility

Journal of Signal Processing Systems (JSPS), Volume 85, Issue 1Pages 45–66https://doi.org/10.1007/s11265-015-0974-8

In the past years, many works have demonstrated the applicability of Coarse-Grained Reconfigurable Array (CGRA) accelerators to optimize loops by using software pipelining approaches. They are proven to be effective in reducing the total execution time ...

research-article

Free

Run-time phase prediction for a reconfigurable VLIW processor

DATE '16: Proceedings of the 2016 Conference on Design, Automation & Test in EuropePages 1634–1639

It is well-known that different applications exhibit varying amounts of ILP. Execution of these applications on the same fixed-width VLIW processor will result (1) in wasted energy due to underutilized resources if the issue-width of the processor is ...

research-article

A Runtime FPGA Placement and Routing Using Low-Complexity Graph Traversal

ACM Transactions on Reconfigurable Technology and Systems (TRETS), Volume 8, Issue 2Article No.: 9, Pages 1–16https://doi.org/10.1145/2660775

Dynamic Partial Reconfiguration (DPaR) enables efficient allocation of logic resources by adding new functionalities or by sharing and/or multiplexing resources over time. Placement and routing (P&R) is one of the most time-consuming steps in the DPaR ...

proceeding

CS2 '15: Proceedings of the Second Workshop on Cryptography and Security in Computing Systems

On behalf of the program and organizing committees, it is a great pleasure to welcome you to the Second Workshop on Cryptography and Security in Computing Systems (CS2 2015). The CS2 meeting is a co-located event with HiPEAC 2015 in Amsterdam, ...

article

Multiple description coding for SNR scalable video transmission over unreliable networks

Multimedia Tools and Applications (MTAA), Volume 69, Issue 3Pages 843–858https://doi.org/10.1007/s11042-012-1150-9

Streaming multimedia data on best-effort networks such as the Internet requires measures against bandwidth fluctuations and frame loss. Multiple Description Coding (MDC) methods are used to overcome the jitter and delay problems arising from frame ...

Article

Embedded reconfigurable architectures

Stephan Wong

ARC'13: Proceedings of the 9th international conference on Reconfigurable Computing: architectures, tools, and applicationsPages 229–230https://doi.org/10.1007/978-3-642-36812-7_30

ERA (Embedded Reconfigurable Architectures) is a 3-year project funded by the EU under the FP7 framework programme (starting January 2010). The ERA project addresses issues rising from a scenario in which the complexity and diversity of embedded systems ...

Article

Configurable fault-tolerance for a configurable VLIW processor

ARC'13: Proceedings of the 9th international conference on Reconfigurable Computing: architectures, tools, and applicationsPages 167–178https://doi.org/10.1007/978-3-642-36812-7_16

This paper presents the design and implementation of configurable fault-tolerance techniques for a configurable VLIW processor. The processor can be configured for 2, 4, or 8 issue-slots with different types of execution functional units (FUs), and its ...

research-article

Support for dynamic issue width in VLIW processors using generic binaries

DATE '13: Proceedings of the Conference on Design, Automation and Test in EuropePages 827–832

Different applications exhibit different behavior that cannot be optimally captured by a fixed organization of a VLIW processor. However, through exploitation of reconfigurable hardware we can optimize the organization when running different ...

tutorial

Embedded reconfigurable architectures

CASES '12: Proceedings of the 2012 international conference on Compilers, architectures and synthesis for embedded systemsPages 213–214https://doi.org/10.1145/2380403.2380444

In current-day embedded systems design, one is faced with cut-throat competition to deliver new functionalities in increasingly shorter time frames. This is now achieved by incorporating processor cores into embedded systems through (re-)...

Article

On Virtualization of Reconfigurable Hardware in Distributed Systems

ICPPW '12: Proceedings of the 2012 41st International Conference on Parallel Processing WorkshopsPages 348–356https://doi.org/10.1109/ICPPW.2012.51

In the design of next-generation distributed and high-performance computing systems, Reconfigurable Processing Elements (RPEs) such as FPGAs and multi-core heterogeneous computers will play an important role. FPGAs are well-known for their ...

Article

Task Scheduling in Large-scale Distributed Systems Utilizing Partial Reconfigurable Processing Elements

IPDPSW '12: Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD ForumPages 79–90https://doi.org/10.1109/IPDPSW.2012.6

Recent progress in processing speeds, network bandwidths, and middleware technologies have contributed towards novel computing platforms, ranging from large-scale computing clusters to globally distributed systems. Consequently, most current computing ...

Applied Filters

People

Names

Institutions

Authors

Editors

Reviewers

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder

Upcoming Conferences