Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleDecember 2023
Swordfish: A Framework for Evaluating Deep Neural Network-based Basecalling using Computation-In-Memory with Non-Ideal Memristors
- Taha Shahroodi,
- Gagandeep Singh,
- Mahdi Zahedi,
- Haiyu Mao,
- Joel Lindegger,
- Can Firtina,
- Stephan Wong,
- Onur Mutlu,
- Said Hamdioui
MICRO '23: Proceedings of the 56th Annual IEEE/ACM International Symposium on MicroarchitecturePages 1437–1452https://doi.org/10.1145/3613424.3614252Basecalling, an essential step in many genome analysis studies, relies on large Deep Neural Network s (DNN s) to achieve high accuracy. Unfortunately, these DNN s are computationally slow and inefficient, leading to considerable delays and resource ...
- ArticleNovember 2023
A Case for Genome Analysis Where Genomes Reside
Embedded Computer Systems: Architectures, Modeling, and SimulationPages 453–458https://doi.org/10.1007/978-3-031-46077-7_30AbstractGenome analysis, such as studying human genomics, critically impacts various aspects of human life. These analyses involve diverse species and experience a surge in the data required to be dealt with. However, extant computer systems grapple with ...
- ArticleNovember 2023
Micro-architecture and Control Electronics Simulation of Modular Color Center-Based Quantum Computers
Embedded Computer Systems: Architectures, Modeling, and SimulationPages 141–157https://doi.org/10.1007/978-3-031-46077-7_10AbstractIn the world of quantum computing, developments of electronics needed to control (a) qubit(s) rapidly follow one another. Consequently, (micro-)architectures need to be defined to ultimately build control logic. In this paper, we present the (...
- research-articleJune 2022
KrakenOnMem: a memristor-augmented HW/SW framework for taxonomic profiling
ICS '22: Proceedings of the 36th ACM International Conference on SupercomputingArticle No.: 29, Pages 1–14https://doi.org/10.1145/3524059.3532367State-of-the-art taxonomic profilers that comprise the first step in larger-context metagenomic studies have proven to be computationally intensive, i.e., while accurate, they come at the cost of high latency and energy consumption. Table Lookup ...
-
- research-articleJanuary 2022
MNEMOSENE: Tile Architecture and Simulator for Memristor-based Computation-in-memory
- Mahdi Zahedi,
- Muah Abu Lebdeh,
- Christopher Bengel,
- Dirk Wouters,
- Stephan Menzel,
- Manuel Le Gallo,
- Abu Sebastian,
- Stephan Wong,
- Said Hamdioui
ACM Journal on Emerging Technologies in Computing Systems (JETC), Volume 18, Issue 3Article No.: 44, Pages 1–24https://doi.org/10.1145/3485824In recent years, we are witnessing a trend toward in-memory computing for future generations of computers that differs from traditional von-Neumann architecture in which there is a clear distinction between computing and memory units. Considering that ...
- research-articleNovember 2021
FPGA-based Deep Learning Accelerator for RF Applications
MILCOM 2021 - 2021 IEEE Military Communications Conference (MILCOM)Pages 751–756https://doi.org/10.1109/MILCOM52596.2021.9652891A key obstacle within the design of cognitive radios has always been the spectrum sensing component that implements the function automatic modulation classification (AMC). With the transition to software-defined radios (SDRs) followed by the introduction ...
- short-paperMay 2019
CIM-SIM: Computation In Memory SIMuIator
- Ali BanaGozar,
- Kanishkan Vadivel,
- Sander Stuijk,
- Henk Corporaal,
- Stephan Wong,
- Muath Abu Lebdeh,
- Jintao Yu,
- Said Hamdioui
SCOPES '19: Proceedings of the 22nd International Workshop on Software and Compilers for Embedded SystemsPages 1–4https://doi.org/10.1145/3323439.3323989Computation-in-memory reverses the trend in von-Neumann processors by bringing the computation closer to the data, to even within the memory array, as opposed to introducing new memory hierarchies to keep (frequently used) data closer to a central ...
- research-articleJanuary 2017
Exploiting Idle Hardware to Provide Low Overhead Fault Tolerance for VLIW Processors
- Anderson L. Sartor,
- Arthur F. Lorenzon,
- Luigi Carro,
- Fernanda Kastensmidt,
- Stephan Wong,
- Antonio C. S. Beck
ACM Journal on Emerging Technologies in Computing Systems (JETC), Volume 13, Issue 2Article No.: 13, Pages 1–21https://doi.org/10.1145/3001935Because of technology scaling, the soft error rate has been increasing in digital circuits, which affects system reliability. Therefore, modern processors, including VLIW architectures, must have means to mitigate such effects to guarantee reliable ...
- articleOctober 2016
A Dynamic Modulo Scheduling with Binary Translation: Loop optimization with software compatibility
Journal of Signal Processing Systems (JSPS), Volume 85, Issue 1Pages 45–66https://doi.org/10.1007/s11265-015-0974-8In the past years, many works have demonstrated the applicability of Coarse-Grained Reconfigurable Array (CGRA) accelerators to optimize loops by using software pipelining approaches. They are proven to be effective in reducing the total execution time ...
- research-articleMarch 2016
Run-time phase prediction for a reconfigurable VLIW processor
It is well-known that different applications exhibit varying amounts of ILP. Execution of these applications on the same fixed-width VLIW processor will result (1) in wasted energy due to underutilized resources if the issue-width of the processor is ...
- research-articleMarch 2015
A Runtime FPGA Placement and Routing Using Low-Complexity Graph Traversal
ACM Transactions on Reconfigurable Technology and Systems (TRETS), Volume 8, Issue 2Article No.: 9, Pages 1–16https://doi.org/10.1145/2660775Dynamic Partial Reconfiguration (DPaR) enables efficient allocation of logic resources by adding new functionalities or by sharing and/or multiplexing resources over time. Placement and routing (P&R) is one of the most time-consuming steps in the DPaR ...
- proceedingJanuary 2015
CS2 '15: Proceedings of the Second Workshop on Cryptography and Security in Computing Systems
On behalf of the program and organizing committees, it is a great pleasure to welcome you to the Second Workshop on Cryptography and Security in Computing Systems (CS2 2015). The CS2 meeting is a co-located event with HiPEAC 2015 in Amsterdam, ...
- articleApril 2014
Multiple description coding for SNR scalable video transmission over unreliable networks
Multimedia Tools and Applications (MTAA), Volume 69, Issue 3Pages 843–858https://doi.org/10.1007/s11042-012-1150-9Streaming multimedia data on best-effort networks such as the Internet requires measures against bandwidth fluctuations and frame loss. Multiple Description Coding (MDC) methods are used to overcome the jitter and delay problems arising from frame ...
- ArticleMarch 2013
Embedded reconfigurable architectures
ARC'13: Proceedings of the 9th international conference on Reconfigurable Computing: architectures, tools, and applicationsPages 229–230https://doi.org/10.1007/978-3-642-36812-7_30ERA (Embedded Reconfigurable Architectures) is a 3-year project funded by the EU under the FP7 framework programme (starting January 2010). The ERA project addresses issues rising from a scenario in which the complexity and diversity of embedded systems ...
- ArticleMarch 2013
Configurable fault-tolerance for a configurable VLIW processor
ARC'13: Proceedings of the 9th international conference on Reconfigurable Computing: architectures, tools, and applicationsPages 167–178https://doi.org/10.1007/978-3-642-36812-7_16This paper presents the design and implementation of configurable fault-tolerance techniques for a configurable VLIW processor. The processor can be configured for 2, 4, or 8 issue-slots with different types of execution functional units (FUs), and its ...
- research-articleMarch 2013
Support for dynamic issue width in VLIW processors using generic binaries
Different applications exhibit different behavior that cannot be optimally captured by a fixed organization of a VLIW processor. However, through exploitation of reconfigurable hardware we can optimize the organization when running different ...
- tutorialOctober 2012
Embedded reconfigurable architectures
- Stephan Wong,
- Luigi Carro,
- Stamatis Kavvadias,
- Georgios Keramidas,
- Francesco Papariello,
- Claudio Scordino,
- Roberto Giorgi,
- Stefanos Kaxiras
CASES '12: Proceedings of the 2012 international conference on Compilers, architectures and synthesis for embedded systemsPages 213–214https://doi.org/10.1145/2380403.2380444In current-day embedded systems design, one is faced with cut-throat competition to deliver new functionalities in increasingly shorter time frames. This is now achieved by incorporating processor cores into embedded systems through (re-)...
- ArticleSeptember 2012
On Virtualization of Reconfigurable Hardware in Distributed Systems
ICPPW '12: Proceedings of the 2012 41st International Conference on Parallel Processing WorkshopsPages 348–356https://doi.org/10.1109/ICPPW.2012.51In the design of next-generation distributed and high-performance computing systems, Reconfigurable Processing Elements (RPEs) such as FPGAs and multi-core heterogeneous computers will play an important role. FPGAs are well-known for their ...
- ArticleMay 2012
Task Scheduling in Large-scale Distributed Systems Utilizing Partial Reconfigurable Processing Elements
IPDPSW '12: Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD ForumPages 79–90https://doi.org/10.1109/IPDPSW.2012.6Recent progress in processing speeds, network bandwidths, and middleware technologies have contributed towards novel computing platforms, ranging from large-scale computing clusters to globally distributed systems. Consequently, most current computing ...