Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJuly 2024
Design and Implementation of an IPC-based Collective MPI Library for Intel GPUs
PEARC '24: Practice and Experience in Advanced Research Computing 2024: Human Powered ComputingArticle No.: 17, Pages 1–9https://doi.org/10.1145/3626203.3670549With the rising demand for computing power in High-Performance Computing and Deep Learning applications, there is a noticeable trend in outfitting modern exascale clusters with accelerators. In recent years, Intel has been designing and developing GPU ...
- research-articleMarch 2024
GIPC: Fast and Stable Gauss-Newton Optimization of IPC Barrier Energy
ACM Transactions on Graphics (TOG), Volume 43, Issue 2Article No.: 23, Pages 1–18https://doi.org/10.1145/3643028Barrier functions are crucial for maintaining an intersection- and inversion-free simulation trajectory but existing methods, which directly use distance can restrict implementation design and performance. We present an approach to rewriting the barrier ...
Pocket: ML Serving from the Edge
EuroSys '23: Proceedings of the Eighteenth European Conference on Computer SystemsPages 46–62https://doi.org/10.1145/3552326.3587459One of the major challenges in serving ML applications is the resource pressure introduced by the underlying ML frameworks. This becomes a bigger problem at resource-constrained, multi-tenant edge server locations, where it is necessary to scale to a ...
- research-articleOctober 2022
DistFax: a toolkit for measuring interprocess communications and quality of distributed systems
ICSE '22: Proceedings of the ACM/IEEE 44th International Conference on Software Engineering: Companion ProceedingsPages 51–55https://doi.org/10.1145/3510454.3516859In this paper, we present DistFax, a toolkit for measuring common distributed systems, focusing on their interprocess communications (IPCs), a vital aspect of distributed system run-time behaviors. DistFax measures the coupling and cohesion of ...
- research-articleJanuary 2022
Augmenting keyword-based patent prior art search using weighted classification code hierarchies
International Journal of Business Intelligence and Data Mining (IJBIDM), Volume 21, Issue 4Pages 397–418https://doi.org/10.1504/ijbidm.2022.126500Patents are critical intellectual assets for any business. With the rapid increase in the patent filings, patent prior art retrieval has become an important task. The goal of the prior art retrieval is to find documents relevant to a patent application. ...
-
- research-articleNovember 2021
A new computational method for acquiring effect knowledge to support product innovation
AbstractEffect provides a scientific principle-level means for product function realization. The unexpected or new application of effects can create high-level innovations enabling products long-term technical advantages and market ...
Highlights- A function-technical area representation is used to capture effect knowledge.
- ...
- ArticleOctober 2020
On the Evolution of Security Issues in Android App Versions
Applied Cryptography and Network Security WorkshopsPages 523–541https://doi.org/10.1007/978-3-030-61638-0_29AbstractSince its launch in 2008, the Android platform has seen a lot of development and improvements to this day. Android developer studios had to refine their understanding and available codebases considerably in the past decade since Android’s ...
- research-articleOctober 2019
A real-time scratchpad-centric OS with predictable inter/intra-core communication for multi-core embedded systems
AbstractMulti-core processors have replaced single-core systems in almost every segment of the industry. Unfortunately, their increased complexity often causes a loss of temporal predictability which represents a key requirement for hard real-time ...
- research-articleJune 2019
Exploring the Role of Large Centralised Caches in Thermal Efficient Chip Design
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 24, Issue 5Article No.: 52, Pages 1–28https://doi.org/10.1145/3339850In the era of short channel length, Dynamic Thermal Management (DTM) has become a challenging task for the architects and designers engineering modern Chip Multi-Processors (CMPs). Ever-increasing demand of processing power along with the developed ...
- articleAugust 2017
Properties of information sets and information processing with an application to face recognition
Knowledge and Information Systems (KAIS), Volume 52, Issue 2Pages 485–507https://doi.org/10.1007/s10115-016-1017-xThis paper presents the properties of information sets that help derive local features from a face when partitioned into windows and devises the information rules from the generalized fuzzy rules for information processing that helps match the unknown ...
- research-articleSeptember 2016
Efficient resource sharing algorithm for physical register file in simultaneous multi-threading processors
Microprocessors & Microsystems (MSYS), Volume 45, Issue PBPages 270–282https://doi.org/10.1016/j.micpro.2016.06.002Simultaneous Multi-Threading (SMT) processors increase performance by allowing concurrent execution of multiple independent threads with sharing of key datapath components and better utilization of the resources. An SMT processor usually maintains a ...
- articleJune 2016
A Guide to the StatFact EViews Add-in
Computational Economics (KLU-CSEM), Volume 48, Issue 1Pages 183–188https://doi.org/10.1007/s10614-015-9507-6This is a short paper which details the user-written StatFact add-in released by IHS on November 10, 2014. The first section introduces and explains the motivation behind the development of the add-in, the second section details the econometric ...
- research-articleApril 2016
L4 Microkernels: The Lessons from 20 Years of Research and Deployment
ACM Transactions on Computer Systems (TOCS), Volume 34, Issue 1Article No.: 1, Pages 1–29https://doi.org/10.1145/2893177The L4 microkernel has undergone 20 years of use and evolution. It has an active user and developer community, and there are commercial versions that are deployed on a large scale and in safety-critical systems. In this article we examine the lessons ...
- research-articleDecember 2015
Multilayer source selection as a tool for supporting patent search and classification
Information Retrieval (INFRE), Volume 18, Issue 6Pages 559–585https://doi.org/10.1007/s10791-015-9270-2AbstractIn this paper we present a method that can be used to attain specific objectives in a typical prior art search process. The objectives are first to assist patent searchers in understanding the underlying technical concepts of a patent by ...
- ArticleNovember 2015
Self-Timed Periodic Scheduling of Data-Dependent Tasks in Embedded Streaming Applications
ICA3PP 2015: Proceedings, Part II, of the 15th International Conference on Algorithms and Architectures for Parallel Processing - Volume 9529Pages 458–478https://doi.org/10.1007/978-3-319-27122-4_32Developers increasingly use streaming languages to write embedded many-core applications that process large volumes of data with high throughput. Because they enable periodic scheduling, cyclo-static models of computation and their variants are well ...
- articleDecember 2014
The Software Architecture for Efficient Distributed Interprocess Communication in Mobile Distributed Systems
Journal of Grid Computing (SPJGC), Volume 12, Issue 4Pages 615–635https://doi.org/10.1007/s10723-014-9304-9The mobile distributed computing applications execute in heterogeneous and dynamic network environments and require efficient as well as reliable interprocess communication (IPC) mechanism. In general, the kernel-level IPC mechanisms offer improved ...
- research-articleSeptember 2014
GPU-Aware Intranode MPI_Allreduce
EuroMPI/ASIA '14: Proceedings of the 21st European MPI Users' Group MeetingPages 45–50https://doi.org/10.1145/2642769.2642773Modern multi-core clusters are increasingly using GPUs to achieve higher performance and power efficiency. In such clusters, efficient communication among processes with data residing in GPU memory is of paramount importance to the performance of MPI ...
- short-paperJune 2014
IP-NUMA for low-latency communication
CFI '14: Proceedings of The Ninth International Conference on Future Internet TechnologiesArticle No.: 13, Pages 1–4https://doi.org/10.1145/2619287.2619294With cloud service becoming more popular, low-latency communication is required between servers in a data center. Low-latency node-to-node or application-to-application notification can be achieved in a NUMA [1] (Non-Uniform Memory Access) system, but ...
- posterMay 2014
Customizing an open source processor to fit in an ultra-low power cluster with a shared L1 memory
GLSVLSI '14: Proceedings of the 24th edition of the great lakes symposium on VLSIPages 87–88https://doi.org/10.1145/2591513.2591569The OpenRISC processor core, featuring a flat pipeline and a low area footprint has been integrated in a multi-core ultra-low power (ULP) cluster with a shared multi-banked memory to exploit parallelism in the near-threshold regime. The micro-...
- ArticleMarch 2014
Dfuzzer: A D-Bus Service Fuzzing Tool
ICSTW '14: Proceedings of the 2014 IEEE International Conference on Software Testing, Verification, and Validation WorkshopsPages 383–389https://doi.org/10.1109/ICSTW.2014.51We present Dfuzzer, a fully automated tool for fuzz testing programs communicating via D-Bus. D-Bus is the prevalent modern mechanism for an inter-process communication in the GNU/Linux ecosystem. Programs receiving data over D-Bus should sanitize the ...