Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleDecember 2024
FastLoad: Speeding Up Data Loading of Both Sparse Matrix and Vector for SpMV on GPUs
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 35, Issue 12Pages 2423–2434https://doi.org/10.1109/TPDS.2024.3477431Sparse Matrix-Vector Multiplication (SpMV) on GPUs has gained significant attention because of SpMV's importance in modern applications and the increasing computing power of GPUs in the last decade. Previous studies have emphasized the importance ...
- research-articleNovember 2024
Multi-scale GraphSAGE with class center balancing loss for rolling bearing fault diagnosis under extremely class imbalance
AbstractThe imbalance between normal and fault data in the condition monitoring of rotating machinery often leads to models needing more focus on the information from the majority class. To this end, this work proposed a rolling bearing fault diagnosis ...
- research-articleNovember 2024
RomeFS: A CXL-SSD Aware File System Exploiting Synergy of Memory-Block Dual Paths
SoCC '24: Proceedings of the 2024 ACM Symposium on Cloud ComputingPages 720–736https://doi.org/10.1145/3698038.3698539Compute eXpress Link (CXL) based Solid-State Drives (CXL-SSDs), such as the Samsung CMM-H model, promise to offer CXL.mem memory and CXL.io block dual-mode interfaces. Nonetheless, whether and how cloud applications with diverse and varying access ...
- research-articleNovember 2024
Beyond Belady to Attain a Seemingly Unattainable Byte Miss Ratio for Content Delivery Networks
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 35, Issue 11Pages 1949–1963https://doi.org/10.1109/TPDS.2024.3452096Reducing the byte miss ratio (BMR) in the Content Delivery Network (CDN) caches can help providers save on the cost of paying for traffic. When evicting objects or files of different sizes in the caches of CDNs, it is no longer sufficient to pursue an ...
- research-articleJune 2024
Sum‐rate maximization for downlink multiuser MISO URLLC system aided by IRS with discrete phase shifters
AbstractIntelligent reflecting surface (IRS) has recently been considered as a potential technology for realizing ultra‐reliable and low‐latency (URLLC) in wireless networks. This paper proposes a resource optimization scheme to maximize the sum‐rate ...
-
FLOWS: Balanced MRC Profiling for Heterogeneous Object-Size Cache
EuroSys '24: Proceedings of the Nineteenth European Conference on Computer SystemsPages 421–440https://doi.org/10.1145/3627703.3650078While Miss Ratio Curve (MRC) profiling methods based on spatial sampling are effective in modeling cache behaviors, previous MRC studies lack in-depth analysis of profiling errors and primarily target homogeneous object-size scenarios. This has caused ...
- research-articleApril 2024
POPA: Expressing High and Portable Performance across Spatial and Vector Architectures for Tensor Computations
FPGA '24: Proceedings of the 2024 ACM/SIGDA International Symposium on Field Programmable Gate ArraysPages 199–210https://doi.org/10.1145/3626202.3637566This paper aims at high and portable performance for tensor computations across spatial (e.g., FPGAs) and vector architectures (e.g., GPUs). The state-of-the-art usually address performance portability across vector architectures (CPUs and GPUs). However,...
- research-articleApril 2024
An efficient SSSP algorithm on time-evolving graphs with prediction of computation results
Journal of Parallel and Distributed Computing (JPDC), Volume 186, Issue Chttps://doi.org/10.1016/j.jpdc.2023.104830AbstractMany applications need to execute Single-Source Shortest Paths (SSSP) algorithm on each snapshot of a time-evolving graph, leading to long waiting times experienced by the users of such applications. However, these applications are often time-...
Highlights- An efficient algorithm to compute SSSPs of the snapshots of time-evolving graph.
- A compact data structure of time-evolving graph to save the memory space and improve the snapshot access speed.
- An effective design to speed up the ...
- research-articleMarch 2024
Esophageal cancer detection framework based on time series information from smear images
Expert Systems with Applications: An International Journal (EXWA), Volume 238, Issue PFhttps://doi.org/10.1016/j.eswa.2023.122362AbstractThe gold standard for esophageal cancer diagnosis and treatment is the Thinprep Cytologic Test (TCT) of suspected sections. TCT refers to analyze the features of the lesion areas using tissue regions stained with hematoxylin and eosin. However, ...
Highlights- A time series-based quantitative framework is designed for esophageal grading.
- A coarse–fine model is designed to detect target cells for quantitative analysis.
- A quantitative analysis model is presented for cell ploidy rectifying.
- research-articleFebruary 2024
An improved social mimic optimization algorithm and its application in bearing fault diagnosis
Neural Computing and Applications (NCAA), Volume 36, Issue 13Pages 7295–7326https://doi.org/10.1007/s00521-024-09461-zAbstractAs a key component of rotating machinery, it is of great significance for the timely diagnosis of bearing weak faults. Stochastic resonance is widely used for its special signal enhancement pattern, and the combination of system parameters ...
FluidKV: Seamlessly Bridging the Gap between Indexing Performance and Memory-Footprint on Ultra-Fast Storage
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 6Pages 1377–1390https://doi.org/10.14778/3648160.3648177Our extensive experiments reveal that existing key-value stores (KVSs) achieve high performance at the expense of a huge memory footprint that is often impractical or unacceptable. Even with the emerging ultra-fast byte-addressable persistent memory (PM),...
- research-articleJanuary 2024
Explorations and Exploitation for Parity-based RAIDs with Ultra-fast SSDs
ACM Transactions on Storage (TOS), Volume 20, Issue 1Article No.: 6, Pages 1–32https://doi.org/10.1145/3627992Following a conventional design principle that pays more fast-CPU-cycles for fewer slow-I/Os, popular software storage architecture Linux Multiple-Disk (MD) for parity-based RAID (e.g., RAID5 and RAID6) assigns one or more centralized worker threads to ...
- research-articleJanuary 2024
A disk I/O optimized system for concurrent graph processing jobs
Frontiers of Computer Science: Selected Publications from Chinese Universities (FCS), Volume 18, Issue 3https://doi.org/10.1007/s11704-023-2361-0AbstractIn order to analyze and process the large graphs with high cost efficiency, researchers have developed a number of out-of-core graph processing systems in recent years based on just one commodity computer. On the other hand, with the rapidly ...
- research-articleJanuary 2024
Applying Delta Compression to Packed Datasets for Efficient Data Reduction
IEEE Transactions on Computers (ITCO), Volume 73, Issue 1Pages 73–85https://doi.org/10.1109/TC.2023.3318404Backup systems often adopt deduplication techniques for data reduction. Real-world backup products often group files into larger units (called packed files) before deduplicating them. The grouping entails inserting metadata immediately before the contents ...
- research-articleDecember 2023
Joint active and passive beamforming optimization for IRS-assisted downlink MISO-URLLC in max–min fairness
Wireless Networks (WIRE), Volume 30, Issue 3Pages 1479–1491https://doi.org/10.1007/s11276-023-03550-yAbstractIn this paper, we propose a max–min fairness optimization scheme for a downlink multiuser multiple-input single-output (MISO) ultra-reliable and low-latency communication (URLLC) system assisted by an intelligence reflecting surface (IRS). In ...
- research-articleMarch 2024
Dynamics Analysis of Large-Scale Transmission Tower-Line Coupled System under Measured Typhoon Load
ICITEE '23: Proceedings of the 6th International Conference on Information Technologies and Electrical EngineeringPages 90–96https://doi.org/10.1145/3640115.3640130The large-scale transmission tower-line system generates significant vibrations when subjected to typhoon loads. Severe vibrations can damage key components of the tower-line system and, in extreme cases, lead to the collapse of the entire tower-line ...
- rapid-communicationNovember 2023
PSWF-based decoupled atomic norm minimization for DOD and DOA estimation in MIMO radar with arbitrary linear arrays
AbstractDecoupled atomic norm minimization (D-ANM) is a computationally-efficient gridless two-dimensional parameter estimation method via dividing the two-level Toeplitz matrix into two one-level matrices, and can be applied to the one-...
- research-articleOctober 2023
User Disengagement-Oriented Target Enforcement for Multi-Tenant Database Systems
SoCC '23: Proceedings of the 2023 ACM Symposium on Cloud ComputingPages 394–409https://doi.org/10.1145/3620678.3624668Unexpected long query latency of a database system can cause domino effects on all the upstream services and severely degrade end users' experience with unpredicted long waits, resulting in an increasing number of users disengaged with the services and ...
- research-articleOctober 2023
An incremental learning approach for sustainable regional isolation and integration
Computers and Electrical Engineering (CENG), Volume 111, Issue PAhttps://doi.org/10.1016/j.compeleceng.2023.108911Highlight- An effective approach for class incremental learning.
- Simulate the human brain's ability to acquire new knowledge while integrating old knowledge.
- Isolate the learning environments of new knowledge and old knowledge to mitigate ...
Humans are capable of acquiring new knowledge on a constant basis, while integrating and optimizing old knowledge without forgetting them. This is mainly attributed to the human brain's ability of partitioned learning and memory replay. In this ...
Graphical abstractDisplay Omitted
- research-articleAugust 2023
A non-stationary channel prediction method for UAV communication network with error compensation
Engineering Applications of Artificial Intelligence (EAAI), Volume 123, Issue PAhttps://doi.org/10.1016/j.engappai.2023.106206AbstractIn an unmanned aerial vehicle (UAV) communication network, especially for mission-critical applications, ultra-reliable and low-latency communication (URLLC) of the control links has essential implications for realizing collision avoidance and ...