Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleMarch 2025JUST ACCEPTED
An Efficient Delta Compression Framework Seamlessly Integrated into Inline Deduplication
Delta compression can complement data deduplication by further minimizing redundancy through the compression of non-duplicate data chunks. When adding delta compression to deduplication-based backup systems, however, two primary challenges arise that ...
- research-articleJanuary 2025
EMD empowered neural network for predicting spatio-temporal non-stationary channel in UAV communications: EMD empowered neural network for predicting spatio-temporal...
AbstractThis paper introduces a novel prediction method for spatio-temporal non-stationary channels between unmanned aerial vehicles (UAVs) and ground control vehicles, essential for the fast and accurate acquisition of channel state information (CSI) to ...
- research-articleFebruary 2025
An immunohistochemical scoring network based on multi-branch and dual attention mechanisms for the evaluation of biomarker PCNA in esophageal cancer
AbstractImmunohistochemical (IHC) detection is crucial for diagnosing esophageal cancer. Proliferating Cell Nuclear Antigen (PCNA) is a key biomarker in IHC analysis, aiding in tumor characterization and prognosis. However, manual scoring methods are ...
Highlights- PH-ScoreNet for the first time automates PCNA IHC scoring in esophageal cancer.
- Dual-branch network integrates nucleus detection with fine-grained classification.
- Dual attention mechanism enhances feature emphasis for improved ...
- research-articleJanuary 2025
EMG-YOLO: An efficient fire detection model for embedded devices
AbstractThe number of edge embedded devices has been increasing with the development of Internet of Things (IoT) technology. In urban fire detection, improving the accuracy of fire detection based on embedded devices requires substantial computational ...
Highlights- The proposed method improves smoke and flame detection accuracy for non-uniform shapes and different scales.
- Two innovative modules are proposed to mitigate the problem of poor feature extraction for flame and smoke targets.
- ...
- research-articleDecember 2024
FastLoad: Speeding Up Data Loading of Both Sparse Matrix and Vector for SpMV on GPUs
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 35, Issue 12Pages 2423–2434https://doi.org/10.1109/TPDS.2024.3477431Sparse Matrix-Vector Multiplication (SpMV) on GPUs has gained significant attention because of SpMV's importance in modern applications and the increasing computing power of GPUs in the last decade. Previous studies have emphasized the importance ...
-
- research-articleNovember 2024
Multi-scale GraphSAGE with class center balancing loss for rolling bearing fault diagnosis under extremely class imbalance: Multi-scale GraphSAGE with class center balancing loss for rolling...
AbstractThe imbalance between normal and fault data in the condition monitoring of rotating machinery often leads to models needing more focus on the information from the majority class. To this end, this work proposed a rolling bearing fault diagnosis ...
- research-articleFebruary 2025
GM-YOLO: A Lightweight Small Target Detection Model
ICTCE '24: Proceedings of the 2024 6th International Conference on Telecommunications and Communication EngineeringPages 1–7https://doi.org/10.1145/3705391.3705392Traditional target detection models struggle to maintain accuracy and improve inference speed for small target detection on computationally limited embedded devices. To solve this problem, this paper proposes an improved YOLOv5 model: GM-YOLO. First, the ...
- research-articleNovember 2024
NStore: A High-Performance NUMA-Aware Key-Value Store for Hybrid Memory
- Zhonghua Wang,
- Kai Lu,
- Jiguang Wan,
- Hong Jiang,
- Zeyang Zhao,
- Peng Xu,
- Biliang Lai,
- Guokuan Li,
- Changsheng Xie
IEEE Transactions on Computers (ITCO), Volume 74, Issue 3Pages 929–943https://doi.org/10.1109/TC.2024.3504269Emerging persistent memory (PM) promises near-DRAM performance, larger capacity, and data persistence, attracting researchers to design PM-based key-value stores. However, existing PM-based key-value stores lack awareness of the Non-Uniform Memory Access (...
- research-articleNovember 2024
RomeFS: A CXL-SSD Aware File System Exploiting Synergy of Memory-Block Dual Paths
SoCC '24: Proceedings of the 2024 ACM Symposium on Cloud ComputingPages 720–736https://doi.org/10.1145/3698038.3698539Compute eXpress Link (CXL) based Solid-State Drives (CXL-SSDs), such as the Samsung CMM-H model, promise to offer CXL.mem memory and CXL.io block dual-mode interfaces. Nonetheless, whether and how cloud applications with diverse and varying access ...
- research-articleNovember 2024
Beyond Belady to Attain a Seemingly Unattainable Byte Miss Ratio for Content Delivery Networks
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 35, Issue 11Pages 1949–1963https://doi.org/10.1109/TPDS.2024.3452096Reducing the byte miss ratio (BMR) in the Content Delivery Network (CDN) caches can help providers save on the cost of paying for traffic. When evicting objects or files of different sizes in the caches of CDNs, it is no longer sufficient to pursue an ...
- research-articleJune 2024
Sum‐rate maximization for downlink multiuser MISO URLLC system aided by IRS with discrete phase shifters
AbstractIntelligent reflecting surface (IRS) has recently been considered as a potential technology for realizing ultra‐reliable and low‐latency (URLLC) in wireless networks. This paper proposes a resource optimization scheme to maximize the sum‐rate ...
FLOWS: Balanced MRC Profiling for Heterogeneous Object-Size Cache
EuroSys '24: Proceedings of the Nineteenth European Conference on Computer SystemsPages 421–440https://doi.org/10.1145/3627703.3650078While Miss Ratio Curve (MRC) profiling methods based on spatial sampling are effective in modeling cache behaviors, previous MRC studies lack in-depth analysis of profiling errors and primarily target homogeneous object-size scenarios. This has caused ...
- research-articleApril 2024
POPA: Expressing High and Portable Performance across Spatial and Vector Architectures for Tensor Computations
FPGA '24: Proceedings of the 2024 ACM/SIGDA International Symposium on Field Programmable Gate ArraysPages 199–210https://doi.org/10.1145/3626202.3637566This paper aims at high and portable performance for tensor computations across spatial (e.g., FPGAs) and vector architectures (e.g., GPUs). The state-of-the-art usually address performance portability across vector architectures (CPUs and GPUs). However,...
- research-articleApril 2024
An efficient SSSP algorithm on time-evolving graphs with prediction of computation results
Journal of Parallel and Distributed Computing (JPDC), Volume 186, Issue Chttps://doi.org/10.1016/j.jpdc.2023.104830AbstractMany applications need to execute Single-Source Shortest Paths (SSSP) algorithm on each snapshot of a time-evolving graph, leading to long waiting times experienced by the users of such applications. However, these applications are often time-...
Highlights- An efficient algorithm to compute SSSPs of the snapshots of time-evolving graph.
- A compact data structure of time-evolving graph to save the memory space and improve the snapshot access speed.
- An effective design to speed up the ...
- research-articleMarch 2024
Esophageal cancer detection framework based on time series information from smear images
Expert Systems with Applications: An International Journal (EXWA), Volume 238, Issue PFhttps://doi.org/10.1016/j.eswa.2023.122362AbstractThe gold standard for esophageal cancer diagnosis and treatment is the Thinprep Cytologic Test (TCT) of suspected sections. TCT refers to analyze the features of the lesion areas using tissue regions stained with hematoxylin and eosin. However, ...
Highlights- A time series-based quantitative framework is designed for esophageal grading.
- A coarse–fine model is designed to detect target cells for quantitative analysis.
- A quantitative analysis model is presented for cell ploidy rectifying.
- research-articleFebruary 2024
An improved social mimic optimization algorithm and its application in bearing fault diagnosis
Neural Computing and Applications (NCAA), Volume 36, Issue 13Pages 7295–7326https://doi.org/10.1007/s00521-024-09461-zAbstractAs a key component of rotating machinery, it is of great significance for the timely diagnosis of bearing weak faults. Stochastic resonance is widely used for its special signal enhancement pattern, and the combination of system parameters ...
FluidKV: Seamlessly Bridging the Gap between Indexing Performance and Memory-Footprint on Ultra-Fast Storage
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 6Pages 1377–1390https://doi.org/10.14778/3648160.3648177Our extensive experiments reveal that existing key-value stores (KVSs) achieve high performance at the expense of a huge memory footprint that is often impractical or unacceptable. Even with the emerging ultra-fast byte-addressable persistent memory (PM),...
- research-articleJanuary 2024
Explorations and Exploitation for Parity-based RAIDs with Ultra-fast SSDs
ACM Transactions on Storage (TOS), Volume 20, Issue 1Article No.: 6, Pages 1–32https://doi.org/10.1145/3627992Following a conventional design principle that pays more fast-CPU-cycles for fewer slow-I/Os, popular software storage architecture Linux Multiple-Disk (MD) for parity-based RAID (e.g., RAID5 and RAID6) assigns one or more centralized worker threads to ...
- research-articleJanuary 2024
A disk I/O optimized system for concurrent graph processing jobs
Frontiers of Computer Science: Selected Publications from Chinese Universities (FCS), Volume 18, Issue 3https://doi.org/10.1007/s11704-023-2361-0AbstractIn order to analyze and process the large graphs with high cost efficiency, researchers have developed a number of out-of-core graph processing systems in recent years based on just one commodity computer. On the other hand, with the rapidly ...
- research-articleJanuary 2024
Applying Delta Compression to Packed Datasets for Efficient Data Reduction
IEEE Transactions on Computers (ITCO), Volume 73, Issue 1Pages 73–85https://doi.org/10.1109/TC.2023.3318404Backup systems often adopt deduplication techniques for data reduction. Real-world backup products often group files into larger units (called packed files) before deduplicating them. The grouping entails inserting metadata immediately before the contents ...