Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleFebruary 2025
PSRR-MaxpoolNMS++: Fast Non-Maximum Suppression With Discretization and Pooling
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 47, Issue 2Pages 978–993https://doi.org/10.1109/TPAMI.2024.3485898Non-maximum suppression (NMS) is an essential post-processing step for object detection. The de-facto standard for NMS, namely GreedyNMS, is not parallelizable and could thus be the performance bottleneck in object detection pipelines. MaxpoolNMS is ...
- research-articleJanuary 2025
TableGPT: a novel table understanding method based on table recognition and large language model collaborative enhancement
AbstractIn today's information age, table images play a crucial role in storing structured information, making table image recognition technology an essential component in many fields. However, accurately recognizing the structure and text content of ...
- research-articleDecember 2024
Insights into Natural Language Database Query Errors: from Attention Misalignment to User Handling Strategies
ACM Transactions on Interactive Intelligent Systems (TIIS), Volume 14, Issue 4Article No.: 25, Pages 1–32https://doi.org/10.1145/3650114Querying structured databases with natural language (NL2SQL) has remained a difficult problem for years. Recently, the advancement of machine learning (ML), natural language processing (NLP), and large language models (LLM) have led to significant ...
- research-articleDecember 2024
AutoJournaling: A Context-Aware Journaling System Leveraging MLLMs on Smartphone Screenshots
ACM MobiCom '24: Proceedings of the 30th Annual International Conference on Mobile Computing and NetworkingPages 2347–2352https://doi.org/10.1145/3636534.3698122Journaling offers significant benefits, including fostering self-reflection, enhancing writing skills, and aiding in mood monitoring. However, many people abandon the practice because traditional journaling is time-consuming, and detailed life events may ...
- research-articleDecember 2024
AraSync: Precision Time Synchronization in Rural Wireless Living Lab
- Md Nadim,
- Taimoor Ul Islam,
- Salil Reddy,
- Tianyi Zhang,
- Zhibo Meng,
- Reshal Afzal,
- Sarath Babu,
- Arsalan Ahmad,
- Daji Qiao,
- Anish Arora,
- Hongwei Zhang
ACM MobiCom '24: Proceedings of the 30th Annual International Conference on Mobile Computing and NetworkingPages 1898–1905https://doi.org/10.1145/3636534.3697318Time synchronization is a critical component in network operation and management, and it is also required by Ultra-Reliable, Low-Latency Communications (URLLC) in next-generation wireless systems such as those of 5G, 6G, and Open RAN. In this context, we ...
-
- research-articleNovember 2024
Streaming Quanta Sensors for Online, High-Performance Imaging and Vision
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 47, Issue 3Pages 1564–1577https://doi.org/10.1109/TPAMI.2024.3501154Recently quanta image sensors (QIS) – ultra-fast, zero-read-noise binary image sensors– have demonstrated remarkable imaging capabilities in many challenging scenarios. Despite their potential, the adoption of these sensors is severely ...
- research-articleOctober 2024
Proof Automation with Large Language Models
ASE '24: Proceedings of the 39th IEEE/ACM International Conference on Automated Software EngineeringPages 1509–1520https://doi.org/10.1145/3691620.3695521Interactive theorem provers such as Coq are powerful tools to formally guarantee the correctness of software. However, using these tools requires significant manual effort and expertise. While Large Language Models (LLMs) have shown promise in ...
- research-articleOctober 2024
SQLucid: Grounding Natural Language Database Queries with Interactive Explanations
UIST '24: Proceedings of the 37th Annual ACM Symposium on User Interface Software and TechnologyArticle No.: 12, Pages 1–20https://doi.org/10.1145/3654777.3676368Though recent advances in machine learning have led to significant improvements in natural language interfaces for databases, the accuracy and reliability of these systems remain limited, especially in high-stakes domains. This paper introduces SQLucid, ...
- ArticleOctober 2024
Generating Progressive Images from Pathological Transitions Via Diffusion Model
Medical Image Computing and Computer Assisted Intervention – MICCAI 2024Pages 308–318https://doi.org/10.1007/978-3-031-72120-5_29AbstractPathological image analysis is a crucial field in deep learning applications. However, training effective models demands large-scale annotated data, which faces challenges due to sampling and annotation scarcity. The rapid developing generative ...
- research-articleOctober 2024
Predicting Affective States from Screen Text Sentiment
UbiComp '24: Companion of the 2024 on ACM International Joint Conference on Pervasive and Ubiquitous ComputingPages 384–390https://doi.org/10.1145/3675094.3678489The proliferation of mobile sensing technologies has enabled the study of various physiological and behavioural phenomena through unobtrusive data collection from smartphone sensors. This approach offers real-time insights into individuals' physical and ...
- research-articleOctober 2024
Leveraging LLMs to Predict Affective States via Smartphone Sensor Features
UbiComp '24: Companion of the 2024 on ACM International Joint Conference on Pervasive and Ubiquitous ComputingPages 709–716https://doi.org/10.1145/3675094.3678420As mental health issues for young adults present a pressing public health concern, daily digital mood monitoring for early detection has become an important prospect. An active research area, digital phenotyping, involves collecting and analysing data ...
- research-articleOctober 2024
Joint Task Offloading and Content Caching for NOMA-Aided Cloud-Edge-Terminal Cooperation Networks
IEEE Transactions on Wireless Communications (TWC), Volume 23, Issue 10_Part_3Pages 15586–15600https://doi.org/10.1109/TWC.2024.3432150To satisfy the requirements of content distribution in computation-intensive and delay-sensitive services, this paper presents a novel joint task offloading and content caching (JTOCC) scheme in multi-cell multi-carrier non-orthogonal multiple-access (...
- ArticleOctober 2024
Robust Incremental Structure-from-Motion with Hybrid Features
- Shaohui Liu,
- Yidan Gao,
- Tianyi Zhang,
- Rémi Pautrat,
- Johannes L. Schönberger,
- Viktor Larsson,
- Marc Pollefeys
AbstractStructure-from-Motion (SfM) has become a ubiquitous tool for camera calibration and scene reconstruction with many downstream applications in computer vision and beyond. While the state-of-the-art SfM pipelines have reached a high level of ...
- research-articleDecember 2024
YOLOv5s infrared small target detection model with improved feature pyramid and fused attention mechanism
IPMLP '24: Proceedings of the International Conference on Image Processing, Machine Learning and Pattern RecognitionPages 287–291https://doi.org/10.1145/3700906.3700953With the development of deep learning, thermal infrared target detection has a wide range of applications in various fields. However, most existing deep learning models have high computational complexity, sensitivity to hyperparameters, low timeliness, ...
Automated Deep Learning Optimization via DSL-Based Source Code Transformation
ISSTA 2024: Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and AnalysisPages 479–490https://doi.org/10.1145/3650212.3652143As deep learning models become increasingly bigger and more complex, it is critical to improve model training and inference efficiency. Though a variety of highly optimized libraries and packages (known as DL kernels) have been developed, it is tedious ...
- ArticleSeptember 2024
GraphRPM: Risk Pattern Mining on Industrial Large Attributed Graphs
- Sheng Tian,
- Xintan Zeng,
- Yifei Hu,
- Baokun Wang,
- Yongchao Liu,
- Yue Jin,
- Changhua Meng,
- Chuntao Hong,
- Tianyi Zhang,
- Weiqiang Wang
Machine Learning and Knowledge Discovery in Databases. Applied Data Science TrackPages 133–149https://doi.org/10.1007/978-3-031-70381-2_9AbstractGraph-based patterns are extensively employed and favored by practitioners within industrial companies due to their capacity to represent the behavioral attributes and topological relationships among users, thereby offering enhanced ...
- ArticleNovember 2024
Going Beyond Passages: Readability Assessment for Book-Level Long Texts
AbstractReadability assessment for book-level long text is widely needed in real educational applications. However, most of the current researches focus on passage-level readability assessment and little work has been done to process ultra-long texts. In ...
- research-articleJuly 2024
GROUPCOVER: a secure, efficient and scalable inference framework for on-device model protection based on TEEs
ICML'24: Proceedings of the 41st International Conference on Machine LearningArticle No.: 2480, Pages 59992–60003Due to the high cost of training DNN models, how to protect the intellectual property of DNN models, especially when the models are deployed to users' devices, is becoming an important topic. One practical solution is to use Trusted Execution ...
- research-articleJuly 2024
Do Large Language Models Pay Similar Attention Like Human Programmers When Generating Code?
Proceedings of the ACM on Software Engineering (PACMSE), Volume 1, Issue FSEArticle No.: 100, Pages 2261–2284https://doi.org/10.1145/3660807Large Language Models (LLMs) have recently been widely used for code generation. Due to the complexity and opacity of LLMs, little is known about how these models generate code. We made the first attempt to bridge this knowledge gap by investigating ...