Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleOctober 2024
A Pixel-Level Explainable Approach of Convolutional Neural Networks and Its Application
ASE '24: Proceedings of the 39th IEEE/ACM International Conference on Automated Software EngineeringPages 2444–2445https://doi.org/10.1145/3691620.3695320Convolutional neural network (CNN) currently has been widely used to undertake the task of image classification. Unfortunately, a trained CNN model is a nonlinear system with high complexity, and the implicit decision knowledge carried by the CNN model ...
- ArticleOctober 2024
Dynamic Pseudo Label Optimization in Point-Supervised Nuclei Segmentation
Medical Image Computing and Computer Assisted Intervention – MICCAI 2024Pages 220–230https://doi.org/10.1007/978-3-031-72111-3_21AbstractDeep learning has achieved impressive results in nuclei segmentation, but the massive requirement for pixel-wise labels remains a significant challenge. To alleviate the annotation burden, existing methods generate pseudo masks for model training ...
- research-articleSeptember 2024
Covariance-Based Activity Detection in Cooperative Multi-Cell Massive MIMO: Scaling Law and Efficient Algorithms
IEEE Transactions on Information Theory (ITHR), Volume 70, Issue 12Pages 8770–8790https://doi.org/10.1109/TIT.2024.3470952This paper focuses on the covariance-based activity detection problem in a multi-cell massive multiple-input multiple-output (MIMO) system. In this system, active devices transmit their signature sequences to multiple base stations (BSs), and the BSs ...
- ArticleSeptember 2024
Octopus: Embodied Vision-Language Programmer from Environmental Feedback
- Jingkang Yang,
- Yuhao Dong,
- Shuai Liu,
- Bo Li,
- Ziyue Wang,
- Haoran Tan,
- Chencheng Jiang,
- Jiamu Kang,
- Yuanhan Zhang,
- Kaiyang Zhou,
- Ziwei Liu
AbstractLarge vision-language models (VLMs) have achieved substantial progress in multimodal perception and reasoning. When integrated into an embodied agent, existing embodied VLM works either output detailed action sequences at the manipulation level or ...
- research-articleApril 2024
SealMates: Improving Communication in Video Conferencing using a Collective Behavior-Driven Avatar
- Mark Armstrong,
- Chi-Lan Yang,
- Kinga Skiers,
- Mengzhen Lim,
- Tamil Selvan Gunasekaran,
- Ziyue Wang,
- Takuji Narumi,
- Kouta Minamizawa,
- Yun Suen Pai
Proceedings of the ACM on Human-Computer Interaction (PACMHCI), Volume 8, Issue CSCW1Article No.: 118, Pages 1–23https://doi.org/10.1145/3637395The limited nonverbal cues and spatially distributed nature of remote communication make it challenging for unacquainted members to be expressive during social interactions over video conferencing. Though it enables seeing others' facial expressions, the ...
-
- research-articleAugust 2024
Breaking the Mold: Exploring Innovative Strategies for Hosting the Olympic Games
ECPDC '24: Proceedings of the 2024 International Academic Conference on Edge Computing, Parallel and Distributed ComputingPages 115–120https://doi.org/10.1145/3677404.3677424This study addresses the declining trend in the number of countries hosting the Olympics, exploring how to systematically evaluate the benefits of hosting and proposing innovative strategies to enhance sustainability and success. Using a TOPSIS model-...
- research-articleNovember 2023
Device Activity Detection in mMTC With Low-Resolution ADCs: A New Protocol
IEEE Transactions on Wireless Communications (TWC), Volume 23, Issue 6Pages 5847–5862https://doi.org/10.1109/TWC.2023.3328657This paper investigates the effect of low-resolution analog-to-digital converters (ADCs) on device activity detection in massive machine-type communications (mMTC). The low-resolution ADCs induce two challenges on the device activity detection compared ...
- ArticleOctober 2023
TiBERT: A Non-autoregressive Pre-trained Model for Text Editing
Natural Language Processing and Chinese ComputingPages 15–26https://doi.org/10.1007/978-3-031-44699-3_2AbstractText editing refers to the task of creating new sentences by altering existing text through methods such as replacing, inserting, or deleting. Two commonly used techniques for text editing are Seq2Seq and sequence labeling. The Seq2Seq method can ...
- ArticleMay 2024
Multi-patch Adversarial Attack for Remote Sensing Image Classification
AbstractDeep Neural Networks (DNNs) have shown excellent image classification performance both in accuracy and efficiency. Therefore, it is of great value to deploy adversarial patch to protect critical facilities from DNNs-based scene classification in ...
A Demonstration of DLBD: Database Logic Bug Detection System
Proceedings of the VLDB Endowment (PVLDB), Volume 16, Issue 12Pages 3914–3917https://doi.org/10.14778/3611540.3611584Database management systems (DBMSs) are prone to logic bugs that can result in incorrect query results. Current debugging tools are limited to single table queries and struggle with issues like lack of ground-truth results and repetitive query space ...
- ArticleOctober 2023
POSTER: Collaborative Authority-Based Searchable Encryption Using Access Control Encryption
Applied Cryptography and Network Security WorkshopsPages 722–726https://doi.org/10.1007/978-3-031-41181-6_47AbstractIn this poster, we propose a novel searchable encryption (SE) scheme that leverages access control encryption to enhance the security and flexibility of search operations. Our approach requires collaborative authority users to authorize data users ...
- research-articleApril 2023
Dementia Eyes: Co-Design and Evaluation of a Dementia Education Augmented Reality Experience for Medical Workers
- Ximing Shen,
- Yun Suen Pai,
- Dai Kiuchi,
- Kehan Bao,
- Tomomi Aoki,
- Hikari Meguro,
- Kanoko Oishi,
- Ziyue Wang,
- Sohei Wakisaka,
- Kouta Minamizawa
CHI '23: Proceedings of the 2023 CHI Conference on Human Factors in Computing SystemsArticle No.: 778, Pages 1–18https://doi.org/10.1145/3544548.3581009Dementia describes a syndrome of cognitive degeneration, and Behavioural and Psychological Symptoms of Dementia (BPSD) is the non-cognitive symptom. BPSD can be improved by care services. To aid better care service, we explore the potential of using ...
- research-articleMarch 2023
Affective Umbrella – A Wearable System to Visualize Heart and Electrodermal Activity, towards Emotion Regulation through Somaesthetic Appreciation
AHs '23: Proceedings of the Augmented Humans International Conference 2023Pages 231–242https://doi.org/10.1145/3582700.3582727In this paper, we introduce Affective Umbrella, a novel system to record, analyze and visualize physiological data in real time via an umbrella handle. We implement a biofeedback loop design in the system that triggers visualization changes to reflect ...
- research-articleFebruary 2023
Imperceptible adversarial attack via invertible neural networks
AAAI'23/IAAI'23/EAAI'23: Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial IntelligenceArticle No.: 46, Pages 414–424https://doi.org/10.1609/aaai.v37i1.25115Adding perturbations via utilizing auxiliary gradient information or discarding existing details of the benign images are two common approaches for generating adversarial examples. Though visual imperceptibility is the desired property of adversarial ...
- research-articleMarch 2024
Tractable and near-optimal adversarial algorithms for robust estimation in contaminated Gaussian models
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 235, Pages 11063–11174Consider the problem of simultaneous estimation of location and variance matrix under Huber's contaminated Gaussian model. First, we study minimum f-divergence estimation at the population level, corresponding to a generative adversarial method with a ...
- research-articleDecember 2022
CNN‐ and GAN‐based classification of malicious code families: A code visualization approach
International Journal of Intelligent Systems (IJIS), Volume 37, Issue 12Pages 12472–12489https://doi.org/10.1002/int.23094AbstractMalicious code attacks have severely hindered the current development of the Internet technologies. Once the devices are infected with virus, the damages to companies and users are unpredictable. Although researchers have developed malware ...
- ArticleOctober 2022
Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval
AbstractIn this paper we revisit feature fusion, an old-fashioned topic, in the new context of text-to-video retrieval. Different from previous research that considers feature fusion only at one end, let it be video or text, we aim for feature fusion for ...
- research-articleOctober 2022
Learn to Understand Negation in Video Retrieval
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 434–443https://doi.org/10.1145/3503161.3547968Negation is a common linguistic skill that allows human to express what we do NOT want. Naturally, one might expect video retrieval to support natural-language queries with negation, e.g., finding shots of kids sitting on the floor and not playing with ...
- ArticleMay 2023
Multi-task RetinaNet for Mitosis Detection
Mitosis Domain Generalization and Diabetic Retinopathy AnalysisPages 234–240https://doi.org/10.1007/978-3-031-33658-4_25AbstractThe count of mitotic cells is a key feature in tumor diagnosis. However, due to the variability of mitotic cell morphology, detecting mitotic cells in tumor tissues is a highly challenging task. At the same time, the performance of the trained ...
- abstractJuly 2022
ImageFlowing-Enhance Emotional Expression by Reproducing the Vital Signs of the Photographer
- Qianqian Mu,
- George Chernyshov,
- Ziyue Wang,
- Danny Hynds,
- Dingding Zheng,
- Kouta Minamizawa,
- Dunya Chen,
- Atsuro Ueki,
- Masa Inakage,
- Kai Kunze
SIGGRAPH '22: ACM SIGGRAPH 2022 Emerging TechnologiesArticle No.: 5, Pages 1–2https://doi.org/10.1145/3532721.3535565ImageFlowing is a ‘living’ photograph that reproduces the biometric signs of the photographer. Viewers can feel how the photographer felt through photographer’s breathing, heartbeats and skin temperature. We extend a two-dimensional picture into a multi-...