Author: Wang, Ziyue : Search

research-article

A Pixel-Level Explainable Approach of Convolutional Neural Networks and Its Application

ASE '24: Proceedings of the 39th IEEE/ACM International Conference on Automated Software EngineeringPages 2444–2445https://doi.org/10.1145/3691620.3695320

Convolutional neural network (CNN) currently has been widely used to undertake the task of image classification. Unfortunately, a trained CNN model is a nonlinear system with high complexity, and the implicit decision knowledge carried by the CNN model ...

Article

Dynamic Pseudo Label Optimization in Point-Supervised Nuclei Segmentation

Medical Image Computing and Computer Assisted Intervention – MICCAI 2024Pages 220–230https://doi.org/10.1007/978-3-031-72111-3_21

Abstract

Deep learning has achieved impressive results in nuclei segmentation, but the massive requirement for pixel-wise labels remains a significant challenge. To alleviate the annotation burden, existing methods generate pseudo masks for model training ...

research-article

Covariance-Based Activity Detection in Cooperative Multi-Cell Massive MIMO: Scaling Law and Efficient Algorithms

IEEE Transactions on Information Theory (ITHR), Volume 70, Issue 12Pages 8770–8790https://doi.org/10.1109/TIT.2024.3470952

This paper focuses on the covariance-based activity detection problem in a multi-cell massive multiple-input multiple-output (MIMO) system. In this system, active devices transmit their signature sequences to multiple base stations (BSs), and the BSs ...

Article

Octopus: Embodied Vision-Language Programmer from Environmental Feedback

Computer Vision – ECCV 2024Pages 20–38https://doi.org/10.1007/978-3-031-73232-4_2

Abstract

Large vision-language models (VLMs) have achieved substantial progress in multimodal perception and reasoning. When integrated into an embodied agent, existing embodied VLM works either output detailed action sequences at the manipulation level or ...

research-article

SealMates: Improving Communication in Video Conferencing using a Collective Behavior-Driven Avatar

Proceedings of the ACM on Human-Computer Interaction (PACMHCI), Volume 8, Issue CSCW1Article No.: 118, Pages 1–23https://doi.org/10.1145/3637395

The limited nonverbal cues and spatially distributed nature of remote communication make it challenging for unacquainted members to be expressive during social interactions over video conferencing. Though it enables seeing others' facial expressions, the ...

research-article

Breaking the Mold: Exploring Innovative Strategies for Hosting the Olympic Games

ECPDC '24: Proceedings of the 2024 International Academic Conference on Edge Computing, Parallel and Distributed ComputingPages 115–120https://doi.org/10.1145/3677404.3677424

This study addresses the declining trend in the number of countries hosting the Olympics, exploring how to systematically evaluate the benefits of hosting and proposing innovative strategies to enhance sustainability and success. Using a TOPSIS model-...

research-article

Device Activity Detection in mMTC With Low-Resolution ADCs: A New Protocol

IEEE Transactions on Wireless Communications (TWC), Volume 23, Issue 6Pages 5847–5862https://doi.org/10.1109/TWC.2023.3328657

This paper investigates the effect of low-resolution analog-to-digital converters (ADCs) on device activity detection in massive machine-type communications (mMTC). The low-resolution ADCs induce two challenges on the device activity detection compared ...

Article

TiBERT: A Non-autoregressive Pre-trained Model for Text Editing

Natural Language Processing and Chinese ComputingPages 15–26https://doi.org/10.1007/978-3-031-44699-3_2

Abstract

Text editing refers to the task of creating new sentences by altering existing text through methods such as replacing, inserting, or deleting. Two commonly used techniques for text editing are Seq2Seq and sequence labeling. The Seq2Seq method can ...

Article

Multi-patch Adversarial Attack for Remote Sensing Image Classification

Web and Big DataPages 377–391https://doi.org/10.1007/978-981-97-2303-4_25

Abstract

Deep Neural Networks (DNNs) have shown excellent image classification performance both in accuracy and efficiency. Therefore, it is of great value to deploy adversarial patch to protect critical facilities from DNNs-based scene classification in ...

research-article

A Demonstration of DLBD: Database Logic Bug Detection System

Proceedings of the VLDB Endowment (PVLDB), Volume 16, Issue 12Pages 3914–3917https://doi.org/10.14778/3611540.3611584

Database management systems (DBMSs) are prone to logic bugs that can result in incorrect query results. Current debugging tools are limited to single table queries and struggle with issues like lack of ground-truth results and repetitive query space ...

Article

POSTER: Collaborative Authority-Based Searchable Encryption Using Access Control Encryption

Applied Cryptography and Network Security WorkshopsPages 722–726https://doi.org/10.1007/978-3-031-41181-6_47

Abstract

In this poster, we propose a novel searchable encryption (SE) scheme that leverages access control encryption to enhance the security and flexibility of search operations. Our approach requires collaborative authority users to authorize data users ...

research-article

Public Access

Dementia Eyes: Co-Design and Evaluation of a Dementia Education Augmented Reality Experience for Medical Workers

CHI '23: Proceedings of the 2023 CHI Conference on Human Factors in Computing SystemsArticle No.: 778, Pages 1–18https://doi.org/10.1145/3544548.3581009

Dementia describes a syndrome of cognitive degeneration, and Behavioural and Psychological Symptoms of Dementia (BPSD) is the non-cognitive symptom. BPSD can be improved by care services. To aid better care service, we explore the potential of using ...

research-article

Public Access

Affective Umbrella – A Wearable System to Visualize Heart and Electrodermal Activity, towards Emotion Regulation through Somaesthetic Appreciation

AHs '23: Proceedings of the Augmented Humans International Conference 2023Pages 231–242https://doi.org/10.1145/3582700.3582727

In this paper, we introduce Affective Umbrella, a novel system to record, analyze and visualize physiological data in real time via an umbrella handle. We implement a biofeedback loop design in the system that triggers visualization changes to reflect ...

research-article

Imperceptible adversarial attack via invertible neural networks

AAAI'23/IAAI'23/EAAI'23: Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial IntelligenceArticle No.: 46, Pages 414–424https://doi.org/10.1609/aaai.v37i1.25115

Adding perturbations via utilizing auxiliary gradient information or discarding existing details of the benign images are two common approaches for generating adversarial examples. Though visual imperceptibility is the desired property of adversarial ...

research-article

Free

Tractable and near-optimal adversarial algorithms for robust estimation in contaminated Gaussian models

The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 235, Pages 11063–11174

Consider the problem of simultaneous estimation of location and variance matrix under Huber's contaminated Gaussian model. First, we study minimum f-divergence estimation at the population level, corresponding to a generative adversarial method with a ...

research-article

CNN‐ and GAN‐based classification of malicious code families: A code visualization approach

International Journal of Intelligent Systems (IJIS), Volume 37, Issue 12Pages 12472–12489https://doi.org/10.1002/int.23094

Abstract

Malicious code attacks have severely hindered the current development of the Internet technologies. Once the devices are infected with virus, the damages to companies and users are unpredictable. Although researchers have developed malware ...

Article

Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval

Computer Vision – ECCV 2022Pages 444–461https://doi.org/10.1007/978-3-031-19781-9_26

Abstract

In this paper we revisit feature fusion, an old-fashioned topic, in the new context of text-to-video retrieval. Different from previous research that considers feature fusion only at one end, let it be video or text, we aim for feature fusion for ...

research-article

Learn to Understand Negation in Video Retrieval

MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 434–443https://doi.org/10.1145/3503161.3547968

Negation is a common linguistic skill that allows human to express what we do NOT want. Naturally, one might expect video retrieval to support natural-language queries with negation, e.g., finding shots of kids sitting on the floor and not playing with ...

Article

Multi-task RetinaNet for Mitosis Detection

Mitosis Domain Generalization and Diabetic Retinopathy AnalysisPages 234–240https://doi.org/10.1007/978-3-031-33658-4_25

Abstract

The count of mitotic cells is a key feature in tumor diagnosis. However, due to the variability of mitotic cell morphology, detecting mitotic cells in tumor tissues is a highly challenging task. At the same time, the performance of the trained ...

abstract

ImageFlowing-Enhance Emotional Expression by Reproducing the Vital Signs of the Photographer

SIGGRAPH '22: ACM SIGGRAPH 2022 Emerging TechnologiesArticle No.: 5, Pages 1–2https://doi.org/10.1145/3532721.3535565

ImageFlowing is a ‘living’ photograph that reproduces the biometric signs of the photographer. Viewers can feel how the photographer felt through photographer’s breathing, heartbeats and skin temperature. We extend a two-dimensional picture into a multi-...

Applied Filters

People

Names

Institutions

Authors

Editors

Advisors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Reproducibility Badges

Publication Date

Save to Binder

Upcoming Conferences