Author: Chen, Qihang : Search

research-article

Sequential action-induced invariant representation for reinforcement learning

Neural Networks (NENE), Volume 179, Issue Chttps://doi.org/10.1016/j.neunet.2024.106579

Abstract

How to accurately learn task-relevant state representations from high-dimensional observations with visual distractions is a realistic and challenging problem in visual reinforcement learning. Recently, unsupervised representation learning ...

research-article

Balancing exploration and exploitation in episodic reinforcement learning

Expert Systems with Applications: An International Journal (EXWA), Volume 231, Issue Chttps://doi.org/10.1016/j.eswa.2023.120801

Abstract

One of the major challenges in reinforcement learning (RL) is its applications in episodic tasks, such as chess game, molecular structure design, healthcare, among others, where the rewards in such scenarios are usually sparse and can only be ...

Highlights

A method that can efficiently solve episodic tasks with only trajectory feedback.
State-entropy based exploration and uniform reward redistribution trade-off.
Outstanding experimental results in tasks with episodic reward on ...

research-article

An Efficient Transfer Learning Method with Auxiliary Information

ACM Transactions on Knowledge Discovery from Data (TKDD), Volume 18, Issue 1Article No.: 22, Pages 1–23https://doi.org/10.1145/3612930

Transfer learning (TL) is an information reuse learning tool, which can help us learn better classification effect than traditional single task learning, because transfer learning can share information within the task-to-task model. Most TL algorithms are ...

research-article

Free

Root Cause Analysis for Microservice Systems via Hierarchical Reinforcement Learning from Human Feedback

KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data MiningPages 5116–5125https://doi.org/10.1145/3580305.3599934

In microservice systems, the identification of root causes of anomalies is imperative for service reliability and business impact. This process is typically divided into two phases: (i)constructing a service dependency graph that outlines the sequence ...

research-article

Semi-supervised Multi-task Learning with Auxiliary data

Information Sciences: an International Journal (ISCI), Volume 626, Issue CPages 626–639https://doi.org/10.1016/j.ins.2023.02.091

Abstract

Compared with single-task learning, multi-tasks can obtain better classifiers by the information provided by each task. In the process of multi-task data collection, we always focus on the target task data in the training process, and ...

research-article

Open Access

ABNDP: Co-optimizing Data Access and Load Balance in Near-Data Processing

ASPLOS 2023: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3Pages 3–17https://doi.org/10.1145/3582016.3582026

Near-Data Processing (NDP) has been a promising architectural paradigm to address the memory wall challenge for data-intensive applications. Typical NDP systems based on 3D-stacked memories contain massive parallel processing units, each of which can ...

research-article

Adaptive robust Adaboost-based twin support vector machine with universum data

Information Sciences: an International Journal (ISCI), Volume 609, Issue CPages 1334–1352https://doi.org/10.1016/j.ins.2022.07.155

Abstract

Universum, as third class that does not belong to the positive class and negative class, allows to incorporate the prior knowledge into the learning process. A lot of reaserchers confirmed that Universum is helpful in the supervised ...

research-article

A new self-paced learning method for privilege-based positive and unlabeled learning

Information Sciences: an International Journal (ISCI), Volume 609, Issue CPages 996–1009https://doi.org/10.1016/j.ins.2022.07.143

Abstract

Positive and unlabeled learning (PU learning) is a kind of problem whose goal is learning a two-classes classifier with little proportion of positive samples and numerous unlabeled samples. A series of studies focus on how to extract ...

Article

Hard Negative Sample Mining for Contrastive Representation in Reinforcement Learning

Advances in Knowledge Discovery and Data MiningPages 277–288https://doi.org/10.1007/978-3-031-05936-0_22

Abstract

In recent years, contrastive learning has become an important technology of self-supervised representation learning and achieved SOTA performances in many fields, which has also gained increasing attention in the reinforcement learning (RL) ...

research-article

Open Access

FINGERS: exploiting fine-grained parallelism in graph mining accelerators

ASPLOS '22: Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating SystemsPages 43–55https://doi.org/10.1145/3503222.3507730

Graph mining is an emerging application of high importance and also with high complexity, thus requiring efficient hardware acceleration. Current accelerator designs only utilize coarse-grained parallelism, leaving large room for further optimizations. ...

research-article

Gated multi-attention representation in reinforcement learning

Knowledge-Based Systems (KNBS), Volume 233, Issue Chttps://doi.org/10.1016/j.knosys.2021.107535

Abstract

Deep reinforcement learning (DRL) has achieved great success in recent years by combining the feature extraction power of deep learning and the decision power of reinforcement learning techniques. In the literature, Convolutional ...

Highlights

Gated multi-attention module is proposed to eliminate task-irrelevant attentions.

research-article

Automated Discovery of Geometric Theorems Based on Vector Equations

Journal of Automated Reasoning (JAUR), Volume 65, Issue 6Pages 711–726https://doi.org/10.1007/s10817-021-09591-2

Abstract

Automated discovery of geometric theorems has attracted considerable attention from the research community. In this paper, a new method is proposed to discover geometric theorems automatically. This method first generates vector equations based on ...

Article

Euge: Effective Utilization of GPU Resources for Serving DNN-Based Video Analysis

Web and Big DataPages 523–528https://doi.org/10.1007/978-3-030-60290-1_40

Abstract

Deep Neural Network (DNN) has been widely adopted in video analysis application. The computation involved in DNN is more efficient on GPUs than on CPUs. However, recent serving systems involve the low utilization of GPU, due to limited process ...

Search Results

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Caption

Sequential action-induced invariant representation for reinforcement learning

Balancing exploration and exploitation in episodic reinforcement learning

An Efficient Transfer Learning Method with Auxiliary Information

Root Cause Analysis for Microservice Systems via Hierarchical Reinforcement Learning from Human Feedback

Semi-supervised Multi-task Learning with Auxiliary data

ABNDP: Co-optimizing Data Access and Load Balance in Near-Data Processing

Adaptive robust Adaboost-based twin support vector machine with universum data

A new self-paced learning method for privilege-based positive and unlabeled learning

Hard Negative Sample Mining for Contrastive Representation in Reinforcement Learning

FINGERS: exploiting fine-grained parallelism in graph mining accelerators

Gated multi-attention representation in reinforcement learning

Automated Discovery of Geometric Theorems Based on Vector Equations

Euge: Effective Utilization of GPU Resources for Serving DNN-Based Video Analysis

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder