Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleNovember 2024
Sequential action-induced invariant representation for reinforcement learning
AbstractHow to accurately learn task-relevant state representations from high-dimensional observations with visual distractions is a realistic and challenging problem in visual reinforcement learning. Recently, unsupervised representation learning ...
- research-articleNovember 2023
Balancing exploration and exploitation in episodic reinforcement learning
Expert Systems with Applications: An International Journal (EXWA), Volume 231, Issue Chttps://doi.org/10.1016/j.eswa.2023.120801AbstractOne of the major challenges in reinforcement learning (RL) is its applications in episodic tasks, such as chess game, molecular structure design, healthcare, among others, where the rewards in such scenarios are usually sparse and can only be ...
Highlights- A method that can efficiently solve episodic tasks with only trajectory feedback.
- State-entropy based exploration and uniform reward redistribution trade-off.
- Outstanding experimental results in tasks with episodic reward on ...
- research-articleSeptember 2023
An Efficient Transfer Learning Method with Auxiliary Information
ACM Transactions on Knowledge Discovery from Data (TKDD), Volume 18, Issue 1Article No.: 22, Pages 1–23https://doi.org/10.1145/3612930Transfer learning (TL) is an information reuse learning tool, which can help us learn better classification effect than traditional single task learning, because transfer learning can share information within the task-to-task model. Most TL algorithms are ...
- research-articleAugust 2023
Root Cause Analysis for Microservice Systems via Hierarchical Reinforcement Learning from Human Feedback
- Lu Wang,
- Chaoyun Zhang,
- Ruomeng Ding,
- Yong Xu,
- Qihang Chen,
- Wentao Zou,
- Qingjun Chen,
- Meng Zhang,
- Xuedong Gao,
- Hao Fan,
- Saravan Rajmohan,
- Qingwei Lin,
- Dongmei Zhang
KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data MiningPages 5116–5125https://doi.org/10.1145/3580305.3599934In microservice systems, the identification of root causes of anomalies is imperative for service reliability and business impact. This process is typically divided into two phases: (i)constructing a service dependency graph that outlines the sequence ...
- research-articleMay 2023
Semi-supervised Multi-task Learning with Auxiliary data
Information Sciences: an International Journal (ISCI), Volume 626, Issue CPages 626–639https://doi.org/10.1016/j.ins.2023.02.091AbstractCompared with single-task learning, multi-tasks can obtain better classifiers by the information provided by each task. In the process of multi-task data collection, we always focus on the target task data in the training process, and ...
- research-articleMarch 2023
ABNDP: Co-optimizing Data Access and Load Balance in Near-Data Processing
ASPLOS 2023: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3Pages 3–17https://doi.org/10.1145/3582016.3582026Near-Data Processing (NDP) has been a promising architectural paradigm to address the memory wall challenge for data-intensive applications. Typical NDP systems based on 3D-stacked memories contain massive parallel processing units, each of which can ...
- research-articleSeptember 2022
Adaptive robust Adaboost-based twin support vector machine with universum data
Information Sciences: an International Journal (ISCI), Volume 609, Issue CPages 1334–1352https://doi.org/10.1016/j.ins.2022.07.155AbstractUniversum, as third class that does not belong to the positive class and negative class, allows to incorporate the prior knowledge into the learning process. A lot of reaserchers confirmed that Universum is helpful in the supervised ...
- research-articleSeptember 2022
A new self-paced learning method for privilege-based positive and unlabeled learning
Information Sciences: an International Journal (ISCI), Volume 609, Issue CPages 996–1009https://doi.org/10.1016/j.ins.2022.07.143AbstractPositive and unlabeled learning (PU learning) is a kind of problem whose goal is learning a two-classes classifier with little proportion of positive samples and numerous unlabeled samples. A series of studies focus on how to extract ...
- ArticleMay 2022
Hard Negative Sample Mining for Contrastive Representation in Reinforcement Learning
Advances in Knowledge Discovery and Data MiningPages 277–288https://doi.org/10.1007/978-3-031-05936-0_22AbstractIn recent years, contrastive learning has become an important technology of self-supervised representation learning and achieved SOTA performances in many fields, which has also gained increasing attention in the reinforcement learning (RL) ...
- research-articleFebruary 2022
FINGERS: exploiting fine-grained parallelism in graph mining accelerators
ASPLOS '22: Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating SystemsPages 43–55https://doi.org/10.1145/3503222.3507730Graph mining is an emerging application of high importance and also with high complexity, thus requiring efficient hardware acceleration. Current accelerator designs only utilize coarse-grained parallelism, leaving large room for further optimizations. ...
- research-articleDecember 2021
Gated multi-attention representation in reinforcement learning
AbstractDeep reinforcement learning (DRL) has achieved great success in recent years by combining the feature extraction power of deep learning and the decision power of reinforcement learning techniques. In the literature, Convolutional ...
Highlights- Gated multi-attention module is proposed to eliminate task-irrelevant attentions.
- research-articleAugust 2021
Automated Discovery of Geometric Theorems Based on Vector Equations
Journal of Automated Reasoning (JAUR), Volume 65, Issue 6Pages 711–726https://doi.org/10.1007/s10817-021-09591-2AbstractAutomated discovery of geometric theorems has attracted considerable attention from the research community. In this paper, a new method is proposed to discover geometric theorems automatically. This method first generates vector equations based on ...
- ArticleAugust 2020
Euge: Effective Utilization of GPU Resources for Serving DNN-Based Video Analysis
AbstractDeep Neural Network (DNN) has been widely adopted in video analysis application. The computation involved in DNN is more efficient on GPUs than on CPUs. However, recent serving systems involve the low utilization of GPU, due to limited process ...