Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleDecember 2024JUST ACCEPTED
Toward Heterogeneous Graph-based Imitation Learning for Autonomous Driving Simulation: Interaction Awareness and Hierarchical Explainability
ACM Journal on Autonomous Transportation Systems (JATS), Just Accepted https://doi.org/10.1145/3708354Understanding and learning the actor-to-X interactions (AXIs), such as those between the focal vehicles (actor) and other traffic participants, such as other vehicles and pedestrians, as well as traffic environments like the city or road map, is essential ...
- research-articleNovember 2024
CBIL: Collective Behavior Imitation Learning for Fish from Real Videos
ACM Transactions on Graphics (TOG), Volume 43, Issue 6Article No.: 242, Pages 1–17https://doi.org/10.1145/3687904Reproducing realistic collective behaviors presents a captivating yet formidable challenge. Traditional rule-based methods rely on hand-crafted principles, limiting motion diversity and realism in generated collective behaviors. Recent imitation learning ...
- research-articleOctober 2024
PhysReaction: Physically Plausible Real-Time Humanoid Reaction Synthesis via Forward Dynamics Guided 4D Imitation
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 3771–3780https://doi.org/10.1145/3664647.3680636Humanoid Reaction Synthesis is pivotal for creating highly interactive and empathetic robots that can seamlessly integrate into human environments, enhancing the way we live, work, and communicate. However, it is difficult to learn the diverse ...
- research-articleOctober 2024
COIN: Chance-Constrained Imitation Learning for Safe and Adaptive Resource Oversubscription under Uncertainty
- Lu Wang,
- Mayukh Das,
- Fangkai Yang,
- Chao Du,
- Bo Qiao,
- Hang Dong,
- Chetan Bansal,
- Si Qin,
- Saravan Rajmohan,
- Qingwei Lin,
- Dongmei Zhang,
- Qi Zhang
CIKM '24: Proceedings of the 33rd ACM International Conference on Information and Knowledge ManagementPages 4939–4947https://doi.org/10.1145/3627673.3680060We address the real problem of safe, robust, adaptive resource oversubscription in uncertain environments with our proposed novel technique of chance-constrained imitation learning. Our objective is to enhance resource efficiency while ensuring safety ...
- research-articleAugust 2024
Unified Learning from Demonstrations, Corrections, and Preferences during Physical Human–Robot Interaction
ACM Transactions on Human-Robot Interaction (THRI), Volume 13, Issue 3Article No.: 39, Pages 1–25https://doi.org/10.1145/3623384Humans can leverage physical interaction to teach robot arms. This physical interaction takes multiple forms depending on the task, the user, and what the robot has learned so far. State-of-the-art approaches focus on learning from a single modality, or ...
-
- research-articleAugust 2024
PAIL: Performance based Adversarial Imitation Learning Engine for Carbon Neutral Optimization
KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data MiningPages 6148–6157https://doi.org/10.1145/3637528.3671611Achieving carbon neutrality within industrial operations has become increasingly imperative for sustainable development. It is both a significant challenge and a key opportunity for operational optimization in industry 4.0. In recent years, Deep ...
- research-articleDecember 2024
Deep Learning-Driven Cooperative Hole Crossing by Multiple UAVs
ICSLT '24: Proceedings of the 2024 10th International Conference on e-Society, e-Learning and e-Technologies (ICSLT)Pages 94–98https://doi.org/10.1145/3678610.3678622This research presents a novel deep learning framework designed to enable autonomous navigation of unmanned aerial vehicle (UAV) swarms through confined spaces, specifically focusing on the challenge of hole crossing. Leveraging the Gazebo simulation ...
- research-articleJune 2024
SARI: Shared Autonomy across Repeated Interaction
ACM Transactions on Human-Robot Interaction (THRI), Volume 13, Issue 2Article No.: 23, Pages 1–36https://doi.org/10.1145/3651994Assistive robot arms try to help their users perform everyday tasks. One way robots can provide this assistance is shared autonomy. Within shared autonomy, both the human and robot maintain control over the robot’s motion: as the robot becomes confident ...
- research-articleMay 2024
Combining imitation and deep reinforcement learning to human-level performance on a virtual foraging task
- Vittorio Giammarino,
- Matthew F Dunne,
- Kylie N Moore,
- Michael E Hasselmo,
- Chantal E Stern,
- Ioannis Ch Paschalidis
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems (SAGE-ADAP), Volume 32, Issue 3Pages 251–263https://doi.org/10.1177/10597123231201655We develop a framework to learn bio-inspired foraging policies using human data. We conduct an experiment where humans are virtually immersed in an open field foraging environment and are trained to collect the highest amount of rewards. A Markov ...
Meet Challenges of RTT Jitter, A Hybrid Internet Congestion Control Algorithm
WWW '24: Proceedings of the ACM Web Conference 2024Pages 2768–2776https://doi.org/10.1145/3589334.3645338Congestion control has been a fundamental research focus in web transmission for over 30 years. However, with diverse network scenarios like cellular networks and WiFi, traditional models might no longer accurately describe current network conditions -- ...
- research-articleMay 2024
Imitation Learning Datasets: A Toolkit For Creating Datasets, Training Agents and Benchmarking
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2800–2802Imitation learning field requires expert data to train agents in a task. Most often, this learning approach suffers from the absence of available data, which results in techniques being tested on its dataset. Creating datasets is a cumbersome process ...
- extended-abstractMay 2024
Overview of t-DGR: A Trajectory-Based Deep Generative Replay Method for Continual Learning in Decision Making
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2579–2581Deep generative replay has emerged as a promising approach for continual learning in decision-making tasks. This approach addresses the problem of catastrophic forgetting by leveraging the generation of trajectories from previously encountered tasks to ...
- extended-abstractMay 2024
HiMAP: Learning Heuristics-Informed Policies for Large-Scale Multi-Agent Pathfinding
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2498–2500Large-scale multi-agent pathfinding (MAPF) presents significant challenges in several areas. As systems grow in complexity with a multitude of autonomous agents operating simultaneously, efficient and collision-free coordination becomes paramount. ...
- extended-abstractMay 2024
ELA: Exploited Level Augmentation for Offline Learning in Zero-Sum Games
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2357–2359Offline learning derives effective policies from expert demonstrators' datasets without direct interaction. While recent research consider dataset characteristics like expertise level or multiple demonstrators, a distinct approach is necessary in zero-...
- extended-abstractMay 2024
Dual-Policy-Guided Offline Reinforcement Learning with Optimal Stopping
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2315–2317Policy-guided offline reinforcement learning (POR) decomposes the offline reinforcement learning (offline RL) problem into goal estimation and goal-conditioned execution subproblems, leading to improved performance. However, we reveal that the ...
- extended-abstractMay 2024
Gaze Supervision for Mitigating Causal Confusion in Driving Agents
- Abhijat Biswas,
- Badal Arun Pardhi,
- Caleb Chuck,
- Jarrett Holtz,
- Scott Niekum,
- Henny Admoni,
- Alessandro Allievi
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2159–2161Imitation Learning (IL) algorithms show promise in learning human-level driving behavior, but they often suffer from "causal confusion," a phenomenon where the lack of explicit inference of the underlying causal structure can result in misattribution of ...
- research-articleApril 2024
Personalizing an AR-based Communication System for Nonspeaking Autistic Users
IUI '24: Proceedings of the 29th International Conference on Intelligent User InterfacesPages 731–741https://doi.org/10.1145/3640543.3645153Nonspeaking autistic individuals ("nonspeakers") represent about one-third of the autistic population, and most are never provided with an effective alternative to speech, hindering their educational, employment, and social opportunities. Some ...
- research-articleMarch 2024
Aligning Human and Robot Representations
HRI '24: Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot InteractionPages 42–54https://doi.org/10.1145/3610977.3634987To act in the world, robots rely on a representation of salient task aspects: for example, to carry a coffee mug, a robot may consider movement efficiency or mug orientation in its behavior. However, if we want robots to act for and with people, their ...
- research-articleMarch 2024
Interpretable Imitation Learning with Dynamic Causal Relations
- Tianxiang Zhao,
- Wenchao Yu,
- Suhang Wang,
- Lu Wang,
- Xiang Zhang,
- Yuncong Chen,
- Yanchi Liu,
- Wei Cheng,
- Haifeng Chen
WSDM '24: Proceedings of the 17th ACM International Conference on Web Search and Data MiningPages 967–975https://doi.org/10.1145/3616855.3635827Imitation learning, which learns agent policy by mimicking expert demonstration, has shown promising results in many applications such as medical treatment regimes and self-driving vehicles. However, it remains a difficult task to interpret control ...
- research-articleMarch 2024
Follow the LIBRA: Guiding Fair Policy for Unified Impression Allocation via Adversarial Rewarding
WSDM '24: Proceedings of the 17th ACM International Conference on Web Search and Data MiningPages 750–759https://doi.org/10.1145/3616855.3635756The diverse advertiser demands (brand effects or immediate outcomes) lead to distinct selling (pre-agreed volumes with an under-delivery penalty or compete per auction) and pricing (fixed prices or varying bids) patterns in Guaranteed delivery (GD) and ...