More Web Proxy on the site http://driver.im/

short-paper

Crowd simulation by deep reinforcement learning

Authors:

Jehee LeeAuthors Info & Claims

MIG '18: Proceedings of the 11th ACM SIGGRAPH Conference on Motion, Interaction and Games

Article No.: 2, Pages 1 - 7

https://doi.org/10.1145/3274247.3274510

Published: 08 November 2018 Publication History

Abstract

Simulating believable virtual crowds has been an important research topic in many research fields such as industry films, computer games, urban engineering, and behavioral science. One of the key capabilities agents should have is navigation, which is reaching goals without colliding with other agents or obstacles. The key challenge here is that the environment changes dynamically, where the current decision of an agent can largely affect the state of other agents as well as the agent in the future. Recently, reinforcement learning with deep neural networks has shown remarkable results in sequential decision-making problems. With the power of convolution neural networks, elaborate control with visual sensory inputs has also become possible. In this paper, we present an agent-based deep reinforcement learning approach for navigation, where only a simple reward function enables agents to navigate in various complex scenarios. Our method is also able to do that with a single unified policy for every scenario, where the scenario-specific parameter tuning is unnecessary. We will show the effectiveness of our method through a variety of scenarios and settings.

Supplementary Material

MP4 File (a2-lee.mp4)

Download
43.83 MB

References

[1]

Alexandre Alahi, Kratarth Goel, Vignesh Ramanathan, Alexandre Robicquet, Li Fei-Fei, and Silvio Savarese. 2016. Social LSTM: Human Trajectory Prediction in Crowded Spaces. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]

Luiselena Casadiego Bastidas. 2014.

[3]

Luiselena Casadiego and Nuria Pelechano. 2015. From One to Many : Simulating Groups of Agents with RL Controllers. In Intelligent Virtual Agents: 15th International Conference. 119--125.

[4]

Panayiotis Charalambous, Ioannis Karamouzas, Stephen J. Guy, and Yiorgos Chrysanthou. 2014. A Data-Driven Framework for Visual Crowd Analysis. Comput. Graph. Forum (2014).

Digital Library

[5]

Gabriel V. de la Cruz, Bei Peng, Walter S. Lasecki, and Matthew E. Taylor. 2015. Towards Integrating Real-Time Crowd Advice with Reinforcement Learning. In Proceedings of the 20th International Conference on Intelligent User Interfaces Companion (IUI Companion '15). 17--20.

Digital Library

[6]

Agrim Gupta, Justin Johnson, Li Fei-Fei, Silvio Savarese, and Alexandre Alahi. 2018. Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]

Stephen J. Guy, Jatin Chhugani, Changkyu Kim, Nadathur Satish, Ming Lin, Dinesh Manocha, and Pradeep Dubey 2009. ClearPath: Highly parallel collision avoidance for multi-agent simulation. In Symposium on Computer Animation 2009 - ACM SIGGRAPH / Eurographics Symposium Proceedings.

Digital Library

[8]

Stephen J. Guy, Sean Curtis, Ming C. Lin, and Dinesh Manocha. 2012. Least-effort trajectories lead to emergent crowd behaviors. Phys. Rev. E (2012).

[9]

Stephen J. Guy, Sujeong Kim, Ming C. Lin, and Dinesh Manocha. 2011. Simulating Heterogeneous Crowd Behaviors Using Personality Trait Theory. In Proceedings of the 2011 ACM SIGGRAPH/Eurographics Symposium on Computer Animation.

Digital Library

[10]

Stephen J. Guy, Jur van den Berg, Wenxi Liu, Rynson Lau, Ming C. Lin, and Dinesh Manocha. 2012. A Statistical Similarity Measure for Aggregate Crowd Dynamics. ACM Trans. Graph. (2012).

Digital Library

[11]

D. Helbing, I. Farkas, and T. Vicsek. 2000. Simulating dynamical features of escape panic. Nature 407 (2000), 487--490.

[12]

Dirk Helbing and Péter Molnár. 1995. Social force model for pedestrian dynamics. Phys. Rev. E (1995).

[13]

P. Henry, C. Vollmer, B. Ferris, and D. Fox. 2010. Learning to navigate through crowded environments. In 2010 IEEE International Conference on Robotics and Automation. 981--986.

[14]

Ludovic Hoyet, Anne-Helene Olivier, Richard Kulpa, and Julien Pettr'e. 2016. Perceptual Effect of Shoulder Motions on Crowd Animations. ACM Trans. Graph. 35, 4, Article 53 (2016), 53:1--53:10 pages.

Digital Library

[15]

Roger L. Hughes. 2002. A continuum theory for the flow of pedestrians. Transportation Research Part B: Methodological (2002).

[16]

Eunjung Ju, Myung Geol Choi, Minji Park, Jehee Lee, Kang Hoon Lee, and Shigeo Takahashi. 2010. Morphable Crowds. In ACM SIGGRAPH Asia 2010 Papers (SIGGRAPH ASIA '10). ACM.

Digital Library

[17]

Ioannis Karamouzas, Brian Skinner, and Stephen J. Guy. 2014. Universal Power Law Governing Pedestrian Interactions. Phys. Rev. Lett. 113 (2014). Issue 23.

[18]

Jongmin Kim, Yeongho Seol, Taesoo Kwon, and Jehee Lee. 2014. Interactive Manipulation of Large-scale Crowd Animation. ACM Trans. Graph. (2014).

Digital Library

[19]

Taesoo Kwon, Kang Hoon Lee, Jehee Lee, and Shigeo Takahashi. 2008. Group Motion Editing. ACM Trans. Graph. 27, 3 (2008), 80:1--80:8.

Digital Library

[20]

Kang Hoon Lee, Myung Geol Choi, and Jehee Lee. 2006. Motion Patches: Building Blocks for Virtual Environments Annotated with Motion Data. ACM Trans. Graph. 25, 3 (2006), 898--906.

Digital Library

[21]

Alon Lerner, Yiorgos Chrysanthou, and Dani Lischinski. 2007. Crowds by Example. (2007).

[22]

Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. 2015. Continuous control with deep reinforcement learning. CoRR abs/1509.02971 (2015).

[23]

Weining Liu, Vladimir Pavlovic, Kaidong Hu, Petros Faloutsos, Sejong Yoon, and Mubbasir Kapadia. 2017. Characterizing the Relationship Between Environment Layout and Crowd Movement Using Machine Learning. In Proceedings of the Tenth International Conference on Motion in Games (MIG '17). ACM, Article 2, 6 pages.

Digital Library

[24]

Francisco Martinez-Gil, Miguel Lozano, and Fernando Fernández. 2012. Multi-agent Reinforcement Learning for Simulating Pedestrian Navigation. In Adaptive and Learning Agents. 54--69.

Digital Library

[25]

Francisco Martinez-Gil, Miguel Lozano, and Fernando Fernndez. 2015. Strategies for simulating pedestrian navigation with multiple reinforcement learning agents. 29 (01 2015), 98--130.

Digital Library

[26]

Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, and Demis Hassabis. 2015. Human-level control through deep reinforcement learning. Nature 518 (2015), 529--533.

[27]

Rahul Narain, Abhinav Golas, Sean Curtis, and Ming C. Lin. 2009. Aggregate Dynamics for Dense Crowd Simulation. In ACM SIGGRAPH Asia 2009 Papers (SIGGRAPH Asia '09).

Digital Library

[28]

Aline Normoyle, Maxim Likhachev, and Alla Safonova. 2014. Stochastic Activity Authoring with Direct User Control. In Proceedings of the 18th Meeting of the ACM SIGGRAPH Symposium on Interactive 3D Graphics and Games (I3D '14). ACM.

Digital Library

[29]

Jan Ondřej, Julien Pettré, Anne-Hélène Olivier, and Stéphane Donikian. 2010. A Synthetic-vision Based Steering Approach for Crowd Simulation. In ACM SIGGRAPH 2010 Papers.

Digital Library

[30]

N. Pelechano, J. M. Allbeck, and N. I. Badler. 2007. Controlling Individual Agents in High-density Crowd Simulation. In Proceedings of the 2007 ACM SIGGRAPH/Eurographics Symposium on Computer Animation (SCA '07).

Digital Library

[31]

Julien Pettré, Jan Ondřej, Anne-Hélène Olivier, Armel Cretual, and Stéphane Donikian. 2009. Experiment-based Modeling, Simulation and Validation of Interactions Between Virtual Walkers. In Proceedings of the 2009 ACM SIGGRAPH/Eurographics Symposium on Computer Animation. ACM.

Digital Library

[32]

Gang Qiao, Sejong Yoon, Mubbasir Kapadia, and Vladimir Pavlovic. 2018. The Role of Data-Driven Priors in Multi-Agent Crowd Trajectory Estimation. (2018).

[33]

Craig W. Reynolds. 1987. Flocks, Herds and Schools: A Distributed Behavioral Model. SIGGRAPH Comput. Graph. 21 (1987).

Digital Library

[34]

David Silver, Aja Huang, Chris J. Maddison, Arthur Guez, Laurent Sifre, George van den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, Sander Dieleman, Dominik Grewe, John Nham, Nal Kalchbrenner, Ilya Sutskever, Timothy Lillicrap, Madeleine Leach, Koray Kavukcuoglu, Thore Graepel, and Demis Hassabis. 2016. Mastering the Game of Go with Deep Neural Networks and Tree Search. Nature 529, 7587 (2016), 484--489.

[35]

TensorFlow. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. (2015). http://tensorflow.org/ Software available from tensorflow.org.

[36]

Lisa Torrey. 2010. Crowd Simulation via Multi-agent Reinforcement Learning. In Proceedings of the Sixth AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE'10). 89--94.

Digital Library

[37]

Adrien Treuille, Seth Cooper, and Zoran Popović. 2006. Continuum Crowds. ACM Trans. Graph. (2006).

Digital Library

[38]

L. M. Vaina, Scott A. Beardsley, and Simon K. Rushton. 2010. Optic Flow and Beyond (1st ed.). Springer Publishing Company, Incorporated.

Digital Library

[39]

Jur van den Berg, Stephen J. Guy, Ming C. Lin, and Dinesh Manocha. 2009. Reciprocal n-body Collision Avoidance. In INTERNATIONAL SYMPOSIUM ON ROBOTICS RESEARCH.

[40]

Jur P. van den Berg, Ming C. Lin, and Dinesh Manocha. 2008. Reciprocal Velocity Obstacles for real-time multi-agent navigation. 2008 IEEE International Conference on Robotics and Automation (2008), 1928--1935.

Cited By

Wu YDou ZIshiwaka YOgawa SLou YWang WLiu LKomura T(2024)CBIL: Collective Behavior Imitation Learning for Fish from Real VideosACM Transactions on Graphics10.1145/368790443:6(1-17)Online publication date: 19-Dec-2024
https://dl.acm.org/doi/10.1145/3687904
Ferreira DShatzel LHaworth B(2024)Deformable Elliptical Particles for Predictive Mesh-Adaptive CrowdsProceedings of the 17th ACM SIGGRAPH Conference on Motion, Interaction, and Games10.1145/3677388.3696329(1-11)Online publication date: 21-Nov-2024
https://dl.acm.org/doi/10.1145/3677388.3696329
Talukdar BZhang YWeiss T(2024)Learning Crowd Motion Dynamics with CrowdsProceedings of the ACM on Computer Graphics and Interactive Techniques10.1145/36513027:1(1-17)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3651302
Show More Cited By

Index Terms

Crowd simulation by deep reinforcement learning
1. Computing methodologies
  1. Computer graphics
    1. Animation
  2. Machine learning
    1. Learning paradigms
      1. Reinforcement learning

Recommendations

GREIL-Crowds: Crowd Simulation with Deep Reinforcement Learning and Examples

Simulating crowds with realistic behaviors is a difficult but very important task for a variety of applications. Quantifying how a person balances between different conflicting criteria such as goal seeking, collision avoidance and moving within a group ...
Learning to move in crowd
SIGGRAPH '18: ACM SIGGRAPH 2018 Posters

The main goal of the crowd simulation is to generate realistic movements of agents. Reproducing the mechanism that seeing the environments, understanding current situation, and deciding where to step is crucial point to simulating crowd movements. We ...
Reinforcement learning-based dynamic obstacle avoidance and integration of path planning
Abstract
Deep reinforcement learning has the advantage of being able to encode fairly complex behaviors by collecting and learning empirical information. In the current study, we have proposed a framework for reinforcement learning in decentralized ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

MIG '18: Proceedings of the 11th ACM SIGGRAPH Conference on Motion, Interaction and Games

November 2018

185 pages

ISBN:9781450360159

DOI:10.1145/3274247

Conference Chairs:
Panayiotis Charalambous
The Cyprus Institute
,
Yiorgos Chrysanthou
The University of Cyprus & RISE
,
Program Chairs:
Ben Jones
University of Utah
,
Jehee Lee
Seoul National University

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 November 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Funding Sources

Ministry of Science ICT and Future Planning

Conference

MIG '18

Sponsor:

SIGGRAPH

MIG '18: Motion, Interaction and Games

November 8 - 10, 2018

Limassol, Cyprus

Acceptance Rates

Overall Acceptance Rate -9 of -9 submissions, 100%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

48
Total Citations
View Citations
1,120
Total Downloads

Downloads (Last 12 months)167
Downloads (Last 6 weeks)18

Reflects downloads up to 14 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Wu YDou ZIshiwaka YOgawa SLou YWang WLiu LKomura T(2024)CBIL: Collective Behavior Imitation Learning for Fish from Real VideosACM Transactions on Graphics10.1145/368790443:6(1-17)Online publication date: 19-Dec-2024
https://dl.acm.org/doi/10.1145/3687904
Ferreira DShatzel LHaworth B(2024)Deformable Elliptical Particles for Predictive Mesh-Adaptive CrowdsProceedings of the 17th ACM SIGGRAPH Conference on Motion, Interaction, and Games10.1145/3677388.3696329(1-11)Online publication date: 21-Nov-2024
https://dl.acm.org/doi/10.1145/3677388.3696329
Talukdar BZhang YWeiss T(2024)Learning Crowd Motion Dynamics with CrowdsProceedings of the ACM on Computer Graphics and Interactive Techniques10.1145/36513027:1(1-17)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3651302
Ling BLyu YLi DGao GShi YXu XWu W(2024)SocialGAIL: Faithful Crowd Simulation for Social Robot Navigation2024 IEEE International Conference on Robotics and Automation (ICRA)10.1109/ICRA57147.2024.10610371(16873-16880)Online publication date: 13-May-2024
https://doi.org/10.1109/ICRA57147.2024.10610371
Chung HOh JHeo JLee GOh S(2024)MAC-ID: Multi-Agent Reinforcement Learning with Local Coordination for Individual Diversity2024 IEEE International Conference on Robotics and Automation (ICRA)10.1109/ICRA57147.2024.10610314(15233-15239)Online publication date: 13-May-2024
https://doi.org/10.1109/ICRA57147.2024.10610314
Hu KYoon SPavlovic VKapadia M(2024)Toward Realistic Human Crowd Simulations with Data-Driven Parameter Space Exploration2024 IEEE International Conference on Artificial Intelligence and eXtended and Virtual Reality (AIxVR)10.1109/AIxVR59861.2024.00035(221-225)Online publication date: 17-Jan-2024
https://doi.org/10.1109/AIxVR59861.2024.00035
Kong YYi KWang LPeng CNguyen LMa Q(2024)RPMTD: A Route Planning Model With Consideration of Tourists’ DistributionIEEE Access10.1109/ACCESS.2024.340037312(69488-69504)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3400373
Knob PPinho GSilva GMontanha RPeres VAraujo VMusse S(2024)Surveying the evolution of virtual humans expressiveness toward real humansComputers and Graphics10.1016/j.cag.2024.104034123:COnline publication date: 1-Oct-2024
https://dl.acm.org/doi/10.1016/j.cag.2024.104034
Bahamid AMohd Ibrahim AShafie A(2024)Crowd evacuation with human-level intelligence via neuro-symbolic approachAdvanced Engineering Informatics10.1016/j.aei.2024.10235660(102356)Online publication date: Apr-2024
https://doi.org/10.1016/j.aei.2024.102356
Sun LYan JQiu YQin W(2024)The crowd cooperation approach for formation maintenance and collision avoidance using multi-agent deep reinforcement learningThe Visual Computer10.1007/s00371-024-03647-1Online publication date: 19-Oct-2024
https://doi.org/10.1007/s00371-024-03647-1
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents