[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3274247.3274510acmconferencesArticle/Chapter ViewAbstractPublication PagesmigConference Proceedingsconference-collections
short-paper

Crowd simulation by deep reinforcement learning

Published: 08 November 2018 Publication History

Abstract

Simulating believable virtual crowds has been an important research topic in many research fields such as industry films, computer games, urban engineering, and behavioral science. One of the key capabilities agents should have is navigation, which is reaching goals without colliding with other agents or obstacles. The key challenge here is that the environment changes dynamically, where the current decision of an agent can largely affect the state of other agents as well as the agent in the future. Recently, reinforcement learning with deep neural networks has shown remarkable results in sequential decision-making problems. With the power of convolution neural networks, elaborate control with visual sensory inputs has also become possible. In this paper, we present an agent-based deep reinforcement learning approach for navigation, where only a simple reward function enables agents to navigate in various complex scenarios. Our method is also able to do that with a single unified policy for every scenario, where the scenario-specific parameter tuning is unnecessary. We will show the effectiveness of our method through a variety of scenarios and settings.

Supplementary Material

MP4 File (a2-lee.mp4)

References

[1]
Alexandre Alahi, Kratarth Goel, Vignesh Ramanathan, Alexandre Robicquet, Li Fei-Fei, and Silvio Savarese. 2016. Social LSTM: Human Trajectory Prediction in Crowded Spaces. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[2]
Luiselena Casadiego Bastidas. 2014.
[3]
Luiselena Casadiego and Nuria Pelechano. 2015. From One to Many : Simulating Groups of Agents with RL Controllers. In Intelligent Virtual Agents: 15th International Conference. 119--125.
[4]
Panayiotis Charalambous, Ioannis Karamouzas, Stephen J. Guy, and Yiorgos Chrysanthou. 2014. A Data-Driven Framework for Visual Crowd Analysis. Comput. Graph. Forum (2014).
[5]
Gabriel V. de la Cruz, Bei Peng, Walter S. Lasecki, and Matthew E. Taylor. 2015. Towards Integrating Real-Time Crowd Advice with Reinforcement Learning. In Proceedings of the 20th International Conference on Intelligent User Interfaces Companion (IUI Companion '15). 17--20.
[6]
Agrim Gupta, Justin Johnson, Li Fei-Fei, Silvio Savarese, and Alexandre Alahi. 2018. Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[7]
Stephen J. Guy, Jatin Chhugani, Changkyu Kim, Nadathur Satish, Ming Lin, Dinesh Manocha, and Pradeep Dubey 2009. ClearPath: Highly parallel collision avoidance for multi-agent simulation. In Symposium on Computer Animation 2009 - ACM SIGGRAPH / Eurographics Symposium Proceedings.
[8]
Stephen J. Guy, Sean Curtis, Ming C. Lin, and Dinesh Manocha. 2012. Least-effort trajectories lead to emergent crowd behaviors. Phys. Rev. E (2012).
[9]
Stephen J. Guy, Sujeong Kim, Ming C. Lin, and Dinesh Manocha. 2011. Simulating Heterogeneous Crowd Behaviors Using Personality Trait Theory. In Proceedings of the 2011 ACM SIGGRAPH/Eurographics Symposium on Computer Animation.
[10]
Stephen J. Guy, Jur van den Berg, Wenxi Liu, Rynson Lau, Ming C. Lin, and Dinesh Manocha. 2012. A Statistical Similarity Measure for Aggregate Crowd Dynamics. ACM Trans. Graph. (2012).
[11]
D. Helbing, I. Farkas, and T. Vicsek. 2000. Simulating dynamical features of escape panic. Nature 407 (2000), 487--490.
[12]
Dirk Helbing and Péter Molnár. 1995. Social force model for pedestrian dynamics. Phys. Rev. E (1995).
[13]
P. Henry, C. Vollmer, B. Ferris, and D. Fox. 2010. Learning to navigate through crowded environments. In 2010 IEEE International Conference on Robotics and Automation. 981--986.
[14]
Ludovic Hoyet, Anne-Helene Olivier, Richard Kulpa, and Julien Pettr'e. 2016. Perceptual Effect of Shoulder Motions on Crowd Animations. ACM Trans. Graph. 35, 4, Article 53 (2016), 53:1--53:10 pages.
[15]
Roger L. Hughes. 2002. A continuum theory for the flow of pedestrians. Transportation Research Part B: Methodological (2002).
[16]
Eunjung Ju, Myung Geol Choi, Minji Park, Jehee Lee, Kang Hoon Lee, and Shigeo Takahashi. 2010. Morphable Crowds. In ACM SIGGRAPH Asia 2010 Papers (SIGGRAPH ASIA '10). ACM.
[17]
Ioannis Karamouzas, Brian Skinner, and Stephen J. Guy. 2014. Universal Power Law Governing Pedestrian Interactions. Phys. Rev. Lett. 113 (2014). Issue 23.
[18]
Jongmin Kim, Yeongho Seol, Taesoo Kwon, and Jehee Lee. 2014. Interactive Manipulation of Large-scale Crowd Animation. ACM Trans. Graph. (2014).
[19]
Taesoo Kwon, Kang Hoon Lee, Jehee Lee, and Shigeo Takahashi. 2008. Group Motion Editing. ACM Trans. Graph. 27, 3 (2008), 80:1--80:8.
[20]
Kang Hoon Lee, Myung Geol Choi, and Jehee Lee. 2006. Motion Patches: Building Blocks for Virtual Environments Annotated with Motion Data. ACM Trans. Graph. 25, 3 (2006), 898--906.
[21]
Alon Lerner, Yiorgos Chrysanthou, and Dani Lischinski. 2007. Crowds by Example. (2007).
[22]
Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. 2015. Continuous control with deep reinforcement learning. CoRR abs/1509.02971 (2015).
[23]
Weining Liu, Vladimir Pavlovic, Kaidong Hu, Petros Faloutsos, Sejong Yoon, and Mubbasir Kapadia. 2017. Characterizing the Relationship Between Environment Layout and Crowd Movement Using Machine Learning. In Proceedings of the Tenth International Conference on Motion in Games (MIG '17). ACM, Article 2, 6 pages.
[24]
Francisco Martinez-Gil, Miguel Lozano, and Fernando Fernández. 2012. Multi-agent Reinforcement Learning for Simulating Pedestrian Navigation. In Adaptive and Learning Agents. 54--69.
[25]
Francisco Martinez-Gil, Miguel Lozano, and Fernando Fernndez. 2015. Strategies for simulating pedestrian navigation with multiple reinforcement learning agents. 29 (01 2015), 98--130.
[26]
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, and Demis Hassabis. 2015. Human-level control through deep reinforcement learning. Nature 518 (2015), 529--533.
[27]
Rahul Narain, Abhinav Golas, Sean Curtis, and Ming C. Lin. 2009. Aggregate Dynamics for Dense Crowd Simulation. In ACM SIGGRAPH Asia 2009 Papers (SIGGRAPH Asia '09).
[28]
Aline Normoyle, Maxim Likhachev, and Alla Safonova. 2014. Stochastic Activity Authoring with Direct User Control. In Proceedings of the 18th Meeting of the ACM SIGGRAPH Symposium on Interactive 3D Graphics and Games (I3D '14). ACM.
[29]
Jan Ondřej, Julien Pettré, Anne-Hélène Olivier, and Stéphane Donikian. 2010. A Synthetic-vision Based Steering Approach for Crowd Simulation. In ACM SIGGRAPH 2010 Papers.
[30]
N. Pelechano, J. M. Allbeck, and N. I. Badler. 2007. Controlling Individual Agents in High-density Crowd Simulation. In Proceedings of the 2007 ACM SIGGRAPH/Eurographics Symposium on Computer Animation (SCA '07).
[31]
Julien Pettré, Jan Ondřej, Anne-Hélène Olivier, Armel Cretual, and Stéphane Donikian. 2009. Experiment-based Modeling, Simulation and Validation of Interactions Between Virtual Walkers. In Proceedings of the 2009 ACM SIGGRAPH/Eurographics Symposium on Computer Animation. ACM.
[32]
Gang Qiao, Sejong Yoon, Mubbasir Kapadia, and Vladimir Pavlovic. 2018. The Role of Data-Driven Priors in Multi-Agent Crowd Trajectory Estimation. (2018).
[33]
Craig W. Reynolds. 1987. Flocks, Herds and Schools: A Distributed Behavioral Model. SIGGRAPH Comput. Graph. 21 (1987).
[34]
David Silver, Aja Huang, Chris J. Maddison, Arthur Guez, Laurent Sifre, George van den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, Sander Dieleman, Dominik Grewe, John Nham, Nal Kalchbrenner, Ilya Sutskever, Timothy Lillicrap, Madeleine Leach, Koray Kavukcuoglu, Thore Graepel, and Demis Hassabis. 2016. Mastering the Game of Go with Deep Neural Networks and Tree Search. Nature 529, 7587 (2016), 484--489.
[35]
TensorFlow. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. (2015). http://tensorflow.org/ Software available from tensorflow.org.
[36]
Lisa Torrey. 2010. Crowd Simulation via Multi-agent Reinforcement Learning. In Proceedings of the Sixth AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE'10). 89--94.
[37]
Adrien Treuille, Seth Cooper, and Zoran Popović. 2006. Continuum Crowds. ACM Trans. Graph. (2006).
[38]
L. M. Vaina, Scott A. Beardsley, and Simon K. Rushton. 2010. Optic Flow and Beyond (1st ed.). Springer Publishing Company, Incorporated.
[39]
Jur van den Berg, Stephen J. Guy, Ming C. Lin, and Dinesh Manocha. 2009. Reciprocal n-body Collision Avoidance. In INTERNATIONAL SYMPOSIUM ON ROBOTICS RESEARCH.
[40]
Jur P. van den Berg, Ming C. Lin, and Dinesh Manocha. 2008. Reciprocal Velocity Obstacles for real-time multi-agent navigation. 2008 IEEE International Conference on Robotics and Automation (2008), 1928--1935.

Cited By

View all
  • (2024)CBIL: Collective Behavior Imitation Learning for Fish from Real VideosACM Transactions on Graphics10.1145/368790443:6(1-17)Online publication date: 19-Dec-2024
  • (2024)Deformable Elliptical Particles for Predictive Mesh-Adaptive CrowdsProceedings of the 17th ACM SIGGRAPH Conference on Motion, Interaction, and Games10.1145/3677388.3696329(1-11)Online publication date: 21-Nov-2024
  • (2024)Learning Crowd Motion Dynamics with CrowdsProceedings of the ACM on Computer Graphics and Interactive Techniques10.1145/36513027:1(1-17)Online publication date: 13-May-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MIG '18: Proceedings of the 11th ACM SIGGRAPH Conference on Motion, Interaction and Games
November 2018
185 pages
ISBN:9781450360159
DOI:10.1145/3274247
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 November 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. animation
  2. collision avoidance
  3. crowd simulation
  4. reinforcement learning

Qualifiers

  • Short-paper

Funding Sources

Conference

MIG '18
Sponsor:
MIG '18: Motion, Interaction and Games
November 8 - 10, 2018
Limassol, Cyprus

Acceptance Rates

Overall Acceptance Rate -9 of -9 submissions, 100%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)167
  • Downloads (Last 6 weeks)18
Reflects downloads up to 14 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)CBIL: Collective Behavior Imitation Learning for Fish from Real VideosACM Transactions on Graphics10.1145/368790443:6(1-17)Online publication date: 19-Dec-2024
  • (2024)Deformable Elliptical Particles for Predictive Mesh-Adaptive CrowdsProceedings of the 17th ACM SIGGRAPH Conference on Motion, Interaction, and Games10.1145/3677388.3696329(1-11)Online publication date: 21-Nov-2024
  • (2024)Learning Crowd Motion Dynamics with CrowdsProceedings of the ACM on Computer Graphics and Interactive Techniques10.1145/36513027:1(1-17)Online publication date: 13-May-2024
  • (2024)SocialGAIL: Faithful Crowd Simulation for Social Robot Navigation2024 IEEE International Conference on Robotics and Automation (ICRA)10.1109/ICRA57147.2024.10610371(16873-16880)Online publication date: 13-May-2024
  • (2024)MAC-ID: Multi-Agent Reinforcement Learning with Local Coordination for Individual Diversity2024 IEEE International Conference on Robotics and Automation (ICRA)10.1109/ICRA57147.2024.10610314(15233-15239)Online publication date: 13-May-2024
  • (2024)Toward Realistic Human Crowd Simulations with Data-Driven Parameter Space Exploration2024 IEEE International Conference on Artificial Intelligence and eXtended and Virtual Reality (AIxVR)10.1109/AIxVR59861.2024.00035(221-225)Online publication date: 17-Jan-2024
  • (2024)RPMTD: A Route Planning Model With Consideration of Tourists’ DistributionIEEE Access10.1109/ACCESS.2024.340037312(69488-69504)Online publication date: 2024
  • (2024)Surveying the evolution of virtual humans expressiveness toward real humansComputers and Graphics10.1016/j.cag.2024.104034123:COnline publication date: 1-Oct-2024
  • (2024)Crowd evacuation with human-level intelligence via neuro-symbolic approachAdvanced Engineering Informatics10.1016/j.aei.2024.10235660(102356)Online publication date: Apr-2024
  • (2024)The crowd cooperation approach for formation maintenance and collision avoidance using multi-agent deep reinforcement learningThe Visual Computer10.1007/s00371-024-03647-1Online publication date: 19-Oct-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media