How can I add termination condition to legged robot environment? #575
-
Hi everyone, I am working on building an environments for quadruped robots in Warp. My goal is to make an environment for deploying policy learning algorithms (BPTT/RL). In order to do that, I need to define termination conditions, which I am struggling with. At the moment, the code is using CUDA graph to capture the forward loop. My approach for the moment is simply iteratively update the attribute
|
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 1 reply
-
you could try adding terminal as a wp.array to model.state and update it inside a wp.kernel. right now, looks like you're using the same variable across all states. fyi Rewarped Ant RL example :) |
Beta Was this translation helpful? Give feedback.
-
To add some more information, Warp doesn't currently support these kinds of dynamic graph evaluations, which would require us to expose this kind of functionality: https://developer.nvidia.com/blog/dynamic-control-flow-in-cuda-graphs-with-conditional-nodes/ (You're welcome to open a feature request). Specifically, it seems that the number of times you call |
Beta Was this translation helpful? Give feedback.
you could try adding terminal as a wp.array to model.state and update it inside a wp.kernel. right now, looks like you're using the same variable across all states.
fyi Rewarped Ant RL example :)