Conversation
|
@yardenas I'm sure you already have a plan for this PR, but if you'd like help with any part of it, even the boring or cleanup tasks, I'd be happy to pitch in. I'm getting out of theory RL and trying to get more hands on experience with practical implementations. |
|
@ShahhhVihaan sounds good! There's def a lot that can be done. I think that adding the escape or fetch tasks would be a solid contribution that's currently missing. I have a general idea of how to do that so I'm happy to guide you through it. |
|
Great, I'll take a look at both dm_control and its version here. Yeah, how do you want me to implement it? Also, I can fork your fork and make a new branch from the add-quadruped branch. Does that sound good? |
|
@ShahhhVihaan, sounds great! How can I connect with you? (email for instance) |
|
You can reach me at vihaan.shah@uconn.edu |
|
@ShahhhVihaan, just sent you an email |
|
@btaba, two questions about the design of the environment:
|
|
@btaba let's make it happen, no? |
|
Hi @yardenas , the PR LGTM. Would it be possible to cleanup the config and training scripts and add just the env? If you have some training curve screenshots and a video, that would be awesome. I'll give it an "Approval" and submit |
|
@btaba, sounds great, thank you! I'll get the PR ready for merge ASAP :) SAC on the Walk & Run tasks
PPO on the Walk & Run tasks
Video Other metrics can be found here: https://api.wandb.ai/links/yardas/y54nyaed |
|
In case there's still interest, I removed all the training files, the PR should be ready! |



_find_non_contacting_heighttrain_brax.pyRuns are available @ https://api.wandb.ai/links/yardas/y54nyaed