在机器人领域应用深度强化学习,目前主流的一些思路是什么? - 知乎
My feed is all about RL environment today. From our recent experience, building an actual useful RL environment for LLM agents is hard since your first need to build the actual useful observation and action space. Taking web navigation as an example, most of the existing browser tools are hardly usable. We ended up with building our own browser... See more
Guohao Li 🐫x.com