-
🔭 I’m currently working on agentic AI, reinforcement learning, and its applications in foundation models and robotics.
-
🌱 We host workshops and seminars on agentic AI, safe AI, and robot learning. Researchers and students interested in safe AI and robot learning are welcome to join! Recordings are available on the AI Agent Research YouTube Channel. For more information, visit the Agentic AI Frontier Seminar, the Safe RL Seminar Homepage, and the Safe RL Workshop Homepage.
Pursuing the essence of intelligence and bringing it into the real world.
- Berkeley, USA
- https://people.eecs.berkeley.edu/~shangding.gu/
Highlights
- Pro
Pinned Loading
-
SafeRL-Lab/cheetahclaws
SafeRL-Lab/cheetahclaws PublicCheetahClaws (Nano Claude Code): A Fast, Easy-to-Use, Production-Ready, Python-Native Personal AI Assistant for Any Model, Inspired by OpenClaw and Claude Code, Built to Work for You Autonomously 2…
-
Multi-Agent-Constrained-Policy-Optimisation
Multi-Agent-Constrained-Policy-Optimisation PublicMulti-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
-
Safe-Reinforcement-Learning-Baselines
Safe-Reinforcement-Learning-Baselines PublicThe repository is for safe reinforcement learning baselines.
-
Safe-Multi-Agent-Mujoco
Safe-Multi-Agent-Mujoco PublicSafe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.
-
collection-claude-code-source-code
collection-claude-code-source-code Public🔥 A collection of the newest Claude Code open source
-
huggingface/trl
huggingface/trl PublicTrain transformer language models with reinforcement learning.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



