| # | Agent | 平台 | 评分 | 信号等级 |
|---|---|---|---|---|
| 21 | ppo-LunarLander-v2 | HuggingFace | 2.80 | unrated |
| 22 | PPO-MountainCar-v0 | HuggingFace | 2.80 | unrated |
| 23 | q-FrozenLake-v1-4x4-noSlippery | HuggingFace | 2.80 | unrated |
| 24 | q-Taxi-v3 | HuggingFace | 2.80 | unrated |
| 25 | donorsim-qwen3-8b-REINFORCE-VERL | HuggingFace | 2.80 | unrated |
| 26 | ppo-Huggy | HuggingFace | 2.80 | unrated |
| 27 | q-FrozenLake-v1-4x4-noSlippery | HuggingFace | 2.80 | unrated |
| 28 | q-Taxi-v3 | HuggingFace | 2.80 | unrated |
| 29 | ppo-Huggy | HuggingFace | 2.80 | unrated |
| 30 | ppo-LunarLander-v2 | HuggingFace | 2.80 | unrated |
| 31 | q-FrozenLake-v1-4x4-noSlippery | HuggingFace | 2.80 | unrated |
| 32 | Taxi-v3 | HuggingFace | 2.80 | unrated |
| 33 | sentinel-reinforcement-learning | HuggingFace | 2.80 | unrated |
| 34 | dqn-SpaceInvadersNoFrameskip-v4 | HuggingFace | 2.80 | unrated |
| 35 | q-FrozenLake-v1-4x4-noSlippery | HuggingFace | 2.80 | unrated |
| 36 | q-Taxi-v3 | HuggingFace | 2.80 | unrated |
| 37 | CarRacing-v0-PPO-optuna | HuggingFace | 2.80 | unrated |
| 38 | LunarLander-v2-DQN-optuna | HuggingFace | 2.80 | unrated |
| 39 | LunarLander-v2-PPO | HuggingFace | 2.80 | unrated |
| 40 | LunarLander-v2-PPO-optuna | HuggingFace | 2.80 | unrated |