| # | Agent | 平台 | 评分 | 信号等级 |
|---|---|---|---|---|
| 4981 | Reinforce-CartPole-v1 | HuggingFace | 2.90 | unrated |
| 4982 | Reinforce-Pixelcopter-PLE-v0 | HuggingFace | 2.90 | unrated |
| 4983 | breakthrough-model | HuggingFace | 2.90 | unrated |
| 4984 | chess-model | HuggingFace | 2.90 | unrated |
| 4985 | ppo-CartPole-v1 | HuggingFace | 2.90 | unrated |
| 4986 | ppo-LunarLander-v2 | HuggingFace | 2.90 | unrated |
| 4987 | q-Taxi-v3 | HuggingFace | 2.90 | unrated |
| 4988 | dqn-SpaceInvadersNoFrameskip-v1 | HuggingFace | 2.90 | unrated |
| 4989 | dqn-SpaceInvadersNoFrameskip-v4 | HuggingFace | 2.90 | unrated |
| 4990 | ppo-LunarLander-v2 | HuggingFace | 2.90 | unrated |
| 4991 | q-FrozenLake-v1-4x4-noSlippery | HuggingFace | 2.90 | unrated |
| 4992 | q-Taxi-v3 | HuggingFace | 2.90 | unrated |
| 4993 | Reinforce-cartbole-v1 | HuggingFace | 2.90 | unrated |
| 4994 | qwen2.5-7b-math-reasoning-grpo | HuggingFace | 2.90 | unrated |
| 4995 | ppo-BipedalWalker-v3 | HuggingFace | 2.90 | unrated |
| 4996 | ppo-LunarLander-v2 | HuggingFace | 2.90 | unrated |
| 4997 | ppo-MountainCar-v0 | HuggingFace | 2.90 | unrated |
| 4998 | a2c-PandaPickAndPlace-v3 | HuggingFace | 2.90 | unrated |
| 4999 | a2c-PandaReachDense-v3 | HuggingFace | 2.90 | unrated |
| 5000 | poca-SoccerTwos | HuggingFace | 2.90 | unrated |