| 49121 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Llama-3.1-8B-Instruct-o16-t-90step |
HuggingFace |
1.30 |
unrated |
| 49122 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Llama-3.1-8B-Instruct-only16-nothink-f-15step |
HuggingFace |
1.30 |
unrated |
| 49123 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Llama-3.1-8B-Instruct-only16-nothink-f-30step |
HuggingFace |
1.30 |
unrated |
| 49124 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Llama-3.1-8B-Instruct-only16-nothink-f-45step |
HuggingFace |
1.30 |
unrated |
| 49125 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Llama-3.1-8B-Instruct-only16-nothink-f-60step |
HuggingFace |
1.30 |
unrated |
| 49126 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Llama-3.1-8B-Instruct-only16-nothink-f-75step |
HuggingFace |
1.30 |
unrated |
| 49127 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Llama-3.1-8B-Instruct-only16-nothink-f-90step |
HuggingFace |
1.30 |
unrated |
| 49128 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Qwen2.5-7B-Instruct-nothink-105step |
HuggingFace |
1.30 |
unrated |
| 49129 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Qwen2.5-7B-Instruct-nothink-120step |
HuggingFace |
1.30 |
unrated |
| 49130 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Qwen2.5-7B-Instruct-nothink-135step |
HuggingFace |
1.30 |
unrated |
| 49131 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Qwen2.5-7B-Instruct-nothink-150step |
HuggingFace |
1.30 |
unrated |
| 49132 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Qwen2.5-7B-Instruct-nothink-15step |
HuggingFace |
1.30 |
unrated |
| 49133 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Qwen2.5-7B-Instruct-nothink-30step |
HuggingFace |
1.30 |
unrated |
| 49134 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Qwen2.5-7B-Instruct-nothink-45step |
HuggingFace |
1.30 |
unrated |
| 49135 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Qwen2.5-7B-Instruct-nothink-60step |
HuggingFace |
1.30 |
unrated |
| 49136 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Qwen2.5-7B-Instruct-nothink-75step |
HuggingFace |
1.30 |
unrated |
| 49137 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Qwen2.5-7B-Instruct-nothink-90step |
HuggingFace |
1.30 |
unrated |
| 49138 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Qwen2.5-7B-Instruct-o16-t-105step |
HuggingFace |
1.30 |
unrated |
| 49139 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Qwen2.5-7B-Instruct-o16-t-120step |
HuggingFace |
1.30 |
unrated |
| 49140 |
verl_agent_webshop-new-GRPO-from-webshop-20step-v2-Qwen2.5-7B-Instruct-o16-t-135step |
HuggingFace |
1.30 |
unrated |