| 48881 |
WeniGPT-2.1.1-zephyr-7b-beta-BitsandBytes-LLM-Base-1.0.1-6k_evol_complexity_increase_steps |
HuggingFace |
1.30 |
信号不足 |
| 48882 |
bi_so101_flatten-and-fold-the-rag-then-place-pi05-folding-final-relative-all-linear-lora |
HuggingFace |
1.30 |
信号不足 |
| 48883 |
agentic-nonNorm-binAdv-sokoban-Markov-qwen2.5-3B_6x6_2-4_SFT_prm0_negR-1_gm0_pNegR-1_actor |
HuggingFace |
1.30 |
unrated |
| 48884 |
aug_verl_agent_alfworld-GRPO-kl0.01-from-sft-Llama-3.1-8B-Instruct-0723-info25-105step |
HuggingFace |
1.30 |
unrated |
| 48885 |
aug_verl_agent_alfworld-GRPO-kl0.01-from-sft-Llama-3.1-8B-Instruct-0723-info25-120step |
HuggingFace |
1.30 |
unrated |
| 48886 |
aug_verl_agent_alfworld-GRPO-kl0.01-from-sft-Llama-3.1-8B-Instruct-0723-info25-135step |
HuggingFace |
1.30 |
unrated |
| 48887 |
aug_verl_agent_alfworld-GRPO-kl0.01-from-sft-Llama-3.1-8B-Instruct-0723-info25-150step |
HuggingFace |
1.30 |
unrated |
| 48888 |
aug_verl_agent_alfworld-GRPO-kl0.01-from-sft-Llama-3.1-8B-Instruct-0723-info25-15step |
HuggingFace |
1.30 |
unrated |
| 48889 |
aug_verl_agent_alfworld-GRPO-kl0.01-from-sft-Llama-3.1-8B-Instruct-0723-info25-30step |
HuggingFace |
1.30 |
unrated |
| 48890 |
aug_verl_agent_alfworld-GRPO-kl0.01-from-sft-Llama-3.1-8B-Instruct-0723-info25-45step |
HuggingFace |
1.30 |
unrated |
| 48891 |
aug_verl_agent_alfworld-GRPO-kl0.01-from-sft-Llama-3.1-8B-Instruct-0723-info25-60step |
HuggingFace |
1.30 |
unrated |
| 48892 |
aug_verl_agent_alfworld-GRPO-kl0.01-from-sft-Llama-3.1-8B-Instruct-0723-info25-75step |
HuggingFace |
1.30 |
unrated |
| 48893 |
aug_verl_agent_alfworld-GRPO-kl0.01-from-sft-Llama-3.1-8B-Instruct-0723-info25-90step |
HuggingFace |
1.30 |
unrated |
| 48894 |
aug-verl_agent_alfworld-GRPO-kl0.01-from-webshop-Llama-3.1-8B-Instruct-info100-105step |
HuggingFace |
1.30 |
unrated |
| 48895 |
aug-verl_agent_alfworld-GRPO-kl0.01-from-webshop-Llama-3.1-8B-Instruct-info100-120step |
HuggingFace |
1.30 |
unrated |
| 48896 |
aug-verl_agent_alfworld-GRPO-kl0.01-from-webshop-Llama-3.1-8B-Instruct-info100-135step |
HuggingFace |
1.30 |
unrated |
| 48897 |
aug-verl_agent_alfworld-GRPO-kl0.01-from-webshop-Llama-3.1-8B-Instruct-info100-150step |
HuggingFace |
1.30 |
unrated |
| 48898 |
aug-verl_agent_alfworld-GRPO-kl0.01-from-webshop-Llama-3.1-8B-Instruct-info100-15step |
HuggingFace |
1.30 |
unrated |
| 48899 |
aug-verl_agent_alfworld-GRPO-kl0.01-from-webshop-Llama-3.1-8B-Instruct-info100-30step |
HuggingFace |
1.30 |
unrated |
| 48900 |
aug-verl_agent_alfworld-GRPO-kl0.01-from-webshop-Llama-3.1-8B-Instruct-info100-45step |
HuggingFace |
1.30 |
unrated |