| 48741 |
RyanYr_-_reflect_llm8B_om2-mstlrg-300k460k-llm33-130k-t12_SFTt1-DPOiPSDPt1_b1.0-gguf |
HuggingFace |
1.30 |
信号不足 |
| 48742 |
RyanYr_-_reflect_llm8B_om2-mstlrg-300k460k-llm33-130k-t12_SFTt1-DPOsPSDP20to40kT1_b1.0-gguf |
HuggingFace |
1.30 |
信号不足 |
| 48743 |
RyanYr_-_reflect_llm8B_om2-mstlrg-300k460k-llm33-130k-t12_SFTt1-DPOsPSDP20to60kT1_b1.0-gguf |
HuggingFace |
1.30 |
信号不足 |
| 48744 |
RyanYr_-_reflect_llm8B_om2-mstlrg-300k460k-llm33-130k-t12_SFTt1-DPOsPSDPt1_b0.5-gguf |
HuggingFace |
1.30 |
信号不足 |
| 48745 |
RyanYr_-_reflect_llm8B_om2-mstlrg-300k460k-llm33-130k-t12_SFTt1-DPOsPSDPt1_b1.0_lr5e-8-gguf |
HuggingFace |
1.30 |
信号不足 |
| 48746 |
RyanYr_-_reflect_llm8B_om2-mstlrg-300k460k-llm33-130k-t12_SFTt1-DPOsPSDPt1_b1.0_lre-7-gguf |
HuggingFace |
1.30 |
信号不足 |
| 48747 |
RyanYr_-_reflect_llm8B_om2-mstlrg-300k460k-llm33-130k-t12_SFTt1-DPOsPSDPt1_b.1-gguf |
HuggingFace |
1.30 |
信号不足 |
| 48748 |
RyanYr_-_reflect_llm8B_om2-mstlrg-300k460k-llm33-130k-t12_SFTt1-DPOsPSDPt1_b1-gguf |
HuggingFace |
1.30 |
信号不足 |
| 48749 |
RyanYr_-_reflect_llm8B_om2-mstlrg-300k460k-llm33-130k-t12_SFTt1-DPOsPSDPt1_b2.0-gguf |
HuggingFace |
1.30 |
信号不足 |
| 48750 |
RyanYr_-_reflect_llm8B_om2-mstlrg-300k460k-llm33-130k-t12_SFTt1-DPOsPSDPt1_b.5-gguf |
HuggingFace |
1.30 |
信号不足 |
| 48751 |
RyanYr_-_reflect_llm8B_om2-mstlrg300k460k-llm3370b130k-t12_SFTDPOt1_psdp-t1_b1.0-gguf |
HuggingFace |
1.30 |
信号不足 |
| 48752 |
RyanYr_-_reflect_llm8B_om2-mstlrg300k460k-llm3370b130k-t12_SFTDPOt1_psdp-t1_b.5-gguf |
HuggingFace |
1.30 |
信号不足 |
| 48753 |
RyanYr_-_reflect_llm8B_om2-mstlrg300k460k-llm3370b130k-t12_SFTDPOt1_psdp-t1-gguf |
HuggingFace |
1.30 |
信号不足 |
| 48754 |
RyanYr_-_reflect_llm8B_om2-mstlrg-300k460k-t12_llm33-130k-t12_SFTt1-lr1e-6_DPOt1-gguf |
HuggingFace |
1.30 |
信号不足 |
| 48755 |
grpo_final_v3_qwen-instruct_math_qwen_no_question_agent_reward_samples_filter_ascendin |
HuggingFace |
1.30 |
unrated |
| 48756 |
best_n_no_rationale_poc_agent_withjava_vulnllm_final_model_nopolicy_agent_train_vulnscan |
HuggingFace |
1.30 |
unrated |
| 48757 |
trainsize200_iter3_rerun-hotpotqa-hotpotqa_two_agents_pipeline-answer_generator-iter0 |
HuggingFace |
1.30 |
unrated |
| 48758 |
appworld-agent-14B-distillation-sft-v2-no-think-new-agent-multilock-dev-0120-global-step-200 |
HuggingFace |
1.30 |
unrated |
| 48759 |
appworld-agent-14B-distillation-sft-v2-no-think-new-agent-multilock-dev-0120-global-step-450 |
HuggingFace |
1.30 |
unrated |
| 48760 |
appworld-agent-32B-no-think-long-50steps-new-agent-multilock-dev-0129-global-step-400 |
HuggingFace |
1.30 |
unrated |