| 48661 |
Phi-4-reasoning-plus_Omni-MATH-5plus_N-CThink-QA_GRPO_short_answer_penalty_step200 |
HuggingFace |
1.30 |
信号不足 |
| 48662 |
Phi-4-reasoning-plus_Omni-MATH-5plus_N-CThink-QA_GRPO_short_answer_penalty_step420 |
HuggingFace |
1.30 |
信号不足 |
| 48663 |
Qwen3-32B-merge-base2-math3-science3-submath05-med05-other1-math4grpo |
HuggingFace |
1.30 |
信号不足 |
| 48664 |
Qwen3-32B-textbookreasoning-UGPhysics-AoPsInstruct-openmathreasoning-sft |
HuggingFace |
1.30 |
信号不足 |
| 48665 |
ParetoTinyRNNTransformers98k_v4_cycles_TRM_d80_L1_H2_C16_100k_LegalW0p5_ckpt38000 |
HuggingFace |
1.30 |
信号不足 |
| 48666 |
sft_clean_openthought312_difficulty_9_filterd-cleaned_NuminaMath-RL-Verifiable |
HuggingFace |
1.30 |
信号不足 |
| 48667 |
Qwen3-0.6B-OURS_self-g_general_prompt_llm_judge_e_sycophancy_keep_last-100-tokens_w1-seed_0 |
HuggingFace |
1.30 |
信号不足 |
| 48668 |
Qwen3-0.6B-OURS_self-g_general_prompt_llm_judge_e_sycophancy_keep_last-100-tokens_w2-seed_0 |
HuggingFace |
1.30 |
信号不足 |
| 48669 |
Qwen3-0.6B-OURS_self-g_general_prompt_llm_judge_e_sycophancy_keep_last-100-tokens_w3-seed_0 |
HuggingFace |
1.30 |
信号不足 |
| 48670 |
Qwen3-0.6B-OURS_self-g_general_reward_prompt_llm_judge_keep_last-100-tokens-seed_0 |
HuggingFace |
1.30 |
信号不足 |
| 48671 |
DeepSeek-R1-Qwen2.5-3b-LLM-Judge-Reward-JSON-Unstructured-To-Structured-Lora-gguf |
HuggingFace |
1.30 |
信号不足 |
| 48672 |
WeniGPT-2.3.3-Zephyr-7B-zephyr-prompt-merged-LLM_Base_2.0.3_SFT_reduction_variation |
HuggingFace |
1.30 |
信号不足 |
| 48673 |
qwen2_5vl_3b_sft_idm_how_to_onannel_agent_sft_data_local_bs_4_epochs_3 |
HuggingFace |
1.30 |
unrated |
| 48674 |
llm-jp-13b-instruct-full-ac_001_16x-dolly-ichikara_004_001_single-oasst-oasst2-v2.0-gguf |
HuggingFace |
1.30 |
信号不足 |
| 48675 |
batch1_epochs2_lr1e-05_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu16 |
HuggingFace |
1.30 |
信号不足 |
| 48676 |
batch1_epochs2_lr1e-05_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu32 |
HuggingFace |
1.30 |
信号不足 |
| 48677 |
batch1_epochs6_lr1e-06_paged_adamw_32bit_cosine_length4096_warmup_0.05_max_grad1.0_grad_accu10 |
HuggingFace |
1.30 |
信号不足 |
| 48678 |
Al-Atlas-LLM-0.5B-bs-4-lr-5e-05-ep-3-wp-0.1-gacc-32-gnm-1.0-FP16-mx-2048-v2.3-GGUF |
HuggingFace |
1.30 |
信号不足 |
| 48679 |
Meta-Llama-3-8b-Lexi-Uninstruct-function-calling-json-mode-Task-Arithmetic-v0.1-GGUF |
HuggingFace |
1.30 |
unrated |
| 48680 |
Polypsyche-Llama-3.1-8B-Instruct-Agent-0.0031-128K-code-ds-auto-divergent-i1-GGUF |
HuggingFace |
1.30 |
unrated |